Skip to content

fix(net): DpdkBackend::send_frame dropped every frame → zero TCP connections#108

Merged
gspivey merged 1 commit into
developmentfrom
agent/dpdk-tx-frame-fix
Jul 2, 2026
Merged

fix(net): DpdkBackend::send_frame dropped every frame → zero TCP connections#108
gspivey merged 1 commit into
developmentfrom
agent/dpdk-tx-frame-fix

Conversation

@gspivey

@gspivey gspivey commented Jul 1, 2026

Copy link
Copy Markdown
Owner

The bug behind the zero-connection perf run

Perf run 28552788420 got all the way to running traffic — TRex opened 150,002 flows and sent 21.3M packets — but Active-flows: 0. The DUT tcp-echo never completed a single handshake.

Root cause (found by code investigation, not another EC2 run): DpdkBackend::send_frame called mbuf.data_mut() on a freshly-allocated mbuf, whose data_len is 0, so the returned slice was zero-length. The data.len() < frame.len() capacity check was therefore always true and the function returned Frame too large: mbuf capacity: 0 before ever calling tx_burst. Every outbound frame — SYN-ACK, ACK, data, RST, retransmit — for both TCP and QUIC real-NIC was silently dropped. The engine driver discarded the error (let _ =), so it was invisible.

Fix

  • Reorder send_frame to compute capacity from buf_len - data_offset and call set_data_len() before data_mut() — matching the proven UDP TX path (dpdk-udp/src/lib.rs:1498-1523, which even has the comment "Set data_len first so data_mut() returns the right size slice").
  • Tighten test_dpdk_backend_send_frame — it tolerated the capacity error (is_err() || …), which masked this. Now asserts the frame reaches tx_burst and never fails the capacity check. (Fails before the fix, passes after.)
  • runtime.rs: route all 3 TX sites through send_or_warn(), which logs the first non-transient TX error instead of let _ =. A silent TX drop cost us a full EC2 run to diagnose; it won't be silent again.

Impact

Unblocks both TCP and QUIC real-NIC transmit. Verified locally (cargo build + 220+ tests pass, no EC2).

⚠️ Needs the companion SSM-truncation transport fix (separate PR) before a perf run goes fully green — see the thread.

🤖 Generated with Claude Code

…e set_data_len)

Root cause of the DUT establishing ZERO TCP connections on real hardware
(perf run 28552788420: TRex opened 150k flows / sent 21M packets, Active-flows=0).

DpdkBackend::send_frame called mbuf.data_mut() on a freshly-allocated mbuf —
whose data_len is 0 — so the returned slice was zero-length, the
'data.len() < frame.len()' capacity check was ALWAYS true, and it returned
'Frame too large: mbuf capacity: 0' before ever reaching tx_burst. Every
outbound frame (SYN-ACK, ACK, data, RST, retransmit) for both TCP and QUIC
real-NIC was silently dropped. Reorder to compute capacity from
buf_len-data_offset and set_data_len() BEFORE data_mut(), matching the proven
UDP TX path (dpdk-udp/src/lib.rs:1498-1523).

- Tighten test_dpdk_backend_send_frame: it tolerated the capacity error
  ('is_err() || ...'), masking the bug. Now asserts the frame reaches tx_burst
  (Ok, or WouldBlock under stubs) and never fails the capacity check.
- runtime.rs: the engine driver discarded send_frame errors ('let _ ='), so the
  silent drop was invisible and cost a full EC2 run to find. Route all 3 TX
  sites through send_or_warn(), which logs the first non-transient TX error.

Verified locally (cargo build + test, no EC2). Found via a code-level
investigation of the RX->SYN->SYN-ACK path.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

Synthetic Performance Results (run)

Commit: ba973e7a

✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)

Synthetic UDP Performance Results

Measures framework overhead: sync dpdk_udp::UdpSocket vs async (std::sync::Mutex + try_recv_from).

IPv4 Baseline

Test Payload Sync PPS Async PPS Ratio (sync/async) Async ns/op
TX send_to 64B 11.7M 11.2M 1.0x 89
RX recv_from 64B 3.6M 4.6M 0.8x 216
TX send_to 1400B 1.8M 1.8M 1.0x 543
RX recv_from 1400B 1.1M 1.2M 1.0x 846

IPv6

Test Payload Sync PPS Async PPS Ratio (sync/async) Async ns/op
TX send_to (IPv6) 64B 9.1M 8.7M 1.0x 114
RX recv_from (IPv6) 64B 4.1M 5.4M 0.7x 183
TX send_to (IPv6) 1400B 3.0M 3.0M 1.0x 329
RX recv_from (IPv6) 1400B 1.2M 1.3M 0.9x 769

IPv6 vs IPv4 Comparison (sync path)

Test Payload IPv4 PPS IPv6 PPS IPv4/IPv6 Ratio
TX send_to (sync) 64B 11.7M 9.1M 1.29x
RX recv_from (sync) 64B 3.6M 4.1M 0.87x
TX send_to (sync) 1400B 1.8M 3.0M 0.60x
RX recv_from (sync) 1400B 1.1M 1.2M 0.94x

IPv4 avg sync/async ratio: 0.9x, worst: 1.0x | IPv6 vs IPv4 worst ratio: 1.29x (OK)

OK: IPv6 is 29.2% slower than IPv4 — within acceptable threshold (<50%). Expected due to larger headers (40B vs 20B) and mandatory UDP checksum.

Good: Async wrapper is within 1.0x of sync — minimal framework overhead.

@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

Synthetic Performance Results — Graviton (run)

Commit: ba973e7a

✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to 10.0.0.1:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)
✅ synthetic UDP socket bound to [2001:db8::1]:9000 (MAC: 02:00:00:00:00:01)

Synthetic UDP Performance Results

Measures framework overhead: sync dpdk_udp::UdpSocket vs async (std::sync::Mutex + try_recv_from).

IPv4 Baseline

Test Payload Sync PPS Async PPS Ratio (sync/async) Async ns/op
TX send_to 64B 11.4M 11.0M 1.0x 90
RX recv_from 64B 3.4M 4.5M 0.8x 222
TX send_to 1400B 1.6M 1.6M 1.0x 613
RX recv_from 1400B 1.0M 1.1M 0.9x 891

IPv6

Test Payload Sync PPS Async PPS Ratio (sync/async) Async ns/op
TX send_to (IPv6) 64B 8.6M 8.4M 1.0x 118
RX recv_from (IPv6) 64B 3.9M 5.4M 0.7x 184
TX send_to (IPv6) 1400B 2.7M 2.7M 1.0x 376
RX recv_from (IPv6) 1400B 1.1M 1.1M 0.9x 873

IPv6 vs IPv4 Comparison (sync path)

Test Payload IPv4 PPS IPv6 PPS IPv4/IPv6 Ratio
TX send_to (sync) 64B 11.4M 8.6M 1.32x
RX recv_from (sync) 64B 3.4M 3.9M 0.89x
TX send_to (sync) 1400B 1.6M 2.7M 0.62x
RX recv_from (sync) 1400B 1.0M 1.1M 0.97x

IPv4 avg sync/async ratio: 0.9x, worst: 1.0x | IPv6 vs IPv4 worst ratio: 1.32x (OK)

OK: IPv6 is 32.4% slower than IPv4 — within acceptable threshold (<50%). Expected due to larger headers (40B vs 20B) and mandatory UDP checksum.

Good: Async wrapper is within 1.0x of sync — minimal framework overhead.

@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

[CI] Stage: Deploy

Infrastructure ready.

  • Sender: i-0a2bcf14c864999e3 (DPDK ENI: 10.0.1.28)
  • Receiver: i-0a20364b70a6cb2c5 (DPDK ENI: 10.0.1.10)
  • Both instances SSM-ready.

@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

[CI] Stage: Deploy

Infrastructure ready.

  • Sender: i-042d9c2e895cc87cc (DPDK ENI: 10.0.1.168)
  • Receiver: i-0b6464b52a79fc914 (DPDK ENI: 10.0.1.129)
  • Both instances SSM-ready.

@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

[CI] Stage: Summary

All tests PASSED.

ARP seeding: kernel /proc/net/arp (automatic)

  • tier1-dpdk-echo: 6 tests, 0 failures
  • tier2-kernel-interop: 4 tests, 0 failures
  • tier3-iperf-interop: 1 tests, 0 failures
  • tier3-iperf-interop: 1 tests, 0 failures

1 similar comment
@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

[CI] Stage: Summary

All tests PASSED.

ARP seeding: kernel /proc/net/arp (automatic)

  • tier1-dpdk-echo: 6 tests, 0 failures
  • tier2-kernel-interop: 4 tests, 0 failures
  • tier3-iperf-interop: 1 tests, 0 failures
  • tier3-iperf-interop: 1 tests, 0 failures

@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

✅ Integration Tests Passed (Run 28555793043)

Branch: 108/merge | Commit: ba973e7a

Test Results

  • tier1-dpdk-echo.xml: 6 tests, 0 failures, skipped
  • tier2-kernel-interop.xml: 4 tests, 0 failures, skipped
  • tier3-iperf-sends.xml: 1 tests, 0 failures, skipped
  • tier3-our-app-sends.xml: 1 tests, 0 failures, skipped

Application Logs (last 20 lines)

receiver-echo-server.log

EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.129:9000 (MAC: 02:2c:70:d5:90:bb)
echo listening on 10.0.1.129:9000 (MTU=9001, max_udp_payload=8973)
Shutting down gracefully...

sender-echo-server.log

EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.168:9000 (MAC: 02:02:89:fa:2c:39)
echo listening on 10.0.1.168:9000 (MTU=9001, max_udp_payload=8973)
Shutting down gracefully...

sender-test-client.log

EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
TELEMETRY: No legacy callbacks, legacy socket not created
DPDK bind failed (Port init failed: Invalid port ID: 0), falling back to tokio
Backend: tokio
Sending packets...
Sent 12 bytes: 'arp-probe #1'
Received 12 bytes from 10.0.1.129:9000: 'arp-probe #1'
Test complete
[2026-07-02T00:15:40Z] INFO: ARP resolution succeeded (got response from peer)
[2026-07-02T00:15:40Z] INFO: Test: udp_send_receive
[2026-07-02T00:15:41Z] INFO: UDP send/receive succeeded
[2026-07-02T00:15:41Z] INFO: Test: echo_roundtrip
[2026-07-02T00:15:41Z] INFO: Echo roundtrip succeeded: 5/5 responses received
[2026-07-02T00:15:41Z] INFO: Test: payload_integrity
[2026-07-02T00:15:42Z] INFO: Response received, checking payload match...
[2026-07-02T00:15:42Z] INFO: Payload integrity verified (found in response)
[2026-07-02T00:15:42Z] INFO: JUnit XML written to: /tmp/test-results/tier2-kernel-interop.xml
[2026-07-02T00:15:42Z] INFO: Tier 2 sender tests complete. Results: /tmp/test-results/tier2-kernel-interop.xml

receiver-test-client-iperf.log

[2026-07-02T00:22:39Z] INFO: iperf-sends: sent 10 packets, received 10 responses
[2026-07-02T00:22:39Z] INFO: iperf-sends: PASS (sent >= 5 packets)
[2026-07-02T00:22:39Z] INFO: JUnit XML written to: /tmp/test-results/tier3-iperf-sends.xml
[2026-07-02T00:22:39Z] INFO: iperf-sends test complete

sender-test-client-iperf.log

Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #3'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #4'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #4'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #5'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #5'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #6'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #6'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #7'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #7'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #8'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #8'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #9'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #9'
Sent 31 bytes: 'dpdk-to-kernel-test-payload #10'
Received 31 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #10'
Test complete
[2026-07-02T00:21:39Z] INFO: our-app-sends: sent 10 packets, received 10 responses
[2026-07-02T00:21:39Z] INFO: our-app-sends: PASS (sent >= 5 packets)
[2026-07-02T00:21:39Z] INFO: JUnit XML written to: /tmp/test-results/tier3-our-app-sends.xml
[2026-07-02T00:21:39Z] INFO: our-app-sends test complete
Full Application Logs (last 200 lines each)

receiver-echo-server.log

EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.129:9000 (MAC: 02:2c:70:d5:90:bb)
echo listening on 10.0.1.129:9000 (MTU=9001, max_udp_payload=8973)
Shutting down gracefully...

sender-echo-server.log

EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.168:9000 (MAC: 02:02:89:fa:2c:39)
echo listening on 10.0.1.168:9000 (MTU=9001, max_udp_payload=8973)
Shutting down gracefully...

sender-test-client.log

[2026-07-02T00:11:57Z] INFO: Test: arp_resolution
UDP Test Client
Target: 10.0.1.129:9000
Bind address: 10.0.1.168:0
Message: 'arp-probe'
Count: 1
EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.168:32768 (MAC: 02:02:89:fa:2c:39)
Backend: dpdk
Sending packets...
Sent 12 bytes: 'arp-probe #1'
Received 12 bytes from 10.0.1.129:9000: 'arp-probe #1'
Test complete
[2026-07-02T00:11:58Z] INFO: ARP resolution succeeded (got response from peer)
[2026-07-02T00:11:58Z] INFO: Test: udp_send_receive
[2026-07-02T00:12:00Z] INFO: UDP send/receive succeeded
[2026-07-02T00:12:00Z] INFO: Test: echo_roundtrip
[2026-07-02T00:12:02Z] INFO: Echo roundtrip succeeded: 5/5 responses received
[2026-07-02T00:12:02Z] INFO: Test: payload_integrity
[2026-07-02T00:12:02Z] INFO: Response received, checking payload match...
[2026-07-02T00:12:02Z] INFO: Payload integrity verified (found in response)
[2026-07-02T00:12:02Z] INFO: Test: jumbo_diagnostics
[2026-07-02T00:12:02Z] INFO: === JUMBO FRAME DIAGNOSTICS ===
[2026-07-02T00:12:02Z] INFO: Interface MTU:
  9001
  65536
[2026-07-02T00:12:02Z] INFO:   ens5: MTU=9001
[2026-07-02T00:12:02Z] INFO:   lo: MTU=65536
[2026-07-02T00:12:02Z] INFO: Routing table MTU column:
Iface	Destination	Gateway 	Flags	RefCnt	Use	Metric	Mask		MTU	Window	IRTT                                                       
ens5	00000000	0101000A	0003	0	0	512	00000000	0	0	0                                                                             
ens5	0200000A	0101000A	0007	0	0	512	FFFFFFFF	0	0	0                                                                             
ens5	0001000A	00000000	0001	0	0	512	00FFFFFF	0	0	0                                                                             
ens5	0101000A	00000000	0005	0	0	512	FFFFFFFF	0	0	0                                                                             
[2026-07-02T00:12:02Z] INFO: DPDK port config (from echo server log):
[2026-07-02T00:12:02Z] INFO:   (no MTU info in echo log)
[2026-07-02T00:12:02Z] INFO: === END JUMBO DIAGNOSTICS ===
[2026-07-02T00:12:02Z] INFO: Test: jumbo_echo_8000
[2026-07-02T00:12:04Z] INFO: Jumbo output: UDP Test Client
Target: 10.0.1.129:9000
Bind address: 10.0.1.168:0
Payload size: 8000 bytes
Count: 3
EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.168:32768 (MAC: 02:02:89:fa:2c:39)
Backend: dpdk
Sending packets...
Sent 8000 bytes (binary payload)
Received 8000 bytes from 10.0.1.129:9000 (expected 8000, OK)
Sent 8000 bytes (binary payload)
Received 8000 bytes from 10.0.1.129:9000 (expected 8000, OK)
Sent 8000 bytes (binary payload)
Received 8000 bytes from 10.0.1.129:9000 (expected 8000, OK)
Test complete
[2026-07-02T00:12:04Z] INFO: Jumbo frame echo succeeded: 3/3 responses with correct size
[2026-07-02T00:12:04Z] INFO: JUnit XML written to: /tmp/test-results/tier1-dpdk-echo.xml
[2026-07-02T00:12:04Z] INFO: Tier 1 sender tests complete. Results: /tmp/test-results/tier1-dpdk-echo.xml
[2026-07-02T00:15:39Z] INFO: Test: arp_resolution
UDP Test Client
Target: 10.0.1.129:9000
Bind address: 0.0.0.0:0
Message: 'arp-probe'
Count: 1
EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
TELEMETRY: No legacy callbacks, legacy socket not created
DPDK bind failed (Port init failed: Invalid port ID: 0), falling back to tokio
Backend: tokio
Sending packets...
Sent 12 bytes: 'arp-probe #1'
Received 12 bytes from 10.0.1.129:9000: 'arp-probe #1'
Test complete
[2026-07-02T00:15:40Z] INFO: ARP resolution succeeded (got response from peer)
[2026-07-02T00:15:40Z] INFO: Test: udp_send_receive
[2026-07-02T00:15:41Z] INFO: UDP send/receive succeeded
[2026-07-02T00:15:41Z] INFO: Test: echo_roundtrip
[2026-07-02T00:15:41Z] INFO: Echo roundtrip succeeded: 5/5 responses received
[2026-07-02T00:15:41Z] INFO: Test: payload_integrity
[2026-07-02T00:15:42Z] INFO: Response received, checking payload match...
[2026-07-02T00:15:42Z] INFO: Payload integrity verified (found in response)
[2026-07-02T00:15:42Z] INFO: JUnit XML written to: /tmp/test-results/tier2-kernel-interop.xml
[2026-07-02T00:15:42Z] INFO: Tier 2 sender tests complete. Results: /tmp/test-results/tier2-kernel-interop.xml

receiver-test-client-iperf.log

[2026-07-02T00:22:39Z] INFO: iperf-sends: sent 10 packets, received 10 responses
[2026-07-02T00:22:39Z] INFO: iperf-sends: PASS (sent >= 5 packets)
[2026-07-02T00:22:39Z] INFO: JUnit XML written to: /tmp/test-results/tier3-iperf-sends.xml
[2026-07-02T00:22:39Z] INFO: iperf-sends test complete

sender-test-client-iperf.log

[2026-07-02T00:21:36Z] INFO: Pre-flight: checking DPDK state and ARP cache...
[2026-07-02T00:21:36Z] INFO: Local IP: 10.0.1.168, Peer IP: 10.0.1.129, Port: 9000
[2026-07-02T00:21:36Z] INFO: /proc/net/arp contents:
IP address       HW type     Flags       HW address            Mask     Device
10.0.1.168       0x1         0x2         02:02:89:fa:2c:39     *        ens5
10.0.1.1         0x1         0x2         02:ef:8f:74:ee:5d     *        ens5
10.0.1.79        0x1         0x2         02:3e:9b:d0:71:9f     *        ens5
10.0.1.129       0x1         0x2         02:2c:70:d5:90:bb     *        ens5
[2026-07-02T00:21:36Z] INFO: DPDK runtime state:
No /var/run/dpdk/ directory
[2026-07-02T00:21:36Z] INFO: vfio-pci bindings:
0000:00:06.0
bind
module
new_id
remove_id
uevent
unbind
[2026-07-02T00:21:36Z] INFO: Test binary: /opt/dpdk-stdlib/target/release/test-client
-rwxr-xr-x. 2 root root 1950784 Jul  2 00:09 /opt/dpdk-stdlib/target/release/test-client
[2026-07-02T00:21:36Z] INFO: Launching test-client: /opt/dpdk-stdlib/target/release/test-client --target 10.0.1.129 --port 9000 --bind-ip 10.0.1.168 --count 10 --delay 200
[2026-07-02T00:21:39Z] INFO: Test client output: UDP Test Client
Target: 10.0.1.129:9000
Bind address: 10.0.1.168:0
Message: 'dpdk-to-kernel-test-payload'
Count: 10
EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.168:32768 (MAC: 02:02:89:fa:2c:39)
Backend: dpdk
Sending packets...
Sent 30 bytes: 'dpdk-to-kernel-test-payload #1'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #1'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #2'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #2'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #3'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #3'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #4'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #4'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #5'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #5'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #6'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #6'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #7'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #7'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #8'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #8'
Sent 30 bytes: 'dpdk-to-kernel-test-payload #9'
Received 30 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #9'
Sent 31 bytes: 'dpdk-to-kernel-test-payload #10'
Received 31 bytes from 10.0.1.129:9000: 'dpdk-to-kernel-test-payload #10'
Test complete
[2026-07-02T00:21:39Z] INFO: our-app-sends: sent 10 packets, received 10 responses
[2026-07-02T00:21:39Z] INFO: our-app-sends: PASS (sent >= 5 packets)
[2026-07-02T00:21:39Z] INFO: JUnit XML written to: /tmp/test-results/tier3-our-app-sends.xml
[2026-07-02T00:21:39Z] INFO: our-app-sends test complete
⚠️ SSM Command Failures (receiver-ssm-failure.log)
=== Polling timeout after 30s ===
Status: InProgress
Instance: i-0b6464b52a79fc914 (receiver)
Command ID: c1b2da2f-9911-4bcc-94ec-2a67291ab546

=== STDOUT ===


=== STDERR ===


=== Polling timeout after 30s ===
Status: InProgress
Instance: i-0b6464b52a79fc914 (receiver)
Command ID: 639cd847-c7c5-4050-aca1-b059f1ecd5d6

=== STDOUT ===


=== STDERR ===


=== Polling timeout after 30s ===
Status: InProgress
Instance: i-0b6464b52a79fc914 (receiver)
Command ID: b84c45a3-1ece-47e9-b653-9db5fde59aed

=== STDOUT ===


=== STDERR ===


=== Polling timeout after 30s ===
Status: InProgress
Instance: i-0b6464b52a79fc914 (receiver)
Command ID: 3b49453d-75b2-4391-9453-311791e94ef3

=== STDOUT ===


=== STDERR ===


⚠️ SSM Command Failures (sender-ssm-failure.log)
=== Polling timeout after 30s ===
Status: InProgress
Instance: i-042d9c2e895cc87cc (sender)
Command ID: 309bf646-a091-4804-83be-d61607f7a866

=== STDOUT ===


=== STDERR ===


Network & PCI State

receiver-network-interfaces.log

1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host noprefixroute 
       valid_lft forever preferred_lft forever
2: ens5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9001 qdisc mq state UP group default qlen 1000
    link/ether 02:05:0c:f2:10:a3 brd ff:ff:ff:ff:ff:ff
    altname enp0s5
    altname eni-0cf4ce889e78931a6
    altname device-number-0.0
    inet 10.0.1.173/24 metric 512 brd 10.0.1.255 scope global dynamic ens5
       valid_lft 2134sec preferred_lft 2134sec
    inet6 fe80::5:cff:fef2:10a3/64 scope link proto kernel_ll 
       valid_lft forever preferred_lft forever

sender-network-interfaces.log

1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host noprefixroute 
       valid_lft forever preferred_lft forever
2: ens5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9001 qdisc mq state UP group default qlen 1000
    link/ether 02:79:ec:41:e9:8f brd ff:ff:ff:ff:ff:ff
    altname enp0s5
    altname eni-0e6eb2c9e495d854f
    altname device-number-0.0
    inet 10.0.1.220/24 metric 512 brd 10.0.1.255 scope global dynamic ens5
       valid_lft 2157sec preferred_lft 2157sec
    inet6 fe80::79:ecff:fe41:e98f/64 scope link proto kernel_ll 
       valid_lft forever preferred_lft forever

receiver-networking-diag-baseline.txt

=== NETWORKING DIAGNOSTICS ===
timestamp: 2026-07-02T00:11:18Z
hostname: ip-10-0-1-173.ec2.internal
kernel: 6.18.35-68.129.amzn2023.x86_64

=== DPDK PORT STATUS ===

Network devices using DPDK-compatible driver
============================================
0000:00:06.0 'Elastic Network Adapter (ENA) ec20' drv=vfio-pci unused=ena

Network devices using kernel driver
===================================
0000:00:05.0 'Elastic Network Adapter (ENA) ec20' if=ens5 drv=ena unused=vfio-pci *Active*

No 'Baseband' devices detected
==============================

No 'Crypto' devices detected
============================

No 'DMA' devices detected
=========================

No 'Eventdev' devices detected
==============================

No 'Mempool' devices detected
=============================

No 'Compress' devices detected
==============================

No 'Misc (rawdev)' devices detected
===================================

No 'Regex' devices detected
===========================

=== IP ADDRESSES ===
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host noprefixroute 
       valid_lft forever preferred_lft forever
2: ens5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9001 qdisc mq state UP group default qlen 1000
    link/ether 02:05:0c:f2:10:a3 brd ff:ff:ff:ff:ff:ff
    altname enp0s5
    altname eni-0cf4ce889e78931a6
    altname device-number-0.0
    inet 10.0.1.173/24 metric 512 brd 10.0.1.255 scope global dynamic ens5
       valid_lft 3118sec preferred_lft 3118sec
    inet6 fe80::5:cff:fef2:10a3/64 scope link proto kernel_ll 
       valid_lft forever preferred_lft forever

=== ARP TABLE ===
10.0.1.79 dev ens5 lladdr 02:3e:9b:d0:71:9f REACHABLE 
10.0.1.1 dev ens5 lladdr 02:ef:8f:74:ee:5d REACHABLE 
10.0.1.53 dev ens5 lladdr 02:4a:49:b3:f0:7d REACHABLE 

=== ROUTE TABLE ===
default via 10.0.1.1 dev ens5 proto dhcp src 10.0.1.173 metric 512 
10.0.0.2 via 10.0.1.1 dev ens5 proto dhcp src 10.0.1.173 metric 512 
10.0.1.0/24 dev ens5 proto kernel scope link src 10.0.1.173 metric 512 
10.0.1.1 dev ens5 proto dhcp scope link src 10.0.1.173 metric 512 

=== IMDS: ENI INFORMATION ===
ENI MACs found: 02:05:0c:f2:10:a3/ 02:2c:70:d5:90:bb/ 

--- ENI: 02:05:0c:f2:10:a3/ ---
  device-number: 0
  local-ipv4s: 10.0.1.173
  subnet-id: subnet-08003d764f3ce9986
  subnet-cidr: 10.0.1.0/24

--- ENI: 02:2c:70:d5:90:bb/ ---
  device-number: 1
  local-ipv4s: 10.0.1.129
  subnet-id: subnet-08003d764f3ce9986
  subnet-cidr: 10.0.1.0/24


=== GATEWAY ARP TEST ===
Gateway IP: 10.0.1.1
Gateway ARP entry:
10.0.1.1 dev ens5 lladdr 02:ef:8f:74:ee:5d REACHABLE 

arping result:
ARPING 10.0.1.1 from 10.0.1.173 ens5
Unicast reply from 10.0.1.1 [02:EF:8F:74:EE:5D]  0.533ms
Sent 1 probes (1 broadcast(s))
Received 1 response(s)

=== HUGEPAGE STATUS ===
AnonHugePages:         0 kB
ShmemHugePages:        0 kB
FileHugePages:     51200 kB
HugePages_Total:    1024
HugePages_Free:     1024
HugePages_Rsvd:        0
HugePages_Surp:        0
Hugepagesize:       2048 kB
Hugetlb:         2097152 kB

=== VFIO STATUS ===
total 0
drwxr-xr-x.  2 root root       80 Jul  2 00:11 .
drwxr-xr-x. 14 root root     3100 Jul  2 00:10 ..
crw-------.  1 root root 243,   0 Jul  2 00:11 noiommu-0
crw-rw-rw-.  1 root root  10, 196 Jul  2 00:03 vfio

noiommu mode:
Y

=== DPDK SHARED MEMORY ===
no /var/run/dpdk/ directory (clean state)

=== DPDK-RELATED DMESG (last 30 lines) ===
[    0.053586] printk: legacy console [ttyS0] enabled
[    0.054597] x2apic enabled
[    0.058724] mitigations: Enabled attack vectors: user_kernel, user_user, guest_host, guest_guest, SMT mitigations: auto
[    0.058831] x86/fpu: Enabled xstate features 0x2ff, context size is 2568 bytes, using 'compacted' format.
[    0.068569] audit: type=2000 audit(1782950591.300:1): state=initialized audit_enabled=0 res=1
[    0.088068] kprobes: kprobe jump-optimization is enabled. All kprobes are optimized if possible.
[    0.140183] ACPI: Interpreter enabled
[    0.140183] ACPI: Enabled 2 GPEs in block 00 to 0F
[    0.158135] pci 0000:00:05.0: enabling Extended Tags
[    0.234717] SGI XFS with ACLs, security attributes, quota, no debug enabled
[    0.249905] ACPI: \_SB_.LNKD: Enabled at IRQ 11
[    0.348991] IPI shorthand broadcast: enabled
[    3.066464] systemd[1]: Mounting dev-hugepages.mount - Huge Pages File System...
[    3.096733] systemd[1]: Mounted dev-hugepages.mount - Huge Pages File System.
[    3.192824] VFIO - User Level meta-driver version: 0.3
[    3.550159] ena 0000:00:05.0: Elastic Network Adapter (ENA) v2.17.1g
[    3.567402] ena 0000:00:05.0: ENA device version: 0.10
[    3.568117] ena 0000:00:05.0: ENA controller version: 0.0.1 implementation version 1
[    3.674573] ena 0000:00:05.0: ENA Large LLQ is disabled
[    3.686567] ena 0000:00:05.0: Elastic Network Adapter (ENA) found at mem c0500000, mac addr 02:05:0c:f2:10:a3
[    3.734338] ena 0000:00:05.0 ens5: renamed from eth0
[  432.396663] pci 0000:00:06.0: enabling Extended Tags
[  432.400570] ena 0000:00:06.0: enabling device (0000 -> 0002)
[  432.411963] ena 0000:00:06.0: ENA device version: 0.10
[  432.412720] ena 0000:00:06.0: ENA controller version: 0.0.1 implementation version 1
[  432.511944] ena 0000:00:06.0: ENA Large LLQ is disabled
[  432.524097] ena 0000:00:06.0: Elastic Network Adapter (ENA) found at mem c0508000, mac addr 02:2c:70:d5:90:bb
[  432.530519] ena 0000:00:06.0 ens6: renamed from eth0
[  473.805573] vfio-pci 0000:00:06.0: Adding to iommu group 0
[  473.807017] vfio-pci 0000:00:06.0: Adding kernel taint for vfio-noiommu group on device

=== DPDK-RELATED PROCESSES ===
no DPDK processes running

=== END DIAGNOSTICS ===

sender-networking-diag-baseline.txt

=== NETWORKING DIAGNOSTICS ===
timestamp: 2026-07-02T00:11:10Z
hostname: ip-10-0-1-220.ec2.internal
kernel: 6.18.35-68.129.amzn2023.x86_64

=== DPDK PORT STATUS ===

Network devices using DPDK-compatible driver
============================================
0000:00:06.0 'Elastic Network Adapter (ENA) ec20' drv=vfio-pci unused=ena

Network devices using kernel driver
===================================
0000:00:05.0 'Elastic Network Adapter (ENA) ec20' if=ens5 drv=ena unused=vfio-pci *Active*

No 'Baseband' devices detected
==============================

No 'Crypto' devices detected
============================

No 'DMA' devices detected
=========================

No 'Eventdev' devices detected
==============================

No 'Mempool' devices detected
=============================

No 'Compress' devices detected
==============================

No 'Misc (rawdev)' devices detected
===================================

No 'Regex' devices detected
===========================

=== IP ADDRESSES ===
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host noprefixroute 
       valid_lft forever preferred_lft forever
2: ens5: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9001 qdisc mq state UP group default qlen 1000
    link/ether 02:79:ec:41:e9:8f brd ff:ff:ff:ff:ff:ff
    altname enp0s5
    altname eni-0e6eb2c9e495d854f
    altname device-number-0.0
    inet 10.0.1.220/24 metric 512 brd 10.0.1.255 scope global dynamic ens5
       valid_lft 3127sec preferred_lft 3127sec
    inet6 fe80::79:ecff:fe41:e98f/64 scope link proto kernel_ll 
       valid_lft forever preferred_lft forever

=== ARP TABLE ===
10.0.1.1 dev ens5 lladdr 02:ef:8f:74:ee:5d REACHABLE 
10.0.1.79 dev ens5 lladdr 02:3e:9b:d0:71:9f REACHABLE 

=== ROUTE TABLE ===
default via 10.0.1.1 dev ens5 proto dhcp src 10.0.1.220 metric 512 
10.0.0.2 via 10.0.1.1 dev ens5 proto dhcp src 10.0.1.220 metric 512 
10.0.1.0/24 dev ens5 proto kernel scope link src 10.0.1.220 metric 512 
10.0.1.1 dev ens5 proto dhcp scope link src 10.0.1.220 metric 512 

=== IMDS: ENI INFORMATION ===
ENI MACs found: 02:02:89:fa:2c:39/ 02:79:ec:41:e9:8f/ 

--- ENI: 02:02:89:fa:2c:39/ ---
  device-number: 1
  local-ipv4s: 10.0.1.168
  subnet-id: subnet-08003d764f3ce9986
  subnet-cidr: 10.0.1.0/24

--- ENI: 02:79:ec:41:e9:8f/ ---
  device-number: 0
  local-ipv4s: 10.0.1.220
  subnet-id: subnet-08003d764f3ce9986
  subnet-cidr: 10.0.1.0/24


=== GATEWAY ARP TEST ===
Gateway IP: 10.0.1.1
Gateway ARP entry:
10.0.1.1 dev ens5 lladdr 02:ef:8f:74:ee:5d REACHABLE 

arping result:
ARPING 10.0.1.1 from 10.0.1.220 ens5
Unicast reply from 10.0.1.1 [02:EF:8F:74:EE:5D]  0.537ms
Sent 1 probes (1 broadcast(s))
Received 1 response(s)

=== HUGEPAGE STATUS ===
AnonHugePages:         0 kB
ShmemHugePages:        0 kB
FileHugePages:     51200 kB
HugePages_Total:    1024
HugePages_Free:     1024
HugePages_Rsvd:        0
HugePages_Surp:        0
Hugepagesize:       2048 kB
Hugetlb:         2097152 kB

=== VFIO STATUS ===
total 0
drwxr-xr-x.  2 root root       80 Jul  2 00:10 .
drwxr-xr-x. 14 root root     3100 Jul  2 00:10 ..
crw-------.  1 root root 243,   0 Jul  2 00:10 noiommu-0
crw-rw-rw-.  1 root root  10, 196 Jul  2 00:03 vfio

noiommu mode:
Y

=== DPDK SHARED MEMORY ===
no /var/run/dpdk/ directory (clean state)

=== DPDK-RELATED DMESG (last 30 lines) ===
[    0.053849] printk: legacy console [ttyS0] enabled
[    0.054851] x2apic enabled
[    0.058982] mitigations: Enabled attack vectors: user_kernel, user_user, guest_host, guest_guest, SMT mitigations: auto
[    0.059092] x86/fpu: Enabled xstate features 0x2ff, context size is 2568 bytes, using 'compacted' format.
[    0.068853] audit: type=2000 audit(1782950592.328:1): state=initialized audit_enabled=0 res=1
[    0.088346] kprobes: kprobe jump-optimization is enabled. All kprobes are optimized if possible.
[    0.140901] ACPI: Interpreter enabled
[    0.140901] ACPI: Enabled 2 GPEs in block 00 to 0F
[    0.152141] pci 0000:00:05.0: enabling Extended Tags
[    0.234005] SGI XFS with ACLs, security attributes, quota, no debug enabled
[    0.249661] ACPI: \_SB_.LNKD: Enabled at IRQ 11
[    0.346968] IPI shorthand broadcast: enabled
[    2.896589] systemd[1]: Mounting dev-hugepages.mount - Huge Pages File System...
[    2.935042] systemd[1]: Mounted dev-hugepages.mount - Huge Pages File System.
[    2.983386] VFIO - User Level meta-driver version: 0.3
[    3.409915] ena 0000:00:05.0: Elastic Network Adapter (ENA) v2.17.1g
[    3.432491] ena 0000:00:05.0: ENA device version: 0.10
[    3.433179] ena 0000:00:05.0: ENA controller version: 0.0.1 implementation version 1
[    3.544935] ena 0000:00:05.0: ENA Large LLQ is disabled
[    3.556833] ena 0000:00:05.0: Elastic Network Adapter (ENA) found at mem c0500000, mac addr 02:79:ec:41:e9:8f
[    3.573725] ena 0000:00:05.0 ens5: renamed from eth0
[  431.901098] pci 0000:00:06.0: enabling Extended Tags
[  431.904881] ena 0000:00:06.0: enabling device (0000 -> 0002)
[  431.917877] ena 0000:00:06.0: ENA device version: 0.10
[  431.918671] ena 0000:00:06.0: ENA controller version: 0.0.1 implementation version 1
[  432.019269] ena 0000:00:06.0: ENA Large LLQ is disabled
[  432.031485] ena 0000:00:06.0: Elastic Network Adapter (ENA) found at mem c0508000, mac addr 02:02:89:fa:2c:39
[  432.039029] ena 0000:00:06.0 ens6: renamed from eth0
[  465.712937] vfio-pci 0000:00:06.0: Adding to iommu group 0
[  465.714354] vfio-pci 0000:00:06.0: Adding kernel taint for vfio-noiommu group on device

=== DPDK-RELATED PROCESSES ===
no DPDK processes running

=== END DIAGNOSTICS ===
⚠️ Crash Diagnostics

receiver-dmesg-crashes.log

[    0.067974] pid_max: default: 32768 minimum: 301
[    0.161915] iommu: Default domain type: Translated
[    0.161938] NetLabel:  unlabeled traffic allowed by default
[    0.182763] PCI: CLS 0 bytes, default 64
[    0.270386] nvme nvme0: 2/0/0 default/read/poll queues
[    0.419063] systemd[1]: systemd 252.23-12.amzn2023 running in system mode (+PAM +AUDIT +SELINUX -APPARMOR +IMA +SMACK +SECCOMP -GCRYPT -GNUTLS +OPENSSL +ACL +BLKID +CURL +ELFUTILS +FIDO2 +IDN2 -IDN -IPTC +KMOD +LIBCRYPTSETUP +LIBFDISK +PCRE2 +PWQUALITY +P11KIT +QRENCODE +TPM2 -BZIP2 -LZ4 +XZ +ZLIB -ZSTD +BPF_FRAMEWORK +XKBCOMMON +UTMP +SYSVINIT default-hierarchy=unified)
[    0.419185] systemd[1]: No hostname configured, using default hostname.
[    0.488506] systemd[1]: Queued start job for default target initrd.target.
[    2.195989] systemd[1]: systemd 252.23-12.amzn2023 running in system mode (+PAM +AUDIT +SELINUX -APPARMOR +IMA +SMACK +SECCOMP -GCRYPT -GNUTLS +OPENSSL +ACL +BLKID +CURL +ELFUTILS +FIDO2 +IDN2 -IDN -IPTC +KMOD +LIBCRYPTSETUP +LIBFDISK +PCRE2 +PWQUALITY +P11KIT +QRENCODE +TPM2 -BZIP2 -LZ4 +XZ +ZLIB -ZSTD +BPF_FRAMEWORK +XKBCOMMON +UTMP +SYSVINIT default-hierarchy=unified)
[  517.154526] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:8346)
[  732.993238] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:8783)
[ 1331.159485] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:9868)

sender-dmesg-crashes.log

[    0.068242] pid_max: default: 32768 minimum: 301
[    0.155952] iommu: Default domain type: Translated
[    0.162370] NetLabel:  unlabeled traffic allowed by default
[    0.179184] PCI: CLS 0 bytes, default 64
[    0.265208] nvme nvme0: 2/0/0 default/read/poll queues
[    0.449414] systemd[1]: systemd 252.23-12.amzn2023 running in system mode (+PAM +AUDIT +SELINUX -APPARMOR +IMA +SMACK +SECCOMP -GCRYPT -GNUTLS +OPENSSL +ACL +BLKID +CURL +ELFUTILS +FIDO2 +IDN2 -IDN -IPTC +KMOD +LIBCRYPTSETUP +LIBFDISK +PCRE2 +PWQUALITY +P11KIT +QRENCODE +TPM2 -BZIP2 -LZ4 +XZ +ZLIB -ZSTD +BPF_FRAMEWORK +XKBCOMMON +UTMP +SYSVINIT default-hierarchy=unified)
[    0.449543] systemd[1]: No hostname configured, using default hostname.
[    0.509946] systemd[1]: Queued start job for default target initrd.target.
[    2.280488] systemd[1]: systemd 252.23-12.amzn2023 running in system mode (+PAM +AUDIT +SELINUX -APPARMOR +IMA +SMACK +SECCOMP -GCRYPT -GNUTLS +OPENSSL +ACL +BLKID +CURL +ELFUTILS +FIDO2 +IDN2 -IDN -IPTC +KMOD +LIBCRYPTSETUP +LIBFDISK +PCRE2 +PWQUALITY +P11KIT +QRENCODE +TPM2 -BZIP2 -LZ4 +XZ +ZLIB -ZSTD +BPF_FRAMEWORK +XKBCOMMON +UTMP +SYSVINIT default-hierarchy=unified)
[ 1149.753313] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:9597)
Kernel Console (dmesg)

receiver-console-output.log (PCI/driver events only)

[  517.152998] vfio-pci 0000:00:06.0: reset done
[  517.154526] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:8346)
[  517.155926] vfio-pci 0000:00:06.0: resetting
[  517.372835] vfio-pci 0000:00:06.0: reset done
[  592.242806] vfio-pci 0000:00:06.0: Removing from iommu group 0
[  593.250919] ena 0000:00:06.0: ENA device version: 0.10
[  593.251679] ena 0000:00:06.0: ENA controller version: 0.0.1 implementation version 1
[  593.350902] ena 0000:00:06.0: ENA Large LLQ is disabled
[  593.363713] ena 0000:00:06.0: Elastic Network Adapter (ENA) found at mem c0508000, mac addr 02:2c:70:d5:90:bb
[  593.370977] ena 0000:00:06.0 ens6: renamed from eth0
[  613.154693] vfio-pci 0000:00:06.0: Adding to iommu group 0
[  613.155887] vfio-pci 0000:00:06.0: Adding kernel taint for vfio-noiommu group on device
[  732.764868] vfio-pci 0000:00:06.0: resetting
[  732.991682] vfio-pci 0000:00:06.0: reset done
[  732.993238] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:8783)
[  732.994644] vfio-pci 0000:00:06.0: resetting
[  733.211453] vfio-pci 0000:00:06.0: reset done
[  808.305609] vfio-pci 0000:00:06.0: Removing from iommu group 0
[  809.318605] ena 0000:00:06.0: ENA device version: 0.10
[  809.319361] ena 0000:00:06.0: ENA controller version: 0.0.1 implementation version 1
[  809.419563] ena 0000:00:06.0: ENA Large LLQ is disabled
[  809.431896] ena 0000:00:06.0: Elastic Network Adapter (ENA) found at mem c0508000, mac addr 02:2c:70:d5:90:bb
[  809.442441] ena 0000:00:06.0 ens6: renamed from eth0 (while UP)
[ 1250.830690] vfio-pci 0000:00:06.0: Adding to iommu group 0
[ 1250.832093] vfio-pci 0000:00:06.0: Adding kernel taint for vfio-noiommu group on device
[ 1330.937986] vfio-pci 0000:00:06.0: resetting
[ 1331.157937] vfio-pci 0000:00:06.0: reset done
[ 1331.159485] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (echo:9868)
[ 1331.160901] vfio-pci 0000:00:06.0: resetting
[ 1331.387732] vfio-pci 0000:00:06.0: reset done

sender-console-output.log (PCI/driver events only)

[ 1149.754800] vfio-pci 0000:00:06.0: resetting
[ 1149.971688] vfio-pci 0000:00:06.0: reset done
[ 1212.073814] vfio-pci 0000:00:06.0: Removing from iommu group 0
[ 1213.086907] ena 0000:00:06.0: ENA device version: 0.10
[ 1213.087619] ena 0000:00:06.0: ENA controller version: 0.0.1 implementation version 1
[ 1213.188742] ena 0000:00:06.0: ENA Large LLQ is disabled
[ 1213.201318] ena 0000:00:06.0: Elastic Network Adapter (ENA) found at mem c0508000, mac addr 02:02:89:fa:2c:39
[ 1213.212118] ena 0000:00:06.0 ens6: renamed from eth0 (while UP)
[ 1242.842913] vfio-pci 0000:00:06.0: Adding to iommu group 0
[ 1242.844255] vfio-pci 0000:00:06.0: Adding kernel taint for vfio-noiommu group on device
[ 1345.855716] vfio-pci 0000:00:06.0: resetting
[ 1346.069336] vfio-pci 0000:00:06.0: reset done
[ 1346.070843] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (tokio-rt-worker:10039)
[ 1346.072353] vfio-pci 0000:00:06.0: resetting
[ 1346.289208] vfio-pci 0000:00:06.0: reset done
[ 1346.567100] vfio-pci 0000:00:06.0: resetting
[ 1346.789215] vfio-pci 0000:00:06.0: reset done
[ 1346.790745] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (tokio-rt-worker:10059)
[ 1346.792279] vfio-pci 0000:00:06.0: resetting
[ 1347.009171] vfio-pci 0000:00:06.0: reset done
[ 1348.299625] vfio-pci 0000:00:06.0: resetting
[ 1348.519190] vfio-pci 0000:00:06.0: reset done
[ 1348.520694] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (tokio-rt-worker:10082)
[ 1348.522215] vfio-pci 0000:00:06.0: resetting
[ 1348.739128] vfio-pci 0000:00:06.0: reset done
[ 1350.019627] vfio-pci 0000:00:06.0: resetting
[ 1350.239174] vfio-pci 0000:00:06.0: reset done
[ 1350.240685] vfio-pci 0000:00:06.0: vfio-noiommu device opened by user (tokio-rt-worker:10105)
[ 1350.242197] vfio-pci 0000:00:06.0: resetting
[ 1350.459109] vfio-pci 0000:00:06.0: reset done

@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

✅ Integration Tests Passed — Graviton (run)

Branch: 108/merge | Commit: ba973e7a

Test Results

  • tier1-dpdk-echo.xml: 6 tests, 0 failures
  • tier2-kernel-interop.xml: 4 tests, 0 failures
  • tier3-iperf-sends.xml: 1 tests, 0 failures
  • tier3-our-app-sends.xml: 1 tests, 0 failures

Application Logs (last 20 lines)

receiver-echo-server.log

EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.10:9000 (MAC: 02:90:a6:69:14:fb)
echo listening on 10.0.1.10:9000 (MTU=9001, max_udp_payload=8973)
Shutting down gracefully...

sender-echo-server.log

EAL: Detected CPU lcores: 2
EAL: Detected NUMA nodes: 1
EAL: Detected shared linkage of DPDK
EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
EAL: Using IOMMU type 8 (No-IOMMU)
EAL: Probe PCI driver: net_ena (1d0f:ec20) device: 0000:00:06.0 (socket -1)
TELEMETRY: No legacy callbacks, legacy socket not created
Warning: Some RX offloads not supported by device (flags: 0x1)
Warning: Some TX offloads not supported by device (flags: 0x1)
✅ DPDK UDP socket bound to 10.0.1.28:9000 (MAC: 02:6f:88:82:bf:ed)
echo listening on 10.0.1.28:9000 (MTU=9001, max_udp_payload=8973)
Shutting down gracefully...

sender-test-client.log

EAL: Multi-process socket /var/run/dpdk/rte/mp_socket
EAL: Selected IOVA mode 'PA'
EAL: VFIO support initialized
TELEMETRY: No legacy callbacks, legacy socket not created
DPDK bind failed (Port init failed: Invalid port ID: 0), falling back to tokio
Backend: tokio
Sending packets...
Sent 12 bytes: 'arp-probe #1'
Received 12 bytes from 10.0.1.10:9000: 'arp-probe #1'
Test complete
[2026-07-02T00:15:45Z] INFO: ARP resolution succeeded (got response from peer)
[2026-07-02T00:15:45Z] INFO: Test: udp_send_receive
[2026-07-02T00:15:46Z] INFO: UDP send/receive succeeded
[2026-07-02T00:15:46Z] INFO: Test: echo_roundtrip
[2026-07-02T00:15:47Z] INFO: Echo roundtrip succeeded: 5/5 responses received
[2026-07-02T00:15:47Z] INFO: Test: payload_integrity
[2026-07-02T00:15:47Z] INFO: Response received, checking payload match...
[2026-07-02T00:15:47Z] INFO: Payload integrity verified (found in response)
[2026-07-02T00:15:47Z] INFO: JUnit XML written to: /tmp/test-results/tier2-kernel-interop.xml
[2026-07-02T00:15:47Z] INFO: Tier 2 sender tests complete. Results: /tmp/test-results/tier2-kernel-interop.xml

@gspivey gspivey merged commit dcb2b12 into development Jul 2, 2026
7 checks passed
@gspivey gspivey deleted the agent/dpdk-tx-frame-fix branch July 2, 2026 00:35
gspivey added a commit that referenced this pull request Jul 2, 2026
…#109)

Companion to #108. Perf run
[28552788420](https://github.com/gspivey/dpdk-stdlib-rust/actions/runs/28552788420)
actually produced real per-config result JSON — but retrieving it with
`cat` over SSM truncated it at **~24,000 chars** (AWS SSM
`StandardOutputContent` cap). Every downloaded file was exactly **23984
bytes** and ended mid-structure with `--output truncated--`, so JSON
parsing failed and the numbers were lost.

**Fix:** retrieve with `gzip -c file | base64 -w0` between markers,
decode locally. gzip shrinks the JSON ~15× (24 KB → **1.6 KB** on the
wire), far under the cap. Verified locally end-to-end (marker-wrap →
extract → `base64 -d` → `gunzip` recovers the bytes byte-exact); harness
unit tests green.

**Both #108 and this are needed for a green TCP perf run:** #108 makes
the DUT actually establish connections; this delivers the resulting
metrics intact. Merge both, then re-dispatch.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant