26 Seconds to Find a Straggler: Fleet v0.10 End-to-End on A100 and GH200

Ingero Team
TL;DR Ingero Fleet v0.10 FOSS is live. We validated the full pipeline end-to-end on two 3-node Lambda Cloud clusters: 3x A100 SXM4 (x86_64) and 3x GH200 (aarch64, 64k pages, Grace kernel 6.8.0-1013-nvidia-64k ). Same Fleet + agent + straggler-sink stack on both. One straggler per cluster, injected by removing the matmul workload from one node. A100 GH200 Region us-east-1 us-east-3 Kernel 6.8.0-60-generic , 4k pages 6.8.0-1013-nvidia-64k , 64k pages Steady-state fleet threshold 0.88 0.88 Time to