Human-Aligned Decision Transformers for satellite anomaly response operations with inverse simulation verification A Discovery Born from a Late-Night Simulation It was 2:47 AM, and I was staring at a terminal window filled with telemetry data from a simulated satellite constellation. For weeks, I had been experimenting with Decision Transformers—a class of models that frame reinforcement learning as a sequence modeling problem—and I was stuck. The models could predict optimal actions for nominal

Human-Aligned Decision Transformers for satellite anomaly response operations with inverse simulation verification
Rikin Patel
