How I Built an AI Agent That Handles On-Call Incidents and Pauses for Human Approval Before Touching Production

Krishna shakula
The Problem It's 3 AM. PagerDuty fires. You drag yourself to your laptop. Open Grafana. Squint at a spike. Switch to Kibana, filter logs, grep for errors. Cross-reference a recent deployment. Form a hypothesis. Write a Slack message explaining what you found. Wait for someone to approve your fix. Apply it. Verify it worked. Then spend an hour writing a post-mortem that goes into a folder nobody opens. You do this for every incident. Every single time. I've been that engineer. So I built IRAS an