
devops

This week, I was updating the image of a FastAPI app in our Kubernetes cluster, but I took the whole app down because the process failed due to an incompatible dependency with our server. The updated pod was unable to start, but we didn't have health checks in place, so the deployment continued to update the other replicas, taking down all app instances. In this tutorial, I will explain how to ad…
Stop hardcoding! Use AWS Parameter Store instead (Hands-On) This article is a practical guide for DevOps engineers who want to manage configuration and secrets the right way on AWS. Introduction If you have a database hostname, a password, and an API URL your application needs to call, where do you put them? The wrong answer is “environment variables hardcoded into your Lambda function” . The iss…
Serverless deployment with NEXUS AI: custom domains, scaling, rollback, and more Published: April 25, 2026 Category: Platform · DevOps Reading time: 16 minutes Author: NEXUS AI Team Serverless means different things to different teams. For most, it means: don't manage servers, don't think about capacity until it matters, and pay for what you actually run. That premise is right. The implementation…
Manual ML retraining is a reliability risk. Azure ML Pipelines orchestrates the ML workflow while Azure DevOps automates testing, validation, and deployment on every code push. Here's how to build the full CI/CD stack with Terraform. Through Series 5, we've built the workspace, deployed endpoints, and set up the feature store. The final piece is automation. Right now, retraining means a data scie…
Trunk-Based Development vs. Git Flow A decision document comparing two branching strategies. Evaluates each approach against our CI/CD maturity, team velocity, and DevOps best practices — then recommends a path forward. Team: Decision Document Owner: William Weeks-Balconi Updated: April 2026 Audience: Engineering, DevOps, Product Management, Leadership Table of Contents Executive Summary What Is …
How I Built a Self-Healing Database on a 10-Year-Old Laptop (Using Docker + Ansible) A practical experiment in resilience engineering on aging hardware—with modern DevOps tools. 🚀 Introduction Running production-grade systems on old hardware sounds like a bad idea… until you treat it as a lab. I set out to build a self-healing database system on a 10-year-old laptop —but this time with a more mod…
Hi all. This article aims to give you the happy path to get your service (or services) up and running on a kubernetes-managed cluster. Disclaimer I expect you already have a running k3s or kubernetes environment with direct access, such as SSH, for troubleshooting and reviewing changes after applying them. If you're completely new to Kubernetes, please take a moment to learn more about it, becaus…
Introduction In modern DevOps workflows, deploying applications is no longer just about writing code—it’s about automating, scaling, and managing services efficiently. In this project, I built a 2-tier microservices application and deployed it using: Docker for containerization Kubernetes for orchestration Minikube for local cluster Helm for production-style deployment The goal was to simulate a …
“There are only two hard things in computer science: cache invalidation and naming things.” ~ Phil Karlton It’s been about a year since I started using AI daily. In my core role as an SRE/DevOps engineer, AI behaves like a very fast and very obedient assistant. It helps me collect data across services via MCP, generate Terraform modules, work with Helm charts, and automate routine tasks. There is…
Infrastructure as code (IaC) is powerful, but deploying untested changes can be risky. On Day 18 of my 30-Day Terraform Challenge, I focused on automating testing for Terraform code, covering unit tests, integration tests, and end-to-end tests, all tied together in a CI/CD pipeline. Unit tests are fast, cheap, and safe because they test your module plan only—no real resources are created. Each un…
In this blog, let’s take a deep dive into the GH-900, the GitHub Foundation Certification exam. From “What is the GH-900 exam?”, what to study, to “Dos and Don’ts, tricks and more to pass your exam in your 2025 attempt. A complete beginner-friendly starter resource to dominate in the GitHub ecosystem. Let’s get started. What is the GH-900 GitHub Foundation Certification? GH 90…
Platform Engineer Intern eagleview.com About the Internship EagleView is offering an exciting opportunity for a Platform Engineer Intern to work with cutting-edge technologies in cloud engineering, DevOps, and automation. In this role, you will collaborate with senior engineers on projects involving Jenkins, Kubernetes, Terraform, and AWS, gaining hands-on experience in managing and automating cl…
The Hashicorp Certified Terraform Associate (002) Certification is one of the hardest tech certifications in the industry. The certification has a great demand and validates skills for cloud and DevOps engineer roles. In fact, it imparts in-depth knowledge of all the open-source Terraform tools. However, numerous fundamentals exist in the certification syllabus and need thorough practice to crack…
Abstract There has been a fundamental shift amongst software developers and engineers in the past few years. The software development life cycle (SDLC) for a developer has increased in complexity and scale. Changes that were developed and deployed over a matter of days or weeks are now deployed in a matter of hours. Due to greater availability of compute, storage, better tooling, and the necessit…



