devops

DEV Community

The Problem: We Were Flying Blind At Refer , we're on a mission to enable talented individuals to fulfill their professional potential by helping them pursue their ideal job. Behind the scenes, that means a lot of microservices, and recently we decided to consolidate everything into a mono-repository. If you've ever migrated dozens of microservices into a monorepo, you know the drill: contracts b…

computer-sciencedevopssoftware-engineering
DEV Community

This week, I was updating the image of a FastAPI app in our Kubernetes cluster, but I took the whole app down because the process failed due to an incompatible dependency with our server. The updated pod was unable to start, but we didn't have health checks in place, so the deployment continued to update the other replicas, taking down all app instances. In this tutorial, I will explain how to ad…

computer-sciencedevopsfastapikubernetes
DEV Community

Stop hardcoding! Use AWS Parameter Store instead (Hands-On) This article is a practical guide for DevOps engineers who want to manage configuration and secrets the right way on AWS. Introduction If you have a database hostname, a password, and an API URL your application needs to call, where do you put them? The wrong answer is “environment variables hardcoded into your Lambda function” . The iss…

computer-sciencedevops
DEV Community

Serverless deployment with NEXUS AI: custom domains, scaling, rollback, and more Published: April 25, 2026 Category: Platform · DevOps Reading time: 16 minutes Author: NEXUS AI Team Serverless means different things to different teams. For most, it means: don't manage servers, don't think about capacity until it matters, and pay for what you actually run. That premise is right. The implementation…

aicloud-computingcomputer-sciencedevops
DEV Community

Manual ML retraining is a reliability risk. Azure ML Pipelines orchestrates the ML workflow while Azure DevOps automates testing, validation, and deployment on every code push. Here's how to build the full CI/CD stack with Terraform. Through Series 5, we've built the workspace, deployed endpoints, and set up the feature store. The final piece is automation. Right now, retraining means a data scie…

aidevopsmachine-learning
DEV Community

Trunk-Based Development vs. Git Flow A decision document comparing two branching strategies. Evaluates each approach against our CI/CD maturity, team velocity, and DevOps best practices — then recommends a path forward. Team: Decision Document Owner: William Weeks-Balconi Updated: April 2026 Audience: Engineering, DevOps, Product Management, Leadership Table of Contents Executive Summary What Is …

devopsengineering
DEV Community

How I Built a Self-Healing Database on a 10-Year-Old Laptop (Using Docker + Ansible) A practical experiment in resilience engineering on aging hardware—with modern DevOps tools. 🚀 Introduction Running production-grade systems on old hardware sounds like a bad idea… until you treat it as a lab. I set out to build a self-healing database system on a 10-year-old laptop —but this time with a more mod…

devopsengineering
DEV Community

Hi all. This article aims to give you the happy path to get your service (or services) up and running on a kubernetes-managed cluster. Disclaimer I expect you already have a running k3s or kubernetes environment with direct access, such as SSH, for troubleshooting and reviewing changes after applying them. If you're completely new to Kubernetes, please take a moment to learn more about it, becaus…

computer-sciencedevopssoftware-engineering
DEV Community

Introduction In modern DevOps workflows, deploying applications is no longer just about writing code—it’s about automating, scaling, and managing services efficiently. In this project, I built a 2-tier microservices application and deployed it using: Docker for containerization Kubernetes for orchestration Minikube for local cluster Helm for production-style deployment The goal was to simulate a …

computer-sciencedevops
DEV Community

“There are only two hard things in computer science: cache invalidation and naming things.” ~ Phil Karlton It’s been about a year since I started using AI daily. In my core role as an SRE/DevOps engineer, AI behaves like a very fast and very obedient assistant. It helps me collect data across services via MCP, generate Terraform modules, work with Helm charts, and automate routine tasks. There is…

aicomputer-sciencedevopsmachine-learning
DEV Community

Infrastructure as code (IaC) is powerful, but deploying untested changes can be risky. On Day 18 of my 30-Day Terraform Challenge, I focused on automating testing for Terraform code, covering unit tests, integration tests, and end-to-end tests, all tied together in a CI/CD pipeline. Unit tests are fast, cheap, and safe because they test your module plan only—no real resources are created. Each un…

computer-sciencedevopssoftware-engineering
Whizlabs Blog

In this blog, let’s take a deep dive into the GH-900, the GitHub Foundation Certification exam. From “What is the GH-900 exam?”, what to study, to “Dos and Don’ts,  tricks and more to pass your exam in your 2025 attempt. A complete beginner-friendly starter resource to dominate in the GitHub ecosystem. Let’s get started.  What is the GH-900 GitHub Foundation Certification? GH 90…

aicomputer-sciencedevopsprogramming-languages
Upcoming Engineer
Sri Rahul Raghav
12/25/2024

Platform Engineer Intern eagleview.com About the Internship EagleView is offering an exciting opportunity for a Platform Engineer Intern to work with cutting-edge technologies in cloud engineering, DevOps, and automation. In this role, you will collaborate with senior engineers on projects involving Jenkins, Kubernetes, Terraform, and AWS, gaining hands-on experience in managing and automating cl…

cloud-computingcomputer-sciencedevops
Whizlabs Blog

The Hashicorp Certified Terraform Associate (002) Certification is one of the hardest tech certifications in the industry. The certification has a great demand and validates skills for cloud and DevOps engineer roles. In fact, it imparts in-depth knowledge of all the open-source Terraform tools. However, numerous fundamentals exist in the certification syllabus and need thorough practice to crack…

cloud-computingdevopstechnology
Software Engineering Research Group

Abstract There has been a fundamental shift amongst software developers and engineers in the past few years. The software development life cycle (SDLC) for a developer has increased in complexity and scale. Changes that were developed and deployed over a matter of days or weeks are now deployed in a matter of hours. Due to greater availability of compute, storage, better tooling, and the necessit…

aidevopsmachine-learning