Experience

Production work with real stakes — and real lessons learned.

Since September 2024, I've been at a high-availability service provider where resilient systems, delivery velocity, and safe change management aren't just goals — they're daily necessities.

Keeping the lights on

I operate payment-critical AWS workloads across EC2, ECS, RDS, and EMR. That means on-call rotations, runbooks, incident response, and the occasional 3 AM debugging session. I've learned that the best metric isn't zero incidents — it's how fast we recover and how much we learn from each one.

CloudWatch New Relic SNS

Making releases less scary

I design GitLab CI/CD flows with BuildKit, scalable runners, and Terraform-based release safety. My favorite part is watching a team go from 'we deploy once a month because we're afraid' to 'we deploy daily because we trust the pipeline.'

GitLab CI/CD BuildKit Terraform

Saving legacy systems

I led CentOS 7 to AlmaLinux 8 migrations and production cutovers with zero-downtime expectations. These projects taught me that rollback planning and clear communication matter just as much as the technical execution.

Linux Runbooks SRE

Stack

Tools I know and use regularly.

The stack is broad, but the pattern is consistent: strong automation, strong operations, low-noise tooling.

Cloud

  • AWS (EC2, ECS, RDS, EMR, Route 53)
  • Azure workload support
  • PrivateLink / VPC networking

DevOps

  • Terraform modules
  • GitLab CI/CD
  • Docker / BuildKit
  • Blue-green rollout hygiene

Programming

  • Python automation
  • Go services
  • Bash scripting
  • PowerShell

Tools

  • CloudWatch
  • New Relic
  • Cloudflare
  • ALB / NLB