Experience & Stack | Wafiy Firdaus

Experience

Production work with real stakes — and real lessons learned.

Since September 2024, I've been at a high-availability service provider where resilient systems, delivery velocity, and safe change management aren't just goals — they're daily necessities.

Keeping the lights on

I operate payment-critical AWS workloads across EC2, ECS, RDS, and EMR. That means on-call rotations, runbooks, incident response, and the occasional 3 AM debugging session. I've learned that the best metric isn't zero incidents — it's how fast we recover and how much we learn from each one.

CloudWatch New Relic SNS

Making releases less scary

I design GitLab CI/CD flows with BuildKit, scalable runners, and Terraform-based release safety. My favorite part is watching a team go from 'we deploy once a month because we're afraid' to 'we deploy daily because we trust the pipeline.'

GitLab CI/CD BuildKit Terraform

Saving legacy systems

I led CentOS 7 to AlmaLinux 8 migrations and production cutovers with zero-downtime expectations. These projects taught me that rollback planning and clear communication matter just as much as the technical execution.

Linux Runbooks SRE

Stack

Tools I know and use regularly.

The stack is broad, but the pattern is consistent: strong automation, strong operations, low-noise tooling.

Cloud

AWS (EC2, ECS, RDS, EMR, Route 53)
Azure workload support
PrivateLink / VPC networking

DevOps

Terraform modules
GitLab CI/CD
Docker / BuildKit
Blue-green rollout hygiene

Programming

Python automation
Go services
Bash scripting
PowerShell

Tools

CloudWatch
New Relic
Cloudflare
ALB / NLB