devopsofworld.comMigrating Redis OSS Across AWS Accounts — Real Issues Faced and the Production-Safe SolutionMigrating Redis OSS data across AWS accounts sounds simple: Export snapshot → Restore in another account. In practice, it is not that straightforward. In this case, we migrated Redis OSS from Account 5d ago·5 min read
devopsofworld.comCross-Region RDS Disaster Recovery: Production Failover Architecture1. Overview This post documents how I designed and implemented cross-region disaster recovery for a production MySQL database running on Amazon RDS. The requirement was straightforward: If the primary region (ap-south-1) becomes unavailable, the data...6d ago·4 min read
devopsofworld.comMigrating from Amazon Linux 2 to Amazon Linux 2023A Practical Production Playbook (200 Instance Scenario) Amazon Linux 2 (AL2) will reach end-of-support on June 30, 2026. After that date, AWS will no longer provide security updates, patches, or new packages. Although AL2 continues to receive mainten...Feb 26·5 min read
devopsofworld.comProduction Incident: Control Plane Latency During Large-Scale Rollout on Amazon EKS1. Context As part of readiness planning for high-demand production scenarios, we executed a large-scale rollout simulation on one of our production clusters running onAmazon Elastic Kubernetes Service. The cluster hosts thousands of pods, supports a...Feb 25·3 min read
devopsofworld.comProduction Change: Migrating a StatefulSet from Large to Smaller Nodes in EKS (Without Downtime)We had a production application running on Amazon EKS as a StatefulSet.Each replica had its own PersistentVolumeClaim backed by Amazon EBS. During the initial launch phase, we deployed the workload onFeb 24·5 min read