aditmodi.hashnode.devEKS Multi-Account Administration: From GitHub Issue to Production ArchitectureIn April 2022, a deceptively simple GitHub issue landed in the AWS containers roadmap: "Looking to start a discussion and get some feedback to help guide our strategy around helping customers administ1d ago·22 min read
cloudtech.hashnode.dev👋 Everything about Cloud & Tech Newsletter "#56" ☁️❤👨💻Dear Cloud and Tech enthusiasts, Welcome to Everything about Cloud & Tech #56. The infrastructure we’re building right now has a strange quality: it’s both harder and easier than it’s ever been. Harder because the problems are genuinely difficult—dis...6d ago·13 min read
cloudtech.hashnode.dev👋 Everything about Cloud & Tech Newsletter "#55" ☁️❤👨💻Dear Cloud and Tech enthusiasts, Welcome to Everything about Cloud & Tech #55. This week sits at the intersection of two very different feelings: on one side, we have genuinely hard, exciting problems in front of us—optimizing LLM serving without bur...Feb 8·12 min read
aditmodi.hashnode.devYour GPU Training Job Has Been Silently Deadlocking for Months — And Your Scheduler Is the ReasonHow the default Kubernetes scheduler, Karpenter, and Volcano create a three-way conflict that wastes GPU hours and nobody warns you about. I've been running distributed GPU training jobs on EKS for aFeb 7·13 min read
cloudtech.hashnode.dev👋 Everything about Cloud & Tech Newsletter "#54" ☁️❤👨💻Dear Cloud and Tech enthusiasts, Welcome to Everything about Cloud & Tech #54. Something shifted in the last few weeks, and it wasn't just another feature launch. Engineers who've been writing code for decades are publicly saying their workflow has f...Feb 2·15 min read