@droidkaran22
Nothing here yet.
Nothing here yet.
5d ago · 12 min read · AI agents are finally showing up inside real incident workflows. One agent triages alerts, another scrapes dashboards, a third drafts the remediation plan. Yet 62% of organizations experimenting with
Join discussionOct 11, 2024 · 7 min read · Introduction For Meta, reducing downtime has been crucial to ensuring millions (or should I say Billions?) of users have a seamless experience. Recently, Meta shared about one of their internal platforms that helped reduce MTTR by ~50% for critical a...
Join discussion
Sep 2, 2024 · 8 min read · Introduction Big Tech companies often have scale enough to justify allocating resources to building internal tools. In this blog, we discuss about RCACoPilot -- an automated incident classification and investigation engine built by Microsoft to impro...
Join discussion
Aug 28, 2024 · 2 min read · A playbook is a set of instructions that a Doctor Droid bot or an on-call engineer follows during a production incident. https://www.youtube.com/watch?v=T9KfunP9juA A playbook consists of tasks. A task is an instruction that's executed through the ...
Join discussion