@ayyanar

DataOps Labs

@ayyanarIndiaJoined December 2023

Stay updated on the latest in AI/ML, Cloud, DevOps, MLOps, Generative AI and cutting-edge techniques with this continuous learning

Website

About

I'm Ayyanar Jeyakrishnan ; aka AJ. With over 21 years in IT, I'm a passionate Multi-Cloud Architect specialising in crafting scalable and efficient cloud solutions. I've successfully designed and implemented multi-cloud architectures for diverse organisations, harnessing AWS, Azure, and GCP. My track record includes delivering Machine Learning and Data Platform projects with a focus on high availability, security, and scalability. I'm a proponent of DevOps and MLOps methodologies, accelerating development and deployment. I actively engage with the tech community, sharing knowledge in sessions, conferences, and mentoring programs. Constantly learning and pursuing certifications, I provide cutting-edge solutions to drive success in the evolving cloud and AI/ML landscape.

Available for

Connect with me 1:1 for Free technical Discussion, Interview prep & tips, Career Guidance, Mentorship https://topmate.io/jayyanar

DataOps Labs's blogs

DataOps Labsblog.dataopslabs.com80 posts

About

Available for

Connect with me 1:1 for Free technical Discussion, Interview prep & tips, Career Guidance, Mentorship https://topmate.io/jayyanar

DataOps Labs's blogs

DataOps Labsblog.dataopslabs.com80 posts

Articles Comments

Comments

Efficient RL Framework: GRPO eliminates costly critic models in RL. (A critic model in reinforcement learning is responsible for evaluating the actions taken by an agent (actor model) by estimating the expected rewards, also known as the value function. It helps guide the agent by providing feedback on how good a particular action or decision is. While effective, critic models are expensive because they often need to be as large and complex as the actor model, doubling the computational cost. Additionally, they require constant updates to align with the actor's learning, adding to memory and GPU/CPU usage. This makes training reinforcement learning systems with critic models resource-intensive, especially for large-scale models like language models.) Floating Point 8 Precision: Reduces memory and compute needs during training and inference. Mixture-of-Experts Design: Activates only a subset of parameters per query, optimizing performance-to-cost. (This is technique Pionered by Mistral) DeepSeek models are trained on custom kernels for efficient GPU-to-GPU communication using NVLink and InfiniBand. Liger-Kernel also did somwhat similar Ref :https://www.linkedin.com/blog/engineering/open-source/liger-kernel-open-source-ecosystem-for-efficient-llm-training?lipi=urn%3Ali%3Apage%3Ad_flagship3_detail_base%3B0zH7nA6VRl20GM0qdPMm3A%3D%3D