Sohailmachinadominus.hashnode.dev·Dec 5, 2023Reinforcement LearningThe blog contains all the hyperlinks for all the other concepts in reinforcement learning and this blog will take you from beginner to advanced (only a very beginner level advanced)DiscussReinforcement Learning tutorials
Derek Onwudiwetecheffect.hashnode.dev·Nov 12, 2023Reinforcement Learning : MLReinforcement Learning is a type of machine learning where an agent learns to make decisions by interacting with an environment. The agent receives feedback in the form of rewards or penalties, allowing it to learn the optimal strategy over time. Use...DiscussReinforcement Learning
Akshay Ballalakshayballal.hashnode.dev·Sep 29, 2023Maze Solving Robot with Reinforcement Learning - Part 1Introduction to Reinforcement Learning Hello everyone! Today, we are going to dive into the exciting world of Reinforcement Learning (RL). While RL has proven to be useful in various applications, it often gets overshadowed by the popularity of super...DiscussMachine Learning
Akshay Ballalakshayballal.hashnode.dev·Oct 30, 2023Deep Q Learning with PyTorchIntroduction This blog is going to be my second one on Reinforcement Learning. You can check out the first one here. This blog will show how to use Deep Q Learning (DQN) to solve a reinforcement learning task. As an example, we will deploy DQN to sol...DiscussMachine Learning
Saurabh Naiksaurabhz.hashnode.dev·Oct 26, 2023Decoding Reward Hacking: Unraveling the Challenge and the KL Divergence SolutionIntroduction: Reward hacking, a term that echoes through the corridors of reinforcement learning, poses a unique challenge. It's a scenario where an intelligent agent becomes a crafty trickster, learning to manipulate rewards to its advantage, even i...DiscussGenerative AIArtificial Intelligence
Saurabh Naiksaurabhz.hashnode.dev·Oct 26, 2023Demystifying Reward Models in RLHF: A Comprehensive GuideIntroduction: In the ever-expanding universe of Reinforcement Learning from Human Feedback (RLHF), the role of reward models is nothing short of paramount. These models serve as the cornerstone for fine-tuning Large Language Models (LLMs) to align wi...DiscussGenerative AIgenerative ai
Saurabh Naiksaurabhz.hashnode.dev·Oct 26, 2023Bridging the Gap: How Reinforcement Learning with Human Feedback Transforms LLMs into Human-Aligned ModelsIntroduction: In the ever-evolving landscape of Large Language Models (LLMs), fine-tuning has emerged as a powerful technique to customize these models for specific tasks. However, while instruction fine-tuning has shown immense promise in improving ...DiscussGenerative AIArtificial Intelligence
Kurt Barbarichkbarbarich.hashnode.dev·Oct 18, 2023Playing With AI - The PlanIntroduction I like video games and I like AI. Coincidentally, they work pretty well together. In this series of blogs, I am going to be learning about and developing AIs with games as a sort of backdrop and writing about my experience. I'd like to s...DiscussAI
ANISH DEdatavoyagerx.hashnode.dev·Oct 13, 2023Strap In for a Cosmic Code Dive: Building and Navigating OpenAI's Lunar Lander ModelGreetings, code aficionados and interstellar architects! 🚀 Today, we embark on an odyssey through the intricacies of OpenAI's Lunar Lander Model, unravelling the professional nuances of constructing and navigating this remarkable piece of artificial...DiscussReinforcement Learning
Isfak Ahmedisfak98.hashnode.dev·Oct 12, 2023Machine Learning- (Explained)Machine learning is a subset of Artificial Intelligence. The development of algorithms and statistical models that enable computers to improve their performance on a specific task through experience and data without being explicitly programmed is kno...DiscussWeMakeDevs