Siddartha Pullakhandamsiddartha10.hashnode.dev·Sep 20, 2024Getting Started with Reinforcement Learning with Human FeedbackReinforcement Learning with Human Feedback(RLHF) is a technique combined with Reinforcement learning and human feedback to better align the LLMs with human preferences. This blog covers most of the concepts that i learnt. Before diving deep, let's un...20 likes·37 readsPolicy gradient
Nidal Iguerblog.inidal.dev·Mar 5, 2024The Dawn of AI: From Ancient Myths to Modern RealitiesThe Journey Begins The journey of Artificial Intelligence (AI) began in antiquity, with tales of artificial beings endowed with intelligence or consciousness. This fascination led to philosophical discussions about the nature of intelligence and the ...29 readsDev Journeyhistory
tagxtagxvikas.hashnode.dev·Jan 19, 2024How does Reinforcement Learning from Human Feedback work?In the dynamic realm of artificial intelligence, the integration of Reinforcement Learning from Human Feedback (RLHF) has emerged as a crucial strategy to enhance machine learning algorithms. RLHF introduces a human-in-the-loop element to conventiona...Machine Learning
Ibrahim V Kibrahimvk.hashnode.dev·Dec 23, 2023Decoding ChatGPT: A Journey from Curiosity to UnderstandingIn the rapidly evolving tech landscape, understanding advanced technologies like ChatGPT isn't just a professional requisite; it's a journey of continuous learning and curiosity. My quest to comprehend how ChatGPT works began with a simple yet profou...2 likeschatgptexplained
Toan HoforDwarves Foundation's Team Blogdwarvesf.hashnode.dev·Aug 11, 2023Challenges faced when researching RLHF with OpenAssistantAt Dwarves, we've been working on researching various topics, focused on full-stack engineering as well as AI. One of my research goals was to find out how LLMs and RLHF training worked end-to-end through a chatbot interface: https://www.youtube.com/...25 likes·102 readsAI
Nitin Agarwalcognibits.hashnode.dev·Jun 28, 2023Understanding Reinforcement Learning (RL)Introduction Reinforcement Learning (RL) is a subfield of machine learning that focuses on how agents can learn to make sequential decisions by interacting with an environment. It has gained significant attention in recent years due to its potential ...1 likeReinforcement Learning