Getting Started with Reinforcement Learning with Human Feedback
Reinforcement Learning with Human Feedback(RLHF) is a technique combined with Reinforcement learning and human feedback to better align the LLMs with human preferences. This blog covers most of the concepts that i learnt.
Before diving deep, let's un...
siddartha10.hashnode.dev6 min read
Subhasya Tippareddy
Understood the concept in single read!