Aniblog.anirudha.dev路Oct 22, 2024Designing The Perfect Incentivised System - Part 1As I started writing this, it became apparent as to how deep the rabbit hole really is... 馃惏 So, I'm breaking up the discovery phase in multiple parts. If you're just catching up, start at the beginning... 馃弳 Incentive systems are frameworks desig...Perpetual Incentive Systemperpetual
Eshan Jairatheshanjairath.hashnode.dev路May 29, 2023Simple Intro to Reinforcement LearningWhat do think when the word reinforcement comes to your mind? Let's go into the technical theory first according to which Reinforcement learning (RL) is a type of machine learning that involves training an agent to make decisions in an environment by...71 readsArtificial Intelligence
Shahul EsforExploding Gradientsexplodinggradients.com路May 8, 2023Reward Modeling for Large language models (with code)Reward modeling combined with reinforcement learning has enabled the widespread application of large language models by aligning models to accepted human values. Reward Modelling and RLHF have been the hottest words in AI alignment since the release ...2 likes路8.1K readsopensource