Mike Youngmikeyoung44.hashnode.dev·Apr 22, 2024From $r$ to $Q^*$: Your Language Model is Secretly a Q-FunctionThis is a Plain English Papers summary of a research paper called [From $r$ to $Q^$: Your Language Model is Secretly a Q-Function](https://aimodels.fyi/papers/arxiv/from-dollarrdollar-to-dollarqdollar-your-language-model). If you like these kinds of ...Beginner DevelopersAdd a thoughtful commentNo comments yetBe the first to start the conversation.