RLHF: How ChatGPT Learned What You Actually Want
You have probably noticed that ChatGPT gives surprisingly useful answers.
Not just correct — useful. It matches your tone, reads between the lines, knows when to be brief and when to go deep.
But here
changeofbasis.hashnode.dev5 min read