InstructGPT: fine-tuned GPT-3
Dec 2, 2025 · 2 min read · Large language models (LLMs) can frequently produce nonsensical, toxic, or made-up text that can easily fool typical users. These unintended behaviours stem from the inherent shortcomings of the language modelling objective: predicting the next token...
Join discussion