OpenAI Misled You on RLHF
11 fpgaminer 2 8/17/2025, 6:37:10 AM aerial-toothpaste-34a.notion.site ↗
Comments (2)
macleginn · 1h ago
Everything the post says about the behaviour of OpenAI models seems to be based on pure speculation.
yorwba · 9m ago
Yeah, in my opinion you can just skip that part and go straight to the author's description of failing to train their own model at first and what they ended up changing to make it work: https://aerial-toothpaste-34a.notion.site/How-OpenAI-Misled-...