
You might like
🔄RLHF → RLVR → Rubrics → OnlineRubrics 👤 Human feedback = noisy & coarse 🧮 Verifiable rewards = too narrow 📋 Static rubrics = rigid, easy to hack, miss emergent behaviors 💡We introduce OnlineRubrics: elicited rubrics that evolve as models train. arxiv.org/abs/2510.07284

Sat down with @lennysan to talk about where AI is headed and how we’re making it work for model builders, enterprises and governments. Also went down memory lane about my time at Uber Eats. 🙂
“I think one of the misunderstandings is that AI is this magic wand or it can solve all problems, and that’s not true today. But there is a ton of value when you get it right.” Our CEO @jdroege shared his AI success framework with CNN's @claresduffy. cnn.com/2025/09/30/tec…
New @Scale_AI paper! The culprit behind reward hacking? We trace it to misspecification in high-reward tail. Our fix: rubric-based rewards to tell “excellent” responses apart from “great.” The result: Less hacking, stronger post-training! arxiv.org/pdf/2509.21500


We’re introducing SEAL Showdown, the AI leaderboard that actually captures real preferences, powered by a platform used by real people. Public benchmarks today rely on contrived tasks or narrow user groups. That leaves us guessing which models are actually preferred by people.…
United States Trends
- 1. Auburn 45.3K posts
- 2. Brewers 64.2K posts
- 3. Georgia 67.3K posts
- 4. Cubs 55.6K posts
- 5. Kirby 23.9K posts
- 6. Utah 24.6K posts
- 7. Arizona 41.4K posts
- 8. #byucpl N/A
- 9. Gilligan 5,936 posts
- 10. #AcexRedbull 3,831 posts
- 11. #BYUFootball 1,007 posts
- 12. Michigan 62.5K posts
- 13. Hugh Freeze 3,233 posts
- 14. #Toonami 2,704 posts
- 15. Boots 50K posts
- 16. Amy Poehler 4,463 posts
- 17. Kyle Tucker 3,178 posts
- 18. Dissidia 5,771 posts
- 19. #GoDawgs 5,561 posts
- 20. Tina Fey 3,477 posts
You might like
-
Alexandr Wang
@alexandr_wang -
Hugging Face
@huggingface -
Geoffrey Hinton
@geoffreyhinton -
Andrej Karpathy
@karpathy -
LlamaIndex 🦙
@llama_index -
Jan Leike
@janleike -
Anthropic
@AnthropicAI -
AI at Meta
@AIatMeta -
clem 🤗
@ClementDelangue -
LangChain
@LangChainAI -
a16z
@a16z -
Chroma
@trychroma -
Runway
@runwayml -
Greg Brockman
@gdb -
Ilya Sutskever
@ilyasut
Something went wrong.
Something went wrong.