Matthew Siegel
@LargerLanguage
AI technical writer and poet. Not simultaneously. Research, warm takes, and cool stuff we're doing at @Scale_AI. For poems: @MatthewSiegel_
thoroughly excited to launch this video series where I break down new research from @Scale_AI. this first one is how we knock out 90% of errors in data that require human review using an autorater. check it out 👀
Lots of people are talking about an AI bubble. Sure. I don't think these people are reading the same research I'm reading.
New @Scale_AI paper! The culprit behind reward hacking? We trace it to misspecification in high-reward tail. Our fix: rubric-based rewards to tell “excellent” responses apart from “great.” The result: Less hacking, stronger post-training! arxiv.org/pdf/2509.21500
This was a HUGE lift from our research team, huge thanks to everyone who contributed to this benchmark. ...and the blog doesn't look so bad either: scale.com/blog/swe-bench…
Working on this leaderboard page and blog was a LIFT! Huge thank you to every cook in the kitchen!! 🧑🍳👩🍳👨🍳
United States Trends
- 1. Good Thursday 29.1K posts
- 2. Cynthia 58.1K posts
- 3. #GrabFoodMegaSalexหลิงออม 472K posts
- 4. #WorldKindnessDay 8,902 posts
- 5. Larry Brooks N/A
- 6. Rejoice in the Lord 2,364 posts
- 7. SUSDT N/A
- 8. RIP Brooksie N/A
- 9. #SwiftDay N/A
- 10. Happy Friday Eve N/A
- 11. PancakeSwap BNB Chain N/A
- 12. RIP Larry N/A
- 13. #thursdaymotivation 1,946 posts
- 14. #thursdayvibes 2,646 posts
- 15. Michael Burry 7,555 posts
- 16. Namjoon 105K posts
- 17. Jeffrey Epstein 521K posts
- 18. Eddie Guerrero 3,913 posts
- 19. Jesse Jackson 1,448 posts
- 20. Mikey 54.3K posts
Something went wrong.
Something went wrong.