Surge AI
@HelloSurgeAI
Our mission is to raise AGI with the richness of humanity — curious, witty, imaginative, and full of breathtaking brilliance.
You might like
GPT-5.1: now with 20% more warmth and personality 😅 When GPT-5 launched in Aug, users were furious that they lost 4o. Did 4o have a better tone & personality? Yup - we'd actually measured it: 850 convos later, they were right. 4o was slightly preferred surgehq.ai/blog/bringing-……
GPT-5.1 is out! It's a nice upgrade. I particularly like the improvements in instruction following, and the adaptive thinking. The intelligence and style improvements are good too.
a bit of good grounding in a world of hype
Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense,…
incredible case study of what goes into building realistic RL environments. very exciting to see Surge creating some excellent resources here :)
Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense,…
Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense,…
Is AI more likely to make you a billion dollars … or lose it?
we made gpt-5, claude, and gemini do real wall street work. then we asked 200 finance pros to grade them. one model produced basel capital numbers that would get a real bank fined. 😱 study -> surgehq.ai/blog/finance-e…
we made gpt-5, claude, and gemini do real wall street work. then we asked 200 finance pros to grade them. one model produced basel capital numbers that would get a real bank fined. 😱 study -> surgehq.ai/blog/finance-e…
You don't get to $1 billion in revenue by sitting around. Don't worry, you didn't miss the chance to hear from @HelloSurgeAI founder @echen in person. November 12th in SF. RSVP below.
Teaching LLMs to follow instructions? Step 1. Teaching them to have taste? That's the endgame. An 8-line poem about the moon can check every box: ✅ Moon: mentioned ✅ Lines: 8 ✅ Rhymes: yes! ...and still be completely forgettable. The models that win aren't the most obedient.…
youtube.com
YouTube
The Startup Powering The Data Behind AGI
A good model can typically ace an academic benchmark. But even the best model out there will often hit a wall as soon as it’s handed a messy, real-world problem to solve. That’s why we build our own RL environments, to help frontier labs create models that can cope with the…
We’ve been chatting with our Surge Research Fellows - Fields Medalists, Harvard professors, frontier scholars. The consensus: the real bottleneck in AI isn’t intelligence. It’s reliability. As one Fellow told us: > Models look convincing but are wrong. In math, that’s fixable…
United States Trends
- 1. Colts 33.8K posts
- 2. Giants 78.7K posts
- 3. Chiefs 68.7K posts
- 4. Gibbs 14.4K posts
- 5. Jameis 38K posts
- 6. Lions 56.1K posts
- 7. Steelers 49.3K posts
- 8. JJ McCarthy 6,800 posts
- 9. Bears 64.6K posts
- 10. Mahomes 17.5K posts
- 11. Vikings 29.6K posts
- 12. #OnePride 4,789 posts
- 13. Ravens 26K posts
- 14. Shane Bowen 2,252 posts
- 15. Tomlin 6,746 posts
- 16. Tony Romo 3,484 posts
- 17. Bengals 21.9K posts
- 18. Daniel Jones 3,321 posts
- 19. Patriots 101K posts
- 20. Campbell 17.5K posts
You might like
-
Anthropic
@AnthropicAI -
DAIR.AI
@dair_ai -
Gradio
@Gradio -
LlamaIndex 🦙
@llama_index -
Jim Fan
@DrJimFan -
Harrison Chase
@hwchase17 -
Aran Komatsuzaki
@arankomatsuzaki -
Jerry Liu
@jerryjliu0 -
Percy Liang
@percyliang -
Yao Fu
@Francis_YAO_ -
Piotr Nawrot
@p_nawrot -
Yi Tay
@YiTayML -
Tim Dettmers
@Tim_Dettmers -
Ankush Gola
@ankush_gola11 -
anton
@abacaj
Something went wrong.
Something went wrong.