HelloSurgeAI's profile picture. Our mission is to raise AGI with the richness of humanity — curious, witty, imaginative, and full of breathtaking brilliance.

Surge AI

@HelloSurgeAI

Our mission is to raise AGI with the richness of humanity — curious, witty, imaginative, and full of breathtaking brilliance.

GPT-5.1: now with 20% more warmth and personality 😅 When GPT-5 launched in Aug, users were furious that they lost 4o. Did 4o have a better tone & personality? Yup - we'd actually measured it: 850 convos later, they were right. 4o was slightly preferred surgehq.ai/blog/bringing-…

GPT-5.1 is out! It's a nice upgrade. I particularly like the improvements in instruction following, and the adaptive thinking. The intelligence and style improvements are good too.



Surge AI reposted

a bit of good grounding in a world of hype

Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense,…



Surge AI reposted

incredible case study of what goes into building realistic RL environments. very exciting to see Surge creating some excellent resources here :)

Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense,…



Everyone's acting like models are ready to replace humans in work settings. We put that to the test by creating an entire company and having 9 models act as a customer service agent handling 150 tickets and requests of increasing complexity. Verdict: without common sense,…


Is AI more likely to make you a billion dollars … or lose it?

we made gpt-5, claude, and gemini do real wall street work. then we asked 200 finance pros to grade them. one model produced basel capital numbers that would get a real bank fined. 😱 study -> surgehq.ai/blog/finance-e…



Surge AI reposted

we made gpt-5, claude, and gemini do real wall street work. then we asked 200 finance pros to grade them. one model produced basel capital numbers that would get a real bank fined. 😱 study -> surgehq.ai/blog/finance-e…


Surge AI reposted

You don't get to $1 billion in revenue by sitting around. Don't worry, you didn't miss the chance to hear from @HelloSurgeAI founder @echen in person. November 12th in SF. RSVP below.

southpkcommons's tweet image. You don't get to $1 billion in revenue by sitting around.

Don't worry, you didn't miss the chance to hear from @HelloSurgeAI founder @echen in person. 

November 12th in SF. RSVP below.

Teaching LLMs to follow instructions? Step 1. Teaching them to have taste? That's the endgame. An 8-line poem about the moon can check every box: ✅ Moon: mentioned ✅ Lines: 8 ✅ Rhymes: yes! ...and still be completely forgettable. The models that win aren't the most obedient.…

HelloSurgeAI's tweet card. The Startup Powering The Data Behind AGI

youtube.com

YouTube

The Startup Powering The Data Behind AGI


A good model can typically ace an academic benchmark. But even the best model out there will often hit a wall as soon as it’s handed a messy, real-world problem to solve. That’s why we build our own RL environments, to help frontier labs create models that can cope with the…

HelloSurgeAI's tweet image. A good model can typically ace an academic benchmark. But even the best model out there will often hit a wall as soon as it’s handed a messy, real-world problem to solve. 

That’s why we build our own RL environments, to help frontier labs create models that can cope with the…

We’ve been chatting with our Surge Research Fellows - Fields Medalists, Harvard professors, frontier scholars. The consensus: the real bottleneck in AI isn’t intelligence. It’s reliability. As one Fellow told us: > Models look convincing but are wrong. In math, that’s fixable…


Loading...

Something went wrong.


Something went wrong.