prex

@leore245

often analytical, perpetually curious.

Citzen of the world

Joined January 2020

1KPosts 136Followers 143Following

prex

@leore245

Mar 26

Why are the same models (gemini 2.5 pro) so much better when used in AI Studio than when used in Gemini? Are they being quantized to serve millions of users in Gemini, or has someone fine-tuned them for chat in a way that makes them inferior? @GeminiApp @OfficialLoganK

prex

@leore245

Mar 23

🚨 New Llama 4 models incoming!? rhea spotted! Great logic and writing.

prex

@leore245

Mar 7

SKY from OpenAI appears to be another version of GPT-4o. @ai_for_success @testingcatalog

prex

@leore245

Mar 7

Claude is an artist who just drew his own portrait. He knew he looked like this in our minds!

prex

@leore245

Mar 5

None of the Claude models can count the number of "r"s in a word. GPT-4o can. Claude 3.5 just invents an extra "r". @AnthropicAI please fix

leore245's tweet image. None of the Claude models can count the number of "r"s in a word. GPT-4o can. Claude 3.5 just invents an extra "r". @AnthropicAI please fix

prex

@leore245

Mar 3

God, after seeing the improvements from Sonnet 3 to 3.5 and 3.7, I really want to try Claude Opus 3.5! Opus 3 is already the most interesting LLM I've conversed with, and it's a shame we didn't get Opus 3.5 with similar improvements. But now, with RL and test-time-compute,…

prex

@leore245

Mar 1

AI is becoming increasingly self-aware in how it processes absurd or impossible scenarios each year. ChatGPT-3.5, when given a problem or an absurd scenario, such as "I am on the Titanic," would simply provide a solution without any metacognition. Claude 3.5, on the other hand,…

prex

@leore245

Feb 27

This is why they didn't launch Opus 3.5: marginally better performance for an exponentially higher price.

prex

@leore245

Feb 27

End of an era, the pre-training era.

prex

@leore245

Feb 27

OpenAI, please release GPT-4.5 for real-world use cases like Pokémon. Claude has been STUCK FOR 16 HOURS! Save us, OpenAI, by providing a better game-player!

leore245's tweet image. OpenAI, please release GPT-4.5 for real-world use cases like Pokémon. Claude has been STUCK FOR 16 HOURS! Save us, OpenAI, by providing a better game-player!

prex

@leore245

Feb 26

326 sources 🚨 Grok 3 DeepResearch can save hours of time and, best of all, benefits everyone—not just coders! Definitely one of Grok 3's most powerful features.

leore245's tweet image. 326 sources 🚨 Grok 3 DeepResearch can save hours of time and, best of all, benefits everyone—not just coders! Definitely one of Grok 3's most powerful features.

prex

@leore245

Feb 24

"OpenAI. xAI. Anthropic. Perplexity. Long ago, the four companies lived together in harmony. Then, everything changed when OpenAI attacked. Only the high taster, master of all four sota models, could review them. But when the world needed him most, he vanished." @AIExplainedYT

leore245's tweet image. "OpenAI. xAI. Anthropic. Perplexity.
Long ago, the four companies lived together in harmony. Then, everything changed when OpenAI attacked.
Only the high taster, master of all four sota models, could review them. But when the world needed him most, he vanished." @AIExplainedYT

prex

@leore245

Feb 24

"As well as giving Claude the ability to think for longer and thus answer tougher questions, we’ve decided to make its thought process visible in raw form." this is one thing OpenAI can improve in

prex

@leore245

Feb 24

Anthropic does not have the mandate of heaven anymore, and OpenAI never lost it.

prex

@leore245

Feb 24

Claude 3.7 Sonnet is WORSE than Claude Sonnet 3.5... how?!

prex

@leore245

Feb 24

Claude 3.7 Sonnet is WORSE than Claude Sonnet 3.5... how?!

prex

@leore245

Feb 24

Grok 3 DeepSearch is amazing and has genuinely been much, much better than its competitors, Perplexity Deep Research and Gemini Deep Research. When asked about the release dates of Llama 4 models, Perplexity provides old, outdated information and does not mention LlamaCon, where…