ag_i_2211's profile picture. @xAI

Aditya Gupta

@ag_i_2211

@xAI

🚀

Great products coming from @xAI!



Aditya Gupta a reposté

MathArena Update: Claims about Grok 4 Fast seem to check out, it matches the performance of Grok 4 but is much faster and 20-50x cheaper. Good release! This holds across final-answer competitions, Apex problems, and Project Euler. 🧵

ni_jovanovic's tweet image. MathArena Update: Claims about Grok 4 Fast seem to check out, it matches the performance of Grok 4 but is much faster and 20-50x cheaper. Good release!

This holds across final-answer competitions, Apex problems, and Project Euler. 🧵

Aditya Gupta a reposté

Deterministic inference, here you are. True on-policy RL is on the way. Although we are mostly using off-policy, having a deterministic mode will make many things easier!

SGLang now supports deterministic LLM inference! Building on @thinkymachines batch-invariant kernels, we integrated deterministic attention & sampling ops into a high-throughput engine - fully compatible with chunked prefill, CUDA graphs, radix cache, and non-greedy sampling. ✅…

lmsysorg's tweet image. SGLang now supports deterministic LLM inference! Building on @thinkymachines batch-invariant kernels, we integrated deterministic attention & sampling ops into a high-throughput engine - fully compatible with chunked prefill, CUDA graphs, radix cache, and non-greedy sampling.

✅…


Aditya Gupta a reposté

Hiring for a new team building computer control agents. Join us to build Grok5 / macrohard later this year. DM me! Will send out a job post soon too.


~10x engineer~ 25x model.

xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…

ArtificialAnlys's tweet image. xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost

Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…
ArtificialAnlys's tweet image. xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost

Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…


Grok4 Fast won at Search Arena! Showing how strong a lightweight model can be when it has good tools. Also, it is #8 on the standard Text Arena, best of its class, much ahead of Gemini-2.5-Flash #17 or O4-mini #25. The post-training team really cooked!



Aditya Gupta a reposté

Grok4 Fast won at Search Arena! Showing how strong a lightweight model can be when it has good tools. Also, it is #8 on the standard Text Arena, best of its class, much ahead of Gemini-2.5-Flash #17 or O4-mini #25. The post-training team really cooked!

🚨 Leaderboard Disrupted! Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️ 🏆 #1 on the Search Leaderboard Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community. 💠 Tied for #8 on the Text Leaderboard…

arena's tweet image. 🚨 Leaderboard Disrupted!

Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️

🏆 #1 on the Search Leaderboard
Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community.
💠 Tied for #8 on the Text Leaderboard…


that's one way we welcome our new team members ;)

A cheekier version

dustinvtran's tweet image. A cheekier version


Aditya Gupta a reposté

A cheekier version

dustinvtran's tweet image. A cheekier version

Grok4 Fast maximizing intelligence density.

Yuhu_ai_'s tweet image. Grok4 Fast maximizing intelligence density.


Aditya Gupta a reposté

xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…

ArtificialAnlys's tweet image. xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost

Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…
ArtificialAnlys's tweet image. xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost

Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…

excited to share the early glimpse of our reasoning post-training! maximizing intelligence density and usability.

🚨 Leaderboard Disrupted! Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️ 🏆 #1 on the Search Leaderboard Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community. 💠 Tied for #8 on the Text Leaderboard…

arena's tweet image. 🚨 Leaderboard Disrupted!

Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️

🏆 #1 on the Search Leaderboard
Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community.
💠 Tied for #8 on the Text Leaderboard…


Aditya Gupta a reposté

🚨 Leaderboard Disrupted! Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️ 🏆 #1 on the Search Leaderboard Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community. 💠 Tied for #8 on the Text Leaderboard…

arena's tweet image. 🚨 Leaderboard Disrupted!

Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️

🏆 #1 on the Search Leaderboard
Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community.
💠 Tied for #8 on the Text Leaderboard…

Introducing Grok 4 Fast, a multimodal reasoning model with a 2M context window that sets a new standard for cost-efficient intelligence. Available for free on grok.com, grok.x.com, iOS and Android apps, and OpenRouter. x.ai/news/grok-4-fa…



come play and let the models play with jax + sglang + GB200/300s.

Come join us to work on 🔢🧮 for 🔄! x.com/i/jobs/1968405…



expert.

1/7 We release FinSearchComp, the first expert-level benchmark for financial search & reasoning — 639 questions from 70+ finance pros. #Grok4 ranked #1 🏆 and close to human experts, GPT-5 the second, while others fail at basic analyst tasks. Page: randomtutu.github.io/FinSearchComp/

lianghu349103's tweet image. 1/7 We release FinSearchComp, the first expert-level benchmark for financial search & reasoning — 639 questions from 70+ finance pros.   #Grok4 ranked #1 🏆 and close to human experts, GPT-5 the second, while others fail at basic analyst tasks.

Page: randomtutu.github.io/FinSearchComp/


Aditya Gupta a reposté

I chose Grok 4 for my ARC-AGI solution because it had the most logical consistency when thinking for long periods of time. You can feel the RL that went into it

Grok 5 starts training in a few weeks



Aditya Gupta a reposté

New SOTA on ARC-AGI - V1: 79.6%, $8.42/task - V2: 29.4%, $30.40/task Custom submissions by @jerber888 and @_eric_pang_ are now the best known solutions to ARC-AGI Both: * Are open source * Use Grok 4 * Implement program-synthesis outer loops with test-time adaptation

arcprize's tweet image. New SOTA on ARC-AGI

- V1: 79.6%, $8.42/task
- V2: 29.4%, $30.40/task

Custom submissions by @jerber888 and @_eric_pang_ are now the best known solutions to ARC-AGI

Both:
* Are open source
* Use Grok 4
* Implement program-synthesis outer loops with test-time adaptation

Aditya Gupta a reposté

Imagine if we built a truly intelligent recommendation system that surfaces content with healthy discussion, letting users discover the truth. We could break the echo chambers and curb the propaganda.


Aditya Gupta a reposté

Grok Code is now seeing more usage on OpenRouter than all others combined

elonmusk's tweet image. Grok Code is now seeing more usage on OpenRouter than all others combined

Loading...

Something went wrong.


Something went wrong.