Aditya Gupta

@ag_i_2211

@xAI

San Francisco, CA

Inscrit en Décembre 2013

273Posts 7KAbonnés 308Abonnements

Vous pourriez aimer

@tanyaagoyal

@ssgrn

@aviaviavi__

@ysu_nlp

@fredahshi

@sarahwiegreffe

@rachelrudinger

@byryuer

@scottyih

@chrome1996

@alan_ritter

@dipanjand

@feiliu_nlp

@bhuwandhingra

@mandarjoshi_

Elon Musk

@elonmusk

7 oct.

Great products coming from @xAI!

Aditya Gupta a reposté

MathArena Update: Claims about Grok 4 Fast seem to check out, it matches the performance of Grok 4 but is much faster and 20-50x cheaper. Good release! This holds across final-answer competitions, Apex problems, and Project Euler. 🧵

ni_jovanovic's tweet image. MathArena Update: Claims about Grok 4 Fast seem to check out, it matches the performance of Grok 4 but is much faster and 20-50x cheaper. Good release!

This holds across final-answer competitions, Apex problems, and Project Euler. 🧵

Aditya Gupta a reposté

Ying Sheng

@ying11231

22 sept.

Deterministic inference, here you are. True on-policy RL is on the way. Although we are mostly using off-policy, having a deterministic mode will make many things easier!

LMSYS Org

@lmsysorg

22 sept.

SGLang now supports deterministic LLM inference! Building on @thinkymachines batch-invariant kernels, we integrated deterministic attention & sampling ops into a high-throughput engine - fully compatible with chunked prefill, CUDA graphs, radix cache, and non-greedy sampling. ✅…

lmsysorg's tweet image. SGLang now supports deterministic LLM inference! Building on @thinkymachines batch-invariant kernels, we integrated deterministic attention &amp; sampling ops into a high-throughput engine - fully compatible with chunked prefill, CUDA graphs, radix cache, and non-greedy sampling.

✅…

Aditya Gupta a reposté

Yuhuai (Tony) Wu

@Yuhu_ai_

23 sept.

Hiring for a new team building computer control agents. Join us to build Grok5 / macrohard later this year. DM me! Will send out a job post soon too.

Aditya Gupta

@ag_i_2211

21 sept.

📈

Aditya Gupta

@ag_i_2211

20 sept.

~10x engineer~ 25x model.

Artificial Analysis

@ArtificialAnlys

19 sept.

xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…

ArtificialAnlys's tweet image. xAI has released Grok 4 Fast - breaking through our intelligence vs cost frontier by achieving Gemini 2.5 Pro level intelligence at a ~25X cheaper cost

Intelligence: @xai shared with us pre-release access to Grok 4 Fast. In reasoning mode, the model scores an impressive 60 on…

Aditya Gupta

@ag_i_2211

20 sept.

job-boards.greenhouse.io/xai/jobs/47992…

Yuhuai (Tony) Wu

@Yuhu_ai_

19 sept.

Grok4 Fast won at Search Arena! Showing how strong a lightweight model can be when it has good tools. Also, it is #8 on the standard Text Arena, best of its class, much ahead of Gemini-2.5-Flash #17 or O4-mini #25. The post-training team really cooked!

Aditya Gupta a reposté

Yuhuai (Tony) Wu

@Yuhu_ai_

19 sept.

lmarena.ai

@arena

19 sept.

🚨 Leaderboard Disrupted! Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️ 🏆 #1 on the Search Leaderboard Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community. 💠 Tied for #8 on the Text Leaderboard…

arena's tweet image. 🚨 Leaderboard Disrupted!

Grok-4-fast by @xAI has arrived in the Arena, and it’s shaking things up! ⚡️

🏆 #1 on the Search Leaderboard
Tested under the codename “menlo,” Grok-4-fast-search just rocketed to the top spot with the community.
💠 Tied for #8 on the Text Leaderboard…

Aditya Gupta

@ag_i_2211

20 sept.

that's one way we welcome our new team members ;)

Dustin Tran

@dustinvtran

19 sept.

A cheekier version

Aditya Gupta a reposté

Dustin Tran

@dustinvtran

19 sept.

A cheekier version

Yuhuai (Tony) Wu

@Yuhu_ai_

19 sept.

Grok4 Fast maximizing intelligence density.

Aditya Gupta a reposté

Artificial Analysis

@ArtificialAnlys

19 sept.

Aditya Gupta

@ag_i_2211

20 sept.

excited to share the early glimpse of our reasoning post-training! maximizing intelligence density and usability.

lmarena.ai

@arena

19 sept.

Aditya Gupta a reposté

lmarena.ai

@arena

19 sept.

xAI

@xai

19 sept.

Introducing Grok 4 Fast, a multimodal reasoning model with a 2M context window that sets a new standard for cost-efficient intelligence. Available for free on grok.com, grok.x.com, iOS and Android apps, and OpenRouter. x.ai/news/grok-4-fa…

xai's tweet card. Pushing the Frontier of Cost-Efficient Intelligence

Grok 4 Fast | xAI

Source: x.ai

Aditya Gupta

@ag_i_2211

19 sept.

come play and let the models play with jax + sglang + GB200/300s.

Szymon Tworkowski

@s_tworkowski

18 sept.

Come join us to work on 🔢🧮 for 🔄! x.com/i/jobs/1968405…

Aditya Gupta

@ag_i_2211

19 sept.

expert.

liang hu

@lianghu349103

18 sept.

1/7 We release FinSearchComp, the first expert-level benchmark for financial search & reasoning — 639 questions from 70+ finance pros. #Grok4 ranked #1 🏆 and close to human experts, GPT-5 the second, while others fail at basic analyst tasks. Page: randomtutu.github.io/FinSearchComp/

lianghu349103's tweet image. 1/7 We release FinSearchComp, the first expert-level benchmark for financial search &amp; reasoning — 639 questions from 70+ finance pros. #Grok4 ranked #1 🏆 and close to human experts, GPT-5 the second, while others fail at basic analyst tasks.

Page: randomtutu.github.io/FinSearchComp/

Aditya Gupta a reposté

Jeremy Berman

@jerber888

17 sept.

I chose Grok 4 for my ARC-AGI solution because it had the most logical consistency when thinking for long periods of time. You can feel the RL that went into it

Elon Musk

@elonmusk

17 sept.

Grok 5 starts training in a few weeks

Aditya Gupta a reposté

ARC Prize

@arcprize

16 sept.

New SOTA on ARC-AGI - V1: 79.6%, $8.42/task - V2: 29.4%, $30.40/task Custom submissions by @jerber888 and @_eric_pang_ are now the best known solutions to ARC-AGI Both: * Are open source * Use Grok 4 * Implement program-synthesis outer loops with test-time adaptation

arcprize's tweet image. New SOTA on ARC-AGI

- V1: 79.6%, $8.42/task
- V2: 29.4%, $30.40/task

Custom submissions by @jerber888 and @_eric_pang_ are now the best known solutions to ARC-AGI

Both:
* Are open source
* Use Grok 4
* Implement program-synthesis outer loops with test-time adaptation

Aditya Gupta a reposté

Aditya Paliwal

@VastoLorde95

13 sept.

Imagine if we built a truly intelligent recommendation system that surfaces content with healthy discussion, letting users discover the truth. We could break the echo chambers and curb the propaganda.