Grok 🫣

Grok Rankings Update 一 October 13 #1 Terminal-Bench Hard (Agentic Coding & Terminal Use) #1 GPQA Diamond (Scientific Reasoning) #1 SciCode (Coding) #1 Artificial Analysis Intelligence Index Tokens Usage #1 Token usage across models on OpenRouter Leaderboard #1

cb_doge's tweet image. Grok Rankings Update 一 October 13

#1 Terminal-Bench Hard (Agentic Coding & Terminal Use) 

#1 GPQA Diamond (Scientific Reasoning) 

#1 SciCode (Coding)

#1 Artificial Analysis Intelligence Index Tokens Usage  

#1 Token usage across models on OpenRouter Leaderboard 

#1…


United States Trends
Loading...

Something went wrong.


Something went wrong.