Gaurav

@gauravisnotme

Good model @xAI | prev. d-matrix, @Google. I was pre-trained like this. Still figuring out the framework for my fine-tuning. and, opinions my own. Please.

Joined February 2024

447Posts 3KFollowers 615Following

Gaurav

@gauravisnotme

9 h

We are now in the era where salary scaling laws are prevailing over model scaling. We are already past Llama-3.2-3B.

Meghan Bobrowsky

@MeghanBobrowsky

11 h

Saturday scoop: Thinking Machines Lab co-founder Andrew Tulloch has joined Meta, the startup confirmed. W/ @keachhagey

MeghanBobrowsky's tweet image. Saturday scoop: Thinking Machines Lab co-founder Andrew Tulloch has joined Meta, the startup confirmed.

W/ @keachhagey

How and why do startups, with mere pre-seed or seed funding, spend tens of thousands of dollars to give free cocktails to a bunch of nerds? What kind of marketing strategy is this and has this ever worked?

Gaurav reposted

Eric Jiang

@veggie_eric

Oct 10

tesla kart 🚀 how is Grok Imagine free

Gaurav reposted

Radical Numerics

@RadicalNumerics

Oct 9

Introducing RND1, the most powerful base diffusion language model (DLM) to date. RND1 (Radical Numerics Diffusion) is an experimental DLM with 30B params (3B active) with a sparse MoE architecture. We are making it open source, releasing weights, training details, and code to…

Gaurav

@gauravisnotme

Oct 10

Given that I haven't worked with JAX very extensively in the past, this is a huge "Today I Learned" moment for me. But let's see how many get it. How much memory do you think this simple statement reserve on the underlying GPU device?

Gaurav

@gauravisnotme

Oct 9

A curious thing I observed in regards to Waymo's Zeekr Fleet is they are always with a human driver in control of the steering wheel. Which makes me wonder - is the autonomous driver that is trained and and being served on the current Jaguar Fleet not transferrable to a…

Gaurav reposted

AshutoshShrivastava

@ai_for_success

Oct 9

The Sora 2 app now has an average rating of 2.9. What’s the reason?

Gaurav reposted

Eric Jiang

@veggie_eric

Oct 9

Our new Grok Imagine is a step function better than the original. Genuinely kind of mind blowing And it’s free for everyone! No invite codes needed😄

Gaurav

@gauravisnotme

Oct 8

First PyTorch and now React. This is a really good arc where they incubate a stack internally while pushing actively to upstream and once mature, hand it over to none other than the lords of open-source, Linux Foundation. Allows for a lot more neutral governance and more…

Engineering at Meta

@fb_engineering

Oct 8

Over 10 years ago, we open-sourced React. And now, we’re excited to announce the next chapter: React & React Native are transitioning to the React Foundation under the Linux Foundation. Meta is committing $3M+ and a 5-year partnership to support this next chapter of innovation.…

Gaurav reposted

Visual Studio

@VisualStudio

Oct 7

Big news for developers! Grok Code Fast 1 is now available in Visual Studio. This advanced AI model brings smarter, faster coding assistance right into your favorite IDEs via GitHub Copilot Chat. Available in public preview for Copilot Pro, Pro+, Business, and Enterprise plans,…

VisualStudio's tweet image. Big news for developers! Grok Code Fast 1 is now available in Visual Studio. This advanced AI model brings smarter, faster coding assistance right into your favorite IDEs via GitHub Copilot Chat.

Available in public preview for Copilot Pro, Pro+, Business, and Enterprise plans,…

Gaurav reposted

Monique Pintarelli

@MoniquePintarel

Oct 7

The Grid meets reality in this first-of-its-kind partnership between @X, @xai @Tesla, and @WaltDisneyCo, live on X and the TRON: ARES red carpet. The TRON: ARES universe comes alive through an immersive digital world powered by cutting-edge xAI tech, Tesla's Optimus robots, and…

Business

@XBusiness

Oct 7

x.com/i/article/1975…

Gaurav reposted

Elon Musk

@elonmusk

Oct 7

Optimus at the Tron premiere

Tesla Optimus

@Tesla_Optimus

Oct 7

Tried to start a fight at the Tron: Ares premiere

Gaurav

@gauravisnotme

Oct 6

😂😂😂 Before curing cancer, we need to cure delusion.

Vin Sachidananda

@vin_sachi

Oct 6

There is no inference moat Hasn’t been since 2023 with model compilation from torch 2.0 and consolidation to transformers from DiT Nvidia loses inference market long term on batch to lower TCO (AMD) and real-time (TPU, ASICS)

Gaurav

@gauravisnotme

Oct 6

A quick glance through the SF Tech week event calendar and you see "Yoga for founders", "swimming with founders", "soulcycle with founders and builders", and so on. Now I have the answer to why SF needs a tech week. They are all trying to find people to go to the gym with.

Gaurav

@gauravisnotme

Oct 5

Here's a sneak-peek into my chat with @grok today. It brings all pieces of information, including the visuals right in the chat. Further, I can just select parts from the chat and ask follow-up questions, branch-off to tangential topics and much more. P.S.: Grok strongly…

gauravisnotme's tweet image. Here's a sneak-peek into my chat with @grok today.

It brings all pieces of information, including the visuals right in the chat.

Further, I can just select parts from the chat and ask follow-up questions, branch-off to tangential topics and much more.

P.S.: Grok strongly…

Gaurav

@gauravisnotme

Oct 4

Can someone tell @FIFAcom to get their house in order? They send me one email, just one email about me getting a time slot allocated in the @Visa presale draw and that's all. NO TIME SLOT information on the website, NO INFORMATION in the email, NO CUSTOMER SUPPRT to answer any…

gauravisnotme's tweet image. Can someone tell @FIFAcom to get their house in order?

They send me one email, just one email about me getting a time slot allocated in the @Visa presale draw and that's all.

NO TIME SLOT information on the website,
NO INFORMATION in the email,
NO CUSTOMER SUPPRT to answer any…

Gaurav

@gauravisnotme

Oct 4

Folks who were online during the browser wars - What made people switch from Internet Explorer and Netscape to Google Chrome? What convinced windows-heavy users to say, well we have a new browser that is not bad, let's give it a try? Do you think a similar thing could be…

Gaurav

@gauravisnotme

Oct 3

Grok Code Fast is right up there. Higher diff edit success rate than Claude 4.5 and GPT-5 Codex and much much cheaper if I may add. Try it out, don't stop using it and keep sending all that feedback our way. It's only going to get better.

Nick

@nickbaumann_

Oct 2

so we analyzed millions of diff edits from cline users and apparently GLM-4.6 hits 94.9% success rate vs claude 4.5's 96.2%. to be clear, diff edits are not the end-all-be-all metric for coding agents. but what's interesting is three months ago this gap was 5-10 points. open…

nickbaumann_'s tweet image. so we analyzed millions of diff edits from cline users and apparently GLM-4.6 hits 94.9% success rate vs claude 4.5's 96.2%.

to be clear, diff edits are not the end-all-be-all metric for coding agents. but what's interesting is three months ago this gap was 5-10 points.

open…

Gaurav

@gauravisnotme

Oct 3

Very interesting proposition. Variants similar to this have existed but this seems to be way more comprehensive since it offers distributed support from the start. Wondering what's the next obvious step after fine-tuning - RL Training and Environments? And if that's the case,…

Thinking Machines

@thinkymachines

Oct 1

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…