Devendra Chaplot

@dchaplot

Founding team @thinkymachines. Past: Founding team @MistralAI, RS at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.

SF Bay Area

devendrachaplot.github.io

Joined April 2010

659Posts 14KFollowers 450Following

You might like

@silviocinguetta

@brandondamos

@DhruvBatra_

@pathak2206

@coreylynch

@TheGregYang

@FidlerSanja

@marcgbellemare

@ShamKakade6

@animesh_garg

@j_foerst

@pulkitology

@zicokolter

@StefanoErmon

@bneyshabur

Pinned

Devendra Chaplot

@dchaplot

Oct 1

Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no…

Tinker

Source: thinkingmachines.ai

Thinking Machines

@thinkymachines

Oct 1

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

17 h

Congratulations to @axiommathai on their achievement! AxiomProver, a mathematics model fine-tuned with Tinker, got top scores on the Putnam Math Competition.

Axiom

@axiommathai

Dec 7

Putnam, the world's hardest college-level math test, ended yesterday 4p PT. Noon today, AxiomProver solved 9/12 problems in Lean autonomously (3:58p PT yesterday, it was 8/12). Our score would've been #1 of ~4000 participants last year and Putnam Fellow (top 5) in recent years

Devendra Chaplot

@dchaplot

Dec 2

Many of us from @thinkymachines are at NeurIPS this week. Would love to chat with people interested in joining us or using Tinker. We are also giving away free Tinker credits! Open roles: job-boards.greenhouse.io/thinkingmachin… Signup for Tinker: thinkingmachines.ai/tinker/

Tinker

Source: thinkingmachines.ai

Devendra Chaplot reposted

Nous Research

@NousResearch

Nov 18

Nous Research proudly presents tinker-atropos: a seamless integration layer between the Thinking Machines Tinker API and our Atropos RL framework. Run scalable RL training on @thinkymachines managed clusters, no GPUs required! Full post by @yaboilyrical: nousresearch.com/tinker-atropos…

NousResearch's tweet image. Nous Research proudly presents tinker-atropos: a seamless integration layer between the Thinking Machines Tinker API and our Atropos RL framework.

Run scalable RL training on @thinkymachines managed clusters, no GPUs required!

Full post by @yaboilyrical: nousresearch.com/tinker-atropos…

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Nov 7

Science is best shared! Tell us about what you’ve built or discovered with Tinker, so we can tell the world about it on our blog. More details at thinkingmachines.ai/blog/call-for-…

thinkymachines's tweet card. Announcing Tinker Community Projects

Tinker: Call for Community Projects

Source: thinkingmachines.ai

Devendra Chaplot reposted

Ramp Labs

@RampLabs

Nov 3

We got early access to @thinkymachines’ Tinker and have been experimenting with it this week at Ramp Labs. We used it to compare RL post training performance between an ensemble of domain specific models vs. a singular model. It handled infra, model updating, and GPUs smoothly…

Ramp Labs

@RampLabs

Nov 3

x.com/i/article/1985…

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Oct 29

Today we’re announcing research and teaching grants for Tinker: credits for scholars and students to fine-tune and experiment with open-weight LLMs. Read more and apply at: thinkingmachines.ai/blog/tinker-re…

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Oct 29

Roadmap update: Tinker launched into private beta a month ago, and we've seen hundreds of builders and researchers train and experiment with models on our platform. This month we've added new models, expanded the cookbook, and improved overall capacity and performance.

Thinking Machines

@thinkymachines

Oct 28

We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: thinkingmachines.ai/tinker/

Devendra Chaplot

@dchaplot

Oct 28

Today we’re excited to add gpt-oss and DeepSeek model families to Tinker - one of our top community requests. With Tinker, you can train a 671B parameter model on your laptop in just a few lines of code. No GPU rentals. No CUDA. No cluster setup. Just train.

Thinking Machines

@thinkymachines

Oct 28

We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: thinkingmachines.ai/tinker/

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Oct 27

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

thinkymachines's tweet image. Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Sep 29

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

thinkymachines's tweet image. LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Sep 26

Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

thinkymachines's tweet image. Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

Devendra Chaplot

@dchaplot

Sep 24

Proud to invest in Emergent, backing Mukund and Madhav - few founders make building look this fun and easy. 0 -> $15M ARR in 3 months 1M+ users 40K apps built every day!

Mukund Jha

@mukundjha

Sep 24

We raised $23M Series A @EmergentLabsHQ , everyone celebrates that. But we aren't Here's what we truly want to celebrate

Devendra Chaplot reposted

Thinking Machines

@thinkymachines

Sep 10

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

thinkymachines's tweet image. Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

Devendra Chaplot reposted

Skild AI

@SkildAI

Jul 29

Modern AI is confined to the digital world. At Skild AI, we are building towards AGI for the real world, unconstrained by robot type or task — a single, omni-bodied brain. Today, we are sharing our journey, starting with early milestones, with more to come in the weeks ahead.…

Devendra Chaplot reposted

Mistral AI

@MistralAI

Jul 15

Introducing the world's best (and open) speech recognition models!

Devendra Chaplot reposted

Mira Murati

@miramurati

Jul 15

Thinking Machines Lab exists to empower humanity through advancing collaborative general intelligence. We're building multimodal AI that works with how you naturally interact with the world - through conversation, through sight, through the messy way we collaborate. We're…