dchaplot's profile picture. Founding team @thinkymachines. Past: Founding team @MistralAI, RS at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.

Devendra Chaplot

@dchaplot

Founding team @thinkymachines. Past: Founding team @MistralAI, RS at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.

Закреплено

Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no…

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…


Many of us from @thinkymachines are at NeurIPS this week. Would love to chat with people interested in joining us or using Tinker. We are also giving away free Tinker credits! Open roles: job-boards.greenhouse.io/thinkingmachin… Signup for Tinker: thinkingmachines.ai/tinker/


Devendra Chaplot сделал(а) репост

Nous Research proudly presents tinker-atropos: a seamless integration layer between the Thinking Machines Tinker API and our Atropos RL framework. Run scalable RL training on @thinkymachines managed clusters, no GPUs required! Full post by @yaboilyrical: nousresearch.com/tinker-atropos…

NousResearch's tweet image. Nous Research proudly presents tinker-atropos: a seamless integration layer between the Thinking Machines Tinker API and our Atropos RL framework.

Run scalable RL training on @thinkymachines managed clusters, no GPUs required!

Full post by @yaboilyrical: nousresearch.com/tinker-atropos…

Devendra Chaplot сделал(а) репост

Science is best shared! Tell us about what you’ve built or discovered with Tinker, so we can tell the world about it on our blog. More details at thinkingmachines.ai/blog/call-for-…


Devendra Chaplot сделал(а) репост

We got early access to @thinkymachines’ Tinker and have been experimenting with it this week at Ramp Labs. We used it to compare RL post training performance between an ensemble of domain specific models vs. a singular model. It handled infra, model updating, and GPUs smoothly…


Devendra Chaplot сделал(а) репост

Today we’re announcing research and teaching grants for Tinker: credits for scholars and students to fine-tune and experiment with open-weight LLMs. Read more and apply at: thinkingmachines.ai/blog/tinker-re…


Devendra Chaplot сделал(а) репост

Roadmap update: Tinker launched into private beta a month ago, and we've seen hundreds of builders and researchers train and experiment with models on our platform. This month we've added new models, expanded the cookbook, and improved overall capacity and performance.

We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: thinkingmachines.ai/tinker/

thinkymachines's tweet image. We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families.

Sign up for the waitlist:
thinkingmachines.ai/tinker/


Today we’re excited to add gpt-oss and DeepSeek model families to Tinker - one of our top community requests. With Tinker, you can train a 671B parameter model on your laptop in just a few lines of code. No GPU rentals. No CUDA. No cluster setup. Just train.

We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families. Sign up for the waitlist: thinkingmachines.ai/tinker/

thinkymachines's tweet image. We just added 4 new models to Tinker from the gpt-oss and DeepSeek-V3.1 families.

Sign up for the waitlist:
thinkingmachines.ai/tinker/


Devendra Chaplot сделал(а) репост

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

thinkymachines's tweet image. Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

Devendra Chaplot сделал(а) репост

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

thinkymachines's tweet image. LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

Devendra Chaplot сделал(а) репост

Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

thinkymachines's tweet image. Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

Proud to invest in Emergent, backing Mukund and Madhav - few founders make building look this fun and easy. 0 -> $15M ARR in 3 months 1M+ users 40K apps built every day!

We raised $23M Series A @EmergentLabsHQ , everyone celebrates that. But we aren't Here's what we truly want to celebrate



Devendra Chaplot сделал(а) репост

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

thinkymachines's tweet image. Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

Devendra Chaplot сделал(а) репост

Modern AI is confined to the digital world. At Skild AI, we are building towards AGI for the real world, unconstrained by robot type or task — a single, omni-bodied brain. Today, we are sharing our journey, starting with early milestones, with more to come in the weeks ahead.…


Devendra Chaplot сделал(а) репост

Introducing the world's best (and open) speech recognition models!

MistralAI's tweet image. Introducing the world's best (and open) speech recognition models!

Devendra Chaplot сделал(а) репост

Thinking Machines Lab exists to empower humanity through advancing collaborative general intelligence. We're building multimodal AI that works with how you naturally interact with the world - through conversation, through sight, through the messy way we collaborate. We're…


Devendra Chaplot сделал(а) репост

Today, We’re launching Genesis AI — a global physical AI lab and full-stack robotics company — to build generalist robots and unlock unlimited physical labor. We’re backed by $105M in seed funding from @EclipseVentures, @khoslaventures, @Bpifrance, HSG, and visionaries…

gs_ai_'s tweet image. Today, We’re launching Genesis AI — a global physical AI lab and full-stack robotics company — to build generalist robots and unlock unlimited physical labor. 

We’re backed by $105M in seed funding from @EclipseVentures, @khoslaventures, @Bpifrance, HSG, and visionaries…

Loading...

Something went wrong.


Something went wrong.