dchaplot's profile picture. Building next-gen AI at @thinkymachines. Past: Founding team @MistralAI, RS at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.

Devendra Chaplot

@dchaplot

Building next-gen AI at @thinkymachines. Past: Founding team @MistralAI, RS at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.

置頂

Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no…

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…


Devendra Chaplot 已轉發

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

thinkymachines's tweet image. LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

Devendra Chaplot 已轉發

Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

thinkymachines's tweet image. Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

Proud to invest in Emergent, backing Mukund and Madhav - few founders make building look this fun and easy. 0 -> $15M ARR in 3 months 1M+ users 40K apps built every day!

We raised $23M Series A @EmergentLabsHQ , everyone celebrates that. But we aren't Here's what we truly want to celebrate



Devendra Chaplot 已轉發

Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference” We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

thinkymachines's tweet image. Today Thinking Machines Lab is launching our research blog, Connectionism. Our first blog post is “Defeating Nondeterminism in LLM Inference”

We believe that science is better when shared. Connectionism will cover topics as varied as our research is: from kernel numerics to…

Devendra Chaplot 已轉發

Modern AI is confined to the digital world. At Skild AI, we are building towards AGI for the real world, unconstrained by robot type or task — a single, omni-bodied brain. Today, we are sharing our journey, starting with early milestones, with more to come in the weeks ahead.…


Devendra Chaplot 已轉發

Introducing the world's best (and open) speech recognition models!

MistralAI's tweet image. Introducing the world's best (and open) speech recognition models!

Devendra Chaplot 已轉發

Thinking Machines Lab exists to empower humanity through advancing collaborative general intelligence. We're building multimodal AI that works with how you naturally interact with the world - through conversation, through sight, through the messy way we collaborate. We're…


Devendra Chaplot 已轉發

Today, We’re launching Genesis AI — a global physical AI lab and full-stack robotics company — to build generalist robots and unlock unlimited physical labor. We’re backed by $105M in seed funding from @EclipseVentures, @khoslaventures, @Bpifrance, HSG, and visionaries…

gs_ai_'s tweet image. Today, We’re launching Genesis AI — a global physical AI lab and full-stack robotics company — to build generalist robots and unlock unlimited physical labor. 

We’re backed by $105M in seed funding from @EclipseVentures, @khoslaventures, @Bpifrance, HSG, and visionaries…

Devendra Chaplot 已轉發

Thinking Machines is hosting a happy hour in Singapore during #ICLR2025 on Friday, April 25: lu.ma/ecgmuhmx Come eat, drink, and learn more about us!


Devendra Chaplot 已轉發

Introducing Mistral Small 3.1. Multimodal, Apache 2.0, outperforms Gemma 3 and GPT 4o-mini. mistral.ai/news/mistral-s…

MistralAI's tweet image. Introducing Mistral Small 3.1. 
Multimodal, Apache 2.0, outperforms Gemma 3 and GPT 4o-mini.
mistral.ai/news/mistral-s…

Devendra Chaplot 已轉發

Introducing the world's best OCR model! mistral.ai/news/mistral-o…


Devendra Chaplot 已轉發

Today, we are excited to announce Thinking Machines Lab (thinkingmachines.ai), an artificial intelligence research and product company. We are scientists, engineers, and builders behind some of the most widely used AI products and libraries, including ChatGPT,…


Career Update: Incredibly fortunate and excited to be part of the founding team at Thinking Machines Lab! thinkingmachines.ai Join us: 6wajk07p.paperform.co

dchaplot's tweet image. Career Update: Incredibly fortunate and excited to be part of the founding team at Thinking Machines Lab!

thinkingmachines.ai

Join us: 
6wajk07p.paperform.co

Career update: After an incredible journey at Mistral AI, I made the hard decision to leave and pursue another exciting opportunity. Will share more details very soon! Very proud of the Mistral team and their accomplishments, I wish them continued success!


Devendra Chaplot 已轉發

Everyone says Europe can't compete with America in tech. But 48 hours ago, Mistral's 'Le Chat' just proved them wrong: • 13x faster than ChatGPT • 100% open-source • Completely free (vs $20/month) The European AI breakthrough Silicon Valley didn't see coming 🧵:

itsolelehmann's tweet image. Everyone says Europe can't compete with America in tech.

But 48 hours ago, Mistral's 'Le Chat' just proved them wrong:

• 13x faster than ChatGPT
• 100% open-source
• Completely free (vs $20/month)

The European AI breakthrough Silicon Valley didn't see coming 🧵:
itsolelehmann's tweet image. Everyone says Europe can't compete with America in tech.

But 48 hours ago, Mistral's 'Le Chat' just proved them wrong:

• 13x faster than ChatGPT
• 100% open-source
• Completely free (vs $20/month)

The European AI breakthrough Silicon Valley didn't see coming 🧵:

Devendra Chaplot 已轉發

No this video is not sped up, genuinely mind blowing And yes this is available to all users right now.

Introducing the all new Le Chat: your ultimate AI sidekick for life and work! Now live on web and mobile!



Devendra Chaplot 已轉發

Le Chat is fast (1,100 tok/s for flash queries on an updated Mistral Large). Download it at mistral.ai/app/android or mistral.ai/app/ios


Loading...

Something went wrong.


Something went wrong.