Shaojie Bai

@shaojieb

Doing AI at @thinkymachines. Previously GenAI+RLR @ Meta. CMU MLD. Twitter account for more than AI.

San Francisco, CA

jerrybai1995.github.io

6월 2012에 가입

135게시물 1K팔로워 301팔로우 중

내가 좋아할 만한 콘텐츠

@zicokolter

@tengyuma

@brandondamos

@dchaplot

@StefanoErmon

@ShamKakade6

@prfsanjeevarora

@aleks_madry

@liyzhen2

@ZeyuanAllenZhu

@pliang279

@SimonShaoleiDu

@EmmaBrunskill

@baaadas

@RickyTQChen

Shaojie Bai

@shaojieb

. 10. 1.

Post-training made (extremely) easy. Try it out!

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

Shaojie Bai 님이 재게시함

Thinking Machines

@thinkymachines

. 10. 1.

Shaojie Bai 님이 재게시함

Thinking Machines

@thinkymachines

. 9. 29.

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

thinkymachines's tweet image. LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

Shaojie Bai 님이 재게시함

Thinking Machines

@thinkymachines

. 9. 26.

Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

thinkymachines's tweet image. Efficient training of neural networks is difficult. Our second Connectionism post introduces Modular Manifolds, a theoretical step toward more stable and performant training by co-designing neural net optimizers with manifold constraints on weight matrices.…

Shaojie Bai 님이 재게시함

Mira Murati

@miramurati

. 7. 15.

Thinking Machines Lab exists to empower humanity through advancing collaborative general intelligence. We're building multimodal AI that works with how you naturally interact with the world - through conversation, through sight, through the messy way we collaborate. We're…

Shaojie Bai

@shaojieb

. 4. 14.

We’ll be mixing ideas, cocktails and discussions for a night of *neural-networking* at Singapore! Come learn more about Thinking Machines at our happy hour at #iclr2025 😎

Thinking Machines

@thinkymachines

. 4. 14.

Thinking Machines is hosting a happy hour in Singapore during #ICLR2025 on Friday, April 25: lu.ma/ecgmuhmx Come eat, drink, and learn more about us!

thinkymachines's tweet card. Thinking Machines Lab (https://thinkingmachines.ai/) is hosting a happy hour at ICLR 2025. If you are interested in learning about our company vision, research…

Thinking Machines ICLR Happy Hour · Luma

출처: luma.com

Shaojie Bai 님이 재게시함

Ahmad Al-Dahle

@Ahmad_Al_Dahle

. 4. 5.

Introducing our first set of Llama 4 models! We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4…

Ahmad_Al_Dahle's tweet image. Introducing our first set of Llama 4 models!

We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4…

Shaojie Bai 님이 재게시함

Jiao Sun

@sunjiao123sun_

. 12. 14.

Mitigating racial bias from LLMs is a lot easier than removing it from humans! Can’t believe this happened at the best AI conference @NeurIPSConf We have ethical reviews for authors, but missed it for invited speakers? 😡

sunjiao123sun_'s tweet image. Mitigating racial bias from LLMs is a lot easier than removing it from humans!

Can’t believe this happened at the best AI conference @NeurIPSConf

We have ethical reviews for authors, but missed it for invited speakers? 😡

Shaojie Bai 님이 재게시함

Zico Kolter

@zicokolter

2024. 8. 8.

I'm excited to announce that I am joining the OpenAI Board of Directors. I'm looking forward to sharing my perspectives and expertise on AI safety and robustness to help guide the amazing work being done at OpenAI.

OpenAI

@OpenAI

2024. 8. 8.

Zico Kolter from Carnegie Mellon joins OpenAI’s Board, bringing technical and AI safety expertise; he also joins the Safety & Security Committee. openai.com/index/zico-kol…

OpenAI's tweet card. We’re strengthening our governance with expertise in AI safety and alignment. Zico will also join the Safety & Security Committee

Zico Kolter Joins OpenAI’s Board of Directors

출처: openai.com

Shaojie Bai 님이 재게시함

Russ Salakhutdinov

@rsalakhu

2024. 7. 3.

I am very excited to start working with GenAI team at @Meta, focusing on multimodal LLM agents, joining together with my amazing CMU colleagues Jing Yu Koh @kohjingyu and Daniel Fried @dan_fried!

rsalakhu's tweet image. I am very excited to start working with GenAI team at @Meta, focusing on multimodal LLM agents, joining together with my amazing CMU colleagues Jing Yu Koh @kohjingyu and Daniel Fried @dan_fried!

Shaojie Bai 님이 재게시함

Zico Kolter

@zicokolter

2024. 6. 10.

I'm thrilled to share that I will become the next Director of the Machine Learning Department at Carnegie Mellon. MLD is a true gem, a department dedicated entirely to ML. Faculty and past directors have been personal role models in my career. cs.cmu.edu/news/2024/kolt…

zicokolter's tweet card. Zico Kolter will head the Machine Learning Department, effective June 15.

Kolter Named Head of Carnegie Mellon University Machine Learning Department

출처: cs.cmu.edu

Shaojie Bai 님이 재게시함

OpenAI

@OpenAI

2024. 5. 13.

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: openai.com/index/hello-gp… Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks.

Shaojie Bai 님이 재게시함

Zhengyang Geng

@ZhengyangGeng

2024. 3. 27.

🚀Our latest blog post unveils the power of Consistency Models and introduces Easy Consistency Tuning (ECT), a new way to fine-tune pretrained diffusion models to consistency models. SoTA fast generative models using 1/32 training cost! 🔽 Get ready to speed up your generative…

ZhengyangGeng's tweet image. 🚀Our latest blog post unveils the power of Consistency Models and introduces Easy Consistency Tuning (ECT), a new way to fine-tune pretrained diffusion models to consistency models.

SoTA fast generative models using 1/32 training cost! 🔽
Get ready to speed up your generative…

Shaojie Bai 님이 재게시함

Zico Kolter

@zicokolter

2024. 1. 10.

I feel like a lot of people leverage LLMs suboptimally, especially for long-form interactions that span a whole project. So I wrote a VSCode extension that supports what I think is a better use paradigm. 🧵 1/N Extension: marketplace.visualstudio.com/items?itemName… Code: github.com/locuslab/chatl…

zicokolter's tweet card. Contribute to locuslab/chatllm-vscode development by creating an account on GitHub.

GitHub - locuslab/chatllm-vscode

출처: github.com

Shaojie Bai

@shaojieb

2024. 1. 4.

Exciting new work with Evonne, @AlexRichardCS et al! 🤯 We (generatively) animate photorealistic full-body avatars using conversational audio (of anyone!). Next step, seeing GPT4 + photorealistic avatar [argue with]/[point a finger at]/[mock] you in VR?😏

Alexander Richard

@AlexRichardCS

2024. 1. 4.

Motion generation for photorealistic avatars? Say no more! Check out how we animate full body avatars exclusively from audio input! Paper: arxiv.org/abs/2401.01885 Project page: people.eecs.berkeley.edu/~evonne_ng/pro… Dataset + Code: github.com/facebookresear…

Shaojie Bai 님이 재게시함

Brandon Amos

@brandondamos

2023. 12. 12.

My core ML team (@AIatMeta) is hiring research interns! Our projects span optimization, optimal transport, optimal control, generative modeling, complex systems, and geometry. Please apply here and reach out ([email protected]) if you're interested: metacareers.com/jobs/627997209…

Shaojie Bai 님이 재게시함

Zico Kolter

@zicokolter

2023. 7. 27.

@CadeMetz at the New York Times just published a piece on a new paper we are releasing today, on adversarial attacks against LLMs. You can read the piece here: nytimes.com/2023/07/27/tec… And find more info and the paper at: llm-attacks.org [1/n]

zicokolter's tweet card. A new report indicates that the guardrails for widely used chatbots can be thwarted, leading to an increasingly unpredictable environment for the technology.

Researchers Poke Holes in Safety Controls of ChatGPT and Other Chatbots (Published 2023)

출처: nytimes.com

Shaojie Bai 님이 재게시함

Zhengyang Geng

@ZhengyangGeng

2022. 11. 28.

NeurIPS!!! First in-person meeting after 3yrs starting my research. 🥳 Glad to have any chats, neural dynamics, deep equilibrium models (DEQ), symmetries, protein folding/AF2, etc. Will be working on the intersection of DEQ and AF2 and expect to see all the collaboration chances!

Shaojie Bai 님이 재게시함

Cem Anil

@cem__anil

2022. 11. 23.

🆕📜When can **Equilibrium Models** learn from simple examples to handle complex ones? We identify a property — Path Independence — that enables this by letting EMs think for longer on hard examples. (NeurIPS) 📝: [arxiv.org/abs/2211.09961]()

cem__anil's tweet image. 🆕📜When can **Equilibrium Models** learn from simple examples to handle complex ones?

We identify a property — Path Independence — that enables this by letting EMs think for longer on hard examples.

(NeurIPS) 📝: [arxiv.org/abs/2211.09961]()

Shaojie Bai 님이 재게시함

Zico Kolter

@zicokolter

2022. 9. 29.

I just posted our Deep Learning Systems Lecture 6 on Fully Connected Networks, Optimization, and Initialization: youtu.be/CukpVt-1PA4 However, the real topic of interest here is that I used @OpenAI's whisper to caption it entirely. A thread 🧵on my experience. 1/N