maybe: shivam

@kaffeinated

ai researcher @spotify, former matrix multiplier @twitter @x cortex, nyu. techno-optimist. probably on a bike 🚲☕.

London

Joined October 2012

3KPosts 446Followers 3KFollowing

You might like

@liruizhe94

@thawani_avijit

@jjh

@ArtidoroPagnoni

@ZehanWang

@effyli4

@ihsgnef

@GabiStanovsky

@SijunHe

@qintong_li

@_thehappyidiot_

maybe: shivam reposted

Ai2

@allen_ai

Nov 20

Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey. Best fully open 32B reasoning model & best 32B base model. 🧵

allen_ai's tweet image. Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, &amp; tool use, and an open model flow—not just the final weights, but the entire training journey.
Best fully open 32B reasoning model &amp; best 32B base model. 🧵

maybe: shivam reposted

Andrew Gordon Wilson

@andrewgwils

Nov 2

I need to take more 14 hour flights without internet. Carefully read three new (amazing!) papers, wrote two letters, and watched two movies. Need a faraday cage at home.

maybe: shivam reposted

Matt Clifford

@matthewclifford

Oct 30

The UK is a great country with an extraordinary history. Our stagnation is real, but it's fixable and worth fixing. Enjoyed giving this talk at @lfg_uk last week and so encouraged by the optimistic responses I've had from people who are building a brilliant future for Britain 🚀

maybe: shivam reposted

Percy Liang

@percyliang

Oct 29

⛵Marin 32B Base (mantis) is done training! It is the best open-source base model (beating OLMo 2 32B Base) and it’s even close to the best comparably-sized open-weight base models, Gemma 3 27B PT and Qwen 2.5 32B Base. Ranking across 19 benchmarks:

percyliang's tweet image. ⛵Marin 32B Base (mantis) is done training! It is the best open-source base model (beating OLMo 2 32B Base) and it’s even close to the best comparably-sized open-weight base models, Gemma 3 27B PT and Qwen 2.5 32B Base. Ranking across 19 benchmarks:

maybe: shivam reposted

GLADIA Research Lab

@GladiaLab

Oct 27

LLMs are injective and invertible. In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space. (1/6)

GladiaLab's tweet image. LLMs are injective and invertible.

In our new paper, we show that different prompts always map to different embeddings, and this property can be used to recover input tokens from individual embeddings in latent space.

(1/6)

maybe: shivam reposted

Percy Liang

@percyliang

Oct 24

You spend $1B training a model A. Someone on your team leaves and launches their own model API B. You're suspicious. Was B was derived (e.g., fine-tuned) from A? But you only have blackbox access to B... With our paper, you can still tell with strong statistical guarantees…

Sally Zhu

@SallyHZhu

Oct 23

🔎Did someone steal your language model? We can tell you, as long as you shuffled your training data🔀. All we need is some text from their model! Concretely, suppose Alice trains an open-weight model and Bob uses it to produce text. Can Alice prove Bob used her model?🚨

maybe: shivam reposted

Andrej Karpathy

@karpathy

Oct 13

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

maybe: shivam reposted

Sam Buchanan

@_sdbuchanan

Oct 2

We wrote a book about representation learning! It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms. 👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

_sdbuchanan's tweet image. We wrote a book about representation learning!

It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms.

👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

maybe: shivam reposted

Guillermo Rauch

@rauchg

Sep 26

A lot of younger people who’re new to the industry ask me how to do “networking”. How do you go from knowing no one in San Francisco to having a network of people that you can do business with, learn from, hire, etc. The trick is I never set out to “network”. Maybe due to…

maybe: shivam reposted

Peyman Milanfar

@docmilanfar

Sep 22

it's called encoder-decoder for a reason

maybe: shivam reposted

Noah Smith 🐇🇺🇸🇺🇦🇹🇼

@Noahpinion

Sep 20

I have never even slightly wavered in my position that Trump is bad and stupid

maybe: shivam reposted

pdawg

@prathamgrv

Sep 16

this image has singlehandedly saved me from so much overthinking

maybe: shivam reposted

roon

@tszzl

Aug 22

companies like Facebook record every imaginable interaction their users have with the platform. they log each of your clicks and taps. they keep track of how long your gaze lingered on a post, whether you were on the same WiFi as that woman who might be your friend, which…

maybe: shivam reposted

Ethan Mollick

@emollick

Aug 20

Sure, you could use AI to summarize papers and explain them at a level anyone could understand... or you can turn the abstracts into music videos for no reason. The tools are not perfect yet, but the disparate elements (consistent characters, lip syncing, etc.) are evolving fast

maybe: shivam reposted

DHH

@dhh

Aug 6

There's only so much attention available in a day. Invest it in enthusiasm, wonder, and progress.

maybe: shivam reposted

David Pfau

@pfau

Jul 10

It's kind of predictable that it would get defined down, but it's still wild that "AI Agent" went from meaning "AI that can act autonomously to achieve a goal by planning, executing and self-correcting when it makes mistakes" to "a wrapper that calls an LLM a bunch of times"

BURKOV

@burkov

Jul 10

This is what they call "an agent."

maybe: shivam reposted

BURKOV

@burkov

Jul 10

This is what they call "an agent."

maybe: shivam reposted

Stanford Online

@StanfordOnline

Jul 8

Our latest CS336 Language Modeling from Scratch lectures are now available! View the entire playlist here: youtube.com/playlist?list=…

maybe: shivam reposted

Chris Albon

@chrisalbon

Jul 3

This is actually how it feels. Remember feature engineering?

ℏεsam

@Hesamation

Jul 3

maybe: shivam reposted

Richard Song

@XingyouSong

Jun 30

Seeing text-to-text regression work for Google’s massive compute cluster (billion $$ problem!) was the final result to convince us we can reward model literally any world feedback. Paper: arxiv.org/abs/2506.21718 Code: github.com/google-deepmin… Just train a simple encoder-decoder…