Joshua Batson

@thebasepoint

trying to understand evolved systems (🖥 and 🧬) interpretability research @anthropicai formerly @czbiohub, @mit math

Oakland, CA

Joined February 2012

2KPosts 4KFollowers 675Following

You might like

@czbiohub

@JEFworks

@deboramarks

@vallens

@loicaroyer

@JonathanHafetz

@hpke1980

@kieranrcampbell

@dna_rosenberg

@davidvanvalen

@amykczb

Joshua Batson reposted

Xinyan Hu @COLM25🇨🇦

Oct 10

3->5, 4->6, 9→11, 7-> ? LLMs solve this via In-Context Learning (ICL); but how is ICL represented and transmitted in LLMs? We build new tools identifying “extractor” and “aggregator” subspaces for ICL, and use them to understand ICL addition tasks like above. Come to…

xyVickyHu's tweet image. 3-&gt;5, 4-&gt;6, 9→11, 7-&gt; ?
LLMs solve this via In-Context Learning (ICL); but how is ICL represented and transmitted in LLMs? We build new tools identifying “extractor” and “aggregator” subspaces for ICL, and use them to understand ICL addition tasks like above. Come to…

Joshua Batson

Sep 29

This was so cool to be a part of. Jack led an incredible effort to quickly analyze the internals of a new model, as versions were coming in, to assess alignment. Research at the speed of model development.

Jack Lindsey

@Jack_W_Lindsey

Sep 29

Prior to the release of Claude Sonnet 4.5, we conducted a white-box audit of the model, applying interpretability techniques to “read the model’s mind” in order to validate its reliability and alignment. This was the first such audit on a frontier LLM, to our knowledge. (1/15)

Jack_W_Lindsey's tweet image. Prior to the release of Claude Sonnet 4.5, we conducted a white-box audit of the model, applying interpretability techniques to “read the model’s mind” in order to validate its reliability and alignment. This was the first such audit on a frontier LLM, to our knowledge. (1/15)

Joshua Batson reposted

Jack Lindsey

@Jack_W_Lindsey

Sep 29

Prior to the release of Claude Sonnet 4.5, we conducted a white-box audit of the model, applying interpretability techniques to “read the model’s mind” in order to validate its reliability and alignment. This was the first such audit on a frontier LLM, to our knowledge. (1/15)

Jack_W_Lindsey's tweet image. Prior to the release of Claude Sonnet 4.5, we conducted a white-box audit of the model, applying interpretability techniques to “read the model’s mind” in order to validate its reliability and alignment. This was the first such audit on a frontier LLM, to our knowledge. (1/15)

Joshua Batson reposted

Mike Krieger

Sep 29

We asked every version of Claude to make a clone of Claude(dot)ai, including today’s Sonnet 4.5… see what happened in the video

Joshua Batson reposted

Ethan Perez

Sep 4

We’re hiring someone to run the Anthropic Fellows Program! Our research collaborations have led to some of our best safety research and hires. We’re looking for an exceptional ops generalist, TPM, or research/eng manager to help us significantly scale and improve our collabs 🧵

Joshua Batson reposted

Goodfire

Aug 27

Arc Institute trained their foundation model Evo 2 on DNA from all domains of life. What has it learned about the natural world? Our new research finds that it represents the tree of life, spanning thousands of species, as a curved manifold in its neuronal activations. (1/8)

GoodfireAI's tweet image. Arc Institute trained their foundation model Evo 2 on DNA from all domains of life. What has it learned about the natural world?
Our new research finds that it represents the tree of life, spanning thousands of species, as a curved manifold in its neuronal activations. (1/8)

Joshua Batson reposted

Anthropic

Aug 15

Join Anthropic interpretability researchers @thebasepoint, @mlpowered, and @Jack_W_Lindsey as they discuss looking into the mind of an AI model - and why it matters:

Joshua Batson reposted

Anthropic

Jul 29

We’re running another round of the Anthropic Fellows program. If you're an engineer or researcher with a strong coding or technical background, you can apply to receive funding, compute, and mentorship from Anthropic, beginning this October. There'll be around 32 places.

AnthropicAI's tweet image. We’re running another round of the Anthropic Fellows program.

If you're an engineer or researcher with a strong coding or technical background, you can apply to receive funding, compute, and mentorship from Anthropic, beginning this October. There'll be around 32 places.

Joshua Batson reposted

Goodfire

Aug 5

New research with coauthors at @Anthropic, @GoogleDeepMind, @AiEleuther, and @decode_research! We expand on and open-source Anthropic’s foundational circuit-tracing work. Brief highlights in thread: (1/7)

Michael Eisen

@mbeisen

Anne Carpenter, PhD

@DrAnneCarpenter

Michael Nielsen

@michael_nielsen

Theo Sanderson

@theosanderson

CZI Science

@cziscience

Bloom Lab

@jbloom_lab

José Luis Ricón Fernández de la Puente

@ArtirKel

Alexey Guzey

@alexeyguzey

Zamin Iqbal

@ZaminIqbal

Rahul Satija

@satijalab

Riley Goodside

@goodside

Oxford Nanopore

@nanopore

Robert Haase

@haesleinhuepf

Elliot Hershberg

@ElliotHershberg

Uri Manor 💔

@manorlaboratory

Loïc A. Royer 💻🔬⚗️

@loicaroyer

Bill Hanage @BillHanage.bsky.social

@BillHanage

Houriiyah Tegally, PhD 🇲🇺

@houzhou

Senjuti Saha, PhD | সেঁজুতি সাহা

@senjutisaha

Smoke-away

@SmokeAwayyy

Robert Scoble

@Scobleizer

aakash

@av1098_

Alex Rives

@alexrives

Edu

@717edu

TheAIGuy

@TheAIGuy4u

Eduardo C. Garrido-Merchan

@vedugarmer

Jeff Lahey

@hk_bird

Justin

@Justin90136455

lucho_1312

@Lucho_1312

Arjun Balaji

@kaizen797

ShyGuy

@TabTora65955

HG

@handangn

niodoo.com

@Niodoodotcom

Eric Nash

@EricNash95929

salinecrop

@salinecrop

Sagar B

@99Bsagar

vixhal

@TheVixhal

Ishan Kakodkar

@Femensch

Matthew Kowal @ICML 🛩️🇨🇦💻

@MatthewKowal9

A

@A76341852

Ömer 🇵🇹

@omeredeball

Snorkel AI

@SnorkelAI

iEFx

@iefxsr

Gautam Mittal

@realgmittal

john aitken

@john_aitken_

JCCapelli

@JCCapelli

Koushik Srivatsan

@koushik_srivats

Stephan Robert

@mkdirstephan

AndreCav

@AndreCav6

aryan.ai

@bayesiansaac

Zachary A. Hampton

@zollicoff_

Thomas Harvey

@Airsofter4692

Karan Bali

@Karan_Bali_

John Sexton

@IAreBaboon1

Chip

@Chipthe4th

Collin Imhof

@collinimhof

Skalz

@_skalz_

Ryan Hofmann

@TheRyanHofmann

Ａｇｅｎｔｍｏｄｅ

@FwdDeployed

Cuervo

@jose____cuervo

alindnbrg

@alindnbrg

Syed Sameed Ahmed

@sameednaama

Liang Qiu

@liangqiu_1994

Curt Tigges

@CurtTigges

Uzay @ COLM

@uzpg_

Maria Khmel

@maria_khmel

Georg Keller

@georg98keller

NB

@nikhilbhima

Fabian Theis

@fabian_theis

Michael Eisen

@mbeisen

Anshul Kundaje (anshulkundaje@bluesky)

@anshulkundaje

Lior Pachter

@lpachter

Dr Emma Hodcroft

@firefoxx66

Tulio de Oliveira

@Tuliodna

Harmit S. Malik

@HarmitMalik

Anne Carpenter, PhD

@DrAnneCarpenter

Andrej Karpathy

@karpathy

Michael Nielsen

@michael_nielsen

Theo Sanderson

@theosanderson

Michael Hoffman @michaelhoffman.bsky.social

@michaelhoffman

Marta Giovanetti

@Mittenavoig

Dr. Angela Rasmussen

@angie_rasmussen

Michael Worobey

@MichaelWorobey

Alexey Guzey

@alexeyguzey

Zamin Iqbal

@ZaminIqbal

Quanta Magazine

@QuantaMagazine

Patrick McKenzie

@patio11

Riley Goodside

@goodside

Kexin Huang

@KexinHuang5

Alexis Gallagher

@alexisgallagher

Jack Lindsey

@Jack_W_Lindsey

nostalgebraist

@nostalgebraist

Shae McLaughlin

@shae_mcl

Erika Alden DeBenedictis

@erika_alden_d

Michael Hanna

@michaelwhanna

Myra Deng

@myra_deng

Wes Gurnee

@wesg52

Rohan Pandey

@khoomeik

Zhengfu He

@ZhengfuHe

Cathy Tie

@CathyTie

Olivia Jimenez

@oliviagjimenez

Mark Zhang

@markazhang

MIT NLP

@nlp_mit

Gytis Daujotas

@gytdau

Jan Leike

@janleike

David Bau

@davidbau

Ali Alkhatib

@_alialkhatib

John (Zhiyao) Ma

@johnma2006

Qinyuan Cheng

@cheng_qinyuan

Sonia

@soniajoseph_

Emmanuel Ameisen

@mlpowered

Alex Rives

@alexrives

Jacob Trefethen

@JacobTref

Alex Albert

@alexalbert__

Zack Witten

@zswitten

Craig Citro

@craigcitro

James Bradbury

@jekbradbury

𝔊𝔴𝔢𝔯𝔫

@gwern

Heidi L. Williams

@heidilwilliams_

Noam Brown

@polynoamial

Patrick Mineault

@patrickmineault

IfNotNow🔥✡️

@IfNotNowOrg

Katalin Kariko

@kkariko

Sanjeev Arora

@prfsanjeevarora

Jacob Andreas

@jacobandreas

Yonatan Belinkov ✈️ COLM2025

@boknilev

Adam Jermyn

@AdamSJermyn

Trenton Bricken

@TrentonBricken

Stella Biderman

@BlancheMinerva

everett

@typochondriac

Elizabeth Wood 🧬🖥️🥼

@lizbwood

Prof. Anima Anandkumar

@AnimaAnandkumar

World Bollard Association™

@WorldBollard

Nicholas Turner

@nicholasturner0

Jonathan Gray

@jgrayatwork

Meredith Whittaker

@mer__edith

Tom Lieberum 🔸

@lieberum_t

Yufei Zhao

@yufeizhao

United States Trends

1. Auburn 44.9K posts
2. Brewers 63.5K posts
3. Georgia 67.5K posts
4. Cubs 55.4K posts
5. Kirby 23.7K posts
6. Arizona 41.8K posts
7. Utah 24.4K posts
8. Gilligan 5,818 posts
9. Michigan 62.7K posts
10. #AcexRedbull 3,536 posts
11. Hugh Freeze 3,206 posts
12. #BYUFootball N/A
13. Boots 50.3K posts
14. #Toonami 2,520 posts
15. Amy Poehler 4,278 posts
16. #GoDawgs 5,551 posts
17. Kyle Tucker 3,162 posts
18. Dissidia 5,517 posts
19. #ThisIsMyCrew 3,231 posts
20. Texas Tech 6,271 posts

You might like

Something went wrong.

Something went wrong.