Jason Yim

@json_yim

PhD student @MIT_CSAIL. Generative models, protein design. 🦋 Bluesky handle: https://bsky.app/profile/jyim.bsky.social On X until the exodus is complete.

Cambridge, MA

jasonkyuyim.com

انضم في سبتمبر 2017

315المنشورات 2Kالمتابعون 326المتابَعون

قد يعجبك

@generate_biomed

@namrata_anand2

@BrianHie

@DaveJuergens

@alexechu_

@GabriCorso

@bahl_lab

@TomSercu

@jueseph

@proteinrosh

@HelenEisenach

@AMotmaen

@Oxer22

@SamTipps

@SoojungYang2

مثبتة

Jason Yim

@json_yim

٢٥ فبراير ٢٠٢٤ م

Combining discrete and continuous data is an important capability for generative models. To address this for protein design, we introduce Multiflow, a generative model for structure and sequence generation. Preprint: arxiv.org/abs/2402.04997 Code: github.com/jasonkyuyim/mu… 1/8

Jason Yim أعاد

Jaeyeon (Jay) Kim

@Jaeyeon_Kim_0

٨ أكتوبرم

We introduce a new ''rule'' for understanding diffusion models: Selective Underfitting. It explains: 🚨 How diffusion models generalize beyond training data 🚨 Why popular training recipes (e.g., DiT, REPA) are effective and scale well Co-led with @kiwhansong0! (1/n)

Jaeyeon_Kim_0's tweet image. We introduce a new ''rule'' for understanding diffusion models: Selective Underfitting.

It explains:
🚨 How diffusion models generalize beyond training data
🚨 Why popular training recipes (e.g., DiT, REPA) are effective and scale well

Co-led with @kiwhansong0!
(1/n)

Jason Yim

@json_yim

١٠ أكتوبرم

It's odd the performance is significantly worse than the AR base model? Starting with a much powerful AR model, dropping the performance just enough to beat all other diffusion LLMs, and then saying it's better than them is weird...

json_yim's tweet image. It's odd the performance is significantly worse than the AR base model? Starting with a much powerful AR model, dropping the performance just enough to beat all other diffusion LLMs, and then saying it's better than them is weird...

Radical Numerics

@RadicalNumerics

٩ أكتوبرم

More on RND1 models: Blog: radicalnumerics.ai/blog/rnd1 Code: github.com/RadicalNumeric… Report: radicalnumerics.ai/assets/rnd1_re… Weights: huggingface.co/radicalnumeric…

RadicalNumerics's tweet image. More on RND1 models:

Blog: radicalnumerics.ai/blog/rnd1

Code: github.com/RadicalNumeric…

Report: radicalnumerics.ai/assets/rnd1_re…

Weights: huggingface.co/radicalnumeric…

Jason Yim أعاد

Peter Holderrieth

@peholderrieth

٩ أكتوبرم

New work: “GLASS Flows: Transition Sampling for Alignment of Flow and Diffusion Models”. GLASS generates images by sampling stochastic Markov transitions with ODEs - allowing us to boost text-image alignment for large-scale models at inference time! arxiv.org/pdf/2509.25170 [1/7]

Jason Yim أعاد

Michael Albergo

@msalbergo

٧ أكتوبرم

We've cleaned up the story big time on flow maps. Check out @nmboffi's slick repo implementing all the many ways to go about them, and stay tuned for a bigger release 🤠 arxiv.org/pdf/2505.18825 flow-maps.github.io

Nicholas Boffi

@nmboffi

٧ أكتوبرم

Consistency models, CTMs, shortcut models, align your flow, mean flow... What's the connection, and how should you learn them in practice? We show they're all different sides of the same coin connected by one central object: the flow map. arxiv.org/abs/2505.18825 🧵(1/n)

Jason Yim أعاد

Andrew Campbell

@AndrewC_ML

٧ أكتوبرم

Very excited to share our preprint: Self-Speculative Masked Diffusions We speed up sampling of masked diffusion models by ~2x by using speculative sampling and a hybrid non-causal / causal transformer arxiv.org/abs/2510.03929 w/ @ValentinDeBort1 @thjashin @ArnaudDoucet1

AndrewC_ML's tweet image. Very excited to share our preprint: Self-Speculative Masked Diffusions

We speed up sampling of masked diffusion models by ~2x by using speculative sampling and a hybrid non-causal / causal transformer

arxiv.org/abs/2510.03929

w/ @ValentinDeBort1 @thjashin @ArnaudDoucet1

Jason Yim أعاد

Joey Bose

@bose_joey

٩ يوليوم

🎉Personal update: I'm thrilled to announce that I'm joining Imperial College London @imperialcollege as an Assistant Professor of Computing @ICComputing starting January 2026. My future lab and I will continue to work on building better Generative Models 🤖, the hardest…

Jason Yim أعاد

Ricky T. Q. Chen

@RickyTQChen

١٤ مايوم

We've open sourced Adjoint Sampling! It's part of a bundled release showcasing FAIR's research and open source commitment to AI for science. github.com/facebookresear… x.com/AIatMeta/statu…

RickyTQChen's tweet card. code for "Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching" - facebookresearch/adjoint_sampling

GitHub - facebookresearch/adjoint_sampling: code for "Adjoint Sampling: Highly Scalable Diffusion...

المصدر: github.com

AI at Meta

@AIatMeta

١٤ مايوم

Announcing the newest releases from Meta FAIR. We’re releasing new groundbreaking models, benchmarks, and datasets that will transform the way researchers approach molecular property prediction, language processing, and neuroscience. 1️⃣ Open Molecules 2025 (OMol25): A dataset…

Jason Yim أعاد

Alex Tong

@AlexanderTong7

٢٨ أبريلم

#FPIworkshop best paper award goes to @peholderrieth @msalbergo and Tommi Jaakkola. Congrats and great talk Peter!

Jason Yim أعاد

Rachel (Menghua) Wu

@menghua_wu

٢١ أبريلم

I won't be at ICLR 🥲 but you can talk to these other cool people at my poster, Thursday 3-5:30 PM in Hall 3+2B #10!

Rachel (Menghua) Wu

@menghua_wu

٤ مارسم

Excited to share my #ICLR2025 paper, with JC Hütter and friends! Genetic perturbation screens allow biologists to manipulate and measure the genes in cells = discover causal relationships! BUT they are expensive to run, expensive to interpret. ... We use LLMs to help!

menghua_wu's tweet image. Excited to share my #ICLR2025 paper, with JC Hütter and friends!

Genetic perturbation screens allow biologists to manipulate and measure the genes in cells = discover causal relationships! BUT they are expensive to run, expensive to interpret.

... We use LLMs to help!

Jason Yim أعاد

Jacob Gershon

@JacobMGershon

٢٠ أبريلم

Had fun exploring guidance for backbone designability within this latent framework, excited to chat more about guidance with experimental data @gembioworkshop ICLR

JacobMGershon's tweet image. Had fun exploring guidance for backbone designability within this latent framework, excited to chat more about guidance with experimental data @gembioworkshop ICLR

Jason Yim

@json_yim

٢٠ أبريلم

I'll be at the ICLR @gembioworkshop workshop presenting latent and structure diffusion for protein backbone generation. Come by to talk all things latent for biology. openreview.net/forum?id=Ek7Hs… arxiv.org/abs/2504.09374

Jason Yim

@json_yim

٢٠ أبريلم

Jason Yim

@json_yim

٢٠ أبريلم

I'll be at ICLR. Come check out our generative modeling work! Reach out if you want to chat. Proteina: x.com/karsten_kreis/… Protcomposer: x.com/HannesStaerk/s… Generator matching: x.com/peholderrieth/…

Peter Holderrieth

@peholderrieth

٣٠ أكتوبرم

New paper out! We introduce “Generator Matching” (GM), a method to build GenAI models for any data type (incl. multimodal) with any Markov process. GM unifies a range of state-of-the-art models and enables new designs of generative models. arxiv.org/abs/2410.20587 (1/5)

peholderrieth's tweet image. New paper out!

We introduce “Generator Matching” (GM), a method to build GenAI models for any data type (incl. multimodal) with any Markov process. GM unifies a range of state-of-the-art models and enables new designs of generative models.

arxiv.org/abs/2410.20587

(1/5)

Jason Yim

@json_yim

١٢ أبريلم

RFdiffusion => generative binder design. RFdiffusion2 => generative enzyme design. It's rare to find scientists with deep knowledge in chemistry, machine learning, and software engineering like Woody. The complexity of enzymes matches the complexity of his skills. Check out RFD2

Woody Ahern

@woodyahern

١٢ أبريلم

New enzymes can unlock chemistry we never had access to before. Here, we introduce RFdiffusion2 (RFD2), a generative model that makes significant strides in de novo enzyme design. Preprint: biorxiv.org/content/10.110… Code: coming soon Animation credit: x.com/ichaydon (1/n)

Jason Yim أعاد

Woody Ahern

@woodyahern

١٢ أبريلم

Jason Yim أعاد

Yehlin Cho

@ChoYehlin

٨ أبريلم

Excited to share our preprint “BoltzDesign1: Inverting All-Atom Structure Prediction Model for Generalized Biomolecular Binder Design” — a collaboration with @MartinPacesa, @ZhidianZ , Bruno E. Correia, and @sokrypton. 🧬 Code will be released in a couple weeks

Jason Yim أعاد

Hannah Wayment-Steele

@HWaymentSteele

٢٠ مارسم

Protein dynamics was the first research to enchant me >10yrs ago, but I left in PhD bc I couldnt find big expt data to evaluate models. Today w @ginaelnesr, I'm thrilled to share the big dynamics data I've been dreaming of, and the mdl we trained: Dyna-1. rb.gy/de5axp

HWaymentSteele's tweet image. Protein dynamics was the first research to enchant me &gt;10yrs ago, but I left in PhD bc I couldnt find big expt data to evaluate models.
Today w @ginaelnesr, I'm thrilled to share the big dynamics data I've been dreaming of, and the mdl we trained: Dyna-1.
rb.gy/de5axp

Jason Yim

@json_yim

٢١ مارسم

Combining prediction, generation, and modalities (sequence, structure, nucleic acids, small molecules, proteins) is the future. Congrats to the authors! Looking forward to the technical report.

VantAI

@vant_ai

٢١ مارسم

Announcing Neo-1: the world’s most advanced atomistic foundation model, unifying structure prediction and all-atom de novo generation for the first time - to decode and design the structure of life 🧵(1/10)

Jason Yim أعاد

Chaitanya K. Joshi

@chaitjo

١٠ مارسم

Introducing All-atom Diffusion Transformers — towards Foundation Models for generative chemistry, from my internship with the FAIR Chemistry team @OpenCatalyst @AIatMeta There are a couple ML ideas which I think are new and exciting in here 👇

chaitjo's tweet image. Introducing All-atom Diffusion Transformers

— towards Foundation Models for generative chemistry, from my internship with the FAIR Chemistry team @OpenCatalyst @AIatMeta

There are a couple ML ideas which I think are new and exciting in here 👇

Jason Yim

@json_yim

١٠ مارسم

Awesome work by Hannnes and Bowen towards improved control of protein structure generation with MultiFlow!

Hannes Stärk

@HannesStaerk

١٠ مارسم

New paper (and #ICLR2025 Oral :)): ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids arxiv.org/abs/2503.05025 Condition on your 3D layout (of ellipsoids) to generate proteins like this or to get better designability/diversity/novelty tradeoffs. 1/6

Jason Yim

@json_yim

٤ مارسم

I really enjoyed seeing how protein generation models scale with more data and weights. Congrats to Nvidia and the core contributors for this amazing work!

Karsten Kreis

@karsten_kreis

٤ مارسم

📢📢 "Proteina: Scaling Flow-based Protein Structure Generative Models" #ICLR2025 (Oral Presentation) 🔥 Project page: research.nvidia.com/labs/genair/pr… 📜 Paper: arxiv.org/abs/2503.00710 🛠️ Code and weights: github.com/NVIDIA-Digital… 🧵Details in thread... (1/n)