Jaskirat Singh @ ICCV2025🌴

@1jaskiratsingh

Researcher

Seattle, Washington

1jsingh.github.io

於六月 2018 加入

238貼文 322位跟隨者 414個跟隨中

你可能會喜歡

@tha_ajanthan

@DongxuLi_

@DavidZhang9873

@ChaminHewa

@dylanjcampbell_

@JiahuiZhang__32

@SameeraRamasin1

@leike_lk

@LiangZheng_06

@patrick_j_ramos

@yoonwoojeong

@han_junlin

置頂

Jaskirat Singh @ ICCV2025🌴

@1jaskiratsingh

年4月16日

Can we optimize both the VAE tokenizer and diffusion model together in an end-to-end manner? Short Answer: Yes. 🚨 Introducing REPA-E: the first end-to-end tuning approach for jointly optimizing both the VAE and the latent diffusion model using REPA loss 🚨 Key Idea: 🧠…

1jaskiratsingh's tweet image. Can we optimize both the VAE tokenizer and diffusion model together in an end-to-end manner? Short Answer: Yes.

🚨 Introducing REPA-E: the first end-to-end tuning approach for jointly optimizing both the VAE and the latent diffusion model using REPA loss 🚨

Key Idea:
🧠…

Jaskirat Singh @ ICCV2025🌴 已轉發

John Yang

@jyangballin

8 小時

New eval! Code duels for LMs ⚔️ Current evals test LMs on *tasks*: "fix this bug," "write a test" But we code to achieve *goals*: maximize revenue, cut costs, win users Meet CodeClash: LMs compete via their codebases across multi-round tournaments to achieve high-level goals

Jaskirat Singh @ ICCV2025🌴 已轉發

Linjie (Lindsey) Li

@LINJIEFUN

年11月4日

Check out our work ThinkMorph, which thinks in multi-modalities, not just with them.

Jiawei Gu

@Kuvvius

年11月3日

🚨Sensational title alert: we may have cracked the code to true multimodal reasoning. Meet ThinkMorph — thinking in modalities, not just with them. And what we found was... unexpected. 👀 Emergent intelligence, strong gains, and …🫣 🧵 arxiv.org/abs/2510.27492 (1/16)

Kuvvius's tweet image. 🚨Sensational title alert: we may have cracked the code to true multimodal reasoning.
Meet ThinkMorph — thinking in modalities, not just with them.
And what we found was... unexpected. 👀
Emergent intelligence, strong gains, and …🫣
🧵 arxiv.org/abs/2510.27492
(1/16)

Jaskirat Singh @ ICCV2025🌴 已轉發

Manish Shetty

@slimshetty_

年11月3日

Tests certify functional behavior; they don’t judge intent. GSO, our code optimization benchmark, now combines tests with a rubric-driven HackDetector to identify models that game the benchmark. We found that up to 30% of a model’s attempts are non-idiomatic reward hacks, which…

slimshetty_'s tweet image. Tests certify functional behavior; they don’t judge intent. GSO, our code optimization benchmark, now combines tests with a rubric-driven HackDetector to identify models that game the benchmark.

We found that up to 30% of a model’s attempts are non-idiomatic reward hacks, which…

Jaskirat Singh @ ICCV2025🌴 已轉發

Naman Jain

@StringChaos

年11月3日

We added LLM judge based hack detector to our code optimization evals and found models perform non-idiomatic code changes in upto 30% of the problems 🤯

Manish Shetty

@slimshetty_

年11月3日

Jaskirat Singh @ ICCV2025🌴 已轉發

Jaskirat Singh @ ICCV2025🌴

@1jaskiratsingh

年10月22日

end-to-end training just makes latent diffusion transformers better! with repa-e, we showed the power of end-to-end training on imagenet. today we are extending it to text-to-image (T2I) generation. #ICCV2025 🌴 🚨 Introducing "REPA-E for T2I: family of end-to-end tuned VAEs for…

1jaskiratsingh's tweet image. end-to-end training just makes latent diffusion transformers better! with repa-e, we showed the power of end-to-end training on imagenet. today we are extending it to text-to-image (T2I) generation. #ICCV2025 🌴

🚨 Introducing "REPA-E for T2I: family of end-to-end tuned VAEs for…

Jaskirat Singh @ ICCV2025🌴 已轉發

Sayak Paul

@RisingSayak

年10月30日

With simple changes, I was able to cut down @krea_ai's new real-time video gen's timing from 25.54s to 18.14s 🔥🚀 1. FA3 through `kernels` 2. Regional compilation 3. Selective (FP8) quantization Notes are in 🧵 below

Jaskirat Singh @ ICCV2025🌴 已轉發

Chieh-Hsin (Jesse) Lai

@JCJesseLai

年10月29日

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core…

JCJesseLai's tweet image. Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on!

📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon.

It traces the core…

Jaskirat Singh @ ICCV2025🌴 已轉發

Wenhao Chai

@wenhaocha1

年10月27日

Back in 2024, LMMs-Eval built a complete evaluation ecosystem for the MLLM/LMM community, with countless researchers contributing their models and benchmarks to raise the whole edifice. I was fortunate to be one of them: our series of video-LMM works (MovieChat, AuroraCap, VDC)…

Brian Bo Li

@BoLi68567011

年10月25日

Throughout my journey in developing multimodal models, I’ve always wanted a framework that lets me plug & play modality encoders/decoders on top of an auto-regressive LLM. I want to prototype fast, try new architectures, and have my demo files scale effortlessly — with full…

Jaskirat Singh @ ICCV2025🌴 已轉發

Sida Wang

@sidawxyz

年10月23日

I have one PhD intern opening to do research as a part of a model training effort at the FAIR CodeGen team (latest: Code World Model). If interested, email me directly and apply at metacareers.com/jobs/214557081…

Jaskirat Singh @ ICCV2025🌴 已轉發

Sihyun Yu

@sihyun_yu

年10月21日

Arash and his team are fantastic! I highly recommend applying if you’re interested

Arash Vahdat

@ArashVahdat

年10月16日

📢 The Fundamental Generative AI Research (GenAIR) team at NVIDIA is looking for outstanding candidates to join us as summer 2026 interns. Apply via: nvidia.wd5.myworkdayjobs.com/en-US/NVIDIAEx… Email: [email protected] Group website: research.nvidia.com/labs/genair/ 👇

Jaskirat Singh @ ICCV2025🌴 已轉發

Nupur Kumari

@nupurkmr9

年10月17日

🚀 New preprint! We present NP-Edit, a framework for training an image editing diffusion model without paired supervision. We use differentiable feedback from Vision-Language Models (VLMs) combined with distribution-matching loss (DMD) to learn editing directly. webpage:…

Jaskirat Singh @ ICCV2025🌴 已轉發

Sijun Tan

@sijun_tan

年10月17日

I am incredibly excited to introduce rLLM v0.2. Zooming back to a year ago: @OpenAI's o1-preview just dropped, and RL + test-time scaling suddenly became the hype. But no one knew how they did it. @kylepmont and I had this idea - what if we built a solver-critique loop for…

rLLM

@rllm_project

年10月17日

🚀 Introducing rLLM v0.2 - train arbitrary agentic programs with RL, with minimal code changes. Most RL training systems adopt the agent-environment abstraction. But what about complex workflows? Think solver-critique pairs collaborating, or planner agents orchestrating multiple…