r0

@romitjain_

0049 I like organizing matrices

github.com/romitjain

Joined February 2010

480Posts 197Followers 441Following

You might like

@AppLovin

@mattmiller1973

@priyansh_mangal

@jawabdeyh

@ScheilUdo

@vamsikrishnap97

@gittisun

r0

@romitjain_

Nov 5

I love @askalphaxiv , but please fix this. 🙏🙏 All of my highlights and notes went away. I am ready to pay for a subscription.

"ah, it's 19 bits then, right" "well not the storage, that's still 32 bit" "oh, so then the accumulation is done in 19 bits then, right" "no, the accumulation is still done in full fp32" "oh, so then there's basically no precision loss from fp32 then" "well, also no"

r0

@romitjain_

Oct 27

Only works for DDP, but amazing speedups. Good for inference setups, not for training workloads (yet)

fal

@fal

Oct 25

🚨 Introducing FlashPack: Lightning-fast model loading package for PyTorch! ⚡ 3-6x faster model loading than current methods 📦 Convert existing checkpoints in one command 🔧 Works on any system Read our blogpost for more details!👇️ blog.fal.ai/introducing-fl…

fal's tweet card. When using machine learning models in the real world, performance isn’t just about how fast your GPU can crunch numbers — it’s also about how quickly you can get your model there. Every second spent...

Introducing FlashPack: Lightning-Fast Model Loading for PyTorch

Source: blog.fal.ai

r0

@romitjain_

Oct 16

every morning, I wake up, and get ready to face my battle.. bangalore traffic

r0 reposted

Jascha Sohl-Dickstein

@jaschasd

Sep 28

Title: Advice for a young investigator in the first and last days of the Anthropocene Abstract: Within just a few years, it is likely that we will create AI systems that outperform the best humans on all intellectual tasks. This will have implications for your research and…

jaschasd's tweet image. Title: Advice for a young investigator in the first and last days of the Anthropocene

Abstract: Within just a few years, it is likely that we will create AI systems that outperform the best humans on all intellectual tasks. This will have implications for your research and…

r0 reposted

λux

@novasarc01

Oct 8

FlashInfer redefines how attention kernels, kv-cache layouts and dynamic runtimes are compiled and scheduled for efficient LLM serving. Check out my latest blog "Dissecting FlashInfer - A Systems Perspective on High-Performance LLM Inference".

novasarc01's tweet image. FlashInfer redefines how attention kernels, kv-cache layouts and dynamic runtimes are compiled and scheduled for efficient LLM serving. Check out my latest blog "Dissecting FlashInfer - A Systems Perspective on High-Performance LLM Inference".

r0 reposted

Red Hat AI

@RedHat_AI

Sep 26

Missed our latest vLLM office hours? We covered hybrid models as first-class citizens in @vllm_project. ✅ Hybrid model support in v1 ✅ Mamba, Mamba2, linear attention ✅ Performance from v0 → v1 ▶️ Recording: youtube.com/live/uWQ489ONv… 📑 Slides: docs.google.com/presentation/d…

r0

@romitjain_

Sep 14

Michael Jordan comes close, but Novak is truly the GOAT. His mental toughness is beyond imagination.

SK

@Djoko_UTD

Sep 13

Messi never suffered war LeBron wasn’t treated unfairly. Phelps didn’t grow up in poverty Brady can’t talk 12 languages Woods never had 2 Goat rivals Sachin didn’t face this much hate. Bolt didn’t come from a war torn country. But Novak Djokovic faced it all and became The GOAT

Djoko_UTD's tweet image. Messi never suffered war
LeBron wasn’t treated unfairly.
Phelps didn’t grow up in poverty
Brady can’t talk 12 languages
Woods never had 2 Goat rivals
Sachin didn’t face this much hate.
Bolt didn’t come from a war torn country.

But Novak Djokovic faced it all and became The GOAT

r0 reposted

Simo Ryu

@cloneofsimo

Sep 12

ok final post for today did youall know there is this golden blogpost on github in markdown format, burried with zero visibility because its not github pages or readme but its so good?

cloneofsimo's tweet image. ok final post for today
did youall know there is this golden blogpost on github in markdown format, burried with zero visibility because its not github pages or readme
but its so good?

r0

@romitjain_

Aug 26

If you procrastinate, try scheduling your procrastination time earlier in the day so it doesn’t interfere with your productivity.

Bryan Johnson

@bryan_johnson

Aug 26

If you have anxiety, try scheduling your worry time earlier in the day so it doesn’t interfere with your sleep.

r0 reposted

Alex L Zhang

@a1zhang

Aug 18

announcing the @GPU_MODE x @scaleml summer speaker series happening next week, a 5⃣-day series where top researchers will teach about the algorithmic and systems-level advances that underpin `gpt-oss`! all content will be live-streamed & recorded for FREE on GPU MODE's YouTube!

a1zhang's tweet image. announcing the @GPU_MODE x @scaleml summer speaker series happening next week, a 5⃣-day series where top researchers will teach about the algorithmic and systems-level advances that underpin `gpt-oss`!

all content will be live-streamed &amp; recorded for FREE on GPU MODE's YouTube!

r0

@romitjain_

Aug 1

too real

Ramp Capital

@RampCapitalLLC

Jul 31

Everything is computer

r0

@romitjain_

Jul 17

Sometimes a couple of bad weeks can do wonders. Motivates you to be better and you come back stronger.

r0

@romitjain_

Jun 25

What an amazing course. I did a few lectures on optimization and kernels. They seem to be good (for high-level understanding). For low-level, their assignments are worth it..

Percy Liang

@percyliang

Jun 18

Wrapped up Stanford CS336 (Language Models from Scratch), taught with an amazing team @tatsu_hashimoto @marcelroed @neilbband @rckpudi. Researchers are becoming detached from the technical details of how LMs work. In CS336, we try to fix that by having students build everything:

r0

@romitjain_

Jun 20

These two statements - "The hottest new programming language is English" by Karpathy and "gpus go brrrr" by Horace he perfectly sums up the current LLM era

r0 reposted

will brown

@willccbb

Jun 18

everybody wants to do fun experiments nobody wants to write core infrastructure code

r0

@romitjain_

Jun 18

Have been tinkering with @vllm_project since its release. It's a beautiful library. I hope it remains the same. One of my earlier long-form articles was around understanding vLLM's behaviour - cmeraki.github.io/throughput-is-…

dr. jack morris

@jxmnop

Jun 17

the "Design" pages of vLLM are actually incredible. found just the other day and bingeread them all over the weekend how delightful to know there are still AI researchers doing Real Computer Science custom hashing, careful memory management... they even use linked lists...