Manoj Rao

@manojrajarao

prev: AI infra @perplexity_ai, @tesla_ai, @aws_ai

mlai.blog

انضم في يونيو 2014

575المنشورات 426المتابعون 645المتابَعون

قد يعجبك

@dneorej

@antonbaumann101

@AllGoneTomorrow

@JasonPatel13

@manojsinghkira1

@alarig_

@rimonamit

@SantilloDavide

@vladblagoje

@unseriousprof

@KeevinOrourke

@moreirajosedani

Manoj Rao أعاد

PyTorch

@PyTorch

22 س

Training massive Mixture-of-Experts (MoE) models like DeepSeek-V3 and Llama 4-Scout efficiently is one of the challenges in modern AI. These models push GPUs, networks, and compilers to their limits. To tackle this, AMD and Meta’s PyTorch teams joined forces to tune TorchTitan…

PyTorch's tweet image. Training massive Mixture-of-Experts (MoE) models like DeepSeek-V3 and Llama 4-Scout efficiently is one of the challenges in modern AI. These models push GPUs, networks, and compilers to their limits.

To tackle this, AMD and Meta’s PyTorch teams joined forces to tune TorchTitan…

Manoj Rao

@manojrajarao

٣٠ نوفمبرم

I enjoyed this one.

Nikhil Kamath

@nikhilkamathcio

٣٠ نوفمبرم

Out now @elonmusk

Manoj Rao

@manojrajarao

١٩ نوفمبرم

They (finally) shipped the obvious feature. Might bring order to our mess, though parts still feel oddly manual.

Manoj Rao أعاد

Curiosity

@MAstronomers

١٨ نوفمبرم

BREAKING🚨: NASA will reveal the latest images of Interstellar visitor, 3I/ATLAS, on Nov. 19.

Manoj Rao أعاد

Andrej Karpathy

@karpathy

١٦ نوفمبرم

I heard Gemini 3 answers questions before you ask them. And that it can talk to your cat.

Manoj Rao

@manojrajarao

١١ نوفمبرم

🔥🔥

Simran Arora

@simran_s_arora

١١ نوفمبرم

AI has been built on one vendor’s stack for too long. AMD’s GPUs now offer state-of-the-art peak compute and memory bandwidth — but the lack of mature software / the “CUDA moat” keeps that power locked away. Time to break it and ride into our multi-silicon future. 🌊 It's been a…

simran_s_arora's tweet image. AI has been built on one vendor’s stack for too long.
AMD’s GPUs now offer state-of-the-art peak compute and memory bandwidth — but the lack of mature software / the “CUDA moat” keeps that power locked away. Time to break it and ride into our multi-silicon future. 🌊

It's been a…

Manoj Rao أعاد

Simran Arora

@simran_s_arora

١١ نوفمبرم

Manoj Rao

@manojrajarao

٥ نوفمبرم

This was a fun event. gpumode.com/v2/news Thanks @marksaroufim @caseyaylward for organizing @danielhanchen @cHHillee for feedback PR soon!

manojrajarao's tweet image. This was a fun event. gpumode.com/v2/news

Thanks @marksaroufim @caseyaylward for organizing
@danielhanchen @cHHillee for feedback

PR soon!

Ethan Boneh

@ethanboneh

٤ نوفمبرم

A week ago I went to my first @gpu_mode hackathon, and, together with @manojrajarao, @Ameen_ml and Emily Shen, placed fourth with HelionEvolve, an OpenEvolve-based autotuner for (Helion) GPU kernels.

ethanboneh's tweet image. A week ago I went to my first @gpu_mode hackathon, and, together with @manojrajarao, @Ameen_ml and Emily Shen, placed fourth with HelionEvolve, an OpenEvolve-based autotuner for (Helion) GPU kernels.

Manoj Rao

@manojrajarao

٤ نوفمبرم

Mandatory for free-tier, opt-in for plus++ and Stargate would be paid for.

Andrew Carr 🤸

@andrew_n_carr

٤ نوفمبرم

I want ads in chat gpt so badly. Please tell me what to buy my wife, where to take my mother on vacation, what to think, what to wear, what to read.

Manoj Rao أعاد

Yukang Chen

@yukangchen_

١٤ أكتوبرم

We open-sourced QeRL — Quantization-enhanced Reinforcement Learning ! 🧠 4-bit quantized RL training 💪 Train a 32B LLM on a single H100 GPU ⚙️ 1.7× faster overall training 🎯 Accuracy on par with bfloat16-level accuracy 🔥 Supports NVFP4 quantization format Moreover, we show…

Manoj Rao

@manojrajarao

١ نوفمبرم

Just UX: why does @windsurf feel way faster than @cursor_ai or vscode + copilot (all running Sonnet 4.5)? As a ex-DoomEmacs user, I’d been missing that snappiness in AI IDEs.

Manoj Rao

@manojrajarao

٢٤ أكتوبرم

Today, I overheard the legendary @jeremyphoward dissuading another man from trying to change Physics with this whole AI thing. (thankfully, he sounded convinced!) Also, I realized I blew the chance for an epic selfie with Jeremy @iScienceLuvr & @johnowhitaker :(

Manoj Rao

@manojrajarao

١٨ أكتوبرم

The "...thethethethe..." explanation of process-based provisioning was eye-opening. Many other great ones in this. 👏 @dwarkesh_sp @karpathy

Dwarkesh Patel

@dwarkesh_sp

١٧ أكتوبرم

The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self…

Manoj Rao

@manojrajarao

١٠ أكتوبرم

An absolute Rockstar!

Neeraj Chopra

@Neeraj_chopra1

١٠ أكتوبرم

Recharging with a Javelin in the Swiss Alps. 🔋🎯

Manoj Rao

@manojrajarao

١٠ أكتوبرم

w00t!!

Dylan Patel

@dylan522p

٩ أكتوبرم

Today we are launching InferenceMAX! We have support from Nvidia, AMD, OpenAI, Microsoft, Pytorch, SGLang, vLLM, Oracle, CoreWeave, TogetherAI, Nebius, Crusoe, HPE, SuperMicro, Dell It runs every day on the latest software (vLLM, SGLang, etc) across hundreds of GPUs, $10Ms of…