Mojonized🔥

@mojonized

Joined April 2025

627Posts 10Followers 18Following

Mojonized🔥 reposted

Chieh-Hsin (Jesse) Lai

@JCJesseLai

Oct 29

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core…

JCJesseLai's tweet image. Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on!

📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon.

It traces the core…

Mojonized🔥 reposted

Thinking Machines

@thinkymachines

Oct 27

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

thinkymachines's tweet image. Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

Mojonized🔥 reposted

Sander Dieleman

@sedielem

Oct 10

In diffusion LMs, discrete methods have all but displaced continuous ones (🥲). Interesting new trend: why not both? Use continuous methods to make discrete diffusion better. Diffusion duality: arxiv.org/abs/2506.10892 CADD: arxiv.org/abs/2510.01329 CCDD: arxiv.org/abs/2510.03206

Sander Dieleman

@sedielem

Aug 19

New survey on diffusion language models: arxiv.org/abs/2508.10875 (via @NicolasPerezNi1). Covers pre/post-training, inference and multimodality, with very nice illustrations. I can't help but feel a bit wistful about the apparent extinction of the continuous approach after 2023🥲

sedielem's tweet image. New survey on diffusion language models: arxiv.org/abs/2508.10875 (via @NicolasPerezNi1). Covers pre/post-training, inference and multimodality, with very nice illustrations.

I can't help but feel a bit wistful about the apparent extinction of the continuous approach after 2023🥲

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Oct 5

Have you given Mojo a try for this? It has a bunch of infra and existing basic support for neon matmuls - I bet you could make it significantly faster!

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Oct 5

Jeremy builds on his years of AI and teaching experience, embracing AI coding by using it the right way: increase productivity and understanding of code, rather than replace programmers with "vibe code". solveit is an innovative platform to learn and build apps. Check it out! 👇

Jeremy Howard

@jeremyphoward

Oct 2

It's a strange time to be a programmer—easier than ever to get started, but easier to let AI steer you into frustration. We've got an antidote that we've been using ourselves with 1000 preview users for the last year: "solveit" Now you can join us.🧵 answer.ai/posts/2025-10-…

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Oct 5

Makes sense. Mojo gives you the full power of the hardware, it doesn't "abstract" it like some other systems, so it is perfect for doing this sort of work. It provides helper libraries that you can optionally use to make some things (incl tiling etc) more declarative, and…

Mojonized🔥 reposted

𝔈𝔥𝔰𝔞𝔫

@ehsanmok

Sep 24

Let's gooooooo Modular 🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀🚀

Modular

@Modular

Sep 24

We raised $250M to accelerate building AI's unified compute layer! 🔥 We’re now powering trillions of tokens, making AI workloads 4x faster 🚀 and 2.5x cheaper ⬇️ for our customers, and welcomed 10K’s of new developers 👩🏼‍💻. We're excited for the future!

Modular's tweet image. We raised $250M to accelerate building AI's unified compute layer! 🔥 We’re now powering trillions of tokens, making AI workloads 4x faster 🚀 and 2.5x cheaper ⬇️ for our customers, and welcomed 10K’s of new developers 👩🏼‍💻. We're excited for the future!

Mojonized🔥 reposted

𝔈𝔥𝔰𝔞𝔫

@ehsanmok

Sep 19

torch.compile is PT's achilles heel!

PyTorch

@PyTorch

Sep 18

Compiling large #PyTorch models at Meta could take an hour+. Engineers cut PT2 compile time by 80% with parallel Triton compilation, dynamic shape marking, autotuning config pruning, and cache improvements now integrated into the stack. 🔗 hubs.la/Q03J-6P20

PyTorch's tweet image. Compiling large #PyTorch models at Meta could take an hour+. Engineers cut PT2 compile time by 80% with parallel Triton compilation, dynamic shape marking, autotuning config pruning, and cache improvements now integrated into the stack.

🔗 hubs.la/Q03J-6P20

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

You're a simple person who is so shy and doesn't know anything about these sorts of things :-)

Mojonized🔥 reposted

Jeremy Howard

@jeremyphoward

Sep 17

It did occur to me that they're starting to understand why Mojo is actually needed...

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

Did you tell Jeremy that Mojo runs on all the NV and AMD consumer GPUs and is starting to work on Apple GPUs too? :-)

Mojonized🔥 reposted

𝔈𝔥𝔰𝔞𝔫

@ehsanmok

Sep 17

Mojo ftw modular.com/blog/matrix-mu…

ehsanmok's tweet card. In this post, we continue on this journey and discuss how to leverage the 2SM technique along with pipelining to increase our performance about 5x and get within 85% of state-of-the-art (SOTA).

Modular: Matrix Multiplication on Blackwell: Part 3 - The Optimizations Behind 85% of SOTA Perfor...

Source: modular.com

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

It was wonderful to spend the day with you in Austin today Kyle. Very excited about the collaboration and our path ahead 🦾!

Mojonized🔥 reposted

𝔈𝔥𝔰𝔞𝔫

@ehsanmok

Sep 17

One more thing to be skeptical. Matter of time that dsl dies too. Mojo 🔥 ftw. Stay tuned for part 4 modular.com/blog/matrix-mu…

Modular: Matrix Multiplication on Blackwell: Part 3 - The Optimizations Behind 85% of SOTA Perfor...

Source: modular.com

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

Good day today @TensorWaveCloud!

Mojonized🔥 reposted

Modular

@Modular

Sep 16

ssshh... 🤫 @AMD Mi355X... now available in nightlies.

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

For more context, see github.com/triton-lang/tr… and youtube.com/watch?v=5e1YKq…

clattner_llvm's tweet card. Triton Community Meetup 20250709 130219 Meeting Recording

youtube.com

YouTube

Triton Community Meetup 20250709 130219 Meeting Recording

Source: youtube.com

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

If you'd like to learn more about Mojo + Blackwell, please read: P1: modular.com/blog/matrix-mu… P2: modular.com/blog/matrix-mu… P3: modular.com/blog/matrix-mu… P4: Coming really soon. 🚀

clattner_llvm's tweet card. In this post, we continue on this journey and discuss how to leverage the 2SM technique along with pipelining to increase our performance about 5x and get within 85% of state-of-the-art (SOTA).

Modular: Matrix Multiplication on Blackwell: Part 3 - The Optimizations Behind 85% of SOTA Perfor...

Source: modular.com

Mojonized🔥 reposted

Chris Lattner

@clattner_llvm

Sep 17

Triton is nice if you want to get something onto a GPU but don't need full performance/TCO. However, if you want peak perf or other HW, then Mojo🔥 could be a better fit. I'm glad OpenAI folk are acknowledging this publicly, but I wrote about it here: modular.com/blog/democrati…

clattner_llvm's tweet card. In this post, we’ll break down how Python eDSLs work, their strengths and weaknesses, and take a close look at Triton.

Modular: What about Triton and Python eDSLs? (Democratizing AI Compute, Part 7)

Source: modular.com

difficultyang

@difficultyang

Sep 16

TIL, RIP Triton, killed by inability to have good Blackwell performance

Mojonized🔥 reposted

Nicholas Wilt

@CUDAHandbook

Sep 18

I’d say Hell froze over, but that might just be because I’m old enough to remember when Mike Hara, VP of Investor Relations at NVIDIA, got in trouble for saying (in 2002) that NVIDIA would be bigger than Intel. wired.com/2002/07/nvidia/

wired.com

Nvidia

Meet Nvidia CEO Jen-Hsun Huang, the man who plans to make the CPU obsolete. Nvidia NASDAQ NVDA California FY 01 Sales$1.4 B FY 01 profit $177M$177 M Market cap$5.1B Microchip manufacturer PLUS The...

Source: wired.com

Patrick Moorhead

@PatrickMoorhead

Sep 18

Huge deal between $NVDA and $INTC. NVIDIA and Intel announced a multi-generation collaboration across PC and datacenter and NVIDIA will invest $5B in Intel at $23.28 per share. The joint solution will be a tight coupling Intel x86 CPUs and NVIDIA RTX GPUs over NVLink for PCs…