Program Counter

@program_counter

all things toward agi

Valhalla

انضم في ديسمبر 2022

1ألفالمنشورات 464المتابعون 7ألفالمتابَعون

قد يعجبك

@lucas__durante

@RetainTraffic

@SeniorPartner_

@Xavier_web_dev

@cameraenvy

Program Counter أعاد

Thien Tran

@gaunernst

٢ نوفمبرم

gau-nernst.github.io/amd-a2a/

Program Counter أعاد

Yacine Mahdid

@yacinelearning

٣١ أكتوبرم

I quit my job so I can have enough time to read this book btw

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

eliebakouch's tweet image. Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably

huggingface.co/spaces/Hugging…

Program Counter أعاد

Edward Z. Yang

@ezyang

٣١ أكتوبرم

I've been brainstorming episodes for the next season of PyTorch Developer Podcast. DTensor StridedShard, FSDP-TP order Redistributing a DTensor Prefetching vs Bucketing History of FSDP in PyTorch Multiprocessing: DataParallel versus DistributedDataParallel Monarch Parallelism…

Program Counter أعاد

Philipp Singer

@ph_singer

٣٠ أكتوبرم

Kaggle is still extremely underrated to learn these basics. On Kaggle you need to produce (i) strong models, (ii) that are robust with strong validation setups, and that are (iii) optimized for inference speed. So you actually learn about many parts of the DS pipeline.

kalomaze

@kalomaze

٢٩ أكتوبرم

my honest advice to people who this resonated with: spend less time reading shiny papers & more time working on the "boring" things, focus on the basics. by basics, i mean like deduplication (on the data side), understanding dp/tp/pp abstractions (on the training side), etc

Program Counter أعاد

Jeremy Howard

@jeremyphoward

٣٠ أكتوبرم

Recently we released a 3000+ word book chapter written by @KeremTurgutlu, based on @karpathy's marvelous "Let's built the GPT tokenizer" video. It's got pics, links, code, diagrams, … Kerem has now written a detailed walk-through of how he made it: answer.ai/posts/2025-10-…

Program Counter أعاد

Andrej Karpathy

@karpathy

٢٦ أكتوبرم

Beautiful technical debugging detective longread that starts with a suspicious loss curve and ends all the way in the Objective-C++ depths of PyTorch MPS backend of addcmul_ that silently fails on non-contiguous output tensors. I wonder how long before an LLM can do all of this.

Elana Simon

@ElanaPearl

٢٣ أكتوبرم

New blog post: The bug that taught me more about PyTorch than years of using it started with a simple training loss plateau... ended up digging through optimizer states, memory layouts, kernel dispatch, and finally understanding how PyTorch works!

ElanaPearl's tweet image. New blog post: The bug that taught me more about PyTorch than years of using it

started with a simple training loss plateau... ended up digging through optimizer states, memory layouts, kernel dispatch, and finally understanding how PyTorch works!

Program Counter أعاد

Eric Zhang

@ekzhang1

٢٤ أكتوبرم

192 weeks notes.ekzhang.com/reflections/19…

ekzhang1's tweet card. “How many weeks are between Jan 31, 2022 and Oct 6, 2025?”

192 Weeks

المصدر: notes.ekzhang.com

Program Counter أعاد

judah

@joodalooped

٢٣ أكتوبرم

24 years old, still holds up

Program Counter أعاد

Simon Guo

@simonguozirui

٢٢ أكتوبرم

RL is pain sometimes and torchforge could handle a lot of the messiness! Thanks for the @PyTorch and @CoreWeave team for letting us test it out!

PyTorch

@PyTorch

٢٢ أكتوبرم

Today Meta announced torchforge, a brand-new PyTorch-native library that makes it easy to use reinforcement learning (RL) to train AI agents. Forge provides high-performance building blocks and ready-to-use examples, so you can focus on what’s novel about your use case rather…

Program Counter أعاد

@fclc

@FelixCLC_

٢٢ أكتوبرم

One of the best SIMD programmers I’ve had the pleasure of interacting with is becoming available. Real work, with real code, that you almost certainly interact with every single day.

Robert Clausecker

@FUZxxl

٢٢ أكتوبرم

I am looking for a job starting May 2026. I am an expert in SIMD programming, in particular for non-numeric applications such as text processing or database programming. Please have a look at my website for the sort of work I do. I am located in Berlin, Germany.

Program Counter أعاد

Nicholas Wilt

@CUDAHandbook

٢٢ أكتوبرم

At PyTorch 2025, where NVIDIA decided to unveil more details about cuTile and TileIR.

Program Counter أعاد

Aarno

@TheGlobalMinima

٢٢ أكتوبرم

The OG PyTorch blog, explaining the mechanics and concepts of the internals of the framework. This basically allows you to explore the complete codebase, enabling better contributions. Definitely worth a read, then another ! Blog by - @ezyang

TheGlobalMinima's tweet image. The OG PyTorch blog, explaining the mechanics and concepts of the internals of the framework. This basically allows you to explore the complete codebase, enabling better contributions. Definitely worth a read, then another !

Blog by - @ezyang

Program Counter أعاد

Sylvain Gugger

@GuggerSylvain

٢٩ يوليو ٢٠٢٠ م

One year and half after starting the first draft of the first chapter, look what arrived in the mail!

Program Counter أعاد

Surya Ganguli

@SuryaGanguli

٢١ أكتوبرم

Good point. My first paper was on time travel in the Gödel universe. ML was easy to pick up after that :) journals.aps.org/prd/abstract/1…

Ankit Singhal

@notankitsinghal

٢١ أكتوبرم

We’ve found a ton of value hiring folks with strong theory backgrounds with little to no production ML experience. One of our members of technical staff got his phd in pure math/the geometry of black holes and had no prior ML experience. Within days of hiring him we released our…

Program Counter أعاد

jd ❄️

@jdchawla29

٢٠ أكتوبرم

it is indeed blog post catch-up day (i'm behind by 6 weeks)

stochasm

@stochasticchasm

١٩ أكتوبرم

I guess it's blog post catch-up day instead of paper catch-up day x.com/omouamoua/stat…

Program Counter أعاد

Kris Wu

@yh_kris

٢٠ أكتوبرم

I've left NVIDIA Research and joined AIRoA Tokyo as Team Lead, VLA Dev. We're pushing VLA and building a Japan-wide real-world data ecosystem with major partners in retail/logistics/construction to deploy hundreds of humanoids. 🔥We're hiring researchers and DM me if interested!

Program Counter أعاد

George Grigorev

@iamgrigorev

١٨ أكتوبرم

bro @karpathy literally re-implemented the entire lm-eval-harness in 2 Python files It's been very useful for my own repo and easy to adapt for SuperBPE case

iamgrigorev's tweet image. bro @karpathy literally re-implemented the entire lm-eval-harness in 2 Python files
It's been very useful for my own repo and easy to adapt for SuperBPE case

Program Counter أعاد

Vincent Abbott

@vtabbott_

١٨ أكتوبرم

I'll be presenting Formalized Kernel Derivation to @GPU_MODE w/ @GioeleZardini; discord dot gg/gpumode at noon PST today! Will be uploaded to the GPU Mode YT afterward. Somewhere at the intersection of art and science. Come for the diagrams, stay for the math.

vtabbott_'s tweet image. I'll be presenting Formalized Kernel Derivation to
@GPU_MODE w/ @GioeleZardini; discord dot gg/gpumode at noon PST today! Will be uploaded to the GPU Mode YT afterward.
Somewhere at the intersection of art and science. Come for the diagrams, stay for the math.

Program Counter أعاد

Michael Choi

@michaelchchoi

١٨ أكتوبرم

RIP. Markov processes and Yang-Mills: tinyurl.com/3huay75x

Tsinghua University

@Tsinghua_Uni

١٨ أكتوبرم

Prof. Chen Ning Yang, a world-renowned physicist, Nobel Laureate in Physics, Academician of the Chinese Academy of Sciences, Professor at Tsinghua University, and Honorary Director of the Institute for Advanced Study at Tsinghua University, passed away in Beijing due to illness…

Tsinghua_Uni's tweet image. Prof. Chen Ning Yang, a world-renowned physicist, Nobel Laureate in Physics, Academician of the Chinese Academy of Sciences, Professor at Tsinghua University, and Honorary Director of the Institute for Advanced Study at Tsinghua University, passed away in Beijing due to illness…

Program Counter أعاد

Ruilong Li

@ruilong_li

١٧ أكتوبرم

Checkout our latest work on Gaussian Splatting for LiDAR with 3DGUT!

Haithem Turki

@Haithem_Turki

١٧ أكتوبرم

[1/N] Excited to introduce "SimULi: Real-Time LiDAR and Camera Simulation with Unscented Transforms." We extend 3DGUT with LiDAR support and render a wide range of sensors 10-20x faster than ray tracing and 1.5-10x faster than prior rasterization work. research.nvidia.com/labs/sil/proje…