dheerpatel1710's profile picture. http://M.Sc student in Robotics, Cognition, Intelligence at @TU_Muenchen . Crazy about automobiles, AI, football and really heavy music.

Dheer Patel

@dheerpatel1710

http://M.Sc student in Robotics, Cognition, Intelligence at @TU_Muenchen . Crazy about automobiles, AI, football and really heavy music.

Dheer Patel reposted

Bro that’s Rafael and Fabio Da Silva!

An 1800-year-old portrait of two brothers from Roman Egypt, c. 140.

BiancoDavinci's tweet image. An 1800-year-old portrait of two brothers from Roman Egypt, c. 140.


Dheer Patel reposted

It is basically DAGGER but with a key difference: On-policy distill for LLMs typically works best with reverse KL (or Jeffreys which is half forward, half reverse) instead of typical "forward KL" in robotics / RL (log-likelihood or MSE loss using expert action). Interestingly,…

agarwl_'s tweet image. It is basically DAGGER but with a key difference: On-policy distill for LLMs typically works best with reverse KL (or Jeffreys which is half forward, half reverse) instead of typical "forward KL" in robotics / RL (log-likelihood or MSE loss using expert action).  

Interestingly,…

Very late to the party. When Thinking Machines Lab released their blog on On Policy Distillation, my first reaction was that it should be just like DAGGER from 15 years ago: arxiv.org/abs/1011.0686. I finally had time to read the blog today and sure enough, they mentioned DAGGER.…

guohao_li's tweet image. Very late to the party. When Thinking Machines Lab released their blog on On Policy Distillation, my first reaction was that it should be just like DAGGER from 15 years ago: arxiv.org/abs/1011.0686. I finally had time to read the blog today and sure enough, they mentioned DAGGER.…


Dheer Patel reposted

When GTA 5 released I was 14. Since then I’ve dropped out of school, gotten arrested, smoked crack, gotten arrested again, gone to rehab, started doing blow, got arrested again, got sober, got 1,000,000 subscribers, made a weed brand, crashed my car, made a podcast, crashed my…

Hi everyone, Grand Theft Auto VI will now release on Thursday, November 19, 2026. We are sorry for adding additional time to what we realize has been a long wait, but these extra months will allow us to finish the game with the level of polish you have come to expect and…

RockstarGames's tweet image. Hi everyone,

Grand Theft Auto VI will now release on Thursday, November 19, 2026.

We are sorry for adding additional time to what we realize has been a long wait, but these extra months will allow us to finish the game with the level of polish you have come to expect and…


Dheer Patel reposted

Leaving Meta and PyTorch I'm stepping down from PyTorch and leaving Meta on November 17th. tl;dr: Didn't want to be doing PyTorch forever, seemed like the perfect time to transition right after I got back from a long leave and the project built itself around me. Eleven years…

soumithchintala's tweet image. Leaving Meta and PyTorch

I'm stepping down from PyTorch and leaving Meta on November 17th.

tl;dr: Didn't want to be doing PyTorch forever, seemed like the perfect time to transition right after I got back from a long leave and the project built itself around me.

Eleven years…

Dheer Patel reposted

the last thing a model sees before being captured in a cuda graph

of course merged by the 500IQ Tsinghua GOAT himself

scaling01's tweet image. of course merged by the 500IQ Tsinghua GOAT himself


Dheer Patel reposted

Correct take.

Hot take: DAgger (Ross 2011) should be the first paper you read to get into RL, instead of Sutton's book. Maybe also read scheduled sampling (Bengio 2015). And before RL, study supervised learning thoroughly.

shaneguML's tweet image. Hot take: DAgger (Ross 2011) should be the first paper you read to get into RL, instead of Sutton's book. Maybe also read scheduled sampling (Bengio 2015). And before RL, study supervised learning thoroughly.
shaneguML's tweet image. Hot take: DAgger (Ross 2011) should be the first paper you read to get into RL, instead of Sutton's book. Maybe also read scheduled sampling (Bengio 2015). And before RL, study supervised learning thoroughly.


Dheer Patel reposted

🇩🇰🇪🇬 Men are simple creatures. Egyptian Foreign Minister Badr Abdelaty was gifted a LEGO set of the Great Pyramid of Giza by the Danish Foreign Minister during his visit for the Grand Egyptian Museum’s opening. Look how happy he is.

MyLordBebo's tweet image. 🇩🇰🇪🇬 Men are simple creatures.

Egyptian Foreign Minister Badr Abdelaty was gifted a LEGO set of the Great Pyramid of Giza by the Danish Foreign Minister during his visit for the Grand Egyptian Museum’s opening.

Look how happy he is.

Dheer Patel reposted
code_star's tweet image.

CONVERT YOUR CODEBASES TO REAL NUMBERS, I REPEAT REAL NUMBERS



Dheer Patel reposted

PewDiePie in 2025: – built a 10×4090 rig – runs Llama 70B, gpt-oss-120B & Qwen 245B locally via vLLM – built a custom web UI (chat, RAG, search, TTS) – ran protein-folding simulations for charity – created an AI “council”, a swarm of 64 models – now fine-tuning his own model…

Yuchenj_UW's tweet image. PewDiePie in 2025:

– built a 10×4090 rig
– runs Llama 70B, gpt-oss-120B & Qwen 245B locally via vLLM
– built a custom web UI (chat, RAG, search, TTS)
– ran protein-folding simulations for charity
– created an AI “council”, a swarm of 64 models
– now fine-tuning his own model…

Dheer Patel reposted
miniapeur's tweet image.

Dheer Patel reposted

p = 0.051

Apart from breakup, what else can make a man be like this?

big_yemm's tweet image. Apart from breakup, what else can make a man be like this?


Dheer Patel reposted

Finally Germany has a serious LB. This new Germany generation looks very promising, many insane 17-22 year olds. Germany will hit its prime when the 1995 generation is gone and the Musiala generation is like 27 years old

nathaniel brown vs bvb



Dheer Patel reposted

Guido van Rossum builds a python package for RAG

At #PyBay25, @gvanrossum demo'd a Python package for "structured RAG". During ingestion, it uses LLM to extract structured data (entities/topics/verbs) and stores in standard DB, and then retrieves by structuring the user query as well. Try it out at: github.com/microsoft/type…

pamelafox's tweet image. At #PyBay25, @gvanrossum  demo'd a Python package for "structured RAG".
During ingestion, it uses LLM to extract structured data (entities/topics/verbs) and stores in standard DB, and then retrieves by structuring the user query as well.
Try it out at:
github.com/microsoft/type…


Dheer Patel reposted

paper rejection

Apart from breakup, what else can make a man be like this?

big_yemm's tweet image. Apart from breakup, what else can make a man be like this?


Dheer Patel reposted

Chelsea LOSE Liverpool LOSE Man City LOSE Man Utd WIN What a beautiful weekend of football. 😂

UtdFaithfuls's tweet image. Chelsea LOSE
Liverpool LOSE
Man City LOSE
Man Utd WIN

What a beautiful weekend of football. 😂

Dheer Patel reposted

This is NOT normal.

UtdXclusive's tweet image. This is NOT normal.

Dheer Patel reposted

Must have been that unchanged team 😏


Dheer Patel reposted

Excited and honored to welcome @goodside to Google DeepMind and the AI Studio team as our first staff prompt engineer : )


Dheer Patel reposted

Nice, short post illustrating how simple text (discrete) diffusion can be. Diffusion (i.e. parallel, iterated denoising, top) is the pervasive generative paradigm in image/video, but autoregression (i.e. go left to right bottom) is the dominant paradigm in text. For audio I've…

BERT is just a Single Text Diffusion Step! (1/n) When I first read about language diffusion models, I was surprised to find that their training objective was just a generalization of masked language modeling (MLM), something we’ve been doing since BERT from 2018. The first…



Dheer Patel reposted

study graph theory the entire universe runs on connections. neurons, roads, networks, friendships, computers, logistics, even ideas; all are graphs. graph theory is the math of relationships. it teaches you how things interact, not just what they are. you’ll start seeing…

oprydai's tweet image. study graph theory

the entire universe runs on connections.
neurons, roads, networks, friendships, computers, logistics, even ideas; all are graphs.

graph theory is the math of relationships. it teaches you how things interact, not just what they are.
you’ll start seeing…

Loading...

Something went wrong.


Something went wrong.