Michael

@Querisity

Working on DL and RL

Joined January 2025

992Posts 43Followers 155Following

Michael reposted

Sam Altman

@sama

Nov 14

Small-but-happy win: If you tell ChatGPT not to use em-dashes in your custom instructions, it finally does what it's supposed to do!

Michael reposted

We’ve developed a new way to train small AI models with internal mechanisms that are easier for humans to understand. Language models like the ones behind ChatGPT have complex, sometimes surprising structures, and we don’t yet fully understand how they work. This approach…

OpenAI's tweet card. We trained models to think in simpler, more traceable steps—so we can better understand how they work.

Understanding neural networks through sparse circuits

Source: openai.com

Michael

@Querisity

Nov 13

Windows is shit, and nobody on earth can even change that.

Pavan Davuluri

@pavandavuluri

Nov 10

Windows is evolving into an agentic OS, connecting devices, cloud, and AI to unlock intelligent productivity and secure work anywhere. Join us at #MSIgnite to see how frontier firms are transforming with Windows and what’s next for the platform. We can’t wait to show you!…

Michael reposted

Peter Richtarik

@peter_richtarik

Nov 13

I am an AC for ICLR 2026. One of the papers in my batch was just withdrawn. The authors wrote a brief response, explaining why the reviewers failed at their job. I agree with most of their comments. The authors gave up. They are fed up. Just like many of us. I understand. We…

Michael reposted

Julian Schrittwieser

@Mononofu

Nov 12

Very excited that our AlphaProof paper is finally out! It's the final thing I worked on at DeepMind, very satisfying to be able to share the full details now - very fun project and awesome team! julian.ac/blog/2025/11/1…

Michael

@Querisity

Nov 12

Definitely not with that horrible UX

Andrea Volpini

@cyberandy

Nov 10

Google is coming after n8n and similar platforms.

Michael

@Querisity

Nov 10

Nice jooke

Peyman Milanfar

@docmilanfar

Nov 10

I'm so sorry for your loss

Michael reposted

yifei e/λ (meetmeinshibuya nov 16)

@yifever

Nov 5

congrats to llama 3 large for winning the LLM trading contest by not participating

Michael

@Querisity

Nov 4

What a horrible term “user damaged”, while never mentioning the bad design of the product

VideoCardz.com

@VideoCardz

Nov 4

NVIDIA agrees to replace RTX 5090 FE with user-damaged PCIe connector videocardz.com/newz/nvidia-ag…

VideoCardz's tweet card. NVIDIA RTX 5090 Founders Edition will be replaced NorthridgeFix has an update on the Founders Edition RTX 5090 card that entered their service recently. As we reported, the card had a broken connec...

NVIDIA agrees to replace RTX 5090 FE with user-damaged PCIe connector - VideoCardz.com

Source: videocardz.com

Michael

@Querisity

Nov 4

AI done wrong

kepano

@kepano

Nov 3

When you email issues to Obsidian Entertainment (the video game company) their AI support hallucinates and tells you to email Obsidian (the note-taking company) instead. The perils of trusting an LLM with your customer support.

kepano's tweet image. When you email issues to Obsidian Entertainment (the video game company) their AI support hallucinates and tells you to email Obsidian (the note-taking company) instead.

The perils of trusting an LLM with your customer support.

Michael

@Querisity

Oct 31

Wow, fire up my training run this weekend

Dimitris Papailiopoulos

@DimitrisPapail

Oct 31

we're building sand castles, on top of sandcastles, on top of sandcastles

Michael reposted

MIT CSAIL

@MIT_CSAIL

Oct 30

How robots would celebrate Halloween at MIT's Stata Center, according to ChatGPT, Midjourney, & Gemini.

MIT_CSAIL's tweet image. How robots would celebrate Halloween at MIT's Stata Center, according to ChatGPT, Midjourney, &amp; Gemini.

Michael reposted

Jaana Dogan ヤナドガン

@rakyll

Oct 31

There is a reason why most large companies are doomed. They value complexity more than anything else. People fabricate complexity to get recognized.

Michael reposted

Keller Jordan

@kellerjordan0

Oct 31

TIL that Muon is in PyTorch stable now. Pretty cool.

Michael

@Querisity

Oct 31

What happened to chatgpt? Even with gpt-5 extended thinking, it just gives you answer right away without doing any kind of thinking or use any tools? Did they nuke the gpt-5 model again?

Michael reposted

Percy Liang

@percyliang

Oct 29

⛵Marin 32B Base (mantis) is done training! It is the best open-source base model (beating OLMo 2 32B Base) and it’s even close to the best comparably-sized open-weight base models, Gemma 3 27B PT and Qwen 2.5 32B Base. Ranking across 19 benchmarks:

percyliang's tweet image. ⛵Marin 32B Base (mantis) is done training! It is the best open-source base model (beating OLMo 2 32B Base) and it’s even close to the best comparably-sized open-weight base models, Gemma 3 27B PT and Qwen 2.5 32B Base. Ranking across 19 benchmarks:

Michael reposted

Alexia Jolicoeur-Martineau

@jm_alexia

Oct 30

Recordings for the talk on "Tiny Recursive Models" are up. youtu.be/ETukUNsn_wQ

Alexia Jolicoeur-Martineau

@jm_alexia

Oct 28

I will give a presentation on "Tiny Recursion Models" tomorrow at 1pm in the Mila Agora (6650 Saint-Urbain, Montreal). Its open to everyone, feel free to come by!

Michael

@Querisity

Oct 28

While this looks interesting, I’m still waiting for people to solve the chicken vs egg problem. One that do not require a teacher model

Thinking Machines

@thinkymachines

Oct 27

Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

thinkymachines's tweet image. Our latest post explores on-policy distillation, a training approach that unites the error-correcting relevance of RL with the reward density of SFT. When training it for math reasoning and as an internal chat assistant, we find that on-policy distillation can outperform other…

Michael reposted

Kyunghyun Cho

@kchonyc

Oct 27

wow

Michael reposted

Jorge Bravo Abad

@bravo_abad

Oct 24

Discovering state-of-the-art reinforcement learning algorithms Reinforcement learning agents usually learn with rules we program by hand (TD, Q-learning, PPO…). But humans didn’t hand-design our learning rules—evolution did. What if we let machines discover their own RL update…