내가 좋아할 만한 콘텐츠
We are looking for a post-training lead at @datologyai we have gpus, you can make them go brrrr

When I have multiple Anneals and some CPT runs
not your weights not your model not your mind
Am I wrong in sensing a paradigm shift in AI? Feels like we’re moving from a world obsessed with generalist LLM APIs to one where more and more companies are training, optimizing, and running their own models built on open source (especially smaller, specialized ones) Some…

Give it a year or two and we will have nanoagent. Then nanoagent speed run. Then nanoASI and so on.
Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

Still thinking about this banger
when you're the eval guy and you check up on the experiments channel in slack

HAHAHAHAHAHA IT'S A VIDEO FROM THE PORT OF LONG BEACH CALIFORNIA 🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸
🇨🇳#China’s intelligent port operates with high efficiency At China’s smart port, autonomous transport vehicles move in an orderly and tireless manner.
Here’s a scaling law that matters more: every two years the gap to frontier closes by a factor of 2x
Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

I’m fairly convinced the first rabbit in a hat trick was an accident



Talk from Wenting Zhao of Qwen on their plans during COLM. Seems like 1 word is the plan still: scaling training up! Let’s go.

Doing some on the ground research on power (football) analysis
Was watching the Georgia game and noticed @dwarkesh_sp, Host of @dwarkeshpodcast was literally on TV 🤯🤯🤯🤯🤯
One day, @code_star will swap profile pics with @willccbb and the world will tremble.
One thing I wonder is, these GPUs are going to remain incredibly powerful and the next generation or two of GPUs are going to be rolled at scales we have never really seen before. Now obviously electricity isn't free, but if we find ourselves with much cheaper electricity in…
Quit hoggin'em all
One thing I wonder is, these GPUs are going to remain incredibly powerful and the next generation or two of GPUs are going to be rolled at scales we have never really seen before. Now obviously electricity isn't free, but if we find ourselves with much cheaper electricity in…
So in like 5 ish years ... what is going to happen to all the H100s ... landfill?
United States 트렌드
- 1. D’Angelo 296K posts
- 2. Young Republicans 15.4K posts
- 3. #PortfolioDay 17.1K posts
- 4. Pentagon 109K posts
- 5. Politico 174K posts
- 6. Brown Sugar 21.1K posts
- 7. Presidential Medal of Freedom 64.9K posts
- 8. Angie Stone 34.3K posts
- 9. Big 12 13.7K posts
- 10. Drew Struzan 29.6K posts
- 11. Scream 5 N/A
- 12. David Bell N/A
- 13. Black Messiah 11.2K posts
- 14. Venables 3,797 posts
- 15. Soybeans 5,635 posts
- 16. Milei 276K posts
- 17. Merino 15.9K posts
- 18. World Cup 349K posts
- 19. Baldwin 21.4K posts
- 20. VPNs 1,552 posts
내가 좋아할 만한 콘텐츠
-
Abhi Venigalla
@ml_hardware -
Tri Dao
@tri_dao -
Jonathan Frankle
@jefrankle -
Sam Havens
@sam_havens -
Jan Leike
@janleike -
Matthew Leavitt
@leavittron -
Sharon Y. Li
@SharonYixuanLi -
Ofir Press
@OfirPress -
Vitaliy Chiley
@vitaliychiley -
Mihir Patel
@mvpatel2000 -
Michael Carbin
@mcarbin -
Tom Goldstein
@tomgoldsteincs -
labml.ai
@labmlai -
Mostafa Dehghani
@m__dehghani -
rohan anil
@_arohan_
Something went wrong.
Something went wrong.