내가 좋아할 만한 콘텐츠
I spent months illustrating how Transformers actually work. Not just what they do, but why they’re built this way. The history, design choices, and intuition behind every layer. From RNNs → Attention → Multi-Head → FFNs → Positional Encoding. Here's everything I wish I…
Did I cross from madness to genius or vice versa? Only one way to find out, check out the latest @CogRev_Podcast with @labenz to find out!
Just watching Emad Mostaque's latest interview on the Cognitive Revolution podcast There's a fine line between madness and genius and Emad has probably overstepped it ... a lot
Google introduces Test-Time Diffusion Deep Researcher Don't sleep on diffusion models. Test-Time Diffusion Deep Researcher (TTD-DR) is a deep research agent that models research writing as a diffusion process. Instead of static reasoning or bolted-on tools, the system drafts…
Thanks @friedberg & the @theallinpod crew for inviting me to the summit, it was a blast! Fun conversation discussing everything from the limitations of today's AI systems to the latest world models like Genie 3 and their key potential role in robotics - enjoy!
Google DeepMind CEO Demis Hassabis on AI, Creativity, and a Golden Age of Science (0:00) Introducing Sir Demis Hassabis, reflecting on his Nobel Prize win (2:39) What is Google DeepMind? How does it interact with Google and Alphabet? (4:01) Genie 3 world model (9:21) State of…
The most important AI paper of 2025 might have just dropped. NVIDIA lays out a framework for Small Language Model agents that could outcompete LLMs. Here’s the full breakdown (and why it matters):
With Cursor/Lovable/Claude Code, the cost of software creation is approaching zero. I think this will fundamentally change software business models over the next decade. Software itself will be less differentiated, meaning differentiation will have to come elsewhere; in many…
Though Greece may have birthed Western philosophy, Rome revitalized it. Here are the 10 greatest Roman philosophers that any true fan of Rome needs to know🧵
12 Foundational AI Model Types ▪️ LLM ▪️ SLM ▪️ VLM ▪️ MLLM ▪️ LAM ▪️ LRM ▪️ MoE ▪️ SSM ▪️ RNN ▪️ CNN ▪️ SAM ▪️ LNN Save the list and check this out for explanations and links to the useful resources: huggingface.co/posts/Kseniase…
if you want to learn RL but focused on LLMs, this is the free book to read: RLHF by @natolambert > RHLF training and fine-tuning > ml, nlp, rl background > preference data > reward modeling > policy gradient algorithms it’s an easy read, check it out.
5 huge AI agents, MCP, and LLM updates today: 1. China just dropped an opensource Deep Research multi-agent framework with MCP support. It connects AI Agents with specialized tools for tasks like web search, crawling, and Python code execution. 100% opensource.
Reasoning LLMs Guide Here is my practical guide to building with Reasoning LLMs. Lots of dev tips in it. It covers: - What are Reasoning LLMs? - Top Reasoning Models - Reasoning Model Design Patterns & Use Cases - Reasoning LLM Usage Tips - Limitations with Reasoning Models
Massive release here! First, MCP. Then, A2A. Now, we have a new AI protocol. AG-UI is the Agent-User Interaction Protocol. This is a protocol for building user-facing AI agents. It's a bridge between a backend AI agent and a full-stack application. Up to this point, most…
MCP vs API: Simplifying AI Agent Integration with External Data youtu.be/7j1t3UZA1TY?si… 来自 @YouTube
youtube.com
YouTube
MCP vs API: Simplifying AI Agent Integration with External Data
it's quite easy to calculate VRAM requirements for a large language model
You might wonder how models like @OpenAI o3 are trained The key behind them is "multi-turn reinforcement learning". It means that the model is trained to perform reasoning steps, call functions, get function call results, do more reasoning, call more functions etc. all in one go
Diffusion models generalize *really* well: if you give them a million pictures of cats, they'll learn to generate reasonable-looking cats no one's ever seen before. But the weird thing is that no one knows why they work! In a theory paper accepted to #ICLR2025, I dug into this.
Google announced LLMs are Greedy Agents on Hugging Face Effects of RL Fine-tuning on Decision-Making Abilities
High school-level Math Problems with Solutions 1/21
🇨🇳🇮🇳 Why China builds and India brags - the real story behind all those Indian CEOs. 🧵 1/ Indian netizens love to boast: “We have CEOs at Google, Microsoft, IBM, Adobe, Pepsi…” China? “Only has factories and copycats.” But let’s break this down. Who’s really winning in the…
New blog post: Questions about the future of AI A 6,000-word clusterfuck of considerations about economics, history, training, investment, and more. Thread of select questions below:
United States 트렌드
- 1. Raiders 82.6K posts
- 2. #WWERaw 176K posts
- 3. Cowboys 52.2K posts
- 4. Pickens 21.6K posts
- 5. #Dragula N/A
- 6. #WickedForGood 8,540 posts
- 7. #GMMTV2026 291K posts
- 8. Gunther 22.2K posts
- 9. Geno 15.9K posts
- 10. Chip Kelly 2,434 posts
- 11. Jeanty 7,133 posts
- 12. Grok 4.1 1,424 posts
- 13. Sigourney N/A
- 14. Pete Carroll 3,611 posts
- 15. Jlexis 8,636 posts
- 16. Roman 75.8K posts
- 17. Quiet Piggy 3,282 posts
- 18. Mark Davis 1,552 posts
- 19. Becky 55.3K posts
- 20. Dolph 44K posts
내가 좋아할 만한 콘텐츠
-
Crypto 豆 🌱
@DeFi_Bean -
0xSamoⓂ️Ⓜ️T
@BirkSamo -
万物岛ThreeDAO
@ThreeDAOspace -
observerdq 🦇🔊
@observerdq -
Web3Buidler.Tech
@Web3BuidlerTech -
Jademont
@shanshan521 -
ViNc | Pendle Sensei 🌸
@ViNc2453 -
HAP
@ethHap -
darkforest
@darkforesttri -
CosmosMan宇宙侠 😈
@iCosmosMan -
Evie | JE Labs🦄🍀
@0xEvieYang -
LukeStarlord
@luckyekinevil -
R
@RR_hodl -
Jiawei
@0xjiawei -
CRYPTOHUBKE
@CryptoHubKE
Something went wrong.
Something went wrong.