Siddharth Goyal
@siddharth22dev
Code @RubrikInc, prev @Innovaccer @owasp
1/ Since @JayaGup10 and I wrote about context graphs, the response has been huge...but I've also noticed a few misconceptions worth addressing. The tldr: context graphs aren't a graph database or structured memory. They require a fundamentally different approach to schema and…
Ok this is sweet. You can now request multiple coding agents to build an app in parallel and compare them head to head. Complete with an integrated IDE for live development. 🔥🔥🔥
Shoutout to @xai team for hustling at 2:00 am to help bring this over the finish line Introducing Agent Runner: the first open-source agent harness run with real users to create a live benchmark of real-world coding We trace tool-calls, reprompting, and multifile edits,…
Leaving Meta and PyTorch I'm stepping down from PyTorch and leaving Meta on November 17th. tl;dr: Didn't want to be doing PyTorch forever, seemed like the perfect time to transition right after I got back from a long leave and the project built itself around me. Eleven years…
If @PriyankKharge thinks these are “huge” subsidies - then Karnataka is in for a tough ride. Bengaluru’s municipal governance is trash like Mumbai. But worse - their state leaders don’t know how to drive investment. Chandrababu Naidu will gobble them up in a contest.
"Andhra Pradesh Govt is giving huge subsidies to Google for its Visakhapatnam Data Centre, like 25% land, free water and electricity. Can any state afford it?" - Karnataka Minister Priyank Kharge
They thought the war was over. They were wrong. The hunt for the final fragments begins. The world shifts with DQN. ▶️⏺️⏸️ DQN anime premieres in 2026. Produced by @morphic.
Namma Bengaluru has the best talent and the best weather but the worst infrastructure - if we fix garbage debris and roads, we can be among the best cities in the world. GBA has a great opportunity to do this. Let’s use collective will to do this @DKShivakumar @BBMPCOMM
Einstein wasted the second half of his life on a fruitless quest. In the second half of his life, von Neumann invented game theory, computer architecture, implosion nuclear weapons, cellular automata and weather prediction, among other things.
Daniel clearly hasn't seen the language balls.
why does everyone pick sides with programming languages, and then refuse to use anything else who cares, they're basically all the same, just marginally different syntax
This is my lecture from 2 months ago at @Cornell “How do I increase my output?” One natural answer is "I will just work a few more hours." Working longer can help, but eventually you hit a physical limit. A better question is, “How do I increase my output without increasing…
Dopamine from information gathering is a dangerous drug. Your entire life will change the moment you stop looking for more information and start acting on the information you already have. Always get your dopamine from action.
This explains why LLaMA 4 failed. The tokens per parameter (TPP) is way off. You can’t defy scaling laws and expect miracles. === Llama 4 Maverick was 400B(17B active) and >30T tokens, TPP = 1764 Llama 4 Behemoth was 2T(288B active) and > 30T tokens, TPP = 104 DeepSeek v3 is…
Llama 4 Maverick was 400B(70B active) and >30T tokens = 429 tokens / active param Llama 4 Behemoth was 2T(288B active) and > 30T tokens = 104 tokens / active param DeepSeek v3 is 671B(37B active) and 14.8T tokens = 400 tokens / active param Kimi K2 is 1T(32B active) and…
We got a call from @xai 24 hours ago “We want to test Grok 4 on ARC-AGI” We heard the rumors. We knew it would be good. We didn’t know it would become the #1 public model on ARC-AGI Here’s the testing story and what the results mean: Yesterday, we chatted with Jimmy from the…
Just opened a PR yesterday that will reduce the binary size PyTorch by 40% by adding 1 flag to NVCC With ~50M monthly of downloads of Pytorch, this one change will reduce global internet traffic by ~20PB. High impact changes like this is why I love OSS. github.com/pytorch/pytorc…
The best open-source reasoning model will be dropped next Thursday if everything goes well. OpenAI hasn't open-sourced an LLM since GPT-2 in 2019, so I'm excited. We’re hosting it on Hyperbolic. Buckle up.
🔍 SEAL and Red Team at @scale_ai present a position paper outlining what we’ve learned from red teaming LLMs so far—what matters, what’s missing, and how model safety fits into broader system safety and monitoring. 🔗 scale.com/research/red_t… 📝 scale.com/blog/rethink-r…
🧵 (1/6) Bringing together diverse mindsets – from in-the-trenches red teamers to ML & policy researchers, we write a position paper arguing crucial research priorities for red teaming frontier models, followed by a roadmap towards system-level safety, AI monitoring, and…
there’s a palpable tension in the air as hundreds of AI researchers (including me!) quietly work nights and weekends trying to figure out the “right way” to scale RL math & code are not the universe we will not rest until post-training is as clean and elegant as pre-training
my favorite version of the finetuning argument is Smolin’s theory that universes “reproduce” via black holes, and the conditions that are optimal for black hole production also happen to be near optimal for creating life unclear whether true but it’s a fun idea
our universe is pretty rare in configuration space - if strong nuclear force were: * 1% weaker - stars wouldnt make much carbon preventing carbon based life & less heavy element production would delay planet formation & take longer for evolution to occur before stars die * 1%…
Good treatment of the Anthropic Principle / fine tuning here: bretthall.org/an-anthropic-u… @ToKTeacher
our universe is pretty rare in configuration space - if strong nuclear force were: * 1% weaker - stars wouldnt make much carbon preventing carbon based life & less heavy element production would delay planet formation & take longer for evolution to occur before stars die * 1%…
What if I told you that Jane Street made ₹36,500 crores from Indian markets in just 2 years, and ₹4,800 crores of that was allegedly through market manipulation? They turned India's stock market into their personal ATM using a strategy so clever. Here's the complete details 🧵
United States Trends
- 1. #WWENXT N/A
- 2. Purdue N/A
- 3. Nebraska N/A
- 4. Jai Lucas N/A
- 5. Real ID N/A
- 6. SWAT N/A
- 7. Courtney N/A
- 8. RINO N/A
- 9. Hubert N/A
- 10. Jackson Drake N/A
- 11. Jaida Parker N/A
- 12. Kurt Cobain N/A
- 13. Baylor N/A
- 14. Alaska N/A
- 15. Rob Wright N/A
- 16. #RingRoyale N/A
- 17. Cluff N/A
- 18. Murkowski N/A
- 19. Collins N/A
- 20. Matt Painter N/A
Something went wrong.
Something went wrong.