devnag's profile picture. Founder/CEO, @TryQueryPal. Previously Founder/CTO, @WavefrontHQ (funded by @Sequoia, acq by @vmware). Oregon-born, universe-raised

Dev Nag

@devnag

Founder/CEO, @TryQueryPal. Previously Founder/CTO, @WavefrontHQ (funded by @Sequoia, acq by @vmware). Oregon-born, universe-raised

Épinglé

The Periodic Table of Machine Learning (Part 1 in a series): medium.com/@devnag/what-s…

devnag's tweet image. The Periodic Table of Machine Learning (Part 1 in a series): medium.com/@devnag/what-s…

Dev Nag a reposté

(1/n) Since its publication in 2017, PPO has essentially become synonymous with RL. Today, we are excited to provide you with a better alternative - EPO.


Dev Nag a reposté

lowkey i think ilya 30u30 needs an upgrade to 50u50 now btw if you go through all of these, you’ll know at least 70% of what matters today

himanshustwts's tweet image. lowkey i think ilya 30u30 needs an upgrade to 50u50 now

btw if you go through all of these, you’ll know at least 70% of what matters today

Dev Nag a reposté

In this thread I'll record some brief impressions from trying to use o3/o4-mini (the new OpenAI models) for mathematical tasks.


Dev Nag a reposté

The financial services industry is uniquely positioned to advocate for collaborative AI leadership, given its vested interest in trust, transparency and global cooperation, writes @DevNag of @TryQueryPal, in @AmerBanker @BankThink.bit.ly/4jhAgvE


Dev Nag a reposté

pretty mind-blowing fact I just learned about transformer language models: the positional embeddings don't really do anything. you can just get rid of them and the model still works just as well sounds impossible, doesn't it? turns out standard LLMs aren't actually…

jxmnop's tweet image. pretty mind-blowing fact I just learned about transformer language models:

the positional embeddings don't really do anything.  you can just get rid of them and the model still works just as well

sounds impossible,  doesn't it?

turns out standard LLMs aren't actually…

Dev Nag a reposté

Simply, no. I've been looking at my old results from doing RL with "verifiable" rewards (math puzzle games, python code to pass unit tests) starting from 2019 with GPT-1/2 to 2024 with Qwen Math Deepseek's success likely lies in the base models improving, the RL is constant

Is it feasible to do a true tabula rasa version of deepseek R1 zero, starting from an LLM with random weights, similar to alpha zero? Or is starting with an LLM which is pre trained on math required?



Dev Nag a reposté

Another key reason people are spooked: around 2016ish we started seeing the *insane* power of purely self-improving Reinforcement Learning (RL) (think AlphaZero going from no knowledge to superhuman at chess in hours), and it was formative for a lot of folks, in terms of their…


Dev Nag a reposté

The bandwidth of this single chip is the entire internet’s traffic??? WHAT We are no where close to seeing the top of intelligent systems

Jensen Huang shows off the NVIDIA GB200 NVL72: a data center superchip with 72 Blackwell GPUs, 1.4 exaFLOPS of compute and 130 trillion transistors



Dev Nag a reposté

Ladies, has your Christmas vacation been ruined by the Deepseek launch? You may be entitled to compensation.


Dev Nag a reposté

It's a bit sad and confusing that LLMs ("Large Language Models") have little to do with language; It's just historical. They are highly general purpose technology for statistical modeling of token streams. A better name would be Autoregressive Transformers or something. They…


Dev Nag a reposté

the cathedrals in code and hardware are invisible to people. those who deeply understand that know the wonder that they hold in their hands to post this tweet instead of a slab of glass and metal.

Why did humans stop building wonders?

JamesLucasIT's tweet image. Why did humans stop building wonders?


Dev Nag a reposté

Timing.

You know what the secret to comedy is?



Dev Nag a reposté

Left: Hierarchical model based RL with a large-scale pre-trained world model, auxiliary tasks and skill-discovery and a model for inverse kinematics. Right: PID

chriswolfvision's tweet image. Left: Hierarchical model based RL with a large-scale pre-trained world model, auxiliary tasks and skill-discovery and a model for inverse kinematics.

Right: PID
chriswolfvision's tweet image. Left: Hierarchical model based RL with a large-scale pre-trained world model, auxiliary tasks and skill-discovery and a model for inverse kinematics.

Right: PID

Dev Nag a reposté

thinking: as execution of ideas gets easier (thx to agents, APIs for every imaginable service, etc), ideas become more of the differentiator. good ideas aren’t derived solely from logic or patterns of the past, they’re also the exhaust of human experiences and traumas, mistakes…


Dev Nag a reposté

🌟 Excited to be featured in the AI Tool Spotlight by @EverydayAI_! Check out the feature here: read.youreverydayai.com/p/sam-altman-c…. 🚀 Follow us for the latest updates and install QueryPal for FREE today at querypal.com!


Dev Nag a reposté

🎉 Exciting News!! We're thrilled to announce that QueryPal clinched second place as Product of the Day on @ProductHunt and soared to the top as the #1 SaaS Product! Plus, we were featured in the Product Hunt newsletter, spreading our reach even further. A heartfelt THANK YOU to…

tryquerypal's tweet image. 🎉 Exciting News!! We're thrilled to announce that QueryPal clinched second place as Product of the Day on @ProductHunt and soared to the top as the #1 SaaS Product! 
Plus, we were featured in the Product Hunt newsletter, spreading our reach even further.
A heartfelt THANK YOU to…

Dev Nag a reposté

The last time Dev worked at a Pal company, it was PayPal 👀

Human attention is the scarcest and most valuable resource in the world. This is why we built QueryPal, launching today on Product Hunt (t.ly/6YXfm). It's an AI chatbot that automatically answers all those incoming repetitive questions at work. (1/10)

devnag's tweet image. Human attention is the scarcest and most valuable resource in the world. 

This is why we built QueryPal, launching today on Product Hunt (t.ly/6YXfm). It's an AI chatbot that automatically answers all those incoming repetitive questions at work.  (1/10)


Loading...

Something went wrong.


Something went wrong.