Dean Park

@deansparc

Joined November 2020

253Posts 74Followers 372Following

Dean Park reposted

Jeff Dean

@JeffDean

19 h

An exciting new approach for doing continual learning, using nested optimization for enhancing long context processing.

Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI…

GoogleResearch's tweet image. Introducing Nested Learning: A new ML paradigm for continual learning that views models as nested optimization problems to enhance long context processing. Our proof-of-concept model, Hope, shows improved performance in language modeling. Learn more: goo.gle/47LJrzI…

Dean Park reposted

OpenAI

@OpenAI

Oct 6

You can now chat with apps in ChatGPT.

Dean Park reposted

Luisa Zintgraf

@luisa_zintgraf

Nov 4

Excited to share our new paper, "DataRater: Meta-Learned Dataset Curation"! We explore a fundamental question: How can we *automatically* learn which data is most valuable for training foundation models? Paper: arxiv.org/pdf/2505.17895 to appear @NeurIPSConf Thread 👇

Dean Park reposted

Sundar Pichai

@sundarpichai

Nov 4

Our TPUs are headed to space! Inspired by our history of moonshots, from quantum computing to autonomous driving, Project Suncatcher is exploring how we could one day build scalable ML compute systems in space, harnessing more of the sun’s power (which emits more power than 100…

sundarpichai's tweet image. Our TPUs are headed to space!

Inspired by our history of moonshots, from quantum computing to autonomous driving, Project Suncatcher is exploring how we could one day build scalable ML compute systems in space, harnessing more of the sun’s power (which emits more power than 100…

Dean Park reposted

Elon Musk

@elonmusk

Nov 3

A large solar-powered AI satellite constellation would be able to prevent global warming by making tiny adjustments in how much solar energy reached Earth

Dean Park reposted

Tesla

@Tesla

Oct 26

To push self-driving into situations wilder than reality, we built a neural network world simulator that can create entirely synthetic worlds for the Tesla to drive in. Video below is fully generated & not a real video

Dean Park reposted

Andrej Karpathy

@karpathy

Oct 18

My pleasure to come on Dwarkesh last week, I thought the questions and conversation were really good. I re-watched the pod just now too. First of all, yes I know, and I'm sorry that I speak so fast :). It's to my detriment because sometimes my speaking thread out-executes my…

Dwarkesh Patel

@dwarkesh_sp

Oct 17

The @karpathy interview 0:00:00 – AGI is still a decade away 0:30:33 – LLM cognitive deficits 0:40:53 – RL is terrible 0:50:26 – How do humans learn? 1:07:13 – AGI will blend into 2% GDP growth 1:18:24 – ASI 1:33:38 – Evolution of intelligence & culture 1:43:43 - Why self…

Dean Park reposted

Andrej Karpathy

@karpathy

Oct 13

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

Dean Park reposted

Thinking Machines

@thinkymachines

Oct 1

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

Dean Park reposted

Google

@Google

Sep 30

Starting today, we’re introducing a major new update to AI Mode in Search, making visual exploration more natural than ever before. Now you can ask a question conversationally and get a range of visual results, with the ability to continuously refine your search in more natural…

Dean Park reposted

Sam Altman

@sama

Sep 30

Excited to launch Sora 2! Video models have come a long way; this is a tremendous research achievement. Sora is also the most fun I've had with a new product in a long time. The iOS app is available in the App Store in the US and Canada; we will expand quickly.

Dean Park reposted

Anthropic

@AnthropicAI

Sep 30

New on the Anthropic Engineering Blog: Most developers have heard of prompt engineering. But to get the most out of AI agents, you need context engineering. We explain how it works: anthropic.com/engineering/ef…

AnthropicAI's tweet card. Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

Effective context engineering for AI agents

Source: anthropic.com

Dean Park reposted

Thinking Machines

@thinkymachines

Sep 29

LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

thinkymachines's tweet image. LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…

Dean Park reposted

Claude

@claudeai

Sep 29

Introducing Claude Sonnet 4.5—the best coding model in the world. It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

claudeai's tweet image. Introducing Claude Sonnet 4.5—the best coding model in the world.

It's the strongest model for building complex agents. It's the best model at using computers. And it shows substantial gains on tests of reasoning and math.

Dean Park reposted

Google

@Google

Sep 9

Parents, guardians, educators and students can now find our latest AI resources and tools in the new AI Literacy hub, from insightful podcasts and Googler-led courses to how-to videos. Learn more → goo.gle/47v7yEq

Google's tweet image. Parents, guardians, educators and students can now find our latest AI resources and tools in the new AI Literacy hub, from insightful podcasts and Googler-led courses to how-to videos. Learn more → goo.gle/47v7yEq

Dean Park reposted

swyx

@swyx

Sep 8

in a normal world i'd make this a blogpost but i'm quite busy with tmr's ✨big announcement✨ so basically here are the 4 valid scenarios in the local models dream most big lab people believe 1, most investors/biz people believe 2, huggingface et al believe either 3 or 4, and…

swyx's tweet image. in a normal world i'd make this a blogpost but i'm quite busy with tmr's ✨big announcement✨

so basically here are the 4 valid scenarios in the local models dream

most big lab people believe 1, most investors/biz people believe 2, huggingface et al believe either 3 or 4, and…

sunil pai

@threepointone

Sep 8

suspect we're a few months (if not weeks!) away from a model that's good enough for tool calling / small enough to be shipped on devices/OSes by default these already exist in research space, but hitting mainstream /consumer space is when it'll really take off. apps will be…

Dean Park reposted

Andrej Karpathy

@karpathy

Aug 27

In era of pretraining, what mattered was internet text. You'd primarily want a large, diverse, high quality collection of internet documents to learn from. In era of supervised finetuning, it was conversations. Contract workers are hired to create answers for questions, a bit…

Prime Intellect

@PrimeIntellect

Aug 27

Introducing the Environments Hub RL environments are the key bottleneck to the next wave of AI progress, but big labs are locking them down We built a community platform for crowdsourcing open environments, so anyone can contribute to open-source AGI

Dean Park reposted

Andrej Karpathy

@karpathy

Jun 25

+1 for "context engineering" over "prompt engineering". People associate prompts with short task descriptions you'd give an LLM in your day-to-day use. When in every industrial-strength LLM app, context engineering is the delicate art and science of filling the context window…

tobi lutke

@tobi

Jun 19

I really like the term “context engineering” over prompt engineering. It describes the core skill better: the art of providing all the context for the task to be plausibly solvable by the LLM.

Dean Park reposted

Dwarkesh Patel

@dwarkesh_sp

May 28

From my friend @vinayramasesh: "I would benefit most from an explanation style in which you frequently pause to confirm, via asking me test questions, that I've understood your explanations so far. Particularly helpful are test questions related to simple, explicit examples.…

Dean Park reposted

Andrej Karpathy

@karpathy

May 11

We're missing (at least one) major paradigm for LLM learning. Not sure what to call it, possibly it has a name - system prompt learning? Pretraining is for knowledge. Finetuning (SL/RL) is for habitual behavior. Both of these involve a change in parameters but a lot of human…