Mike Knoop

@mikeknoop

co-founder @ndea and @zapier @arcprize

sf bay area

mikeknoop.com

Joined July 2009

4KPosts 23KFollowers 347Following

You might like

@wadefoster

@jesslivingston

@Suhail

@zapier

@rahulvohra

@ttunguz

@vhmth

@briannekimmel

@bryanhelmig

@harjtaggar

@mikekarnj

@zck

@bryantchou

@ilyaf

@andreasklinger

Pinned

Mike Knoop

@mikeknoop

Jul 18

Today we’re releasing our first public preview of ARC-AGI-3: the first three games. Version 3 is a big upgrade over v1 and v2 which are designed to challenge pure deep learning and static reasoning. In contrast, v3 challenges interactive reasoning (eg. agents). The full version…

Mike Knoop

@mikeknoop

Nov 26

I've learned that building useful AI benchmarks has much in common with building useful products. You cannot design either in isolation. Both need strong contact with reality and iteration to make them great.

Mike Knoop

@mikeknoop

Nov 26

DNA is sometimes viewed as information compression. What minimal information is needed to create life/intelligence? But DNA is useless without a cell and its machinery. To fully describe life, you must describe both.

Mike Knoop reposted

will brown

@willccbb

Nov 25

come hang with me + @SalimansRobin next week to hear about using RL environments for evaluating + optimizing agents in prod at @zapier :) will be demoing some fun new features we've been collaborating on 👀

willccbb's tweet image. come hang with me + @SalimansRobin next week to hear about using RL environments for evaluating + optimizing agents in prod at @zapier :)

will be demoing some fun new features we've been collaborating on 👀

Mike Knoop

@mikeknoop

Nov 25

"New ideas needed" for AGI has been the headline on arcprize.org since June 2024

Mike Knoop

@mikeknoop

Nov 25

"scaling sucked out all the oxygen in the room, everyone converged to the same ideas" --> new ideas still needed!

Mike Knoop

@mikeknoop

Nov 25

"scaling sucked out all the oxygen in the room, everyone converged to the same ideas" --> new ideas still needed!

Lisan al Gaib

@scaling01

Nov 25

Ilya Sutskever: We are no longer in the age of scaling, we are back to the age of research

Mike Knoop

@mikeknoop

Nov 25

Really exciting to see! This is important work to assemble these scientific datasets. Also, one item close to my heart: > launch funding opportunities or prize competitions to incentivize private-sector participation in AI-driven scientific research

Dean W. Ball

@deanwball

Nov 24

Very excited to see this AI for Science Executive Order—the Genesis Mission. The Administration has appropriately ambitious goals here; we may be on the verge of world-changing breakthroughs. Congratulations to all involved!

deanwball's tweet image. Very excited to see this AI for Science Executive Order—the Genesis Mission. The Administration has appropriately ambitious goals here; we may be on the verge of world-changing breakthroughs. Congratulations to all involved!

Mike Knoop

@mikeknoop

Nov 24

For those studying AI reasoning systems, Opus' token efficiency scaling curves on ARC v1 and v2 are worth looking at. Very clean-looking results. Raw data is here: huggingface.co/arcprize

arcprize (ARC Prize Foundation)

Source: huggingface.co

ARC Prize

@arcprize

Nov 24

Opus 4.5 (Thinking, 64k) on ARC-AGI Semi-Private Eval - ARC-AGI-1: 80.00%, $1.47/task - ARC-AGI-2: 37.64%, $2.40/task New SOTA for released frontier models from @AnthropicAI

arcprize's tweet image. Opus 4.5 (Thinking, 64k) on ARC-AGI Semi-Private Eval

- ARC-AGI-1: 80.00%, $1.47/task
- ARC-AGI-2: 37.64%, $2.40/task

New SOTA for released frontier models from @AnthropicAI

Mike Knoop

@mikeknoop

Nov 24

Anthropic's new Claude 4.5 Opus (Thinking 64k) is on par with Gemini 3 Pro, released just 1 week ago! These are both very impressive new results. Intentional strategy? Timing coincidence? Or are there simply no secrets?

ARC Prize

@arcprize

Nov 24

Opus 4.5 (Thinking, 64k) on ARC-AGI Semi-Private Eval - ARC-AGI-1: 80.00%, $1.47/task - ARC-AGI-2: 37.64%, $2.40/task New SOTA for released frontier models from @AnthropicAI

Mike Knoop reposted

Mike Knoop

@mikeknoop

Nov 19

Grid size is definitely correlated. But so is solution program length (eg. kolmogorov complexity). This would be the basis of a good experiment -- disentangle these.

Mike Knoop

@mikeknoop

Nov 18

I'm live now chatting Gemini 3 ARC results!

TBPN

@tbpn

Nov 18

Good morning. On today’s show: – @mikeknoop (Ndea) – @JonnyNemo (Sweetgreen) – @ashleevance (Core Memory) – @jeremy_epling (Vanta) – @keoneHD (Monad) – @stephenbalaban (Lambda) See you on stream.

Mike Knoop

@mikeknoop

Nov 18

We just verified Gemini 3 Pro and Deep Think (Preview) are over 2X SOTA on ARC v2! This is really impressive and frankly a bit surprising. Impressive because many of the v2 solves indicate clear complexity scaling over v1. Such as tasks 65b59efc, e3721c99, and dd6b8c4b We’re…

ARC Prize

@arcprize

Nov 18

Gemini 3 models from @Google @GoogleDeepMind have made a significant 2X SOTA jump on ARC-AGI-2 (Semi-Private Eval) Gemini 3 Pro: 31.11%, $0.81/task Gemini 3 Deep Think (Preview): 45.14%, $77.16/task

arcprize's tweet image. Gemini 3 models from @Google @GoogleDeepMind have made a significant 2X SOTA jump on ARC-AGI-2 (Semi-Private Eval)

Gemini 3 Pro:
31.11%, $0.81/task

Gemini 3 Deep Think (Preview):
45.14%, $77.16/task

Mike Knoop

@mikeknoop

Nov 18

One strange thing is despite significant inference cost reductions, ARC v1 pareto frontier continues to mostly hold up. You'd naively expect frontier AI to use cheap inference to get much more reasoning search coverage.

Sam Altman

@sama

Nov 18

The rate reduction in price per unit of intelligence has been thing I've most consistently underestimated the past couple of years. 300x in a year is nuts!

Mike Knoop

@mikeknoop

Nov 10

To materially beat 2% GDP growth we need AI capable of innovation. Unlike automation, innovation inherently requires the ability to adapt to change. This is what ARC-AGI measures.

Steve Hou

@stevehou

Nov 7

Joking aside, here's my base case for thinking about AI's impact on GDP growth. I think we'll keep growing at 2%. Stuff like AI that comes once in a while how we managed to grow at 2%/y as we've done for millennia since the first industrial revolution. There’s no sense in which…

stevehou's tweet image. Joking aside, here's my base case for thinking about AI's impact on GDP growth. I think we'll keep growing at 2%. Stuff like AI that comes once in a while how we managed to grow at 2%/y as we've done for millennia since the first industrial revolution.

There’s no sense in which…

Mike Knoop

@mikeknoop

Nov 4

Wow! Unprecedented movement on the leaderboard over the last few days. ARC Prize 2025 is now closed. I'm looking forward to reviewing all the papers (still a few more days to submit those). We'll announce final results on December 5.

ARC Prize

@arcprize

Nov 4

ARC Prize 2025 - Submissions Closed! Thank you to the 1,495 teams that made 15,923 submissions Final results depend on open-source verification and private leaderboard standings Verified winners announced December 5, 2025

arcprize's tweet image. ARC Prize 2025 - Submissions Closed!

Thank you to the 1,495 teams that made 15,923 submissions

Final results depend on open-source verification and private leaderboard standings

Verified winners announced December 5, 2025

Mike Knoop reposted

François Chollet

@fchollet

Nov 2

One day left to submit to ARC Prize 2025 on Kaggle! Big changes at the top of the leaderboard these past few days, with the rise of teams NVARC and North Stars. Close contest between GiottoAI and ARChitects for the top spot. Keep in mind the final score will be evaluated on a…

ARC Prize

@arcprize

Nov 2

ARC Prize 2025 - 1 day left for Top Score submissions The leaderboard is heating up, over 1.4K teams participating Guaranteed prizes: - Top Score ($50K) - Highest private-set scores, Nov 3 - Paper Prize ($75K) - Best conceptual progress, Nov 9 Grand Prize locked till 85%

arcprize's tweet image. ARC Prize 2025 - 1 day left for Top Score submissions

The leaderboard is heating up, over 1.4K teams participating

Guaranteed prizes:
- Top Score ($50K) - Highest private-set scores, Nov 3
- Paper Prize ($75K) - Best conceptual progress, Nov 9

Grand Prize locked till 85%

Mike Knoop

@mikeknoop

Oct 30

At NeruIPS this year and interested in ARC? Come say hi to @fchollet and myself on Saturday night!

ARC Prize

@arcprize

Oct 30

NeurIPS Party - ARC Prize Foundation + Y Combinator Join us in San Diego for a NeurIPS party co-hosted with @ycombinator Meet ARC Prize and YC leadership, researchers and industry leaders pushing the boundaries of frontier AI San Diego 6-8 PM, December 6, 2025

arcprize's tweet image. NeurIPS Party - ARC Prize Foundation + Y Combinator

Join us in San Diego for a NeurIPS party co-hosted with @ycombinator

Meet ARC Prize and YC leadership, researchers and industry leaders pushing the boundaries of frontier AI

San Diego
6-8 PM, December 6, 2025

Mike Knoop

@mikeknoop

Oct 28

This is the final week for ARC Prize 2025! And the paper prize deadline is one week after close. Last year, there was a ton of action in the final days. Good luck to all teams!

ARC Prize

@arcprize

Oct 28

ARC Prize 2025 - 6 days go to Over 1.3K teams have submitted 13.9K entries Guaranteed prizes: - Paper Prize ($75K) - Awarded to the best conceptual progress - Top Score ($50K) - Awarded to the submissions with the highest private-set scores Winners announced Dec 5

arcprize's tweet image. ARC Prize 2025 - 6 days go to

Over 1.3K teams have submitted 13.9K entries

Guaranteed prizes:

- Paper Prize ($75K) - Awarded to the best conceptual progress
- Top Score ($50K) - Awarded to the submissions with the highest private-set scores

Winners announced Dec 5

Mike Knoop reposted

ARC Prize

@arcprize

Oct 24

.@fchollet + @mikeknoop fireside chat @ MIT Listen to ARC Prize Co-Foundres, Francois Chollet + Mike Knoop talk about ARC-AGI-3, game development, and measuring intelligence with Interactive Benchmarks youtu.be/1u2DkoqEfhk