Burny - Effective Curiosity

@burny_tech

On the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. http://burnyverse.com Upskilling @StanfordOnline

42 googolsth multiverse branch

burnyverse.com

Joined June 2021

48KPosts 19KFollowers 9KFollowing

You might like

@stephen_zerfas

@algekalipso

@QualiaRI

@cube_flipper

@durdfarm

@RomeoStevens76

@peak_valley_pea

@5matthewdub

@the_wilderless

@RobertFox_420

@strangestloop

@samswoora

@wholebodyprayer

@FractalAuth

@JakeOrthwein

Pinned

Burny - Effective Curiosity

@burny_tech

Oct 13

Hey! Follow me for explorations of intelligence, mathematics, science, engineering, technology, artificial intelligence, machine learning, physics, computer science, (not only computational) neuroscience, cognitive science, transhumanism, AI engineering, AI's benefits, risks,…

burny_tech's tweet image. Hey! Follow me for explorations of intelligence, mathematics, science, engineering, technology, artificial intelligence, machine learning, physics, computer science, (not only computational) neuroscience, cognitive science, transhumanism, AI engineering, AI's benefits, risks,…

Burny - Effective Curiosity reposted

Saman Habibi Esfahani

@Saman_Habibi_E

Oct 11

Hi new followers! I’m a mathematician at Harvard. I have a YouTube channel, where I discuss math the way I think about it, with now two playlists on differential geometry and complex geometry (in progress). Comments welcome! DG: youtube.com/watch?v=rVTN7V… CG: youtube.com/watch?v=N5FQHg…

Saman_Habibi_E's tweet image. Hi new followers! I’m a mathematician at Harvard. I have a YouTube channel, where I discuss math the way I think about it, with now two playlists on differential geometry and complex geometry (in progress). Comments welcome!
DG: youtube.com/watch?v=rVTN7V…
CG: youtube.com/watch?v=N5FQHg…

Burny - Effective Curiosity reposted

François Chollet

@fchollet

24 h

An interesting property of ARC 3 is that it is more accessible to children than ARC 1 & 2, while being much more difficult for current AI systems

Burny - Effective Curiosity reposted

Aidan McLaughlin

@aidan_mclau

Oct 10

2024 evals can it count letters 🥺 can it do college stuff 🤓 are its solutions diverse 👉👈 2025 evals has it worked for 30 hours yet 🦾 has it increased gdp 📈 has it discovered novel math 🧮

Burny - Effective Curiosity reposted

Yam Peleg

@Yampeleg

Oct 10

/Humanitys-Last-Exam/ ├─ Humanity_Last_Exam.docx ├─ Humanity_Last_Exam_final.docx ├─ Humanity_Last_Exam_FINAL.docx ├─ Humanity_Last_Exam_FINAL_FINAL.docx ├─ Humanity_Last_Exam_REAL_FINAL.docx ├─ Humanity_Last_Exam_REAL_FINAL_v2.docx ├─…

Burny - Effective Curiosity reposted

Epoch AI

@EpochAIResearch

Oct 9

We recently wrote that GPT-5 is likely the first mainline GPT release to be trained on less compute than its predecessor. How did we reach this conclusion, and what do we actually know about how GPT-5 was trained? 🧵

EpochAIResearch's tweet image. We recently wrote that GPT-5 is likely the first mainline GPT release to be trained on less compute than its predecessor.

How did we reach this conclusion, and what do we actually know about how GPT-5 was trained?
🧵

Burny - Effective Curiosity reposted

Aidan McLaughlin

@aidan_mclau

Oct 9

sota on arc-agi-1 and -2, sota on artifical intelligence (composite), blows everything (including 4.5 sonnet) away on METR task length... it might not be forever, and it might not be for your use case, but gpt-5 is the world's best overall model; any other claim is cope

aidan_mclau's tweet image. sota on arc-agi-1 and -2, sota on artifical intelligence (composite), blows everything (including 4.5 sonnet) away on METR task length...

it might not be forever, and it might not be for your use case, but gpt-5 is the world's best overall model; any other claim is cope

Burny - Effective Curiosity reposted

Quanta Magazine

@QuantaMagazine

Oct 8

A calculation by the late physicist Freeman Dyson suggested that no plausible experiment could be conducted to confirm the existence of gravitons, the hypothetical particles of gravity. A new proposal overturns that conventional wisdom. quantamagazine.org/it-might-be-po…

QuantaMagazine's tweet image. A calculation by the late physicist Freeman Dyson suggested that no plausible experiment could be conducted to confirm the existence of gravitons, the hypothetical particles of gravity. A new proposal overturns that conventional wisdom. quantamagazine.org/it-might-be-po…

Burny - Effective Curiosity reposted

Pedro Domingos

@pmddomingos

Oct 9

One AI year is seven Internet ones.

Burny - Effective Curiosity reposted

Lisan al Gaib

@scaling01

Oct 9

Official METR results for Claude 4.5 Sonnet it doesn't beat GPT-5 with 80% success rate it is even below o3, 4 and 4.1 Opus

METR

@METR_Evals

Oct 9

We estimate that Claude Sonnet 4.5 has a 50%-time-horizon of around 1 hr 53 min (95% confidence interval of 50 to 235 minutes) on our agentic multi-step software engineering tasks. This estimate is lower than the current highest time-horizon point estimate of around 2 hr 15 min.

METR_Evals's tweet image. We estimate that Claude Sonnet 4.5 has a 50%-time-horizon of around 1 hr 53 min (95% confidence interval of 50 to 235 minutes) on our agentic multi-step software engineering tasks. This estimate is lower than the current highest time-horizon point estimate of around 2 hr 15 min.

Burny - Effective Curiosity reposted

METR

@METR_Evals

Oct 9

Burny - Effective Curiosity

@burny_tech

Oct 10

Min Choi

@minchoi

Oct 8

2.5 years of AI progress Modelscope (left) Grok Imagine 0.9 (right)

Burny - Effective Curiosity reposted

ARC Prize

@arcprize

Oct 9

New ARC-AGI SOTA: GPT-5 Pro - ARC-AGI-1: 70.2%, $4.78/task - ARC-AGI-2: 18.3%, $7.41/task @OpenAI’s GPT-5 Pro now holds the highest verified frontier LLM score on ARC-AGI’s Semi-Private benchmark

arcprize's tweet image. New ARC-AGI SOTA: GPT-5 Pro

- ARC-AGI-1: 70.2%, $4.78/task
- ARC-AGI-2: 18.3%, $7.41/task

@OpenAI’s GPT-5 Pro now holds the highest verified frontier LLM score on ARC-AGI’s Semi-Private benchmark

Burny - Effective Curiosity

@burny_tech

Oct 10

Now didnt expect such jump from non pro version?

ARC Prize

@arcprize

Oct 9

New ARC-AGI SOTA: GPT-5 Pro - ARC-AGI-1: 70.2%, $4.78/task - ARC-AGI-2: 18.3%, $7.41/task @OpenAI’s GPT-5 Pro now holds the highest verified frontier LLM score on ARC-AGI’s Semi-Private benchmark

Burny - Effective Curiosity reposted

Sabine Hossenfelder

@skdh

Oct 8

A group of physicists say they know the entropy of what is causing gravity. youtube.com/watch?v=qNt2bh…

skdh's tweet card. This Paper Might Change How We See Gravity

youtube.com

YouTube

This Paper Might Change How We See Gravity

Source: youtube.com

Burny - Effective Curiosity reposted

机器之心 JIQIZHIXIN

@jiqizhixin

Oct 8

Yann LeCun's team is continuously advancing JEPA. Their new study reveals that the anti-collapse term in Joint Embedding Predictive Architectures (JEPAs) does more than just prevent trivial representations — it implicitly estimates data density. This means any trained JEPA…

jiqizhixin's tweet image. Yann LeCun's team is continuously advancing JEPA.

Their new study reveals that the anti-collapse term in Joint Embedding Predictive Architectures (JEPAs) does more than just prevent trivial representations — it implicitly estimates data density.

This means any trained JEPA…

Burny - Effective Curiosity reposted

hardmaru

@hardmaru

Oct 7

Evolution Strategies can be applied at scale to fine-tune LLMs, and outperforms PPO and GRPO in many model settings! Fantastic paper “Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning” by @yule_gan, Risto Miikkulainen and team. arxiv.org/abs/2509.24372

Yulu Gan

@yule_gan

Oct 6

Reinforcement Learning (RL) has long been the dominant method for fine-tuning, powering many state-of-the-art LLMs. Methods like PPO and GRPO explore in action space. But can we instead explore directly in parameter space? YES we can. We propose a scalable framework for…

Burny - Effective Curiosity reposted

Google

@Google

Oct 7

Congratulations to Michel Devoret, Google Quantum AI’s Chief Scientist of Quantum Hardware, who was awarded the 2025 Nobel Prize in Physics today. Google now has five Nobel laureates among our ranks, including three prizes in the past two years. goo.gle/46EwKaG

Google's tweet card. Google now celebrates five Nobel laureates, including three prizes in the past two years.

Googler Michel Devoret awarded the Nobel Prize in Physics

Source: blog.google

Burny - Effective Curiosity reposted

Lianhui Qin

@Lianhuiq

Oct 7

🧠How can LLMs self-evolve over time? They need memory. LLMs burn huge compute on each query and forget everything afterward. ArcMemo introduces abstraction memory, which stores reusable reasoning patterns and recombines them to strengthen compositional reasoning. 📈On…

Lianhuiq's tweet image. 🧠How can LLMs self-evolve over time? They need memory.

LLMs burn huge compute on each query and forget everything afterward.

ArcMemo introduces abstraction memory, which stores reusable reasoning patterns and recombines them to strengthen compositional reasoning.

📈On…

Matt Ho

@matt_seb_ho

Oct 7

ArcMemo yields +7.5% relative on ARC-AGI vs o4-mini (same backbone). It extends the LLM idea of “compressing knowledge for generalization” into a lightweight, continually learnable abstract memory—model-agnostic and text-based. Preprint: Lifelong LM Learning via Abstract Memory

matt_seb_ho's tweet image. ArcMemo yields +7.5% relative on ARC-AGI vs o4-mini (same backbone).

It extends the LLM idea of “compressing knowledge for generalization” into a lightweight, continually learnable abstract memory—model-agnostic and text-based.

Preprint: Lifelong LM Learning via Abstract Memory

Burny - Effective Curiosity reposted

François Chollet

@fchollet

Oct 8

Impressive work.

Jackson Atkins

@JacksonAtkinsX

Oct 7

My brain broke when I read this paper. A tiny 7 Million parameter model just beat DeepSeek-R1, Gemini 2.5 pro, and o3-mini at reasoning on both ARG-AGI 1 and ARC-AGI 2. It's called Tiny Recursive Model (TRM) from Samsung. How can a model 10,000x smaller be smarter? Here's how…

JacksonAtkinsX's tweet image. My brain broke when I read this paper.

A tiny 7 Million parameter model just beat DeepSeek-R1, Gemini 2.5 pro, and o3-mini at reasoning on both ARG-AGI 1 and ARC-AGI 2.

It's called Tiny Recursive Model (TRM) from Samsung.

How can a model 10,000x smaller be smarter?

Here's how…

Burny - Effective Curiosity reposted

Air Katakana

@airkatakana

Oct 8

the new ai benchmark next year will be "when you ask a model to make you a $1b arr saas, how much money actually shows up in your bank account"

Captain Pleasure, Andrés Gómez Emilsson

@algekalipso

Fun Pilgrim

@tasshinfogleman

Guy IS GOING TO NYC!!!

@nosilverv

Pranab

@nopranablem

——

@itinerantfog

@goth

@goth600

MAGE THE COURAGEOUS {❤️‍🔥}

@myceliummage

☯️ MOON FIRE 🌖🔥

@chercher_ai

Jake Orthwein

@JakeOrthwein

deepfates

@deepfates

curious irrationalist {100/100 longformish things}

@42irrationalist

integrated daddy

@_StevenFan

Samswara

@samswoora

lemonaut

@metamorphopolis

Michael Levin

@drmichaellevin

Maija Haavisto

@DiamonDie

🎃 Mark 👻 🟡⚪️🟣⚫️

@meditationstuff

Cube Flipper

@cube_flipper

bayes

@bayeslord

cup head

@smiling_for_fun

Ashjay Mohsin

@ashjaymohsin97

LindaChapman

@1Dxn5Cx25epaK

SiMohammed

@machihnalokhrin

The East must fail东方必败

@dfbb_uk

Gailys Aetano

@muzziogaetano

🐙Stoner𒂼𒄄

@RedShoggothCult

Ar Roue 👑.~ 🌐🤝🐄 .~ 朕🏮 ⌘. ↀ∞

@Shahrexleroi

o3iz3ounrwa

@o3iz3ounrw61208

Pablo

@PMayrgundter

André Chinchio

@andre_chinchio

$naslouki's profile picture. Somewhere b/w (B2B) product+/growth and amateur Theory. prev. fractional CMO for VC-backed early-stage startups; ssr acquisition @ amazon. Currently becoming.$

Nassim (mediterranean training arc)

@naslouki

さささささささみ🚀

@sasami_39_

Δt

@time1871

Zack Rolfson

@ZRolfson34707

Copper

@CopperBlue6

Beep Boop

@Scattermake

Ava moreau

@avamoreau1wu

Psychedelics Journey

@Psychedelics722

Roslyn Cummerata

@cummerata92740

Angel Lane

@Lairspa45291

daaan(しろくまBOX)

@UnaUnagi33

ReggieDunlop

@DicintioMi17120

MR_J

@JokerLateNite

Akhil Vijaykumar

@AkhilV83663705

Mosul Orb

@Mosul_Orb

Nels Gerhold

@GerholdNel31966

Aleth ..

@alethkit

Jim Moran

@jdmoran

Milan Špinka

@SpinkaMilan

Jon

@belonky

Carlos Funes

@CarlosF98253355

Wade Carroll

@carroll44729

Lam

@LamadridAraoz

Bagli

@bagli27

Stephen Ferris

@StephenDFerris

Serenity Bloom 🌱🏳‍🌈🦄

@mmysticbloom

hao kgole sharp, mfanake

@major_ban_alert

Jason Stephenson

@JasonStephensun

Aporiador

@Aporiador_

🛸

@violetmarlboros

Klein Benjamin

@BenjaminKlein75

Jason Larkin

@jasonlarkin841

ΛΨΩ

@psi_oops

gradient

@gradientpull

gotem

@gautam_sharda_

michael

@MichaelDmrks

ajay

@ajayonloop

crowfry

@crowfry

Doello

@doellodotcom

Captain Pleasure, Andrés Gómez Emilsson

@algekalipso

Nick

@nickcammarata

Vivid Void

@VividVoid_

Joscha Bach

@Plinz

RomeoStevens

@RomeoStevens76

Visakan Veerasamy

@visakanv

Fun Pilgrim

@tasshinfogleman

Guy IS GOING TO NYC!!!

@nosilverv

Pranab

@nopranablem

Rudra Dakini

@the_wilderless

$TylerAlterman's profile picture. Venture Culturalist | Civic Society: @fractal_nyc | Sci-fi: @psychofauna | Introduce me to my future person: https://t.co/EE6HgykOlv$