Michael Hodel

@bayesilicon

writer (of programs)

Zurich, Switzerland

github.com/michaelhodel

Beigetreten im November 2021

102Posts 979Follower 699Folge ich

Was dir gefallen könnte

@jcb_31416

@laurolangosco

@taylorhowell

@CRSegerie

@an1lam

@Ocient

@KlausWohlrabe

@JohanWinn

@Henrymiller2012

@nkfd_deficiency

@jenner_erik

@MarkNicoll6

@geekblake

@RubenRabadan2

@abraham_airs

Michael Hodel

@bayesilicon

31.10.

Just made a repository with golfed solutions to ARC-AGI-1-train for the NeurIPS 2025 Google Code Golf Championship as well as some golfing tricks that I learned: github.com/michaelhodel/a…

bayesilicon's tweet card. Solutions to ARC Tasks for the Google Code Golf Championship - michaelhodel/arc-golfing

GitHub - michaelhodel/arc-golfing: Solutions to ARC Tasks for the Google Code Golf Championship

Quelle: github.com

Michael Hodel hat repostet

Keller Jordan

@kellerjordan0

12.10.

There's been significant recent progress in the NanoGPT speedrun. Highly recommend this post by @classiclarryd lesswrong.com/posts/j3gp8teb…

Michael Hodel hat repostet

📝 New research: AlphaWrite applies evolutionary algorithms to creative writing. Inspired by AlphaEvolve, we use iterative generation + Elo ranking to systematically improve story quality through inference-time compute scaling. Results: 72% preference over baseline generation…

tobyrsimonds's tweet image. 📝 New research: AlphaWrite applies evolutionary algorithms to creative writing.

Inspired by AlphaEvolve, we use iterative generation + Elo ranking to systematically improve story quality through inference-time compute scaling.

Results: 72% preference over baseline generation…

Michael Hodel hat repostet

Jack Cole

@MindsAI_Jack

13.05.

Excited to advance our lead and SoTA score on ARC-AGI-2 (@arcprize) by 3 points to 15.28. @DriesSmit1 @MohamedOsmanML @bayesilicon @GregKamradt @tufalabs kaggle.com/competitions/a…

Michael Hodel hat repostet

Dimitri von Rütte

@dvruette

10.03.

🚨 NEW PAPER DROP! Wouldn't it be nice if LLMs could spot and correct their own mistakes? And what if we could do so directly from pre-training, without any SFT or RL? We present a new class of discrete diffusion models, called GIDD, that are able to do just that: 🧵1/12

Michael Hodel hat repostet

Akira Yoshiyama ⁂

@yoshiyama_akira

06.03.

Happy to announce we outperformed @OpenAI o1 with a 7B model :) We released two self-improvement methods for verifiable domains in our preliminary paper -->

yoshiyama_akira's tweet image. Happy to announce we outperformed @OpenAI o1 with a 7B model :)

We released two self-improvement methods for verifiable domains in our preliminary paper --&gt;

Michael Hodel hat repostet

Tufalabs

@tufalabs

21.11.

Today, MindsAI (@MindsAI_Jack @MohamedOsmanML @bayesilicon) becomes part of @tufalabs First assignment: complete the @arcprize challenge

Michael Hodel hat repostet

Jack Cole

@MindsAI_Jack

17.11.

Great presentation on some unique TTT ideas and experiments by Jonas Hübotter @tufalabs. youtu.be/vei7uf9wOxI?si…

MindsAI_Jack's tweet card. Learning at test time in LLMs

youtube.com

YouTube

Learning at test time in LLMs

Quelle: youtube.com

Michael Hodel hat repostet

François Chollet

@fchollet

14.11.2024

Consulting my heart... Ok, looks like you haven't. But whenever you have a SotA (or close) solution built on top of the OpenAI API we're more than happy to verify it and add it to the public ARC Prize leaderboard. Anything using less than $10k worth of API calls is eligible.

Sam Altman

@sama

14.11.2024

in your heart do you believe we’ve solved that one or no?

Michael Hodel hat repostet

Andreas Köpf

@neurosp1ke

13.11.2024

ARC prize 2024 🥈place paper by the ARChitects who scored 53.5 (56.5): github.com/da-fr/arc-priz… - Transformers/LLMs are for ARC what ConvNets were for Imagenet - strong base model, TTT, specialized datasets (e.g. @bayesilicon’s re-arc) + novel: DFS sampling with LLM critique

neurosp1ke's tweet image. ARC prize 2024 🥈place paper by the ARChitects who scored 53.5 (56.5): github.com/da-fr/arc-priz…
- Transformers/LLMs are for ARC what ConvNets were for Imagenet
- strong base model, TTT, specialized datasets (e.g. @bayesilicon’s re-arc) + novel: DFS sampling with LLM critique

Michael Hodel

@bayesilicon

11.11.2024

very excited to win guys, it's been such a blast! let's goo

Jack Cole

@MindsAI_Jack

11.11.2024

@arcprize 2024 with more than 16k entrants just ended after 5 months, and we rank #1 (@bayesilicon @MohamedOsmanML)! We just scored 58% with a submission that finished after the deadline! We're just getting started. We hope to have an announcement about @tufalabs soon.…

MindsAI_Jack's tweet image. @arcprize 2024 with more than 16k entrants just ended after 5 months, and we rank #1 (@bayesilicon @MohamedOsmanML)! We just scored 58% with a submission that finished after the deadline! We're just getting started. We hope to have an announcement about @tufalabs soon.…

Michael Hodel hat repostet

Andreas Köpf

@neurosp1ke

08.11.2024

Have been working on my 2nd synthetic ARC riddle generator (agent: ideation -> prog generation). Got >1k diverse generator+solver pairs as PoC so far. Some nice examples:

neurosp1ke's tweet image. Have been working on my 2nd synthetic ARC riddle generator (agent: ideation -&gt; prog generation). Got &gt;1k diverse generator+solver pairs as PoC so far. Some nice examples:

Michael Hodel hat repostet

Machine Learning Street Talk

@MLStreetTalk

06.11.2024

I finally got to meet @fchollet in person recently to interview him about @arcprize, intelligence vs memorization, human cognitive development, learning abstractions, limits of pattern recognition and consciousness development. These are the best bits. Full show released tomorrow

Michael Hodel hat repostet

Mohamed Osman

@MohamedOsmanML

03.11.2024

We got upto 55.5% on the @arcprize leaderboard today! Progress towards the 60.2 % milestone of median human performance reported by arxiv.org/pdf/2409.01374 is not slowing down. @MindsAI_Jack @bayesilicon

MohamedOsmanML's tweet image. We got upto 55.5% on the @arcprize leaderboard today!
Progress towards the 60.2 % milestone of median human performance reported by arxiv.org/pdf/2409.01374 is not slowing down.
@MindsAI_Jack @bayesilicon

Michael Hodel hat repostet

Kevin Ellis

@ellisk_kellis

02.11.2024

New ARC-AGI paper @arcprize w/ fantastic collaborators @xu3kev @HuLillian39250 @ZennaTavares @evanthebouncy @BasisOrg For few-shot learning: better to construct a symbolic hypothesis/program, or have a neural net do it all, ala in-context learning? cs.cornell.edu/~ellisk/docume…

ellisk_kellis's tweet image. New ARC-AGI paper
@arcprize w/ fantastic collaborators @xu3kev @HuLillian39250 @ZennaTavares @evanthebouncy @BasisOrg
For few-shot learning: better to construct a symbolic hypothesis/program, or have a neural net do it all, ala in-context learning?
cs.cornell.edu/~ellisk/docume…

Michael Hodel hat repostet

Jack Cole

@MindsAI_Jack

29.10.2024

New SoTA on ARC-AGI. Nothing like the synergy of an awesome team (@bayesilicon @MohamedOsmanML). From 53 to 54.5 today. Onward and upward! 🚀 @arcprize @mikeknoop @fchollet @bryanlanders @GregKamradt @MLStreetTalk First. #kaggle - kaggle.com/competitions/a…

Michael Hodel

@bayesilicon

23.10.2024

yes

Jack Cole

@MindsAI_Jack

23.10.2024

I told you Michael Hodel was cooking something hot, but this is pure 🔥. Great work Michael. 🏆 Shall we go for 60?

Michael Hodel

@bayesilicon

23.10.2024

just achieved a score of 53% on the @arcprize - what a feeling! @MindsAI_Jack @MohamedOsmanML lets gooo!

Michael Hodel hat repostet

Wenhao Li

@WenhaoLi29

15.10.2024

We trained a Vision Transformer to solve ONE single task from @fchollet and @mikeknoop’s @arcprize. Unexpectedly, it failed to produce the test output, even when using 1 MILLION examples! Why is this the case? 🤔