Leland Richardson
@intelligibabble
Technical Staff @AnthropicAI Prev: Jetpack Compose @Google. I like learning, discussing, and diving into challenges.
你可能會喜歡
Why is pasting into VSCode Terminal slow? Because it sleeps for 5ms every 50 characters.
An important lesson that ARC-AGI has internalized, but not many others have, is that benchmark perf is a function of test-time compute. @OpenAI publishes single-number benchmark results because it's simpler and people expect to see it, but ideally all evals would have an x-axis.
A year ago, we verified a preview of an unreleased version of @OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task Today, we’ve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task This represents a ~390X efficiency improvement in one year
If you understand the world you know it's actually @AmandaAskell .
Japanese Prime Minister Sanae Takaichi rockets to number 3 on the Forbes World's Most Powerful Women list, behind Christine Lagarde and Ursula von der Leyen forbes.com/lists/power-wo…
derivedStateOf is explained in depth in this great post by Zach Klippenstein, including its performance overhead bit Zach's posts never disappoint. Give it a read! blog.zachklipp.com/how-derivedsta…
Claude Code on Android! Check it out - just the beginning, lots more useful things coming, but I've been finding it surprisingly useful!
You can now run Claude Code tasks from the Claude Android app, in research preview. Kick off cloud-based tasks from your phone, let Claude run, then pick them up later to review and merge work. Download Claude for Android: play.google.com/store/apps/det…
FYI we just shipped Claude Code on Android!
I'm genuinely surprised / impressed that we broke the ARC-AGI-2 50% mark in 2025.
I think I’ve published the first research article in theoretical physics in which the main idea came from an AI - GPT5 in this case. The physics research paper itself (on QFT and state-dependent quantum mechanics) has been published in Physics Letters B. I've written an…
One of you AI companies better buy us next or I'm going to make a bunch of breaking changes and invalidate all of the code you generate.
few will understand this. for the last few decades, UI/UX has been built on a stable background assumption: the human is the only general intelligence in the loop. software is procedural. UI's job is to make that procedure discoverable and efficient: - show the state of the…
Me: Ok. This is a really tricky bug. Gonna turn on thinking for Opus bc honestly, i don't even know if it'll be able to fix it *even with thinking*. Opus: *reads one file, zero thinking tokens, edits file*. "Fixed!" Me: you don't have to be a jerk about it...
Google's vision of AI Tutors is amazing. This is Project Astra which is now productized inside Gemini:
Really happy to see this research on reward hacking and misalignment get published by Anthropic: anthropic.com/research/emerg… When I first heard about this internally it was a real "holy sh*t" moment for me. Reading through some of these transcripts is legitimately surreal. Aside…
An emergent capability of Nano Banana Pro that took me by surprise: the ability to generate beautiful & accurate charts that are to scale.
My most amusing interaction was where the model (I think I was given some earlier version with a stale system prompt) refused to believe me that it is 2025 and kept inventing reasons why I must be trying to trick it or playing some elaborate joke on it. I kept giving it images…
Cosign. It's interesting to think about what would be different if iOS and Android didn't have these astronomical toll booths for just allowing users the privilege of *installing software* on the pocket computer they bought
It's truly depressing to see the Android org's leadership repeatedly shooting themselves in the foot. It takes 20 years to build a reputation and five minutes to ruin it. f-droid.org/en/2025/10/28/… Users should be able to choose what software they run on the devices they own.
I recently decided to put a few of my astro photos on display in my office. Ended up with a few of these 18x12 metal prints, which I think really show off some punchy contrast! Very happy with how this ended up looking - will probably be filling this out with another 6 or so if…
United States 趨勢
- 1. Vanity Fair 45K posts
- 2. Susie Wiles 97K posts
- 3. #doordashfairy N/A
- 4. Mick Foley 28.5K posts
- 5. Michelea Ponce 18.9K posts
- 6. Raphinha 54.4K posts
- 7. Larian 8,307 posts
- 8. $TSLA 51.1K posts
- 9. Disclosure Day 18.5K posts
- 10. Brad Johnson N/A
- 11. Spielberg 25.5K posts
- 12. Alan Jackson N/A
- 13. Mustapha Kharbouch 3,314 posts
- 14. Brookline 3,132 posts
- 15. Olive Garden N/A
- 16. My Fellow Americans 3,707 posts
- 17. Philo 2,357 posts
- 18. Doug Williams N/A
- 19. Unemployment 62.9K posts
- 20. Bellingham 32.5K posts
你可能會喜歡
-
Jim Sproch
@JimSproch -
Mitch Tabian
@mitch_tabian -
Gabriel Peal
@gpeal8 -
Jorge Castillo
@JorgeCastilloPr -
Manuel Vivo
@manuelvicnt -
Gurupreet singh
@_gurupreet -
Nicola Corti
@cortinico -
Márton Braun
@zsmb13 -
Roman Elizarov
@relizarov -
Nick Rout
@ricknout -
Sam Edwards
@HandstandSam -
Rebecca Franks
@riggaroo -
Arkadii Ivanov
@arkann1985 -
Code with the Italians
@codewiththeita -
Find me on 🦋 : @p-y.wtf
@Piwai
Something went wrong.
Something went wrong.