Jing Yuan
@joekina
靖源 Focus on N.O.W
You might like
When you use an autonomous AI agent to automate your corporate expense reports and it makes a mistake or worse, who is responsible? Does it matter if I created the agent myself, if was provided by the company, or if it was required by the company? And what about when you use…
Interesting experiment found that an AI agent built around the obsolete GPT-3.5 and GPT-4 models beat experienced human venture capital analysts in predicting which early-stage startups would survive based on early screening (at much lower costs as well). sciencedirect.com/science/articl…
Lex, thank you for the discussion!! Was great to see you.
Here's my conversation with Michael Levin (@drmichaellevin) about the nature of intelligence in biological systems, including unconventional & alien intelligence, agency, memory, consciousness, and life in all its forms here on Earth and beyond. It's here on X in full and is up…
One point I made that didn’t come across: - Scaling the current thing will keep leading to improvements. In particular, it won’t stall. - But something important will continue to be missing.
Demis Hassabis talks about the moment he wanted to pursue research. At 12, he was world’s 2nd best chess player for his age. He went to a tournament and lost to a 30-year old player, who was overly happy beating a kid. Demis loved chess but realized all the brainpower in that…
One point I made that didn’t come across: - Scaling the current thing will keep leading to improvements. In particular, it won’t stall. - But something important will continue to be missing.
here are the most important points from today's ilya sutskever podcast: - superintelligence in 5-20 years - current scaling will stall hard; we're back to real research - superintelligence = super-fast continual learner, not finished oracle - models generalize 100x worse than…
From the makers of the popular AlphaGo documentary, The Thinking Game gives a much broader picture of the story of DeepMind and our mission to build AGI, drawing on interviews with myself and others going back many years. You can now freely watch it here: youtube.com/watch?v=d95J8y…
youtube.com
YouTube
The Thinking Game | Full documentary | Tribeca Film Festival official...
New Harvard+MIT+Georgia Tech paper argues that truly understanding language means linking words to rich nonverbal brain systems that model reality. First, it explains that the brain's language regions mostly track patterns in words and grammar, similar to phone typing…
As one of the authors of the original “jagged frontier” paper, I think this undersells how jagged AI is (& likely will be) at even the level of individual jobs: having a couple of critical tasks that AI can’t do creates deep bottlenecks especially as shape of frontier is unknown.
How Claudey is Opus 4.5? We previously described Claudiness as "good at agentic tasks while being weaker at multimodal and math". This pattern remains when comparing Opus 4.5 to other newly-released models, though the gap on agentic coding and tool-calling benchmarks is small.
youtube.com
YouTube
Ilya Sutskever – We're moving from the age of scaling to the age of...
This is insane… OpenAI Anthropic & Google just got access to petabytes of proprietary Data, The data is coming from the 17 National Laboratories, which have been hoarding experimental data for decades. We aren't just talking about better chatbots anymore. The US Government’s…
Introducing Claude Opus 4.5: the best model in the world for coding, agents, and computer use. Opus 4.5 is a step forward in what AI systems can do, and a preview of larger changes to how work gets done.
To my surprise, Opus 4.5 one-shot my hardest λ-calculus problem (tying with Gemini 3), and it did solve the stack underflow bug that an old checkpoint of Gemini 3 (NOT the deployed version) solved. So, in terms of first hour impression, that couldn't be more promising I guess...
14MB ram / 9MB disk (MB, *not* GB!) to index all of Windows 10, in 1 second. Index stays updated automatically. It's amazing what's possible with a modern computer if you actually care about engineering. voidtools.com
對嘛!這個時代的行銷,根本不用太多 demo 或 benchmark 直接叫你們家長相帥氣、聲音有磁性的員工出來說幾句話就好啦😆 這也是 AI 無法取代的能力 😂 youtu.be/56kq0VTkU4k?si…
youtube.com
YouTube
Introducing Claude Opus 4.5
United States Trends
- 1. Spurs 41.8K posts
- 2. Cooper Flagg 10.4K posts
- 3. UNLV 2,342 posts
- 4. Chet 8,823 posts
- 5. #Pluribus 15.6K posts
- 6. Randle 2,560 posts
- 7. Christmas Eve 182K posts
- 8. Mavs 5,966 posts
- 9. #PorVida 1,562 posts
- 10. #WWENXT 11.4K posts
- 11. Skol 1,545 posts
- 12. Rosetta Stone N/A
- 13. Keldon Johnson 1,253 posts
- 14. Yellow 58.4K posts
- 15. #GoAvsGo N/A
- 16. Nuggets 12.2K posts
- 17. #VegasBorn N/A
- 18. Scott Wedgewood N/A
- 19. Cam Johnson N/A
- 20. Trae 14.6K posts
Something went wrong.
Something went wrong.