Electrik Dreams

@techdreamzai

Seeing the world through AI-tinted glasses ✨

San Francisco

Joined May 2023

91Posts 26Followers 106Following

Electrik Dreams reposted

AISecHub

@AISecHub

Nov 9

Chatbot Privacy: An Analysis of Frontier AI Policies - arxiv.org/pdf/2509.05382 | youtu.be/3EuVeBql9bc A study of six major U.S. LLM developers’ privacy policies (PPs) found all use user chat data for training by default, often requiring users to opt-out. This data can include…

AISecHub's tweet image. Chatbot Privacy: An Analysis of Frontier AI Policies - arxiv.org/pdf/2509.05382 | youtu.be/3EuVeBql9bc

A study of six major U.S. LLM developers’ privacy policies (PPs) found all use user chat data for training by default, often requiring users to opt-out. This data can include…

Electrik Dreams reposted

Kirk Borne

@KirkDBorne

Nov 9

Get a #Python Workout with these 50 ten-minute exercises: amzn.to/3vUkpz2 by @reuvenmlerner ———— #ML #AI #DataScience #ComputationalScience #MachineLearning #DataScientist #SoftwareDevelopment ———— ➡️➡️GitHub repo for the book: github.com/reuven/python-…

KirkDBorne's tweet card. Summary The only way to master a skill is to practice. In Python Workout, author Reuven M. Lerner guides you through 50 carefully selected exercises that invite you to flex your programming muscles....

Python Workout: 50 ten-minute exercises

Source: amazon.com

Electrik Dreams reposted

机器之心 JIQIZHIXIN

@jiqizhixin

Nov 9

Can AI truly understand tools like humans do? Researchers introduce PhysToolBench, the first benchmark testing MLLMs’ grasp of physical tools—from recognizing and explaining how they work to creatively inventing new ones when none are available. Tests on 32 leading models show…

jiqizhixin's tweet image. Can AI truly understand tools like humans do?

Researchers introduce PhysToolBench, the first benchmark testing MLLMs’ grasp of physical tools—from recognizing and explaining how they work to creatively inventing new ones when none are available.

Tests on 32 leading models show…

Electrik Dreams reposted

shre

@itsDrDrewithaSh

18 h

Good to be back 🇬🇧 And what better way to kick off the jet lagged morning with energy from a @vercel v0 x @meetgranola hackathon!

mehedi

@mehedih_

18 h

hackathon day @meetgranola

Electrik Dreams reposted

Y Combinator

@ycombinator

14 h

Kestrel (@kestrel__ai) is the agentic platform that unifies Kubernetes ops & security, replacing manual triage and on-call firefighting with autonomous investigations and one-click fixes. ycombinator.com/launches/Ona-k… Congrats on the launch, @ramanv_ & @Evanj80!

Electrik Dreams reposted

TestingCatalog News 🗞

@testingcatalog

Nov 8

ERNIE-5.0-Preview-1022 from Baidu got a preliminary high ranking on LMArena and scored 1432 points. Feels like the gap is getting very small 👀

testingcatalog's tweet image. ERNIE-5.0-Preview-1022 from Baidu got a preliminary high ranking on LMArena and scored 1432 points.

Feels like the gap is getting very small 👀

ERNIE for Developers

@ErnieforDevs

Nov 7

🎉We’re thrilled to share that the ERNIE-5.0-Preview-1022 now ranks #2 (tied) globally on the LMArena Text leaderboard — one of the world’s most recognized benchmarks for large language models driven by real-world use. As part of our commitment, we plan to officially release…

ErnieforDevs's tweet image. 🎉We’re thrilled to share that the ERNIE-5.0-Preview-1022 now ranks #2 (tied) globally on the LMArena Text leaderboard — one of the world’s most recognized benchmarks for large language models driven by real-world use.

As part of our commitment, we plan to officially release…

Electrik Dreams reposted

fly51fly

@fly51fly

Nov 8

[CL] Towards Robust Mathematical Reasoning T Luong, D Hwang, H H. Nguyen, G Ghiasi... [Google DeepMind] (2025) arxiv.org/abs/2511.01846

Electrik Dreams reposted

VraserX e/acc

@VraserX

21 h

OpenAI’s CFO says an IPO isn’t on the near-term agenda, cutting through the trillion-dollar hype. The company is focused on scaling its models, infrastructure, and products, not chasing market valuation. It’s a reminder that the real race in AI isn’t about listings or stock…

VraserX's tweet image. OpenAI’s CFO says an IPO isn’t on the near-term agenda, cutting through the trillion-dollar hype.

The company is focused on scaling its models, infrastructure, and products, not chasing market valuation.

It’s a reminder that the real race in AI isn’t about listings or stock…

Electrik Dreams reposted

Sarbjeet Johal

@sarbjeetjohal

21 h

THE TALK WHICH STARTED THE RUMORS OF #GovernmentalAIBackstop, and media took those to government bailout of AI companies levels: Sarah Friar, @OpenAI's CFO, discusses the company’s funding strategies, computing challenges and rapid growth, while emphasizing the importance of…

Electrik Dreams reposted

DailyPapers

@HuggingPapers

Nov 8

PHUMA: A new physically-grounded dataset for humanoid locomotion This large-scale dataset, developed by @DAVIANRobotics & KAIST AI, uses human video and physics-constrained retargeting to eliminate physical artifacts like floating and foot skating. It's 3x larger than AMASS and…

Electrik Dreams reposted

fly51fly

@fly51fly

Nov 7

[LG] The Illusion of Certainty: Uncertainty quantification for LLMs fails under ambiguity T Tomov, D Fuchsgruber, T Wollschläger, S Günnemann [Technical University of Munich] (2025) arxiv.org/abs/2511.04418

fly51fly's tweet image. [LG] The Illusion of Certainty: Uncertainty quantification for LLMs fails under ambiguity
T Tomov, D Fuchsgruber, T Wollschläger, S Günnemann [Technical University of Munich] (2025)
arxiv.org/abs/2511.04418

Electrik Dreams reposted

ARC Prize

@arcprize

Nov 8

ARC Prize 2025 - Paper Submission Due Tomorrow Final submissions to the ARC Prize 2025 Paper Prize are due on November 9, 2025

arcprize's tweet image. ARC Prize 2025 - Paper Submission Due Tomorrow

Final submissions to the ARC Prize 2025 Paper Prize are due on November 9, 2025

Electrik Dreams reposted

Charles Foster

@CFGeek

Nov 8

Highly recommend this 3-hour video. Makes me feel jealous of the researchers who get to explore model internals!

Neel Nanda

@NeelNanda5

Nov 7

We discuss their papers showing that model diffing is unexpectedly easy when fine-tuning in a narrow domain, and on finding and fixing flaws with crosscoders, a sparse autoencoder based approach Video: youtu.be/VQ_7zLXHf3s

NeelNanda5's tweet card. What do models learn during finetuning? A model diffing paper...

youtube.com

YouTube

What do models learn during finetuning? A model diffing paper...

Source: youtube.com

Electrik Dreams reposted

Robert Scoble

@Scobleizer

Nov 7

“Your video to 3D worlds in one second.” Wow. Everything I talked about with @3duaun for three years is coming true.

Hunyuan

@TencentHunyuan

Nov 7

The training code for Hunyuan World 1.1 (WorldMirror) is released now!🔥🔥🔥 This release provides researchers and developers the full stack for customization and fine-tuning: 📷 Your video to 3D worlds in 1 second. 🪄ANY input (image, video, 3D prior) to ANY output (3DGS,…

Electrik Dreams reposted

机器之心 JIQIZHIXIN

@jiqizhixin

Nov 7

Sakana AI is building artificial life and they can evolve! Petri Dish Neural Cellular Automata (PD-NCA) let multiple NCA agents learn and adapt during simulation, not just after training. Each cell updates its own parameters via gradient descent, turning morphogenesis into a…

Electrik Dreams reposted

fly51fly

@fly51fly

Nov 7

[CL] Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics A Zur, A Geiger, E S Lubana, E Bigelow [Stanford University & Goodfire & NTT Research] (2025) arxiv.org/abs/2511.04527

fly51fly's tweet image. [CL] Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics
A Zur, A Geiger, E S Lubana, E Bigelow [Stanford University &amp; Goodfire &amp; NTT Research] (2025)
arxiv.org/abs/2511.04527

Electrik Dreams reposted

Thinking Machines

@thinkymachines

Nov 7

Science is best shared! Tell us about what you’ve built or discovered with Tinker, so we can tell the world about it on our blog. More details at thinkingmachines.ai/blog/call-for-…

thinkymachines's tweet card. Announcing Tinker Community Projects

Tinker: Call for Community Projects

Source: thinkingmachines.ai

Electrik Dreams reposted

Unwind AI

@unwind_ai_

Nov 7

Spec-driven development tool for AI coding agents. Forces humans and AI to agree on specs before writing code. Separates current truth from proposed changes. Native slash command for Claude Code, Cursor, Codex, and more. 100% open-source.

unwind_ai_'s tweet image. Spec-driven development tool for AI coding agents.

Forces humans and AI to agree on specs before writing code. Separates current truth from proposed changes.

Native slash command for Claude Code, Cursor, Codex, and more.

100% open-source.

Electrik Dreams reposted

Chris Barber

@chrisbarber

Nov 7

benchmark is: scale.com/leaderboard/rli

Electrik Dreams reposted

Will Manidis

@WillManidis

Nov 7

if you write a good enough lesswrong post about how lenders repossessing data centers after default is actually an existential risk and massively increases the chance of unaligned private credit owned AGI I’m sure one of the labs would pay you a few billion a year at this point