Developer65537's profile picture. 🇺🇸 Data Analyst (PM) ← AI Startup Co-founder | IBM Quantum Developer | Toptal-Certified Full-Stack Engineer | Math Olympiad Camp | Invest, AI, Quantum, Space

Tyler

@Developer65537

🇺🇸 Data Analyst (PM) ← AI Startup Co-founder | IBM Quantum Developer | Toptal-Certified Full-Stack Engineer | Math Olympiad Camp | Invest, AI, Quantum, Space

Tyler reposted

Scientists demonstrated a method for scalable spin squeezing using the dipolar interactions in an ultracold gas of fermionic erbium atoms, achieving 7.1 dB of metrologically useful squeezing using a protocol that can be implemented with neutral atoms. go.aps.org/3LLdgZN

PhysRevX's tweet image. Scientists demonstrated a method for scalable spin squeezing using the dipolar interactions in an ultracold gas of fermionic erbium atoms, achieving 7.1 dB of metrologically useful squeezing using a protocol that can be implemented with neutral atoms.

go.aps.org/3LLdgZN

Tyler reposted

I highly encourage anyone interested in Delta Attention/Deltanet to read through this whole thread. You can see how I start from practically 0 and am trying to understand Kimi Delta Attention and related linear attention literature by spamming Grad with questions.

nrehiew_'s tweet image. I highly encourage anyone interested in Delta Attention/Deltanet to read through this whole thread.

You can see how I start from practically 0 and am trying to understand Kimi Delta Attention and related linear attention literature by spamming Grad with questions.

Originally a while back i got some intuition with it from the query key value perspective which might help (theres also the gradient descent perspective which is good too). Scraped this from a chat with @stochasticchasm a year ago so might be a bit dodgy. Imagine u want to store…



Tyler reposted

It's a shame Kaluza-Klein theory didn't pan out, because we'd have badass particle called the 'Graviphoton' Graviphoton cannons, Graviphoton thrusters, etc

Andercot's tweet image. It's a shame Kaluza-Klein theory didn't pan out, because we'd have badass particle called the 'Graviphoton' 

Graviphoton cannons, Graviphoton thrusters, etc

Tyler reposted

Optimal Flow Matching (OFM) is a cutting-edge deep learning technique for training generative models. It learns the most efficient path (an "optimal flow") to transform simple noise into complex data, like a realistic image. It's known for being much faster and more stable to…

probnstat's tweet image. Optimal Flow Matching (OFM) is a cutting-edge deep learning technique for training generative models. It learns the most efficient path (an "optimal flow") to transform simple noise into complex data, like a realistic image. It's known for being much faster and more stable to…

Tyler reposted

Diffusion Language Models are Super Data Learners… now on arXiv with MegaDLMs, the full large-scale training framework (6.1K H100s, 462B-param run, 47 % MFU). Supports diffusion and autoregressive LMs, dense and MoE architectures, FP8/BF16/FP16 precision, and multi-axis…

gm8xx8's tweet image. Diffusion Language Models are Super Data Learners… now on arXiv with MegaDLMs, the full large-scale training framework (6.1K H100s, 462B-param run, 47 % MFU).
Supports diffusion and autoregressive LMs, dense and MoE architectures, FP8/BF16/FP16 precision, and multi-axis…

Tyler reposted

Breaking release! OpenHands Software Agent SDK — a complete redesign of the popular 64k⭐️ OpenHands framework. It offers: • Plug-and-play agent interfaces • Sandboxed & portable execution • Multi-LLM routing • Built-in security analysis Benchmarks on SWE-Bench Verified &…

jiqizhixin's tweet image. Breaking release!

OpenHands Software Agent SDK — a complete redesign of the popular 64k⭐️ OpenHands framework. It offers:

• Plug-and-play agent interfaces
• Sandboxed & portable execution
• Multi-LLM routing
• Built-in security analysis

Benchmarks on SWE-Bench Verified &…

Tyler reposted

🚨 MIT just humiliated every major AI lab and nobody’s talking about it. They built a new benchmark called WorldTest to see if AI actually understands the world… and the results are brutal. Even the biggest models Claude, Gemini 2.5 Pro, OpenAI o3 got crushed by humans.…

bigaiguy's tweet image. 🚨 MIT just humiliated every major AI lab and nobody’s talking about it.

They built a new benchmark called WorldTest to see if AI actually understands the world… and the results are brutal.

Even the biggest models Claude, Gemini 2.5 Pro, OpenAI o3 got crushed by humans.…

Tyler reposted

congrats to llama 3 large for winning the LLM trading contest by not participating

yifever's tweet image. congrats to llama 3 large for winning the LLM trading contest by not participating

Tyler reposted

When the XPENG IRON gracefully approaches you, @Tesla_Optimus how will you greet her?


Tyler reposted

AI can become a fully autonomous data scientist. DeepAnalyze-8B is the first agentic LLM capable of handling the entire data science pipeline—from raw data to analyst-grade research reports—without predefined workflows. It learns like a human via a curriculum-based agentic…

jiqizhixin's tweet image. AI can become a fully autonomous data scientist.

DeepAnalyze-8B is the first agentic LLM capable of handling the entire data science pipeline—from raw data to analyst-grade research reports—without predefined workflows. It learns like a human via a curriculum-based agentic…

Tyler reposted

New: Qwen-Image-2509-MultipleAngles. Very solid model and probably a lot of creative use cases to find with it. ⬇️ Free demo available on Hugging Face

victormustar's tweet image. New: Qwen-Image-2509-MultipleAngles. Very solid model and probably a lot of creative use cases to find with it.

⬇️ Free demo available on Hugging Face

Tyler reposted

Met a Meta AI researcher. He studied Physics in China, came to the US for a PhD in Physics, and then fell in love with AI, despite never having studied computer science. He watched Andrej Karpathy and Andrew Ng, bought a GPU, read every arXiv paper title daily, and dived into…


Tyler reposted

Top AI Papers of the Week (Oct 27 - Nov 2): - SmolLM2 - AgentFold - Precision-RL - Multi-Agent Evolve - Agent Data Protocol - Graph-based Agent Planning - Introspective Awareness in LLMs Read on for more:


Tyler reposted

Google $GOOG director just said HALF of all code is now being written by AI. Wow…


Tyler reposted

Wow, language models can talk without words. A new framework, Cache-to-Cache (C2C), lets multiple LLMs communicate directly through their KV-caches instead of text, transferring deep semantics without token-by-token generation. It fuses cache representations via a neural…

jiqizhixin's tweet image. Wow, language models can talk without words.

A new framework, Cache-to-Cache (C2C), lets multiple LLMs communicate directly through their KV-caches instead of text, transferring deep semantics without token-by-token generation. 

It fuses cache representations via a neural…

Tyler reposted

Quantum computing is best done in the permanently shadowed craters on the Moon


Tyler reposted

Most multi-drone systems struggle with one thing: coordination under real-world constraints. A new paper in Science Robotics from TU Delft proposes a model-based approach that lets multiple quadrotors jointly move and orient a cable-suspended load. BUT: without relying on…


Tyler reposted

NEO The Home Robot Order Today


Tyler reposted

Im confused about "10,000 more efficient" part. This means you can train stable-diffusion-3 like model with 20$~ ish amount of electricity. What stops them from building a model and demonstrating it, beyond *checks note* ... Fashion MNIST? Im genuinely curious whats stopping them…

Hello Thermo World.



Tyler reposted

learn a lot more here: extropic.ai


Loading...

Something went wrong.


Something went wrong.