Tyler

@Developer65537

🇺🇸 Data Analyst (PM) ← AI Startup Co-founder | IBM Quantum Developer | Toptal-Certified Full-Stack Engineer | Math Olympiad Camp | Invest, AI, Quantum, Space

lit.link/en/Developer65…

Joined June 2021

6KPosts 641Followers 9Following

You might like

@n0gu1ss_yt

@IdeaSimulated

@TtttpppppM

@kitsunezaka55

@StudioNoGimmick

@seigetsubyo

@FreesoulHugues

@toumei_shikiso

@aibenxyz

@questionsnake

@sunouku

@hrathnir7

@uni_nori_

@voxeloops

@betoo3_char

Tyler reposted

Physical Review X

@PhysRevX

Nov 5

Scientists demonstrated a method for scalable spin squeezing using the dipolar interactions in an ultracold gas of fermionic erbium atoms, achieving 7.1 dB of metrologically useful squeezing using a protocol that can be implemented with neutral atoms. go.aps.org/3LLdgZN

PhysRevX's tweet image. Scientists demonstrated a method for scalable spin squeezing using the dipolar interactions in an ultracold gas of fermionic erbium atoms, achieving 7.1 dB of metrologically useful squeezing using a protocol that can be implemented with neutral atoms.

go.aps.org/3LLdgZN

Tyler reposted

wh

@nrehiew_

Nov 5

I highly encourage anyone interested in Delta Attention/Deltanet to read through this whole thread. You can see how I start from practically 0 and am trying to understand Kimi Delta Attention and related linear attention literature by spamming Grad with questions.

nrehiew_'s tweet image. I highly encourage anyone interested in Delta Attention/Deltanet to read through this whole thread.

You can see how I start from practically 0 and am trying to understand Kimi Delta Attention and related linear attention literature by spamming Grad with questions.

Grad

@Grad62304977

Oct 30

Originally a while back i got some intuition with it from the query key value perspective which might help (theres also the gradient descent perspective which is good too). Scraped this from a chat with @stochasticchasm a year ago so might be a bit dodgy. Imagine u want to store…

Tyler reposted

Andrew Côté

@Andercot

Nov 6

It's a shame Kaluza-Klein theory didn't pan out, because we'd have badass particle called the 'Graviphoton' Graviphoton cannons, Graviphoton thrusters, etc

Andercot's tweet image. It's a shame Kaluza-Klein theory didn't pan out, because we'd have badass particle called the 'Graviphoton'

Graviphoton cannons, Graviphoton thrusters, etc

Tyler reposted

Probability and Statistics

@probnstat

Nov 5

Optimal Flow Matching (OFM) is a cutting-edge deep learning technique for training generative models. It learns the most efficient path (an "optimal flow") to transform simple noise into complex data, like a realistic image. It's known for being much faster and more stable to…

probnstat's tweet image. Optimal Flow Matching (OFM) is a cutting-edge deep learning technique for training generative models. It learns the most efficient path (an "optimal flow") to transform simple noise into complex data, like a realistic image. It's known for being much faster and more stable to…

Tyler reposted

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

Nov 6

Diffusion Language Models are Super Data Learners… now on arXiv with MegaDLMs, the full large-scale training framework (6.1K H100s, 462B-param run, 47 % MFU). Supports diffusion and autoregressive LMs, dense and MoE architectures, FP8/BF16/FP16 precision, and multi-axis…

gm8xx8's tweet image. Diffusion Language Models are Super Data Learners… now on arXiv with MegaDLMs, the full large-scale training framework (6.1K H100s, 462B-param run, 47 % MFU).
Supports diffusion and autoregressive LMs, dense and MoE architectures, FP8/BF16/FP16 precision, and multi-axis…

Tyler reposted

机器之心 JIQIZHIXIN

@jiqizhixin

Nov 6

Breaking release! OpenHands Software Agent SDK — a complete redesign of the popular 64k⭐️ OpenHands framework. It offers: • Plug-and-play agent interfaces • Sandboxed & portable execution • Multi-LLM routing • Built-in security analysis Benchmarks on SWE-Bench Verified &…

jiqizhixin's tweet image. Breaking release!

OpenHands Software Agent SDK — a complete redesign of the popular 64k⭐️ OpenHands framework. It offers:

• Plug-and-play agent interfaces
• Sandboxed &amp; portable execution
• Multi-LLM routing
• Built-in security analysis

Benchmarks on SWE-Bench Verified &amp;…

Tyler reposted

Spencer Baggins

@bigaiguy

Nov 6

🚨 MIT just humiliated every major AI lab and nobody’s talking about it. They built a new benchmark called WorldTest to see if AI actually understands the world… and the results are brutal. Even the biggest models Claude, Gemini 2.5 Pro, OpenAI o3 got crushed by humans.…

bigaiguy's tweet image. 🚨 MIT just humiliated every major AI lab and nobody’s talking about it.

They built a new benchmark called WorldTest to see if AI actually understands the world… and the results are brutal.

Even the biggest models Claude, Gemini 2.5 Pro, OpenAI o3 got crushed by humans.…

Tyler reposted

yifei e/λ (meetmeinshibuya nov 16)

@yifever

Nov 5

congrats to llama 3 large for winning the LLM trading contest by not participating

Tyler reposted

CyberRobo

@CyberRobooo

Nov 5

When the XPENG IRON gracefully approaches you, @Tesla_Optimus how will you greet her?

Tyler reposted

机器之心 JIQIZHIXIN

@jiqizhixin

Nov 4

AI can become a fully autonomous data scientist. DeepAnalyze-8B is the first agentic LLM capable of handling the entire data science pipeline—from raw data to analyst-grade research reports—without predefined workflows. It learns like a human via a curriculum-based agentic…

jiqizhixin's tweet image. AI can become a fully autonomous data scientist.

DeepAnalyze-8B is the first agentic LLM capable of handling the entire data science pipeline—from raw data to analyst-grade research reports—without predefined workflows. It learns like a human via a curriculum-based agentic…

Tyler reposted

Victor M

@victormustar

Nov 4

New: Qwen-Image-2509-MultipleAngles. Very solid model and probably a lot of creative use cases to find with it. ⬇️ Free demo available on Hugging Face

victormustar's tweet image. New: Qwen-Image-2509-MultipleAngles. Very solid model and probably a lot of creative use cases to find with it.

⬇️ Free demo available on Hugging Face

Tyler reposted

Yuchen Jin

@Yuchenj_UW

Nov 2

Met a Meta AI researcher. He studied Physics in China, came to the US for a PhD in Physics, and then fell in love with AI, despite never having studied computer science. He watched Andrej Karpathy and Andrew Ng, bought a GPU, read every arXiv paper title daily, and dived into…

Tyler reposted

DAIR.AI

@dair_ai

Nov 2

Top AI Papers of the Week (Oct 27 - Nov 2): - SmolLM2 - AgentFold - Precision-RL - Multi-Agent Evolve - Agent Data Protocol - Graph-based Agent Planning - Introspective Awareness in LLMs Read on for more:

Tyler reposted

LuxAlgo

@LuxAlgo

Nov 3

Google $GOOG director just said HALF of all code is now being written by AI. Wow…

Tyler reposted

机器之心 JIQIZHIXIN

@jiqizhixin

Nov 3

Wow, language models can talk without words. A new framework, Cache-to-Cache (C2C), lets multiple LLMs communicate directly through their KV-caches instead of text, transferring deep semantics without token-by-token generation. It fuses cache representations via a neural…

jiqizhixin's tweet image. Wow, language models can talk without words.

A new framework, Cache-to-Cache (C2C), lets multiple LLMs communicate directly through their KV-caches instead of text, transferring deep semantics without token-by-token generation.

It fuses cache representations via a neural…

Tyler reposted

Elon Musk

@elonmusk

Nov 2

Quantum computing is best done in the permanently shadowed craters on the Moon

Tyler reposted

Ilir Aliu - eu/acc

@IlirAliu_

Nov 1

Most multi-drone systems struggle with one thing: coordination under real-world constraints. A new paper in Science Robotics from TU Delft proposes a model-based approach that lets multiple quadrotors jointly move and orient a cable-suspended load. BUT: without relying on…

Tyler reposted

1X

@1x_tech

Oct 28

NEO The Home Robot Order Today

Tyler reposted

Simo Ryu

@cloneofsimo

Oct 30

Im confused about "10,000 more efficient" part. This means you can train stable-diffusion-3 like model with 20$~ ish amount of electricity. What stops them from building a model and demonstrating it, beyond *checks note* ... Fashion MNIST? Im genuinely curious whats stopping them…