
Deep Fried Net
@DeepFriedNet
Human in the loop
قد يعجبك
Every time ASI has to construct a low dimensional UI to bridge communication... "This is a manifestation of the Continuum that we hope falls within your level of comprehension."

One more paper from the lab this week! 🥴 Multi-objective optimization of biological sequences isn’t limited to discrete diffusion. We present AReUReDi, our new framework that extends rectified discrete flows to provably converge to the Pareto front! Hope you're ready! 👇 📜:…



To the Moon with @NASA! Our second Blue Moon MK1 lander is already in production and well-suited to support the VIPER rover. Building on the learnings from our first MK1 lander, this mission is important for future lunar permanence and will teach us about the origin and…

Can we use video diffusion to generate 3D scenes? 𝐖𝐨𝐫𝐥𝐝𝐄𝐱𝐩𝐥𝐨𝐫𝐞𝐫 (#SIGGRAPHAsia25) creates fully-navigable scenes via autoregressive video generation. Text input -> 3DGS scene output & interactive rendering! 🌍mschneider456.github.io/world-explorer/ 📽️youtu.be/N6NJsNyiv6I
Tiny SOTA model release today: v3 of the Smart Turn semantic VAD model. Smart Turn is a native audio, open source, open data, open training code model for detecting whether a human has stopped speaking and expects a voice agent to respond. The model now runs in <60ms on most…

Introducing Alterego: the world’s first near-telepathic wearable that enables silent communication at the speed of thought. Alterego makes AI an extension of the human mind. We’ve made several breakthroughs since our work started at MIT. We’re announcing those today.
Visual Story-Writing. While you write, our word processor visualizes the timeline, world map, and character relationships. Editing these visuals updates the story (e.g. drag a character on the map to move them). This summarizes our #UIST2025 paper. #HCI #LLMs #AI Thread 🧵 (1/8)
Check out what you can do when you mix Gemini's world knowledge with the ability to show things visually. Multimodal communication abilities unlock new use cases!
Made a walkthrough vid for Magenta RealTime “Audio Injection”! The notebook takes ~10m to spin up, but totally worth it for the surreal experience 🎤💻🎧⁉️
LongSplat Robust Unposed 3D Gaussian Splatting for Casual Long Videos
National security reframe (in order to get funding to solve the problem) - could an adversary be performing a death by 1000 spam calls attack to agitate and distract an entire population (of engineers)? 😅
I get ~10 spam calls per day (various automated voicemails, "loan pre-approval" etc) and ~5 spam messages per day (usually phishing). - I have AT&T Active Armor, all of the above still slips through. - All of the above is always from new, unique numbers so blocking doesn't work.…
🚀 GLiNER x SmolLM: a new joint encoder-decoder architecture 🚀 We are excited to release a new kind of GLiNER model built with the mantra "you do the same things only once." Built on top of DeBERTa + @huggingface SmolLM2 — full details below 👇
Realtime interactive generative models FTW! Announcing a new 🌊 of details and features for Magenta RealTime, the open weights live music AI model from GDM! * Live Jamming with audio input 🎤🎸🎵 * Personalize your own models 🔧 * Tech report 📜 Links below in the 🧵...
LMStudio are using the upstream ggml implementation which is significantly better and well optimized. Looking at ollama's modifications in ggml, they have too much branching in their MXFP4 kernels and the attention sinks implementation is really inefficient. Along with other…
There's a new tiny TTS model in town: Kitten TTS! 🐱 With just 15M parameters (<25 MB), it delivers impressive quality for its size, and can even run in real time without a GPU. So, I created a web demo for it: featuring text normalization, chunking, and real-time playback. 🤗
Introducing Kitten TTS, a SOTA tiny text-to-speech model - Just 15M parameters - Runs without a GPU - Model size less than 25 MB - Multiple high-quality voices - Ultra-fast - even runs on low-end edge devices Github and HF links below
"Infinito Particular"
Introducing Genie 3, our state-of-the-art world model that generates interactive worlds from text, enabling real-time interaction at 24 fps with minutes-long consistency at 720p. 🧵👇
In a few years, on a cool summer night, Generation Betas will bring their family robots out for a neighborhood game of hide and seek. A father will be out on the porch, drinking lemonade, proudly reflecting on successfully convincing them to name the game, "Terminator".
I don't have any special inside knowledge about how @Kimi_Moonshot trained Kimi K2. I just read the paper and this part is what I've been telling anyone who will listen about. Their data generation steps to get lots of high quality, multi-turn agent traces to train on is so much…


I'm going around telling anyone who will listen about how @Kimi_Moonshot Kimi K2 was trained
Smart Turn v2: open source, native audio turn detection in 14 languages. New checkpoint of the open source, open data, open training code, semantic VAD model on @huggingface, @FAL, and @pipecat_ai. - 3x faster inference (12ms on an L40) - 14 languages (13 more than v1, which…
Everyone knows action chunking is great for imitation learning. It turns out that we can extend its success to RL to better leverage prior data for improved exploration and online sample efficiency! colinqiyangli.github.io/qc/ The recipe to achieve this is incredibly simple. 🧵 1/N
Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵
United States الاتجاهات
- 1. Chiefs 71.7K posts
- 2. LaPorta 9,043 posts
- 3. #TNABoundForGlory 33.9K posts
- 4. Goff 11.3K posts
- 5. Butker 7,439 posts
- 6. Kelce 12.2K posts
- 7. #OnePride 5,291 posts
- 8. #DETvsKC 3,631 posts
- 9. Bryce Miller 2,564 posts
- 10. Baker 49.6K posts
- 11. #SNFonNBC N/A
- 12. Collinsworth 1,953 posts
- 13. Gibbs 5,092 posts
- 14. Dan Campbell 2,049 posts
- 15. #ALCS 8,025 posts
- 16. Polanco 6,170 posts
- 17. Pacheco 4,396 posts
- 18. Patrick Mahomes 6,451 posts
- 19. Leon Slater 2,530 posts
- 20. Cal Raleigh 4,557 posts
Something went wrong.
Something went wrong.