nandahkrishna's profile picture. PhD student at @Mila_Quebec & @UMontreal, @MacHomebrew maintainer.

Nanda H Krishna

@nandahkrishna

PhD student at @Mila_Quebec & @UMontreal, @MacHomebrew maintainer.

Pinned

New preprint! 🧠🤖 How do we build neural decoders that are: ⚡️ fast enough for real-time use 🎯 accurate across diverse tasks 🌍 generalizable to new sessions, subjects, and species? We present POSSM, a hybrid SSM architecture that optimizes for all three of these axes! 🧵1/7

nandahkrishna's tweet image. New preprint! 🧠🤖
How do we build neural decoders that are:
⚡️ fast enough for real-time use
🎯 accurate across diverse tasks
🌍 generalizable to new sessions, subjects, and species?
We present POSSM, a hybrid SSM architecture that optimizes for all three of these axes!
🧵1/7

Nanda H Krishna reposted

[1/9] While pretraining data might be hitting a wall, novel methods for modeling it are just getting started! We introduce future summary prediction (FSP), where the model predicts future sequence embeddings to reduce teacher forcing & shortcut learning. 📌Predict a learned…


Nanda H Krishna reposted

Mila's annual supervision request process is now open to receive MSc and PhD applications for Fall 2026 admission! For more information, visit mila.quebec/en/prospective…

Mila_Quebec's tweet image. Mila's annual supervision request process is now open to receive MSc and PhD applications for Fall 2026 admission! For more information, visit mila.quebec/en/prospective…

Nanda H Krishna reposted

Qwen3-4B can match DeepSeek-R1 and o3-mini (high) with ONLY test-time scaling?🤯 Introducing Recursive Self-Aggregation (RSA), a new test-time scaling method: - parallel + sequential✅ - no verifiers✅ - no scaffolding✅ Then we use aggregation-aware RL to push further!🚀 🧵👇

thevineetjain's tweet image. Qwen3-4B can match DeepSeek-R1 and o3-mini (high) with ONLY test-time scaling?🤯

Introducing Recursive Self-Aggregation (RSA), a new test-time scaling method:
- parallel + sequential✅
- no verifiers✅
- no scaffolding✅

Then we use aggregation-aware RL to push further!🚀
🧵👇

Nanda H Krishna reposted

Two exciting updates 🚀 1️⃣ POSSM has been accepted to NeurIPS 2025! We'll see you in San Diego 🏖️! 2️⃣ I've officially started my PhD! Very grateful to stay at Mila, and excited to continue working on advancing both deep learning + science! 🧪🧬🧠

Super stoked to share my first first-author paper that introduces a hybrid architecture approach for real-time neural decoding. It's been a lot of work, but happy to showcase some very cool results!



Nanda H Krishna reposted

🚨Reasoning LLMs are e̵f̵f̵e̵c̵t̵i̵v̵e̵ ̵y̵e̵t̵ inefficient! Large language models (LLMs) now solve multi-step problems by emitting extended chains of thought. During the process, they often re-derive the same intermediate steps across problems, inflating token usage and…


Nanda H Krishna reposted

🚨 The call for demos is still open, the deadline is tomorrow! If you have a tool for visualizing large-scale data, pipelines for training foundation models, or BCI demos, we want to see it! Submission is only 500 words, and it's a great opportunity to showcase your work.

Excited to announce the Foundation Models for the Brain and Body workshop at #NeurIPS2025!🧠 We invite short papers or interactive demos on AI for neural, physiological or behavioral data. Submit by Aug 22 👉 brainbodyfm-workshop.github.io

mehdiazabou's tweet image. Excited to announce the Foundation Models for the Brain and Body workshop at #NeurIPS2025!🧠

We invite short papers or interactive demos on AI for neural, physiological or behavioral data.

Submit by Aug 22 👉 brainbodyfm-workshop.github.io


Nanda H Krishna reposted

🚨 We are extending the paper submission deadline to Friday, August 29, 11:59 pm AoE. Check our website for the latest updates on the Foundation Models for the Brain and Body workshop #NeurIPS2025 #BrainBodyFM

Excited to announce the Foundation Models for the Brain and Body workshop at #NeurIPS2025!🧠 We invite short papers or interactive demos on AI for neural, physiological or behavioral data. Submit by Aug 22 👉 brainbodyfm-workshop.github.io

mehdiazabou's tweet image. Excited to announce the Foundation Models for the Brain and Body workshop at #NeurIPS2025!🧠

We invite short papers or interactive demos on AI for neural, physiological or behavioral data.

Submit by Aug 22 👉 brainbodyfm-workshop.github.io


Excited to be organising the BrainBodyFM Workshop – in spirit, a successor to our #COSYNE Workshop on Neuro-foundation Models – at #NeurIPS2025! Check out the website for more details. 🧠🤖

Excited to announce the Foundation Models for the Brain and Body workshop at #NeurIPS2025!🧠 We invite short papers or interactive demos on AI for neural, physiological or behavioral data. Submit by Aug 22 👉 brainbodyfm-workshop.github.io

mehdiazabou's tweet image. Excited to announce the Foundation Models for the Brain and Body workshop at #NeurIPS2025!🧠

We invite short papers or interactive demos on AI for neural, physiological or behavioral data.

Submit by Aug 22 👉 brainbodyfm-workshop.github.io


Nanda H Krishna reposted

How to align your diffusion model with unseen objectives at inference time? Presenting Diffusion Tree Sampling/Search (DTS/DTS*) 🥳 Using MCTS-style search, DTS steadily improves sample quality with compute, matching the best baseline with 5× less compute!


Nanda H Krishna reposted

Preprint Alert 🚀 Multi-agent reinforcement learning (MARL) often assumes that agents know when other agents cooperate with them. But for humans, this isn’t always true. Example, plains indigenous groups used to leave resources for others to use at effigies called Manitokan. 1/8

dvnxmvl_hdf5's tweet image. Preprint Alert  🚀
Multi-agent reinforcement learning (MARL) often assumes that agents know when other agents cooperate with them. But for humans, this isn’t always true. Example, plains indigenous groups used to leave resources for others to use at effigies called Manitokan.
1/8

Nanda H Krishna reposted

The recordings from the 🌐🧠 Neuro Foundation Model workshop are up on the workshop website! Thanks again to our speakers, and everyone who attended. And thanks to the entire team @cole_hurwitz, @nandahkrishna, @averyryoo, @evadyer and @tyrell_turing for making this happen 🙌

mehdiazabou's tweet image. The recordings from the 🌐🧠 Neuro Foundation Model workshop are up on the workshop website! 

Thanks again to our speakers, and everyone who attended.

And thanks to the entire team @cole_hurwitz, @nandahkrishna, @averyryoo, @evadyer and @tyrell_turing  for making this happen 🙌

How can large-scale models + datasets revolutionize neuroscience 🧠🤖🌐? We are excited to announce our workshop: “Building a foundation model for the brain: datasets, theory, and models” at @CosyneMeeting #COSYNE2025. Join us in Mont-Tremblant, Canada from March 31 - April 1.

mehdiazabou's tweet image. How can large-scale models + datasets revolutionize neuroscience 🧠🤖🌐? We are excited to announce our workshop: “Building a foundation model for the brain: datasets, theory, and models” at @CosyneMeeting #COSYNE2025. Join us in Mont-Tremblant, Canada from March 31 - April 1.


Nanda H Krishna reposted

Just a couple days until Cosyne - stop by [3-083] this Saturday and say hi! @nandahkrishna @XimengMao

averyryoo's tweet image. Just a couple days until Cosyne - stop by [3-083] this Saturday and say hi! @nandahkrishna @XimengMao

Loading...

Something went wrong.


Something went wrong.