MousaviPooneh's profile picture.

Pooneh Mousavi

@MousaviPooneh

مثبتة

“Ever tried. Ever failed. No matter. Try again. Fail again. Fail better.” Samuel Becket


Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group! 📅 Thursday, Nov 6th | 11 AM - 12 PM EST 🎙 Speaker: Emmanouil Benetos (@emmanouilb) - Queen Mary University of London 📖 Topic: "Machine learning paradigms for music and audio understanding" 🔗 Details: (poonehmousavi.github.io/rg)


Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group to know more about Google Gemini 2.5, a natively multimodal audio model developed over the past year. 📅 Thursday, Oct 30th | 11 AM - 12 PM EST 🎙 Speaker: Michael Han - Google DeepMind 🔗 Details: (poonehmousavi.github.io/rg)


Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group! 📅 Thursday, Oct 23rd | 11 AM - 12 PM EST 🎙 Speaker: Joan Serrà @serrjoa - Sony AI 📖 Topic: "Supervised contrastive learning from weakly-labeled audio segments for musical version matching" 🔗 Details: (poonehmousavi.github.io/rg)


📢 Schedule Update! The Oct 16th session will start at 12PM . Please make sure to mark this change in your calendar so you don’t miss this great talk!

📢This week, our Conversational AI Reading Group is excited to have Jinyu Li from Microsoft. Please note: This week’s session will start one hour later than usual, at 12:00 PM instead of 11:00 AM. 📅 Thursday, Oct 16th | 12:00 - 13:00 EST 📖 Topic: The development of spoken LM



Pooneh Mousavi أعاد

This Thursday Oct 2nd , our Conversational AI RG is honored to host @Yoshua_Bengio , one of the world’s leading pioneers in AI. He will present: “A Safety Case for the Scientist AI.” Don’t miss this unique opportunity to join us online ! 🔗 Details: poonehmousavi.github.io/rg.html


Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group! 📅 Thurs, Sep 25th | 11 AM - 12 PM EST 🎙 Speaker: Themos Stafylakis @themosst 📖 Topic: "Advances in Speaker Recognition: Pruning, Deepfake Detection, and Learning without Temporal Labels" 🔗 Details: (poonehmousavi.github.io/rg)


If you missed my session presenting our recent work “Discrete Audio Tokens: More Than a Survey!”, you can now find the recording on our YouTube channel and the slides on our website: ▶️ YouTube: youtu.be/iGNotmn5J5A?si… 🌐 Website: poonehmousavi.github.io/rg.html#fall20…

MousaviPooneh's tweet card. Discrete Audio Tokens: More Than a Survey! - Pooneh Mousavi

youtube.com

YouTube

Discrete Audio Tokens: More Than a Survey! - Pooneh Mousavi

📢 Our Conversational AI Reading Group is back! Join the first Fall 2025 session! 🤖 📅 Thursday, Sept 18 | 11 AM–12 PM EST 🎙 Speaker: Pooneh Mousavi (Mila) @MousaviPooneh 📖 Topic: “Discrete Audio Tokens: More Than a Survey!” 🌐 Details: poonehmousavi.github.io/rg



I’ll be presenting our survey paper “Discrete Audio Tokens: More Than a Survey!” at the first Fall 2025 session of the Conversational AI Reading Group. Looking forward to seeing you there and discussing ideas!

📢 Our Conversational AI Reading Group is back! Join the first Fall 2025 session! 🤖 📅 Thursday, Sept 18 | 11 AM–12 PM EST 🎙 Speaker: Pooneh Mousavi (Mila) @MousaviPooneh 📖 Topic: “Discrete Audio Tokens: More Than a Survey!” 🌐 Details: poonehmousavi.github.io/rg



Pooneh Mousavi أعاد

We’re back with a new series of Conversational AI Talks. Everyone’s invited! Feel free to share with your network. 🗓 Every Thursday, 11:00 AM – 12:00 PM EDT 🚀 Kicking off on September 18th with an exciting lineup of speakers. 🔗  More details: poonehmousavi.github.io/rg


I’m happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. 🎉 📄 Read: arxiv.org/pdf/2506.10274 🔎 Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/…

🎉🥳 I am thrilled to share that our work on audio tokenisers has been accepted to #TMLR The tokeniser DB is ever updating so submit your new tokenisers 💪 poonehmousavi.github.io/dates-website/



📢 Presenting our paper “LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs” — an interpretable fine-tuning method for spoken language understanding. 🗓 Wed, Aug 20 | 08:30–10:30 📍 A11-P2B-03 Hope to see you there! 📄 arxiv.org/pdf/2505.18517 @ISCAInterspeech


Pooneh Mousavi أعاد

Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM

Speech-language models show promise in multimodal tasks—but how well are speech & text actually aligned? 🤔 This paper arxiv.org/abs/2505.19937 proposes a new metric to measure layer-wise correlation between the two, with a focus on SLU tasks. 🔍🗣️📄



Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group! 📅 Thursday, June 19th | 11 AM - 12 PM EST 🎙 Speaker: Yuki Mitsufuji (@mittu1204) - SonyAI 📖 Topic: "AI for Creators: Pushing Creative Abilities to the Next Level" 🔗 Details: (poonehmousavi.github.io/rg)


Pooneh Mousavi أعاد

``Discrete Audio Tokens: More Than a Survey!,'' Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi, Haibin Wu, Haici Yang, Anastasia Kuznetsova, Artem Ploujnikov, Ricard Marxer, Bhuvana Ramabhadran, Benjamin Elizalde, Loren Lugosch… ift.tt/GA4ZC6u


Pooneh Mousavi أعاد

🎵💬 If you are interested in Audio Tokenisers, you should check out our new work! We empirically analysed existing tokenisers from every way - reconstruction, downstream, LMs and more. Grab yourself a ☕/🍺 and sit down for a read!

GallilMaimon's tweet image. 🎵💬 If you are interested in Audio Tokenisers, you should check out our new work!
We empirically analysed existing tokenisers from every way - reconstruction, downstream, LMs and more.

Grab yourself a ☕/🍺 and sit down for a read!

Pooneh Mousavi أعاد

🌟🌟 Great collaboration, with a diverse all-star team led by @MousaviPooneh - check it out👇 📄Paper - arxiv.org/abs/2506.10274 🌐Website (+updating tokeniser DB!) - poonehmousavi.github.io/dates-website/


🚀 We're excited to announce our latest work: "Discrete Audio Tokens: More Than a Survey!" It presents a comprehensive survey and benchmark of audio tokenizers across speech, music, and general audio. preprint: arxiv.org/pdf/2506.10274 website: poonehmousavi.github.io/dates-website/


Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group! 📅 Thursday, June 12th | 11 AM - 12 PM EST 🎙 Speaker: Andros Tjandra 📖 Topic: "Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound" 🔗 Details: (poonehmousavi.github.io/rg)


Pooneh Mousavi أعاد

📢 Join our Conversational AI Reading Group! 📅 Thursday, May 29th | 11 AM - 12 PM EST 🎙 Speaker: Yossi Adi @adiyossLC 📖 Topic: "On The Landscape of Spoken Language Models" 🔗 Details: (poonehmousavi.github.io/rg)


Pooneh Mousavi أعاد

Learn about speaker diarization, the science behind it, and the future of diarization at ⁦@pyannoteAI⁩ research labs youtu.be/ECqxZgVevuI?fe…

hbredin's tweet card. "Speaker diarization, a (love) loss story" - Hervé Bredin

youtube.com

YouTube

"Speaker diarization, a (love) loss story" - Hervé Bredin


United States الاتجاهات

Loading...

Something went wrong.


Something went wrong.