Josh Meyer
@_josh_meyer_
https://www.linkedin.com/in/josh-r-meyer/
You might like
Kyutai Speech-To-Text is now open-source! It’s streaming, supports batched inference, and runs blazingly fast: perfect for interactive applications. Check out the details here: kyutai.org/next/stt
We had to downsize due to NIH funding cuts and lay off a junior software engineer who is proficient in Python coding, crawling, LLMs, RAG, and other related areas. He is currently on OPT (24 months) and will need an H1B sponsor. If any startups are interested, pls DM. RT for…
🚨The ASR Hackathon is still ON! 💻🎙️ We’re building the future of speech recognition for low-resource African languages — and we want you to be part of it. It's not too late to submit your model! For more information: digital-umuganda.github.io/kasr_hackathon/ #AfrivoicekinyarwandaASR
The NaijaVoices Dataset (accepted to Interspeech 2025) arXiv link: arxiv.org/abs/2505.20564 video overview: supabase.manatee.work/storage/v1/obj…
𝚜/𝚁𝙴𝙰𝙳𝙼𝙴.𝚖𝚍/𝚆𝙰𝚃𝙲𝙷𝙼𝙴.𝚖𝚙𝟺/𝚐 I built a technical document -> video converter... with voice narration :) It's great for arXiv papers / API docs / server logs / etc Check it out! manatee.work
These aren't real... Signup at fluxions.ai for access to closed alpha. This is v0.0.1 so more improvements to come. Follow me for updates :)
After spending some hours on F5, I found passion to finalize this small post. I'm telling this for quite some time already though. alphacephei.com/nsh/2024/10/18…
Awesome new project: Whisper Turbo MLX by Josef Albers. A clean, single file (< 250 lines), and blazing fast implementation of Whisper Turbo in MLX:
190ms TTFB 👀
Today we’re introducing our latest Text-To-Speech model, Play 3.0 mini. It’s faster, more accurate, handles multiple languages, supports streaming from LLMs, and it’s more cost-efficient than ever before. Try it out here: play.ht/playground/?ut…
Inspired by the @AIatMeta's Chameleon and Llama Herd papers, llama3-s (Ichigo) is an early-fusion, audio and text, multimodal model. We're experimenting with this research entirely in the open, with an open-source codebase, open data, and open weights. 2/10
3 steps to run @huggingface "Parler TTS" AI Voice on your local machine. New tutorial video out now 😊! My step-by-step technical tutorial is now available on my "Thorsten-Voice" youtube channel. youtu.be/1X2LxAGn9tU
youtube.com
YouTube
3 steps to run HuggingFace 🤗 "Parler TTS" AI Voice on your local...
We just released Pixtral 12B paper on Arxiv: arxiv.org/abs/2410.07073
🍏 Apple ML research in Paris has multiple open internship positions!🍎 We are looking for Ph.D. students interested in generative modeling, optimization, large-scale learning or uncertainty quantification, with applications to challenging scientific problems. Details below 👇
I’ll be presenting a deep dive into how Moshi works at the next NLP Meetup in Paris, this Wednesday the 9th at 7pm. Register if you want to attend ! 🧩🔎🟢 meetup.com/fr-FR/paris-nl…
impressive
🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date. Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in…
👀
United States Trends
- 1. Under Armour 3,104 posts
- 2. Blue Origin 9,931 posts
- 3. Megyn Kelly 33.9K posts
- 4. New Glenn 10.5K posts
- 5. Vine 36.3K posts
- 6. Senator Fetterman 20.4K posts
- 7. Brainiac 7,713 posts
- 8. Curry Brand 2,589 posts
- 9. CarPlay 4,587 posts
- 10. Eric Swalwell 30.6K posts
- 11. World Cup 106K posts
- 12. Operation SOUTHERN SPEAR 3,081 posts
- 13. Portugal 66.8K posts
- 14. Nike 25.8K posts
- 15. Matt Gaetz 16.6K posts
- 16. Padres 28.6K posts
- 17. #2025CaracasWordExpo 8,626 posts
- 18. GeForce Season 1,156 posts
- 19. Man of Tomorrow 7,690 posts
- 20. Grade 1 27.4K posts
You might like
-
arXiv Sound
@ArxivSound -
BUT Speech
@ButSpeech -
Shinji Watanabe
@shinjiw_at_cmu -
Neil Zeghidour
@neilzegh -
erica
@erica_cooper -
Mirco Ravanelli
@mirco_ravanelli -
AlphaCephei
@alphacep -
WAVLab | @CarnegieMellon
@WavLab -
Desh Raj
@rdesh26 -
erogol
@erogol -
Hervé "pyannote" Bredin
@hbredin -
CDT in Speech and Language Technologies
@sltcdt -
Yacine Jernite
@YJernite -
ISCA - Emmanuelle Foxonet
@ISCAFOX -
laurent besacier
@laurent_besacie
Something went wrong.
Something went wrong.