#speechrecognition 검색 결과

🚀 New patent application alert: #US20250335739A1 by #Google. Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP. This innovative conformer model integrates feed-forward, self-attention, and…

PatentPulse's tweet image. 🚀 New patent application alert: #US20250335739A1 by #Google.

Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP.

This innovative conformer model integrates feed-forward, self-attention, and…
PatentPulse's tweet image. 🚀 New patent application alert: #US20250335739A1 by #Google.

Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP.

This innovative conformer model integrates feed-forward, self-attention, and…
PatentPulse's tweet image. 🚀 New patent application alert: #US20250335739A1 by #Google.

Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP.

This innovative conformer model integrates feed-forward, self-attention, and…

speech_recognition Power your Python projects with speech recognition! This library supports 15+ engines & APIs, including Google Cloud Speech API & OpenAI Whisper. #speechrecognition #python

D4Vinci1's tweet image. speech_recognition

Power your Python projects with speech recognition! This library supports 15+ engines & APIs, including Google Cloud Speech API & OpenAI Whisper. #speechrecognition #python

📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…

PatentPulse's tweet image. 📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…
PatentPulse's tweet image. 📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…
PatentPulse's tweet image. 📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…

moshi Introducing Moshi: a speech-text foundation model for real-time dialogue! Talk to Moshi, a full-duplex AI that can understand your inner monologue. Learn more about this innovative tech [link] #AI #SpeechRecognition

D4Vinci1's tweet image. moshi

Introducing Moshi: a speech-text foundation model for real-time dialogue! Talk to Moshi, a full-duplex AI that can understand your inner monologue. Learn more about this innovative tech [link] #AI #SpeechRecognition

1/4 🤖 Remember when we did Bot or Not: Text Edition? Guess what? With thanks to Hope McVean, @GeorgebrownLing and I have just launched the essential follow-up - Bot or Not: Audio Edition. 🤩 Can you distinguish between humans and AI just by listening? #AI #SpeechRecognition

DrClaireH's tweet image. 1/4 🤖 Remember when we did Bot or Not: Text Edition? Guess what? With thanks to Hope McVean, @GeorgebrownLing and I have just launched the essential follow-up - Bot or Not: Audio Edition. 🤩

Can you distinguish between humans and AI just by listening? #AI #SpeechRecognition

Exploring unintended memorization in #SpeechRecognition! 🗣️ #Google's patent application #US20250279112A1 reveals a method to assess how audio encoders memorize data. By using un-transcribed speech & synthetic canary utterances, the technique measures memorization levels in #AI

PatentPulse's tweet image. Exploring unintended memorization in #SpeechRecognition! 🗣️

#Google's patent application #US20250279112A1 reveals a method to assess how audio encoders memorize data.

By using un-transcribed speech & synthetic canary utterances, the technique measures memorization levels in #AI…
PatentPulse's tweet image. Exploring unintended memorization in #SpeechRecognition! 🗣️

#Google's patent application #US20250279112A1 reveals a method to assess how audio encoders memorize data.

By using un-transcribed speech & synthetic canary utterances, the technique measures memorization levels in #AI…
PatentPulse's tweet image. Exploring unintended memorization in #SpeechRecognition! 🗣️

#Google's patent application #US20250279112A1 reveals a method to assess how audio encoders memorize data.

By using un-transcribed speech & synthetic canary utterances, the technique measures memorization levels in #AI…

New patent application #US20250299678A1 by #Meta introduces advanced #SpeechRecognition tech. It uses multi-path acoustic echo cancellation and beamforming to refine audio from multiple microphones, enabling accurate speech-to-text transcription. #AI $META

PatentPulse's tweet image. New patent application #US20250299678A1 by #Meta introduces advanced #SpeechRecognition tech. 

It uses multi-path acoustic echo cancellation and beamforming to refine audio from multiple microphones, enabling accurate speech-to-text transcription. #AI $META
PatentPulse's tweet image. New patent application #US20250299678A1 by #Meta introduces advanced #SpeechRecognition tech. 

It uses multi-path acoustic echo cancellation and beamforming to refine audio from multiple microphones, enabling accurate speech-to-text transcription. #AI $META
PatentPulse's tweet image. New patent application #US20250299678A1 by #Meta introduces advanced #SpeechRecognition tech. 

It uses multi-path acoustic echo cancellation and beamforming to refine audio from multiple microphones, enabling accurate speech-to-text transcription. #AI $META

voice-pro Transform multimedia content creation with Voice-Pro, a web app for AI-powered speech recognition, translation, and dubbing, featuring top-tier speech recognition, zero-shot voice cloning, and multilingual text-to-speech. #VoicePro #AI #SpeechRecognition

D4Vinci1's tweet image. voice-pro

Transform multimedia content creation with Voice-Pro, a web app for AI-powered speech recognition, translation, and dubbing, featuring top-tier speech recognition, zero-shot voice cloning, and multilingual text-to-speech. #VoicePro #AI #SpeechRecognition

🎉 Woohoo! We’ve fixed the Whisper hallucination issue when no one’s speaking — thanks to Apple’s in-built speech detection 🗣️✨ Now silence stays silent, and transcripts stay clean. 🚀 Retweet & Reply "Doggy" if you want access! #ai #whisper #speechRecognition

geetpurwar's tweet image. 🎉 Woohoo! We’ve fixed the Whisper hallucination issue when no one’s speaking — thanks to Apple’s in-built speech detection 🗣️✨

Now silence stays silent, and transcripts stay clean. 🚀

Retweet & Reply "Doggy" if you want access!

#ai #whisper #speechRecognition

wenet Introducing WeNet, a production-ready, lightweight, and accurate ASR (Automatic Speech Recognition) model. Transcribe audio with ease using Python! #WeNet #ASR #SpeechRecognition

D4Vinci1's tweet image. wenet

Introducing WeNet, a production-ready, lightweight, and accurate ASR (Automatic Speech Recognition) model. Transcribe audio with ease using Python! #WeNet #ASR #SpeechRecognition

STT that actually keeps up with real-time voice 🎙️⚡️ We just expanded our @DeepgramAI integration to Telnyx Voice API + TeXML! This means enterprise-grade transcription in 30+ languages, ready for production A quick thread 🧵 ↓ #VoiceAI #DevTools #SpeechRecognition


🎙️ Noisy interviews? Rap songs? 11 languages? Qwen3 ASR handles it all—even singing with background music. Speech recognition built for the real world, not studios. Why this might be the breakthrough 👇 c-sharpcorner.com/article/qwen3-… by @sarthak_v2 via @CsharpCorner #SpeechRecognition

sarthak_v2's tweet image. 🎙️ Noisy interviews? Rap songs? 11 languages?
Qwen3 ASR handles it all—even singing with background music.
Speech recognition built for the real world, not studios.
Why this might be the breakthrough 👇 c-sharpcorner.com/article/qwen3-… by @sarthak_v2 via @CsharpCorner
#SpeechRecognition

Pay respects to the Dark Queen👑at her shrine. Say 'We Live to Serve' and you are granted an audience! Then you can ask for treasure🏆, or healing and more! The game listens to what you and your friends say. 👄 #indiegame #horrorgame #speechrecognition #gamedev


.@KnuEdge is taking kneural knetworks to the knext level, delivering true #datasecurity for both proprietary business information as well as sensitive personal customer information across the network. Their #voicebiometrics use #speechrecognition for #machinecontrol, #sowenamedit

WhereWords's tweet image. .@KnuEdge is taking kneural knetworks to the knext level, delivering true #datasecurity for both proprietary business information as well as sensitive personal customer information across the network. Their #voicebiometrics use #speechrecognition for #machinecontrol, #sowenamedit…

From #SpeechRecognition to sound event detection, machines are learning to listen. #ComputerAudition is moving from single-task tools to powerful, multitasking foundation models. Learn more in a new article published in @ProceedingsIEEE's April 2025 issue: bit.ly/ProceedingsIEE…

ProceedingsIEEE's tweet image. From #SpeechRecognition to sound event detection, machines are learning to listen. #ComputerAudition is moving from single-task tools to powerful, multitasking foundation models. Learn more in a new article published in @ProceedingsIEEE's April 2025 issue: bit.ly/ProceedingsIEE…

Your voice holds business intelligence. We help you unlock it. ✔ Live meeting transcription ✔ Voice-enabled enterprise tools ✔ Industry-specific ASR models 👉 inexture.ai #ArtificialIntelligence #VoiceAI #SpeechRecognition #InextureSolutions #TechForBusiness

inexture's tweet image. Your voice holds business intelligence.
We help you unlock it.
✔ Live meeting transcription
 ✔ Voice-enabled enterprise tools
 ✔ Industry-specific ASR models

👉 inexture.ai 

#ArtificialIntelligence #VoiceAI #SpeechRecognition #InextureSolutions #TechForBusiness

💻 Empower productivity with Nuance Assistive Technology! 🚀 From speech recognition to advanced dictation tools, discover solutions designed to transform how you work and communicate. bit.ly/3zJ5M43 #AssistiveTech #Nuance #SpeechRecognition #WorkSmart

_DSEDU's tweet image. 💻 Empower productivity with Nuance Assistive Technology! 🚀 From speech recognition to advanced dictation tools, discover solutions designed to transform how you work and communicate.

bit.ly/3zJ5M43 

#AssistiveTech #Nuance #SpeechRecognition #WorkSmart

meta ai's omnilingual ASR covers 1600+ languages, tackling speech recognition for underserved tongues. impressive scale, but real-world deployment in noisy enterprise settings remains unproven. thoughts? marktechpost.com/2025/11/11/met…... #AI #SpeechRecognition


🎙️ Noisy interviews? Rap songs? 11 languages? Qwen3 ASR handles it all—even singing with background music. Speech recognition built for the real world, not studios. Why this might be the breakthrough 👇 c-sharpcorner.com/article/qwen3-… by @sarthak_v2 via @CsharpCorner #SpeechRecognition

sarthak_v2's tweet image. 🎙️ Noisy interviews? Rap songs? 11 languages?
Qwen3 ASR handles it all—even singing with background music.
Speech recognition built for the real world, not studios.
Why this might be the breakthrough 👇 c-sharpcorner.com/article/qwen3-… by @sarthak_v2 via @CsharpCorner
#SpeechRecognition

Announcing CoSHE-Eval — a Hindi–English code-switching ASR evaluation benchmark from Soket AI Labs. 30 hours of curated, human-verified bilingual speech for ASR evaluation. Read more soket.ai/blogs/coshe_ev… Dataset → huggingface.co/datasets/soket… #ASR #SpeechRecognition


Raw audio's messy! Feature extraction (like MFCCs) cuts noise, keeps speech essence so machines understand words. #SpeechRecognition #AI milvus.io/ai-quick-refer…


Optimize speech recognition: track WER, latency, robustness to ensure great voice tech. #SpeechRecognition #DevTips milvus.io/ai-quick-refer…


Meta AI unveils Omnilingual ASR! 🗣️🌍 Transcribes 1600+ languages, including low-resource ones, with impressive accuracy. Built on wav2vec 2.0, it offers ZST and supports endangered languages. A game-changer for global communication! #AI #SpeechRecognition #Language


💻 Empower productivity with Nuance Assistive Technology! 🚀 From speech recognition to advanced dictation tools, discover solutions designed to transform how you work and communicate. bit.ly/3zJ5M43 #AssistiveTech #Nuance #SpeechRecognition #WorkSmart

_DSEDU's tweet image. 💻 Empower productivity with Nuance Assistive Technology! 🚀 From speech recognition to advanced dictation tools, discover solutions designed to transform how you work and communicate.

bit.ly/3zJ5M43 

#AssistiveTech #Nuance #SpeechRecognition #WorkSmart

✨ Thank you to our gold sponsor, JPMorgan Chase, for your generous support of IEEE ASRU 2025! We’re looking forward to a successful conference thanks to your support! 🔗 2025.ieeeasru.org/sponsors/spons… #SpeechRecognition #Hawaii #ArtificialIntelligence @IEEEsps


Shoutout to Badr al-Absi (Badrex) & contributors for the Speech, Languages of Ethiopia collection on Hugging Face! Covers Amharic,Tigrinya, Wolaytta, Afaan Oromo, Sidama & more key for low-resource speech AI. 🔗 huggingface.co/collections/ba… #Leyu #SpeechRecognition #EthiopianLanguages

Leyu_Ai's tweet image. Shoutout to Badr al-Absi (Badrex) & contributors for the Speech, Languages of Ethiopia collection on Hugging Face! Covers Amharic,Tigrinya, Wolaytta, Afaan Oromo, Sidama & more key for low-resource speech AI.
🔗 huggingface.co/collections/ba…
#Leyu #SpeechRecognition #EthiopianLanguages

📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…

PatentPulse's tweet image. 📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…
PatentPulse's tweet image. 📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…
PatentPulse's tweet image. 📢 New patent application #US20250278240A1 by #Google explores #SpeechRecognition using active acoustic sensing. A hearable device uses ultrasound signals in the ear canal to capture speech and muscle movements, enhancing recognition accuracy. This method fuses ultrasound data…

The easiest way to convert messy thoughts into clear text. Just hit record. Then start rambling. AudioPen will clean things up when you're done. Price: Fremium Source: Audiopen(.)ai #ai #speechrecognition #speaker #speechtotext #audiopen #futuretools #foan82

foan82's tweet image. The easiest way to convert messy thoughts into clear text.
Just hit record. Then start rambling.
AudioPen will clean things up when you're done.

Price: Fremium
Source: Audiopen(.)ai

#ai #speechrecognition #speaker #speechtotext #audiopen #futuretools #foan82

🚀 New patent application alert: #US20250335739A1 by #Google. Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP. This innovative conformer model integrates feed-forward, self-attention, and…

PatentPulse's tweet image. 🚀 New patent application alert: #US20250335739A1 by #Google.

Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP.

This innovative conformer model integrates feed-forward, self-attention, and…
PatentPulse's tweet image. 🚀 New patent application alert: #US20250335739A1 by #Google.

Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP.

This innovative conformer model integrates feed-forward, self-attention, and…
PatentPulse's tweet image. 🚀 New patent application alert: #US20250335739A1 by #Google.

Discover how Convolution-Augmented Transformer Models enhance data processing tasks like #SpeechRecognition, sound separation, and #NLP.

This innovative conformer model integrates feed-forward, self-attention, and…

From #SpeechRecognition to sound event detection, machines are learning to listen. #ComputerAudition is moving from single-task tools to powerful, multitasking foundation models. Learn more in a new article published in @ProceedingsIEEE's April 2025 issue: bit.ly/ProceedingsIEE…

ProceedingsIEEE's tweet image. From #SpeechRecognition to sound event detection, machines are learning to listen. #ComputerAudition is moving from single-task tools to powerful, multitasking foundation models. Learn more in a new article published in @ProceedingsIEEE's April 2025 issue: bit.ly/ProceedingsIEE…

When the listener instantly recognizes a latent meaning in their consciousness(unconscious) through hearing #words. #language #speechrecognition #sphot2023

saarthi_ai's tweet image. When the listener instantly recognizes a latent meaning in their consciousness(unconscious) through hearing #words.

#language #speechrecognition #sphot2023
saarthi_ai's tweet image. When the listener instantly recognizes a latent meaning in their consciousness(unconscious) through hearing #words.

#language #speechrecognition #sphot2023
saarthi_ai's tweet image. When the listener instantly recognizes a latent meaning in their consciousness(unconscious) through hearing #words.

#language #speechrecognition #sphot2023

speech_recognition Power your Python projects with speech recognition! This library supports 15+ engines & APIs, including Google Cloud Speech API & OpenAI Whisper. #speechrecognition #python

D4Vinci1's tweet image. speech_recognition

Power your Python projects with speech recognition! This library supports 15+ engines & APIs, including Google Cloud Speech API & OpenAI Whisper. #speechrecognition #python

Looking for lightning-fast #ASR? Check out NVIDIA NeMo Parakeet-TDT. The open-source model is #1 on the @HuggingFace Open ASR Leaderboard and is 64% faster than the previous best model. #SpeechRecognition #AInvda.ws/3SxaVBs

NVIDIAAIDev's tweet image. Looking for lightning-fast #ASR? Check out NVIDIA NeMo Parakeet-TDT.  

The open-source model is #1 on the @HuggingFace Open ASR Leaderboard and is 64% faster than the previous best model. #SpeechRecognition #AI

⚡ nvda.ws/3SxaVBs

moshi Introducing Moshi: a speech-text foundation model for real-time dialogue! Talk to Moshi, a full-duplex AI that can understand your inner monologue. Learn more about this innovative tech [link] #AI #SpeechRecognition

D4Vinci1's tweet image. moshi

Introducing Moshi: a speech-text foundation model for real-time dialogue! Talk to Moshi, a full-duplex AI that can understand your inner monologue. Learn more about this innovative tech [link] #AI #SpeechRecognition

🔥 Unleash the power of #LargeLanguageModels to transform text into spoken magic! 🎙️ #NaturalLanguageProcessing #SpeechRecognition #AI🧵👇

bitlauncherai's tweet image. 🔥 Unleash the power of #LargeLanguageModels to transform text into spoken magic! 🎙️ #NaturalLanguageProcessing #SpeechRecognition #AI🧵👇

Thrilled to share our paper on multilingual ASR for code-switched Yoruba-English speech was accepted at @DeepIndaba #LyngualLabs #LowResourceAI #SpeechRecognition #DLI2025

LyngualLabs's tweet image. Thrilled to share our paper on multilingual ASR for code-switched Yoruba-English speech was accepted at @DeepIndaba

#LyngualLabs #LowResourceAI #SpeechRecognition #DLI2025
LyngualLabs's tweet image. Thrilled to share our paper on multilingual ASR for code-switched Yoruba-English speech was accepted at @DeepIndaba

#LyngualLabs #LowResourceAI #SpeechRecognition #DLI2025

.@KnuEdge is taking kneural knetworks to the knext level, delivering true #datasecurity for both proprietary business information as well as sensitive personal customer information across the network. Their #voicebiometrics use #speechrecognition for #machinecontrol, #sowenamedit

WhereWords's tweet image. .@KnuEdge is taking kneural knetworks to the knext level, delivering true #datasecurity for both proprietary business information as well as sensitive personal customer information across the network. Their #voicebiometrics use #speechrecognition for #machinecontrol, #sowenamedit…

Big Tech loves using African data. But do African communities benefit? Let’s talk about a new framework that changes the game 🎙️👇 - T H R E A D - #AfricanAI #MultilingualAI #SpeechRecognition #EthicalAI #DataForGood #TechForAfrica #LanguageEquity #AIForAll

LelapaAI's tweet image. Big Tech loves using African data. But do African communities benefit? Let’s talk about a new framework that changes the game 🎙️👇

- T H R E A D -

#AfricanAI #MultilingualAI #SpeechRecognition #EthicalAI #DataForGood #TechForAfrica #LanguageEquity #AIForAll

voice-pro Transform multimedia content creation with Voice-Pro, a web app for AI-powered speech recognition, translation, and dubbing, featuring top-tier speech recognition, zero-shot voice cloning, and multilingual text-to-speech. #VoicePro #AI #SpeechRecognition

D4Vinci1's tweet image. voice-pro

Transform multimedia content creation with Voice-Pro, a web app for AI-powered speech recognition, translation, and dubbing, featuring top-tier speech recognition, zero-shot voice cloning, and multilingual text-to-speech. #VoicePro #AI #SpeechRecognition

Meet @ijaonline Associate Editors: Karen Banai, Professor and Director of #Audiology at @UofHaifa. Her research interests include #speechlearning and its contribution to individual differences in #speechrecognition and to the success of #hearingrehabilitation

ijaonline's tweet image. Meet @ijaonline Associate Editors: Karen Banai, Professor and Director of #Audiology at @UofHaifa. Her research interests include #speechlearning and its contribution to individual differences in #speechrecognition and to the success of #hearingrehabilitation

Ever wonder what powers smart speakers, voice assistants, and real-time transcription apps? Let's take today's #TechConnectQuiz to see how much you know about technology! Leave a comment with your response. #AI #SpeechRecognition #DeepLearning #MachineLearning #EdTech

CUUttarPradesh's tweet image. Ever wonder what powers smart speakers, voice assistants, and real-time transcription apps?

Let's take today's #TechConnectQuiz to see how much you know about technology!

Leave a comment with your response.

#AI #SpeechRecognition #DeepLearning #MachineLearning #EdTech…

wenet Introducing WeNet, a production-ready, lightweight, and accurate ASR (Automatic Speech Recognition) model. Transcribe audio with ease using Python! #WeNet #ASR #SpeechRecognition

D4Vinci1's tweet image. wenet

Introducing WeNet, a production-ready, lightweight, and accurate ASR (Automatic Speech Recognition) model. Transcribe audio with ease using Python! #WeNet #ASR #SpeechRecognition

Loading...

Something went wrong.


Something went wrong.


United States Trends