#automatic_speech_recognition_and_understanding 검색 결과

"#automatic_speech_recognition_and_understanding"에 대한 결과가 없습니다

4PSA

14 시간

🧠 From neural networks to natural language, speech-to-text has evolved into one of AI’s most practical tools. It's transforming how teams capture, analyze & share information, faster than ever before. Here's how it works 👉 bit.ly/4njiUj8 #AI #SpeechRecognition

4psa's tweet image. 🧠 From neural networks to natural language, speech-to-text has evolved into one of AI’s most practical tools. It's transforming how teams capture, analyze &amp; share information, faster than ever before.
Here's how it works 👉 bit.ly/4njiUj8 #AI #SpeechRecognition

Umesh

@umesh_ai

. 5. 28.

Left : ChatGPT 4o Right : Ideogram Prompt in ALT.

Tom Dörr

@tom_doerr

. 9. 9.

open-source model for speech-to-speech and audio understanding

Tom Dörr

@tom_doerr

. 1. 30.

Vision-language model for images and text

Dogan Ural

@doganuraldesign

2024. 8. 9.

Google just released Imagen 3! Their latest text-to-image generator. Here's a couple of side-by-side with Midjourney & Flux

doganuraldesign's tweet image. Google just released Imagen 3!

Their latest text-to-image generator.

Here's a couple of side-by-side with Midjourney &amp; Flux

ΜΔDΞRΔS

@hackermaderas

2019. 9. 2.

#CyberpunkisNow A new AI/algorithm can accurately reconstructs faces from tiny 16×16 pixel input images. Top row are low resolution images, middle row are the AI's output, bottom row are the original photos. More Info- iforcedabot.com/photo-realisti… arxiv.org/abs/1908.08239

hackermaderas's tweet image. #CyberpunkisNow A new AI/algorithm can accurately reconstructs faces from tiny 16×16 pixel input images.

Top row are low resolution images, middle row are the AI's output, bottom row are the original photos.

More Info-
iforcedabot.com/photo-realisti…

arxiv.org/abs/1908.08239

ahhhhfs

@abskoop

. 5. 18.

Image Describer X：免费AI图像描述神器让每张图片“开口说话” 👉ahhhhfs.com/71441/

cts🌸

@gf_256

2022. 12. 1.

OpenAI: We have the most sophisticated content filtering system in the world OpenAI's content filtering system:

June

@askjuneai

11 시간

AI Models learn patterns through training, which sets fixed rules (weights). During responses, they use attention mechanisms to dynamically focus on relevant parts of your specific input.

askjuneai's tweet image. AI Models learn patterns through training, which sets fixed rules (weights). During responses, they use attention mechanisms to dynamically focus on relevant parts of your specific input.

madison

@dearmadisonblue

2023. 2. 20.

if i'm understanding this correctly, you can use a pure text encoder model to find text that lets you reconstruct an image from the text encoding. basically, the latent space of a text model is expressive enough to serve as a compilation target for images

dearmadisonblue's tweet image. if i'm understanding this correctly, you can use a pure text encoder model to find text that lets you reconstruct an image from the text encoding. basically, the latent space of a text model is expressive enough to serve as a compilation target for images

Tom Dörr

@tom_doerr

. 11. 8.

"WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)"

tom_doerr's tweet image. "WhisperX: Automatic Speech Recognition with Word-level Timestamps (&amp; Diarization)"

LisaVote50

@LisaVote50

. 6. 12.

ChatGPT returned a clearer image. AI is crazy

James 

@BellianJames

2018. 10. 11.

Since y’all seem to be interested in this tech, here’s a simple diagram showing how it works. The solid colored objects on the right are what the computer sees, it will compare their shape to hundreds of thousands of samples in its database to label them.

BellianJames's tweet image. Since y’all seem to be interested in this tech, here’s a simple diagram showing how it works. The solid colored objects on the right are what the computer sees, it will compare their shape to hundreds of thousands of samples in its database to label them.

Tom Dörr

@tom_doerr

. 10. 16.

multimodal AI model for real-time text, image, audio, and video chat

Jaynit Makwana

@JaynitMakwana

. 10. 21.

🚨BREAKING: AI just killed censorship. EternalAI just dropped an uncensored image & video model. It creates exactly what you ask for: text, image, or video. It’s fast, free, and totally unfiltered. Here's how it works:

JaynitMakwana's tweet image. 🚨BREAKING: AI just killed censorship.

EternalAI just dropped an uncensored image &amp; video model.

It creates exactly what you ask for: text, image, or video.

It’s fast, free, and totally unfiltered.

Here's how it works:

Jinpeng Wang

@awinyimgprocess

2023. 4. 12.

A image to paragraph model with ChatGPT. Low-level visual semantic extraction with BLIP2, OFA, GRIT, Segment-anything. High-level reasoning with ChatGPT. Can Run on 1 8GB GPU card! Github: github.com/showlab/Image2…

awinyimgprocess's tweet image. A image to paragraph model with ChatGPT.

Low-level visual semantic extraction with BLIP2, OFA, GRIT, Segment-anything.

High-level reasoning with ChatGPT.

Can Run on 1 8GB GPU card!

Github: github.com/showlab/Image2…

Chubby♨️

@kimmonismus

2024. 10. 5.

It is hard to grasp how far we have already come with AI. Images can no longer be distinguished from reality. After the meme a few examples. All using Flux1.1

kimmonismus's tweet image. It is hard to grasp how far we have already come with AI. Images can no longer be distinguished from reality.
After the meme a few examples. All using Flux1.1

Tom Dörr

@tom_doerr

. 2. 16.

Multimodal speech LLM for voice interactions

ExcitedUtterance

@Extd_utterance

. 10. 19.

The images aren’t AI. All of the type is clear and not garbled on each image. There appears to be a sharpening filter which could use some AI technology though.

Extd_utterance's tweet image. The images aren’t AI. All of the type is clear and not garbled on each image. There appears to be a sharpening filter which could use some AI technology though.

Kit 𓏲 ๋࣭ 🌹 cr: Jojolion

@k1t_catt

2024. 9. 28.

Hey, I think this image you used is actually an ai generated image. So here are some real examples you could use instead /nm

Something went wrong.

United States Trends

1. Wemby 52.2K posts
2. Clippers 10.6K posts
3. Spurs 38.2K posts
4. Cooper Flagg 12K posts
5. Mavs 15.5K posts
6. #QueenRadio 14.5K posts
7. Maxey 10.4K posts
8. Sixers 23.2K posts
9. Embiid 13.5K posts
10. VJ Edgecombe 23.3K posts
11. #AEWDynamite 23.2K posts
12. Victor Wembanyama 11.3K posts
13. Knicks 34K posts
14. Anthony Davis 4,190 posts
15. Jazz 23K posts
16. Klay 7,501 posts
17. Bulls 24.4K posts
18. Pistons 6,797 posts
19. Celtics 25.7K posts
20. #PorVida 2,085 posts