#automatic_speech_recognition_and_understanding 검색 결과
🧠 From neural networks to natural language, speech-to-text has evolved into one of AI’s most practical tools. It's transforming how teams capture, analyze & share information, faster than ever before. Here's how it works 👉 bit.ly/4njiUj8 #AI #SpeechRecognition

Google just released Imagen 3! Their latest text-to-image generator. Here's a couple of side-by-side with Midjourney & Flux

#CyberpunkisNow A new AI/algorithm can accurately reconstructs faces from tiny 16×16 pixel input images. Top row are low resolution images, middle row are the AI's output, bottom row are the original photos. More Info- iforcedabot.com/photo-realisti… arxiv.org/abs/1908.08239

OpenAI: We have the most sophisticated content filtering system in the world OpenAI's content filtering system:

AI Models learn patterns through training, which sets fixed rules (weights). During responses, they use attention mechanisms to dynamically focus on relevant parts of your specific input.

if i'm understanding this correctly, you can use a pure text encoder model to find text that lets you reconstruct an image from the text encoding. basically, the latent space of a text model is expressive enough to serve as a compilation target for images




"WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)"

Since y’all seem to be interested in this tech, here’s a simple diagram showing how it works. The solid colored objects on the right are what the computer sees, it will compare their shape to hundreds of thousands of samples in its database to label them.

🚨BREAKING: AI just killed censorship. EternalAI just dropped an uncensored image & video model. It creates exactly what you ask for: text, image, or video. It’s fast, free, and totally unfiltered. Here's how it works:

A image to paragraph model with ChatGPT. Low-level visual semantic extraction with BLIP2, OFA, GRIT, Segment-anything. High-level reasoning with ChatGPT. Can Run on 1 8GB GPU card! Github: github.com/showlab/Image2…




It is hard to grasp how far we have already come with AI. Images can no longer be distinguished from reality. After the meme a few examples. All using Flux1.1




The images aren’t AI. All of the type is clear and not garbled on each image. There appears to be a sharpening filter which could use some AI technology though.




Hey, I think this image you used is actually an ai generated image. So here are some real examples you could use instead /nm




Something went wrong.
Something went wrong.
United States Trends
- 1. Wemby 52.2K posts
- 2. Clippers 10.6K posts
- 3. Spurs 38.2K posts
- 4. Cooper Flagg 12K posts
- 5. Mavs 15.5K posts
- 6. #QueenRadio 14.5K posts
- 7. Maxey 10.4K posts
- 8. Sixers 23.2K posts
- 9. Embiid 13.5K posts
- 10. VJ Edgecombe 23.3K posts
- 11. #AEWDynamite 23.2K posts
- 12. Victor Wembanyama 11.3K posts
- 13. Knicks 34K posts
- 14. Anthony Davis 4,190 posts
- 15. Jazz 23K posts
- 16. Klay 7,501 posts
- 17. Bulls 24.4K posts
- 18. Pistons 6,797 posts
- 19. Celtics 25.7K posts
- 20. #PorVida 2,085 posts