PyImageSearch's profile picture. Leading deep learning educator. Tutorials and courses on #DeepLearning, #ComputerVision, #LLMs, #GenAI, #OpenCV, #Keras, #TensorFlow, #PyTorch, and many more.

PyImageSearch

@PyImageSearch

Leading deep learning educator. Tutorials and courses on #DeepLearning, #ComputerVision, #LLMs, #GenAI, #OpenCV, #Keras, #TensorFlow, #PyTorch, and many more.

OCR just got a major update. 📄 Tencent dropped HunyuanOCR—complete with the paper, code, and weights. The vision capabilities looking pretty impressive. Dive into the repo here: github.com/Tencent-Hunyua… #ComputerVision #OpenSource

PyImageSearch's tweet image. OCR just got a major update. 📄 Tencent dropped HunyuanOCR—complete with the paper, code, and weights. The vision capabilities looking pretty impressive. 

Dive into the repo here: github.com/Tencent-Hunyua…

#ComputerVision #OpenSource

It's Cyber Monday, and this is your last chance for 30% OFF (or 70% OFF all products). The sale discount drops again at midnight PST. buff.ly/4nGlNCy

PyImageSearch's tweet image. It's Cyber Monday, and this is your last chance for 30% OFF (or 70% OFF all products). 

The sale discount drops again at midnight PST. 

buff.ly/4nGlNCy

Happy Black Friday! If you missed the Early Bird, you can still get 30% OFF any product or 'Double Your Discount' to 70% OFF our 'All Products' Package all weekend long. Get you discount here: buff.ly/4nGlNCy

PyImageSearch's tweet image. Happy Black Friday! 

If you missed the Early Bird, you can still get 30% OFF any product or 'Double Your Discount' to 70% OFF our 'All Products' Package all weekend long. 

Get you discount here: buff.ly/4nGlNCy

LAST CALL: Our 70% OFF 'Double Your Discount' offer (and the 35% OFF site-wide sale) ends at midnight PST. Don't miss the best price of the year buff.ly/4nGlNCy

PyImageSearch's tweet image. LAST CALL: Our 70% OFF 'Double Your Discount' offer (and the 35% OFF site-wide sale) ends at midnight PST. 

Don't miss the best price of the year 

buff.ly/4nGlNCy

What if you could build a interactive app UI just by describing it? A project from Google Research, Generative UI, aims to do just that, turning natural language prompts into custom interfaces. A potential game-changer for prototyping. generativeui.github.io #GenUI #AI

PyImageSearch's tweet image. What if you could build a interactive app UI just by describing it? 

A project from Google Research, Generative UI, aims to do just that, turning natural language prompts into custom interfaces. 

A potential game-changer for prototyping. 

generativeui.github.io 

#GenUI #AI

Ready to streamline your AI model deployment? Our guide, demonstrates how to build a FastAPI inference server for an ONNX model, containerize it with Docker, and prepare it for serverless deployment on AWS Lambda. Check it out! pyimg.co/a6uo2 #FastAPI #AWSLambda

PyImageSearch's tweet image. Ready to streamline your AI model deployment? 
Our guide, demonstrates how to build a FastAPI inference server for an ONNX model, containerize it with Docker, and prepare it for serverless deployment on AWS Lambda. 

Check it out! pyimg.co/a6uo2 

#FastAPI #AWSLambda

The 2025 PyImageSearch Black Friday Sale is LIVE. Get 35% OFF any course, or 'Double Your Discount' to 70% OFF our 'All Products' Package. This 'Early Bird' price is the best we'll offer all year. buff.ly/4nGlNCy

PyImageSearch's tweet image. The 2025 PyImageSearch Black Friday Sale is LIVE. 

Get 35% OFF any course, or 'Double Your Discount' to 70% OFF our 'All Products' Package. 

This 'Early Bird' price is the best we'll offer all year.

buff.ly/4nGlNCy

How robust can 3D human mesh recovery get from a single image, even with occlusions? The open-source project Meta AI's SAM-3D Body is showing incredible results. This looks like a huge step for AR/VR development. What do you think? github.com/facebookresear… #AI #3D #MetaAI

PyImageSearch's tweet image. How robust can 3D human mesh recovery get from a single image, even with occlusions? The open-source project Meta AI's SAM-3D Body is showing incredible results. 

This looks like a huge step for AR/VR development. 

What do you think? 

github.com/facebookresear… 
#AI #3D #MetaAI

Ever wished you could turn any object in a photo into a 3D model? A fascinating project from Meta AI, SAM-3D, does exactly that by lifting 2D masks into 3D point clouds. What are your thoughts on this? You can explore the model here: huggingface.co/facebook/sam-3… #AI #3D

PyImageSearch's tweet image. Ever wished you could turn any object in a photo into a 3D model? A fascinating project from Meta AI, SAM-3D, does exactly that by lifting 2D masks into 3D point clouds. 

What are your thoughts on this? 

You can explore the model here: huggingface.co/facebook/sam-3… 
#AI #3D

Fascinating development in computer vision: Meta's SAM-3 isn't just segmenting pixels, it's understanding concepts. A huge leap from just seeing an 'object' to identifying a ' red car'. What are the biggest implications of this? huggingface.co/facebook/sam3 #AI #ComputerVision

PyImageSearch's tweet image. Fascinating development in computer vision: Meta's SAM-3 isn't just segmenting pixels, it's understanding concepts. 

A huge leap from just seeing an 'object' to identifying a ' red car'. 

What are the biggest implications of this? 
huggingface.co/facebook/sam3 
#AI #ComputerVision

Ready to turbocharge your AI model deployment? We walk you through converting PyTorch models to ONNX, setting them up with FastAPI for amazingly fast, production-ready inference. See how we got a 2x speed boost on batch processing! pyimg.co/muf0c #PyTorch #ONNX

PyImageSearch's tweet image. Ready to turbocharge your AI model deployment? We walk you through converting PyTorch models to ONNX, setting them up with FastAPI for amazingly fast, production-ready inference. 
See how we got a 2x speed boost on batch processing! 

pyimg.co/muf0c 
#PyTorch #ONNX

Could one AI model eventually understand every spoken language? Meta's new Omnilingual ASR project is a fascinating step in that direction. Check out their approach to this massive challenge. What are the implications? ai.meta.com/blog/omnilingu… #AI #SpeechRecognition

PyImageSearch's tweet image. Could one AI model eventually understand every spoken language? 

Meta's new Omnilingual ASR project is a fascinating step in that direction. Check out their approach to this massive challenge. 

What are the implications? 
ai.meta.com/blog/omnilingu… 

#AI #SpeechRecognition

Fascinating new release from BAAI! Emu3.5 looks like a major step forward in multimodal AI, aiming to seamlessly blend text, image, and video understanding. What are your thoughts on its potential impact? Check out the announcement: emu.world #AI #Multimodal

PyImageSearch's tweet image. Fascinating new release from BAAI! Emu3.5 looks like a major step forward in multimodal AI, aiming to seamlessly blend text, image, and video understanding. 

What are your thoughts on its potential impact? 

Check out the announcement: emu.world 

#AI #Multimodal

Ever wonder how to parse complex documents in over 100 languages? This paper on PaddleOCR-VL presents a resource-efficient AI model that's setting a new standard for performance. What are your thoughts on this perspective? arxiv.org/pdf/2510.14528 #AI #DocumentParsing

PyImageSearch's tweet image. Ever wonder how to parse complex documents in over 100 languages? This paper on PaddleOCR-VL presents a resource-efficient AI model that's setting a new standard for performance. 

What are your thoughts on this perspective? 

arxiv.org/pdf/2510.14528 

#AI #DocumentParsing

Loading...

Something went wrong.


Something went wrong.