
Satya Mallick
@LearnOpenCV
CEO, http://OpenCV.org. Course Director, http://OpenCV.org/Courses Entrepreneur. Ph.D. ( Computer Vision & Machine Learning ). Author: http://LearnOpenCV.com
قد يعجبك
📢VideoRAG: Redefining Long-Context Video Comprehension In this week’s deep dive, we explore another interesting approach for performing RAG on videos. VideoRAG, a groundbreaking framework that brings RAG to the world of extremely long videos. Unlike traditional LVLMs that…
San Diego Investors👇🏽 AI Beyond the Buzz: Smarter Money, Leaner Work, Safer Decisions in 2025 by Satya Mallick @LearnOpenCV Come on over to San Diego meeting this Saturday 11th October at 9am hosted by AAII San Diego!! <Details in link below> Want to know how AI is disrupting…

The more you show your 💜, the more we keep it coming every week :) +600 people have signed up for today's @SensAIHackademy hands-on workshop, starting in 2 hours, with Christoph Spinger from Ballee and @LearnOpenCV from @opencvlive ! And here is another interesting workshop…

📢The Ultimate Guide To VLM Evaluation Metrics, Datasets, And Benchmarks Evaluating Vision-Language Models (VLMs) is more than just checking accuracy. How would you know if your model understands a scene or is just hallucinating? Our new, comprehensive guide on LearnOpenCV…

The 3rd edition of my book Deep Learning with Python is being printed right now, and will be in bookstores within 2 weeks. You can order it now from Amazon or from Manning. This time, we're also releasing the whole thing as a 100% free website. I don't care if it reduces book…

📢 Getting Started with VLM on Jetson Nano Tiny Vision Language Models (VLMs) like Moondream2, LiquidAI’s LFM2-VL, Apple’s FastVLM, and Huggingface’s SmolVLM2 are bringing vision-language capabilities to the edge. In this tutorial, LearnOpenCV demonstrates how to deploy and run…
📢New Post Alert: 📙VLM on Edge Devices: Worth the Hype or Just a Novelty? The rise of Vision Language Models (VLMs) has been meteoric but can they really run effectively on edge devices? Our latest post is first in the series of experiments that we will continue for VLMs on…
📢AnomalyCLIP: Harnessing CLIP for Weakly-Supervised Video Anomaly Recognition In this week’s deep dive, we explore AnomalyCLIP, the first method to adapt CLIP’s vision–language latent space for Video Anomaly Recognition (VAR) under weak supervision. We break down how it learns…
📢AI for Video Understanding: From Content Moderation to Summarization In this blog post, we explore how to build a practical pipeline for AI-powered video understanding. We look at two main applications: video content moderation using CLIP and Gemini, and video summarization…
📢DINOv3: Scaling Self-Supervised Learning for Vision Foundation Models (Meta AI) DINOv3 is a next-generation vision foundation model trained purely with self-supervised learning. It introduces innovations that allow robust dense feature learning at scale with models reaching 7B…
📢☑️Video-RAG: Training-Free Retrieval for Long-Video LVLMs In this week’s deep dive, we implement Video-RAG as a training-free, single-pass pipeline and integrate it with LLaVA-Video-7B (Qwen2, 32K context), without APE - to keep things reproducible on today’s stacks. We enable…
Created this video using a single image using grok. Quite impressive
Huge computer science result: A Tsinghua professor JUST discovered the fastest shortest path algorithm for graphs in 40yrs. This improves on Turing award winner Tarjan’s O(m + nlogn) with Dijkstra’s, something every Computer Science student learns in college.

One word: relentless. just in the past two weeks, we’ve shipped: 🌐 Genie 3 - the most advanced world simulator ever 🤔 Gemini 2.5 Pro Deep Think available to Ultra subs 🎓 Gemini Pro free for uni students & $1B for US ed 🌍 AlphaEarth - a geospatial model of the entire planet…
📢Object Detection and Spatial Understanding with VLMs ft. Qwen2.5-VL Object Detection used to mean bounding boxes and pre-trained classes. Now? You can upload an image and ask: “What brand are the sneakers the person on the left is wearing?” Welcome to the world of…
Huge thanks to all the open source projects that've made a lot of the tech we rely on in the world possible: Linux Git FFmpeg PyTorch & TensorFlow Apache & Nginx MySQL, PostgreSQL, SQLite Chromium & Firefox GCC & LLVM Docker & Kubernetes Also, all the open-weight LLMs... and…
We released two open-weight reasoning models—gpt-oss-120b and gpt-oss-20b—under an Apache 2.0 license. Developed with open-source community feedback, these models deliver meaningful advancements in both reasoning capabilities & safety. openai.com/index/introduc…
📢LangGraph: Building a Self-Correcting RAG Agent for Code Generation Ready to level up your AI workflows? 🔄 In our latest #LangGraph post, we built a self-correcting RAG agent that writes Python code with Hugging Face Diffusers, runs it, learns from errors, and iterates until…

United States الاتجاهات
- 1. Baker 36.5K posts
- 2. Packers 32.7K posts
- 3. 49ers 34.3K posts
- 4. #BNBdip N/A
- 5. Bucs 11.7K posts
- 6. Flacco 12.4K posts
- 7. Cowboys 74.6K posts
- 8. Fred Warner 11.5K posts
- 9. Niners 5,601 posts
- 10. Cam Ward 3,004 posts
- 11. Zac Taylor 3,235 posts
- 12. Panthers 75.9K posts
- 13. #FTTB 4,422 posts
- 14. #GoPackGo 4,139 posts
- 15. Mac Jones 5,988 posts
- 16. Titans 24.3K posts
- 17. Tez Johnson 3,443 posts
- 18. #TNABoundForGlory 7,941 posts
- 19. #Bengals 3,283 posts
- 20. Browns 67K posts
قد يعجبك
-
Soumith Chintala
@soumithchintala -
Ian Goodfellow
@goodfellow_ian -
François Chollet
@fchollet -
Dr. Angelica Lim @petitegeek.bsky.social
@petitegeek -
Sebastian Ruder
@seb_ruder -
Sylvain Gugger
@GuggerSylvain -
OpenCV Live
@opencvlive -
Jeremy Howard
@jeremyphoward -
Gary Marcus
@GaryMarcus -
Russ Salakhutdinov
@rsalakhu -
Sander Dieleman
@sedielem -
Oriol Vinyals
@OriolVinyalsML -
Nando de Freitas
@NandoDF -
Tejas Kulkarni
@tejasdkulkarni -
Kyunghyun Cho
@kchonyc
Something went wrong.
Something went wrong.