#computervisionmodels résultats de recherche

new research from Meta FAIR: Code World Model (CWM), a 32B research model we encourage the research community to research this open-weight model! pass@1 evals, for the curious: 65.8 % on SWE-bench Verified 68.6 % on LiveCodeBench 96.6 % on Math-500 76.0 % on AIME 2024 🧵

alexandr_wang's tweet image. new research from Meta FAIR: Code World Model (CWM), a 32B research model

we encourage the research community to research this open-weight model!

pass@1 evals, for the curious:

65.8 % on SWE-bench Verified
68.6 % on LiveCodeBench
96.6 % on Math-500
76.0 % on AIME 2024

🧵

Introducing our new tiny vision language model: LFM2-VL-3B 👀 > Expanded multilingual visual understanding: English, Japanese, French, Spanish, German, Italian, Portuguese, Arabic, Chinese, Korean > 51.8% on MM-IFEval (instruction following) > 71.4% on RealWorldQA (real-world…

LiquidAI_'s tweet image. Introducing our new tiny vision language model: LFM2-VL-3B 👀

> Expanded multilingual visual understanding: English, Japanese, French, Spanish, German, Italian, Portuguese, Arabic, Chinese, Korean
> 51.8% on MM-IFEval (instruction following)
> 71.4% on RealWorldQA (real-world…

1/4 What is Computer Vision ? Computer Vision is a branch of AI that enables machines to see, identify, and process images and videos in a way similar to human vision . The goal is not just to "see" but to understand the visual world by interpreting and making decisions based…

Compute is the most valuable resource of the digital age. It’s time to redefine our digital infrastructure. Introducing @cysic_xyz Network, making the #ComputeFi vision a reality : ✅ Owned, not rented A novel Proof of Compute consensus allows contributors with computing…



(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. ai.meta.com/research/publi…


15 ChatGPT prompts for better research _____________ P.S. Meet Dreamina — the world’s #1 AI image model. It lets you create cinematic visuals, edit interactively, and export in 4K quality all in one seamless experience. bit.ly/MuhammadAyan

socialwithaayan's tweet image. 15 ChatGPT prompts for better research

_____________
P.S. Meet Dreamina — the world’s #1 AI image model. It lets you create cinematic visuals, edit interactively, and export in 4K quality all in one seamless experience.

bit.ly/MuhammadAyan

There is no god tier video model. Ideal use cases for each model: (Full piece by @venturetwins in the a16z newsletter)

a16z's tweet image. There is no god tier video model.

Ideal use cases for each model:

(Full piece by @venturetwins in the a16z newsletter)

New from Meta FAIR: Code World Model (CWM), a 32B-parameter research model designed to explore how world models can transform code generation and reasoning about code. We believe in advancing research in world modeling and are sharing CWM under a research license to help empower…


AI Models learn patterns through training, which sets fixed rules (weights). During responses, they use attention mechanisms to dynamically focus on relevant parts of your specific input.

askjuneai's tweet image. AI Models learn patterns through training, which sets fixed rules (weights). During responses, they use attention mechanisms to dynamically focus on relevant parts of your specific input.

these are some computer vision papers that everyone must go through atleast once: 1. ResNets: arxiv.org/pdf/1512.03385… 2. YOLO: arxiv.org/abs/1506.02640 3. DeConv: lxu.me/mypapers/dcnn_… 4. GAN: arxiv.org/abs/1406.2661 5. Unet: arxiv.org/abs/1505.04597 6. Focal Loss:…


how a computer vision researcher sees the world (and solves most problems in vision).

jbhuang0604's tweet image. how a computer vision researcher sees the world (and solves most problems in vision).

how a mathematician sees the world

oprydai's tweet image. how a mathematician sees the world


A wave of computer vision breakthroughs reshapes how machines “see”—new models generate dynamic 3D worlds from text, edit images with physical precision, and accelerate diffusion by 5× for real-time creativity. #ComputerVision #GenAI youtube.com/watch?v=U6afOP…

NetworkTrend's tweet card. AI Vision Revolution: 16 Breakthrough Papers Transform Computer...

youtube.com

YouTube

AI Vision Revolution: 16 Breakthrough Papers Transform Computer...


Video generation is slow. 5 techniques to make your model 10-40x faster! I spoke yesterday at the AI Infra & Open-Source Meetup by @dstackai about how to make video models more efficient. Here are the main highlights - with all the links you need for faster video diffusion…

deyneka_e's tweet image. Video generation is slow. 5 techniques to make your model 10-40x faster!

I spoke yesterday at the AI Infra & Open-Source Meetup by @dstackai about how to make video models more efficient.

Here are the main highlights - with all the links you need for faster video diffusion…

want to fine-tune models for OCR/document??? 📑 two tutorials for you 🫡 > fine-tune Kosmos2.5 with grounding: if you have data with bounding boxes + text inside > fine-tune Florence-2 on DocVQA: if you search for answers in a document plug and play with other VLMs! 💗

mervenoyann's tweet image. want to fine-tune models for OCR/document??? 📑

two tutorials for you 🫡
> fine-tune Kosmos2.5 with grounding: if you have data with bounding boxes + text inside 
> fine-tune Florence-2 on DocVQA: if you search for answers in a document

plug and play with other VLMs! 💗

Check out this collection of over 50 #computervision examples that include code! ow.ly/pkTm30esoOa

MATLAB's tweet image. Check out this collection of over 50 #computervision examples that include code! ow.ly/pkTm30esoOa

Humans see text — but LLMs don’t. I wrote a short blog post exploring how models can perceive text visually rather than tokenize it: 🔗 csu-jpg.github.io/Blog/people_se… From PIXEL, CLIPPO, VisInContext, VIST to DeepSeek-OCR, this is a quick story of how vision-centric modeling is…


Say hello to DINOv3 🦖🦖🦖 A major release that raises the bar of self-supervised vision foundation models. With stunning high-resolution dense features, it’s a game-changer for vision tasks! We scaled model size and training data, but here's what makes it special 👇

BaldassarreFe's tweet image. Say hello to DINOv3 🦖🦖🦖

A major release that raises the bar of self-supervised vision foundation models.
With stunning high-resolution dense features, it’s a game-changer for vision tasks!

We scaled model size and training data, but here's what makes it special 👇
BaldassarreFe's tweet image. Say hello to DINOv3 🦖🦖🦖

A major release that raises the bar of self-supervised vision foundation models.
With stunning high-resolution dense features, it’s a game-changer for vision tasks!

We scaled model size and training data, but here's what makes it special 👇
BaldassarreFe's tweet image. Say hello to DINOv3 🦖🦖🦖

A major release that raises the bar of self-supervised vision foundation models.
With stunning high-resolution dense features, it’s a game-changer for vision tasks!

We scaled model size and training data, but here's what makes it special 👇
BaldassarreFe's tweet image. Say hello to DINOv3 🦖🦖🦖

A major release that raises the bar of self-supervised vision foundation models.
With stunning high-resolution dense features, it’s a game-changer for vision tasks!

We scaled model size and training data, but here's what makes it special 👇

15 ChatGPT prompts for better research _____________ P.S. Meet Dreamina — the world’s #1 AI image model. It lets you create cinematic visuals, edit interactively, and export in 4K quality all in one seamless experience. bit.ly/MuhammadAyan

socialwithaayan's tweet image. 15 ChatGPT prompts for better research

_____________
P.S. Meet Dreamina — the world’s #1 AI image model. It lets you create cinematic visuals, edit interactively, and export in 4K quality all in one seamless experience.

bit.ly/MuhammadAyan

🚨 BREAKING: ByteDance just updated Dreamina, powered by Seedream 4.0 the AI image model that beat Nano Banana to rank #1 globally. It’s not just a generator. It’s your all-in-one creative studio 👇

socialwithaayan's tweet image. 🚨 BREAKING: ByteDance just updated Dreamina, powered by Seedream 4.0 the AI image model that beat Nano Banana to rank #1 globally.

It’s not just a generator. It’s your all-in-one creative studio 👇


Announcing our State of Generative Media Survey Report 2025! Based on responses from the community, we’ve put together a report covering the key trends in media generation adoption Responses are based on our survey of ~300 developers and creators to understand how the community…

ArtificialAnlys's tweet image. Announcing our State of Generative Media Survey Report 2025! Based on responses from the community, we’ve put together a report covering the key trends in media generation adoption

Responses are based on our survey of ~300 developers and creators to understand how the community…

Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense…


🤔 Wondering how your competitors use #ComputerVision solutions and whether your company could benefit from the technology, too? ⬇️ Our latest article will give you a hint! bit.ly/CV-applications #ComputerVisionApplications #ComputerVisionModels #AI


Many preventable car accidents happen each year because drivers lose focus. Car companies like #BMW are using #computervisionmodels and #annotateddatasets with #keypoints and #boundingboxes to improve driver monitoring systems and reduce accidents. #AI linkedin.com/feed/update/ur…


By identifying a fundamental property called #PerceptualStraightness, #MIT researchers have made strides in training #ComputerVisionModels to learn like humans. How? By enhancing the computer's ability to represent the visual world in a predictable manner: bit.ly/448EwFT


Betterview Data Scientist Sean Ridge will be at InsurTech NY Spring Conference this year! Sean is an expert on Betterview’s #PredictiveAnalytics and #ComputerVisionModels which help #PredictAndPRevent losses. Set up time to talk with Sean today. hubs.ly/Q0153dMd0


Object-recognition dataset stumped the world’s best computer vision models news.mit.edu/2019/object-re… #Object-recognition #WorldS #ComputerVisionModels


This new Object-Recognition Dataset, ‘ObjectNet’, stumped the leading Computer Vision Models marktechpost.com/2019/12/11/thi… #ComputerVisionModels

darioandriani's tweet image. This new Object-Recognition Dataset, ‘ObjectNet’, stumped the leading Computer Vision Models
marktechpost.com/2019/12/11/thi…  #ComputerVisionModels

Aucun résultat pour "#computervisionmodels"

Gregory S. Dawson, Kevin C. Desouza, and James S. Denford. Read more 👉 aikn.co/560eb0 #ExistingBiases #CommercialSystems #ComputerVisionModels #VisionModels

brandposture1's tweet image. Gregory S. Dawson, Kevin C. Desouza, and James S. Denford.

Read more 👉 aikn.co/560eb0

#ExistingBiases #CommercialSystems #ComputerVisionModels #VisionModels

UNDERSTANDING ARTIFICIAL INTELLIGENCE SPENDING Read more 👉 aikn.co/517dba #CommercialSystems #ComputerVisionModels #VisionModels #Pre-trained

brandposture1's tweet image. UNDERSTANDING ARTIFICIAL INTELLIGENCE SPENDING

Read more 👉 aikn.co/517dba

#CommercialSystems #ComputerVisionModels #VisionModels #Pre-trained

NONRESIDENT FELLOW - GOVERNANCE STUDIES, CENTER FOR TECHNOLOGY INNOVATION. Read more 👉 aikn.co/b2a69b #CommercialSystems #ComputerVisionModels #VisionModels #Pre-trained

brandposture1's tweet image. NONRESIDENT FELLOW - GOVERNANCE STUDIES, CENTER FOR TECHNOLOGY INNOVATION.

Read more 👉 aikn.co/b2a69b

#CommercialSystems #ComputerVisionModels #VisionModels #Pre-trained

In comparison, faces of men were autocompleted with suits or career-related attire 42 percent of the time. Read more 👉 aikn.co/71b667 #CommercialSystems #ComputerVisionModels #VisionModels #Pre-trained

brandposture1's tweet image. In comparison, faces of men were autocompleted with suits or career-related attire 42 percent of the time.

Read more 👉 aikn.co/71b667

#CommercialSystems #ComputerVisionModels #VisionModels #Pre-trained

This new Object-Recognition Dataset, ‘ObjectNet’, stumped the leading Computer Vision Models marktechpost.com/2019/12/11/thi… #ComputerVisionModels

darioandriani's tweet image. This new Object-Recognition Dataset, ‘ObjectNet’, stumped the leading Computer Vision Models
marktechpost.com/2019/12/11/thi…  #ComputerVisionModels

Loading...

Something went wrong.


Something went wrong.


United States Trends