CSVisionPapers's profile picture. Covers image processing, computer vision, pattern recognition, and scene understanding. (new submissions to http://arxiv.org, not affiliated with arXiv)

Computer Vision and Pattern Recognition Papers

@CSVisionPapers

Covers image processing, computer vision, pattern recognition, and scene understanding. (new submissions to http://arxiv.org, not affiliated with arXiv)

Prompt-Guided Spatial Understanding with RGB-D Transformers for Fine-Grained Object Relation Reasoning. arxiv.org/abs/2510.11996


PanoTPS-Net: Panoramic Room Layout Estimation via Thin Plate Spline Transformation. arxiv.org/abs/2510.11992


Task-Specific Dual-Model Framework for Comprehensive Traffic Safety Video Description and Analysis. arxiv.org/abs/2510.11907


MammoDINO: Anatomically Aware Self-Supervision for Mammographic Images. arxiv.org/abs/2510.11883


Data or Language Supervision: What Makes CLIP Better than DINO?. arxiv.org/abs/2510.11835


Enhancing the Quality of 3D Lunar Maps Using JAXA's Kaguya Imagery. arxiv.org/abs/2510.11817


Fast Self-Supervised depth and mask aware Association for Multi-Object Tracking. arxiv.org/abs/2510.09878


Cluster-Aware Prompt Ensemble Learning for Few-Shot Vision-Language Model Adaptation. arxiv.org/abs/2510.09867


Cell Instance Segmentation: The Devil Is in the Boundaries. arxiv.org/abs/2510.09848


Post Processing of image segmentation using Conditional Random Fields. arxiv.org/abs/2510.09833


Task-Aware Resolution Optimization for Visual Large Language Models. arxiv.org/abs/2510.09822


Towards Understanding Ambiguity Resolution in Multimodal Inference of Meaning. arxiv.org/abs/2510.09815


Constructive Distortion: Improving MLLMs with Attention-Guided Image Warping. arxiv.org/abs/2510.09741


Multi Camera Connected Vision System with Multi View Analytics: A Comprehensive Survey. arxiv.org/abs/2510.09731


Adaptive Fusion Network with Temporal-Ranked and Motion-Intensity Dynamic Images for Micro-expression Recognition. arxiv.org/abs/2510.09730


Knowledge-Aware Mamba for Joint Change Detection and Classification from MODIS Times Series. arxiv.org/abs/2510.09679


Ultralytics YOLO Evolution: An Overview of YOLO26, YOLO11, YOLOv8 and YOLOv5 Object Detectors for Computer Vision and Pattern Recognition. arxiv.org/abs/2510.09653


United States الاتجاهات

Loading...

Something went wrong.


Something went wrong.