Vishal Patel

@vishalm_patel

Associate Professor @JohnsHopkins working on computer vision, biometrics, and medical imaging.

Johns Hopkins University

engineering.jhu.edu/vpatel36/

Присоединился в Октябрь 2022

183Посты 625читателей 263в читаемых

Вам может понравиться

@wacv_official

@wjscheirer

@AdamKortylewski

@CMHungSteven

@ucfmshah

@andrewhowens

@shaoyuanlo

@myq_1997

@zhanghesprinter

@rohitrango

@SHENGLui1989

@realNingYu

@jeyamariajose

@jparag123

@MingHsuanYang

Vishal Patel

@vishalm_patel

6 окт.г.

Our new work FreeViS: Training-free Video Stylization with Inconsistent References introduces a training-free framework for generating high-quality, temporally coherent stylized videos ! 🌀 Check it out: xujiacong.github.io/FreeViS/ @HopkinsEngineer @myq_1997 #AI #CV #DeepLearning

Vishal Patel

@vishalm_patel

3 окт.г.

Exciting news! My former student, Dr. Shao-Yuan Lo—now an Asst. Prof. at NTU—has been awarded as a Yushan Young Fellow by Taiwan’s Ministry of Education, the nation’s highest honor for rising faculty. Huge congratulations—so proud of you! 👏@HopkinsEngineer @JHUECE @shaoyuanlo

vishalm_patel's tweet image. Exciting news! My former student, Dr. Shao-Yuan Lo—now an Asst. Prof. at NTU—has been awarded as a Yushan Young Fellow by Taiwan’s Ministry of Education, the nation’s highest honor for rising faculty. Huge congratulations—so proud of you! 👏@HopkinsEngineer @JHUECE @shaoyuanlo

Vishal Patel сделал(а) репост

Amandeep Kumar

@Amandee59573123

29 сент.г.

Project webpage & code - virobo-15.github.io/srdd.github.io/ Arxiv - arxiv.org/pdf/2509.22636 This project was co-led with Nithin Gopalakrishnan Nair (@NithinGK10) under the guidance of Vishal Patel(@vishalm_patel).

Vishal Patel сделал(а) репост

Zhenjun Zhao

@zhenjun_zhao

9 сент.г.

Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data Nithin Gopalakrishnan Nair, Srinivas Kaza, @XuanLuo14, @vishalm_patel, Stephen Lombardi, Jungyeon Park tl;dr: layer-wise modulation of source and target tokens->synthetic data…

zhenjun_zhao's tweet image. Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data

Nithin Gopalakrishnan Nair, Srinivas Kaza, @XuanLuo14, @vishalm_patel, Stephen Lombardi, Jungyeon Park

tl;dr: layer-wise modulation of source and target tokens-&gt;synthetic data…

Vishal Patel сделал(а) репост

Johns Hopkins Data Science and AI Institute

@HopkinsDSAI

25 авг.г.

#HopkinsDSAI welcomes 22 new faculty members, who join more than 150 DSAI faculty members across @JohnsHopkins in advancing the study of data science, machine learning, and #AI and translation to a range of critical and emerging fields. ai.jhu.edu/news/data-scie…

HopkinsDSAI's tweet image. #HopkinsDSAI welcomes 22 new faculty members, who join more than 150 DSAI faculty members across @JohnsHopkins in advancing the study of data science, machine learning, and #AI and translation to a range of critical and emerging fields.

ai.jhu.edu/news/data-scie…

Vishal Patel сделал(а) репост

Johns Hopkins Engineering

@HopkinsEngineer

28 июл.г.

Think before you diffuse: DiffPhy from @vishalm_patel and team delivers realistic physics in AI video generation by enlisting LLMs to reason about the physical context. Multimodal LLMs evaluate and fine tune the model. GitHub page: bwgzk-keke.github.io/DiffPhy/

Vishal Patel сделал(а) репост

Yana Wei

@yanawei_

17 июл.г.

🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection! A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass. 💡 Key Ideas 👉 Builds a perception-feedback loop through a curated visual reflection dataset. 👉 Utilizes Reflective…

yanawei_'s tweet image. 🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection!

A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass.

💡 Key Ideas
👉 Builds a perception-feedback loop through a curated visual reflection dataset.
👉 Utilizes Reflective…

Vishal Patel

@vishalm_patel

15 июл.г.

🪞 We'll present Perception in Reflection at ICML this week! We introduce RePer, a dual-model framework that improves visual understanding through reflection. Better captions, fewer hallucinations, stronger alignment. 📄 arxiv.org/pdf/2504.07165 #ICML2025 @yanawei_ @JHUCompSci

Vishal Patel

@vishalm_patel

15 июл.г.

🚀 Open Vision Reasoner (OVR) Transferring linguistic cognitive behaviors to visual reasoning via large-scale multimodal RL. SOTA on MATH500 (95.3%), MathVision, and MathVerse. 💻 Code: github.com/Open-Reasoner-… 🌐 Project: weiyana.github.io/Open-Vision-Re… #LLM @yanawei @HopkinsEngineer

Vishal Patel сделал(а) репост

Kartik Narayan

@KartikNarayan10

27 июн.г.

#ICCV2025 🌺FaceXFormer has been accepted by ICCV !

Vishal Patel

@vishalm_patel

20 мар. 2024 г.

FaceXFormer: A Unified Transformer for Facial Analysis w/ @vibashan @KartikNarayan10 @jhuclsp @HopkinsEngineer @JHUECE Paper: arxiv.org/abs/2403.12960 Code: github.com/Kartik-3004/fa… Website: kartik-3004.github.io/facexformer_we…

vishalm_patel's tweet image. FaceXFormer: A Unified Transformer for Facial Analysis
w/ @vibashan @KartikNarayan10 @jhuclsp @HopkinsEngineer
@JHUECE

Paper: arxiv.org/abs/2403.12960
Code: github.com/Kartik-3004/fa…
Website: kartik-3004.github.io/facexformer_we…

Vishal Patel

@vishalm_patel

19 июн.г.

🎨 New work: Training-Free Stylized Abstraction Generate stylized avatars (LEGO, South Park, dolls) from a single image ! 💡 VLM-guided identity distillation 📊 StyleBench eval @HopkinsDSAI @JHUECE @jhucs @KartikNarayan10 @HopkinsEngineer 🔗 kartik-3004.github.io/TF-SA/

vishalm_patel's tweet image. 🎨 New work: Training-Free Stylized Abstraction
Generate stylized avatars (LEGO, South Park, dolls) from a single image !
💡 VLM-guided identity distillation
📊 StyleBench eval

@HopkinsDSAI @JHUECE @jhucs @KartikNarayan10 @HopkinsEngineer

🔗 kartik-3004.github.io/TF-SA/

Vishal Patel сделал(а) репост

Jack (in SF) Langerman

@jacklangerman

15 июн.г.

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)…

jacklangerman's tweet image. STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models

tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)…

Vishal Patel сделал(а) репост

Johns Hopkins Data Science and AI Institute

@HopkinsDSAI

13 июн.г.

Hopkins researchers including @JHUECE Tinoosh Mohsenin and @JHU_BDPs Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI #CVPR2025

HopkinsDSAI's tweet image. Hopkins researchers including @JHUECE Tinoosh Mohsenin and @JHU_BDPs Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI
#CVPR2025

Vishal Patel сделал(а) репост

WACV

@wacv_official

27 маяг.

The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).

wacv_official's tweet image. The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).

Vishal Patel сделал(а) репост

Bo Wang

@BoWang87

10 июн.г.

@CVPR IS AROUND THE CORNER! #CVPR2025 Welcome to join our Medical Vision Foundation Model Workshop on June 11th, from 8:30 to 12:00 at Room 212.! We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas Kather @jnkath Dr. Faisal Mahmood @AI4Pathology Dr.…

Bo Wang

@BoWang87

8 апр.г.

Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! @CVPR 🌐 fmv-cvpr25workshop.github.io FMV brings together researchers pushing the boundaries of medical AGI. We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas…

BoWang87's tweet image. Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! @CVPR
🌐 fmv-cvpr25workshop.github.io
FMV brings together researchers pushing the boundaries of medical AGI.
We are also proud to host an esteemed lineup of speakers:
Dr. Jakob Nikolas…

Vishal Patel

@vishalm_patel

3 июн.г.

🚨Excited to share that the JHU VIU Lab will be presenting the following papers at CVPR next week, including 3 Highlights! 🎉 Come stop by our posters and say hi — we’d love to connect! 👋 #CVPR2025 @JHUCompSci @JHUECE @HopkinsDSAI @HopkinsEngineer

vishalm_patel's tweet image. 🚨Excited to share that the JHU VIU Lab will be presenting the following papers at CVPR next week, including 3 Highlights! 🎉
Come stop by our posters and say hi — we’d love to connect! 👋
#CVPR2025 @JHUCompSci @JHUECE @HopkinsDSAI @HopkinsEngineer

Vishal Patel

@vishalm_patel

29 маяг.

💥 New paper: Think Before You Diffuse Meet DiffPhy — LLM-guided, physics-aware video diffusion 🎥🧠🌍 SOTA on real-world motion & dynamics! 🔗 bwgzk-keke.github.io/DiffPhy/ @JHUCompSci @HopkinsDSAI @HopkinsEngineer @myq_1997 #DiffusionModels #VideoGeneration

Vishal Patel

@vishalm_patel

16 маяг.

Honored to be speaking alongside other respected experts at the Biometrics Institute US Biometrics Seminar. We’ll be diving into US biometrics developments and the crucial topic of AI’s impact on vulnerabilities. @HopkinsDSAI @JHUECE @JHUCompSci @HopkinsEngineer @BiometricsInst

vishalm_patel's tweet image. Honored to be speaking alongside other respected experts at the Biometrics Institute US Biometrics Seminar. We’ll be diving into US biometrics developments and the crucial topic of AI’s impact on vulnerabilities. @HopkinsDSAI @JHUECE @JHUCompSci @HopkinsEngineer @BiometricsInst

Vishal Patel сделал(а) репост

Kartik Narayan

@KartikNarayan10

21 апр.г.

🥳🥳Two papers accepted in FG 2025 !!! Improved Representation Learning for Unconstrained Face Recognition w/ @NithinGK10 @vishalm_patel Investigating Social Biases in Multimodal LLMs w/ Malsha Perera, @vishalm_patel

Vishal Patel

@vishalm_patel

18 апр.г.

Excited to present two papers at #ICLR2025 next week! Looking forward to sharing our work in Singapore! 🇸🇬. @KangfuM @JHUECE @HopkinsDSAI kfmei.com/Field-DiT/