Vishal Patel

@vishalm_patel

Associate Professor @JohnsHopkins working on computer vision, biometrics, and medical imaging.

Johns Hopkins University

engineering.jhu.edu/vpatel36/

Tham gia vào Tháng 10 2022

188Bài đăng 637Người theo dõi 265Đang theo dõi

Bạn có thể thích

@wacv_official

@wjscheirer

@AdamKortylewski

@CMHungSteven

@ucfmshah

@andrewhowens

@shaoyuanlo

@zhanghesprinter

@rohitrango

@realNingYu

@jeyamariajose

@DeqingSun

@jparag123

@yanshuaicao

@saakur3

Vishal Patel đã đăng lại

Yana Wei ✈️ NeurIPS 25

@yanawei_

19 thg 11

Why can language teach MLLMs to see? Read thoughts on: 👀 Visual priors embeded in language ➡️ How reasoning transfer across modalities 🧠 How SVGs from #Gemini 3 pro hint at a model’s inner “cognitive imaginery”. ✨ Blog: medium.com/@yanawei/throu… (Stay tuned!)

yanawei_'s tweet card. How can language teach models to see? Reflections from recent multimodal studies, with Gemini-3’s SVG reasoning offering fresh evidence!

🎢 Through the Bridge of Thought: From Language to Vision

Nguồn: yanawei.medium.com

Vishal Patel

@vishalm_patel

11 thg 11

CGCE: a plug-and-play framework for robustly erasing unsafe concepts from generative models without retraining. Achieves state-of-the-art safety while preserving image & video quality. Great work by @vng_sofw! @JHUengineering Project page: viettmab.github.io/cgce.github.io/

vishalm_patel's tweet image. CGCE: a plug-and-play framework for robustly erasing unsafe concepts from generative models without retraining. Achieves state-of-the-art safety while preserving image &amp; video quality.
Great work by @vng_sofw! @JHUengineering

Project page: viettmab.github.io/cgce.github.io/

Vishal Patel đã đăng lại

Viet Nguyen

@vng_sofw

11 thg 11

💥New paper: CGCE: Classifier-Guided Concept Erasure in Generative Models 📄Arxiv: arxiv.org/abs/2511.05865 🌐Project Page: viettmab.github.io/cgce.github.io Details in 🧵[1/n]

Vishal Patel đã đăng lại

JHU Malone Center for Engineering in Healthcare

@JHUMCEH

27 thg 10

Congratulations to Malone researchers Swaroop Vedula, @ShameemaSikder, @vishalm_patel, and Masaru Ishii on their $1.2 million @NSF grant, which will fund the development of #AI capable of giving surgeons expert feedback from videos of their performance: malonecenter.jhu.edu/malone-researc…

JHUMCEH's tweet card. The National Science Foundation awarded a Johns Hopkins team a four-year grant to develop AI capable of giving surgeons expert feedback based on videos of their performance.

Malone researchers awarded $1.2 million NSF grant

Nguồn: malonecenter.jhu.edu

Vishal Patel đã đăng lại

Jieneng Chen

@jieneng_chen

22 thg 10

🤯 Think better visuals mean better world models? Think again. 💥 Surprise: Agents don’t need eye candy— they need wins. Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels. We uncover 3 shocks: 1️⃣ Visuals ≠ utility 2️⃣…

jieneng_chen's tweet image. 🤯 Think better visuals mean better world models? Think again.
💥 Surprise: Agents don’t need eye candy— they need wins.

Meet World-in-World, the first open benchmark that ranks world models by closed-loop task success, not pixels.

We uncover 3 shocks:
1️⃣ Visuals ≠ utility
2️⃣…

Vishal Patel

@vishalm_patel

6 thg 10

Our new work FreeViS: Training-free Video Stylization with Inconsistent References introduces a training-free framework for generating high-quality, temporally coherent stylized videos ! 🌀 Check it out: xujiacong.github.io/FreeViS/ @HopkinsEngineer @myq_1997 #AI #CV #DeepLearning

Vishal Patel

@vishalm_patel

3 thg 10

Exciting news! My former student, Dr. Shao-Yuan Lo—now an Asst. Prof. at NTU—has been awarded as a Yushan Young Fellow by Taiwan’s Ministry of Education, the nation’s highest honor for rising faculty. Huge congratulations—so proud of you! 👏@HopkinsEngineer @JHUECE @shaoyuanlo

vishalm_patel's tweet image. Exciting news! My former student, Dr. Shao-Yuan Lo—now an Asst. Prof. at NTU—has been awarded as a Yushan Young Fellow by Taiwan’s Ministry of Education, the nation’s highest honor for rising faculty. Huge congratulations—so proud of you! 👏@HopkinsEngineer @JHUECE @shaoyuanlo

Vishal Patel đã đăng lại

Amandeep Kumar

@Amandeep__kumar

29 thg 9

Project webpage & code - virobo-15.github.io/srdd.github.io/ Arxiv - arxiv.org/pdf/2509.22636 This project was co-led with Nithin Gopalakrishnan Nair (@NithinGK10) under the guidance of Vishal Patel(@vishalm_patel).

Vishal Patel đã đăng lại

Zhenjun Zhao

@zhenjun_zhao

9 thg 9

Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data Nithin Gopalakrishnan Nair, Srinivas Kaza, @XuanLuo14, @vishalm_patel, Stephen Lombardi, Jungyeon Park tl;dr: layer-wise modulation of source and target tokens->synthetic data…

zhenjun_zhao's tweet image. Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data

Nithin Gopalakrishnan Nair, Srinivas Kaza, @XuanLuo14, @vishalm_patel, Stephen Lombardi, Jungyeon Park

tl;dr: layer-wise modulation of source and target tokens-&gt;synthetic data…

Vishal Patel đã đăng lại

Johns Hopkins Data Science and AI Institute

@HopkinsDSAI

25 thg 8

#HopkinsDSAI welcomes 22 new faculty members, who join more than 150 DSAI faculty members across @JohnsHopkins in advancing the study of data science, machine learning, and #AI and translation to a range of critical and emerging fields. ai.jhu.edu/news/data-scie…

HopkinsDSAI's tweet image. #HopkinsDSAI welcomes 22 new faculty members, who join more than 150 DSAI faculty members across @JohnsHopkins in advancing the study of data science, machine learning, and #AI and translation to a range of critical and emerging fields.

ai.jhu.edu/news/data-scie…

Vishal Patel đã đăng lại

Johns Hopkins Engineering

@HopkinsEngineer

28 thg 7

Think before you diffuse: DiffPhy from @vishalm_patel and team delivers realistic physics in AI video generation by enlisting LLMs to reason about the physical context. Multimodal LLMs evaluate and fine tune the model. GitHub page: bwgzk-keke.github.io/DiffPhy/

Vishal Patel đã đăng lại

Yana Wei ✈️ NeurIPS 25

@yanawei_

17 thg 7

🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection! A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass. 💡 Key Ideas 👉 Builds a perception-feedback loop through a curated visual reflection dataset. 👉 Utilizes Reflective…

yanawei_'s tweet image. 🚀 Check out our ✈️ ICML 2025 work: Perception in Reflection!

A reasonable perception paradigm for LVLMs should be iterative rather than a single-pass.

💡 Key Ideas
👉 Builds a perception-feedback loop through a curated visual reflection dataset.
👉 Utilizes Reflective…

Vishal Patel

@vishalm_patel

15 thg 7

🪞 We'll present Perception in Reflection at ICML this week! We introduce RePer, a dual-model framework that improves visual understanding through reflection. Better captions, fewer hallucinations, stronger alignment. 📄 arxiv.org/pdf/2504.07165 #ICML2025 @yanawei_ @JHUCompSci

Vishal Patel

@vishalm_patel

15 thg 7

🚀 Open Vision Reasoner (OVR) Transferring linguistic cognitive behaviors to visual reasoning via large-scale multimodal RL. SOTA on MATH500 (95.3%), MathVision, and MathVerse. 💻 Code: github.com/Open-Reasoner-… 🌐 Project: weiyana.github.io/Open-Vision-Re… #LLM @yanawei @HopkinsEngineer

Vishal Patel đã đăng lại

Kartik Narayan

@KartikNarayan10

27 thg 6

#ICCV2025 🌺FaceXFormer has been accepted by ICCV !

Vishal Patel

@vishalm_patel

20 thg 3, 2024

FaceXFormer: A Unified Transformer for Facial Analysis w/ @vibashan @KartikNarayan10 @jhuclsp @HopkinsEngineer @JHUECE Paper: arxiv.org/abs/2403.12960 Code: github.com/Kartik-3004/fa… Website: kartik-3004.github.io/facexformer_we…

vishalm_patel's tweet image. FaceXFormer: A Unified Transformer for Facial Analysis
w/ @vibashan @KartikNarayan10 @jhuclsp @HopkinsEngineer
@JHUECE

Paper: arxiv.org/abs/2403.12960
Code: github.com/Kartik-3004/fa…
Website: kartik-3004.github.io/facexformer_we…

Vishal Patel

@vishalm_patel

19 thg 6

🎨 New work: Training-Free Stylized Abstraction Generate stylized avatars (LEGO, South Park, dolls) from a single image ! 💡 VLM-guided identity distillation 📊 StyleBench eval @HopkinsDSAI @JHUECE @jhucs @KartikNarayan10 @HopkinsEngineer 🔗 kartik-3004.github.io/TF-SA/

vishalm_patel's tweet image. 🎨 New work: Training-Free Stylized Abstraction
Generate stylized avatars (LEGO, South Park, dolls) from a single image !
💡 VLM-guided identity distillation
📊 StyleBench eval

@HopkinsDSAI @JHUECE @jhucs @KartikNarayan10 @HopkinsEngineer

🔗 kartik-3004.github.io/TF-SA/

Vishal Patel đã đăng lại

Jack (✈️ ICCV) Langerman

@jacklangerman

15 thg 6

STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)…

jacklangerman's tweet image. STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models

tldr: iteratively discover and mitigate adversarial prompts, then go back to original model and mitigate in parallel (with anchor concepts to reduce unwanted side effects)…

Vishal Patel đã đăng lại

Johns Hopkins Data Science and AI Institute

@HopkinsDSAI

13 thg 6

Hopkins researchers including @JHUECE Tinoosh Mohsenin and @JHU_BDPs Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI #CVPR2025

HopkinsDSAI's tweet image. Hopkins researchers including @JHUECE Tinoosh Mohsenin and @JHU_BDPs Rama Chellappa are speaking at booth 1317 of the IEEE / CVF Computer Vision and Pattern Recognition Conference today! Come meet #HopkinsDSAI
#CVPR2025

Vishal Patel đã đăng lại

WACV

@wacv_official

27 thg 5

The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).

wacv_official's tweet image. The #WACV2026 Call for Papers is live at wacv.thecvf.com/Conferences/20……! First round paper registration is coming up on July 11th, with the submission deadline on July 18th (all deadlines are 23:59 AoE).

Vishal Patel đã đăng lại

Bo Wang

@BoWang87

10 thg 6

@CVPR IS AROUND THE CORNER! #CVPR2025 Welcome to join our Medical Vision Foundation Model Workshop on June 11th, from 8:30 to 12:00 at Room 212.! We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas Kather @jnkath Dr. Faisal Mahmood @AI4Pathology Dr.…

Bo Wang

@BoWang87

8 thg 4

Excited to announce the 2nd Workshop on Foundation Models for Medical Vision (FMV) at #CVPR2025! @CVPR 🌐 fmv-cvpr25workshop.github.io FMV brings together researchers pushing the boundaries of medical AGI. We are also proud to host an esteemed lineup of speakers: Dr. Jakob Nikolas…