Zaid Khan

@codezakh

NDSEG Fellow / PhD @uncnlp with @mohitban47 working on automating env/data generation + program synthesis formerly @allenai @neclabsamerica

Boston, USA

zaidkhan.me

6月 2023に登録

527ポスト 568フォロワー 963フォロー中

おすすめツイート

@alexxthiery

@olivia__white1

@Molly_M_Miller

@NicholasRebold

@0xCrispy

@saibayadon

@LordAlirezaF

@danieljvdm

@_Andros__

@Aishwarya_R_M

固定されたツイート

Zaid Khan

@codezakh

/10/15

How can an agent reverse engineer the underlying laws of an unknown, hostile & stochastic environment in “one life”, without millions of steps + human-provided goals / rewards? In our work, we: 1️⃣ infer an executable symbolic world model (a probabilistic program capturing…

Zaid Khan さんがリポスト

Alex Shaw

@alexgshaw

/11/07

Today, we’re announcing the next chapter of Terminal-Bench with two releases: 1. Harbor, a new package for running sandboxed agent rollouts at scale 2. Terminal-Bench 2.0, a harder version of Terminal-Bench with increased verification

alexgshaw's tweet image. Today, we’re announcing the next chapter of Terminal-Bench with two releases:

1. Harbor, a new package for running sandboxed agent rollouts at scale
2. Terminal-Bench 2.0, a harder version of Terminal-Bench with increased verification

Zaid Khan さんがリポスト

Arijit Ray

@ARRay693

/11/07

We live, feel, and create by perceiving the world as visual spaces unfolding through time — videos. Our memories and even our language are spatial: mind-palaces, mind-maps, "taking steps in the right direction..." Super excited to see Cambrian-S pushing this frontier! And,…

Saining Xie

@sainingxie

/11/07

Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶

Zaid Khan さんがリポスト

Ellis Brown

@_ellisbrown

/11/07

MLLMs are great at understanding videos, but struggle with spatial reasoning—like estimating distances or tracking objects across time. the bottleneck? getting precise 3D spatial annotations on real videos is expensive and error-prone. introducing SIMS-V 🤖 [1/n]

Zaid Khan さんがリポスト

Justin Chih-Yao Chen

@cyjustinchen

/11/05

I'll be presenting ✨MAgICoRe✨ virtually tonight at 7 PM ET / 8 AM CST (Gather Session 3)! I'll discuss 3 key challenges in LLM refinement for reasoning, and how MAgICoRe tackles them jointly: 1⃣ Over-correction on easy problems 2⃣ Failure to localize & fix its own errors 3⃣…

Mohit Bansal

@mohitban47

/11/04

🚨 Check out our awesome students/postdocs' papers at #EMNLP2025 and say hi to them 👋! Also, I will give a keynote (virtually) on "Attributable, Conflict-Robust, and Multimodal Summarization with Multi-Source Retrieval" at the NewSumm workshop. -- Jaehong (in-person) finished…

mohitban47's tweet image. 🚨 Check out our awesome students/postdocs' papers at #EMNLP2025 and say hi to them 👋!

Also, I will give a keynote (virtually) on "Attributable, Conflict-Robust, and Multimodal Summarization with Multi-Source Retrieval" at the NewSumm workshop.

-- Jaehong (in-person) finished…

Zaid Khan さんがリポスト

Ziyang Wang

@ZiyangW00

/11/05

🎉Thanks for the shoutout! I’ll be virtually presenting our new work Video-RTS at #EMNLP2025 (my co-lead @jaeh0ng_yoon will present in person). If you’re into advanced video-reasoning frameworks, check it out: - No SFT, pure RL: trains with simple output-based rewards (GRPO)—no…

Mohit Bansal

@mohitban47

/11/04

Zaid Khan さんがリポスト

Mohit Bansal

@mohitban47

/11/04

Zaid Khan さんがリポスト

hyunji amy lee

@hyunji_amy_lee

/11/04

🚨 Excited to announce Gistify!, where a coding agent must extract the gist of a repository: generate a single, executable, and self-contained file that faithfully reproduces the behavior of a given command (e.g., a test or entrypoint). ✅ It is a lightweight, broadly applicable…

hyunji_amy_lee's tweet image. 🚨 Excited to announce Gistify!, where a coding agent must extract the gist of a repository: generate a single, executable, and self-contained file that faithfully reproduces the behavior of a given command (e.g., a test or entrypoint).

✅ It is a lightweight, broadly applicable…

Zaid Khan さんがリポスト

Jaehong Yoon

@jaeh0ng_yoon

/11/03

🎉 Excited to share that 5/5 of my papers (3 main, 2 findings) have been accepted at #EMNLP2025, in video/multimodal reasoning, instructional video editing, and efficient LLM adaptation & reasoning! 🚨 I’m recruiting Ph.D. students to join the Multimodal AI Group at NTU College…

jaeh0ng_yoon's tweet image. 🎉 Excited to share that 5/5 of my papers (3 main, 2 findings) have been accepted at #EMNLP2025, in video/multimodal reasoning, instructional video editing, and efficient LLM adaptation &amp; reasoning!

🚨 I’m recruiting Ph.D. students to join the Multimodal AI Group at NTU College…

Zaid Khan さんがリポスト

Mohit Bansal

@mohitban47

/10/29

It was an honor and pleasure to give a keynote at the 28th European Conference on Artificial Intelligence (#ECAI2025) in beautiful Bologna, and engage in enthusiastic discussions about trustworthy + calibrated agents, collaborative reasoning + privacy, and controllable multimodal…

mohitban47's tweet image. It was an honor and pleasure to give a keynote at the 28th European Conference on Artificial Intelligence (#ECAI2025) in beautiful Bologna, and engage in enthusiastic discussions about trustworthy + calibrated agents, collaborative reasoning + privacy, and controllable multimodal…

Zaid Khan さんがリポスト

Vaidehi Patil

@vaidehi_patil_

/10/27

🥳🥳 Honored and grateful to be awarded a 2025 Google PhD Fellowship in Machine Learning and ML Foundations for my research on machine unlearning, defenses against adversarial attacks, and multi-agent privacy! ✨ Deep gratitude to my advisor @mohitban47 for his constant…

Google.org

@Googleorg

/10/23

🎉 We're excited to announce the 2025 Google PhD Fellows! @GoogleOrg is providing over $10 million to support 255 PhD students across 35 countries, fostering the next generation of research talent to strengthen the global scientific landscape. Read more: goo.gle/43wJWw8

Googleorg's tweet image. 🎉 We're excited to announce the 2025 Google PhD Fellows! @GoogleOrg is providing over $10 million to support 255 PhD students across 35 countries, fostering the next generation of research talent to strengthen the global scientific landscape. Read more: goo.gle/43wJWw8

Zaid Khan さんがリポスト

Mohit Bansal

@mohitban47

/10/27

🎉 Big congratulations to Vaidehi on being awarded a Google PhD Fellowship in Machine Learning and ML Foundations for her important research contributions in machine unlearning for LLMs/VLMs, defenses against adversarial attacks, and multi-agent privacy! #ProudAdvisor 👇👇

Vaidehi Patil

@vaidehi_patil_

/10/27

Zaid Khan さんがリポスト

Yi Lin Sung

@yilin_sung

/10/23

Tough week! I also got impacted less than 3 months after joining. Ironically, I just landed some new RL infra features the day before. Life moves on. My past work spans RL, PEFT, Quantization, and Multimodal LLMs. If your team is working on these areas, I’d love to connect.

Jiaxun Cui 🐿️

@cuijiaxun

/10/23

Meta has gone crazy on the squid game! Many new PhD NGs are deactivated today (I am also impacted🥲 happy to chat)

Zaid Khan さんがリポスト

Mohit Bansal

@mohitban47

/10/23

🚨 🤯 Wow! Yi Lin is an amazing researcher, who works on very hard and important problems in LLM and VLM training, RL, PEFT, Quantization, etc. -- ironically, he had several other top offers just a few months ago! Hire him ASAP if you want to pick up a top talent (and several…

Yi Lin Sung

@yilin_sung

/10/23

Zaid Khan さんがリポスト

Elias Stengel-Eskin

@EliasEskin

/10/23

🚨 Excited to share PoSH, a graph-based, fine-grained, and interpretable metric for detailed image descriptions! PoSH allows us to not only evaluate generated image descriptions but localize hallucinations and errors in them. To test PoSH, we also introduce DOCENT, a new and…

Amith Ananthram

@AmithAnanthram

/10/23

🚨 Are your detailed image descriptions what you (really really) want? Let PoSh be the judge. Introducing PoSh, a new graph-based metric for detailed image descriptions, and DOCENT, a novel & challenging benchmark of art w/ detailed descriptions and strong human judgments 🧵

AmithAnanthram's tweet image. 🚨 Are your detailed image descriptions what you (really really) want? Let PoSh be the judge.

Introducing PoSh, a new graph-based metric for detailed image descriptions, and DOCENT, a novel &amp; challenging benchmark of art w/ detailed descriptions and strong human judgments

🧵

Zaid Khan さんがリポスト

Gedas Bertasius

@gberta227

/10/19

Can AI models teach you to shoot like Steph Curry? 🏀 Come to my talk on Challenges in Expert-Level Skill Analysis at 4:30 pm in Room 318-A tomorrow (Sunday) to find out! sauafg-workshop.github.io #ICCV2025

sauafg-workshop.github.io

SAUAFG Workshop – ICCV 2025

ICCV 2025 SAUAFG Workshop on AI-driven skill assessment, understanding, and feedback generation.

ソース: sauafg-workshop.github.io

Paritosh Parmar

@ParitoshParmar_

/10/11

🗓Oct 19, 2025 | 📍Hawaii Convention Center, Room 318-A 👉 Learn more: sauafg-workshop.github.io 🔍 We'll explore AI-driven Skilled Activity Understanding, Assessment & Guidance generation in various domains from Surgery to Sports, from Robotics and Manufacturing to Education

sauafg-workshop.github.io

SAUAFG Workshop – ICCV 2025

ICCV 2025 SAUAFG Workshop on AI-driven skill assessment, understanding, and feedback generation.

ソース: sauafg-workshop.github.io

Zaid Khan さんがリポスト

Mohit Bansal

@mohitban47

/10/22

🎉 Big congrats to Zaid on being awarded the NDSEG PhD Fellowship, for his innovative contributions in environment/data generation, skill-based self-improvement and adaptable agents, visual program synthesis, and world model inference! #ProudAdvisor 👇👇

Zaid Khan

@codezakh

/10/22

🥳 Honored and grateful to be awarded an NDSEG Fellowship in Computer Science! 💫🇺🇸 Big thanks to my advisor @mohitban47 for his guidance, and shoutout to my lab mates at @unc_ai_group, collaborators, internship advisors, and mentors for their support 🤗 Excited to continue…

Zaid Khan

@codezakh

/10/22

UNC Computer Science

@unccs

/10/22

🎉 Congratulations to our student Zaid Khan (advised by @mohitban47) for being awarded a prestigious NDSEG Fellowship for his work on environment generation! Established in 1989, the fellowship has an acceptance rate of <7% and covers diverse science and engineering disciplines.

unccs's tweet image. 🎉 Congratulations to our student Zaid Khan (advised by @mohitban47) for being awarded a prestigious NDSEG Fellowship for his work on environment generation!

Established in 1989, the fellowship has an acceptance rate of &lt;7% and covers diverse science and engineering disciplines.

Zaid Khan さんがリポスト

Jaemin Cho

@jmin__cho

/10/21

It's today! Come check out CAPTURe at #ICCV2025 Poster #280 3pm-5pm 🤗

Elias Stengel-Eskin

@EliasEskin

/04/23

Check out 🚨CAPTURe🚨 -- a new benchmark and task testing spatial reasoning by making VLMs count objects under occlusion. Key Takeaways: ➡️ SOTA VLMs (GPT-4o, Qwen2-VL, Intern-VL2) have high error rates on CAPTURe (but humans get very low error ✅) and models struggle to reason…

EliasEskin's tweet image. Check out 🚨CAPTURe🚨 -- a new benchmark and task testing spatial reasoning by making VLMs count objects under occlusion.

Key Takeaways:
➡️ SOTA VLMs (GPT-4o, Qwen2-VL, Intern-VL2) have high error rates on CAPTURe (but humans get very low error ✅) and models struggle to reason…

Zaid Khan さんがリポスト

FUCAI KE

@Fucai_Ke

/10/21

I will be presenting our recent work, “DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning,” at #ICCV2025 . ⌚️Oct. 21, 11:30-13:30 👉Exhibit Hall I, #314

Fucai_Ke's tweet image. I will be presenting our recent work, “DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation &amp; Instruct-Masking Tuning,” at #ICCV2025 .

⌚️Oct. 21, 11:30-13:30
👉Exhibit Hall I, #314

Zaid Khan さんがリポスト

Mohit Bansal

@mohitban47

/10/21

🚨 If you are at #ICCV2025, make sure to talk to Jaemin for his new group at @jhuclsp @JHUCompSci -- he has done a lot of foundational research in multimodality+other areas & will be a great advisor! 👇👇

Jaemin Cho

@jmin__cho

/10/20

Excited to be at #ICCV2025 in Hawaii!🌴 I'll present two papers: M3DocVQA/M3DocRAG (Mon) and CAPTURe (Tue). Check our poster sessions👇 and feel free to ping me to grab a coffee together I'm hiring PhD students to work on multimodal AI and robotics with me at JHU from Fall 2026!