UNC NLP

@uncnlp

NLP (+ML/AI/CV) research at @UNCCS @UNC Faculty: @mohitban47+@gberta227+@snigdhac25+@shsriva+@tianlongchen4+@huaxiuyaoml+@dingmyu+@zhun_deng +@SenguptRoni et al

nlp.cs.unc.edu

Joined June 2017

2KPosts 3KFollowers 410Following

You might like

@EdinburghNLP

@uwnlp

@YejinChoinka

@emnlpmeeting

@mohitban47

@xiangrenNLP

@LTIatCMU

@kaiwei_chang

@jhuclsp

@naaclmeeting

@HannaHajishirzi

@CopeNLU

@iatitov

@cambridgenlp

@yoavartzi

Pinned

UNC AI

@unc_ai_group

Nov 26, 2023

🚨🎓 We have several PhD (and postdoc) openings in NLP+CV+ML+AI in beautiful Chapel Hill 👇 Please RT+apply & ping our faculty for any questions (application-fee waivers & no GRE requirement)! @mohitban47 @gberta227 @snigdhac25 @TianlongChen4 @shsriva @HuaxiuYaoML + others 🧵

UNC AI

@unc_ai_group

Nov 22, 2021

Check out our @uncnlp group for your PhD applications! We have no GRE requirements + continue our application-fee waivers! Please RT+apply (we have postdoc openings too) & ping us for any questions!🙂 We strongly encourage diversity. Direct FAQ page link: cs.unc.edu/graduate/gradu…

unc_ai_group's tweet image. Check out our @uncnlp group for your PhD applications! We have no GRE requirements + continue our application-fee waivers! Please RT+apply (we have postdoc openings too) &amp; ping us for any questions!🙂

We strongly encourage diversity. Direct FAQ page link: cs.unc.edu/graduate/gradu…

UNC AI reposted

David Wan

@meetdavidwan

Nov 7

So excited to share this work (which received very high reviewer scores!) led by Sapir! Why wait for a full sentence to find a hallucination? We built a model that detects factual errors on prefixes as they’re generated. This leads to huge faithfulness gains (our 3B model…

Sapir Harary

@SapirHarary

Nov 7

🚨 New paper alert! We’re thrilled to share our new preprint “PrefixNLI: Detecting Factual Inconsistencies as Soon as They Arise” ✨ LLMs generate text one token at a time, but factuality checks still wait for a full sentence. We extend NLI to text prefixes, enabling the…

SapirHarary's tweet image. 🚨 New paper alert!

We’re thrilled to share our new preprint “PrefixNLI: Detecting Factual Inconsistencies as Soon as They Arise” ✨

LLMs generate text one token at a time, but factuality checks still wait for a full sentence.

We extend NLI to text prefixes, enabling the…

UNC AI reposted

Sapir Harary

@SapirHarary

Nov 7

UNC AI reposted

Yue Dong @ NeurIPS 2023

@YueDongCS

Nov 5

[2/2] 🎤 We’re excited to host an amazing lineup of keynote speakers for #NewSumm2025 at #EMNLP2025! Mohit Bansal (UNC) @mohitban47 Greg Durrett (NYU) @gregd_nlp Arman Cohan (Yale) @armancohan Alexander R. Fabbri (Scale AI) Jey Han Lau (Univ. of Melbourne) #NLProc

Yue Dong @ NeurIPS 2023

@YueDongCS

Nov 5

📣 As #EMNLP2025 begins, join the 5th New Frontiers in Summarization Workshop (NewSumm 2025) this Saturday, Nov 8! 🔗 newsumm.github.io/2025/ Explore grounded, multimodal, and long-form summarization with reliable evaluation and efficient LLMs. #NLProc #NLP #LLMs #Summarization

UNC AI reposted

Huaxiu Yao

@HuaxiuYaoML

Nov 6

🚨 Introducing MIRA — a challenging Visual-CoT benchmark that requires multimodal LLMs to draw to think. Even the strongest models (GPT-5, Gemini 2.5 Pro, o3) score > 80% on popular benchmarks, yet collapse to < 20% on MIRA, where reasoning demands visual imagination. Text-only…

HuaxiuYaoML's tweet image. 🚨 Introducing MIRA — a challenging Visual-CoT benchmark that requires multimodal LLMs to draw to think.

Even the strongest models (GPT-5, Gemini 2.5 Pro, o3) score &gt; 80% on popular benchmarks, yet collapse to &lt; 20% on MIRA, where reasoning demands visual imagination.

Text-only…

UNC AI reposted

Elias Stengel-Eskin

@EliasEskin

Nov 5

🚨 Excited to share Gistify! Often the easiest way to understand large/complicated repos is by playing around with test cases and tracing back through the code that is executed. Gistify tasks models with turning a codebase and an entry-point (e.g. command, unit test) into a…

hyunji amy lee

@hyunji_amy_lee

Nov 4

🚨 Excited to announce Gistify!, where a coding agent must extract the gist of a repository: generate a single, executable, and self-contained file that faithfully reproduces the behavior of a given command (e.g., a test or entrypoint). ✅ It is a lightweight, broadly applicable…

hyunji_amy_lee's tweet image. 🚨 Excited to announce Gistify!, where a coding agent must extract the gist of a repository: generate a single, executable, and self-contained file that faithfully reproduces the behavior of a given command (e.g., a test or entrypoint).

✅ It is a lightweight, broadly applicable…

UNC AI reposted

Justin Chih-Yao Chen

@cyjustinchen

Nov 5

I'll be presenting ✨MAgICoRe✨ virtually tonight at 7 PM ET / 8 AM CST (Gather Session 3)! I'll discuss 3 key challenges in LLM refinement for reasoning, and how MAgICoRe tackles them jointly: 1⃣ Over-correction on easy problems 2⃣ Failure to localize & fix its own errors 3⃣…

Mohit Bansal

@mohitban47

Nov 4

🚨 Check out our awesome students/postdocs' papers at #EMNLP2025 and say hi to them 👋! Also, I will give a keynote (virtually) on "Attributable, Conflict-Robust, and Multimodal Summarization with Multi-Source Retrieval" at the NewSumm workshop. -- Jaehong (in-person) finished…

mohitban47's tweet image. 🚨 Check out our awesome students/postdocs' papers at #EMNLP2025 and say hi to them 👋!

Also, I will give a keynote (virtually) on "Attributable, Conflict-Robust, and Multimodal Summarization with Multi-Source Retrieval" at the NewSumm workshop.

-- Jaehong (in-person) finished…

UNC AI reposted

Snigdha Chaturvedi

@snigdhac25

Nov 5

Last week, I gave a talk on Ethical Issues in LLMs at UNC's Parr Center for Ethics. The extended Q&A session was the best part! Thoughtful questions about bias, responsibility & accountability of AI judgment. Inspiring to see people thinking so deeply about ethics in tech. @unccs

UNC AI reposted

Kerem Zaman

@KeremZaman3

Nov 5

don’t forget to stop by my poster during Gather Session 3 at 8 AM Nov 6 (CST) / 7 PM Nov 5 (ET)! #EMNLP2025

Kerem Zaman

@KeremZaman3

Sep 3

🚨 NEW PAPER 🚨 Recent work shows CoT can be unfaithful to true model reasoning. Yet everyone measures faithfulness differently and there's still no systematic comparison. Our #EMNLP2025 paper introduces CausalDiagnosticity, a meta-evaluation framework for faithfulness metrics.…

KeremZaman3's tweet image. 🚨 NEW PAPER 🚨 Recent work shows CoT can be unfaithful to true model reasoning. Yet everyone measures faithfulness differently and there's still no systematic comparison.

Our #EMNLP2025 paper introduces CausalDiagnosticity, a meta-evaluation framework for faithfulness metrics.…

UNC AI reposted

David Wan

@meetdavidwan

Nov 3

🚨 Proud to share our #TACL work on localizing factual inconsistencies in attributable text generation! To find where LLMs hallucinate, we need to get granular. We introduce QASemConsistency, a new method that decomposes text into simple question-answer pairs to precisely…

Arie Cattan

@ArieCattan

Nov 3

LLMs love to hallucinate, but *where* exactly? 🤔 We're thrilled to announce that our paper "Localizing Factual Inconsistencies in Attributable Text Generation" has been accepted to #TACL #nlproc ! 🎉 🧵👇

UNC AI reposted

Daeun Lee

@danadaeun

Nov 4

Even though I cannot attend EMNLP in person, plz enjoy our work Video-Skill-CoT at Suzhou 🇨🇳 Appreciate all of my collaborators and @mohitban47! 😆

Mohit Bansal

@mohitban47

Nov 4

UNC AI reposted

Ziyang Wang

@ZiyangW00

Nov 5

🎉Thanks for the shoutout! I’ll be virtually presenting our new work Video-RTS at #EMNLP2025 (my co-lead @jaeh0ng_yoon will present in person). If you’re into advanced video-reasoning frameworks, check it out: - No SFT, pure RL: trains with simple output-based rewards (GRPO)—no…

Mohit Bansal

@mohitban47

Nov 4

UNC AI reposted

Shoubin Yu @ EMNLP

@shoubin621

Nov 5

Excited to be at #EMNLP2025 in Suzhou! I’ll present our work: (1) MEXA (Fri 12:30 PM CST) about general multimodal reasoning with dynamic multi-expert aggregation and (2) RACCooN (Wed 4:30 PM CST) about editing videos via auto-generated narratives. Please stop by our…

Mohit Bansal

@mohitban47

Nov 4

UNC AI reposted

Elias Stengel-Eskin

@EliasEskin

Nov 4

🚨 I'll be presenting virtually tonight at 7PM ET/ 8AM CST on Gather! I'll be talking about how strong LLMs can exploit loopholes introduced by ambiguous instructions, and what that means for safety! P.s. I am hiring Ph.D. students for my lab at UT Austin CS, applications due…

Mohit Bansal

@mohitban47

Nov 4

UNC AI reposted

Kerem Zaman

@KeremZaman3

Oct 31

unfortunately I won’t be in Suzhou but I’ll be presenting our paper at Gather Session 3 on Nov 6 at 8 AM CST. save the date! #EMNLP2025

Kerem Zaman

@KeremZaman3

Sep 3

UNC AI reposted

Gedas Bertasius

@gberta227

Oct 31

Is language a "terrible abstraction" for video understanding? Many in the video community often dismiss language-driven approaches in favor of complex, video-native solutions. However, I believe this resistance stems more from internal bias—validating a research identity as a…

UNC AI reposted

Canyu Chen

@CanyuChen3

Nov 2

🔥The deadline (Nov 3, 2025 AoE) for 𝐍𝐞𝐮𝐫𝐈𝐏𝐒 𝟐𝟎𝟐𝟓 𝐖𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐒𝐨𝐜𝐢𝐚𝐥𝐥𝐲 𝐑𝐞𝐬𝐩𝐨𝐧𝐬𝐢𝐛𝐥𝐞 𝐚𝐧𝐝 𝐓𝐫𝐮𝐬𝐭𝐰𝐨𝐫𝐭𝐡𝐲 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 (𝐑𝐞𝐬𝐩𝐨𝐧𝐬𝐢𝐛𝐥𝐞𝐅𝐌) is approaching!🔥 📍 Hybrid (Hilton Mexico City Reforma +…

CanyuChen3's tweet image. 🔥The deadline (Nov 3, 2025 AoE) for 𝐍𝐞𝐮𝐫𝐈𝐏𝐒 𝟐𝟎𝟐𝟓 𝐖𝐨𝐫𝐤𝐬𝐡𝐨𝐩 𝐨𝐧 𝐒𝐨𝐜𝐢𝐚𝐥𝐥𝐲 𝐑𝐞𝐬𝐩𝐨𝐧𝐬𝐢𝐛𝐥𝐞 𝐚𝐧𝐝 𝐓𝐫𝐮𝐬𝐭𝐰𝐨𝐫𝐭𝐡𝐲 𝐅𝐨𝐮𝐧𝐝𝐚𝐭𝐢𝐨𝐧 𝐌𝐨𝐝𝐞𝐥𝐬 (𝐑𝐞𝐬𝐩𝐨𝐧𝐬𝐢𝐛𝐥𝐞𝐅𝐌) is approaching!🔥
📍 Hybrid (Hilton Mexico City Reforma +…

UNC AI reposted

Yiyang Zhou

@AiYiyangZ

Nov 4

🚨 BREAKING: AI Can't Actually See Videos. New benchmark shows mainstream LVLMs barely hit 60% accuracy—while humans reach 94.82%. This isn’t a glitch—it’s a fundamental failure in video understanding. LVLMs are doing visual theater, not real comprehension.

AiYiyangZ's tweet image. 🚨 BREAKING: AI Can't Actually See Videos.
New benchmark shows mainstream LVLMs barely hit 60% accuracy—while humans reach 94.82%.
This isn’t a glitch—it’s a fundamental failure in video understanding. LVLMs are doing visual theater, not real comprehension.

UNC AI reposted

Huaxiu Yao

@HuaxiuYaoML

Nov 4

🚨 #EMNLP2025 Oral: AI Can’t Actually “See” Videos - New Benchmark Exposes the Truth LVLMs aren’t thinking with video - they’re performing it. What looks like understanding is just visual theater. Introducing GLIMPSE - a benchmark revealing how today’s models fail when…