feiliu_nlp's profile picture. Associate professor @EmoryUniversity. Working on large language models, LLM inference, reasoning, natural language generation, and various aspects of GenAI.

Fei Liu

@feiliu_nlp

Associate professor @EmoryUniversity. Working on large language models, LLM inference, reasoning, natural language generation, and various aspects of GenAI.

Pinned

🏆 Thrilled that our paper #PlanGenLLMs (arxiv.org/abs/2502.11221) won the SAC Award at #ACL2025!! Couldn't have done it without the amazing team: @HuiWei15, Zihao Zhang, Shenghua He, Tian Xia, and Shijia Pan. So thankful and beyond proud! 💖 #ACL2025NLP #NLProc 🧠 Planning is…

feiliu_nlp's tweet image. 🏆 Thrilled that our paper #PlanGenLLMs  (arxiv.org/abs/2502.11221) won the SAC Award at #ACL2025!!

Couldn't have done it without the amazing team: @HuiWei15, Zihao Zhang, Shenghua He, Tian Xia, and Shijia Pan. So thankful and beyond proud! 💖 #ACL2025NLP #NLProc

🧠 Planning is…

I'm happy to be one of the NAACL Board candidates this year! NAACL helps shape the direction of the NLP community across the Americas, and board members play a key role in that mission. As Agentic AI/LLMs reshapes the field, what do you think should be our top priorities?


Overwhelmed by the flood of papers? I highly recommend these must-read technical reports: K2, Qwen3, Qwen2.5-Omni, V3, and Claude 4. Huge thanks to the engineering teams who've done the heavy lifting, so we can learn from their insights! - Kimi K2: arxiv.org/abs/2507.20534 -…


Missed the Berkeley #AgenticAI Summit in person last week, and grateful to the organizers for making it available online (rdi.berkeley.edu/events/agentic…)! I found these talks especially fascinating: - Chi Wang (@Chi_Wang_) on multi-agent orchestration and key insights - Sergey Levine…


Fei Liu reposted

📢 Call for Papers: NewSumm 2025 - The 5th New Frontiers in Summarization Workshop at EMNLP 2025 The summarization research community is invited to submit to NewSumm 2025, co-located with EMNLP 2025! As LLMs continue to transform our field, we're expanding beyond traditional…

ruizhang_nlp's tweet image. 📢 Call for Papers: NewSumm 2025 - The 5th New Frontiers in Summarization Workshop at EMNLP 2025

The summarization research community is invited to submit to NewSumm 2025, co-located with EMNLP 2025! As LLMs continue to transform our field, we're expanding beyond traditional…

Loving the thoughtful scheduling at #ACL2025! The 14:00–15:30 CEST afternoon slot for in-person oral presentations is perfect for attendees from around the world. Kudos to the organizers! ❤️ @aclmeeting #NLProc #ACL2025NLP

Today at ACL 2025 in Vienna! 🇦🇹 Don't miss in person orals 🎙️ from 14:00 - 15:30 CEST across Levels 1 & 2. Cutting-edge research presentations from leading NLP experts! #ACL2025NLP #NLProc #OralPresentations



Day 1 at #ACL2025 in Vienna done! Presented both an oral (Session 3, 2-3:30pm) and a poster (Session 5, 6-7:30pm). So nice catching up with old friends and meeting new ones. Grateful for all the conversations. Excited for the rest of the conference! #ACL2025NLP

feiliu_nlp's tweet image. Day 1 at #ACL2025 in Vienna done! Presented both an oral (Session 3, 2-3:30pm) and a poster (Session 5, 6-7:30pm). So nice catching up with old friends and meeting new ones. Grateful for all the conversations. Excited for the rest of the conference! #ACL2025NLP

Just watched John Oliver's episode on AI Slop and loved it. He describes AI slop as the flood of low-quality, AI-generated content: music, images, short videos, maybe even news articles, books, research papers, code... you name it. Feels like we're (almost) drowning in it. Real…

feiliu_nlp's tweet image. Just watched John Oliver's episode on AI Slop and loved it.

He describes AI slop as the flood of low-quality, AI-generated content: music, images, short videos, maybe even news articles, books, research papers, code... you name it. Feels like we're (almost) drowning in it. Real…

Fei Liu reposted

try creating reels from longer videos with reka vision: app.reka.ai/vision/reels watch the video below or check out this user guide for more details reka.ai/reel-generatio… we are adding more prompt-based editing capabilities very soon!


Happy to share our paper got selected as an Oral Presentation at #ACL2025! Out of 8,000+ submissions and 3,000+ accepted papers, only 245 were chosen for oral (<3%)! Our amazing first author Hui Wei can't travel, so I'll be presenting in Vienna. Hope to see you there! 🇦🇹 📄…

feiliu_nlp's tweet image. Happy to share our paper got selected as an Oral Presentation at #ACL2025! 

Out of 8,000+ submissions and 3,000+ accepted papers, only 245 were chosen for oral (&amp;lt;3%)!

Our amazing first author Hui Wei can&apos;t travel, so I&apos;ll be presenting in Vienna. Hope to see you there! 🇦🇹 

📄…

Fei Liu reposted

I’m looking for a new postdoc to start this fall working on AI for Science/Science-Inspired AI (focusing on chemistry and bioengineering domains for now). Please drop me a CV if interested.


Fei Liu reposted

Pokémon Red has recently emerged as an evaluation benchmark, adopted by several top AI labs. But is it really a good benchmark for evaluating LLM capabilities or guiding LLM research? We wrote this blog to dive into the challenges, surface the opportunities, and introduce…

🔥 Pokémon Red is becoming a go-to benchmark for testing advanced AIs such as Gemini. But is Pokémon Red really a good eval? We study this problem and identify three issues: 1️⃣ Navigation tasks are too hard. 2️⃣ Combat control is too simple. 3️⃣ Raising a strong Pokémon team is…



Autonomous agents are powerful, but without guardrails, they drift into inefficiency. We view 'cost' as a form of guardrail and use Monte Carlo Tree Search with explicit cost-awareness to guide LLM-based planning. Tight cost constraints push the planner to quickly identify…

feiliu_nlp's tweet image. Autonomous agents are powerful, but without guardrails, they drift into inefficiency.

We view &apos;cost&apos; as a form of guardrail and use Monte Carlo Tree Search with explicit cost-awareness to guide LLM-based planning.

Tight cost constraints push the planner to quickly identify…

Anthropic staff realized they could ask Claude to buy things that weren’t just food & drink. After someone randomly decided to ask it to order a tungsten cube, Claude ended up with an inventory full of (as it put it) “specialty metal items” that it ended up selling at a loss.

AnthropicAI's tweet image. Anthropic staff realized they could ask Claude to buy things that weren’t just food &amp;amp; drink. 

After someone randomly decided to ask it to order a tungsten cube, Claude ended up with an inventory full of (as it put it) “specialty metal items” that it ended up selling at a loss.


Just watched Nathan's recent talk and really enjoyed it. Around 19:30 is where it gets really interesting. Totally agree that planning is the exciting frontier. If you're curious, our recent survey #PlanGenLLMs is a great place to start: arxiv.org/pdf/2502.11221…

Here's a recent talk I gave recapping the last 6-12 months of AI progress, why getting perfect models is hard, how labs are likely approaching the next phase of training (for agents), and other interesting tidbits across the reasoning landscape. Topics: 00:00 Introduction & the…



Fei Liu reposted

Q-learning is not yet scalable seohong.me/blog/q-learnin… I wrote a blog post about my thoughts on scalable RL algorithms. To be clear, I'm still highly optimistic about off-policy RL and Q-learning! I just think we haven't found the right solution yet (the post discusses why).

seohong_park's tweet image. Q-learning is not yet scalable

seohong.me/blog/q-learnin…

I wrote a blog post about my thoughts on scalable RL algorithms.

To be clear, I&apos;m still highly optimistic about off-policy RL and Q-learning! I just think we haven&apos;t found the right solution yet (the post discusses why).

Fei Liu reposted

My high-level take on why multimodal reasoning is fundamentally harder than text-only reasoning: Language is structured and directional, while images are inherently unstructured—you can start reasoning from anywhere. This visual freedom makes step-by-step logical inference much…

zhuokaiz's tweet image. My high-level take on why multimodal reasoning is fundamentally harder than text-only reasoning: Language is structured and directional, while images are inherently unstructured—you can start reasoning from anywhere. This visual freedom makes step-by-step logical inference much…

Revisited @andy_l_jones's RL debugging post from a few years back. Still one of the most insightful guides out there. If your agent's acting weird, here's a great checklist: andyljones.com/posts/rl-debug…

feiliu_nlp's tweet image. Revisited @andy_l_jones&apos;s RL debugging post from a few years back. Still one of the most insightful guides out there. If your agent&apos;s acting weird, here&apos;s a great checklist: andyljones.com/posts/rl-debug…

✨ Our paper #PlanGenLLMs: A Modern Survey of LLM Planning Capabilities (arxiv.org/pdf/2502.11221) is accepted to the #ACL2025 main conference! Huge thanks to the reviewers for the unanimous 4-4-4 reviews and meta score ❤️ Grateful for your thoughtful feedback! #ACL2025 #NLProc

feiliu_nlp's tweet image. ✨ Our paper #PlanGenLLMs: A Modern Survey of LLM Planning Capabilities (arxiv.org/pdf/2502.11221) is accepted to the #ACL2025 main conference! 

Huge thanks to the reviewers for the unanimous 4-4-4 reviews and meta score ❤️ Grateful for your thoughtful feedback! #ACL2025 #NLProc

Loading...

Something went wrong.


Something went wrong.