Zhuang_Li_NLP's profile picture. PhD, Prev Research Fellow @MonashUni | Prev SDE @Microsoft | NLP | Contribution @BigCodeProject | Lecturer @RMIT | ARR Area Chair

Zhuang Li

@Zhuang_Li_NLP

PhD, Prev Research Fellow @MonashUni | Prev SDE @Microsoft | NLP | Contribution @BigCodeProject | Lecturer @RMIT | ARR Area Chair

Social Event at EMNLP Suzhou, China!


Zhuang Li reposted

Modern LLMs create a dilemma: powerful models are costly, cheap models are unreliable. xRouter is a learned routing system trained via reinforcement learning with explicit cost-performance trade-offs. 🤖 Paper: bit.ly/49e0v3E Key results: substantial cost reductions…

SFResearch's tweet image. Modern LLMs create a dilemma: powerful models are costly, cheap models are unreliable.

xRouter is a learned routing system trained via reinforcement learning with explicit cost-performance trade-offs. 🤖

Paper: bit.ly/49e0v3E

Key results: substantial cost reductions…
SFResearch's tweet image. Modern LLMs create a dilemma: powerful models are costly, cheap models are unreliable.

xRouter is a learned routing system trained via reinforcement learning with explicit cost-performance trade-offs. 🤖

Paper: bit.ly/49e0v3E

Key results: substantial cost reductions…
SFResearch's tweet image. Modern LLMs create a dilemma: powerful models are costly, cheap models are unreliable.

xRouter is a learned routing system trained via reinforcement learning with explicit cost-performance trade-offs. 🤖

Paper: bit.ly/49e0v3E

Key results: substantial cost reductions…
SFResearch's tweet image. Modern LLMs create a dilemma: powerful models are costly, cheap models are unreliable.

xRouter is a learned routing system trained via reinforcement learning with explicit cost-performance trade-offs. 🤖

Paper: bit.ly/49e0v3E

Key results: substantial cost reductions…

Zhuang Li reposted

Happy to announce that my #EMNLP2025 paper Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design have finally made it to arxiv! This work REDEFINES anthropomorphism in LLM!! arxiv.org/abs/2508.17573…


Zhuang Li reposted

✨We are thrilled to announce that over 3200 papers have been accepted to #EMNLP2025 ✨ This includes over 1800 main conference papers and over 1400 papers in findings! Congratulations to all authors!! 🎉🎉🎉


Zhuang Li reposted

Yeah, we won an award.#ACL2025 😉😉😉

We start with the Outstanding Papers (1/6)

aclmeeting's tweet image. We start with the Outstanding Papers (1/6)


Zhuang Li reposted

Tbh I'm happy to see ChatGPT’s downloads reaching major social media apps combined! Beyond work (startup & coding), it's been a life-changer for me: - Cured my dizziness. Two doctors couldn't help, but ChatGPT suggested electrolyte water. It worked! - fixed my e-bike myself. New…


🎉 Our paper SCAR (Style Consistency-Aware Ranking) is accepted to ACL 2025 as a poster presentation! See you in Vienna! 🚀 In the best case, using just 0.7% of the original data, SCAR enables OLMo-7B to outperform its full-data fine-tuned counterpart and consistently surpasses…


Zhuang Li reposted

BESSTIE is a new dataset for sentiment and sarcasm classification of Australian, Indian and British English. Do check it out! #ACL2025


Zhuang Li reposted

Keep those reviews coming in! We are currently up to 78% of papers with three reviews submitted. ▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓▓░░░░░░░ 78%


📢 ACL 2025 | LAZYREVIEW (Meta Score: 5/5!) Is peer review always fair and constructive? We present LAZYREVIEW: the first dataset that exposes lazy thinking in NLP reviews: vague rejections like “Not novel enough” or “Only tested on German” without solid justification. 💡 What…


Zhuang Li reposted

#ACL2025 #NLProc BESSTIE comes to ACL. See you in Vienna!! @UNSWCOMPUTING

aadi_joshi's tweet image. #ACL2025 #NLProc BESSTIE comes to ACL. See you in Vienna!! @UNSWCOMPUTING

6 papers accepted to ACL 2025 main track! Work mainly focuses on LLM safety, global diversity, and low-resource NLP. Thanks to all amazing collaborators—see you in Vienna! More details on papers soon.


Zhuang Li reposted

🌏 New position paper analyzes 2,000+ multilingual LLM benchmarks from 148 countries (2021-2024) and reveals key challenges in multilingual AI evaluation. 🌺

wangly0229's tweet image. 🌏 New position paper analyzes 2,000+ multilingual LLM benchmarks from 148 countries (2021-2024) and reveals key challenges in multilingual AI evaluation. 🌺

Zhuang Li reposted

🥳🥳🥳New dataset: huggingface.co/datasets/mingh… We recently received some free compute, so we created a synthetic dataset of 10M realistic personas using meta-llama/Llama-3.3-70B-Instruct & Qwen/Qwen2.5-72B-Instruct. Each persona includes features like name, DOB, personality, and…


Zhuang Li reposted

Can LLMs spot signs of self-harm in multilingual, culturally nuanced contexts? JiraiBench is our new benchmark focusing on the Jirai (地雷系) subculture in Japan & China—offering a hard testbed for LLMs on detection + reasoning. arxiv.org/pdf/2503.21679


Zhuang Li reposted

Hiring Lecturer in NLP: jobs.surrey.ac.uk/vacancy.aspx?r…. Lectureship position aligned with Nature Inspired Computing and Engineering (NICE) research group (lnkd.in/ew6xpiJm) within the CS. Happy to be contacted for discussing our research/strengths. #nlproc


Just finished wrapping up 6 submissions in ACL. Being both an AC and a reviewer, I guess my review load will be exploding this time…


Zhuang Li reposted

Hao's PhD research in audio-safety red teaming of LLMs has now extended into a new exciting direction in his latest #NAACL2025 paper. In his recent work "Audio Is the Achilles' Heel: Red Teaming Audio Large Multimodal Models" we ask the following questions: (1) Do text-only LLMs…

Speech (or audio to be more specific) related safety is literally unexplored beyond content. If the focus is only placed on safeguarding "what" is being said but not "how" it is sad or in "which context" it is said, then we are left with very weak safety measures for speech. A…

EhsanShareghi's tweet image. Speech (or audio to be more specific) related safety is literally unexplored beyond content. If the focus is only placed on safeguarding "what" is being said but not "how" it is sad or in "which context" it is said, then we are left with very weak safety measures for speech.  A…


Thrilled to share that our paper “CultureInstruct: Curating Large-Scale Multi-Cultural Instructions” has been accepted to #NAACL2025 main! 🎉 It introduces a novel data synthesis method for reducing cultural bias in LLMs. Stay tuned-preprint coming soon on arXiv! 🚀 #AI #NLP

Zhuang_Li_NLP's tweet image. Thrilled to share that our paper “CultureInstruct: Curating Large-Scale Multi-Cultural Instructions” has been accepted to #NAACL2025 main! 🎉 It introduces a novel data synthesis method for reducing cultural bias in LLMs. Stay tuned-preprint coming soon on arXiv! 🚀 #AI #NLP

Loading...

Something went wrong.


Something went wrong.