Robert Vacareanu

@robert_nlp

PhD from @UofArizona Working on #nlproc Past: 2022, 2023: Applied Scientist Intern (@AWS)

於四月 2022 加入

125貼文 256位跟隨者 2K個跟隨中

你可能會喜歡

@nikita_moghe

@DanRothNLP

@preslav_nakov

@agostina_cal

@mjjzha

@DennisFucci

@xianjun_agi

@atanasovapepa

@noriyuki_kojima

@gradient_step

@_salvatoregreco

@juliencolin_

@BurlyWilder

@PDufter

@nlp_pranav

Robert Vacareanu 已轉發

Bing Liu

@vbingliu

年10月1日

New @Scale_AI paper! The culprit behind reward hacking? We trace it to misspecification in high-reward tail. Our fix: rubric-based rewards to tell “excellent” responses apart from “great.” The result: Less hacking, stronger post-training! arxiv.org/pdf/2509.21500

vbingliu's tweet image. New @Scale_AI paper!

The culprit behind reward hacking? We trace it to misspecification in high-reward tail.

Our fix: rubric-based rewards to tell “excellent” responses apart from “great.”

The result: Less hacking, stronger post-training! arxiv.org/pdf/2509.21500

Robert Vacareanu 已轉發

Francesco Orabona

@bremen79

年5月28日

As promised, we put on Arxiv the proof we did with Gemini. arxiv.org/pdf/2505.20219 This shows that the Polyak stepsize not only will not reach the optimum, but it can cycle, when used without the knowledge of f*. Gemini failed when prompted directly ("Find an example where the…

bremen79's tweet image. As promised, we put on Arxiv the proof we did with Gemini. arxiv.org/pdf/2505.20219

This shows that the Polyak stepsize not only will not reach the optimum, but it can cycle, when used without the knowledge of f*.

Gemini failed when prompted directly ("Find an example where the…

Francesco Orabona

@bremen79

年4月22日

This is a turning point: I just proved a complex math result useful for my research using an LLM. I am not sure if I should be happy or scared...

Robert Vacareanu 已轉發

MohammadHossein Rezaei

@mhrezaeics

年4月30日

If you’re at NAACL today, I’ll be presenting this poster in Hall 3 from 2:00 – 3:30 PM. Paper link: aclanthology.org/2025.naacl-lon…

MohammadHossein Rezaei

@mhrezaeics

年2月13日

1/🚨 Thrilled to share that our paper (w/ @eduardo_nlp), "Making Language Models Robust Against Negation," has been accepted to the #NAACL2025 main conference! 🎉 #Negation has always been a challenge for language models. Here's our self-supervised method to tackle this issue:

mhrezaeics's tweet image. 1/🚨 Thrilled to share that our paper (w/ @eduardo_nlp), "Making Language Models Robust Against Negation," has been accepted to the #NAACL2025 main conference! 🎉

#Negation has always been a challenge for language models. Here's our self-supervised method to tackle this issue:

Robert Vacareanu 已轉發

Francesco Orabona

@bremen79

年4月22日

This is a turning point: I just proved a complex math result useful for my research using an LLM. I am not sure if I should be happy or scared...

Robert Vacareanu 已轉發

Stanford NLP Group

@stanfordnlp

年4月4日

Look who we found hanging out in her new @StanfordEng Gates Computer Science office! We’re truly delighted to welcome @YejinChoinka as a new @stanfordnlp faculty member, starting full-time in September. ❤️ nlp.stanford.edu/people/

stanfordnlp's tweet image. Look who we found hanging out in her new @StanfordEng Gates Computer Science office!

We’re truly delighted to welcome @YejinChoinka as a new @stanfordnlp faculty member, starting full-time in September. ❤️

nlp.stanford.edu/people/

Robert Vacareanu 已轉發

Zifan (Sail) Wang

@_zifan_wang

年3月28日

Exciting that @scale_AI is sponsoring Agent Workshop at CMU in April. Students and researchers who work on agents feel free to visit CMU to present your work! I will also be traveling to Pittsburgh to share my recent focuses on agents, both capability and safety.

_zifan_wang's tweet image. Exciting that @scale_AI is sponsoring Agent Workshop at CMU in April. Students and researchers who work on agents feel free to visit CMU to present your work! I will also be traveling to Pittsburgh to share my recent focuses on agents, both capability and safety.

Faria Huq | 🦋: fariahuqoaishi

@FariaHuqOaishi

年3月11日

📢 Join us at the CMU Agent Workshop 2025, April 10-11! Don't miss our esteemed invited speakers: - Qingyun Wu (PSU) - Diyi Yang (Stanford) - Aviral Kumar (CMU) - Graham Neubig (CMU) ...and many more to come! To register, visit: cmu-agent-workshop.github.io

Robert Vacareanu 已轉發

Amanda Bertsch

@abertsch72

年3月6日

coming to a NAACL 2025 near you! 🌞 Looking forward to discussing with folks in Albuquerque :) The camera-ready is on arxiv now, with more models, more tasks, and more compared settings-- including results comparing ICL to full finetuning! arxiv.org/abs/2405.00200

Amanda Bertsch

@abertsch72

2024年5月3日

In-context learning provides an LLM with a few examples to improve accuracy. But with long-context LLMs, we can now use *thousands* of examples in-context. We find that this long-context ICL paradigm is surprisingly effective– and differs in behavior from short-context ICL! 🧵

abertsch72's tweet image. In-context learning provides an LLM with a few examples to improve accuracy. But with long-context LLMs, we can now use *thousands* of examples in-context.

We find that this long-context ICL paradigm is surprisingly effective– and differs in behavior from short-context ICL! 🧵

Robert Vacareanu 已轉發

Prateek Yadav

@prateeky2806

年3月4日

Excited to share our work on RSQ — enhancing quantization by focusing on the most impactful tokens. - Rotate, Scale, Quantize: delivering strong performance - Dynamic, attention-based token importance drives better efficiency - Results across LLaMA3, Mistral, Qwen-2.5, and more

Yi Lin Sung

@yilin_sung

年3月4日

🚀 New Paper: RSQ: Learning from Important Tokens Leads to Better Quantized LLMs We show that not all tokens should be treated equally during quantization. By prioritizing important tokens through a three-step process—Rotate, Scale, and Quantize—we achieve better-quantized…

yilin_sung's tweet image. 🚀 New Paper: RSQ: Learning from Important Tokens Leads to Better Quantized LLMs

We show that not all tokens should be treated equally during quantization. By prioritizing important tokens through a three-step process—Rotate, Scale, and Quantize—we achieve better-quantized…

Robert Vacareanu 已轉發

Diyi Yang

@Diyi_Yang

年3月4日

Check out 🔥 EgoNormia: a benchmark for physical social norm understanding egonormia.org Can we really trust VLMs to make decisions that align with human norms? 👩‍⚖️ With EgoNormia, a 1800 ego-centric video 🥽 QA benchmark, we show that this is surprisingly challenging…

Robert Vacareanu 已轉發

MohammadHossein Rezaei

@mhrezaeics

年3月4日

🔥 Excited to share EgoNormia! A benchmark for physical social norm understanding. Can we really trust VLMs to make decisions that align with human norms? 🌐 Check out our website for the answer: egonormia.org Proud to be part of this amazing team! 🚀

mhrezaeics's tweet card. A large scale video dataset and a benchmark for evaluating frontier models' understanding of physical social norms through videos.

EgoNormia: A Benchmark for Embodied Normative Reasoning

來源: opensocial.world

Diyi Yang

@Diyi_Yang

年3月4日

Robert Vacareanu 已轉發

Tanmoy Chakraborty

@Tanmoy_Chak

年2月22日

**Kindly consider sharing the post** We are seeking opinions about the current quality of reviewing in *CL conferences. We (@emnlpmeeting PCs along with @ReviewAcl EiCs) are committed to improving the review quality. We are bringing a series of changes in the review process.…

Christos Christodoulopoulos

@c_christodoulop

年2月21日

Do you have opinions about the current state of reviewing at *CL conferences? Do you want to help? We (@emnlpmeeting PCs) want to hear from you: forms.office.com/r/P68uvwXYqf

Please fill out this form

來源: forms.office.com

Robert Vacareanu 已轉發

Mihai Surdeanu

@msurd

年2月16日

Our new paper in Findings of NAACL 2025, with Vlad Negru, @robert_nlp, @CameliaLemnaru, and Rodica Potolea, proposes a new, softer take on Natural Logic, where alignment is generated through text morphing. This yields robust performance cross domain. arxiv.org/abs/2502.09567

Robert Vacareanu 已轉發

Zifan (Sail) Wang

@_zifan_wang

年2月11日

🧵 1/N) Excited to share our recent work at @scale_AI, "Jailbreaking to Jailbreak (J2)".😈 We present a novel LLM-as-red-teamer approach in which a human jailbreaks a refusal-trained LLM to make it willing to jailbreak itself or other LLMs. We refer to this process as…

_zifan_wang's tweet image. 🧵 1/N) Excited to share our recent work at @scale_AI, "Jailbreaking to Jailbreak (J2)".😈 We present a novel LLM-as-red-teamer approach in which a human jailbreaks a refusal-trained LLM to make it willing to jailbreak itself or other LLMs. We refer to this process as…

Robert Vacareanu 已轉發

Jacob Andreas

@jacobandreas

年12月16日

Is your CS dept worried about what academic research should be in the age of LLMs? Hire one of my lab members! Leshem Choshen (@LChoshen), Pratyusha Sharma (@pratyusha_PS) and Ekin Akyürek (@akyurekekin) are all on the job market with unique perspectives on the future of NLP: 🧵

Robert Vacareanu 已轉發

Summer Yue

@summeryue0

年12月18日

🚀Big update: 4 new SEAL multilingual leaderboards are LIVE — Arabic, Chinese, Japanese, and Korean! 🌍 Arabic: Gemini 1.5 Pro (gemini-exp-1121) leads the pack 🏮 Chinese: Gemini 1.5 Pro (gemini-1.5-pro-exp-0827) holds the crown 💫 Japanese & Korean: o1-preview dominates 📊 See…

Robert Vacareanu 已轉發

Summer Yue

@summeryue0

年12月18日

SEAL Visual-Understanding Leaderboard Launch 🏆 Today, we’re introducing VISTA—a new rubric-based visual task assessment benchmark that pushes beyond simple Q&A. The leading models achieve under 40% on this eval, compared to a human baseline of ~55.4%. This highlights that…

summeryue0's tweet image. SEAL Visual-Understanding Leaderboard Launch 🏆

Today, we’re introducing VISTA—a new rubric-based visual task assessment benchmark that pushes beyond simple Q&amp;A.

The leading models achieve under 40% on this eval, compared to a human baseline of ~55.4%. This highlights that…

Robert Vacareanu 已轉發

maharshi

@mrsiipa

年11月24日

what an amazing read: converting json to regex then regex to finite state machines, and then optimising it is brilliant!

Robert Vacareanu 已轉發

Prateek Yadav

@prateeky2806

年11月7日

I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modular models - Synthetic Data - Test time compute - other phases of pre/post-training. If you are not hiring then I would appreciate a retweet! More details👇

Robert Vacareanu 已轉發

Roberta Raileanu

@robertarail

年10月19日

I’m looking for a PhD intern for next year to work at the intersection of LLM-based agents and open-ended learning, part of the Llama Research Team in London. If interested please send me an email with a short paragraph with some research ideas and apply at the link below.

Robert Vacareanu 已轉發

Sharon Levy

@sharonlevy21

2024年10月11日

Come join our reading group!

Rutgers Computer Science Department

@RutgersCS

2024年10月11日

New Reading Group! Trustworthy AI Reading Group; dedicated to exploring critical topics in AI, including fairness, security, explainability, and their broader applications. Open to CS undergraduate, master’s, and PhD students. ruixiangtang.net/teaching-mento…