Chinmaya Andukuri

@chinmaya_mohan

applied research @CapitalOne, previously @StanfordAILab / @StanfordHAI

scandukuri.github.io

Joined December 2023

25Posts 62Followers 344Following

Chinmaya Andukuri

@chinmaya_mohan

Oct 13

W, feeling seen @YugeTen

Chinmaya Andukuri reposted

One of my takeaways from #COLM2025 was that people are thinking a lot about user simulation (have been thinking about this myself in the context of tutoring!) Really exciting to see this work on the topic 🤩

Tarek Naous

@tareknaous

Oct 10

Simulating user–AI conversations helps us understand how LMs work in multi-turn settings. Prompting LMs like GPT-4o to simulate users is common, but their assistant nature makes it hard to replicate user behavior. We introduce User LMs - trained to be users, not assistants.

tareknaous's tweet image. Simulating user–AI conversations helps us understand how LMs work in multi-turn settings.

Prompting LMs like GPT-4o to simulate users is common, but their assistant nature makes it hard to replicate user behavior.

We introduce User LMs - trained to be users, not assistants.

Chinmaya Andukuri

@chinmaya_mohan

Sep 25

have been enjoying dipping my toes into `verifiers` and @PrimeIntellect environments hub - just pushed an eval environment for MultiChallenge (@scale_AI) to the Environments Hub my env: app.primeintellect.ai/dashboard/envi… main page: scale.com/leaderboard/mu…

chinmaya_mohan's tweet card. Explore the SEAL leaderboard with expert-driven LLM benchmarks and updated AI model leaderboards, ranking top models across coding, reasoning and more.

MultiChallenge

Source: scale.com

Chinmaya Andukuri

@chinmaya_mohan

Oct 7, 2024

if you’re at @COLM_conf, come say hi tomorrow and talk to us about LM self-improvement + clarification!

Philipp Fränken

@jphilippfranken

Oct 7, 2024

Presenting this tomorrow at @COLM_conf! Poster 36 (11:00 AM-1:00 PM). We’ll have a demo—come along if you want to try our models and talk about multi-turn dialogue!

Chinmaya Andukuri reposted

Philipp Fränken

@jphilippfranken

Apr 23, 2024

Constitutional AI showed LMs can learn to follow constitutions by labeling their own outputs. But why can't we just tell a base model the principles of desired behavior and rely on it to act appropriately? Introducing SAMI: Self-Supervised Alignment with Mutual Information!

Chinmaya Andukuri reposted

Philipp Fränken

@jphilippfranken

Apr 18, 2024

Excited to share OffTheRails: A moral reasoning benchmark beyond trolley problems! We present a simple prompting pipeline for generating moral reasoning evaluations with language models using causal templates 🔵→🟠

jphilippfranken's tweet image. Excited to share OffTheRails: A moral reasoning benchmark beyond trolley problems!

We present a simple prompting pipeline for generating moral reasoning evaluations with language models using causal templates 🔵→🟠

Chinmaya Andukuri reposted

Kanishk Gandhi

@gandhikanishk

Apr 8, 2024

Language models struggle to search, not due to an architecture problem, but a data one! They rarely see how to search or backtrack. We show how LLMs can be taught to search by representing the process of search in language as a flattened string, a stream of search (SoS)!

Chinmaya Andukuri reposted

Rafael Rafailov

@rm_rafailov

Mar 29, 2024

Multi-turn interactive RL should be a bigger focus. Current methods are not well-suited for this - i.e. PPO can't train with user in the loop generally and offline Q-learning still does not work at scale. It's interesting to see more work in that direction.

Philipp Fränken

@jphilippfranken

Mar 29, 2024

When prompting language models to complete a task, users often leave important things unsaid. Can language models teach themselves to ask clarifying questions? In STaR-GATE, we explore LMs' ability to self-improve by rewarding the model for generating useful questions!