Joykirat

@joykiratsingh

CS PhD Student @unc_ai_group @UNC, advised by. @mohitban47 | ex RF @MSFTResearch

joykirat18.github.io

Joined April 2017

451Posts 400Followers 614Following

You might like

@avgupt

@TanejaAryan

@RishitG57144297

@pr3khar

@KuchAlagKar

@d_silent_quill

@seaweeddbrainn

@Itida_99

@jainnandika

@SamyakGupta3

@dhattarwalm0hit

@vsushmita_

@not_bhaskar

@_karanjot

Pinned

Joykirat

@joykiratsingh

Oct 3

🚨 Excited to announce TRAAC, an online difficulty-adaptive, attention-based method that handles the tradeoff of under & overthinking in reasoning models to improve both accuracy and efficiency. Underthinking ❌: Models terminate reasoning too early on harder problems, leading…

joykiratsingh's tweet image. 🚨 Excited to announce TRAAC, an online difficulty-adaptive, attention-based method that handles the tradeoff of under &amp; overthinking in reasoning models to improve both accuracy and efficiency.

Underthinking ❌: Models terminate reasoning too early on harder problems, leading…

Joykirat reposted

Elias Stengel-Eskin

@EliasEskin

11 h

🚨 Excited to share new work on inferring symbolic world models from observations! OneLife can infer world models in stochastic, complex environments by proposing rules via LLM and reweighting code-based environment laws from observations collected in a single interaction…

Zaid Khan

@codezakh

14 h

How can an agent reverse engineer the underlying laws of an unknown, hostile & stochastic environment in “one life”, without millions of steps + human-provided goals / rewards? In our work, we: 1️⃣ infer an executable symbolic world model (a probabilistic program capturing…

Joykirat reposted

Archiki Prasad

@ArchikiPrasad

13 h

🚨 Excited to share our new work ✨ OneLife ✨, which investigates how an agent can infer executable symbolic world models 🌐 from a single unguided trajectory in a stochastic environment. I’m especially excited about our planning + evaluation contributions: 1️⃣ We support…

Zaid Khan

@codezakh

14 h

Joykirat reposted

Zaid Khan

@codezakh

14 h

Joykirat reposted

Shoubin Yu

@shoubin621

Oct 10

🚨 New Paper Alert! Introducing SciVideoBench — a comprehensive benchmark for scientific video reasoning! 🔬SciVideoBench: 1. Spans Physics, Chemistry, Biology & Medicine with authentic experimental videos. 2. Features 1,000 challenging MCQs across three reasoning types:…

shoubin621's tweet image. 🚨 New Paper Alert! Introducing SciVideoBench — a comprehensive benchmark for scientific video reasoning!

🔬SciVideoBench:

1. Spans Physics, Chemistry, Biology &amp; Medicine with authentic experimental videos.

2. Features 1,000 challenging MCQs across three reasoning types:…

Joykirat reposted

Zun Wang

@ZunWang919

Oct 8

🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves. ➡️ Agents iteratively generate and learn from their own successful trajectories ➡️…

ZunWang919's tweet image. 🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves.

➡️ Agents iteratively generate and learn from their own successful trajectories
➡️…

Joykirat reposted

Hanqi Xiao

@hanqi_xiao

Oct 7

Landed in Montreal 🇨🇦 for #COLM2025 to present my first-author work on task-conditioned mixed-precision quantization: “Task-Circuit Quantization” (Thursday 11am, Poster Session 5). I'm applying to PhD programs this cycle and am excited to chat about this or other interests (LLM…

Mohit Bansal

@mohitban47

Oct 6

@ArchikiPrasad

Oct 3

Models often think too much on easy problems and not enough on harder reasoning problems. Our new method ✨TRAAC✨ fixes this by teaching models to adaptively compress their "thinking budget" to the difficulty of the task during GRPO rollouts. Result? The model uses…

Joykirat

@joykiratsingh

Oct 3

Joykirat reposted

Justin Chih-Yao Chen

@cyjustinchen

Oct 3

Large reasoning models suffer from under-adaptiveness, which underthink on hard problems and overthink on easy ones. TRAAC addresses this by introducing ✨difficulty calibration and attention-based compression✨→ +8.4% accuracy & +36.8% efficiency! 1️⃣ TRAAC adaptively mitigates…