Joykirat

@joykiratsingh

CS PhD Student @unc_ai_group @UNC, advised by. @mohitban47 | ex RF @MSFTResearch

joykirat18.github.io

Tham gia vào Tháng 4 2017

451Bài đăng 400Người theo dõi 614Đang theo dõi

Bạn có thể thích

@avgupt

@TanejaAryan

@RishitG57144297

@pr3khar

@KuchAlagKar

@d_silent_quill

@seaweeddbrainn

@Itida_99

@jainnandika

@SamyakGupta3

@dhattarwalm0hit

@vsushmita_

@not_bhaskar

@_karanjot

Ghim

Joykirat

@joykiratsingh

3 thg 10

🚨 Excited to announce TRAAC, an online difficulty-adaptive, attention-based method that handles the tradeoff of under & overthinking in reasoning models to improve both accuracy and efficiency. Underthinking ❌: Models terminate reasoning too early on harder problems, leading…

joykiratsingh's tweet image. 🚨 Excited to announce TRAAC, an online difficulty-adaptive, attention-based method that handles the tradeoff of under &amp; overthinking in reasoning models to improve both accuracy and efficiency.

Underthinking ❌: Models terminate reasoning too early on harder problems, leading…

Joykirat đã đăng lại

Elias Stengel-Eskin

@EliasEskin

20 giờ

🚨 Excited to share new work on inferring symbolic world models from observations! OneLife can infer world models in stochastic, complex environments by proposing rules via LLM and reweighting code-based environment laws from observations collected in a single interaction…

Zaid Khan

@codezakh

24 giờ

How can an agent reverse engineer the underlying laws of an unknown, hostile & stochastic environment in “one life”, without millions of steps + human-provided goals / rewards? In our work, we: 1️⃣ infer an executable symbolic world model (a probabilistic program capturing…

Joykirat đã đăng lại

Archiki Prasad

@ArchikiPrasad

23 giờ

🚨 Excited to share our new work ✨ OneLife ✨, which investigates how an agent can infer executable symbolic world models 🌐 from a single unguided trajectory in a stochastic environment. I’m especially excited about our planning + evaluation contributions: 1️⃣ We support…

Zaid Khan

@codezakh

24 giờ

Joykirat đã đăng lại

Zaid Khan

@codezakh

24 giờ

Joykirat đã đăng lại

Shoubin Yu

@shoubin621

10 thg 10

🚨 New Paper Alert! Introducing SciVideoBench — a comprehensive benchmark for scientific video reasoning! 🔬SciVideoBench: 1. Spans Physics, Chemistry, Biology & Medicine with authentic experimental videos. 2. Features 1,000 challenging MCQs across three reasoning types:…

shoubin621's tweet image. 🚨 New Paper Alert! Introducing SciVideoBench — a comprehensive benchmark for scientific video reasoning!

🔬SciVideoBench:

1. Spans Physics, Chemistry, Biology &amp; Medicine with authentic experimental videos.

2. Features 1,000 challenging MCQs across three reasoning types:…

Joykirat đã đăng lại

Zun Wang

@ZunWang919

8 thg 10

🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves. ➡️ Agents iteratively generate and learn from their own successful trajectories ➡️…

ZunWang919's tweet image. 🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves.

➡️ Agents iteratively generate and learn from their own successful trajectories
➡️…

Joykirat đã đăng lại

Hanqi Xiao

@hanqi_xiao

7 thg 10

Landed in Montreal 🇨🇦 for #COLM2025 to present my first-author work on task-conditioned mixed-precision quantization: “Task-Circuit Quantization” (Thursday 11am, Poster Session 5). I'm applying to PhD programs this cycle and am excited to chat about this or other interests (LLM…

Mohit Bansal

@mohitban47

6 thg 10

@ArchikiPrasad

3 thg 10

Models often think too much on easy problems and not enough on harder reasoning problems. Our new method ✨TRAAC✨ fixes this by teaching models to adaptively compress their "thinking budget" to the difficulty of the task during GRPO rollouts. Result? The model uses…

Joykirat

@joykiratsingh

3 thg 10

Joykirat đã đăng lại

Justin Chih-Yao Chen

@cyjustinchen

3 thg 10

Large reasoning models suffer from under-adaptiveness, which underthink on hard problems and overthink on easy ones. TRAAC addresses this by introducing ✨difficulty calibration and attention-based compression✨→ +8.4% accuracy & +36.8% efficiency! 1️⃣ TRAAC adaptively mitigates…