Joykirat

@joykiratsingh

CS PhD Student @unc_ai_group @UNC, advised by. @mohitban47 | ex RF @MSFTResearch

joykirat18.github.io

4월 2017에 가입

451게시물 404팔로워 617팔로우 중

내가 좋아할 만한 콘텐츠

@avgupt

@TanejaAryan

@RishitG57144297

@pr3khar

@ananyalohani_

@KuchAlagKar

@d_silent_quill

@seaweeddbrainn

@jainnandika

@SamyakGupta3

@dhattarwalm0hit

@vsushmita_

@not_bhaskar

@_karanjot

고정된 트윗

Joykirat

@joykiratsingh

. 10. 3.

🚨 Excited to announce TRAAC, an online difficulty-adaptive, attention-based method that handles the tradeoff of under & overthinking in reasoning models to improve both accuracy and efficiency. Underthinking ❌: Models terminate reasoning too early on harder problems, leading…

joykiratsingh's tweet image. 🚨 Excited to announce TRAAC, an online difficulty-adaptive, attention-based method that handles the tradeoff of under &amp; overthinking in reasoning models to improve both accuracy and efficiency.

Underthinking ❌: Models terminate reasoning too early on harder problems, leading…

Joykirat 님이 재게시함

Elias Stengel-Eskin

@EliasEskin

. 10. 15.

🚨 Excited to share new work on inferring symbolic world models from observations! OneLife can infer world models in stochastic, complex environments by proposing rules via LLM and reweighting code-based environment laws from observations collected in a single interaction…

Zaid Khan

@codezakh

. 10. 15.

How can an agent reverse engineer the underlying laws of an unknown, hostile & stochastic environment in “one life”, without millions of steps + human-provided goals / rewards? In our work, we: 1️⃣ infer an executable symbolic world model (a probabilistic program capturing…

Joykirat 님이 재게시함

Archiki Prasad

@ArchikiPrasad

. 10. 15.

🚨 Excited to share our new work ✨ OneLife ✨, which investigates how an agent can infer executable symbolic world models 🌐 from a single unguided trajectory in a stochastic environment. I’m especially excited about our planning + evaluation contributions: 1️⃣ We support…

Zaid Khan

@codezakh

. 10. 15.

Joykirat 님이 재게시함

Zaid Khan

@codezakh

. 10. 15.

Joykirat 님이 재게시함

Shoubin Yu@ICCV🌺

@shoubin621

. 10. 10.

🚨 New Paper Alert! Introducing SciVideoBench — a comprehensive benchmark for scientific video reasoning! 🔬SciVideoBench: 1. Spans Physics, Chemistry, Biology & Medicine with authentic experimental videos. 2. Features 1,000 challenging MCQs across three reasoning types:…

shoubin621's tweet image. 🚨 New Paper Alert! Introducing SciVideoBench — a comprehensive benchmark for scientific video reasoning!

🔬SciVideoBench:

1. Spans Physics, Chemistry, Biology &amp; Medicine with authentic experimental videos.

2. Features 1,000 challenging MCQs across three reasoning types:…

Joykirat 님이 재게시함

Zun Wang

@ZunWang919

. 10. 8.

🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves. ➡️ Agents iteratively generate and learn from their own successful trajectories ➡️…

ZunWang919's tweet image. 🚨 Thrilled to introduce Self-Improving Demonstrations (SID) for Goal-Oriented Vision-and-Language Navigation — a scalable paradigm where navigation agents learn to explore by teaching themselves.

➡️ Agents iteratively generate and learn from their own successful trajectories
➡️…

Joykirat 님이 재게시함

Hanqi Xiao

@hanqi_xiao

. 10. 7.

Landed in Montreal 🇨🇦 for #COLM2025 to present my first-author work on task-conditioned mixed-precision quantization: “Task-Circuit Quantization” (Thursday 11am, Poster Session 5). I'm applying to PhD programs this cycle and am excited to chat about this or other interests (LLM…

Mohit Bansal

@mohitban47

. 10. 6.

@ArchikiPrasad

. 10. 3.

Models often think too much on easy problems and not enough on harder reasoning problems. Our new method ✨TRAAC✨ fixes this by teaching models to adaptively compress their "thinking budget" to the difficulty of the task during GRPO rollouts. Result? The model uses…

Joykirat

@joykiratsingh

. 10. 3.

Joykirat 님이 재게시함

Justin Chih-Yao Chen

@cyjustinchen

. 10. 3.

Large reasoning models suffer from under-adaptiveness, which underthink on hard problems and overthink on easy ones. TRAAC addresses this by introducing ✨difficulty calibration and attention-based compression✨→ +8.4% accuracy & +36.8% efficiency! 1️⃣ TRAAC adaptively mitigates…