Kyle Montgomery

@kylepmont

PhD student at UC Santa Cruz

Santa Cruz, CA

Tham gia vào Tháng 1 2016

15Bài đăng 44Người theo dõi 30Đang theo dõi

Kyle Montgomery đã đăng lại

Chenguang Wang (hiring)

5 thg 11

Attending #EMNLP2025, Dr @hengjinlp is now giving the keynote! pls do dm me if you plan to do a #postdoc on #agenticai and #aisafety and are interested in working with me, we should chat! If you plan to pursue #phd in the space, also don’t hesitate to reach out (limited spots…

ChenguangWang's tweet image. Attending #EMNLP2025, Dr @hengjinlp is now giving the keynote! pls do dm me if you plan to do a #postdoc on #agenticai and #aisafety and are interested in working with me, we should chat! If you plan to pursue #phd in the space, also don’t hesitate to reach out (limited spots…

Kyle Montgomery

17 thg 10

Thrilled to have been a part of this release — looking forward to what’s coming next with rLLM!

rLLM

17 thg 10

🚀 Introducing rLLM v0.2 - train arbitrary agentic programs with RL, with minimal code changes. Most RL training systems adopt the agent-environment abstraction. But what about complex workflows? Think solver-critique pairs collaborating, or planner agents orchestrating multiple…

rllm_project's tweet image. 🚀 Introducing rLLM v0.2 - train arbitrary agentic programs with RL, with minimal code changes.

Most RL training systems adopt the agent-environment abstraction. But what about complex workflows? Think solver-critique pairs collaborating, or planner agents orchestrating multiple…

Kyle Montgomery

25 thg 4

Excited to share our work at #ICLR2025! JudgeBench ⚖️ tests the reliability of LLM-based judges with a focus on objective correctness. JudgeBench converts tough 🧠 datasets in knowledge, reasoning, math & code into labeled response pairs, forcing objective grading over vibes.…

kylepmont's tweet image. Excited to share our work at #ICLR2025! JudgeBench ⚖️ tests the reliability of LLM-based judges with a focus on objective correctness. JudgeBench converts tough 🧠 datasets in knowledge, reasoning, math &amp; code into labeled response pairs, forcing objective grading over vibes.…

Kyle Montgomery đã đăng lại

Sijun Tan

18 thg 10, 2024

Introducing JudgeBench – the ultimate benchmark designed to push LLM-based judges to their limits! 🚀 ❓Why do we need a new benchmark for LLM-based judges? As LLMs continues to evolve, their responses become more complex, demanding stronger judges to assess them accurately.…

ValerieSassoon

@7BDq2dw0ht1rR

Magdalena

@herrerarey44239

Noemi

@gilbtalima5406

Vivienne

@Aboeexorl99130

Sabrina

@Nusdud3540075

shubham gaur

@shubhamgaur98

Elin

@Ralqarq036

Roberta

@slisherjef61844

Juliette

@streitmatt1346

NicoleSimpson

@3Y8GsTtJMa6K74

Jason Wei

@jaswwei

Tianjun Zhang

@tianjun_zhang

Mohamed Moustafa

@mohamedmustfaaa

Senthil Kumar

@SenthilKumarN_

Tashmoy Ghosh

@TashmoyG

ja me s

@james2275430095

rLLM

@rllm_project

Robert Washbourne

@rawsh0

Bingyang Wu

@wu_bingyang

Pivotal_AI

@Pivotal_AI

Eyal Biebar

@EBiebar

behance

@behance17

z

@m511ob

SWH | (168, 168)

@swh16888

Zhongwen Xu

@zhongwen2009

Sam Kuhn

@SamKuhnDev

Jasper

@zjasper

Cosmic Cat

@cosmiccat2025

Alpay Ariyak

@AlpayAriyak

Leon Liangyu Chen

@realleonlc

Yichuan Wang

@YichuanM

Ameen Patel

@Ameen_ml

✦✦✦

@not_infinite___

Albert Peng

@albert_peng_

Nina

@Slorbi6508

Eric Pasewark

@epasewark

Jianhong Tu

@TuJianhong

Vincent Siu

@vsiu82

Sunghwan Kim

@seonghwan_57

Chandra Prakash Bathula

@ChandraPraksh_B

Chenguang Wang (hiring)

@ChenguangWang

Sijun Tan

@sijun_tan

Nicholas Crispino

@NRCrispino

Thinking Machines

@thinkymachines

MassGen

@massgen_ai

rLLM

@rllm_project

Jianhong Tu

@TuJianhong

Geoffrey Hinton

@geoffreyhinton

Jürgen Schmidhuber

@SchmidhuberAI

Pieter Abbeel

@pabbeel

Andrew Ng

@AndrewYNg

Anca Dragan

@ancadianadragan

Ian Goodfellow

@goodfellow_ian

Graham Neubig

@gneubig

Jan Leike

@janleike

Chris Olah

@ch402

Yi Tay

@YiTayML

Tri Dao

@tri_dao

Noam Shazeer

@NoamShazeer

Alex Dimakis

@AlexGDimakis

Ross Taylor

@rosstaylor90

Max Tegmark

@tegmark

Agentica Project

@Agentica_

Zico Kolter

@zicokolter

Songlin Yang

@SonglinYang4

Mira Murati

@miramurati

Gal Gantar

@gantargal

Dawn Song

@dawnsongtweets

Chenguang Wang (hiring)

@ChenguangWang

Siyuan Zhuang

@siy_zh

Sijun Tan

@sijun_tan

Andrej Karpathy

@karpathy

Nicholas Crispino

@NRCrispino

United States Xu hướng

1. Klay 19.2K posts
2. #AEWFullGear 69K posts
3. Lando 96.4K posts
4. McLaren 40.3K posts
5. #LasVegasGP 181K posts
6. LAFC 15K posts
7. Hangman 9,634 posts
8. Samoa Joe 4,580 posts
9. Gambino 2,084 posts
10. Swerve 6,284 posts
11. Ja Morant 8,371 posts
12. #Toonami 2,750 posts
13. Bryson Barnes N/A
14. #byucpl N/A
15. Verstappen 76.2K posts
16. Utah 23.9K posts
17. Benavidez 15.7K posts
18. Kimi 37.4K posts
19. Mark Briscoe 4,343 posts
20. LJ Martin 1,285 posts

Something went wrong.

Something went wrong.