(((ل()(ل() 'yoav))))👾

@yoavgo

Israel

cs.biu.ac.il/~yogo/

於五月 2009 加入

139千貼文 69千位跟隨者 2千個跟隨中

你可能會喜歡

@srush_nlp

@soumithchintala

@SchmidhuberAI

@sleepinyourhat

@percyliang

@arankomatsuzaki

@chrmanning

@ch402

@hardmaru

@seb_ruder

@jacobandreas

@kchonyc

@Thom_Wolf

@_rockt

@ericjang11

置頂

(((ل()(ل() 'yoav))))👾

2023年11月4日

what there is no overwhelming agreement on is what will happen the day after. how do we build a livable future here, for both israelis and palestinians. israel leadership is not thinking of this now, future looks grim. THIS is where protest effort and outrage should be invested.

(((ل()(ل() 'yoav))))👾

22 小時

remind me why we call the LLM reasoning samples during training "trajectories" and not "sampled responses"?

(((ل()(ل() 'yoav))))👾

年12月6日

wait so the GRPO everyone are drooling about is just REINFORCE with the baseline computed as an average over a large sample (and the usual kl regularization in llm models)?

(((ل()(ل() 'yoav))))👾 已轉發

Hadi Vafaii ✈️ NeurIPS 2025

年12月5日

yes. sensorimotorai.github.io/debates/

(((ل()(ل() 'yoav))))👾

年12月5日

this classic figure is wrong.

yoavgo's tweet image. this classic figure is wrong.

(((ل()(ل() 'yoav))))👾

年12月5日

I complain a lot about RL lately, and here we go again. The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment. More at length here: gist.github.com/yoavg/3eb3e722…

yoavgo's tweet card. GitHub Gist: instantly share code, notes, and snippets.

rl-wrong-about-rewards.md

來源: gist.github.com

(((ل()(ل() 'yoav))))👾

年12月5日

I complain a lot about RL lately, and here we go again. The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment. More at length here: gist.github.com/yoavg/3eb3e722…

yoavgo's tweet card. GitHub Gist: instantly share code, notes, and snippets.

rl-wrong-about-rewards.md

來源: gist.github.com

(((ل()(ل() 'yoav))))👾

年12月5日

candy attracts mutant ants with variable number of legs

ᴅᴏᴜʙʟᴇ-🆁

@Naam_Hi_Kafi_H

年12月4日

What does this picture teach you??

Naam_Hi_Kafi_H's tweet image. What does this picture teach you??

(((ل()(ل() 'yoav))))👾

年12月4日

this is a survey. when you think of a "model" as in "model based RL", what do you have in mind? (in other words, what is a model in this sense?)

(((ل()(ل() 'yoav))))👾

年12月4日

turns out some disciplines/people use "a bellman equation" to mean *any* recursive equation which is amenable to DP. in that sense clearly the concept is important. i was talking specifically about the update rule for computing a value function using tabulation.

(((ل()(ل() 'yoav))))👾

年12月3日

why are the bellman equations considered foundational or important today? arent they just a straightforward application of DP to solve a problem that only arises in extremely simplified cases that never occur in practice?

(((ل()(ل() 'yoav))))👾

年12月3日

it is not "a confession" stop calling things with the most misleading names you can

(((ل()(ل() 'yoav))))👾

年12月3日

whats the least french sandwich you could think off

Declaration of Memes

年12月2日

Say what you want about the French, they get sandwiches right... 👇👇👇💯

(((ل()(ل() 'yoav))))👾

年12月3日

i am feeling a bit sick and cannot concentrate, perfect time for some science-adjacent online fighting

(((ل()(ل() 'yoav))))👾

年12月3日

hot take on the bun purchase: turns out coding agents cannot replace engineers just yet, huh.

(((ل()(ل() 'yoav))))👾

年12月2日

actually this should make companies more reluctant to rely on bun, not less.

Jarred Sumner

年12月2日

People frequently ask: > How is Bun sustainable? If I bet my company’s tech stack on Bun, will Bun still be around in a few years? We didn’t have a great answer to this question, until today

Delip Rao e/σ

@deliprao

Kyunghyun Cho

@kchonyc

Sasha Rush

@srush_nlp

Percy Liang

@percyliang

Christopher Manning

@chrmanning

Sam Bowman

@sleepinyourhat

Graham Neubig

@gneubig

Zachary Lipton

@zacharylipton

Jason Wei

@_jasonwei

Tal Linzen

@tallinzen

Felix Hill

@FelixHill84

Naomi Saphra

@nsaphra

Akari Asai (@NeurIPS 2025)

@AkariAsai

Jacob Andreas

@jacobandreas

Tim Dettmers

@Tim_Dettmers

Thomas G. Dietterich

@tdietterich

Danish Pruthi

@danish037

Leo Boytsov

@srchvrs

Douwe Kiela

@douwekiela

William Wang

@WilliamWangNLP

Ekonia

@KalihoseMigisha

Haneul Shin

@haneulshin1030

Wassim Meziani

@wassim_meziani2

井上大輔 | EURELIX代表取締役 | クエシス執行役員

@quesis_inoue

Fuatcan Başlık

@fuattcann

ModernWayMotion

@ModernWayMotion

Shaowen Wang

@wangsw5653

Lampedusa

@Lampedusallz

mh86

@mh861176458

Line

@linexjlin

KAVYA

@k9patel1

M.arif Pradana

@M43771Arif

Justin Daniel

@justin_m_daniel

SHOPQUANTUM.AI

@makezbrightgift

Atuhaire Ivan

@IvoAtuhaire

Connor

@Connor2jm

Serhii Honcharenko

@serhiiH100

Eric Howard

@EricHow19656998

StoneLean

@StoneLeanTao

rahcrypto

@rahcrypto1

L. John Silver

@boychocks

web3 fish

@g1134506645

trapboyhuncho

@trapboyhuncho99

parisyg

@parissyg

Itay Hazan

@itayhzn

Davidzjw

@zhujiwi07086836

David Halprin

@TheHalpi

M (Parody)

@M0924318635339

Samswara

@samswoora

nomāda visas // Carpe annum.™

@nomadavisas

$stefanos_ch3's profile picture. Physical AI @mimicrobotics Prev. {@SonyAI_global @leggedrobotics @GTrobotics) Opinions are my own$

Stefanos Charalambous

@stefanos_ch3

Stephen Paek

@stpaek

Ravi Agrawal

@ravi20036

Harsh Jalan

@harshjalaan

Kaiwen Zha

@KaiwenZha

John Gkountouras

@j0hngou

Asif.(AR) 🚀

@Asif_Legendary

Ali Larian

@ali_larian

RIan

@RIan182132

.

@COtpyr

Omojasola

@RalzyTaiwo

Leszek Bukowski 🧠💻🏛️👾

@LeszBuk

Pratik Patel

@pratikpatel

Tejas

@tejas_k0

dream_agent

@dreamagent22817

Prodigal Snacker

@Mightyjuge

Murat

@Murat89486896

Sobhan Shukueian Tabrizi

@sobhanshukueian

Lie B

@LieB329845

somuSan

@somuSan_

Delip Rao e/σ

@deliprao

Kyunghyun Cho

@kchonyc

Sasha Rush

@srush_nlp

Percy Liang

@percyliang

Christopher Manning

@chrmanning

Sam Bowman

@sleepinyourhat

Graham Neubig

@gneubig

Zachary Lipton

@zacharylipton

Jason Wei

@_jasonwei

Tal Linzen

@tallinzen

Felix Hill

@FelixHill84

Naomi Saphra

@nsaphra

Yoav Artzi

@yoavartzi

Thomas Wolf

@Thom_Wolf

Ferenc Huszár

@fhuszar

Jacob Andreas

@jacobandreas

Tim Dettmers

@Tim_Dettmers

Thomas G. Dietterich

@tdietterich

Leo Boytsov

@srchvrs

Douwe Kiela

@douwekiela

Hadi Vafaii ✈️ NeurIPS 2025

@hadivafaii

Matthias Schmidt

@eurofounder

Rishabh Agarwal

@agarwl_

Dwarkesh Patel

@dwarkesh_sp

Sarah Ettedgui

@SarahEttedgui

OZ Party - מפלגת עוז

@ozparty2026

Cynde

@Cyndesama

كابتن إيلا Captain Ella

@CaptainElla1

Aastha

@aastha_mhaske

Inbal Talgam-Cohen

@InbalTalgam

Irene Chen

@irenetrampoline

Translating Falasteen (Palestine)

@translatingpal

MMitchell

@mmitchell_ai

Alireza Talakoubnejad

@websterkaroon

Habeeb Habeeb

@habeebhabeeb

TheFatRat

@ThisIsTheFatRat

Sebastien Bubeck

@SebastienBubeck

Kareem Jouda

@kareem_1087

htmx.org / CEO of Complete Wrongness (same thing)

@htmx_org

Casey Muratori

@cmuratori

Benjamin Bratton

@bratton

Joscha Bach

@Plinz

Xenova

@xenovacom

Computer Science Bar Ilan

@ComscienceBiu

Dmitrii Kovanikov

@ChShersh

Josh McGrath

@j_mcgraph

Jason Lee

@jasondeanlee

Luca Ambrogioni

@LucaAmb

Asaf🇺🇦🇮🇱🎗️

@Asaf1139

corey scher

@coreymaps

Noy Sternlicht

@NoySternlicht

Mo Ghaoui

@moghaoui

Seyed Abbas Araghchi

@araghchi

מענדי גרוזמן

@mendy_gruzman

Mahmoud "Mo" Shawki 🇵🇸

@mo_shawki2

Mahmood Sharif

@mahmoods01

חיים גולדברג

@haim_goldberg

Ridvan Aydemir | Apostate Prophet 🎗

@ApostateProphet

Thamar E. Gindin

@thmr

Oz Katerji

@OzKaterji

Ghaya Ben Mbarek غاية بن مبارك

@Ghaya_BM

Charles Goddard

@chargoddard

zed

@zmkzmkz

Taelin

@VictorTaelin

Eleanor Berger

@intellectronica

Ph.Gritti

@Philipp27960841

Orian Sharoni

@OrianSharoni

@Geopolitics_is_like_AoE

@Geopolitics_AoE

Yehuda Lahav

@DrLahav

inigo quilez

@iquilezles

United States 趨勢

1. #UFC323 129K posts
2. Merab 47.6K posts
3. Indiana 107K posts
4. Good Sunday 51.9K posts
5. SB19 ACONic PERFORMANCE 107K posts
6. Roach 29.4K posts
7. Petr Yan 28.6K posts
8. Ohio State 64.9K posts
9. Duke 61.7K posts
10. Pantoja 36K posts
11. Mendoza 42.6K posts
12. Benin 38.8K posts
13. Walt 8,505 posts
14. TOP CALL 8,927 posts
15. Vtuber 88.7K posts
16. Pitbull 18.6K posts
17. Joshua Van 11.5K posts
18. Heisman 19.7K posts
19. Pearl Harbor 6,358 posts
20. Curt Cignetti 12.1K posts

你可能會喜歡

Something went wrong.

Something went wrong.