(((ل()(ل() 'yoav))))👾

@yoavgo

Israel

cs.biu.ac.il/~yogo/

Inscrit en Mai 2009

139KPosts 69KAbonnés 2KAbonnements

Vous pourriez aimer

@srush_nlp

@soumithchintala

@SchmidhuberAI

@sleepinyourhat

@percyliang

@arankomatsuzaki

@chrmanning

@ch402

@hardmaru

@seb_ruder

@jacobandreas

@kchonyc

@Thom_Wolf

@_rockt

@ericjang11

Épinglé

(((ل()(ل() 'yoav))))👾

4 nov. 2023

what there is no overwhelming agreement on is what will happen the day after. how do we build a livable future here, for both israelis and palestinians. israel leadership is not thinking of this now, future looks grim. THIS is where protest effort and outrage should be invested.

(((ل()(ل() 'yoav))))👾

14 h

remind me why we call the LLM reasoning samples during training "trajectories" and not "sampled responses"?

(((ل()(ل() 'yoav))))👾

20 h

wait so the GRPO everyone are drooling about is just REINFORCE with the baseline computed as an average over a large sample (and the usual kl regularization in llm models)?

(((ل()(ل() 'yoav))))👾 a reposté

Hadi Vafaii ✈️ NeurIPS 2025

5 déc.

yes. sensorimotorai.github.io/debates/

(((ل()(ل() 'yoav))))👾

5 déc.

this classic figure is wrong.

yoavgo's tweet image. this classic figure is wrong.

(((ل()(ل() 'yoav))))👾

5 déc.

I complain a lot about RL lately, and here we go again. The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment. More at length here: gist.github.com/yoavg/3eb3e722…

yoavgo's tweet card. GitHub Gist: instantly share code, notes, and snippets.

rl-wrong-about-rewards.md

Source: gist.github.com

(((ل()(ل() 'yoav))))👾

5 déc.

I complain a lot about RL lately, and here we go again. The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment. More at length here: gist.github.com/yoavg/3eb3e722…

yoavgo's tweet card. GitHub Gist: instantly share code, notes, and snippets.

rl-wrong-about-rewards.md

Source: gist.github.com

(((ل()(ل() 'yoav))))👾

5 déc.

candy attracts mutant ants with variable number of legs

ᴅᴏᴜʙʟᴇ-🆁

@Naam_Hi_Kafi_H

4 déc.

What does this picture teach you??

Naam_Hi_Kafi_H's tweet image. What does this picture teach you??

(((ل()(ل() 'yoav))))👾

4 déc.

this is a survey. when you think of a "model" as in "model based RL", what do you have in mind? (in other words, what is a model in this sense?)

(((ل()(ل() 'yoav))))👾

4 déc.

turns out some disciplines/people use "a bellman equation" to mean *any* recursive equation which is amenable to DP. in that sense clearly the concept is important. i was talking specifically about the update rule for computing a value function using tabulation.

(((ل()(ل() 'yoav))))👾

3 déc.

why are the bellman equations considered foundational or important today? arent they just a straightforward application of DP to solve a problem that only arises in extremely simplified cases that never occur in practice?

(((ل()(ل() 'yoav))))👾

3 déc.

it is not "a confession" stop calling things with the most misleading names you can

(((ل()(ل() 'yoav))))👾

3 déc.

whats the least french sandwich you could think off

Declaration of Memes

2 déc.

Say what you want about the French, they get sandwiches right... 👇👇👇💯

(((ل()(ل() 'yoav))))👾

3 déc.

i am feeling a bit sick and cannot concentrate, perfect time for some science-adjacent online fighting

(((ل()(ل() 'yoav))))👾

3 déc.

hot take on the bun purchase: turns out coding agents cannot replace engineers just yet, huh.

(((ل()(ل() 'yoav))))👾

2 déc.

actually this should make companies more reluctant to rely on bun, not less.

Jarred Sumner

2 déc.

People frequently ask: > How is Bun sustainable? If I bet my company’s tech stack on Bun, will Bun still be around in a few years? We didn’t have a great answer to this question, until today

Delip Rao e/σ

@deliprao

Kyunghyun Cho

@kchonyc

Sasha Rush

@srush_nlp

Percy Liang

@percyliang

Christopher Manning

@chrmanning

Sam Bowman

@sleepinyourhat

Graham Neubig

@gneubig

Zachary Lipton

@zacharylipton

Jason Wei

@_jasonwei

Tal Linzen

@tallinzen

Felix Hill

@FelixHill84

Naomi Saphra

@nsaphra

Akari Asai (@NeurIPS 2025)

@AkariAsai

Jacob Andreas

@jacobandreas

Tim Dettmers

@Tim_Dettmers

Thomas G. Dietterich

@tdietterich

Danish Pruthi

@danish037

Leo Boytsov

@srchvrs

Douwe Kiela

@douwekiela

William Wang

@WilliamWangNLP

Serhii Honcharenko

@serhiiH100

Eric Howard

@EricHow19656998

StoneLean

@StoneLeanTao

rahcrypto

@rahcrypto1

L. John Silver

@boychocks

web3 fish

@g1134506645

trapboyhuncho

@trapboyhuncho99

parisyg

@parissyg

Itay Hazan

@itayhzn

Davidzjw

@zhujiwi07086836

David Halprin

@TheHalpi

M (Parody)

@M0924318635339

Samswara

@samswoora

nomāda visas // Carpe annum.™

@nomadavisas

$stefanos_ch3's profile picture. Physical AI @mimicrobotics Prev. {@SonyAI_global @leggedrobotics @GTrobotics) Opinions are my own$

Stefanos Charalambous

@stefanos_ch3

Stephen Paek

@stpaek

Ravi Agrawal

@ravi20036

Harsh Jalan

@harshjalaan

Kaiwen Zha

@KaiwenZha

John Gkountouras

@j0hngou

Asif.(AR) 🚀

@Asif_Legendary

Ali Larian

@ali_larian

RIan

@RIan182132

.

@COtpyr

Omojasola

@RalzyTaiwo

Leszek Bukowski 🧠💻🏛️👾

@LeszBuk

Pratik Patel

@pratikpatel

Tejas

@tejas_k0

dream_agent

@dreamagent22817

Prodigal Snacker

@Mightyjuge

Murat

@Murat89486896

Sobhan Shukueian Tabrizi

@sobhanshukueian

Lie B

@LieB329845

somuSan

@somuSan_

Bjarni Vilhjalmsson

@bvilhjal

Jonas

@jonas_edr

Osho chawla

@oshochawla17

Ryan Cooper

@RyanCooper7069

Brandon Ballinger 🧠 NeurIPS

@bballinger

Omar Ndizeye, MA

@Omar_Ndizeye

zefulan

@zefulan1

Fackarov

@fackarov

Mukund Narasimhan

@mukundn

parangaricutirimicuaro

@Ntifragility

Saurabh Sahay

@SahaySaurabh

Pravin W

@pravinw01

Shashwat Goel

@ShashwatGoel7

sumset

@sumir_30

Van Damme

@JCVD_ASTER

Thomson

@ThomsonYenTY

Delip Rao e/σ

@deliprao

Kyunghyun Cho

@kchonyc

Sasha Rush

@srush_nlp

Percy Liang

@percyliang

Christopher Manning

@chrmanning

Sam Bowman

@sleepinyourhat

Graham Neubig

@gneubig

Zachary Lipton

@zacharylipton

Jason Wei

@_jasonwei

Tal Linzen

@tallinzen

Felix Hill

@FelixHill84

Naomi Saphra

@nsaphra

Yoav Artzi

@yoavartzi

Thomas Wolf

@Thom_Wolf

Ferenc Huszár

@fhuszar

Jacob Andreas

@jacobandreas

Tim Dettmers

@Tim_Dettmers

Thomas G. Dietterich

@tdietterich

Leo Boytsov

@srchvrs

Douwe Kiela

@douwekiela

Hadi Vafaii ✈️ NeurIPS 2025

@hadivafaii

Matthias Schmidt

@eurofounder

Rishabh Agarwal

@agarwl_

Dwarkesh Patel

@dwarkesh_sp

Sarah Ettedgui

@SarahEttedgui

OZ Party - מפלגת עוז

@ozparty2026

Cynde

@Cyndesama

كابتن إيلا Captain Ella

@CaptainElla1

Aastha

@aastha_mhaske

Inbal Talgam-Cohen

@InbalTalgam

Irene Chen

@irenetrampoline

Translating Falasteen (Palestine)

@translatingpal

MMitchell

@mmitchell_ai

Alireza Talakoubnejad

@websterkaroon

Habeeb Habeeb

@habeebhabeeb

TheFatRat

@ThisIsTheFatRat

Sebastien Bubeck

@SebastienBubeck

Kareem Jouda

@kareem_1087

htmx.org / CEO of Complete Wrongness (same thing)

@htmx_org

Casey Muratori

@cmuratori

Benjamin Bratton

@bratton

Joscha Bach

@Plinz

Xenova

@xenovacom

Computer Science Bar Ilan

@ComscienceBiu

Dmitrii Kovanikov

@ChShersh

Josh McGrath

@j_mcgraph

Jason Lee

@jasondeanlee

Luca Ambrogioni

@LucaAmb

Asaf🇺🇦🇮🇱🎗️

@Asaf1139

corey scher

@coreymaps

Noy Sternlicht

@NoySternlicht

Mo Ghaoui

@moghaoui

Seyed Abbas Araghchi

@araghchi

מענדי גרוזמן

@mendy_gruzman

Mahmoud "Mo" Shawki 🇵🇸

@mo_shawki2

Mahmood Sharif

@mahmoods01

חיים גולדברג

@haim_goldberg

Ridvan Aydemir | Apostate Prophet 🎗

@ApostateProphet

Thamar E. Gindin

@thmr

Oz Katerji

@OzKaterji

Ghaya Ben Mbarek غاية بن مبارك

@Ghaya_BM

Charles Goddard

@chargoddard

zed

@zmkzmkz

Taelin

@VictorTaelin

Eleanor Berger

@intellectronica

Ph.Gritti

@Philipp27960841

Orian Sharoni

@OrianSharoni

@Geopolitics_is_like_AoE

@Geopolitics_AoE

Yehuda Lahav

@DrLahav

inigo quilez

@iquilezles

United States Tendances

1. Bama 70.5K posts
2. Mendoza 17.2K posts
3. #UFC323 42.3K posts
4. Indiana 55.1K posts
5. #NXTDeadline 33.9K posts
6. Ohio State 28.8K posts
7. Sayin 89.2K posts
8. Georgia 75.2K posts
9. Miami 263K posts
10. Gus Johnson 1,212 posts
11. Pat Spencer 6,711 posts
12. Heisman 8,672 posts
13. #AEWCollision 10.4K posts
14. #Big10Championship 1,069 posts
15. Jeremiah Smith 3,428 posts
16. #iufb 4,235 posts
17. Cavs 8,088 posts
18. Cass 7,728 posts
19. Caden Curry 1,750 posts
20. Buckeyes 7,494 posts

Vous pourriez aimer

Something went wrong.

Something went wrong.