(((ل()(ل() 'yoav))))👾

@yoavgo

Israel

cs.biu.ac.il/~yogo/

Joined May 2009

139KPosts 69KFollowers 2KFollowing

You might like

@srush_nlp

@soumithchintala

@SchmidhuberAI

@sleepinyourhat

@percyliang

@arankomatsuzaki

@chrmanning

@ch402

@hardmaru

@seb_ruder

@jacobandreas

@kchonyc

@Thom_Wolf

@_rockt

@ericjang11

Pinned

(((ل()(ل() 'yoav))))👾

Nov 4, 2023

what there is no overwhelming agreement on is what will happen the day after. how do we build a livable future here, for both israelis and palestinians. israel leadership is not thinking of this now, future looks grim. THIS is where protest effort and outrage should be invested.

(((ل()(ل() 'yoav))))👾

Dec 6

remind me why we call the LLM reasoning samples during training "trajectories" and not "sampled responses"?

(((ل()(ل() 'yoav))))👾

Dec 6

wait so the GRPO everyone are drooling about is just REINFORCE with the baseline computed as an average over a large sample (and the usual kl regularization in llm models)?

(((ل()(ل() 'yoav))))👾 reposted

Hadi Vafaii ✈️ NeurIPS 2025

Dec 5

yes. sensorimotorai.github.io/debates/

(((ل()(ل() 'yoav))))👾

Dec 5

this classic figure is wrong.

yoavgo's tweet image. this classic figure is wrong.

(((ل()(ل() 'yoav))))👾

Dec 5

I complain a lot about RL lately, and here we go again. The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment. More at length here: gist.github.com/yoavg/3eb3e722…

yoavgo's tweet card. GitHub Gist: instantly share code, notes, and snippets.

rl-wrong-about-rewards.md

Source: gist.github.com

(((ل()(ل() 'yoav))))👾

Dec 5

I complain a lot about RL lately, and here we go again. The CS view of RL is wrong in how it thinks about rewards, already at the setup level. Briefly, the reward computation should be part of the agent, not part of the environment. More at length here: gist.github.com/yoavg/3eb3e722…

yoavgo's tweet card. GitHub Gist: instantly share code, notes, and snippets.

rl-wrong-about-rewards.md

Source: gist.github.com

(((ل()(ل() 'yoav))))👾

Dec 5

candy attracts mutant ants with variable number of legs

ᴅᴏᴜʙʟᴇ-🆁

@Naam_Hi_Kafi_H

Dec 4

What does this picture teach you??

Naam_Hi_Kafi_H's tweet image. What does this picture teach you??

(((ل()(ل() 'yoav))))👾

Dec 4

this is a survey. when you think of a "model" as in "model based RL", what do you have in mind? (in other words, what is a model in this sense?)

(((ل()(ل() 'yoav))))👾

Dec 4

turns out some disciplines/people use "a bellman equation" to mean *any* recursive equation which is amenable to DP. in that sense clearly the concept is important. i was talking specifically about the update rule for computing a value function using tabulation.

(((ل()(ل() 'yoav))))👾

Dec 3

why are the bellman equations considered foundational or important today? arent they just a straightforward application of DP to solve a problem that only arises in extremely simplified cases that never occur in practice?

(((ل()(ل() 'yoav))))👾

Dec 3

it is not "a confession" stop calling things with the most misleading names you can

(((ل()(ل() 'yoav))))👾

Dec 3

whats the least french sandwich you could think off

Declaration of Memes

Dec 2

Say what you want about the French, they get sandwiches right... 👇👇👇💯

(((ل()(ل() 'yoav))))👾

Dec 3

i am feeling a bit sick and cannot concentrate, perfect time for some science-adjacent online fighting

(((ل()(ل() 'yoav))))👾

Dec 3

hot take on the bun purchase: turns out coding agents cannot replace engineers just yet, huh.

(((ل()(ل() 'yoav))))👾

Dec 2

actually this should make companies more reluctant to rely on bun, not less.

Jarred Sumner

Dec 2

People frequently ask: > How is Bun sustainable? If I bet my company’s tech stack on Bun, will Bun still be around in a few years? We didn’t have a great answer to this question, until today

Delip Rao e/σ

@deliprao

Kyunghyun Cho

@kchonyc

Sasha Rush

@srush_nlp

Percy Liang

@percyliang

Christopher Manning

@chrmanning

Sam Bowman

@sleepinyourhat

Graham Neubig

@gneubig

Zachary Lipton

@zacharylipton

Jason Wei

@_jasonwei

Tal Linzen

@tallinzen

Felix Hill

@FelixHill84

Naomi Saphra

@nsaphra

Akari Asai (@NeurIPS 2025)

@AkariAsai

Jacob Andreas

@jacobandreas

Tim Dettmers

@Tim_Dettmers

Thomas G. Dietterich

@tdietterich

Danish Pruthi

@danish037

Leo Boytsov

@srchvrs

Douwe Kiela

@douwekiela

William Wang

@WilliamWangNLP

M

@M5736755315624

Zinc

@hifoxhi

TigerKing 킹타이거

@TigerKingByte

David Rwakoojo

@DRwakoojo

Gary Shular

@GaryShular65708

Tim Dockhorn

@timudk

ixuzhi

@ixuzhi100

Data with T(🤖)F

@TaifAlmezanii

RSAA

@RSAA

Jack Philips

@TheJackPhilip

Dark angel 509

@509_dark

arti

@claudeusmaximus

Nyvaris

@_Nyvaris

Jason

@jasonsapps

juan sebastian

@Jusesaga1994

A_for_Aurapiller

@Anos_Polticole

yang

@yang108709

13scoobie

@13scoobie

dave

@Dave_ij

Yule Wazimu

@zaidi58575

Paula

@pmourad84

SRC

@D33mD3

Cristian FS 🍀

@Crfarsal

Michael

@Michaelinkala

Ma Sheen

@MaSheenUprising

Karl Edward Uibo

@edwarduibo

Dr Blanca AI

@drblanca_AI

Ekonia

@KalihoseMigisha

Haneul Shin

@haneulshin1030

Wassim Meziani

@wassim_meziani2

井上大輔 | EURELIX代表取締役 | クエシス執行役員

@quesis_inoue

Fuatcan Başlık

@fuattcann

ModernWayMotion

@ModernWayMotion

Shaowen Wang

@wangsw5653

Lampedusa

@Lampedusallz

mh86

@mh861176458

Line

@linexjlin

KAVYA

@k9patel1

M.arif Pradana

@M43771Arif

Justin Daniel

@justin_m_daniel

SHOPQUANTUM.AI

@makezbrightgift

Atuhaire Ivan

@IvoAtuhaire

Connor

@Connor2jm

Serhii Honcharenko

@serhiiH100

Eric Howard

@EricHow19656998

StoneLean

@StoneLeanTao

rahcrypto

@rahcrypto1

L. John Silver

@boychocks

web3 fish

@g1134506645

Delip Rao e/σ

@deliprao

Kyunghyun Cho

@kchonyc

Sasha Rush

@srush_nlp

Percy Liang

@percyliang

Christopher Manning

@chrmanning

Sam Bowman

@sleepinyourhat

Graham Neubig

@gneubig

Zachary Lipton

@zacharylipton

Jason Wei

@_jasonwei

Tal Linzen

@tallinzen

Felix Hill

@FelixHill84

Naomi Saphra

@nsaphra

Yoav Artzi

@yoavartzi

Thomas Wolf

@Thom_Wolf

Ferenc Huszár

@fhuszar

Jacob Andreas

@jacobandreas

Tim Dettmers

@Tim_Dettmers

Thomas G. Dietterich

@tdietterich

Leo Boytsov

@srchvrs

Douwe Kiela

@douwekiela

Hadi Vafaii ✈️ NeurIPS 2025

@hadivafaii

Matthias Schmidt

@eurofounder

Rishabh Agarwal

@agarwl_

Dwarkesh Patel

@dwarkesh_sp

Sarah Ettedgui

@SarahEttedgui

OZ Party - מפלגת עוז

@ozparty2026

Cynde

@Cyndesama

كابتن إيلا Captain Ella

@CaptainElla1

Aastha

@aastha_mhaske

Inbal Talgam-Cohen

@InbalTalgam

Irene Chen

@irenetrampoline

Translating Falasteen (Palestine)

@translatingpal

MMitchell

@mmitchell_ai

Alireza Talakoubnejad

@websterkaroon

Habeeb Habeeb

@habeebhabeeb

TheFatRat

@ThisIsTheFatRat

Sebastien Bubeck

@SebastienBubeck

Kareem Jouda

@kareem_1087

htmx.org / CEO of Complete Wrongness (same thing)

@htmx_org

Casey Muratori

@cmuratori

Benjamin Bratton

@bratton

Joscha Bach

@Plinz

Xenova

@xenovacom

Computer Science Bar Ilan

@ComscienceBiu

Dmitrii Kovanikov

@ChShersh

Josh McGrath

@j_mcgraph

Jason Lee

@jasondeanlee

Luca Ambrogioni

@LucaAmb

Asaf🇺🇦🇮🇱🎗️

@Asaf1139

corey scher

@coreymaps

Noy Sternlicht

@NoySternlicht

Mo Ghaoui

@moghaoui

Seyed Abbas Araghchi

@araghchi

מענדי גרוזמן

@mendy_gruzman

Mahmoud "Mo" Shawki 🇵🇸

@mo_shawki2

Mahmood Sharif

@mahmoods01

חיים גולדברג

@haim_goldberg

Ridvan Aydemir | Apostate Prophet 🎗

@ApostateProphet

Thamar E. Gindin

@thmr

Oz Katerji

@OzKaterji

Ghaya Ben Mbarek غاية بن مبارك

@Ghaya_BM

Charles Goddard

@chargoddard

zed

@zmkzmkz

Taelin

@VictorTaelin

Eleanor Berger

@intellectronica

Ph.Gritti

@Philipp27960841

Orian Sharoni

@OrianSharoni

@Geopolitics_is_like_AoE

@Geopolitics_AoE

Yehuda Lahav

@DrLahav

inigo quilez

@iquilezles

United States Trends

1. Notre Dame 102K posts
2. Daniel Jones 9,268 posts
3. Colts 18.4K posts
4. Tulane 35.1K posts
5. Miami 419K posts
6. Bengals 20.6K posts
7. Achilles 7,708 posts
8. Alabama 167K posts
9. Jeffy Yu 1,812 posts
10. Riley Leonard 1,333 posts
11. Redzone 11.8K posts
12. Joe Burrow 6,183 posts
13. Lamar Jackson 2,921 posts
14. Aaron Rodgers 3,791 posts
15. Tee Higgins 4,040 posts
16. #BillsMafia 7,417 posts
17. #CFPRankings 2,434 posts
18. #HardRockBet 3,869 posts
19. #HereWeGo 2,138 posts
20. Pearl Harbor 53.6K posts

You might like

Something went wrong.

Something went wrong.