NeuralComputing

@NeuralComputing

Eylül 2017’de katıldı

1Gönderiler 2Takipçiler 8Takip edilenler

NeuralComputing gönderiyi yeniden yayınladı

Nathan Lambert

1 Ara 2023

DPO Debate: Is RL needed for RLHF? All things as we cannot settle if DPO or RL is better. At least it is a good exercise. 1. Derivations in the DPO paper. Hint, the authors are good at math 2. cDPO, IPO, and related equations 3. Speculation on potential oddities of DPO vs RL…

Neuroethics and AI LatAm

@Neuroethic76909

Power progress

@PowerprogressiA

stephanie yee

@stephaniejyee

Andrew Ng

@AndrewYNg

Tony Chu

@tonyhschu

r2d3.us

@r2d3us

Cisco

@Cisco

Deep Learning Hub

@DeepLearningHub

NVIDIA

@nvidia

AMD

@AMD

United States Trendler

1. Raindotgg 1,570 posts
2. Sam Houston 1,362 posts
3. #TalusLabs N/A
4. Oregon State 4,721 posts
5. Lubin 5,410 posts
6. Louisville 14.2K posts
7. #GoAvsGo 1,517 posts
8. UCLA 7,731 posts
9. Batum N/A
10. Nuss 5,778 posts
11. Emmett Johnson 2,395 posts
12. #Huskers 1,087 posts
13. Oilers 4,852 posts
14. #T1WIN 21.3K posts
15. Miller Moss 1,216 posts
16. #FlyTogether 1,935 posts
17. Bama 13.8K posts
18. Lateef 2,297 posts
19. Brohm 1,154 posts
20. Nikki Glaser N/A

Something went wrong.

Something went wrong.