NeuralComputing

@NeuralComputing

九月 2017 加入

1帖子 2关注者 8正在关注

NeuralComputing 已转帖

Nathan Lambert

2023年12月1日

DPO Debate: Is RL needed for RLHF? All things as we cannot settle if DPO or RL is better. At least it is a good exercise. 1. Derivations in the DPO paper. Hint, the authors are good at math 2. cDPO, IPO, and related equations 3. Speculation on potential oddities of DPO vs RL…

Neuroethics and AI LatAm

@Neuroethic76909

Power progress

@PowerprogressiA

stephanie yee

@stephaniejyee

Andrew Ng

@AndrewYNg

Tony Chu

@tonyhschu

r2d3.us

@r2d3us

Cisco

@Cisco

Deep Learning Hub

@DeepLearningHub

NVIDIA

@nvidia

AMD

@AMD

United States 趋势

1. Trench 6,897 posts
2. Godzilla 22.7K posts
3. Richardson 3,169 posts
4. Foden 12.6K posts
5. Hato 24.6K posts
6. Jacob Frey 17.9K posts
7. Brugge 30.7K posts
8. Flick 27.9K posts
9. Shabbat 1,705 posts
10. Ferran 11.7K posts
11. Hefner 1,121 posts
12. Lina Khan 5,809 posts
13. $DUOL 1,686 posts
14. Tosin 13.8K posts
15. Jared Golden 1,283 posts
16. Minneapolis 54.4K posts
17. Stearns N/A
18. Qarabag 48K posts
19. NYPD 37K posts
20. Balde 9,002 posts

Something went wrong.

Something went wrong.