Gustav
@guidotrev
rl enthusiast | math @ uchicago | @zfellows
We are happy to announce SkyRL tx 0.0.3! SkyRL tx is an open source library that implements a backend for the Tinker API and allows people to set up their own Tinker-like service running on their own hardware. This release has full MoE support, better checkpointing and the first…
novasky-ai.notion.site
SkyRL tx v0.0.3 Release
Philipp Moritz, Tyler Griggs, and the SkyRL Team
i don't understand the asynchronous rl claim for higher throughput. you can colocate training and generation on the same set of gpus and the switching bottleneck is minimal. this still achieves high throughput while avoiding off policy training.
practical, modern GRPO tweaks as described in Meta's Code World Models paper
thank god kl is useless 🙏 fucking hate having to deal with the ref model
I am making agents that fix performance bottlenecks in code. Here, it made search in @WerWolv ImHex 2x faster! End-to-end, producing a ready-to-compile updated project with no changes in functionality
yo chat?
5 voto · Resultados finais
great read
Meet SFR-DeepResearch (SFR-DR) 🤖: our RL-trained autonomous agents that can reason, search, and code their way through deep research tasks. 🚀SFR-DR-20B achieves 28.7% on Humanity's Last Exam (text-only) using only web search 🔍, browsing 🌐, and Python interpreter 🐍,…
cool graph from @thinkymachines blog that shows performance peaks when batches are size 2^n
rl frameworks fail or succeed based on how photons hit silicon on your laptop. the same script that worked 12 days ago with pinged dependencies and versions now fails
Sometimes I open my stripe and realize my saas autopilot side hustle casually made $150 yesterday 🦍
i saw another tweet about someone beating a benchmark with rl after training directly on it and open sourcing the project. what's literally the point? are startups just showing off or have we forgotten basic train/val/test splits ever since rl went mainstream?
United States Tendências
- 1. #IDontWantToOverreactBUT N/A
- 2. #MondayMotivation 35.9K posts
- 3. Howie 7,827 posts
- 4. Hobi 52.8K posts
- 5. Phillips 489K posts
- 6. Victory Monday 3,015 posts
- 7. Good Monday 51.2K posts
- 8. Winthrop 1,589 posts
- 9. #maddiekowalski N/A
- 10. #MondayVibes 3,167 posts
- 11. 60 Minutes 120K posts
- 12. $IREN 16.1K posts
- 13. Happy Birthday Kim 1,026 posts
- 14. #Talus_Labs N/A
- 15. Tomorrow is Election Day 1,072 posts
- 16. Kimberly-Clark 1,893 posts
- 17. Bradley 6,939 posts
- 18. $QURE 2,146 posts
- 19. Kenvue 2,728 posts
- 20. Rybakina 2,233 posts
Something went wrong.
Something went wrong.