rasdani

@rasdani_

hill climbing @PrimeIntellect

~/.cache/huggingface

Joined April 2022

467Posts 536Followers 3KFollowing

You might like

@Ruesavatar

@student201012

@luka_spahija

@KommuriShreya

@derekisnt

@az0o0z_97

@sbkobaidze

rasdani reposted

Dvij Kalaria

@DvijKalaria

Nov 16

Why should humans have all the fun :) Thanks @ZhiSu22 for HITTER on g1!

rasdani

@rasdani_

Nov 16

memetic convergence @extropic 🤝 @PrimeIntellect

rasdani reposted

CIX 🦾

@cixliv

Nov 15

A taste of the Los Angeles event a few days ago. @cookedbykimchi starting his professional robot fighting career with a 1-0. This trip has been a wild ride to put it lightly.

rasdani reposted

will brown

@willccbb

Nov 14

this job is way too fun lol

rasdani reposted

i cannot overstate how absurdly impressive primeintellect's rl infra is the people working on it clearly view it as art and probably forget they get paid if you like rl, there’s really no better place on earth to work on it

rasdani reposted

Sriraam

@27upon2

Nov 13

day 8 of RL: building open-source swe-grep by @cognition with just a bash tool got 37.6% increase in F1 from Qwen3-4B base by GRPOing for 120 steps. more data 👇 once i built my env with @PrimeIntellect verifiers, starting the rl run with prime-rl was super simple. thanks…

27upon2's tweet image. day 8 of RL: building open-source swe-grep by @cognition with just a bash tool

got 37.6% increase in F1 from Qwen3-4B base by GRPOing for 120 steps. more data 👇

once i built my env with @PrimeIntellect verifiers, starting the rl run with prime-rl was super simple. thanks…

rasdani reposted

kalomaze

@kalomaze

Nov 9

RL LEARNING WITH LORA: A DIVERSE DEEP DIVE

rasdani

@rasdani_

Nov 9

💯

Johannes Hagemann

@johannes_hage

Nov 9

> Train models end-to-end with RL in your own environment/application > RL facilitates building specialized models > RL is an infra challenge While all the big labs are doing everything they can to convince companies to build on their closed APIs/models, despite telling everyone…

johannes_hage's tweet image. &gt; Train models end-to-end with RL in your own environment/application
&gt; RL facilitates building specialized models
&gt; RL is an infra challenge

While all the big labs are doing everything they can to convince companies to build on their closed APIs/models, despite telling everyone…

rasdani reposted

will brown

@willccbb

Nov 7

if you want the tweet version and not the 10min video version: this is now all it takes to train with prime-rl after installing verifiers

willccbb's tweet image. if you want the tweet version and not the 10min video version:

this is now all it takes to train with prime-rl after installing verifiers

will brown

@willccbb

Nov 7

verifiers v0.1.7 is released 🚀 this one's all about making RL training and experimentation waaaay easier: - single-command installation for prime-rl - single-command training w/ unified configs - overhauled vf.RLTrainer for hacking on new algorithms quick demo + links below :)

rasdani reposted

will brown

@willccbb

Nov 7

rasdani reposted

Logan Olson

@jloganolson

Nov 4

Quick clip of the final full-speed crawl (no costume)

rasdani reposted

Johannes Hagemann

@johannes_hage

Nov 6

The PipelineRL paper getting rejected at NeurIPS reminds me of when the Megatron-LM paper got rejected from every conference back in 2020 scientific reviewers still don’t recognize a good systems paper when they see one openreview.net/forum?id=Eqlmp…

johannes_hage's tweet image. The PipelineRL paper getting rejected at NeurIPS reminds me of when the Megatron-LM paper got rejected from every conference back in 2020

scientific reviewers still don’t recognize a good systems paper when they see one

openreview.net/forum?id=Eqlmp…

Rishabh Agarwal

@agarwl_

Nov 6

Don't sleep on PipelineRL -- this is one of the biggest jumps in compute efficiency of RL setups that we found in the ScaleRL paper (also validated by Magistral & others before)! What's the problem PipelineRL solves? In RL for LLMs, we need to send weight updates from trainer to…

agarwl_'s tweet image. Don't sleep on PipelineRL -- this is one of the biggest jumps in compute efficiency of RL setups that we found in the ScaleRL paper (also validated by Magistral &amp; others before)!

What's the problem PipelineRL solves? In RL for LLMs, we need to send weight updates from trainer to…

rasdani reposted

will brown

@willccbb

Nov 5

Frida Kalo

rasdani reposted

Sriraam

@27upon2

Nov 4

Day 5 of RL: open source is beautiful i had a timeout issue in @PrimeIntellect sandboxes sdk so i forked, fixed it, symlinked in my swe-bench env with uv and got positive rewards. u can just do things with open source software. also made a PR with a test. also running a…

27upon2's tweet image. Day 5 of RL: open source is beautiful

i had a timeout issue in @PrimeIntellect sandboxes sdk so i forked, fixed it, symlinked in my swe-bench env with uv and got positive rewards. u can just do things with open source software. also made a PR with a test.

also running a…

rasdani reposted

Johannes Hagemann

@johannes_hage

Nov 3

gonna be a big month

rasdani reposted

Charlie Marsh

@charliermarsh

Nov 2

It finally happened: someone asked me if I work at @PrimeIntellect while wearing the @PrimeIntellect hat

rasdani reposted

Kimi.ai

@Kimi_Moonshot

Oct 31

Introducing Kimi CLI Technical Preview & Kimi For Coding! Kimi CLI powers your terminal: - Shell-like UI + shell command execution - Seamless Zsh integration - MCP support -Agent Client Protocol (now compatible with @zeddotdev) More features incoming!

Kimi_Moonshot's tweet image. Introducing Kimi CLI Technical Preview &amp; Kimi For Coding!

Kimi CLI powers your terminal:
- Shell-like UI + shell command execution
- Seamless Zsh integration
- MCP support
-Agent Client Protocol (now compatible with @zeddotdev)

More features incoming!

rasdani reposted

wh

@nrehiew_

Nov 1

From my tests, the new Cursor Composer 1 model is likely some variant of a DeepSeek model since it uses the same tokenizer. You can verify this by looking at the input tokens per request in your usage dashboard.