code_star's profile picture. Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | http://linktr.ee/code_star

Cody Blakeney

@code_star

Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | http://linktr.ee/code_star

Pinned

We are looking for a post-training lead at @datologyai we have gpus, you can make them go brrrr

code_star's tweet image. We are looking for a post-training lead at @datologyai 

we have gpus, you can make them go brrrr

Cody Blakeney reposted

time for us to AllGather once again

Nvidia x Prime Intellect: Open-Source Model Builders & Scaling Meetup October 23 · 5:30-8 PM · San Francisco Join us luma.com/yndatvdq

PrimeIntellect's tweet image. Nvidia x Prime Intellect:
Open-Source Model Builders & Scaling Meetup

October 23 · 5:30-8 PM · San Francisco

Join us luma.com/yndatvdq


those tech radicals trying to legalize everything

code_star's tweet image. those tech radicals trying to legalize everything

The real ones know

code_star's tweet image. The real ones know

Things are transpiring

ok i start to see this bubble thing

Dorialexander's tweet image. ok i start to see this bubble thing


Are there memes you associate with one of your friends? I have a few. Their go to memes. When I see the template used I immediately think of them and my heart is full.


I have no dog in this fight, but as an outsider to the long time RL community this is pretty funny.

It's funny how almost a decade later, the frontier labs are back at the same spot, building RL gyms

TheSeaMouse's tweet image. It's funny how almost a decade later, the frontier labs are back at the same spot, building RL gyms


Cody Blakeney reposted

It's funny how almost a decade later, the frontier labs are back at the same spot, building RL gyms

TheSeaMouse's tweet image. It's funny how almost a decade later, the frontier labs are back at the same spot, building RL gyms

Our reinforcement learning toolkit, OpenAI Gym, is now in public beta: gym.openai.com.



I love that he looks like he is doing a bit even while he is shredding.

I don’t think nearly enough people know that Tim Robinson was photographed for Thrasher magazine

girldrawsghosts's tweet image. I don’t think nearly enough people know that Tim Robinson was photographed for Thrasher magazine
girldrawsghosts's tweet image. I don’t think nearly enough people know that Tim Robinson was photographed for Thrasher magazine


If scene kids existed in the Bay Area would they call their band "Panic! at the Frisco"?


Me learning what mxfp8 is from this tweet.

code_star's tweet image. Me learning what mxfp8 is from this tweet.

it may not be interesting to anyone but this is where i left off working tonight. i will start back again in a few hours from this same tmux session

_xjdr's tweet image. it may not be interesting to anyone but this is where i left off working tonight. i will start back again in a few hours from this same tmux session


Cody Blakeney reposted

to put this in real terms this probably is like going from a cost of ~$100k -> $500 in just 2 years from hardware, training techniques, and data to reach this capability.

crazy how far data and models have come in such a short time. I could probably train an MoE in a single day with 1 H100 node with a higher pass@1



There seems to be strange parallel worlds that exist in academic research. One side seems to have decided SFTing on reasoning traces is all you need for post-training small models. And one side thinks GRPO is all you need. I'm sort of baffled by the whole thing.


nanogpt pass@1 > 30% would go so hard

new speedrun for nanogpt :)



For the record we did the radar plots for the exact reason you are thinking. We were thinking about video games.

code_star's tweet image. For the record we did the radar plots for the exact reason you are thinking. We were thinking about video games.

A lot of people were really mad at us when we made radar plots for the gauntlet lol if we had known how far people would take it we would have reconsidered

code_star's tweet image. A lot of people were really mad at us when we made radar plots for the gauntlet lol if we had known how far people would take it we would have reconsidered


Loading...

Something went wrong.


Something went wrong.