code_star's profile picture. Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | http://linktr.ee/code_star

Cody Blakeney

@code_star

Data Dawg @datologyai | Formerly Data Research Lead @DbrxMosaicAI | Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | http://linktr.ee/code_star

Ghim

We are looking for a post-training lead at @datologyai we have gpus, you can make them go brrrr

code_star's tweet image. We are looking for a post-training lead at @datologyai 

we have gpus, you can make them go brrrr

Me eating my hot Cheetos: “I told you that huel shit was bad for you”


Cody Blakeney đã đăng lại

not possible for someone to be more personally victimized by a song than I am by this one spotify.link/zSzDpYoUuXb


Cody Blakeney đã đăng lại

i think about this everyday

when you're the eval guy and you check up on the experiments channel in slack

jecdohmann's tweet image. when you're the eval guy and you check up on the experiments channel in slack


Cody Blakeney đã đăng lại

looking at data is like eating lava. it warms me up inside but kills me


Cody Blakeney đã đăng lại

Accidentally said « AdamW » instead of « muon » and they kicked me out of SF

Accidentally said "looks good" instead of "SOTA" and they kicked me out of SF



Cody Blakeney đã đăng lại

GPU computing before CUDA was *weird*. Memory primitives were graphics shaped, not computer science shaped. Want to do math on an array? Store it as an RGBA texture. Fragment Shader for processing. *Paint* the result in a big rectangle.

lauriewired's tweet image. GPU computing before CUDA was *weird*.

Memory primitives were graphics shaped, not computer science shaped.

Want to do math on an array? Store it as an RGBA texture.

Fragment Shader for processing. *Paint* the result in a big rectangle.
lauriewired's tweet image. GPU computing before CUDA was *weird*.

Memory primitives were graphics shaped, not computer science shaped.

Want to do math on an array? Store it as an RGBA texture.

Fragment Shader for processing. *Paint* the result in a big rectangle.

Cody Blakeney đã đăng lại

Accidentally said “orthogonal” instead of “opposite” in florida and desantis is shipping me back to SF

Accidentally said "hard" instead of "non-trivial" and they kicked me out of SF



banger

looking at data is like eating lava. it warms me up inside but kills me



Unbelievably good hire for @AnthropicAI . Excited to see what y'all cook up!

Excited to announce that @dsmilkov and I have joined @AnthropicAI to work on tooling for mechanistic interpretability research 🐜 🐜 If you have worked with me you know I’ve been obsessed with Claude and now we will be opening it up to understand what’s going on inside, and help…

nsthorat's tweet image. Excited to announce that @dsmilkov and I have joined @AnthropicAI to work on tooling for mechanistic interpretability research 🐜 🐜

If you have worked with me you know I’ve been obsessed with Claude and now we will be opening it up to understand what’s going on inside, and help…


When I have multiple Anneals and some CPT runs


Cody Blakeney đã đăng lại

not your weights not your model not your mind

Am I wrong in sensing a paradigm shift in AI? Feels like we’re moving from a world obsessed with generalist LLM APIs to one where more and more companies are training, optimizing, and running their own models built on open source (especially smaller, specialized ones) Some…

ClementDelangue's tweet image. Am I wrong in sensing a paradigm shift in AI?

Feels like we’re moving from a world obsessed with generalist LLM APIs to one where more and more companies are training, optimizing, and running their own models built on open source (especially smaller, specialized ones)

Some…


Give it a year or two and we will have nanoagent. Then nanoagent speed run. Then nanoASI and so on.

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…


Still thinking about this banger

when you're the eval guy and you check up on the experiments channel in slack

jecdohmann's tweet image. when you're the eval guy and you check up on the experiments channel in slack


Cody Blakeney đã đăng lại

HAHAHAHAHAHA IT'S A VIDEO FROM THE PORT OF LONG BEACH CALIFORNIA 🇺🇸🇺🇸🇺🇸🇺🇸🇺🇸

Tweet này không còn khả dụng.

Cody Blakeney đã đăng lại

Its that time again

code_star's tweet image. Its that time again

Cody Blakeney đã đăng lại

Here’s a scaling law that matters more: every two years the gap to frontier closes by a factor of 2x

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…


Loading...

Something went wrong.


Something went wrong.