David Marx (@digthatdata.bsky.social)

@DigThatData

Generative AI MLE, FOSS toolmaker, innovation catalyst @CoreWeave + @AiEleuther. https://bsky.app/profile/digthatdata.bsky.social

Science & Technology

Seattle, WA

github.com/dmarx

انضم في نوفمبر 2013

10Kالمنشورات 4Kالمتابعون 2Kالمتابَعون

قد يعجبك

@devdef

@multimodalart

@pharmapsychotic

@NerdyRodent

@rainisto

@huemin_art

@sureailabs

@Takyon

@GlennIsZen

@proximasan

@nin_artificial

@Infinite__Vibes

@dome_271

@thibaudz

@williamcusick

مثبتة

David Marx (@digthatdata.bsky.social)

@DigThatData

١ فبراير ٢٠٢٤ م

David Marx (@digthatdata.bsky.social)

@DigThatData

٦ يوليوم

Remember: by participating on twitter, you are adding value to it, thereby incentivizing others to join/stay, and amplifying the voice/reach of the people who set its algorithm's biases. Is this the ideology you want to amplify? Because if you are here, you are amplifying this.

DigThatData's tweet image. Remember: by participating on twitter, you are adding value to it, thereby incentivizing others to join/stay, and amplifying the voice/reach of the people who set its algorithm's biases.

Is this the ideology you want to amplify? Because if you are here, you are amplifying this.

David Marx (@digthatdata.bsky.social)

@DigThatData

١١ يونيوم

youtube.com/shorts/IF3bzkz…

David Marx (@digthatdata.bsky.social)

@DigThatData

٢٤ مايوم

David Marx (@digthatdata.bsky.social)

@DigThatData

١٢ مارسم

TAXATION WITHOUT REPRESENTATION.

David Marx (@digthatdata.bsky.social) أعاد

anton 🇺🇸

@atroyn

٢٩ ينايرم

'we're in this bizarre world where the best way to learn about llms... is to read papers by chinese companies. i do not think this is a good state of the world' - us labs keeping their architectures and algorithms secret is ultimately hurting ai development in the us.

David Marx (@digthatdata.bsky.social)

@DigThatData

٢٥ ينايرم

> Republicans: "We love America! It has the greatest system of governance. Look, I even carry the constitution next to my heart like a little bible :*) " > Also republicans: "DISMANTLE THE GOVERNMENT! FUCK THE SEPARATION OF POWERS! GOD KING PRESIDENT CULT OF PERSONALITY!"

David Marx (@digthatdata.bsky.social)

@DigThatData

٣ ينايرم

sora can't handle my prompts

David Marx (@digthatdata.bsky.social) أعاد

Visu_AI_Poetry

@Visu_AI_Poetry

٢٥ ديسمبرم

Happy Christmas friends

David Marx (@digthatdata.bsky.social) أعاد

Haiwen Huang

@HaiwenHuang_

١٠ ديسمبرم

🔥 Are you ever dissatisfied with the imprecise names in vision-language datasets? 🚀 At #NeurIPS2024, we introduce 𝐑𝐄𝐍𝐎𝐕𝐀𝐓𝐄, showing how better segmentation dataset names lead to 𝐛𝐞𝐭𝐭𝐞𝐫 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 & 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧. Let’s dive in! 🧵👇

HaiwenHuang_'s tweet image. 🔥 Are you ever dissatisfied with the imprecise names in vision-language datasets?

🚀 At #NeurIPS2024, we introduce 𝐑𝐄𝐍𝐎𝐕𝐀𝐓𝐄, showing how better segmentation dataset names lead to 𝐛𝐞𝐭𝐭𝐞𝐫 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠 &amp; 𝐞𝐯𝐚𝐥𝐮𝐚𝐭𝐢𝐨𝐧.

Let’s dive in! 🧵👇

David Marx (@digthatdata.bsky.social)

@DigThatData

٣ ديسمبر ٢٠٢٤ م

Yo this paper is wild.

Daniel Kunin

@KuninDaniel

٢٦ سبتمبر ٢٠٢٤ م

🌟Announcing NeurIPS spotlight paper on the transition from lazy to rich🔦 We reveal through exact gradient flow dynamics how unbalanced initializations promote rapid feature learning co-led @AllanRaventos and @ClementineDomi6 @FCHEN_AI @klindt_david @SaxeLab @SuryaGanguli

David Marx (@digthatdata.bsky.social) أعاد

Fern

@hi_tysam

٢٥ نوفمبر ٢٠٢٤ م

New NanoGPT training speed record: 3.28 FineWeb val loss in 4.66 minutes Previous record: 5.03 minutes Changelog: - FlexAttention blocksize warmup - hyperparameter tweaks

hi_tysam's tweet image. New NanoGPT training speed record: 3.28 FineWeb val loss in 4.66 minutes

Previous record: 5.03 minutes
Changelog:
- FlexAttention blocksize warmup
- hyperparameter tweaks

David Marx (@digthatdata.bsky.social) أعاد

Keller Jordan

@kellerjordan0

٢٥ نوفمبر ٢٠٢٤ م

This is officially the new record! Congrats @hi_tysam (who is also an OG of CIFAR-10 speedrunning) x.com/hi_tysam/statu…

Fern

@hi_tysam

٢٥ نوفمبر ٢٠٢٤ م

New NanoGPT training speed record: 3.28 FineWeb val loss in 4.66 minutes Previous record: 5.03 minutes Changelog: - FlexAttention blocksize warmup - hyperparameter tweaks

David Marx (@digthatdata.bsky.social) أعاد

Dylan Foster 🐢

@canondetortugas

٢٢ أكتوبر ٢٠٢٤ م

Is KL-regularization the right tool for language model alignment? The χPO algorithm: We show that a one-line change to DPO—moving from KL to chi-squared regularization—is sufficient to achieve state-of-the-art theoretical guarantees, provably alleviating over-optimization.

canondetortugas's tweet image. Is KL-regularization the right tool for language model alignment?

The χPO algorithm: We show that a one-line change to DPO—moving from KL to chi-squared regularization—is sufficient to achieve state-of-the-art theoretical guarantees, provably alleviating over-optimization.

David Marx (@digthatdata.bsky.social)

@DigThatData

٢٤ نوفمبر ٢٠٢٤ م

David Marx (@digthatdata.bsky.social)

@DigThatData

٢٤ نوفمبر ٢٠٢٤ م

And just like that, the AI/ML migration off twitter finally happened. RIP, this shitty pay-to-play platform.