Vivek

@cuda_optimized

powerful dreamer @iiscbangalore Interests : ai, tech, f1, cricket, music

India

انضم في سبتمبر 2019

531المنشورات 36المتابعون 1ألفالمتابَعون

قد يعجبك

@dill_sunnyb11

@Brunot3ch

@ibne_eunus

@mdda123

@gdsttian

@bertrandcouture

@Xarahenergy

@amvitable

@notes_own

@mhanchinmani

@Tsunami70510954

@semsphy

@anujdutt92

@dokondokon

@AdityaMorolia

مثبتة

Vivek

@cuda_optimized

١٣ أكتوبرم

woke up to this!! 😱😱

cuda_optimized's tweet image. woke up to this!! 😱😱

Vivek

@cuda_optimized

2 س

winter arc #3 (8.5hrs): -> read deepseek-r1 paper -> finished most of chapters in the rlhfbook -> watched @hkproj vid on rlhf -> contemplated some future paths -> opinionated rl blog @jsuarez5341 -> went through cublas docs -> watched @elliotarledge tensor core vids

cuda_optimized's tweet image. winter arc #3 (8.5hrs):
-&gt; read deepseek-r1 paper
-&gt; finished most of chapters in the rlhfbook
-&gt; watched @hkproj vid on rlhf
-&gt; contemplated some future paths
-&gt; opinionated rl blog @jsuarez5341
-&gt; went through cublas docs
-&gt; watched @elliotarledge tensor core vids

Vivek

@cuda_optimized

4 س

miss the @sama from YC. the podcasts used to teach. now they just… talk

Vivek

@cuda_optimized

9 س

does anyone actually use linkedIn, or do we all just log in once a month to accept random connection requests from strangers.

Vivek

@cuda_optimized

12 س

open source gives ideas. closed source takes them, scales them, hides them. fair game or just pure cheating?

Vivek

@cuda_optimized

14 س

highlight your notes in a quick and easy way. credit : @adivekar_ -> green - quick read -> yellow - read slowly and imp -> red - read, think and understand

cuda_optimized's tweet image. highlight your notes in a quick and easy way. credit : @adivekar_
-&gt; green - quick read
-&gt; yellow - read slowly and imp
-&gt; red - read, think and understand

Vivek

@cuda_optimized

14 س

yeah, now this makes sense.

sadernoheart

15 س

relax guys

sadernoheart's tweet image. relax guys

Vivek

@cuda_optimized

15 س

btw, @waitin4agi_ cooked this one!!

Vivek

@cuda_optimized

١٣ أكتوبرم

winter arc #2 (9hrs): -> read deepseek-math paper and grpo -> finished first 8 chap of rlhfbook by @natolambert -> read @kipperrii transformer inference arithmetic -> watched @elliotarledge vid on cublas and cublasLt -> wrote sgemm & hgemm in cublas -> @karpathy nanochat

cuda_optimized's tweet image. winter arc #2 (9hrs):
-&gt; read deepseek-math paper and grpo
-&gt; finished first 8 chap of rlhfbook by @natolambert
-&gt; read @kipperrii transformer inference arithmetic
-&gt; watched @elliotarledge vid on cublas and cublasLt
-&gt; wrote sgemm &amp; hgemm in cublas
-&gt; @karpathy nanochat

Vivek

@cuda_optimized

١٣ أكتوبرم

does kl penalty and grad norm ultimately have the same effect on the grpo loss. if so, then why can't we just add grad norm instead of the kl penalty term.

Vivek

@cuda_optimized

١٣ أكتوبرم

notes on deepseek-math paper - deepseek-math-base -> pretrained model on code and math data - deepseekmath-instruct 7B -> sft using coT, poT and tool reasoning - deepseekmath-rl -> grpo on gsm8k and math questions - rl is increasing prob of correct response

cuda_optimized's tweet image. notes on deepseek-math paper
- deepseek-math-base -&gt; pretrained model on code and math data
- deepseekmath-instruct 7B -&gt; sft using coT, poT and tool reasoning
- deepseekmath-rl -&gt; grpo on gsm8k and math questions
- rl is increasing prob of correct response

Vivek

@cuda_optimized

١٢ أكتوبرم

winter arc #1 (9.5hrs): -> read context & sequence parallel -> failed to impl ring attn -> binge watched @willccbb vids on yt. -> went deep into deepseek - r1 and watched some vids. -> posted tweet an gpt2 impl in triton and got a like from karpathy -> overall not a bad day!!

Vivek

@cuda_optimized

١٠ أكتوبرم

most of the llm's today are a simp.

Vivek

@cuda_optimized

٢٦ سبتمبرم

i love how @dwarkesh_sp is trying to convince richard sutton that next token prediction is kinda like rl

Vivek

@cuda_optimized

٢٦ سبتمبرم

just saw the @elliotarledge yt latest vid. man that’s so deep and thoughtful on how you spoke about your highs and lows. just wanted to say your an absolute inspiration man. good things will definitely happen soon brother!! keep inspiring us with your work and time lapses!!

Vivek

@cuda_optimized

٢٤ سبتمبرم

why chatgpt is better than google ->compression : quick answers + stores a lot information. compression ratio is very good. ->context : able to identify your problems/questions which are not there on the internet and answer specifically.

Vivek

@cuda_optimized

٢٢ سبتمبرم

man these llm's are so good without any context i wonder what happens if we give the right context to these llm's

Vivek

@cuda_optimized

٢٢ سبتمبرم

this is not what i expected for humans vs robots to be

vittorio

@IterIntellectus

٢٢ سبتمبرم

I’m sorry but WHAT THE FUCK?!

Diana

@schoenfuss24651

EvangelineRuth

@7Wrg7eU7W77Z2

Ocorse

@Ocorse95987

Francesca

@Sreorqu5967

Natasha Howell

@NatashaHow39821

Irhuidor

@Irhuidor2130

Kearm h/eng

@Nottlespike

赖勇强

@LaiYongqiang_

Eva

@eva86madhavan

Edwin Kaliwanga

@EdwinKaliw6218

Lursoshyth

@LursoshythBvZR

Teatot

@TeatotIUy1

Thabiso

@Thabiso_Mapogo2

Ahmad Hassan

@ahmadhassan_seo

Shetyez

@ShetyezQyzYXWz

Chanel

@Chanel518302

Lovetogrow

@Lovetogrow16662

Bitter

@Bitter1521657

Orange

@Cadmium_orange

David Butler

@davogones

Ines

@ines_whitesell_

Advait Paliwal

@advaitpaliwal

DaniellaOlenius

@DaniellaOl88614

Morris

@teeleth15802

Sirrip

@Sirrip244714

Make money easily

@9Rs88N2yra0LOh4

Zhengzhong Tu

@_vztu

Jayanth Ragav

@jayra137

Aditya Ramesh

@theadityaramesh

Gale

@gale_beck_

Sébastien Darses

@DarsesSebastien

faraa29caqt

@kxvci60oxioci

Lauran

@lackey92lauran

Gio at QRC

@GioQrc

Shreyas K

@Shreyask0401

Lucy

@dreggsmcgee

arya

@aryagxr

Yacine Mahdid

@yacinelearning

Vincent Weisser

@vincentweisser

sadernoheart

@sadernoheart

Extraordinary

@extraordinary

snimu

@omouamoua

Arnie Ramesh

@arnie_hacker

William Fedus

@LiamFedus

Periodic Labs

@periodiclabs

Inference

@inference_net

Simon Mo

@simon_mo_

anandmaj

@Almondgodd

Ahmad

@TheAhmadOsman

Nathan Chen

@nathancgy4

LMSYS Org

@lmsysorg

Ali Taha

@AliesTaha

Lei Mao

@matchaleimao

j4orz

@j4orz

a16z

@a16z

Deedy

@deedydas

Pranjal

@pranjalssh

The Cinéprism

@TheCineprism

tender

@tenderizzation

Hao Zhang

@haozhangml

Ying Sheng

@ying11231

Lianmin Zheng

@lm_zheng

Zhuohan Li

@zhuohan123

Woosuk Kwon

@woosuk_k

Logan Thorneloe

@loganthorneloe

Mika Senghaas

@mikasenghaas

Justus Mattern

@MatternJustus

Grad

@Grad62304977

Standard Kernel Co.

@Standard_Kernel

Nikita Bier

@nikitabier

the tiny corp

@__tinygrad__

Vivek Galatage

@vivekgalatage

Outa

@CallMeOuta

Igor Michalak

@igorjmichalak

Rathul Anand

@vendablechart

Qwen

@Alibaba_Qwen

Origin Financial

@useorigin

Marques Brownlee

@MKBHD

Anindya

@anindyadeeps

Mark Saroufim

@marksaroufim

AthenaAgent

@AthenaAgentRL

Sachin

@sachdh

SemiAnalysis

@SemiAnalysis_

Johannes Hagemann

@johannes_hage

samsja

@samsja19

Thien Tran

@gaunernst

United States الاتجاهات

1. D’Angelo 285K posts
2. Pentagon 107K posts
3. Brown Sugar 20.6K posts
4. #PortfolioDay 16.2K posts
5. Young Republicans 13.8K posts
6. Politico 164K posts
7. Drew Struzan 28.3K posts
8. Big 12 13.2K posts
9. Scream 5 N/A
10. Black Messiah 10.8K posts
11. David Bell N/A
12. Milei 262K posts
13. Presidential Medal of Freedom 59.3K posts
14. Soybeans 5,228 posts
15. Merino 14.7K posts
16. Venables 3,641 posts
17. Nick Mangold N/A
18. World Cup 334K posts
19. Voodoo 21.5K posts
20. Baldwin 20.9K posts

قد يعجبك

Something went wrong.

Something went wrong.