Vivek

@cuda_optimized

powerful dreamer @iiscbangalore Interests : ai, tech, f1, cricket, music

India

9월 2019에 가입

529게시물 35팔로워 1천팔로우 중

내가 좋아할 만한 콘텐츠

@dill_sunnyb11

@Brunot3ch

@ibne_eunus

@mdda123

@gdsttian

@bertrandcouture

@Xarahenergy

@amvitable

@notes_own

@mhanchinmani

@Tsunami70510954

@semsphy

@anujdutt92

@dokondokon

@AdityaMorolia

고정된 트윗

Vivek

@cuda_optimized

. 10. 13.

woke up to this!! 😱😱

cuda_optimized's tweet image. woke up to this!! 😱😱

Vivek

@cuda_optimized

2 시간

does anyone actually use linkedIn, or do we all just log in once a month to accept random connection requests from strangers.

Vivek

@cuda_optimized

5 시간

open source gives ideas. closed source takes them, scales them, hides them. fair game or just pure cheating?

Vivek

@cuda_optimized

6 시간

highlight your notes in a quick and easy way. credit : @adivekar_ -> green - quick read -> yellow - read slowly and imp -> red - read, think and understand

cuda_optimized's tweet image. highlight your notes in a quick and easy way. credit : @adivekar_
-&gt; green - quick read
-&gt; yellow - read slowly and imp
-&gt; red - read, think and understand

Vivek

@cuda_optimized

6 시간

yeah, now this makes sense.

sadernoheart

7 시간

relax guys

sadernoheart's tweet image. relax guys

Vivek

@cuda_optimized

8 시간

btw, @waitin4agi_ cooked this one!!

Vivek

@cuda_optimized

18 시간

winter arc #2 (9hrs): -> read deepseek-math paper and grpo -> finished first 8 chap of rlhfbook by @natolambert -> read @kipperrii transformer inference arithmetic -> watched @elliotarledge vid on cublas and cublasLt -> wrote sgemm & hgemm in cublas -> @karpathy nanochat

cuda_optimized's tweet image. winter arc #2 (9hrs):
-&gt; read deepseek-math paper and grpo
-&gt; finished first 8 chap of rlhfbook by @natolambert
-&gt; read @kipperrii transformer inference arithmetic
-&gt; watched @elliotarledge vid on cublas and cublasLt
-&gt; wrote sgemm &amp; hgemm in cublas
-&gt; @karpathy nanochat

Vivek

@cuda_optimized

. 10. 13.

does kl penalty and grad norm ultimately have the same effect on the grpo loss. if so, then why can't we just add grad norm instead of the kl penalty term.

Vivek

@cuda_optimized

. 10. 13.

notes on deepseek-math paper - deepseek-math-base -> pretrained model on code and math data - deepseekmath-instruct 7B -> sft using coT, poT and tool reasoning - deepseekmath-rl -> grpo on gsm8k and math questions - rl is increasing prob of correct response

cuda_optimized's tweet image. notes on deepseek-math paper
- deepseek-math-base -&gt; pretrained model on code and math data
- deepseekmath-instruct 7B -&gt; sft using coT, poT and tool reasoning
- deepseekmath-rl -&gt; grpo on gsm8k and math questions
- rl is increasing prob of correct response

Vivek

@cuda_optimized

. 10. 12.

winter arc #1 (9.5hrs): -> read context & sequence parallel -> failed to impl ring attn -> binge watched @willccbb vids on yt. -> went deep into deepseek - r1 and watched some vids. -> posted tweet an gpt2 impl in triton and got a like from karpathy -> overall not a bad day!!

Vivek

@cuda_optimized

. 10. 10.

most of the llm's today are a simp.

Vivek

@cuda_optimized

. 9. 26.

i love how @dwarkesh_sp is trying to convince richard sutton that next token prediction is kinda like rl

Vivek

@cuda_optimized

. 9. 26.

just saw the @elliotarledge yt latest vid. man that’s so deep and thoughtful on how you spoke about your highs and lows. just wanted to say your an absolute inspiration man. good things will definitely happen soon brother!! keep inspiring us with your work and time lapses!!

Vivek

@cuda_optimized

. 9. 24.

why chatgpt is better than google ->compression : quick answers + stores a lot information. compression ratio is very good. ->context : able to identify your problems/questions which are not there on the internet and answer specifically.

Vivek

@cuda_optimized

. 9. 22.

man these llm's are so good without any context i wonder what happens if we give the right context to these llm's

Vivek

@cuda_optimized

. 9. 22.

this is not what i expected for humans vs robots to be

vittorio

@IterIntellectus

. 9. 22.

I’m sorry but WHAT THE FUCK?!

Vivek

@cuda_optimized

. 9. 22.

sam altman has a way of answering the question without actually answering the question while doing a podcast

Vivek

@cuda_optimized

. 9. 21.

not sure why but after a number of responses grok tends to repeats previous answer. @xai

EvangelineRuth

@7Wrg7eU7W77Z2

Ocorse

@Ocorse95987

Francesca

@Sreorqu5967

Natasha Howell

@NatashaHow39821

Irhuidor

@Irhuidor2130

Kearm h/eng

@Nottlespike

赖勇强

@LaiYongqiang_

Eva

@eva86madhavan

Edwin Kaliwanga

@EdwinKaliw6218

Lursoshyth

@LursoshythBvZR

Teatot

@TeatotIUy1

Thabiso

@Thabiso_Mapogo2

Ahmad Hassan

@ahmadhassan_seo

Shetyez

@ShetyezQyzYXWz

Chanel

@Chanel518302

Lovetogrow

@Lovetogrow16662

Bitter

@Bitter1521657

Orange

@Cadmium_orange

David Butler

@davogones

Ines

@ines_whitesell_

Advait Paliwal

@advaitpaliwal

DaniellaOlenius

@DaniellaOl88614

Morris

@teeleth15802

Sirrip

@Sirrip244714

Make money easily

@9Rs88N2yra0LOh4

Zhengzhong Tu

@_vztu

Jayanth Ragav

@jayra137

Aditya Ramesh

@theadityaramesh

Gale

@gale_beck_

Sébastien Darses

@DarsesSebastien

faraa29caqt

@kxvci60oxioci

Lauran

@lackey92lauran

Gio at QRC

@GioQrc

Shreyas K

@Shreyask0401

Lucy

@dreggsmcgee

arya

@aryagxr

Yacine Mahdid

@yacinelearning

Vincent Weisser

@vincentweisser

sadernoheart

@sadernoheart

Extraordinary

@extraordinary

snimu

@omouamoua

Arnie Ramesh

@arnie_hacker

William Fedus

@LiamFedus

Periodic Labs

@periodiclabs

Inference

@inference_net

Simon Mo

@simon_mo_

anandmaj

@Almondgodd

Ahmad

@TheAhmadOsman

Nathan Chen

@nathancgy4

LMSYS Org

@lmsysorg

Ali Taha

@AliesTaha

Lei Mao

@matchaleimao

j4orz

@j4orz

a16z

@a16z

Deedy

@deedydas

Pranjal

@pranjalssh

The Cinéprism

@TheCineprism

tender

@tenderizzation

Hao Zhang

@haozhangml

Ying Sheng

@ying11231

Lianmin Zheng

@lm_zheng

Zhuohan Li

@zhuohan123

Woosuk Kwon

@woosuk_k

Logan Thorneloe

@loganthorneloe

Mika Senghaas

@mikasenghaas

Justus Mattern

@MatternJustus

Grad

@Grad62304977

Standard Kernel Co.

@Standard_Kernel

Nikita Bier

@nikitabier

the tiny corp

@__tinygrad__

Vivek Galatage

@vivekgalatage

Outa

@CallMeOuta

Igor Michalak

@igorjmichalak

Rathul Anand

@vendablechart

Qwen

@Alibaba_Qwen

Origin Financial

@useorigin

Marques Brownlee

@MKBHD

Anindya

@anindyadeeps

Mark Saroufim

@marksaroufim

AthenaAgent

@AthenaAgentRL

Sachin

@sachdh

SemiAnalysis

@SemiAnalysis_

Johannes Hagemann

@johannes_hage

samsja

@samsja19

Thien Tran

@gaunernst

United States 트렌드

1. Happy Birthday Charlie 50.6K posts
2. Good Tuesday 31.4K posts
3. #tuesdayvibe 3,390 posts
4. Shilo 1,457 posts
5. #NationalDessertDay N/A
6. #PutThatInYourPipe N/A
7. Pentagon 74.8K posts
8. Standard Time 2,995 posts
9. #TacoTuesday N/A
10. #Worlds2025 43.2K posts
11. Happy 32nd 8,932 posts
12. Victory Tuesday N/A
13. Dissidia 6,543 posts
14. Happy Birthday in Heaven 1,489 posts
15. Martin Sheen 6,473 posts
16. Janet Mills 1,561 posts
17. No American 64.3K posts
18. JPMorgan 12.3K posts
19. Monad 190K posts
20. Presidential Medal of Freedom 29.7K posts

내가 좋아할 만한 콘텐츠

Something went wrong.

Something went wrong.