Aman Swar

@Compile_Conquer

Building fast & efficient AI systems, from low-level CUDA kernels to distributed training frameworks. AI Systems Engineer | 3rd Year undergraduate

India

amanswar.github.io

Dołączył w Lipiec 2023

95Wpisy 13Obserwujących 536Obserwowanych

Aman Swar

@Compile_Conquer

10 lis

Started Learning inline PTX assembly in CUDA, built a small header that implements: - Guarded global loads/stores using predicate registers - cp.async async copy from global → shared memory - Vectorized 128-bit loads/stores to improve bandwidth #CUDA #PyTorch

Compile_Conquer's tweet image. Started Learning inline PTX assembly in CUDA,
built a small header that implements:
- Guarded global loads/stores using predicate registers
- cp.async async copy from global → shared memory
- Vectorized 128-bit loads/stores to improve bandwidth
#CUDA #PyTorch

Aman Swar podał dalej

ThePrimeagen

4 lis

ASI will happen Because of tech? No Because the average IQ will experience such a steep drop that exceeding human intelligence becomes trivial

Aman Swar

@Compile_Conquer

2 lis

Going to share some of my old linkedIn post here ....

Aman Swar

@Compile_Conquer

10 wrz

After 3 freakin hours of debugging fused Grouped Query Attention kernel , I finally see this plot feeling happy blue = PyTorch attention (dies as seq_len ↑) green = my Triton kernel (🚀) checkout : github.com/AmanSwar/Model… #PyTorch #CUDA

Compile_Conquer's tweet image. After 3 freakin hours of debugging fused Grouped Query Attention kernel , I finally see this plot
feeling happy
blue = PyTorch attention (dies as seq_len ↑) green = my Triton kernel (🚀)
checkout : github.com/AmanSwar/Model…
#PyTorch #CUDA

Aman Swar

@Compile_Conquer

21 sie

eating CuTe today for lunch and dinner #CUDA #CUTASS #Nvidia

Compile_Conquer's tweet image. eating CuTe today for lunch and dinner #CUDA #CUTASS #Nvidia

Shlok Limbhare

@limbizzz11

Amara

@Vlaleesaur5717

Mayank Sharma

@sharmayank16

DividendAristo🇺🇸

@Hileal207

MaxineRoy

@b4cVG131dM0Tpr9

sudeep swar

@SwarSudeep64728

MyrnaCowper

@v8lVxS7hhYlts

StackOverflowed

@StackOverflowe

mateo

@matejhladky_dev

Priyansh Saxena

@Priyansh77718

Redamancy

@Sophia9is

Katie

@cruice_katie81

Drishan Arora

@drishanarora

Deep Cogito

@DeepCogito

Joseph Suarez 🐡

@jsuarez5341

EleutherAI

@AiEleuther

Casper Hansen

@casper_hansen_

Jonathan Blow

@Jonathan_Blow

Casey Muratori

@cmuratori

samsja

@samsja19

Joseph Spisak

@joespeez

marius eriksen

@marius

difficultyang

@difficultyang

Yan Chernikov

@TheCherno

Hedgie

@HedgieMarkets

Iain Dunning

@iaindunning

Wenting Zhao

@wzhao_nlp

Vedant Misra

@vedantmisra

tender

@tenderizzation

Ayush Jaiswal

@ayushjaiswal

steve

@gpusteve

NEXA AI

@nexa_ai

Shlomi Fruchter

@shlomifruchter

Jack Parker-Holder

@jparkerholder

Stuart Sul

@stuart_sul

Asuka

@HighFreqAsuka

Periodic Labs

@periodiclabs

Max Mynter

@MaxMynter

Krish Shah

@KrishRShah

Zeeshan Patel

@zeeshanp_

Manas Kala, PhD

@Anonymanasensei

Vishal S Pandey

@its_vayishu

Raj Dabre

@prajdabre

Scott Gray

@scottgray76

Deep-ML

@real_deep_ml

mobicham

@mobicham

Thien Tran

@gaunernst

Thinking Machines

@thinkymachines

Ivan Yashchuk

@IvanYashchuk

Alexander Amini

@xanamini

Tulsee Doshi

@tulseedoshi

Minna Song

@minnasong

Ahmad

@TheAhmadOsman

Igor Babuschkin

@ibab

Sholto Douglas

@_sholtodouglas

Quentin Gallouédec

@QGallouedec

Ava Amini

@avapamini

Chen Sun 🤖🧠🇨🇦

@ChenSun92

Shengjia Zhao

@shengjia_zhao

Hatice Ozen

@ozenhati

David Pfau

@pfau

Toby Pohlen

@TobyPhln

United States Trendy

1. $NVDA 85.4K posts
2. FEMA 18.3K posts
3. Peggy 39.3K posts
4. WE HURT PEOPLE 1,313 posts
5. Jensen 28.5K posts
6. Sheila Cherfilus-McCormick 15.7K posts
7. Dean Wade N/A
8. Ricochet 1,417 posts
9. Raisel Iglesias N/A
10. Jabari N/A
11. #Jupiter 4,444 posts
12. Koa Peat N/A
13. Baba Oladotun 1,063 posts
14. Sam Harris 1,219 posts
15. #CMAawards N/A
16. Nae'Qwan Tomlin N/A
17. #YIAYalpha N/A
18. NASA 58.2K posts
19. GeForce Season 6,664 posts
20. Bobby Lashley N/A

Something went wrong.

Something went wrong.