Mohit acharya

@mohitxya

Peeling off the layers of abstraction.

India

十月 2014 加入

7帖子 6关注者 242正在关注

Mohit acharya 已转帖

Jeremy Howard

年11月9日

Side effect of blocking Chinese firms from buying the best NVIDIA cards: top models are now explicitly being trained to work well on older/cheaper GPUs. The new SoTA model from @Kimi_Moonshot uses plain old BF16 ops (after dequant from INT4); no need for expensive FP4 support.

jeremyphoward's tweet image. Side effect of blocking Chinese firms from buying the best NVIDIA cards: top models are now explicitly being trained to work well on older/cheaper GPUs.

The new SoTA model from @Kimi_Moonshot uses plain old BF16 ops (after dequant from INT4); no need for expensive FP4 support.

Zhihu Frontier

年11月8日

🚀 "Quantization is not a compromise — it's the next paradigm." After K2-Thinking's release, many developers have been curious about its native INT4 quantization format. 刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

Mohit acharya

年6月22日

Finally wrapped up the hardware half of Nand2Tetris, took me a week. It had been on my to-do list for way longer than I’d like to admit.

mohitxya's tweet image. Finally wrapped up the hardware half of Nand2Tetris, took me a week. It had been on my to-do list for way longer than I’d like to admit.

Mohit acharya 已转帖

shrihacker

年1月29日

Deepseek engineers so cracked they bypassed cuda

shrihacker's tweet image. Deepseek engineers so cracked they bypassed cuda

shrihacker's tweet image. Deepseek engineers so cracked they bypassed cuda

SUJAL KHANDELWAL💥

@SUJALKH26064008

Andrew Joy Hembrom

@andrewhembrom_

sps

@S_P_S_4

theone_97

@_theone_97

ramján

@ramjankdl

Dr Rachana Bhatawdekar

@AstroRach

Ion Stoica

@istoica05

$realwillreil's profile picture. wannabe soft\hard eng, actual retard. 🇨🇦$

Will

@realwillreil

Jino Rohit

@jino_rohit

Umar Jamil

@hkproj

François Chollet

@fchollet

Dwarkesh Patel

@dwarkesh_sp

Igor Michalak

@igorjmichalak

Joseph Suarez 🐡

@jsuarez5341

Radek Osmulski

@radekosmulski

NextSilicon

@NextSilicon

Teknium (e/λ)

@Teknium

Tarek Mansour

@mansourtarek_

rajan agarwal

@_rajanagarwal

chastronomic

@chastronomic

anandmaj

@Almondgodd

Kenny Guo

@kennykgguo

Henry Ko

@henryHM_ko

Sholto Douglas

@_sholtodouglas

Ali Taha

@AliesTaha

Glinert 🇺🇸 🏭

@StevenGlinert

actual hog

@actualhog

SIGARCH

@sigarch

Computer Architecture Student Association (CASA)

@CompArchSA

Ben Dicken

@BenjDicken

LiveOverflow 🔴

@LiveOverflow

Andreas Kling

@awesomekling

Matthew Hartensveld, PhD

@MattHartensveld

@fclc

@FelixCLC_

Daniel Liu

@p1nosaur

Nicholas von Bodungen

@nvonbodungen

leloy!

@leloykun

David Gomes

@davidrfgomes

OXMIQ

@realoxmiqlabs

Dorsa Rohani

@dorsa_rohani

Casey Muratori

@cmuratori

Piotr Mazurek @ NeurIPS 🇺🇸

@tugot17

@levelsio

@levelsio

Tony Dinh 🎯

@tdinh_me

Jason Benn

@jasoncbenn

Yacine Mahdid

@yacinelearning

Sudarshan Kamath

@kamath_sutra

rh

@_renhau

Joe Fioti

@joefioti

em m0shou

@emm0sh

zack

@zack_overflow

Andy Pavlo (@andypavlo.bsky.social)

@andy_pavlo

Bryan Johnson

@bryan_johnson

Zettascale Computing Corporation

@thezettascale

Adithya S K

@adithya_s_k

MacCallister Higgins

@macjshiggins

United States 趋势

1. Duke 32.8K posts
2. Auburn 41K posts
3. Stockton 25.3K posts
4. Bama 29.8K posts
5. Miami 137K posts
6. Ole Miss 38.6K posts
7. Lane Kiffin 48.7K posts
8. Notre Dame 25.9K posts
9. Stanford 9,977 posts
10. #SurvivorSeries 191K posts
11. #JimmySeaFanconD2 198K posts
12. Austin Theory 5,326 posts
13. Virginia 48.7K posts
14. #BNewEraBirthdayConcert 690K posts
15. #INDvSA 35.8K posts
16. PERTHSANTA LUMINOUS SKIN 275K posts
17. Cam Coleman 2,048 posts
18. Ewing 1,311 posts
19. #NIVEASkinGlowxPerthSanta 319K posts
20. BECKY BIRTHDAY CONCERT 679K posts

Something went wrong.

Something went wrong.