Mohit acharya

@mohitxya

Peeling off the layers of abstraction.

India

10월 2014에 가입

7게시물 6팔로워 242팔로우 중

Mohit acharya 님이 재게시함

Jeremy Howard

. 11. 9.

Side effect of blocking Chinese firms from buying the best NVIDIA cards: top models are now explicitly being trained to work well on older/cheaper GPUs. The new SoTA model from @Kimi_Moonshot uses plain old BF16 ops (after dequant from INT4); no need for expensive FP4 support.

jeremyphoward's tweet image. Side effect of blocking Chinese firms from buying the best NVIDIA cards: top models are now explicitly being trained to work well on older/cheaper GPUs.

The new SoTA model from @Kimi_Moonshot uses plain old BF16 ops (after dequant from INT4); no need for expensive FP4 support.

Zhihu Frontier

. 11. 8.

🚀 "Quantization is not a compromise — it's the next paradigm." After K2-Thinking's release, many developers have been curious about its native INT4 quantization format. 刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

ZhihuFrontier's tweet image. 🚀 "Quantization is not a compromise — it's the next paradigm."
After K2-Thinking's release, many developers have been curious about its native INT4 quantization format.
刘少伟, infra engineer at @Kimi_Moonshot and Zhihu contributor, shares an insider's view on why this choice…

Mohit acharya

. 6. 22.

Finally wrapped up the hardware half of Nand2Tetris, took me a week. It had been on my to-do list for way longer than I’d like to admit.

mohitxya's tweet image. Finally wrapped up the hardware half of Nand2Tetris, took me a week. It had been on my to-do list for way longer than I’d like to admit.

Mohit acharya 님이 재게시함

shrihacker

. 1. 29.

Deepseek engineers so cracked they bypassed cuda

shrihacker's tweet image. Deepseek engineers so cracked they bypassed cuda

shrihacker's tweet image. Deepseek engineers so cracked they bypassed cuda

SUJAL KHANDELWAL💥

@SUJALKH26064008

Andrew Joy Hembrom

@andrewhembrom_

sps

@S_P_S_4

theone_97

@_theone_97

ramján

@ramjankdl

Dr Rachana Bhatawdekar

@AstroRach

Ion Stoica

@istoica05

$realwillreil's profile picture. wannabe soft\hard eng, actual retard. 🇨🇦$

Will

@realwillreil

Jino Rohit

@jino_rohit

Umar Jamil

@hkproj

François Chollet

@fchollet

Dwarkesh Patel

@dwarkesh_sp

Igor Michalak

@igorjmichalak

Joseph Suarez 🐡

@jsuarez5341

Radek Osmulski

@radekosmulski

NextSilicon

@NextSilicon

Teknium (e/λ)

@Teknium

Tarek Mansour

@mansourtarek_

rajan agarwal

@_rajanagarwal

chastronomic

@chastronomic

anandmaj

@Almondgodd

Kenny Guo

@kennykgguo

Henry Ko

@henryHM_ko

Sholto Douglas

@_sholtodouglas

Ali Taha

@AliesTaha

Glinert 🇺🇸 🏭

@StevenGlinert

actual hog

@actualhog

SIGARCH

@sigarch

Computer Architecture Student Association (CASA)

@CompArchSA

Ben Dicken

@BenjDicken

LiveOverflow 🔴

@LiveOverflow

Andreas Kling

@awesomekling

Matthew Hartensveld, PhD

@MattHartensveld

@fclc

@FelixCLC_

Daniel Liu

@p1nosaur

Nicholas von Bodungen

@nvonbodungen

leloy!

@leloykun

David Gomes

@davidrfgomes

OXMIQ

@realoxmiqlabs

Dorsa Rohani

@dorsa_rohani

Casey Muratori

@cmuratori

Piotr Mazurek @ NeurIPS 🇺🇸

@tugot17

@levelsio

@levelsio

Tony Dinh

@tdinh_me

Jason Benn

@jasoncbenn

Yacine Mahdid

@yacinelearning

Sudarshan Kamath

@kamath_sutra

rh

@_renhau

Joe Fioti

@joefioti

em m0shou

@emm0sh

zack

@zack_overflow

Andy Pavlo (@andypavlo.bsky.social)

@andy_pavlo

Bryan Johnson

@bryan_johnson

Zettascale Computing Corporation

@thezettascale

Adithya S K

@adithya_s_k

MacCallister Higgins

@macjshiggins

United States 트렌드

1. Cyber Monday 39.4K posts
2. #Fivepillarstoken 1,549 posts
3. #IDontWantToOverreactBUT 1,189 posts
4. Alina Habba 19K posts
5. TOP CALL 10.8K posts
6. #MondayMotivation 9,115 posts
7. #GivingTuesday 2,318 posts
8. Shopify 3,919 posts
9. $MSTR 13.7K posts
10. Mainz Biomed N/A
11. Token Signal 3,085 posts
12. #Rashmer 17K posts
13. Check Analyze N/A
14. Market Focus 2,597 posts
15. Victory Monday 1,570 posts
16. Good Monday 42.2K posts
17. JUST ANNOUNCED 18.5K posts
18. Clarie 3,141 posts
19. World AIDS Day 17.5K posts
20. GreetEat Corp. N/A

Something went wrong.

Something went wrong.