tsuki
@tensorcore
Hiroyuki Ootomo. High-precision GEMM emulation on Tensor Cores. Work at #76B900. Ph.D. from @TokyoTech_en. Cooking @cp_async. Hai-to-Yoka: https://x.enp1s0.dev
你可能會喜歡
Good news. SCAsia/HPCAsia2026 Early bird reg. extended to Jan 5! ディスカウントの早期登録を1月5日まで延長しました。This is to accommodate late posters and visa issues, but everyone could take advantage. ポスターやビザ関係を考慮してですが、全ての参加者に適当されます。日本語…
I feel more relaxed working on weekends than on weekdays, and late at night rather than during the day...
Attached an ADS-B receiver to a Raspberry Pi Zero W for Flightradar24. I still need to update the OS, though...
I’d like to try making a picture in the style of 黃文勇. I saw his work at his solo exhibition in Kaohsiung last year (相無所相) and really liked it.
Nago City Hall in Okinawa. I love the architecture. It feels somewhat brutalist to me.
visited Okinawa Institute of Science and Technology. It’s in Japan, but it doesn’t feel like Japan at all. I want to be a Ph.D. student here.
昔書いた、カメラ映像から流れ星を検出して自動的にお願い事を標準出力に唱えるプログラム(autopray)を、良いレンズのある今こそ動かすときな気がしてきた。ふたご座流星群でお願い事叶え放題。マニ車的祈念に効果があるのか分からないけれど。
gemm and gmem are now being typed interchangeably by tired stupid me
ChatGPT, Perplexity, Gemini, etc. somehow manage to include every possible lie in the world when answering a question about Fortran+CUDA libs. That's a waste of energy.
=> "Is Mixed Precision Computing really the Top Priority?", Hartwig Anzt, TUM, WS on Approx Comp in NLA, Oct 8, 2025 sdrive.cnrs.fr/s/djQWs8W6gcdY… Bandwidth & Latency can not keep up with growth in compute power Ginkgo github.com/ginkgo-project… AI WH, Rio Yokota x.com/ogawa_tter/sta…
"Emulating High-precision Matrix Operations on Low-precision Matrix Engines" Rio Yokota, WS on Approx Comp in NLA, Oct 8, 2025 sdrive.cnrs.fr/s/djQWs8W6gcdY… <= M. Fasi, Oct 8 x.com/ogawa_tter/sta… NVIDIA, Oct 8 x.com/ogawa_tter/sta… Next-gen TPU? x.com/ogawa_tter/sta… Block Data
=> "Emulation of Complex Matrix Multiplication based on the Chinese Remainder Theorem", @uchino_error (RIKEN Kobe), et al., arXiv, Dec 9, 2025 arxiv.org/abs/2512.08321 Ozaki-II scheme ScalAH 2025 (SC25 WS) dl.acm.org/doi/10.1145/37… K. Ozaki, Jul 2 x.com/ogawa_tter/sta…
=> "Emulating Matrix Multiplication Using Mixed-Precision Computation", K. Ozaki, NGT - Openlab "Optimising Floating Point Precision" WS, Jul 2 (MP4) indico.cern.ch/event/1538409/… indico.cern.ch/event/1538409/… Ozaki Scheme II, Apr 27 (10) arxiv.org/abs/2504.08009 Aug 8 x.com/ogawa_tter/sta…
> The next major change in hardware design will be shared exponents
"Emulating High-precision Matrix Operations on Low-precision Matrix Engines" Rio Yokota, WS on Approx Comp in NLA, Oct 8, 2025 sdrive.cnrs.fr/s/djQWs8W6gcdY… <= M. Fasi, Oct 8 x.com/ogawa_tter/sta… NVIDIA, Oct 8 x.com/ogawa_tter/sta… Next-gen TPU? x.com/ogawa_tter/sta… Block Data
=> "Floating-Point Matrix Multiply with Integer Arithmetic", M. Fasi, U of Leeds, with A. Abdelfattah, J. Dongarra, M. Mikaitis & F. Tisseur, WS on Approx Comp in NLA, Oct 8, 2025 sdrive.cnrs.fr/s/djQWs8W6gcdY… arXiv. Jun 12 arxiv.org/abs/2506.11277 Sep 5, 2023 x.com/ogawa_tter/sta…
=> "DGEMM on Integer Tensor Cores", @tensorcore, NHR PerfLab Seminar, Sep 5, 2023 youtube.com/watch?v=ouK0gw… hpc.fau.de/files/2023/09/… Can DL processors be used for HPC applications? Can we emulate DGEMM in the same manner? We can! Ozaki scheme arXiv, Jun 22 arxiv.org/abs/2306.11975
United States 趨勢
- 1. The JUP 333K posts
- 2. FINALLY DID IT 567K posts
- 3. #IDontWantToOverreactBUT N/A
- 4. 60 Minutes 104K posts
- 5. Greenland 20.5K posts
- 6. NextNRG Inc. N/A
- 7. #MondayMotivation 31.9K posts
- 8. Good Monday 42.7K posts
- 9. Bobby Petrino N/A
- 10. Bari Weiss 86.3K posts
- 11. Victory Monday 2,604 posts
- 12. The Odyssey 35.9K posts
- 13. Christopher Nolan 37.5K posts
- 14. Chris Rea 8,989 posts
- 15. #MondayVibes 4,962 posts
- 16. Algorhythm Holdings N/A
- 17. #HunSen N/A
- 18. JD Vance 176K posts
- 19. Nicki Minaj 225K posts
- 20. Chapel Hill N/A
你可能會喜歡
-
Shinnosuke Furuya
@sfuruyaz -
YOSHIFUJI Naoki
@LWisteria -
herumi
@herumi -
acc-mu3n
@AcceleratedMu3n -
株式会社フィックスターズ
@Fixstars_JP -
R. Shioya
@r_shioya -
山田 ネオエクスデス てるみ
@telmin_orca -
電子計算機の沼
@Hishinuma_t -
Kuninobu SaSaki
@_ksasaki -
Ryuji Fuchikami
@Ryuz88 -
PCクラスタコンソーシアム
@PrPccc -
てらモス🌹
@termoshtt -
HPCwire Japan
@hpcwirejapan -
Rio Yokota
@rioyokota -
Keigo Nitadori
@k_nitadori
Something went wrong.
Something went wrong.