VHPCworkshop's profile picture. Workshop on Virtualization in High-Performance Cloud Computing. News and views on #HPC #virtualization #VHPC #VMs #containers #cloud #ai

VHPC

@VHPCworkshop

Workshop on Virtualization in High-Performance Cloud Computing. News and views on #HPC #virtualization #VHPC #VMs #containers #cloud #ai

VHPC '25 Program is now live: vhpc.org Featured Keynote Tutorial: "Writing a hypervisor from scratch" Join Seiya Nuta from Vercel Inc. for an hands-on tutorial covering fundamental concepts and practical implementation techniques.


VHPC 2025 : 20th Workshop on Virtualization in High-Performance Cloud Computing, Dresden, Germany Deadline Extension to May 25, 2025 The Euro-Par Dresden VHPC workshop will emphasize containers/virtualization for LLM training/inference workloads vhpc.org

VHPCworkshop's tweet image. VHPC 2025 : 20th Workshop on Virtualization in High-Performance Cloud Computing, Dresden, Germany
Deadline Extension to May 25, 2025

The Euro-Par Dresden VHPC workshop will emphasize containers/virtualization for LLM training/inference workloads vhpc.org

VHPC is now active on Mastodon as @vhpc@mastodon.social and on Bluesky under the handle @vhpc.bsky.social.


NVIDIA vGPU is going non-proprietary nouveau/nvkm which is significant. vGPU still needs more transistors so to speak on the memory-management side though. lore.kernel.org/kvm/2024092416…


VHPC reposted

HARMONY OF RESILIENCE: Recorded in space and sent to Earth via @SpaceX’s @Starlink constellation, Polaris Dawn crewmember and violinist @Gillis_SarahE invites you to enjoy this music moment in support of @StJude & @ElSistemaUSApolarisprogram.com/music


Podman 5.2.0 Linux virtiofs for podman machine VMs github.com/containers/pod… , yet has to work on RHEL-current though


EUV for cheap(er) invention without experiment - this is not mainstream but the blue LED wasn't either.

【プレスリリース】エネルギー効率を飛躍的に高める革新的なEUVリソグラフィー先端半導体製造技術を発表 従来の常識を覆し、わずか4枚の反射ミラーで構成された新型EUVリソグラフィ| 日本の研究.com research-er.jp/articles/view/… あかん!ASMLしんでまう!!!



VHPC '25 Schedule is live vhpc.github.io/schedule/


VHPC '24 Keynote Towards Using Partitioned GPU Virtual Functions for Mixture of Experts Vignesh Chander, Member of Technical Staff, AMD This talk provides an in-depth overview of the ROCm software ecosystem on MI300X platforms, with a focus on large-scale training and…


VHPC reposted

DeepSeek-Coder-V2: First Open Source Model Beats GPT4-Turbo in Coding and Math > Excels in coding and math, beating GPT4-Turbo, Claude3-Opus, Gemini-1.5Pro, Codestral. > Supports 338 programming languages and 128K context length. > Fully open-sourced with two sizes: 230B (also…

deepseek_ai's tweet image. DeepSeek-Coder-V2: First Open Source Model Beats GPT4-Turbo in Coding and Math

> Excels in coding and math, beating GPT4-Turbo, Claude3-Opus, Gemini-1.5Pro, Codestral.
> Supports 338 programming languages and 128K context length.
> Fully open-sourced with two sizes: 230B (also…

arxiv.org/abs/2405.14906 Teacher-model type stage bootstrapped by a foundation model followed by an autonomous stage. AIEV-INSTRUCT (Instruction Tuning with Agent-Interaction and Execution-Verified) describes it and shows that you don’t have to be Facebook for large scale…


multi-cloud or an extra local or hosted backup leg makes a lot of sense

Google Cloud accidentally deleted a company's entire cloud environment (Unisuper, an investment company, which manages $80B). The company had backups in another region, but GCP deleted those too. Luckily, they had yet more backups on another provider. theguardian.com/australia-news…



VHPC reposted

Ray Ozzie on the beginning of Azure


VHPC workshop submission deadline extended to May 20, updated text CfP below. vhpc.org ==================================================================== CALL FOR PAPERS 19th Workshop on Virtualization in High­-Performance Cloud Computing (VHPC '24) held in…


VHPC reposted

Highly amusing update, ~18 hours later: llm.c is now down to 26.2ms/iteration, exactly matching PyTorch (tf32 forward pass). We discovered a bug where we incorrectly called cuBLAS in fp32 mathmode 🤦‍♂️. And ademeure contributed a more optimized softmax kernel for very long rows…

karpathy's tweet image. Highly amusing update, ~18 hours later:

llm.c is now down to 26.2ms/iteration, exactly matching PyTorch (tf32 forward pass). We discovered a bug where we incorrectly called cuBLAS in fp32 mathmode 🤦‍♂️. And ademeure contributed a more optimized softmax kernel for very long rows…

A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch (fp32, forward pass) compared to 4 days ago, when it was at 4.2X slower 📈 The biggest improvements were: - turn on TF32 (NVIDIA TensorFLoat-32) instead of FP32 for matmuls. This is a…

karpathy's tweet image. A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch (fp32, forward pass) compared to 4 days ago, when it was at 4.2X slower 📈

The biggest improvements were:
- turn on TF32 (NVIDIA TensorFLoat-32) instead of FP32 for matmuls. This is a…


a Driver modification to re-enable P2P for 4090s over PCIe (large BAR), which is one step to PCIe n x 4090 vs. $$ DGX H100 or H100 SXM5. github.com/tinygrad/open-… Note IOMMU off which is a virtualization turn-off. Next question is if PyTorch nccl toch.distributed collective…


United States Trends

Loading...

Something went wrong.


Something went wrong.