#gpuprogramming search results
"Need better CUDA textbooks. 'Programming Massively Parallel Processors' is a good intro. I've created C/CUDA C implementations for first 3 chapters. Check book & my GitHub repo for details. #CUDA #GPUprogramming"
Day 2 of #GPUProgramming: >read an article about shared memory >learnt about registers……..Global memory >almost blacked out from elaboration of L1 & L2 iykyk >”repetition”to digest what I just learnt for about 900k milliseconds
"10 days into CUDA, and I’ve earned my first badge of honor! 🚀 From simple kernels to profiling, every day is a step closer to mastering GPU computing. Onward to 100! #CUDA #GPUProgramming #100DaysOfCUDA"
Each common operation is implemented as its own .cu file—modular. intriguing. #CUDA #NVIDIA #GPUProgramming #libcudf
Day 3 of GPU programming At this rate I'll be writing custom inference kernels for AI by next month. The gap between PyTorch abstractions and bare metal isn't as wide as it seemed. #CUDA #GPUProgramming #MachineLearning
Day 2 of GPU programming Never knew addition needs so much code 😂 Starting to get the hang of program_id. Used Gemini 3.0 to generate pseudocode since I'm new to GPU programming and didn't want full code. Lets hope this momentum continues
Day 3 of #GPUProgramming: >in-depth of what shared memory is capable of >read about techniques by which these concepts optimize their performance >synchronization of threads during matmul >realized it took ~1.5 hours to digest this stuff >tried to code matrix multiplication…
I started with @elliotarledge CUDA course, Here's the link youtu.be/86FAWCzIe_4?si… #gpu #gpuprogramming
youtube.com
YouTube
CUDA Programming Course – High-Performance Computing with GPUs
GPU CODING PLAYLIST youtube.com/playlist?list=… #gpu #gpuprogramming #vulkan #vulkanapi #computergraphics #hpc #highperformancecomputing #nvidia #intel #amd #howtocode #howtoprogram #raytracing #dataviz #infographics #art #digitalart #artist #cudaeducation
#GPUProgramming - Day 07: 🔧 #CPU Hazards 101 🚧: Ever heard of #Register Renaming & Out-of-Order Execution? They tackle structural hazards, ensuring smooth sailing for instructions. Watch out for Data Hazards (#RAW, #WAR, #WAW) in #MIPS, but fear not! #COA #LearnInPublic
#GPUProgramming - Day 02: 🔄 Exploring CPU architectures! #RISC, like #ARM & #Power, opts for efficiency with many registers. #CISC, exemplified by #Intel 8086, prioritizes simplicity, offering diverse, complex instructions. RISC excels in energy efficiency. #COA #LearnInPublic
#GPUProgramming - Day 03: 🧠 CPUs: Processors adapt with DISA. #CPU's core duo - Control Unit & Datapath. Datapath: Registers, ALU, Buses, Multiplexers – a data symphony! 🔄 Follow the Instruction Execution Cycle: Fetch ➡️Decode➡️Execute➡️Store➡️ Update PC. 🕹️ #LearnInPublic
#GPUProgramming - Day 01: 🚀 Exploring RISC architecture: Simplified, optimized instructions in one clock cycle. 🔄 Bye, CISC complexity! 🏎️ Registers rule, boosting speed. 🤖💡 Compiler-friendly design, slick pipelining for simultaneous processing! 🕵️♂️ #COA #RISC #LearnInPublic
#GPUProgramming - Day 08: 🚀 Explored #computerarchitecture today! 🖥️ Control Hazards tackle branch prediction, #Pentium FDIV bug a classic example. 💡 Memory #Hierarchy is key—#RAM, #cache levels (L1, L2, L3), and storage devices play crucial roles. 🔄🌐 #Memory #LearnInPublic
#GPUProgramming - Day 06: 🔍 Diving into computer architecture! 🖥️ Structural hazards arise when hardware resources are in high demand, causing contention among instructions. Data hazards? RAW, WAR, WAW – the battle for data paths and registers! 💡 #COA #LearnInPublic
#GPUProgramming - Day 04: 🕰️ Dive into processor architectures! 🧠 Single-cycle execution, one clock cycle per instruction, demands a versatile datapath. 🔄 Multi-cycle instructions break it down for a more intricate dance with time. ⏳ #ComputerArchitecture #LearnInPublic 🚀
🚀 Exciting Learning Opportunity! 🚀 For more details and registration: events.eurocc.lu/meluxina-intro… #GPUProgramming #CUDA #Supercomputing #ScientificComputing #MeluXina #Luxembourg
#GPUProgramming - Day 05: 🚀 Pipelining in computer architecture boosts performance by dividing instruction execution into stages. Techniques like forwarding, branch prediction, and superscalar processors enhance parallelism.💻🌐 #ComputerArchitecture #Pipelining #LearnInPublic
GPU PROGRAMMING PLAYLIST youtube.com/playlist?list=… #gpu #gpuprogramming #vulkan #vulkanapi #computergraphics #hpc #highperformancecomputing #nvidia #intel #amd #howtocode #howtoprogram #raytracing #dataviz #infographics #art #digitalart #artist #cudaeducation
Programming Tensor Cores in Unity with WMMA (Warp Matrix Multiply Accumulate) API. #gpuprogramming #unity3d I have written minimal example: github.com/przemyslawzawo…
For maximum performance, firms often develop custom CUDA kernels. This involves writing low-level code to directly program the GPU's parallel cores, squeezing out every drop of efficiency for critical tasks. #CUDA #GPUProgramming
GPU PROGRAMMING PLAYLIST youtube.com/playlist?list=… #gpu #gpuprogramming #vulkan #vulkanapi #computergraphics #hpc #highperformancecomputing #nvidia #intel #amd #howtocode #howtoprogram #raytracing #dataviz #infographics #art #digitalart #artist #cudaeducation
GPU CODING PLAYLIST youtube.com/playlist?list=… #gpu #gpuprogramming #vulkan #vulkanapi #computergraphics #hpc #highperformancecomputing #nvidia #intel #amd #howtocode #howtoprogram #raytracing #dataviz #infographics #art #digitalart #artist #cudaeducation
COMPUTER GRAPHICS | Frustum culling equation to discard object that aren't relevant youtu.be/WmQPuaj_j4k #gpu #gpuprogramming #frustumculling #computergraphics #hpc #howtocode #howtoprogram #geometry #shader #intel #amd #nvidia #dataviz #computersimulation #cudaeducation
youtube.com
YouTube
Vulkan API Discussion | Frustrum Culling + Level of Detail + Indirect...
GPU CODING PLAYLIST youtube.com/playlist?list=… #gpu #gpuprogramming #vulkan #vulkanapi #computergraphics #hpc #highperformancecomputing #nvidia #intel #amd #howtocode #howtoprogram #raytracing #dataviz #infographics #art #digitalart #artist #cudaeducation
NSIGHT GRAPHICS TUTORIAL: youtu.be/LtretfoL2tc | Vulkan, OpenGL, Direct 3D profiling and debugging | #graphicsprogramming #gpuprogramming #gpgpu #howtoprogram #howtocode #computerprogramming #howtowriteaprogram #siliconvalley #cudaeducation
youtube.com
YouTube
Nsight Graphics Tutorial | Cuda Education
@nvidia GPU bootcamp in the Claustro of @URosario. @HPCCol growing stronger. #HPC #GPUProgramming #OpenACC
Day 5 of GPU Programming -matrix transpose. -matrix prefix sum leetcode. #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney
Day 8 of GPU Programming -Count 2D Array Element -todays potd easy peasy again. #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney
Day 3 of GPU Programming -implemented ReLU activation for 1D array -Did potd leetcod -explored how GPUs handle element-wise operations in parallel #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney
Day 9 of GPU Programming -Leaky ReLU -leetcode potd easy but hard to implement #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney
Day 7 of GPU Programming -color inversion. -todays potd easy peasy. #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney
Compilation errors when using OpenACC with g++ 10 stackoverflow.com/questions/6542… #openacc #gpuprogramming #g++ #compilererrors #cpp
Day 6 of GPU Programming -matrix addition. -todays potd was on harder side. #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney pic.x.com/9qrJliv3Fdhttp…
And this was day 3 of the DL & #GPUprogramming course @LRZ_DE @HLRS_HPC @Uni_Stuttgart @NVIDIAAI DLI, with the grande finale: we learnt how to employ distributed stoch. gradient descent w/ multi-GPU, @TensorFlow & #Horovod @UberAILabs @LFAIDataFdn🎉😍highly recommended course!
Day 2 of #GPUProgramming: >read an article about shared memory >learnt about registers……..Global memory >almost blacked out from elaboration of L1 & L2 iykyk >”repetition”to digest what I just learnt for about 900k milliseconds
Day 10 of GPU Programming - Rainbow Table - 4th consecutive easy problem on LeetCode #100DaysOfGPU #CUDA #GPUProgramming #ParallelComputing #AI #DeepLearning #100DaysOfCode #MachineLearning #NVIDIA #CodingJourney pic.x.com/FYOUlLsQy3
"Need better CUDA textbooks. 'Programming Massively Parallel Processors' is a good intro. I've created C/CUDA C implementations for first 3 chapters. Check book & my GitHub repo for details. #CUDA #GPUprogramming"
Each common operation is implemented as its own .cu file—modular. intriguing. #CUDA #NVIDIA #GPUProgramming #libcudf
NSIGHT GRAPHICS TUTORIAL: amzn.to/2Qffvpl | Vulkan, OpenGL, Direct 3D profiling and debugging | #graphicsprogramming #gpuprogramming #gpgpu #howtoprogram #howtocode #computerprogramming #howtowriteaprogram #siliconvalley #nsightgraphicstutorial #videowalkthrough
#GPUProgramming - Day 07: 🔧 #CPU Hazards 101 🚧: Ever heard of #Register Renaming & Out-of-Order Execution? They tackle structural hazards, ensuring smooth sailing for instructions. Watch out for Data Hazards (#RAW, #WAR, #WAW) in #MIPS, but fear not! #COA #LearnInPublic
#GPUProgramming - Day 02: 🔄 Exploring CPU architectures! #RISC, like #ARM & #Power, opts for efficiency with many registers. #CISC, exemplified by #Intel 8086, prioritizes simplicity, offering diverse, complex instructions. RISC excels in energy efficiency. #COA #LearnInPublic
#GPUProgramming - Day 03: 🧠 CPUs: Processors adapt with DISA. #CPU's core duo - Control Unit & Datapath. Datapath: Registers, ALU, Buses, Multiplexers – a data symphony! 🔄 Follow the Instruction Execution Cycle: Fetch ➡️Decode➡️Execute➡️Store➡️ Update PC. 🕹️ #LearnInPublic
Day 3 of GPU programming At this rate I'll be writing custom inference kernels for AI by next month. The gap between PyTorch abstractions and bare metal isn't as wide as it seemed. #CUDA #GPUProgramming #MachineLearning
Day 2 of GPU programming Never knew addition needs so much code 😂 Starting to get the hang of program_id. Used Gemini 3.0 to generate pseudocode since I'm new to GPU programming and didn't want full code. Lets hope this momentum continues
#GPUProgramming - Day 01: 🚀 Exploring RISC architecture: Simplified, optimized instructions in one clock cycle. 🔄 Bye, CISC complexity! 🏎️ Registers rule, boosting speed. 🤖💡 Compiler-friendly design, slick pipelining for simultaneous processing! 🕵️♂️ #COA #RISC #LearnInPublic
Something went wrong.
Something went wrong.
United States Trends
- 1. Brian Cole 19.9K posts
- 2. #Kodezi 1,049 posts
- 3. #TrumpAffordabilityCrisis 1,572 posts
- 4. #NationalCookieDay 1,330 posts
- 5. Woodbridge 4,229 posts
- 6. Tong 18.3K posts
- 7. Walter Payton 11.2K posts
- 8. Price 268K posts
- 9. The FBI 121K posts
- 10. TPUSA 74.6K posts
- 11. Merry Christmas 68.9K posts
- 12. Rwanda 24.8K posts
- 13. #thursdayvibes 3,539 posts
- 14. All-Big 1,807 posts
- 15. #RedSeaIFFxFaye 104K posts
- 16. GLOWING FAYE AT RED SEA FILM25 110K posts
- 17. Real Steel N/A
- 18. Happy Birthday Dan 3,099 posts
- 19. Eurovision 44.5K posts
- 20. soobin 73.8K posts