ROCm for Windows Training Course

ROCm is an open-source platform designed for GPU programming, supporting AMD GPUs while also offering compatibility with CUDA and OpenCL. This platform exposes hardware details to developers, granting them complete control over the parallelization process. However, this level of control necessitates a solid understanding of device architecture, memory models, execution models, and optimization strategies.

ROCm for Windows is a recent advancement that enables users to install and operate ROCm on the Windows operating system, a system widely adopted for both personal and professional use. This version empowers users to harness the power of AMD GPUs for a variety of applications, including artificial intelligence, gaming, graphics, and scientific computing.

This instructor-led live training (available online or onsite) targets beginner to intermediate-level developers who want to install and utilize ROCm on Windows to program AMD GPUs and leverage their parallel processing capabilities.

Upon completing this training, participants will be able to:

Establish a development environment comprising the ROCm Platform, an AMD GPU, and Visual Studio Code on Windows.
Develop a fundamental ROCm program that executes vector addition on the GPU and retrieves results from GPU memory.
Utilize the ROCm API to query device information, allocate and deallocate device memory, transfer data between the host and device, launch kernels, and synchronize threads.
Employ the HIP language to write kernels that execute on the GPU and manipulate data.
Leverage HIP built-in functions, variables, and libraries to carry out common tasks and operations.
Optimize data transfers and memory access by using ROCm and HIP memory spaces, such as global, shared, constant, and local.
Control threads, blocks, and grids that define parallelism using ROCm and HIP execution models.
Debug and test ROCm and HIP programs using tools like ROCm Debugger and ROCm Profiler.
Optimize ROCm and HIP programs through techniques such as coalescing, caching, prefetching, and profiling.

Format of the Course

Interactive lectures and discussions.
Extensive exercises and practice sessions.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request customized training for this course, please contact us to arrange it.

21 hours

Mexico City - Samara Shops Tower A

110,107 MXN (Online)

140,107 MXN (Classroom)

ROCm for Windows Training Course

Course Outline

Requirements

Upcoming Courses

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

ROCm for Windows Training Course

Course Outline

Requirements

Upcoming Courses

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

ROCm for Windows

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites