Skip to content

CaoZhongZ/gpu_ipc_example

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Intel MLPerf Prototypes on Peak Performance Communication Collectives

This SYCL implementation of All-Reduce shows how to achieve peak performance on Intel PVC system with XeLinks. Single shot kernel that utilizes of all available internel (Cross Tile and XeLinks) bandwidth simultaneously is the key to achieve designed peak performance. Implementation demonstrated in half precision at the moment.

Requirements

  1. Intel SYCL Compiler
  2. Most up-to-date drivers for PVC
  3. MPI

Build the Benchmark

git submodule update --init
make main

Run

mpirun -np 8 ./main -n \<number of elements in half\> [-w sub-group] [-g group] [-a small | simple | bisect]

About

Intel Xe GPU Communication Prototypes

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages