Weekly exercises for the course in Parallel programming for HPC @ UniTS.
- Distributed parallelism (MPI)
- BLAS
- (NVIDIA) GPU programming
- CUDA
- Theory & best practices
- cuBLAS
- OpenACC
- FFTW
The following time measurements were taken on standard nodes on Marconi-100.
2500x2500
5000x5000