skill-tree:pe:4:3:b
**This is an old revision of the document!**
Table of Contents
PE4.3-B Tuning via Reprogramming
Background
The potential for tuning via reprogramming exists on the hardware as well as on the software level. At the software level, performance improvements are achievable by using more efficient algorithms. This is explained by the help of popular practice-relevant examples.
Aim
Outcomes
- Run parallel programs in an HPC environment.
- Reprogram appropriate parallel code for improved performance on the processing element level e.g. by using functional units (for executing fused multiply-add instructions and variants thereof), by using vectorization techniques with SIMD instructions, etc.
- Assess how appropriate computationally intensive functions (which have been identified earlier by profiling the parallel program) can be ported to many core archictures like GPUs to achieve further speedups.
Subskills
skill-tree/pe/4/3/b.1595151007.txt.gz · Last modified: 2020/07/19 11:30 by 127.0.0.1