skill-tree:k:4:b
Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
skill-tree:k:4:b [2020/06/05 17:06] – external edit 127.0.0.1 | skill-tree:k:4:b [2025/04/16 18:30] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | # K4-B Job Scheduling | + | # K4 Job Scheduling |
- | # Background | + | |
- | Parallel computers are operated differently than a normal PC, all users must share the system. Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application. | + | Parallel computers are operated differently than a normal PC, all users must share the system. |
+ | Therefore, various operative procedures are in place. Users must understand these concepts and procedures to be able to use the available resources of a system to run a parallel application. | ||
A workload manager/job scheduler controls how available hardware resources are distributed among the user requests (jobs). | A workload manager/job scheduler controls how available hardware resources are distributed among the user requests (jobs). | ||
- | # Aim | + | Users of computing centers typically compete for the expensive HPC resources of cluster systems. |
- | To enable practitioners to comprehend | + | HPC resources can be distinguished as |
+ | * Shared resources (e.g. a parallel file system that is often shared across all cluster nodes and therefore shared between all users), | ||
+ | * Not-shared resources (e.g. cluster nodes dedicated to a particular parallel program | ||
+ | |||
+ | The configuration of the cluster | ||
+ | |||
+ | A major aspect of job scheduling is to manage these resources in a way that users are treated fairly. | ||
+ | Accounting for users or user groups can additionally support this. | ||
+ | |||
+ | ## Learning Outcomes | ||
+ | |||
+ | * Comprehend the princimples of Job scheduling and why program managing the jobs are required. | ||
+ | * Demonstrate the SLURM workload manager. | ||
+ | * Discuss different Scheduling strategies. | ||
+ | |||
+ | ## Subskills | ||
- | # Outcomes | + | * [[skill-tree: |
- | * explain the concepts and procedures for resource allocation and job execution in an HPC environment | + | * [[skill-tree: |
- | * run interactive jobs and batch jobs | + | * [[skill-tree: |
- | * comprehend and describe the expected behavior of job scripts | + | |
- | * change provided job scripts and embed them into shell scripts to run a variety of parallel applications | + | |
- | | + | |
- | * comprehend accounting principles (billing for the jobs) | + | |
- | * comprehend scheduling strategies that increase productivity | + | |
- | # Subskills | ||
- | * [[skill-tree: | ||
- | * [[skill-tree: |
skill-tree/k/4/b.1591369587.txt.gz · Last modified: 2020/06/05 17:06 by 127.0.0.1