Performance Evaluation
Metrics
Speedup
Measures how much faster the parallel program runs compared to serial.
Efficiency
Measures how effectively processor resources are utilized.
Amdahl's Law
If a fraction
Key insight: Even with infinite processors, speedup is limited by the sequential fraction.
Example: If 10% is sequential (
Gustafson's Law
If the problem size scales with the number of processors:
where
Key insight: With scaled problem size, near-linear speedup is achievable.
Scalability
Strong Scaling
- Fixed problem size, increasing processors
- Speedup should be near-linear
- Limited by Amdahl's Law
Weak Scaling
- Problem size per processor remains constant
- Total work grows with processor count
- Limited by communication overhead
Isoefficiency
The rate at which problem size must grow to maintain constant efficiency as processors increase.
Performance Models
PRAM Model
- Parallel Random Access Machine
- Idealized model: unlimited processors, unit-time shared memory access
- Variants: EREW, CREW, CRCW (based on concurrent access policies)
BSP (Bulk Synchronous Parallel)
- Computation proceeds in supersteps
- Each superstep: local computation → global communication → barrier synchronization
- Cost model:
(computation + communication + synchronization)
LogP Model
- Parameters:
(latency), (overhead), (gap), (processors) - More realistic than PRAM for distributed memory systems