Performance GPU Utilization CPU Utilization Disk IO Network IO CPU => GPU Memory Bandwidth Distributed Training https://medium.com/huggingface/training-larger-batches-practical-tips-on-1-gpu-multi-gpu-distributed-setups-ec88c3e51255 Inference