PyTorch CUDA Allocator Fragmentation: When and Why OOMs Happen | HACKOBAR_