When does fragmentation occur in the CUDA caching allocator?