Skip to content

Investigate potential defer_temp_storage_destruction optimization #979

Open
@gevtushenko

Description

@gevtushenko

Memory deallocation in defer_temp_storage_destruction might be blocking. We should investigate if launching all the destruction kernels in sequence (or fusing destructors in a single kernel) before releasing the memory is any better to address:

// TODO Bury all the memory associated with the stream provider and then 
//      deallocate the memory

Metadata

Metadata

Assignees

No one assigned

    Labels

    P2cudarequires CUDA expertiseenhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions