Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 4.9k 400

  2. HIP HIP Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.9k 548

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 237

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 690 97

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 542 79

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 442 69

Repositories

Showing 10 of 300 repositories
  • pytorch Public Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    ROCm/pytorch’s past year of commit activity
    Python 220 23,814 77 39 Updated Jan 30, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    ROCm/composable_kernel’s past year of commit activity
    C++ 338 141 22 (1 issue needs help) 53 Updated Jan 30, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    ROCm/flash-attention’s past year of commit activity
    Python 152 BSD-3-Clause 1,447 25 5 Updated Jan 30, 2025
  • ROCm/TransformerEngine’s past year of commit activity
    Python 15 8 9 7 Updated Jan 30, 2025
  • rpp Public

    AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/OpenCL/CPU back-ends.

    ROCm/rpp’s past year of commit activity
    C++ 57 MIT 41 0 3 Updated Jan 30, 2025
  • rocPRIM Public

    ROCm Parallel Primitives

    ROCm/rocPRIM’s past year of commit activity
    C++ 169 MIT 72 1 9 Updated Jan 30, 2025
  • AMDMIGraphX Public

    AMD's graph optimization engine.

    ROCm/AMDMIGraphX’s past year of commit activity
    C++ 196 MIT 90 349 (1 issue needs help) 48 Updated Jan 30, 2025
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    ROCm/hipBLASLt’s past year of commit activity
    Assembly 73 MIT 99 8 73 Updated Jan 30, 2025
  • rocDecode Public

    rocDecode is a high performance video decode SDK for AMD hardware

    ROCm/rocDecode’s past year of commit activity
    C++ 21 18 3 1 Updated Jan 30, 2025
  • rocprofiler-systems Public

    ROCm Systems Profiler

    ROCm/rocprofiler-systems’s past year of commit activity
    C++ 15 MIT 6 0 10 Updated Jan 30, 2025