Skip to content

Stretching GPU performance for GEMMs and tensor contractions.

License

Notifications You must be signed in to change notification settings

ROCmMathLibrariesBot/Tensile

This branch is 985 commits behind ROCm/Tensile:develop.

Folders and files

NameName
Last commit message
Last commit date
Jul 10, 2020
Oct 20, 2020
Nov 5, 2020
Nov 5, 2020
Apr 2, 2020
Jul 10, 2020
Nov 14, 2019
Oct 30, 2020
Oct 30, 2020
May 19, 2020
Jul 17, 2019
Jun 4, 2020
Aug 8, 2019
Nov 11, 2016
May 19, 2020
May 6, 2019
Jun 28, 2017
Oct 7, 2020
Jul 20, 2020
Jun 9, 2020
Aug 19, 2019
Aug 22, 2020

Repository files navigation

A tool for creating a benchmark-driven backend library for GEMMs, GEMM-like problems (such as batched GEMM), N-dimensional tensor contractions, and anything else that multiplies two multi-dimensional objects together on a GPU.

See Tensile Wiki for documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 54.4%
  • C++ 38.9%
  • TeX 3.0%
  • CMake 1.9%
  • Shell 1.3%
  • Groovy 0.4%
  • Other 0.1%