-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CK GEMM Backend #1480
base: rocm6.3_internal_testing
Are you sure you want to change the base?
CK GEMM Backend #1480
Commits on Jun 17, 2024
-
[SOW MS3] Centos stream9 PyTorch image support (ROCm#1090)
* changes to build Centos stream 9 images * Added scripts for centos and centos stream images * Added an extra line * Add ninja installation * Optimized code * Fixes * Add comment * Optimized code * Added AMDGPU mapping for ROCm 5.2 and invalid-url for rocm_baseurl Co-authored-by: Jithun Nair <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for e023400 - Browse repository at this point
Copy the full SHA e023400View commit details
Commits on Jun 19, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b8a2811 - Browse repository at this point
Copy the full SHA b8a2811View commit details -
Temporarily skip test_conv3d_64bit_indexing
- Rocblas API support is requested - SWDEV-383635 & sub task - SWDEV-390218
Configuration menu - View commit details
-
Copy full SHA for 59e9341 - Browse repository at this point
Copy the full SHA 59e9341View commit details -
Enable tensorpipe with hip_basic backend (ROCm#1135)
* Add hip_basic tensorpipe support to PyTorch * Enabling hip_basic for Tensorpipe for pyTorch * removing upstream tensorpipe module * Adding ROCm specific tensopipe submodule * tensorpipe submodule updated * Update the hip invalid device string * Added ignore for tensorpipe git submodule * Moved include of tensorpipe_cuda.h to hipify * Updates based on review comments * Defining the variable __HIP_PLATFORM_AMD__ * Enabling the UTs Co-authored-by: Ronak Malik <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 2bb132f - Browse repository at this point
Copy the full SHA 2bb132fView commit details -
- Fortran package installation moved after gcc - Update libtinfo search code in cmake1 - Install libstdc++.so
Configuration menu - View commit details
-
Copy full SHA for 0b08278 - Browse repository at this point
Copy the full SHA 0b08278View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e7704d - Browse repository at this point
Copy the full SHA 6e7704dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 108bf57 - Browse repository at this point
Copy the full SHA 108bf57View commit details -
Configuration menu - View commit details
-
Copy full SHA for 15da21a - Browse repository at this point
Copy the full SHA 15da21aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4003496 - Browse repository at this point
Copy the full SHA 4003496View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cfad86 - Browse repository at this point
Copy the full SHA 2cfad86View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c19bf9 - Browse repository at this point
Copy the full SHA 3c19bf9View commit details -
Skip ddp apply_optim_in_bwd tests for gloo (ROCm#1302)
To resolve https://ontrack-internal.amd.com/browse/SWDEV-403530 and https://ontrack-internal.amd.com/browse/SWDEV-419837. For more context check upstream issue pytorch#111834
Configuration menu - View commit details
-
Copy full SHA for b7e47fa - Browse repository at this point
Copy the full SHA b7e47faView commit details -
Reversed the condition as required
Configuration menu - View commit details
-
Copy full SHA for 032320c - Browse repository at this point
Copy the full SHA 032320cView commit details -
[CS9] Updates to CentOS stream 9 build (ROCm#1326)
- Add missing common_utils.sh - Update the install vision part - Move to amdgpu rhel 9.3 builds - Update to pick python from conda path - Add a missing package - Add ROCM_PATH and magma - Updated repo radeon path
Configuration menu - View commit details
-
Copy full SHA for 50d56db - Browse repository at this point
Copy the full SHA 50d56dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 17ba54f - Browse repository at this point
Copy the full SHA 17ba54fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e00045a - Browse repository at this point
Copy the full SHA e00045aView commit details -
Enable gesvda for ROCM >= 6.1 (ROCm#1339)
This also fixes a problem in gesvd driver when UV is not needed.
Configuration menu - View commit details
-
Copy full SHA for 7f3172f - Browse repository at this point
Copy the full SHA 7f3172fView commit details -
Increase lifespan of test-times files
- build_environment is hard coded to value from upstream when branch for created, since the dev/QA ENV build_environment value can be varing
Configuration menu - View commit details
-
Copy full SHA for a2d6ace - Browse repository at this point
Copy the full SHA a2d6aceView commit details
Commits on Jun 20, 2024
-
Fixes CI build script (ROCm#1350)
* Fix the parsing of /etc/os-release The old code parses OS_DISTRO as 'PRETTY_Ubuntu' on Ubuntu and thus never links to libtinfo correctly. * Configurable CMAKE_PREFIX_PATH in CI script.
Configuration menu - View commit details
-
Copy full SHA for 00307cc - Browse repository at this point
Copy the full SHA 00307ccView commit details -
[NO CP] Temporary dumping of test exec log to stderr
- This is done as per QA request, needs to be reverted and not required to be cherry-picked into later releases.
Configuration menu - View commit details
-
Copy full SHA for 3120778 - Browse repository at this point
Copy the full SHA 3120778View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9726c26 - Browse repository at this point
Copy the full SHA 9726c26View commit details -
Converted NAVI check as a function (ROCm#1364)
* Moved NAVI check to the test file * Revised NAVI check as a function
Configuration menu - View commit details
-
Copy full SHA for 91125f1 - Browse repository at this point
Copy the full SHA 91125f1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 623579f - Browse repository at this point
Copy the full SHA 623579fView commit details -
Configuration menu - View commit details
-
Copy full SHA for b39d5fa - Browse repository at this point
Copy the full SHA b39d5faView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d3494e - Browse repository at this point
Copy the full SHA 6d3494eView commit details -
Configuration menu - View commit details
-
Copy full SHA for f02e87f - Browse repository at this point
Copy the full SHA f02e87fView commit details -
Skip test_mm_triton_kernel_benchmark (ROCm#1376)
* Running triton kernel on ROCM only has one GB/s metric reported * Update test_kernel_benchmark.py
Configuration menu - View commit details
-
Copy full SHA for c1f1f51 - Browse repository at this point
Copy the full SHA c1f1f51View commit details -
Implementation of PyTorch ut parsing script - QA helper function (ROC…
…m#1386) * Initial implementation of PyTorch ut parsing script * Extracted path variables * Use nested dict to save results * Fixes typo * Cleanup * Fixes several issues * Minor name change * Update run_pytorch_unit_tests.py * Added file banners * Supported running from API * Added more help info * Consistent naming * Format help text --------- Co-authored-by: Jithun Nair <[email protected]> Co-authored-by: Jithun Nair <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 3720952 - Browse repository at this point
Copy the full SHA 3720952View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f65d22 - Browse repository at this point
Copy the full SHA 6f65d22View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98df198 - Browse repository at this point
Copy the full SHA 98df198View commit details -
[ROCm] skip warp update to 64 for gfx10 and gfx11 (ROCm#1417)
* Warp update to 64 for NAVI3x is skipped * adding warp_size to device properties * adding warp_size to device properties
Configuration menu - View commit details
-
Copy full SHA for f18c060 - Browse repository at this point
Copy the full SHA f18c060View commit details -
Configuration menu - View commit details
-
Copy full SHA for cb0e9ad - Browse repository at this point
Copy the full SHA cb0e9adView commit details -
Configuration menu - View commit details
-
Copy full SHA for b4abc4b - Browse repository at this point
Copy the full SHA b4abc4bView commit details -
[release/2.1] Skip certificate check for CentOS7 since certificate ex…
…pired (ROCm#1399) * Skip certificate check only for CentOS7 since certificate expired * Naming
Configuration menu - View commit details
-
Copy full SHA for c716c2e - Browse repository at this point
Copy the full SHA c716c2eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 09b800a - Browse repository at this point
Copy the full SHA 09b800aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a0872c0 - Browse repository at this point
Copy the full SHA a0872c0View commit details -
Change Torch extra install requirement
- PYTORCH_EXTRA_INSTALL_REQUIREMENTS is set in builder repo - Remove the PYTORCH_EXTRA_INSTALL_REQUIREMENTS step from this file
Configuration menu - View commit details
-
Copy full SHA for 700ee13 - Browse repository at this point
Copy the full SHA 700ee13View commit details -
Remove the installation of rocm-llvm-dev package
- Causing regression - SWDEV-463083
Configuration menu - View commit details
-
Copy full SHA for 8f95824 - Browse repository at this point
Copy the full SHA 8f95824View commit details -
* Fix SWDEV-459623. The Rank of logsumexp Tensor must be 3. This tensor was considered for internal use only but apparently exposed to UTs. * Fix for mGPU. The stream should be selected after picking the current device according to input tensor.
Configuration menu - View commit details
-
Copy full SHA for 5f9b3f4 - Browse repository at this point
Copy the full SHA 5f9b3f4View commit details -
Enable fp8 inductor unit tests (ROCm#1421)
* Add formal FP8 check in common_cuda.py * Enable inductor/test_valid_cast * Support for test_eager_fallback * allow fnuz types on amax test * Finalize passing tests vs failing * Fix fnuz constants in _to_fp8_saturated
Configuration menu - View commit details
-
Copy full SHA for 90df487 - Browse repository at this point
Copy the full SHA 90df487View commit details -
Enable NHWC batchnorm for miopen (ROCm#1400)
* Enable batchnorm NHWC for MIOpen * cleanup * test to compare NHWC MIOpen batchnorm with CPU * fix 'use_miopen' condition for nhwc miopen * fix includes * use native nhwc batchnorm to verify miopen * remove extra spaces * remove empty lines * set PYTORCH_MIOPEN_SUGGEST_NHWC=1 for all test_nn.py test
Configuration menu - View commit details
-
Copy full SHA for 4380b15 - Browse repository at this point
Copy the full SHA 4380b15View commit details -
Configuration menu - View commit details
-
Copy full SHA for a390471 - Browse repository at this point
Copy the full SHA a390471View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6be1d5d - Browse repository at this point
Copy the full SHA 6be1d5dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 31b3681 - Browse repository at this point
Copy the full SHA 31b3681View commit details -
Configuration menu - View commit details
-
Copy full SHA for cefda3a - Browse repository at this point
Copy the full SHA cefda3aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8068d3d - Browse repository at this point
Copy the full SHA 8068d3dView commit details -
Print consolidated log file for pytorch unit test automation scripts (R…
…OCm#1433) * Print consolidated log file for pytorch uts * Update run_entire_tests subprocess call as well * lint * Add ERROR string
Configuration menu - View commit details
-
Copy full SHA for 5187ca9 - Browse repository at this point
Copy the full SHA 5187ca9View commit details -
[ROCm] Intra-node all reduce initial implementation (ROCm#1435)
* Initial commit to port intra_node_comm to ROCm (cherry picked from commit 48d1c33) * gpt-fast running now with intra-node comm (cherry picked from commit 618c54e) --------- Co-authored-by: Prachi Gupta <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0c2f97c - Browse repository at this point
Copy the full SHA 0c2f97cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 012c13b - Browse repository at this point
Copy the full SHA 012c13bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6e45ab1 - Browse repository at this point
Copy the full SHA 6e45ab1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3aa060d - Browse repository at this point
Copy the full SHA 3aa060dView commit details -
Sync updates from hipify_torch. (ROCm#1168)
Co-authored-by: Jithun Nair <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for 0c5d257 - Browse repository at this point
Copy the full SHA 0c5d257View commit details -
Configuration menu - View commit details
-
Copy full SHA for ecf4e8d - Browse repository at this point
Copy the full SHA ecf4e8dView commit details
Commits on Jun 26, 2024
-
Merge pull request ROCm#1436 from ROCm/IFU_CP_06172024
IFU for rocm6.3_internal_testing
Configuration menu - View commit details
-
Copy full SHA for 8f19207 - Browse repository at this point
Copy the full SHA 8f19207View commit details
Commits on Jun 27, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5de711c - Browse repository at this point
Copy the full SHA 5de711cView commit details
Commits on Jul 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 4459b67 - Browse repository at this point
Copy the full SHA 4459b67View commit details
Commits on Jul 10, 2024
-
Configuration menu - View commit details
-
Copy full SHA for dd43b9b - Browse repository at this point
Copy the full SHA dd43b9bView commit details
Commits on Jul 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1b6b84e - Browse repository at this point
Copy the full SHA 1b6b84eView commit details