Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Kernel][Hardware][AMD] improved rocm custom paged attention accuracy…
… and code documentation. updated its unittest to match the correct partition size based on paged attention versions as well as platform type. Signed-off-by: vllmellm <[email protected]>
- Loading branch information