Skip to content

Commit

Permalink
Using a more precise profiling on ROCm to properly account for weight…
Browse files Browse the repository at this point in the history
…s padding (#394)
  • Loading branch information
gshtras authored Jan 30, 2025
1 parent 273c949 commit 22141e7
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/platforms/rocm.py
Original file line number Diff line number Diff line change
Expand Up @@ -230,4 +230,4 @@ def get_current_memory_usage(cls,
device: Optional[torch.types.Device] = None
) -> float:
torch.cuda.reset_peak_memory_stats(device)
return torch.cuda.max_memory_allocated(device)
return torch.cuda.mem_get_info()[1] - torch.cuda.mem_get_info()[0]

0 comments on commit 22141e7

Please sign in to comment.