Skip to content

Commit

Permalink
[GPU] Fix synchronization in PagedAttention kernel when KV-cache rota…
Browse files Browse the repository at this point in the history
…tion is enabled but skipped for the current iteration
  • Loading branch information
sshlyapn committed Jan 23, 2025
1 parent 6895e1a commit c6bb2ed
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -346,7 +346,7 @@ struct paged_attention_impl : multi_stage_primitive<paged_attention> {

std::vector<event::ptr> res_events;
std::vector<event::ptr> dep_events = events;
if (has_rotated_blocks) {
if (has_rotated_blocks && !_kernels_data[Stage::KV_CACHE_ROTATE].kernels[0].skip_execution) {
execute_stage(dep_events, instance, res_events, Stage::KV_CACHE_ROTATE, is_mixed_mode);
dep_events = res_events;
}
Expand Down

0 comments on commit c6bb2ed

Please sign in to comment.