Skip to content

Commit 5782f8b

Browse files
committed
modify perplexity test to use sdpa
1 parent 344cda2 commit 5782f8b

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

sharktank/sharktank/evaluate/perplexity_torch.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -111,7 +111,7 @@ def load_model(self, dataset, tokenizer, tensor_parallelism_size, attention_kern
111111
attention_dtype=self.attention_dtype,
112112
tensor_parallelism_size=tensor_parallelism_size,
113113
)
114-
114+
config.attention_kernel="torch"
115115
if config.tensor_parallelism_size > 1:
116116
dataset.root_theta = shard_theta(dataset.root_theta, config)
117117

0 commit comments

Comments
 (0)