Skip to content

Commit f42d134

Browse files
author
wangxiaoxin-sherie
committed
xx
1 parent 27b7aab commit f42d134

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vllm_ascend/worker/model_runner_v1.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2246,6 +2246,9 @@ def _build_attention_metadata(self, create_mixed_batch, num_reqs,
22462246
self.seq_lens_np[:num_reqs] = seq_lens
22472247
self.seq_lens_np[num_reqs:] = 0
22482248

2249+
self.query_start_loc[:num_reqs + 1] = torch.arange(num_reqs + 1)
2250+
self.query_start_loc_cpu[:num_reqs + 1] = torch.arange(num_reqs + 1)
2251+
22492252
num_computed_tokens_cpu = (
22502253
self.input_batch.num_computed_tokens_cpu_tensor[:num_reqs])
22512254

0 commit comments

Comments
 (0)