Skip to content

Commit 8dea6e9

Browse files
committed
BSND to TND and FA_UPDATE replacement
Signed-off-by: pichangping <[email protected]>
1 parent 6ee9dc8 commit 8dea6e9

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

vllm_ascend/attention/attention_v1.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -909,7 +909,8 @@ def _forward_decode_pcp_dcp(self, query: torch.Tensor,
909909
if workspace is None:
910910
workspace = torch_npu._npu_fused_infer_attention_score_get_max_workspace(
911911
query, k_nope, value, **common_kwargs)
912-
graph_params.workspaces[num_tokens] = weak_ref_tensors(workspace)
912+
graph_params.workspaces[num_tokens] = weak_ref_tensors(
913+
workspace)
913914
attn_out = torch.empty_like(query)
914915
attn_lse = torch.empty((num_tokens, num_heads, 1, 1),
915916
dtype=torch.float,

0 commit comments

Comments
 (0)