extend FA2 and other cases to XPU, #42536

yao-matrix · 2025-12-01T22:00:12Z

we expect all model cases except CUDAGraph specific, CUDA compute capability specific and FA3 specific can run XPU. For FA3, we are developing.

@ydshieh, pls help review, thx very much.

…UDAGraph specific, CUDA compute capability specific and FA3 specific can run XPU. For FA3, we are develioping Signed-off-by: Yao, Matrix <[email protected]>

Signed-off-by: Yao, Matrix <[email protected]>

ydshieh · 2025-12-02T09:23:09Z

src/transformers/models/mimi/modeling_mimi.py


 MIMI_ATTENTION_CLASSES = {
    "eager": MimiAttention,
+    "kernels-community/flash-attn2": MimiFlashAttention2,


could you explain this part?

@ydshieh , sure. in latest design, when users set attn_implementation == "flash_attention_2", there will be 2 branches:

if flash_attn package is available, it will go directly to use it

else, do not fail as before, but use kernels instead, in this case, the attn_implementation will be updated to "kernels-community/flash-attn2", as in code here

For XPU, we go with the kernels path in transformers for FA support, so we need this key.

Thx very much.

I'd rather not do this, even tho you are correct here. We should rather refactor mimi here with the attention interface and not have these manual registrations. We could infinitely extend these edge cases in the future to FA3 etc which makes this not scalable (without using/refactoring to the interface).

@yao-matrix

Let's revert this line 🙏 . We can skip the relevant FA tests if necessary.

ydshieh · 2025-12-02T09:25:40Z

tests/generation/test_continuous_batching.py

+            ("xpu", None): {
+                "req_1":  " 3.5 bolts.\n\nLet's break it down step by step:\n\n- Blue fiber: 2 bolts\n- White fiber: half of 2 bolts = 1 bolt\n\nTotal = ",
+            },
+        }).get_expectation()  # fmt: skip


i need to check why this was {} before, but thank you.

ydshieh

Thank you! LGTM, but has one question

ydshieh · 2025-12-02T16:52:28Z

@yao-matrix don't forget

#42536 (comment)

🙏

yao-matrix · 2025-12-02T17:18:19Z

@yao-matrix don't forget

#42536 (comment)

🙏

Yes, done, thx very much for your always support, :).

github-actions · 2025-12-02T18:23:47Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma2, gemma3, glm4v, glm4v_moe, granitemoehybrid, idefics2, kosmos2_5, longcat_flash, mimi, modernbert, musicgen, musicgen_melody, pixtral, qwen2_5_omni, qwen2_5_vl, qwen2_moe

yao-matrix added 3 commits December 1, 2025 21:56

extend FA2 and other cases to XPU, we expect all model cases except C…

8d14174

…UDAGraph specific, CUDA compute capability specific and FA3 specific can run XPU. For FA3, we are develioping Signed-off-by: Yao, Matrix <[email protected]>

Merge branch 'main' into fa2-xpu

b7a366f

fix style

d54f907

Signed-off-by: Yao, Matrix <[email protected]>

YangKai0616 mentioned this pull request Dec 2, 2025

Fixed paged|FA2 kernel loading logic and UT. #42547

Open

ydshieh reviewed Dec 2, 2025

View reviewed changes

ydshieh approved these changes Dec 2, 2025

View reviewed changes

Merge branch 'main' into fa2-xpu

40d694a

yao-matrix added 3 commits December 2, 2025 09:18

Merge branch 'main' into fa2-xpu

ac8c6b8

Merge branch 'main' into fa2-xpu

ec05093

Merge branch 'main' into fa2-xpu

e5692c7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

extend FA2 and other cases to XPU, #42536

extend FA2 and other cases to XPU, #42536

yao-matrix commented Dec 1, 2025

Uh oh!

ydshieh Dec 2, 2025

Uh oh!

yao-matrix Dec 2, 2025

Uh oh!

vasqu Dec 3, 2025

Uh oh!

ydshieh Dec 3, 2025

Uh oh!

ydshieh Dec 2, 2025

Uh oh!

ydshieh left a comment •

edited

Loading

Uh oh!

ydshieh commented Dec 2, 2025 •

edited

Loading

Uh oh!

yao-matrix commented Dec 2, 2025

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

extend FA2 and other cases to XPU, #42536

Are you sure you want to change the base?

extend FA2 and other cases to XPU, #42536

Conversation

yao-matrix commented Dec 1, 2025

Uh oh!

ydshieh Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

yao-matrix Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

vasqu Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

ydshieh Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

ydshieh Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

ydshieh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yao-matrix commented Dec 2, 2025

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ydshieh left a comment •

edited

Loading

ydshieh commented Dec 2, 2025 •

edited

Loading