Check for WMMA instead of MFMA when assigning datatypes for attention #2125

dorde-antic · 2025-11-24T13:41:42Z

Motivation

Resolves https://github.com/ROCm/rocMLIR-internal/issues/2142

Technical Details

Checks for WMMA in perfRunner when assigning datatypes for attention.

Test Plan

Weekly CI - Tuning phase

Test Result

CI RUN
Successfully filtered out f32 attention configs on WMMA architecture (check attention tuning results on gfx1100)
Run failed for other issues related to tuning

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

umangyadav · 2025-11-30T13:28:40Z

@dorde-antic CI has failed on Navi3x. It is still picking f32 attentions

dorde-antic · 2025-11-30T17:34:02Z

@dorde-antic CI has failed on Navi3x. It is still picking f32 attentions

Weird since on Navi4x it didn't (and the logic we use is the same)
It might be related to how we obtain the chip name on navi3x maybe... i'll investigate

dhernandez0 · 2025-12-01T08:34:25Z

@dorde-antic CI has failed on Navi3x. It is still picking f32 attentions

Weird since on Navi4x it didn't (and the logic we use is the same) It might be related to how we obtain the chip name on navi3x maybe... i'll investigate

@dorde-antic This is blocking weekly CI for upstream merge. Please give priority to this PR

dorde-antic · 2025-12-05T11:11:05Z

@umangyadav I would rather focus on merging this #2123 as a temp solution than merging this PR. #2123 solves both f32 attn thing and thing that we don't try every possible combination on MITuna

Try checking for wmma instead for mfma

621111b

dorde-antic changed the title ~~Try checking for wmma instead for mfma~~ Check for WMMA instead of MFMA when assigning datatypes for attention Nov 26, 2025

dorde-antic marked this pull request as ready for review November 26, 2025 10:42

dorde-antic requested a review from causten as a code owner November 26, 2025 10:42

Merge branch 'develop' into i2142-v2

bb9c09f

dorde-antic requested review from dhernandez0, mirza-halilcevic, stefankoncarevic and umangyadav November 26, 2025 10:45

dorde-antic and others added 2 commits November 27, 2025 11:16

Merge branch 'develop' into i2142-v2

48215d2

Merge branch 'develop' into i2142-v2

2dc3b5d

umangyadav approved these changes Nov 28, 2025

View reviewed changes

Merge branch 'develop' into i2142-v2

9268c22

dhernandez0 and others added 6 commits December 1, 2025 09:34

Merge branch 'develop' into i2142-v2

98c3348

Multiple layers of ensuring f32 is not tuned for attn on navi

38bca5f

flake8 issues

ca46ce1

Update perfRunner.py

e35b18c

Merge branch 'develop' into i2142-v2

0e7a6bc

Merge branch 'develop' into i2142-v2

729fd2f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check for WMMA instead of MFMA when assigning datatypes for attention #2125

Check for WMMA instead of MFMA when assigning datatypes for attention #2125

dorde-antic commented Nov 24, 2025 •

edited

Loading

Uh oh!

umangyadav commented Nov 30, 2025

Uh oh!

dorde-antic commented Nov 30, 2025

Uh oh!

dhernandez0 commented Dec 1, 2025

Uh oh!

dorde-antic commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Check for WMMA instead of MFMA when assigning datatypes for attention #2125

Are you sure you want to change the base?

Check for WMMA instead of MFMA when assigning datatypes for attention #2125

Conversation

dorde-antic commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

umangyadav commented Nov 30, 2025

Uh oh!

dorde-antic commented Nov 30, 2025

Uh oh!

dhernandez0 commented Dec 1, 2025

Uh oh!

dorde-antic commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dorde-antic commented Nov 24, 2025 •

edited

Loading