[Quantization] fix fbgemm #42561

MekkCyber · 2025-12-02T15:28:40Z

What does this PR do?

Fixes fbgemm

HuggingFaceDocBuilderDev · 2025-12-02T15:37:19Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SunMarc

Let's go ! Just a nit

SunMarc · 2025-12-02T18:07:38Z

src/transformers/integrations/fbgemm_fp8.py

+        # Sanity checks
+        if isinstance(module, FbgemmFp8Linear):
+            if tensor_name == "weight" and value.dtype == torch.float8_e4m3fn:
+                raise ValueError("Expect unquantized weights but got a quantized weight")
+            if tensor_name == "weight_scale":
+                raise ValueError("Expect unquantized weights but got a weight_scale")
+        if isinstance(module, FbgemmFp8Llama4TextExperts):
+            if tensor_name == "gate_up_proj_scale" or tensor_name == "down_proj_scale":
+                raise ValueError("Expect unquantized weights but got a quantized weight_scale")


let's remove those checks, this shouldn't be possible here.

Suggested change

# Sanity checks

if isinstance(module, FbgemmFp8Linear):

if tensor_name == "weight" and value.dtype == torch.float8_e4m3fn:

raise ValueError("Expect unquantized weights but got a quantized weight")

if tensor_name == "weight_scale":

raise ValueError("Expect unquantized weights but got a weight_scale")

if isinstance(module, FbgemmFp8Llama4TextExperts):

if tensor_name == "gate_up_proj_scale" or tensor_name == "down_proj_scale":

raise ValueError("Expect unquantized weights but got a quantized weight_scale")

SunMarc · 2025-12-02T18:08:38Z

src/transformers/integrations/fbgemm_fp8.py

-    current_key_name=None,
    quantization_config=None,
    pre_quantized=False,
    config=None,


let's use model.config directly

SunMarc · 2025-12-02T18:09:27Z

src/transformers/integrations/fbgemm_fp8.py

+                if tp_plan is not None:
+                    tp_key = re.sub(r"\d+", "*", f"{module_name}.down_proj_scale")
+                    tp_plan[tp_key] = None


comment this for now

github-actions · 2025-12-03T05:11:15Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: fbgemm_fp8

initial commit

3cb1e7b

MekkCyber added 5 commits December 2, 2025 16:19

Merge remote-tracking branch 'upstream/HEAD' into fix-fbgemm

0f60898

passing tests

43e4d1d

fix replace_linear

08d6ff0

style

0a3d11b

rm list

7510720

MekkCyber requested review from ArthurZucker and SunMarc December 2, 2025 17:40

Merge branch 'main' into fix-fbgemm

494c9bd

SunMarc approved these changes Dec 2, 2025

View reviewed changes

fix

737aaa8

style

ee64fac

MekkCyber merged commit 15b79ea into main Dec 3, 2025
24 checks passed

MekkCyber deleted the fix-fbgemm branch December 3, 2025 09:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quantization] fix fbgemm #42561

[Quantization] fix fbgemm #42561

MekkCyber commented Dec 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

SunMarc left a comment

Uh oh!

SunMarc Dec 2, 2025

Uh oh!

SunMarc Dec 2, 2025

Uh oh!

SunMarc Dec 2, 2025

Uh oh!

github-actions bot commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Quantization] fix fbgemm #42561

[Quantization] fix fbgemm #42561

Conversation

MekkCyber commented Dec 2, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

SunMarc Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants