[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance #120

rahul-tuli · 2025-09-29T12:20:44Z

Eagle3 drafters were incorrectly inheriting the verifier's quantization
configuration instead of using their own, causing KeyError when loading
unquantized drafter weights with quantized verifiers.

This implements a clean inheritance pattern where:

Base LlamaDecoderLayer has configurable get_quant_config() method
Eagle3 LlamaDecoderLayer overrides to use drafter's quantization config
Uses existing VllmConfig.get_quantization_config() infrastructure

…ance Eagle3 drafters were incorrectly inheriting the verifier's quantization configuration instead of using their own, causing KeyError when loading unquantized drafter weights with quantized verifiers. This implements a clean inheritance pattern where: - Base LlamaDecoderLayer has configurable get_quant_config() method - Eagle3 LlamaDecoderLayer overrides to use drafter's quantization config - Uses existing VllmConfig._get_quantization_config() infrastructure Fixes speculative decoding with quantized verifier + unquantized drafter. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]> Signed-off-by: [email protected] Signed-off-by: Rahul Tuli <[email protected]>

rahul-tuli · 2025-09-29T15:58:34Z

Landed on vllm main!

rahul-tuli closed this Sep 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance #120

[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance #120

Uh oh!

rahul-tuli commented Sep 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

rahul-tuli commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance #120

[Bugfix][Speculative Decoding] Fix Eagle3 quantization config inheritance #120

Uh oh!

Conversation

rahul-tuli commented Sep 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rahul-tuli commented Sep 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rahul-tuli commented Sep 29, 2025 •

edited by github-actions bot

Loading