Skip to content

Conversation

@GuoliangShiIntel
Copy link
Contributor

@GuoliangShiIntel GuoliangShiIntel commented Oct 21, 2025

Details:

This PR introduces support for Eagle3 (top-1) speculative decoding in the NPUW. The main changes include:

  1. Added a new llm_eagle3_extension module to handle Eagle3-specific input/output logic, including model role detection (Draft/Target), input padding, and chunked processing.
  2. Updated LLMInferRequest to automatically detect Eagle3 models and manage Eagle3 input/output tensors during prefill and generate stages.
  3. Modified model reshaping and output redirection functions to support new Eagle3 layer names and shapes.

These changes enable integration of Eagle3 (top-1) speculative decoding models with the NPU plugin.

Tickets:

image

@github-actions github-actions bot added category: NPU OpenVINO NPU plugin category: NPUW NPUW plugin labels Oct 21, 2025
@sys-openvino-ci sys-openvino-ci added the ExternalIntelPR External contributor from Intel label Oct 21, 2025
@GuoliangShiIntel GuoliangShiIntel force-pushed the eagle3_new_input_output_support branch 3 times, most recently from 01b3462 to 6eeecc6 Compare October 21, 2025 04:28
@GuoliangShiIntel GuoliangShiIntel changed the title [NPUW] New inputs and output support for eagle3 on NPUW [NPUW] New inputs and output support for eagle3 on NPU Oct 22, 2025
@GuoliangShiIntel GuoliangShiIntel changed the title [NPUW] New inputs and output support for eagle3 on NPU [NPUW] New inputs and output support for eagle3 pipeline on NPU Oct 22, 2025
@GuoliangShiIntel GuoliangShiIntel changed the title [NPUW] New inputs and output support for eagle3 pipeline on NPU [NPUW] Add Eagle3 (top-1) pipeline support with new I/O on NPU Oct 23, 2025
@GuoliangShiIntel GuoliangShiIntel marked this pull request as ready for review October 23, 2025 06:51
@GuoliangShiIntel GuoliangShiIntel requested review from a team as code owners October 23, 2025 06:51
@GuoliangShiIntel GuoliangShiIntel changed the title [NPUW] Add Eagle3 (top-1) pipeline support with new I/O on NPU [NPUW] Add Eagle3 (top-1) pipeline support with new I/O Oct 23, 2025
@dmatveev dmatveev added this to the 2026.0 milestone Oct 31, 2025
@GuoliangShiIntel GuoliangShiIntel force-pushed the eagle3_new_input_output_support branch from 0ce76dd to cc16f13 Compare November 17, 2025 09:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: NPU OpenVINO NPU plugin category: NPUW NPUW plugin ExternalIntelPR External contributor from Intel

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants