Eagle3 - update docs, enforce limitations #3939

mzegla · 2026-01-30T12:19:20Z

No description provided.

Copilot

Pull request overview

This PR adds Eagle3 support to the codebase by introducing sequential processing enforcement and configuration options. Eagle3 is a speculative decoding variant that requires stricter limitations than standard speculative decoding, including forced greedy sampling and single-request processing.

Changes:

Added Eagle3-specific configuration fields and mutex-based sequential processing enforcement
Created new EAGLE3 decoding method with enforced greedy sampling (disabling random sampling and beam search)
Updated documentation to clarify Eagle3 limitations and enforcement mechanisms

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
src/llm/servable.hpp	Added mutex and lock fields to enforce sequential processing for Eagle3
src/llm/servable.cpp	Added Eagle3 mode detection and decoding method assignment
src/llm/llm_calculator.proto	Added `draft_eagle3_mode` configuration option and renumbered subsequent fields
src/llm/language_model/continuous_batching/servable_initializer.cpp	Set eagle3Mode property from node options
src/llm/io_processing/base_generation_config_builder.hpp	Added EAGLE3 enum value and documentation
src/llm/io_processing/base_generation_config_builder.cpp	Implemented Eagle3 configuration enforcement (greedy sampling only)
src/llm/http_llm_calculator.cc	Added lock acquisition/release logic for Eagle3 sequential processing
demos/continuous_batching/speculative_decoding/README.md	Updated documentation with enforcement details
demos/common/export_models/export_model.py	Renamed flag and updated template to enforce max_num_seqs=1 for Eagle3

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/llm/servable.hpp

src/llm/io_processing/base_generation_config_builder.cpp

src/llm/http_llm_calculator.cc

dkalinowski · 2026-01-30T12:40:08Z

src/llm/io_processing/base_generation_config_builder.hpp


 namespace ovms {

+// TODO: Monitor Eagle3 sampling support in GenAI and update this when Eagle3 supports more sampling strategies.


Does not exist. As far as I know different sampling degrades performance, so no ticket has been created to support it.

src/llm/http_llm_calculator.cc

dkalinowski

comments

src/llm/servable.cpp

Copilot AI review requested due to automatic review settings January 30, 2026 12:19

mzegla added the 2026.0 label Jan 30, 2026

init

c2d0dbc

mzegla force-pushed the eagle3_imrov branch from 28966ed to c2d0dbc Compare January 30, 2026 12:20

Copilot AI reviewed Jan 30, 2026

View reviewed changes

mzegla requested review from dkalinowski, dtrawins, michalkulakowski and ngrozae January 30, 2026 12:27

dkalinowski reviewed Jan 30, 2026

View reviewed changes

src/llm/http_llm_calculator.cc Outdated Show resolved Hide resolved

dkalinowski requested changes Jan 30, 2026

View reviewed changes

remove locking mechanism, add rest_workers recommendation

410e516

mzegla commented Jan 30, 2026

View reviewed changes

src/llm/servable.cpp Outdated Show resolved Hide resolved

Apply suggestion from @mzegla

3788cd1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eagle3 - update docs, enforce limitations #3939

Eagle3 - update docs, enforce limitations #3939

mzegla commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dkalinowski Jan 30, 2026

Uh oh!

mzegla Jan 30, 2026

Uh oh!

Uh oh!

dkalinowski left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		namespace ovms {

		// TODO: Monitor Eagle3 sampling support in GenAI and update this when Eagle3 supports more sampling strategies.

Eagle3 - update docs, enforce limitations #3939

Are you sure you want to change the base?

Eagle3 - update docs, enforce limitations #3939

Conversation

mzegla commented Jan 30, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dkalinowski Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

mzegla Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dkalinowski left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants