Skip to content

Commit

Permalink
refactor EAGLE 2 (sgl-project#3269)
Browse files Browse the repository at this point in the history
Co-authored-by: Ying Sheng <[email protected]>
Co-authored-by: merrymercy <[email protected]>
Co-authored-by: Ying1123 <[email protected]>
  • Loading branch information
3 people authored Feb 3, 2025
1 parent 3c8ac78 commit 013021b
Show file tree
Hide file tree
Showing 9 changed files with 1,180 additions and 596 deletions.
1 change: 1 addition & 0 deletions examples/runtime/engine/EAGLE_offline_batch_inference.py
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ def main():
speculative_num_steps=3,
speculative_eagle_topk=4,
speculative_num_draft_tokens=16,
cuda_graph_max_bs=8,
)

outputs = llm.generate(prompts, sampling_params)
Expand Down
Loading

0 comments on commit 013021b

Please sign in to comment.