-
Notifications
You must be signed in to change notification settings - Fork 641
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AsyncEngine Refactor 3/N] Introduce SessionManager and InstManager
#4253
opened Jan 5, 2026 by
lvhan028
Loading…
[WIP]: Support destroy deepep buffer explicitly
improvement
WIP
#4246
opened Dec 31, 2025 by
RunningLeon
Loading…
feat: implement online bf16-to-fp8 conversion and inference in TurboMind
improvement
#4237
opened Dec 25, 2025 by
43758726
Loading…
fix: Fix Guided Decoding Crashes and State Corruption Issues
#4167
opened Nov 28, 2025 by
windreamer
Loading…
[WIP]: Support fp32 head for qwen and internlm models
#4160
opened Nov 27, 2025 by
RunningLeon
•
Draft
Add step_map to track token decoding order in DLLM
#4057
opened Oct 21, 2025 by
Auraithm
Loading…
4 tasks done
quant blocked fp8
enhancement
New feature or request
#4018
opened Sep 29, 2025 by
CUHKSZzxy
Loading…
4 of 5 tasks
add ppu quick start doc
documentation
Improvements or additions to documentation
#3841
opened Aug 14, 2025 by
guozixu2001
Loading…
fix: qwen3 nonstream parse with no or uncompleted think content
#3748
opened Jul 18, 2025 by
ywx217
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.