Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add custom noaux kernel
#4345 opened Feb 10, 2026 by grimoire Loading…
Support MiniMax-M2 in TurboMind engine
#4343 opened Feb 10, 2026 by zh-nj Loading…
fix qwen3-vl-moe long context Bug:P1
#4342 opened Feb 9, 2026 by grimoire Loading…
Fix authorization
#4338 opened Feb 9, 2026 by lvhan028 Loading…
[WIP]Support torch compile
#4336 opened Feb 8, 2026 by grimoire Draft
Qwen Dense/Moe model fp8 quant online
#4324 opened Feb 5, 2026 by 43758726 Loading…
return BadRequest for all invlid inputs Bug:P2
#4291 opened Jan 26, 2026 by lvhan028 Loading…
support repetition ngram logits processor
#4288 opened Jan 23, 2026 by grimoire Loading…
fix dllm mask on set_step
#4278 opened Jan 18, 2026 by grimoire Loading…
[ascend] fix awq and smoothq
#4277 opened Jan 16, 2026 by wanfengcxz Draft
test: add mixing guided and non-guided tests
#4267 opened Jan 12, 2026 by windreamer Loading…
Update benchmark serving script for proxy_server
#4173 opened Dec 1, 2025 by lvhan028 Loading…
Update installation.md
#4095 opened Nov 3, 2025 by krescent Loading…
Add step_map to track token decoding order in DLLM
#4057 opened Oct 21, 2025 by Auraithm Loading…
4 tasks done
[POC] Encoder Disaggregation
#4047 opened Oct 17, 2025 by CUHKSZzxy Draft
2 of 7 tasks
quant blocked fp8 enhancement New feature or request
#4018 opened Sep 29, 2025 by CUHKSZzxy Loading…
4 of 5 tasks
ProTip! Filter pull requests by the default branch with base:main.