jd-opensource / xllm Public

Notifications You must be signed in to change notification settings
Fork 164
Star 1.2k

Code
Issues 78
Pull requests 52
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: jd-opensource/xllm

Labels 15 Milestones 0

New pull request New

52 Open 949 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

perf: optimize qwen3.5 hybrid linear cache flow[4/N].

#1160 opened Apr 1, 2026 by JC-ut0

Loading…

refactor: introduce MultimodalProcessor abstraction for VLM.

#1159 opened Apr 1, 2026 by wly-115 • Draft

feat: add constrain decodeing for onerec.

#1158 opened Apr 1, 2026 by DragonFive

Loading…

refactor: prefer lambda to bind.

#1157 opened Apr 1, 2026 by Dragonliu2018

Loading…

feat: support video inference for Qwen3-VL on NPU device.

#1151 opened Mar 31, 2026 by xanecdotex

Loading…

bugfix: make CP compatible with MTP

#1150 opened Mar 31, 2026 by shifengmin

Loading…

feat: support FIA for qwen model on npu device.

#1147 opened Mar 31, 2026 by sanlio36

Loading…

bugfix: rollback shared prefix blocks on allocate failure.

#1146 opened Mar 31, 2026 by RobbieLeung

Loading…

feat: implement column parallel for lm head to improve performance.

#1145 opened Mar 31, 2026 by wxh571001500

Loading…

feat: support flux model on mlu device.

#1138 opened Mar 30, 2026 by phantomlei3

Loading…

feat: support embedding interface for all generate VLM models.

#1136 opened Mar 30, 2026 by xanecdotex

Loading…

feat: support oxygenvlm model on mlu device.

#1131 opened Mar 30, 2026 by phantomlei3

Loading…

perf: optimize qwen3.5 hybrid linear cache flow[4/N].

#1130 opened Mar 30, 2026 by yingxudeng

Loading…

feat: support configurable max/min pixels for vlm image processors.

#1123 opened Mar 27, 2026 by wly-115

Loading…

feat: support joyai-llm-flash model on npu device.

#1121 opened Mar 27, 2026 by longhui-z

Loading…

feat: support parsing dtype field from config.

#1118 opened Mar 26, 2026 by DongheJin

Loading…

refactor: simplify xllm_ops pre-build checks and marker-based rebuild logic.

#1111 opened Mar 25, 2026 by LMX-xin

Loading…

feat: support pured lm head by candidate token ids.

#1071 opened Mar 17, 2026 by RobbieLeung

Loading…

feat: support qwen3-omni talker and code2wav.

#1070 opened Mar 17, 2026 by ethan686

Loading…

[WIP] feat: add functional MiniMax-M2.5 baseline

#1064 opened Mar 16, 2026 by QwertyJack • Draft

feat: add onerec in supported model docs and align rec utility style.

#1055 opened Mar 13, 2026 by DragonFive

Loading…

feat: add onerec model implement[4/N].

#1051 opened Mar 13, 2026 by DragonFive

Loading…

3 of 6 tasks

feat: add onerec model implement[3/N].

#1050 opened Mar 13, 2026 by DragonFive

Loading…

5 of 9 tasks

bugfix: avoid decode instance leak if prefill instance prefill fail.

#1046 opened Mar 12, 2026 by magicheng0816

Loading…

bugfix: fix decode instance infinite retry allocate.

#1045 opened Mar 12, 2026 by magicheng0816

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Follow long discussions with comments:>50.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!