Skip to content

Add vLLM EP launch script and SGLang scripts for Intel irdma NICs#823

Merged
MaoZiming merged 1 commit intouccl-project:mainfrom
manojgop:intel_rdma_script_sglang_vllm
Mar 21, 2026
Merged

Add vLLM EP launch script and SGLang scripts for Intel irdma NICs#823
MaoZiming merged 1 commit intouccl-project:mainfrom
manojgop:intel_rdma_script_sglang_vllm

Conversation

@manojgop
Copy link
Contributor

  • Add launch_vllm_intel_nic.sh for multi-node vLLM Expert Parallel with Intel irdma RoCE NICs (Qwen3-30B-A3B-FP8)
  • Add common_env_intel_nic.sh with shared Intel NIC env config
  • Add Qwen3-30B_uep_intel_nic.sh for SGLang EP on Intel NICs
  • Set LOCAL_WORLD_SIZE to fix cross-node CUDA IPC handle error

Description

Please include a summary of the changes and the related issue.

Fixes # (issue)

Type of Change

  • Bug fix
  • New feature
  • Documentation update

How Has This Been Tested?

Include any tests here.

  • Unit tests
  • Integration tests
  • Manual testing

Checklist

  • I have run format.sh to follow the style guidelines.
  • I have run build.sh to verify compilation.
  • I have removed redundant variables and comments.
  • I have updated the documentation.
  • I have added tests.

@MaoZiming MaoZiming assigned MaoZiming and unassigned MaoZiming Mar 18, 2026
@MaoZiming MaoZiming self-requested a review March 18, 2026 00:09
@manojgop manojgop force-pushed the intel_rdma_script_sglang_vllm branch 3 times, most recently from 1eeb894 to 58884dc Compare March 19, 2026 15:06
@MaoZiming
Copy link
Member

@manojgop Thank you! Do you want to share the results or performance in this PR or README.md?

- Add launch_vllm_intel_nic.sh for multi-node vLLM Expert Parallel
  with Intel irdma RoCE NICs (Qwen3-30B-A3B-FP8)
- Add common_env_intel_nic.sh with shared Intel NIC env config
- Add Qwen3-30B_uep_intel_nic.sh for SGLang EP on Intel NICs
- Set LOCAL_WORLD_SIZE to fix cross-node CUDA IPC handle error

Signed-off-by: Manoj Gopalakrishnan <[email protected]>
@manojgop manojgop force-pushed the intel_rdma_script_sglang_vllm branch from 58884dc to db9a0a5 Compare March 21, 2026 09:49
@manojgop
Copy link
Contributor Author

@manojgop Thank you! Do you want to share the results or performance in this PR or README.md?

I'll raise one more PR for P-D disaggregation. Shall I update the the README.md with that PR ? This script is similar to existing script except that environment variables are Intel NIC specific. Don't want to share performance results now since this is tested with a smaller setup with only 1 GPU per node.

@MaoZiming
Copy link
Member

@manojgop Thank you! Do you want to share the results or performance in this PR or README.md?

I'll raise one more PR for P-D disaggregation. Shall I update the the README.md with that PR ? This script is similar to existing script except that environment variables are Intel NIC specific. Don't want to share performance results now since this is tested with a smaller setup with only 1 GPU per node.

I see. sounds good.

@MaoZiming MaoZiming merged commit ed395bf into uccl-project:main Mar 21, 2026
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants