[doc] feat: Add FLOPs calculator and FP8/FP4 dequantization guidance to adding-model-support skill by cuichenx · Pull Request #2934 · NVIDIA-NeMo/Megatron-Bridge

cuichenx · 2026-03-21T00:30:24Z

Summary

Adds Step 4 — Check for quantized weights (FP8 / FP4) to the Discovery phase of the adding-model-support skill. Documents the silent failure mode (bridge loads raw quantized values → broken model with no error) and two fix approaches: standalone dequant script and in-bridge maybe_modify_loaded_hf_weight() hook.
Adds Update FLOPs calculator for new architectural blocks section to Phase 2. Covers when and how to update flop_utils.py for new blocks (GDN, MTP, Mamba, novel MoE), with reference to PR [perf] feat: add GDN (Gated DeltaNet) FLOPs calculator #2925 as the canonical example.

Test plan

Review the added guidance in skills/adding-model-support/SKILL.md
Verify links to referenced files (examples/models/vlm/ministral3/dequant_fp8_for_bridge.py, PR [perf] feat: add GDN (Gated DeltaNet) FLOPs calculator #2925) are correct

…to adding-model-support skill - Step 4 (Discovery): Check for quantized weights (FP8/FP4) that silently break models without dequantization. Documents standalone script and in-bridge hook approaches. - Phase 2: Update FLOPs calculator when new architectural blocks (GDN, MTP, Mamba) differ from standard attention/MLP. References PR #2925 as example. Signed-off-by: Chen Cui <[email protected]>

copy-pr-bot bot temporarily deployed to test March 21, 2026 00:31 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci March 21, 2026 00:46 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci March 21, 2026 00:53 Inactive

copy-pr-bot bot temporarily deployed to nemo-ci March 21, 2026 01:00 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[doc] feat: Add FLOPs calculator and FP8/FP4 dequantization guidance to adding-model-support skill#2934

[doc] feat: Add FLOPs calculator and FP8/FP4 dequantization guidance to adding-model-support skill#2934
cuichenx wants to merge 1 commit intoyuya/public-skillsfrom
chcui/skill-flops-dequant-guidance

cuichenx commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cuichenx commented Mar 21, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant