[None][chore] Add failed cases into waives.txt #10301

xinhe-nv · 2025-12-25T16:49:55Z

waive failed cases.

Summary by CodeRabbit

Tests
- Enhanced test skip conditions to properly validate GPU architecture compatibility.
- Updated test path organization for improved test categorization.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

xinhe-nv · 2025-12-30T02:53:18Z

/bot run --skip-test

coderabbitai · 2025-12-30T02:56:43Z

📝 Walkthrough

Walkthrough

This PR adds hardware-specific test skip decorators (skip_pre_hopper, skip_post_blackwell) to a multimodal test class and renames a waived test path with a "full:sm100/" prefix. No functional test logic is modified.

Changes

Cohort / File(s)	Summary
Test Skip Decorator Updates `tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py`	Extends imports to include `skip_post_blackwell`, `skip_pre_blackwell`, and `skip_pre_hopper` from conftest. Adds `@skip_pre_hopper` and `@skip_post_blackwell` decorators (two occurrences) before the TestGemma3_27BInstruct class to skip tests on specific GPU architectures.
Test Waiver Path Prefix `tests/integration/test_lists/waives.txt`	Renames test reference path by prepending "full:sm100/" prefix to the existing test path for TestGPTOSS::test_w4_4gpus[dp4-trtllm-auto]. Test identifier and SKIP annotation remain unchanged.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

Possibly related PRs

[TRTLLM-8638][fix] Add failed cases into waives.txt #9588: Modifies waives.txt with test skip entries, related through shared test waiver configuration.
[None][chore] Add failed cases into waives.txt #10177: Modifies waives.txt with additional SKIP entries, related through shared test waiver list updates.

Suggested reviewers

crazydemo
jieli-matrix
StanleySun639
LarryXFly

Pre-merge checks and finishing touches

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.
Description check	❓ Inconclusive	The PR description 'waive failed cases.' is extremely vague and lacks required template sections like detailed explanation of the issue/solution and test coverage information.	Expand the description with specific details about which test cases failed, why they are being waived, and the expected impact of these changes.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title '[None][chore] Add failed cases into waives.txt' accurately summarizes the main change: adding failed test cases to the waives.txt file.

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4944192 and 44af3d4.

📒 Files selected for processing (2)

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py
tests/integration/test_lists/waives.txt

🧰 Additional context used

📓 Path-based instructions (2)

**/*.py

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

**/*.py: Code developed for TensorRT-LLM should conform to Python 3.8+
Indent Python code with 4 spaces. Do not use tabs
Always maintain the namespace when importing in Python, even if only one class or function from a module is used
Python files should use snake_case naming: some_file.py
Python classes should use PascalCase naming: class SomeClass
Python functions and methods should use snake_case naming: def my_awesome_function():
Python local variables should use snake_case naming: my_variable = ...
Python variable names that start with a number should be prefixed with 'k': k_99th_percentile = ...
Python global variables should use upper snake_case with prefix 'G': G_MY_GLOBAL = ...
Python constants should use upper snake_case naming: MY_CONSTANT = ...
Avoid shadowing variables declared in an outer scope in Python
Initialize all externally visible members of a Python class in the constructor
For Python interfaces that may be used outside a file, prefer docstrings over comments
Python comments should be reserved for code within a function, or interfaces that are local to a file
Use Google style docstrings in Python for classes and functions, which can be parsed by Sphinx
Python attributes and variables can be documented inline with type and description
Avoid using reflection in Python when functionality can be easily achieved without reflection
When using try-except blocks in Python, limit the except to the smallest set of errors possible
When using try-except blocks in Python to handle multiple possible variable types (duck-typing), keep the body of the try as small as possible, using the else block for logic

Files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

**/*.{cpp,h,cu,cuh,py}

📄 CodeRabbit inference engine (CODING_GUIDELINES.md)

All TensorRT-LLM Open Source Software code should contain an NVIDIA copyright header that includes the year of its latest meaningful modification

Files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

🧠 Learnings (14)

📓 Common learnings

Learnt from: tongyuantongyu
Repo: NVIDIA/TensorRT-LLM PR: 7781
File: tests/integration/test_lists/waives.txt:313-313
Timestamp: 2025-09-17T02:48:52.732Z
Learning: In TensorRT-LLM, `tests/integration/test_lists/waives.txt` is specifically for waiving/skipping tests, while other test list files like those in `test-db/` and `qa/` directories are for different test execution contexts (pre-merge, post-merge, QA tests). The same test appearing in both waives.txt and execution list files is intentional - the test is part of test suites but will be skipped due to the waiver.

Learnt from: EmmaQiaoCh
Repo: NVIDIA/TensorRT-LLM PR: 7370
File: tests/unittest/trt/model_api/test_model_quantization.py:24-27
Timestamp: 2025-08-29T14:07:45.863Z
Learning: In TensorRT-LLM's CI infrastructure, pytest skip markers (pytest.mark.skip) are properly honored even when test files have __main__ blocks that call test functions directly. The testing system correctly skips tests without requiring modifications to the __main__ block execution pattern.

Learnt from: nvpohanh
Repo: NVIDIA/TensorRT-LLM PR: 7478
File: tests/unittest/_torch/modeling/test_modeling_llama_min_latency.py:286-308
Timestamp: 2025-09-03T13:16:38.028Z
Learning: In test files, temporary monkey-patches for upstream bugs can be kept simple when they are explicitly intended to be removed soon, rather than investing effort in making them more robust.

📚 Learning: 2025-09-17T02:48:52.732Z

Learnt from: tongyuantongyu
Repo: NVIDIA/TensorRT-LLM PR: 7781
File: tests/integration/test_lists/waives.txt:313-313
Timestamp: 2025-09-17T02:48:52.732Z
Learning: In TensorRT-LLM, `tests/integration/test_lists/waives.txt` is specifically for waiving/skipping tests, while other test list files like those in `test-db/` and `qa/` directories are for different test execution contexts (pre-merge, post-merge, QA tests). The same test appearing in both waives.txt and execution list files is intentional - the test is part of test suites but will be skipped due to the waiver.

Applied to files:

tests/integration/test_lists/waives.txt

📚 Learning: 2025-09-09T09:40:45.658Z

Learnt from: fredricz-20070104
Repo: NVIDIA/TensorRT-LLM PR: 7645
File: tests/integration/test_lists/qa/llm_function_core.txt:648-648
Timestamp: 2025-09-09T09:40:45.658Z
Learning: In TensorRT-LLM test lists, it's common and intentional for the same test to appear in multiple test list files when they serve different purposes (e.g., llm_function_core.txt for comprehensive core functionality testing and llm_function_core_sanity.txt for quick sanity checks). This duplication allows tests to be run in different testing contexts.

Applied to files:

tests/integration/test_lists/waives.txt
tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-08-26T09:49:04.956Z

Learnt from: pengbowang-nv
Repo: NVIDIA/TensorRT-LLM PR: 7192
File: tests/integration/test_lists/test-db/l0_dgx_b200.yml:56-72
Timestamp: 2025-08-26T09:49:04.956Z
Learning: In TensorRT-LLM test configuration files, the test scheduling system handles wildcard matching with special rules that prevent duplicate test execution even when the same tests appear in multiple yaml files with overlapping GPU wildcards (e.g., "*b200*" and "*gb200*").

Applied to files:

tests/integration/test_lists/waives.txt
tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-07-22T08:33:49.109Z

Learnt from: yiqingy0
Repo: NVIDIA/TensorRT-LLM PR: 5198
File: jenkins/mergeWaiveList.py:0-0
Timestamp: 2025-07-22T08:33:49.109Z
Learning: In the TensorRT-LLM waive list merging system, removed lines are always located at the end of the merge waive lists, which is why the mergeWaiveList.py script uses reverse traversal - it's an optimization for this specific domain constraint.

Applied to files:

tests/integration/test_lists/waives.txt

📚 Learning: 2025-08-29T14:07:45.863Z

Learnt from: EmmaQiaoCh
Repo: NVIDIA/TensorRT-LLM PR: 7370
File: tests/unittest/trt/model_api/test_model_quantization.py:24-27
Timestamp: 2025-08-29T14:07:45.863Z
Learning: In TensorRT-LLM's CI infrastructure, pytest skip markers (pytest.mark.skip) are properly honored even when test files have __main__ blocks that call test functions directly. The testing system correctly skips tests without requiring modifications to the __main__ block execution pattern.

Applied to files:

tests/integration/test_lists/waives.txt
tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-07-28T17:06:08.621Z

Learnt from: moraxu
Repo: NVIDIA/TensorRT-LLM PR: 6303
File: tests/integration/test_lists/qa/examples_test_list.txt:494-494
Timestamp: 2025-07-28T17:06:08.621Z
Learning: In TensorRT-LLM testing, it's common to have both CLI flow tests (test_cli_flow.py) and PyTorch API tests (test_llm_api_pytorch.py) for the same model. These serve different purposes: CLI flow tests validate the traditional command-line workflow, while PyTorch API tests validate the newer LLM API backend. Both are legitimate and should coexist.

Applied to files:

tests/integration/test_lists/waives.txt
tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-08-18T08:42:02.640Z

Learnt from: samuellees
Repo: NVIDIA/TensorRT-LLM PR: 6974
File: tensorrt_llm/serve/scripts/benchmark_dataset.py:558-566
Timestamp: 2025-08-18T08:42:02.640Z
Learning: In TensorRT-LLM's RandomDataset (tensorrt_llm/serve/scripts/benchmark_dataset.py), when using --random-token-ids option, sequence length accuracy is prioritized over semantic correctness for benchmarking purposes. The encode/decode operations should use skip_special_tokens=True and add_special_tokens=False to ensure exact target token lengths.

Applied to files:

tests/integration/test_lists/waives.txt

📚 Learning: 2025-08-06T13:58:07.506Z

Learnt from: galagam
Repo: NVIDIA/TensorRT-LLM PR: 6487
File: tests/unittest/_torch/auto_deploy/unit/singlegpu/test_ad_trtllm_bench.py:1-12
Timestamp: 2025-08-06T13:58:07.506Z
Learning: In TensorRT-LLM, test files (files under tests/ directories) do not require NVIDIA copyright headers, unlike production source code files. Test files typically start directly with imports, docstrings, or code.

Applied to files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-08-26T09:37:10.463Z

Learnt from: jiaganc
Repo: NVIDIA/TensorRT-LLM PR: 7031
File: tensorrt_llm/bench/dataclasses/configuration.py:90-104
Timestamp: 2025-08-26T09:37:10.463Z
Learning: In TensorRT-LLM, the `get_pytorch_perf_config()` method returns `self.pytorch_config` which can contain default `cuda_graph_config` values, so `llm_args` may already have this config before the extra options processing.

Applied to files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-08-26T09:37:10.463Z

Learnt from: jiaganc
Repo: NVIDIA/TensorRT-LLM PR: 7031
File: tensorrt_llm/bench/dataclasses/configuration.py:90-104
Timestamp: 2025-08-26T09:37:10.463Z
Learning: In TensorRT-LLM's bench configuration, the `get_pytorch_perf_config()` method returns `self.pytorch_config` which is a Dict[str, Any] that can contain default values including `cuda_graph_config`, making the fallback `llm_args["cuda_graph_config"]` safe to use.

Applied to files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-08-11T20:09:24.389Z

Learnt from: achartier
Repo: NVIDIA/TensorRT-LLM PR: 6763
File: tests/integration/defs/triton_server/conftest.py:16-22
Timestamp: 2025-08-11T20:09:24.389Z
Learning: In the TensorRT-LLM test infrastructure, the team prefers simple, direct solutions (like hard-coding directory traversal counts) over more complex but robust approaches when dealing with stable directory structures. They accept the maintenance cost of updating tests if the layout changes.

Applied to files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-08-01T15:14:45.673Z

Learnt from: yibinl-nvidia
Repo: NVIDIA/TensorRT-LLM PR: 6506
File: examples/models/core/mixtral/requirements.txt:3-3
Timestamp: 2025-08-01T15:14:45.673Z
Learning: In TensorRT-LLM, examples directory can have different dependency versions than the root requirements.txt file. Version conflicts between root and examples dependencies are acceptable because examples are designed to be standalone and self-contained.

Applied to files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

📚 Learning: 2025-11-27T09:23:18.742Z

Learnt from: fredricz-20070104
Repo: NVIDIA/TensorRT-LLM PR: 9511
File: tests/integration/defs/examples/serve/test_serve.py:136-186
Timestamp: 2025-11-27T09:23:18.742Z
Learning: In TensorRT-LLM testing, when adding test cases based on RCCA commands, the command format should be copied exactly as it appears in the RCCA case, even if it differs from existing tests. For example, some RCCA commands for trtllm-serve may omit the "serve" subcommand while others include it.

Applied to files:

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Pre-commit Check

🔇 Additional comments (2)

tests/integration/test_lists/waives.txt (1)

522-522: LGTM!

The waive entry is properly formatted with a bug reference and adds a hardware-specific skip for the TestGPTOSS test on Blackwell (sm100) architecture.

Based on learnings, waives.txt is specifically for waiving/skipping tests in the TensorRT-LLM test infrastructure, and this addition follows the established pattern.

tests/integration/defs/accuracy/test_llm_api_pytorch_multimodal.py (1)

6-6: LGTM!

The import additions are appropriate for the hardware-specific skip decorators being applied to the TestGemma3_27BInstruct class.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

tensorrt-cicd · 2025-12-30T02:59:01Z

PR_Github #30115 [ run ] triggered by Bot. Commit: 44af3d4

Signed-off-by: xinhe-nv <[email protected]>

Signed-off-by: Xin He (SW-GPU) <[email protected]>

tensorrt-cicd · 2025-12-30T04:32:43Z

PR_Github #30115 [ run ] completed with state SUCCESS. Commit: 44af3d4
/LLM/main/L0_MergeRequest_PR pipeline #23175 (Partly Tested) completed with status: 'SUCCESS'

…ION_TEST_1786 Signed-off-by: xinhe-nv <[email protected]>

Signed-off-by: xinhe-nv <[email protected]>

xinhe-nv · 2025-12-30T05:30:55Z

/bot reuse-pipeline

tensorrt-cicd · 2025-12-30T05:36:52Z

PR_Github #30135 [ reuse-pipeline ] triggered by Bot. Commit: ec27cf5

tensorrt-cicd · 2025-12-30T05:59:08Z

PR_Github #30135 [ reuse-pipeline ] completed with state SUCCESS. Commit: ec27cf5
Reusing PR_Github #30115 (Partly Tested) for commit ec27cf5

xinhe-nv requested review from LarryXFly and crazydemo December 25, 2025 16:49

xinhe-nv force-pushed the user/qa/post_update_waive_20251226_LLM_FUNCTION_TEST_1786 branch 2 times, most recently from f7c4a65 to 44af3d4 Compare December 30, 2025 02:52

xinhe-nv marked this pull request as ready for review December 30, 2025 02:53

xinhe-nv enabled auto-merge (squash) December 30, 2025 02:53

xinhe-nv force-pushed the user/qa/post_update_waive_20251226_LLM_FUNCTION_TEST_1786 branch from 44af3d4 to a2774ec Compare December 30, 2025 03:09

xinhe-nv added 2 commits December 30, 2025 12:24

update waive list

c609a88

Signed-off-by: xinhe-nv <[email protected]>

skip gemma 27b on ada and blackwell

a8de691

Signed-off-by: Xin He (SW-GPU) <[email protected]>

xinhe-nv force-pushed the user/qa/post_update_waive_20251226_LLM_FUNCTION_TEST_1786 branch from a2774ec to a8de691 Compare December 30, 2025 04:24

xinhe-nv added 2 commits December 30, 2025 13:29

Merge branch 'main' into user/qa/post_update_waive_20251226_LLM_FUNCT…

301b1bd

…ION_TEST_1786 Signed-off-by: xinhe-nv <[email protected]>

Update waives.txt

ec27cf5

Signed-off-by: xinhe-nv <[email protected]>

LarryXFly approved these changes Dec 30, 2025

View reviewed changes

xinhe-nv merged commit 3e0344a into NVIDIA:main Dec 30, 2025
5 checks passed

xinhe-nv deleted the user/qa/post_update_waive_20251226_LLM_FUNCTION_TEST_1786 branch December 30, 2025 06:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[None][chore] Add failed cases into waives.txt #10301

[None][chore] Add failed cases into waives.txt #10301

Uh oh!

xinhe-nv commented Dec 25, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

xinhe-nv commented Dec 30, 2025

Uh oh!

coderabbitai bot commented Dec 30, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

xinhe-nv commented Dec 30, 2025

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[None][chore] Add failed cases into waives.txt #10301

[None][chore] Add failed cases into waives.txt #10301

Uh oh!

Conversation

xinhe-nv commented Dec 25, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

xinhe-nv commented Dec 30, 2025

Uh oh!

coderabbitai bot commented Dec 30, 2025

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested reviewers

Pre-merge checks and finishing touches

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

xinhe-nv commented Dec 30, 2025

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

tensorrt-cicd commented Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xinhe-nv commented Dec 25, 2025 •

edited by coderabbitai bot

Loading