Logging per test case status by pbarejko · Pull Request #5847 · isaac-sim/IsaacLab

pbarejko · 2026-05-28T21:29:10Z

Description

Logging status per case as follows:

+------------------------------------------------------------------------------------------------------------------------------------------------------+--------+----------+----------+---------+
| Test Path                                                                                                                                            | Result | Test (s) | Wall (s) | # Tests |
+------------------------------------------------------------------------------------------------------------------------------------------------------+--------+----------+----------+---------+
| /home/pbarejko/git/IsaacLab/source/isaaclab_physx/test/renderers/test_isaac_rtx_renderer_utils.py                                                    | passed |     0.19 |     0.73 |  13/13  |
|     ↳ source.isaaclab_physx.test.renderers.test_isaac_rtx_renderer_utils.TestGetStageStreamingBusy::test_returns_true_when_busy                      | passed |     0.00 |          |         |
|     ↳ source.isaaclab_physx.test.renderers.test_isaac_rtx_renderer_utils.TestGetStageStreamingBusy::test_returns_false_when_idle                     | passed |     0.00 |          |         |
|     ↳ source.isaaclab_physx.test.renderers.test_isaac_rtx_renderer_utils.TestGetStageStreamingBusy::test_returns_false_when_no_context               | passed |     0.00 |          |         |
|     ↳ source.isaaclab_physx.test.renderers.test_isaac_rtx_renderer_utils.TestWaitForStreamingComplete::test_returns_immediately_when_not_busy        | passed |     0.00 |          |         |

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context.
List any dependencies that are required for this change.

Fixes # (issue)

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (existing functionality will not work without user modification)
Documentation update

Screenshots

Please attach before and after screenshots of the change if applicable.

Checklist

I have read and understood the contribution guidelines
I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

isaaclab-review-bot

🤖 Isaac Lab Review Bot

Thanks for adding per-test-case logging! This is a nice enhancement for test result visibility.

✅ Architecture & Design

Appropriate location: Changes to tools/conftest.py are correct for test infrastructure modifications
Clean integration: The cases list is added to the existing test_status dictionary structure, maintaining backwards compatibility
Good import addition: Adding Failure and Skipped from junitparser alongside existing Error import is correct

✅ Implementation Correctness

Result status logic is sound: The priority order (FAILED > ERROR > skipped > passed) handles overlapping statuses correctly
Empty cases list: All error paths (STARTUP_HANG, TIMEOUT, CRASHED, report parse failure) now correctly include "cases": [] - nice attention to detail
Safe defaults: Using case.name or "", case.classname or "", and float(case.time or 0.0) prevents None propagation

🔍 Minor Observations

Line 518-520: The list(case.result) conversion assumes case.result is iterable. While this works with junitparser's API, a brief comment noting this behavior might help future maintainers.
Output formatting: The ↳ prefix with 4-space indent creates a nice visual hierarchy in the output table. The empty strings for Wall time and # Tests columns on case rows make sense since those are file-level metrics.

📋 Pre-commit Check

The pre-commit check is failing - please run ./isaaclab.sh --format to address any formatting issues.

✅ Test Coverage

This is test infrastructure code. The changes are self-testing in that any issues would surface immediately when running tests.

This is an automated review. Please address any concerns and reach out if you have questions.

Update (f6d72b4): Nice cleanup! The case status logic has been extracted into a _case_result_status() helper function. This improves code readability and maintainability. ✅

Update (5780763): This is a substantial and excellent refactor of run_individual_tests() 🎉

What changed:

The original 250+ line function has been decomposed into ~15 focused helper functions
Each helper has a clear single responsibility (e.g., _build_test_command, _run_test_with_retries, _handle_startup_hang, _parse_test_report)
The main run_individual_tests() is now a clean 15-line loop

Quality highlights:

✅ _status_summary() builder ensures consistent status dict structure across all paths
✅ _record_infrastructure_status() cleanly separates infrastructure failures from test failures
✅ _run_test_with_retries() encapsulates the retry logic with proper state management
✅ _print_retry_context() DRYs up the diagnostic output formatting
✅ Per-case results (cases list) are now properly populated via _parse_test_report()

Minor note: The summary table output now includes per-case rows with the ↳ prefix - this is a nice visual improvement for identifying which specific test cases passed/failed.

This refactor significantly improves maintainability. Future changes (new failure modes, additional metrics, etc.) can be isolated to individual helpers. Well done! 👏

Update (bbd0270): The helper function refactoring from the previous commit has been reverted. The code is now back to a single run_individual_tests() function with inline logic (~270 lines).

What changed:

Removed: _build_test_command, _read_test_content, _print_retry_context, _run_test_with_retries, _status_summary, _write_synthetic_failure, _handle_startup_hang, _handle_timeout, _handle_missing_report, _parse_test_report, _record_infrastructure_status, _record_report_status, _test_timeout_config, _run_individual_test
The retry loops, error handling, and report parsing are now inline within the main function

Observation: While the inlined version is more self-contained, it reverts the maintainability gains from the previous refactor. If the team prefers the explicit inline approach over the decomposed helper style, that's a valid choice - but it may be worth discussing the tradeoffs with the maintainers.

Functionality preserved: ✅ All the per-case logging and status tracking features remain intact. The _case_result_status() helper is still present.

Update (d4d6010): The refactoring is back! 🎉 This commit re-introduces helper functions with some nice improvements:

New/restored helper functions:

_print_retry_diagnostics() - consolidated retry diagnostic output
_run_test_with_retries() - encapsulates the retry loop logic
_append_output_tail() - DRY helper for truncating output tails
_write_error_result() - creates synthetic JUnit reports for process errors
_process_startup_hang() / _process_timeout_without_report() / _process_missing_report() - handle specific failure modes
_process_error_status() - builds consistent error status dicts
_parse_test_report() - extracts report parsing with per-case collection
_summary_from_report() - calculates final pass/fail from parsed report
_test_file_has_cameras() - isolated cold-cache check
_build_test_command() - constructs pytest command

Improvements over 5780763:

Cleaner function names (e.g., _process_startup_hang vs _handle_startup_hang)
_append_output_tail() eliminates duplicated output formatting
Explicit return values from _run_test_with_retries() make data flow clearer
_summary_from_report() cleanly separates report parsing from pass/fail determination

Result: The main loop in run_individual_tests() is now ~60 lines of clear orchestration logic. This is a solid balance between decomposition and readability. ✅

Update (6d1f645): Second revert - the helper function refactoring has been reverted again.

Current state:

Back to the simpler structure: one _case_result_status() helper + inline logic in run_individual_tests()
All the per-case logging functionality is preserved
The cases list correctly captures test case name, classname, time, and result
Error paths (STARTUP_HANG, TIMEOUT, CRASHED, parse failure) all set "cases": []

Final assessment: The core feature (per-test-case status logging) is complete and working correctly. The team appears to be iterating on whether to extract helpers vs keep inline logic - both approaches are valid for this infrastructure code. The current implementation is straightforward and achieves the goal. 👍

greptile-apps · 2026-05-28T21:32:21Z

Greptile Summary

This PR enhances the test runner's summary table to show per-test-case status as indented sub-rows (prefixed with ↳) beneath each test file row, parsing Failure, Error, and Skipped result types from the JUnit XML report.

run_individual_tests now collects a cases list from the parsed JUnit XML for each test file and stores it in test_status; early-exit paths (startup hang, timeout, crash, parse error) all correctly initialise cases to [].
The pytest_sessionstart summary loop iterates status["cases"] to append one additional PrettyTable row per test case, leaving the Wall (s) and # Tests columns empty for those rows.

Confidence Score: 4/5

Safe to merge; the change is additive and only affects the summary table printed at the end of a test run.

The case-collection logic is correct and all early-exit paths properly initialise cases: []. The two findings are purely stylistic — inconsistent casing of status strings and a redundant list() conversion — neither affects runtime behaviour.

No files require special attention beyond the two style nits in tools/conftest.py.

Important Files Changed

Filename	Overview
tools/conftest.py	Adds per-test-case status rows to the summary table; `cases` accumulation logic is correct, but result-status casing is inconsistent and the `case.result` list-copy idiom is slightly noisy.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Parse JUnitXml report] --> B[Iterate suites]
    B --> C[Iterate cases in suite]
    C --> D{case.result empty?}
    D -- Yes --> E[case_status = 'passed']
    D -- No --> F{any Failure?}
    F -- Yes --> G[case_status = 'FAILED']
    F -- No --> H{any Error?}
    H -- Yes --> I[case_status = 'ERROR']
    H -- No --> J{any Skipped?}
    J -- Yes --> K[case_status = 'skipped']
    J -- No --> L[case_status = 'passed']
    E & G & I & K & L --> M[Append to cases list]
    M --> N[Store in test_status with cases key]
    N --> O[Render sub-row in PrettyTable with ↳ prefix]

_{Reviews (1): Last reviewed commit: "Logging per test case status" | Re-trigger Greptile}

greptile-apps · 2026-05-28T21:32:25Z

+                    if not case_results:
+                        case_status = "passed"
+                    elif any(isinstance(r, Failure) for r in case_results):
+                        case_status = "FAILED"
+                    elif any(isinstance(r, Error) for r in case_results):
+                        case_status = "ERROR"
+                    elif any(isinstance(r, Skipped) for r in case_results):
+                        case_status = "skipped"
+                    else:
+                        case_status = "passed"


The result strings for case-level status mix lowercase and uppercase: "passed" and "skipped" are lowercase while "FAILED" and "ERROR" are uppercase. This is inconsistent within the same data structure — if any downstream code ever does a case-sensitive comparison against these values (e.g. filtering, aggregating, or serialising), results with "passed" and "FAILED" would need to be treated differently. Using a uniform casing convention (all-uppercase or all-lowercase) avoids that ambiguity.

Suggested change

if not case_results:

case_status = "passed"

elif any(isinstance(r, Failure) for r in case_results):

case_status = "FAILED"

elif any(isinstance(r, Error) for r in case_results):

case_status = "ERROR"

elif any(isinstance(r, Skipped) for r in case_results):

case_status = "skipped"

else:

case_status = "passed"

if not case_results:

case_status = "passed"

elif any(isinstance(r, Failure) for r in case_results):

case_status = "failed"

elif any(isinstance(r, Error) for r in case_results):

case_status = "error"

elif any(isinstance(r, Skipped) for r in case_results):

case_status = "skipped"

else:

case_status = "passed"

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!

seems good consistent change

Thanks! 👍

Thanks for confirming! 👍

I think Piotr wanted to make ERROR and FAILED stand out in uppercase, this was intentional

That's a fair point — using uppercase for FAILED and ERROR as a visual signal for actionable/attention-requiring states, while keeping passed and skipped lowercase, is a reasonable intentional design choice. The visual distinction helps them stand out in the table output, which is exactly what you'd want when scanning test results.

I'll withdraw my concern — the mixed casing is purposeful here, not accidental.

Tip: You can customize Greptile's behavior for this repo with .greptile/rules.md and .greptile/config.json.

greptile-apps · 2026-05-28T21:32:26Z

                if suite.name == "pytest":
                    suite.name = os.path.splitext(file_name)[0]
+                for case in suite:
+                    case_results = list(case.result) if case.result else []


In junitparser 2.x case.result already returns a list; wrapping it in list(...) creates an unnecessary shallow copy. The idiom case.result or [] expresses the intent more directly — fall back to an empty list when case.result is falsy (either None or []).

Suggested change

case_results = list(case.result) if case.result else []

case_results = case.result or []

Note: If this suggestion doesn't match your team's coding style, reply to this and let me know. I'll remember it for next time!

pv-nvidia

Overall this looks good to me.

The tiny greptile suggestion

case_results = case.result or []

seems valid?

This reverts commit 5780763.

This reverts commit d4d6010.

Logging per test case status

3746795

pbarejko requested a review from hhansen-bdai as a code owner May 28, 2026 21:29

github-actions Bot added the infrastructure label May 28, 2026

isaaclab-review-bot Bot reviewed May 28, 2026

View reviewed changes

greptile-apps Bot reviewed May 28, 2026

View reviewed changes

fix: reduce conftest complexity

23c079c

pv-nvidia force-pushed the pbarejko/per-case-status branch from f6d72b4 to 23c079c Compare May 29, 2026 12:37

pv-nvidia reviewed May 29, 2026

View reviewed changes

pv-nvidia added 4 commits May 29, 2026 16:25

ci: reduce test runner complexity

5780763

Revert "ci: reduce test runner complexity"

bbd0270

This reverts commit 5780763.

ci: reduce test runner complexity

d4d6010

Revert "ci: reduce test runner complexity"

6d1f645

This reverts commit d4d6010.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Logging per test case status#5847

Logging per test case status#5847
pbarejko wants to merge 6 commits into
isaac-sim:developfrom
pbarejko:pbarejko/per-case-status

pbarejko commented May 28, 2026

Uh oh!

isaaclab-review-bot Bot left a comment •

edited

Loading

Uh oh!

greptile-apps Bot commented May 28, 2026

Uh oh!

greptile-apps Bot May 28, 2026

Uh oh!

hujc7 May 28, 2026

Uh oh!

isaaclab-review-bot Bot May 28, 2026

Uh oh!

isaaclab-review-bot Bot May 28, 2026

Uh oh!

pv-nvidia May 29, 2026

Uh oh!

greptile-apps Bot May 29, 2026

Uh oh!

greptile-apps Bot May 28, 2026

Uh oh!

pv-nvidia left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	case_results = list(case.result) if case.result else []
	case_results = case.result or []

Conversation

pbarejko commented May 28, 2026

Description

Type of change

Screenshots

Checklist

Uh oh!

isaaclab-review-bot Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

🤖 Isaac Lab Review Bot

✅ Architecture & Design

✅ Implementation Correctness

🔍 Minor Observations

📋 Pre-commit Check

✅ Test Coverage

Uh oh!

greptile-apps Bot commented May 28, 2026

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

hujc7 May 28, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

isaaclab-review-bot Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

pv-nvidia May 29, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot May 29, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot May 28, 2026

Choose a reason for hiding this comment

Uh oh!

pv-nvidia left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

isaaclab-review-bot Bot left a comment •

edited

Loading

pv-nvidia left a comment •

edited

Loading