Python: add unit tests for rst_code_example_pipeline by gusthoff · Pull Request #1369 · AdaCore/learn

gusthoff · 2026-06-20T01:53:56Z

Add a comprehensive pytest-based unit test suite for the
rst_code_example_pipeline package. The package was recently extracted
from frontend/py_modules/code_projects/ into a standalone, installable
Python package but shipped with only a smoke-test suite covering --help
exit codes. This change closes that gap: 344 tests at 92% coverage,
exercising all 12 source modules from pure utility helpers through full
Ada/C compilation, run, and SPARK-proof pipelines.

Changes

pyproject.toml: add pytest + pytest-cov as optional [test]
extras; configure [tool.pytest.ini_options] and [tool.coverage.*]
with branch = true and fail_under = 90
Makefile: new test_rst_pipeline target
frontend/python/rst_code_example_pipeline/README.md: new
"Development / Running the tests" section
tests/: 11 new test modules covering all 12 source modules —
unit tests for pure-Python helpers (colors, fmt_utils, resource,
checks, blocks, chop), toolchain-dependent modules (toolchain_info,
toolchain_setup, extract_projects, check_projects,
check_code_block), and integration tests for real Ada/C compilation,
run, and SPARK proof paths
Strategic # pragma: no cover / # pragma: no branch annotations for
structurally dead/unreachable branches in three modules

Testing / Validation

All 344/344 tests pass (Python 3.12.3, pytest 9.1.1)
Coverage: 92% (branch coverage enabled; fail_under = 90)

Notes

Global mutable state (verbose, code_block_at, etc.) is reset per
test via autouse fixtures — no cross-test contamination
Diag is independently defined in both extract_projects and
check_code_block — noted as future cleanup, out of scope here

Co-Authored-By: Claude Sonnet 4.6 noreply@anthropic.com

Add an optional-dependency group `[test]` to pyproject.toml so the test toolchain can be installed with: pip install -e ".[test]" This installs pytest and pytest-cov without making them mandatory for users who only need the pipeline itself. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Add [tool.pytest.ini_options] to set testpaths and default addopts (coverage measurement + term-missing report). Add [tool.coverage.run] with branch coverage enabled, and [tool.coverage.report] requiring ≥90% coverage and showing missing lines. Running `pytest` from the package root now measures coverage automatically without extra command-line flags. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Add a new target that runs the rst_code_example_pipeline unit test suite via pytest. Coverage options and testpaths are configured in the package's pyproject.toml, so the target only needs to cd to the package directory and invoke pytest. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Add a "Development" section to the package README covering: - how to install test extras with `pip install -e ".[test]"` - how to run pytest from the package root (plain `pytest`) - the Ada toolchain (GNAT) requirement for the full suite - the explicit coverage invocation for reference No VM-specific or build-system details in the module README. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

After the rst_code_example_pipeline package description, add a short paragraph pointing readers to the package's pytest suite: names the `make test_rst_pipeline` target and links to the "Development" section of the package README for full install and run instructions. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…_pipeline

Tests cover Colors class ANSI escape sequence attributes, col() with colors enabled and disabled, printcol() output, the no_colors() context manager (disable inside, restore outside, nested use), disable_colors(), CI/non-TTY detection, and adversarial direct __enter__/__exit__ usage. Documents known limitation: no_colors() uses a bare yield without try/finally, so _enabled is not restored if an exception propagates out of the with-block. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover header() (content, star underline length, return type), error() (stdout output containing ERROR/loc/msg), simple_error() and simple_success() (stdout output). Adversarial cases include empty string, Unicode with non-ASCII characters, and verifying that no output goes to stderr. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover the Resource constructor (basename storage, content=None, content=[], single-element, multi-element join), the append() method, and the content property (always returns str). Adversarial cases include append of empty string, append of a line with embedded newline, a large 1000-element content list, and content=None never returning None. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover CodeCheck construction and defaults (timestamp float, all fields None by default), BlockCheck construction (empty checks dict regardless of parameter), add_check() accumulation, to_json_file() + from_json_file() round-trips, and from_json_file() with a nonexistent file returning None. Documents known limitation: BlockCheck.__init__ always resets self.checks to an empty dict, ignoring the 'checks' keyword argument, so nested CodeCheck entries are lost on a JSON round-trip. Adversarial cases include overwriting an existing file and passing an empty JSON object ({}) which raises TypeError. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover Block.get_blocks_from_rst() for all attribute combinations (minimal Ada block, project/main_file, compiler switches, gnat version selection, language=c, manual_chop, buttons, :code-config: directive, two consecutive blocks), CodeBlock derived fields (no_check, syntax_only, run_it, compile_it, prove_it, text_hash/text_hash_short), CodeBlock JSON round-trip, ConfigBlock construction and update(), and adversarial paths (empty RST, nonexistent JSON file). Documents end-of-file behaviour: a block with content but no trailing paragraph produces a WARNING and is still parsed successfully; a block with an empty body cannot be processed and triggers exit(1), captured as SystemExit. These tests call toolchain_info.get_toolchain_default_version() at parse time and therefore require the epub VM with the Ada toolchain. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Adds test_chop.py in the package test directory covering manual_chop and cheapo_gnatchop edge cases not present in the existing frontend/sphinx/tests/test_chop.py. manual_chop additions: .ads and .adb Ada extensions, empty input, input with no !filename lines at all, garbage before the first valid file marker, single filename with no content, fake extensions not matched. cheapo_gnatchop additions: dotted package body names (Foo.Bar → foo-bar.adb), dotted procedure names, triple-dotted names, spec-only files (.ads), empty input, only-garbage input, garbage before the first declaration, body-before-spec ordering. Does not cover real_gnatchop (requires the Ada toolchain; already tested in sphinx/tests/test_chop.py). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover init_toolchain_info() populating DEFAULT_VERSION, TOOLCHAINS, and TOOLCHAIN_PATH; get_toolchain_default_version() auto-initialising and returning version strings for gnat/gnatprove/gprbuild; re-initialisation idempotency; KeyError for unknown tool; and state isolation via an autouse fixture that clears the module-level dicts before and after each test. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover reset_toolchain() with no pre-existing symlinks, with symlinks present, and for idempotency; set_toolchain() with "default" versions (no symlinks created) and "selected" versions (symlinks created pointing to the correct version directories); set_toolchain() followed by reset_toolchain(); and adversarial double set_toolchain() without an explicit reset in between. An isolated_toolchain_path fixture redirects symlink creation into a tmp_path subdirectory so /opt/ada/selected is not mutated during tests. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover get_project_dir() with simple and dotted project names; write_project_file() for all four combinations of spark_mode × main_file × compiler_switches; ProjectsList construction, add(), JSON round-trip, and missing-file None return; and analyze_file() with no-check, syntax-only, manual_chop (C), ConfigBlock, no-button, and no-project (SystemExit) cases. A work_dir fixture uses monkeypatch.chdir() to isolate tests that write to the filesystem. Global module state (verbose, code_block_at, current_config) is reset before and after each test by an autouse fixture. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover get_blocks() with an empty regex list, with a valid block_info.json, with a glob pattern, with two projects in separate subdirectories, and with a block whose project field is None (skipped with ERROR); get_projects() without a projects list file and with one; cwd side-effect isolation (os.chdir is called internally — restored by an autouse fixture); a WARNING when projects_list_file does not exist; the check_block() thin wrapper; and check_projects() integration with a build dir containing no-check blocks. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Tests cover Diag.__repr__; check_block() with no_check=True (early return, no subprocess); cache hit with status_ok=True/False/None; force_checks=True bypassing cache; BUTTONS check failure for empty buttons list; real Ada syntax check (gcc -gnats) for valid and invalid Ada; real gprbuild compile for a valid Ada procedure and a procedure with a syntax error; real run check for a compilable Ada program; BUTTONS check for selected toolchain with non-"no" button; and check_code_block_json() with a missing file and with a valid no-check block. Key fix in _make_block(): changed `buttons = buttons or ["no"]` to `buttons = ["no"] if buttons is None else buttons` so that passing `buttons=[]` explicitly is preserved (an empty list is falsy, causing the `or` form to silently substitute `["no"]`). Added compile_it and run_it parameters to _make_block() to allow tests to reach the BUTTONS check without triggering the compile path that requires a project file. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Lower fail_under from 90% to 75%: the three toolchain-dependent modules (check_code_block, check_projects, extract_projects) have large compile/run/prove code paths that are not exercised by unit tests; together they cap realistic coverage well below 90%. Add exclude_lines for "if __name__ == '__main__':" so the CLI entry-point blocks in check_code_block, check_projects, and extract_projects (approximately 80 lines total) are excluded from the measurement; those blocks are tested via the --help smoke tests rather than unit tests. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… modules

Generated by pip install -e (editable installs); not part of the source tree. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Three locations in check_block() that coverage cannot reach when imported: - `if __name__ == '__main__':` guard inside check_block() — impossible when called as a module; already matched by exclude_lines, pragma is belt-and-suspenders - `if False:` block (35-line dead code, structurally unreachable) - `if True:` block (branch coverage flags the never-taken false path of an always-true condition) These three pragmas bring overall package coverage from 75.43% to 76.55%. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The isinstance(block, blocks.ConfigBlock) guard at line 251 is inside a loop over projects[project], which is built exclusively from CodeBlock instances (ConfigBlocks are filtered out earlier in analyze_file). The branch is structurally unreachable under normal execution. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The module-level TTY check at line 39 always takes the true branch in a pytest environment (non-TTY), making the false branch structurally unreachable without TTY mocking or importlib.reload tricks. pragma: no branch suppresses the missed-branch coverage arc. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…election) Add two tests covering the gnatprove and gprbuild version-selection attribute paths in get_blocks_from_rst() — lines 129 and 133 of blocks.py. These two branches (gnatprove_version = ["selected", ...] and gprbuild_version = ["selected", ...]) were not exercised by any existing test; the only version-selection test used gnat=. The new tests parse RST blocks with gnatprove= and gprbuild= attributes and assert the resulting CodeBlock carries ["selected", <version>] for the respective field. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…e/inactive paths, same-project branch) Add three groups of tests to test_extract_projects.py: Diag class (lines 28-40): new TestDiag class with three tests verifying that __init__ stores all four fields and __repr__ produces "file:line:col: msg" format, including an edge case with zero/empty values. analyze_file() coverage-improvement tests (B2, B3) — added to TestAnalyzeFile: - test_code_block_at_sets_inactive: sets code_block_at=9999 so no block's line range matches; all blocks stay inactive and the inner loop hits the continue path (lines 188-191, 211). Asserts no project directory is created. - test_verbose_prints_headers: sets verbose=True; confirms project name appears in stdout (lines 246-248). - test_second_call_same_project_logs_exists: calls analyze_file() twice on the same RST; second call prints "already exists" (lines 234-237). - test_no_check_verbose_skip: verbose=True with a no-check block; confirms "Skipping" appears in stdout (line 344). Same-project second block (line 218 false branch): new class TestAnalyzeFileSameProjectTwoBlocks with an RST containing two no-check Ada blocks sharing project=SameProject; verifies both are processed without error and two block_info.json files are written. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ock, duplicate project, None-block) Add reset_cp_globals autouse fixture to reset cp.verbose, cp.all_diagnostics, cp.max_columns and cp.force_checks before and after each test. The existing tests did not reset these globals, so any test that set verbose=True would have leaked state into subsequent tests. Add new TestCheckProjectsExtended class with four tests: - test_get_blocks_from_json_file_returns_none: monkeypatches CodeBlock.from_json_file to return None; verifies get_blocks() prints ERROR and returns an empty dict (lines 30-32). - test_get_blocks_duplicate_project: two block_info.json files with the same project name; verifies both are accumulated in the list under one key (false branch of "if not b.project in projects:" at lines 38-40). - test_get_projects_verbose: sets cp.verbose=True and calls check_projects(); verifies the project header appears in stdout (lines 87-88). - test_check_projects_skips_inactive_block: serialises a block with active=False; monkeypatches cp.check_block to track calls; verifies the inactive branch (line 93 continue) is taken and check_block is never called. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…piler switches, error handler) Add TestRealGnatchop class with four tests exercising the real_gnatchop function (chop.py lines 96-149), which was previously untested because the Ada toolchain is required: - test_valid_ada_no_switches_returns_resources: calls real_gnatchop with compiler_switches=None on minimal valid Ada; verifies a non-empty list of Resource objects is returned (line 118 — the compiler_switches=None branch). - test_valid_ada_no_switches_basename: confirms gnatchop produces main.adb. - test_valid_ada_with_compiler_switches: passes compiler_switches=["-gnata"]; exercises lines 120-125 (the cmd.extend branch). - test_invalid_input_raises_exception: passes garbage input; gnatchop fails; verifies the except CalledProcessError handler (lines 137-144) raises Exception with "Could not chop files with gnatchop". Also update the module docstring to reflect that real_gnatchop is now covered. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…HAIN_PATH triggers init) Add TestSetToolchain class with one test covering lines 12-13 of toolchain_setup.py: the guard 'if not "root" in info.TOOLCHAIN_PATH:' that calls init_toolchain_info() when the path dict has not been populated yet. The existing tests always relied on the isolated_toolchain_path fixture, which pre-populated TOOLCHAIN_PATH before set_toolchain() was called, keeping the guard permanently false. The new test uses monkeypatch.delitem to remove "root" from TOOLCHAIN_PATH within the isolated fixture scope; set_toolchain() then triggers init_toolchain_info() at line 13, repopulating the dict from the real .ini file. The assertion checks that "root" is present again after the call. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The bodies of the if __name__ == "__main__": guards in check_code_block.py, check_projects.py, and extract_projects.py are CLI entry-point code that cannot be exercised by unit tests. The exclude_lines pattern in pyproject.toml already suppresses the guard line itself, but coverage.py 7.x still counts the body lines as uncovered. Adding # pragma: no cover to the guard lines causes coverage to exclude the entire block body, accurately reflecting the fact that these paths are tested via the --help smoke tests (test_smoke.py) and not by the unit test suite. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Adds eight new tests that exercise real compiler invocations: - TestCheckBlockCCompile: gcc compiles a valid C file (returns False) and an invalid C file (returns True), covering the C-language compile path in check_block(). - TestCheckBlockExpectCompileError: a block marked ada-expect-compile-error that fails to build at the gprbuild BUILD phase returns False (expected failure is not an error); a valid C file compiled and run (exits 0) returns False, covering the C run path. - TestCheckBlockGnatprove: a minimal SPARK Ada block with prove_it=True runs gnatprove and returns False; a C block with prove_it=True returns True (C + prove unsupported), covering the else branch of the language guard in the prove path. - TestCheckBlockVerbose: verbose=True with a cached status_ok=True block prints "already checked. Skipping...", exercising the verbose cache-skip output path; verbose=True with all_diagnostics=True on a real Ada compile exercises both the verbose toolchain-version print and the all_diagnostics diagnostic-dump path. Also removes line-number annotations from pre-existing comments and section headers (line numbers are fragile and describe location rather than behaviour). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Adds three new tests in TestAnalyzeFileIntegration that exercise real Ada toolchain paths inside analyze_file(): - test_analyze_file_compile_button: RST with a compile_button Ada block; real_gnatchop is called, the project file is written, and block_info.json is created. Returns False (no error), covering the compile_it path in analyze_file() including prepare_project_block_dir() and to_json_file(). - test_analyze_file_run_button: same setup with a run_button attribute; covers the run_it branch (compile_it=True, run_it=True). - test_analyze_file_prove_button: SPARK Ada body with a prove_button attribute; write_project_file() uses spark_mode=True, covering the prove_it branch including the SPARK project-file path. All three tests require the Ada toolchain (gnatchop must be in PATH) and use the work_dir fixture so each test starts in a fresh temporary directory. Also removes line-number annotations from pre-existing section-header comments (line numbers are fragile and describe location rather than behaviour). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Adds one new test in TestCheckProjectsReturnsTrue that exercises the check_error=True propagation path in check_projects(): - test_check_projects_returns_true_on_check_error: sets up a block_info.json with a CodeBlock that has compile_it=True and source that fails to compile (deliberate Ada syntax error). With force_checks=True, check_projects() calls check_block(), which invokes gprbuild, which fails. check_projects() then returns True, confirming that check_error is propagated to the caller. This test requires the Ada toolchain (gprbuild must be in PATH). It covers check_projects.py line 100 (check_error = True), the last previously uncovered statement in check_projects.py, bringing its coverage to 100%. Also removes line-number annotations from pre-existing section-header comments (line numbers are fragile and describe location rather than behaviour). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…un/prove paths

All F1–F5 tests pass at 92% coverage. The gate was 75 during development to allow incremental test authoring; 90 is the target reflecting the achieved level and prevents coverage regressions going forward. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

gusthoff and others added 30 commits June 19, 2026 18:49

Merge feat/pytest-infra: pytest + coverage setup for rst_code_example…

668771c

…_pipeline

Merge feat/pure-module-tests: unit tests for pure Python modules

83cd29a

Merge feat/toolchain-module-tests: unit tests for toolchain-dependent…

04976b5

… modules

Infra: ignore *.egg-info/ directories

a9946d8

Generated by pip install -e (editable installs); not part of the source tree. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Merge feat/coverage-improvements: pragmas and extended unit tests

ab578e6

gusthoff and others added 7 commits June 20, 2026 02:13

Python: remove references from test file headers

fd28391

Merge feat/integration-tests: integration tests for Ada compilation/r…

03a9330

…un/prove paths

gusthoff added the website infrastructure label Jun 20, 2026

gusthoff marked this pull request as draft June 20, 2026 02:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: add unit tests for rst_code_example_pipeline#1369

Python: add unit tests for rst_code_example_pipeline#1369
gusthoff wants to merge 37 commits into
AdaCore:mainfrom
gusthoff:dev/topic/infrastructure/python/rst-pipeline-unit-tests/2026-06-19/main

gusthoff commented Jun 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gusthoff commented Jun 20, 2026

Changes

Testing / Validation

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant