fix: use parameterized SQL queries in report_digest.py by gn00295120 · Pull Request #1645 · NVIDIA/garak

gn00295120 · 2026-03-22T00:51:33Z

Summary

Replace four f-string SQL query constructions in garak/analyze/report_digest.py with parameterized queries using ? placeholders
Affected functions: _init_populate_result_db, _get_group_aggregate_score, _get_probe_result_summaries, _get_detectors_info
Addresses CWE-89 (Improper Neutralization of Special Elements used in an SQL Command)

Background

The affected queries interpolated values derived from report JSONL data directly into SQL strings via f-strings. A malformed or adversarially crafted .report.jsonl file could include SQL metacharacters in field values such as probe, detector, or taxonomy tag names, causing unintended query behavior against the in-memory SQLite database.

Parameterized queries pass values as a separate tuple argument to cursor.execute(), so the SQLite driver handles escaping unconditionally — no string manipulation required.

Changes

_init_populate_result_db (line ~131): INSERT with 7 ? placeholders
_get_group_aggregate_score (line ~159): SELECT filtered by probe_group
_get_probe_result_summaries (line ~236): SELECT filtered by probe_group
_get_detectors_info (line ~262): SELECT filtered by probe_group and probe_class

No logic, variable names, return types, or other behavior was changed.

Test plan

Run existing test suite: pytest tests/ — all tests should pass without modification
Manually verify a report digest still generates correctly against a sample .report.jsonl

github-actions · 2026-03-22T00:51:43Z

DCO Assistant Lite bot All contributors have signed the DCO ✍️ ✅

Copilot

Pull request overview

This PR hardens garak/analyze/report_digest.py against SQL injection from untrusted .report.jsonl inputs by replacing f-string SQL construction with SQLite parameterized queries.

Changes:

Parameterized the INSERT into the in-memory results table in _init_populate_result_db.
Parameterized SELECT queries filtering by probe_group in _get_group_aggregate_score and _get_probe_result_summaries.
Parameterized the detector lookup query filtering by probe_group and probe_class in _get_detectors_info.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-22T00:53:38Z

garak/analyze/report_digest.py

 def _get_probe_result_summaries(cursor, probe_group) -> List[tuple]:
    res = cursor.execute(
-        f"select probe_module, probe_class, min(score) as s from results where probe_group='{probe_group}' group by probe_class order by s asc, probe_class asc;"
+        "select probe_module, probe_class, min(score) as s from results where probe_group=? group by probe_class order by s asc, probe_class asc;",
+        (probe_group,),
    )


Consider adding a targeted regression test that exercises these queries with adversarial probe_group / probe_class / detector values containing quotes or SQL metacharacters (e.g., "x' OR 1=1 --") and asserts build_digest() still succeeds and returns expected results. Current tests run the CLI over known-good assets but don't specifically cover the injection/escaping behavior this change is meant to fix.

gn00295120 · 2026-03-22T01:15:39Z

I have read the DCO Document and I hereby sign the DCO

Add parameterized tests exercising _init_populate_result_db, _get_group_aggregate_score, _get_probe_result_summaries, and _get_detectors_info with adversarial SQL metacharacter payloads (e.g., "x' OR 1=1 --", "'; DROP TABLE results; --"). Addresses Copilot review suggestion on PR NVIDIA#1645. Signed-off-by: Lucas Wang <[email protected]>

jmartin-tech · 2026-03-23T16:22:02Z

@gn00295120 using Copilot or another assistant to help create a PR is acceptable and even encouraged if it helps you understand your contribution, however this should be confined to your private fork at this time. Asking the assistant to review and comment on the public PR after being opened upstream places a maintenance burdens on the project for tools that are not used by all contributors. In the future please keep this in mind.

jmartin-tech

@gn00295120 thank you for the contribution. This looks great and will go thru additional testing.

As an additional note to address the reference to CWE-89 in the description, this is a valid case of an improvement to follow a more secure practice in sql statement generation, the values here are only user controlled when utilizing this class as a standalone utility and the database is ephemeral to the report digest creation process. This simply mean the security evaluation of risk show no exposed vulnerable attack chain existed. This does not diminish the value of this PR in improving the codebase and reducing the possible attack surfaces.

Replace f-string interpolation with parameterized queries using ? placeholders to prevent potential SQL injection from malformed report JSONL data. Addresses CWE-89. Signed-off-by: Lucas Wang <[email protected]>

Add parameterized tests exercising _init_populate_result_db, _get_group_aggregate_score, _get_probe_result_summaries, and _get_detectors_info with adversarial SQL metacharacter payloads (e.g., "x' OR 1=1 --", "'; DROP TABLE results; --"). Addresses Copilot review suggestion on PR NVIDIA#1645. Signed-off-by: Lucas Wang <[email protected]>

gn00295120 · 2026-03-25T00:37:50Z

Thank you for the detailed review and clarification — I really appreciate it.

Copilot AI review requested due to automatic review settings March 22, 2026 00:51

Copilot started reviewing on behalf of gn00295120 March 22, 2026 00:52 View session

Copilot AI reviewed Mar 22, 2026

View reviewed changes

gn00295120 force-pushed the fix/sql-parameterized-queries branch from 93a4d84 to 0b0f1b7 Compare March 22, 2026 06:42

github-actions bot added a commit that referenced this pull request Mar 22, 2026

@gn00295120 has signed the CLA in #1645

b0c8691

jmartin-tech reviewed Mar 23, 2026

View reviewed changes

gn00295120 added 2 commits March 24, 2026 08:37

fix: use parameterized SQL queries in report_digest.py

53d0277

Replace f-string interpolation with parameterized queries using ? placeholders to prevent potential SQL injection from malformed report JSONL data. Addresses CWE-89. Signed-off-by: Lucas Wang <[email protected]>

gn00295120 force-pushed the fix/sql-parameterized-queries branch from fd3f7b5 to 4a1878a Compare March 24, 2026 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use parameterized SQL queries in report_digest.py#1645

fix: use parameterized SQL queries in report_digest.py#1645
gn00295120 wants to merge 2 commits intoNVIDIA:mainfrom
gn00295120:fix/sql-parameterized-queries

gn00295120 commented Mar 22, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 22, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 22, 2026

Uh oh!

gn00295120 commented Mar 22, 2026

Uh oh!

jmartin-tech commented Mar 23, 2026

Uh oh!

jmartin-tech left a comment

Uh oh!

gn00295120 commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gn00295120 commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

Changes

Test plan

Uh oh!

github-actions bot commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 22, 2026

Choose a reason for hiding this comment

Uh oh!

gn00295120 commented Mar 22, 2026

Uh oh!

jmartin-tech commented Mar 23, 2026

Uh oh!

jmartin-tech left a comment

Choose a reason for hiding this comment

Uh oh!

gn00295120 commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gn00295120 commented Mar 22, 2026 •

edited

Loading

github-actions bot commented Mar 22, 2026 •

edited

Loading