feat: Add "deps" command to generate a graph of rule depdendencies. by wxsBSD · Pull Request #498 · VirusTotal/yara-x

wxsBSD · 2025-11-14T20:29:16Z

This branch adds a "deps" command that generates dependency information for a set of rules. It walks the AST looking for identifiers of rules, modules and unknown identifiers (hopefully external variables) and collects information about them. For any given rule it will output either the dependencies of that rule or the reverse dependencies of that rule. The output is in the form of a graphviz file that can be piped into dot to generate a visual graph.

For example, given these rules:

rule a { condition: pe.is_dll() }
rule b { condition: a }
rule c { condition: b }
rule d { condition: false }

You can print out the dependencies of a with yr deps -r a rules/test.yara:

digraph {
  a [fillcolor=paleturquoise, style="filled"];
  pe [fillcolor=palegreen, style="filled"];
  a -> pe;
}

This can be useful if you're looking to find the minimum set of rules and modules needed to share rule a in this case. It obviously becomes harder to determine this without a dependency walker when you have more complex graphs. For example, knowing the set of rules and imports to share rule c is more complex just due to the length of the chain.

You can also get reverse dependencies, which is a nice thing to know when you want to make a change to a rule. For example, if I were to change rule a it would be nice to know that I haven't broken any of the rules that depend upon it (directly or indirectly). Assuming those rules have "expected matches" values in the metadata you can use the dependency walking code to determine what rules to test and what they should match.

yr deps -R -r a rules/test.yara:

digraph {
  a [fillcolor=paleturquoise, style="filled"];
  b [fillcolor=paleturquoise, style="filled"];
  b -> a;
  c [fillcolor=paleturquoise, style="filled"];
  c -> b;
}

Given a set of rules parse it and walk the AST to find identifiers and generate a dot file of them that can be fed into graphviz for visualization. By default it generates a graph of all the rules but you can select any number of rules with the -r argument. For example, given these rules: ``` rule a { condition: pe.is_dll() } rule b { condition: a } rule c { condition: b } rule d { condition: false } ``` And selecting using `-r b` you get output that looks like this: ``` digraph { b [fillcolor=paleturquoise, style="filled"]; a [fillcolor=paleturquoise, style="filled"]; pe [fillcolor=palegreen, style="filled"]; a -> pe; b -> a; } ``` This mode is best thought of as "what is the minimum set of rules and imports I need to execute the selected rule." Using the -R argument displays the reverse dependencies of a rule. For the same rules above the output when using -R is: ``` digraph { b [fillcolor=paleturquoise, style="filled"]; c [fillcolor=paleturquoise, style="filled"]; c -> b; } ``` This mode is best thought of as "if I change this rule, what other rules do I also need to test."

Move the dependency walking code to it's own command and make it hidden by default until it gets more testing.

wxsBSD · 2025-11-14T20:31:32Z

I don't have any tests for this yet, but I'm willing to write them if you think this is a good idea to include in yara-x. I'm just putting this out there now to get some early feedback.

I have tested this with a very complex set of rules from work and it does parse them and output graphs. However, the graphs quickly turn very hard to understand if you have exceptionally large dependency chains in your output. For smaller graphs (dozens of dependencies) it looks much better.

cli/src/commands/deps.rs

wxsBSD · 2025-11-18T21:39:02Z

I've updated the code to use the new features you've added and it works great, thanks!

I've decided to stop efforts to track unknown identifiers because it can get weird. For example, these two rules produce drastically different ASTs:

rule a { condition: pe.signatures.len() > 0 }
rule b { condition: (pe).signatures.len() > 0 }

The ASTs:

 rule a
 └─ condition
    └─ gt
       ├─ len()
       │  └─ <object>
       │     └─ field access
       │        ├─ pe
       │        └─ signatures
       └─ 0

 rule b
 └─ condition
    └─ gt
       ├─ field access
       │  ├─ pe
       │  └─ len()
       │     └─ <object>
       │        └─ signatures
       └─ 0

In the case of a it is pretty easy to ignore signatures if you only track the first identifier operand to a Expr::FieldAccess however the AST of b is different enough that you have to take a different approach. I couldn't come up with a good way to track unknown identifiers that is robust to these "less normal" ASTs. It felt too fragile to try to do this.

It is for this reason that I went ahead and removed the "unknown identifier" part of this PR and now we only track dependencies to existing rules or things that look like modules (all other identifiers are ignored).

Variables can be tracked by using a vector that behaves as a stack of defined variable identifiers, and another vector containing the indexes within this stack were each variable scope starts.

plusvic · 2025-11-19T12:34:24Z

I've updated the code to use the new features you've added and it works great, thanks!

I've decided to stop efforts to track unknown identifiers because it can get weird. For example, these two rules produce drastically different ASTs:
rule a { condition: pe.signatures.len() > 0 }
rule b { condition: (pe).signatures.len() > 0 }
The ASTs:
 rule a
 └─ condition
    └─ gt
       ├─ len()
       │  └─ <object>
       │     └─ field access
       │        ├─ pe
       │        └─ signatures
       └─ 0

 rule b
 └─ condition
    └─ gt
       ├─ field access
       │  ├─ pe
       │  └─ len()
       │     └─ <object>
       │        └─ signatures
       └─ 0
In the case of a it is pretty easy to ignore signatures if you only track the first identifier operand to a Expr::FieldAccess however the AST of b is different enough that you have to take a different approach. I couldn't come up with a good way to track unknown identifiers that is robust to these "less normal" ASTs. It felt too fragile to try to do this.

It is for this reason that I went ahead and removed the "unknown identifier" part of this PR and now we only track dependencies to existing rules or things that look like modules (all other identifiers are ignored).

Field names can't be handled as identifiers because they could cause dependencies that don't exist actually. For instance:

        import "pe"

        rule a {
        condition:
          true
        }

        rule b {
        condition:
            pe.a
        }

With the current implementation the b is reported as dependent on a, but that's not true.

I'm just thinking out loud, but I believe any identifier that is under a field access expression should be ignored, except for the first operand.

plusvic · 2025-11-19T13:52:52Z

I think that 50ba863 fixes the issue with field names. While implementing this solution I found a bug fixed in 8eaa4db.

…ors.

cli/src/commands/deps.rs

wxsBSD · 2025-11-20T19:38:35Z

I'm just thinking out loud, but I believe any identifier that is under a field access expression should be ignored, except for the first operand.

I think you're right here. I spent a bit of time trying to come up with a rule that would cause a problem here but I haven't been able to. I did find a different problem but I'll open a different issue for that.

wxsBSD added 4 commits November 9, 2025 15:45

Move deps to it's own command.

2b04adf

Move the dependency walking code to it's own command and make it hidden by default until it gets more testing.

Merge branch 'main' into deps

01be926

-r is now required, remove this check.

5513550

Requested rules is no longer optional.

6dd5bf9

wxsBSD mentioned this pull request Nov 14, 2025

Expose modules and rule dependencies after compilation #484

Open

plusvic requested changes Nov 14, 2025

View reviewed changes

cli/src/commands/deps.rs Outdated Show resolved Hide resolved

cli/src/commands/deps.rs Outdated Show resolved Hide resolved

cli/src/commands/deps.rs Outdated Show resolved Hide resolved

wxsBSD added 3 commits November 17, 2025 10:33

Merge branch 'main' into deps

e0c769d

Use new DFSContext functionality.

e8ceaf2

Remove unknowns, it's a pain to deal with.

4de55b9

plusvic added 2 commits November 19, 2025 11:01

refactor: simplify the logic for tracking variables.

8377edc

Variables can be tracked by using a vector that behaves as a stack of defined variable identifiers, and another vector containing the indexes within this stack were each variable scope starts.

refactor: make the code easier to follow.

5d25228

plusvic added 2 commits November 19, 2025 14:42

Merge branch 'main' into deps

fa40b82

fix: prevent field names from being considered variables.

50ba863

plusvic added 2 commits November 19, 2025 15:00

refactor: minor changes.

4dd19d7

fix: return a non-zero exit code when yr deps fails with syntax err…

d1cdfaf

…ors.

plusvic reviewed Nov 19, 2025

View reviewed changes

cli/src/commands/deps.rs Outdated Show resolved Hide resolved

deps: Error on duplicate rules.

a1672fb

plusvic and others added 4 commits December 5, 2025 10:04

refactor: minor fixes.

d0d5bb3

tests: remove test case that was broken after changes in the parser.

449b133

Merge branch 'main' into deps

b51f2c5

Merge branch 'deps' of github.com:wxsBSD/yara-x into deps

af5bc27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add "deps" command to generate a graph of rule depdendencies.#498

feat: Add "deps" command to generate a graph of rule depdendencies.#498
wxsBSD wants to merge 19 commits intoVirusTotal:mainfrom
wxsBSD:deps

wxsBSD commented Nov 14, 2025

Uh oh!

wxsBSD commented Nov 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wxsBSD commented Nov 18, 2025 •

edited

Loading

Uh oh!

plusvic commented Nov 19, 2025 •

edited

Loading

Uh oh!

plusvic commented Nov 19, 2025

Uh oh!

Uh oh!

wxsBSD commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wxsBSD commented Nov 14, 2025

Uh oh!

wxsBSD commented Nov 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wxsBSD commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

plusvic commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

plusvic commented Nov 19, 2025

Uh oh!

Uh oh!

wxsBSD commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wxsBSD commented Nov 18, 2025 •

edited

Loading

plusvic commented Nov 19, 2025 •

edited

Loading