Perf: Improve gkr-mimc memory use by Tabaie · Pull Request #1616 · Consensys/gnark

Tabaie · 2025-09-24T16:43:19Z

Improves the amount of heap allocations in the benchmark of a 2^16 BLS12-377 element long hash down to 1798.62 MB on an hpc6a.48xlarge machine. This constitutes a 20% improvement over the linea-monorepo baseline (2267.45 MB.)

Most of the improvement was achieved by introducing a reusable pool / stack for temporary field element variables, and is generic to any use of the GKR API.
A further reduction was due to a hard-coded implementation of the most commonly used MiMC gate in BLS12-377, which is (state + msg + key)^17.

Log Instance Size	computeGJ	Solve	Total
14	51.05 MB	1102.33 MB	1153.38 MB
15	103.09 MB	1244.49 MB	1347.58 MB
16	228.72 MB	1569.90 MB	1798.62 MB
17	388.39 MB	2943.39 MB	3331.78 MB
18	871.35 MB	4127.38 MB	4998.73 MB
19	1657.63 MB	6413.27 MB	8070.9 MB
20	3352.88 MB	12711.18 MB	16064.06 MB
21	6.63 GB	21.47 GB	28.1 GB
22	13.01 GB	42.69 GB	55.7 GB
23	26.06 GB	83.92 GB	109.98 GB

I believe that for small instance sizes (< 2^16) the cost is dominated by the GKR verifier and the PLONK prover itself. From that point on, the slope of the fitted line (<1) suggests an at most linear rate of growth.

Note

Improves GKR performance and memory by reducing heap allocations and optimizing common exponentiation.

Add pooled, pointer-based gateAPI with newElement, cast, and freeElements; switch all gate evaluations to &api and call freeElements in hot loops
Extend gkr.GateAPI with SumExp17 and add FrontendAPIWrapper implementation; use for (a+b+key)^17 in MiMC and generic GKR paths
Refactor MiMC registration: return curve-specific constants as frontend.Variable, change S-Box builders to accept frontend.Variable keys, and use SumExp17 where applicable
Update generator templates and per-curve backends (bls12-377/381, bls24-315/317, bn254, bw6-633/761, small_rational) to the new gateAPI and solver hints
Clean up tests/benchmarks: modernize circuits, rename hash tree benchmarks to Merkle tree, use gnark backend options

^{Written by Cursor Bugbot for commit 1676049. This will update automatically on new commits. Configure here.}

Co-authored-by: Copilot <[email protected]>

Copilot

Pull Request Overview

This PR optimizes memory usage in the GKR-MiMC implementation by introducing memory pooling for field element allocations and adding a specialized SumExp17 operation. The changes result in a 20% reduction in heap allocation (from 2267.45 MB to 1798.62 MB) for BLS12-377 benchmarks.

Memory pool implementation in gateAPI to reduce heap allocations
New SumExp17 method for optimized computation of (a+b+c)^17 operations
Refactoring from global to instance-based API usage patterns

Reviewed Changes

Copilot reviewed 33 out of 33 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
std/gkrapi/gkr/types.go	Adds SumExp17 method to GateAPI interface
std/gkrapi/compile.go	Wraps API with FrontendAPIWrapper for gate evaluation
internal/gkr/gkr.go	Adds FrontendAPIWrapper with SumExp17 implementation
std/permutation/gkr-mimc/gkr-mimc.go	Optimizes addPow17 function with BLS12-377 specific caching
Multiple internal/gkr/*/gkr.go	Implements memory pooling in gateAPI across all curve implementations
Multiple internal/gkr/*/solver_hints.go	Updates to use instance-based gateAPI
Test files	Renames hashTreeCircuit to merkleTreeCircuit and improves test configuration

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

cursor · 2026-01-09T19:06:13Z

+
+func (api *gateAPI) newElement() *{{ .ElementType }} {
+	api.nbUsed++
+	if api.nbUsed >= len(api.allocated) {


Off-by-one causes unnecessary allocations in memory pool

Medium Severity

The newElement function uses >= in its condition if api.nbUsed >= len(api.allocated) when it should use >. After incrementing nbUsed, the function returns allocated[nbUsed-1]. When nbUsed equals len(allocated), the index nbUsed-1 is still valid (within bounds), but the current condition triggers an unnecessary append. This causes an extra allocation every time the pool is reused up to its previous capacity, which undermines the PR's memory optimization goal. The condition should be api.nbUsed > len(api.allocated).

Additional Locations (2)

internal/gkr/bls12-377/gkr.go#L826-L827

internal/gkr/small_rational/gkr.go#L826-L827

cursor · 2026-01-11T23:27:29Z

+		api.allocated = append(api.allocated, new(fr.Element))
+	}
+	return api.allocated[api.nbUsed-1]
+}


Off-by-one causes unnecessary allocations in element pool

Medium Severity

The newElement function uses >= instead of > in its condition, causing an unnecessary allocation every time the pool is reused after freeElements() is called. When nbUsed equals len(api.allocated), the element at index nbUsed-1 already exists and can be returned, but the >= condition triggers an append first. Since this PR's goal is reducing memory allocations, this off-by-one error partially defeats the optimization. This pattern is replicated across all curve implementations.

Additional Locations (1)

internal/generator/backend/template/gkr/gkr.go.tmpl#L820-L827

cursor · 2026-01-11T23:27:29Z

 		}
 	}

+	var api gateAPI


Missing pool recycling in Complete function loop

Medium Severity

The Complete function calls api.evaluate inside a nested loop iterating over all instances and wires, but never calls api.freeElements() to recycle the memory pool. Unlike solver_hints.go and computeAll which properly call freeElements() after each gate evaluation, this code path allows the pool to grow unboundedly. For large circuits this defeats the memory optimization that is the primary goal of this PR. This pattern is replicated across all curve implementations via the template.

Additional Locations (1)

internal/generator/backend/template/gkr/gkr.go.tmpl#L669-L681

Tabaie · 2026-01-21T21:59:31Z

Closed in favor of #1676

Tabaie and others added 30 commits June 1, 2025 11:36

refactor: addInstance instead of series etc

609d47d

feat: check for duplicate gates, allow limiting curves for gate

79d4cbf

refactor: gkr api tests

82e33e5

refactor gkr example

ba06e5d

refactor: solve hint called per instance

700c152

revert: newPrint back in std/gkr to avoid import cycle

30f5633

refactor: remove circuit/instance rearranging

44d6b4c

chore: generify

f5f726c

fix: registry duplicate detection

644fd6a

fix: solver hint id mismatch

2001e83

remove redundant make

d3ca3c1

fix: works on plonk

6474482

refactor: solve hint for test engine

0c0b6b0

fix prove hint

478d53d

fix package tests

6c83740

refactor: remove println

d6e382f

chore: generify print removal

a8f30a4

feat: GetValue

8ac5f9f

fix all gkrapi tests pass

5bb879d

fix gkr-poseidon2

7f9a0f1

Merge branch 'master' into feat/gkr/add-instance

355277c

fix: reduce in test engine

cfdb9d3

fix: rename GkrCompressions -> GkrPermutations

12788f3

bench: gkrposeidon2

9a0bf0e

fix pad for bls12377

618beff

some more padding fixes

22b77f2

fix: padding issue in bn254

250a35b

chore generify fix

1586760

Let Uint64 panic

e593d90

Update constraint/solver/gkrgates/registry.go

9df619a

Co-authored-by: Copilot <[email protected]>

Tabaie marked this pull request as ready for review September 25, 2025 04:09

Tabaie requested a review from Copilot September 25, 2025 04:09

Copilot AI reviewed Sep 25, 2025

View reviewed changes

Comment thread std/permutation/gkr-mimc/gkr-mimc.go Outdated

Comment thread internal/gkr/small_rational/gkr.go

Tabaie requested a review from ivokub September 25, 2025 04:12

This comment was marked as outdated.

Sign in to view

ivokub added the feat: gkr PRs related to GKR label Sep 25, 2025

ivokub mentioned this pull request Sep 29, 2025

feat: BLS12-381 precompiles glue Consensys/linea-monorepo#915

Merged

3 tasks

Tabaie added 12 commits September 30, 2025 16:56

perf: store keys as fr.Elements instead of big.Int

f6613ce

Merge branch 'master' into feat/gkr/hashes

d7eec96

fix: match api changes

610d49c

fix: api use

8549a0b

remove engine_hints

8bcce8f

revert change to engine.go

a045f11

fix: mimc tests

9c35fd7

fix: error message

dd4e36a

Merge branch 'master' into feat/gkr/hashes

f2d2c2f

fix: error on empty list

560e80e

Merge branch 'master' into feat/gkr/hashes

bc1750c

Merge branch 'feat/gkr/hashes' into perf/mem/gkr-exp17

543905e

cursor bot reviewed Jan 9, 2026

View reviewed changes

revert: api in the solve hint

c80eff5

cursor bot reviewed Jan 11, 2026

View reviewed changes

fix: constant is var

1488ac9

Base automatically changed from feat/gkr/hashes to master January 12, 2026 22:38

Tabaie added 2 commits January 12, 2026 19:02

Merge branch 'master' into perf/mem/gkr-exp17

4395bd0

fix: imports

1676049

Tabaie mentioned this pull request Jan 14, 2026

feat: Compiled Gates for GKR #1676

Merged

Tabaie closed this Jan 21, 2026

Tabaie deleted the perf/mem/gkr-exp17 branch January 21, 2026 21:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf: Improve gkr-mimc memory use#1616

Perf: Improve gkr-mimc memory use#1616
Tabaie wants to merge 117 commits intomasterfrom
perf/mem/gkr-exp17

Tabaie commented Sep 24, 2025 •

edited by cursor bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Jan 9, 2026

Uh oh!

Uh oh!

cursor bot Jan 11, 2026

Uh oh!

cursor bot Jan 11, 2026

Uh oh!

Tabaie commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Tabaie commented Sep 24, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

cursor bot Jan 9, 2026

Choose a reason for hiding this comment

Off-by-one causes unnecessary allocations in memory pool

Uh oh!

Uh oh!

cursor bot Jan 11, 2026

Choose a reason for hiding this comment

Off-by-one causes unnecessary allocations in element pool

Uh oh!

cursor bot Jan 11, 2026

Choose a reason for hiding this comment

Missing pool recycling in Complete function loop

Uh oh!

Tabaie commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Tabaie commented Sep 24, 2025 •

edited by cursor bot

Loading