fix(flags): fix cohort version update by matheus-vb · Pull Request #51994 · PostHog/posthog

matheus-vb · 2026-03-23T19:42:48Z

Problem

A customer reported their cohort showed stale data ("no new people since a day ago") despite last_calculation being recent. Removing and re-adding a filter fixed it by forcing a version bump.

The probable root cause: calculate_people_ch updates the Postgres Cohort.version after the try/finally block. If self.save() or _safe_reset_calculating_state() in the finally block raises, the version update is silently skipped. ClickHouse has the new version's data, but Postgres still points to the old version, so all queries return stale results.

Changes

Moved the Cohort.objects.filter(...).update(version=..., count=...) call from after the try/finally block into the try block, right after recalculate_cohortpeople succeeds. This ensures the version is persisted before the finally block runs, so exceptions there can no longer silently skip it.

Renamed update_fields to version_update_fields to avoid confusion with the self.save(update_fields=...) call in finally
Added a test that mocks _safe_reset_calculating_state to raise and verifies version is still updated

The concurrency guard (version__lt=pending_version | version__isnull=True) is preserved, lower versions still can't overwrite higher ones.

Tradeoff: If the version .update() DB call itself raises, errors_calculating is incremented even though ClickHouse data was written. This is visible and self-healing (next retry succeeds), far better than the current silent split-brain.

How did you test this code?

New unit test test_calculate_people_ch_updates_version_even_when_finally_raises
Existing cohort tests verified for regressions
ruff check and ruff format pass

greptile-apps · 2026-03-23T20:31:14Z

Prompt To Fix All With AI

This is a comment left during a code review.
Path: posthog/test/test_cohort_model.py
Line: 513-533

Comment:
**Test only covers one of two `finally` failure paths**

The new test mocks `_safe_reset_calculating_state` to raise, but `self.save()` is called *before* `_safe_reset_calculating_state` in the `finally` block. If `self.save()` raises, `_safe_reset_calculating_state` is never reached yet the fix still protects the version update. Adding a second case (or parameterising the test) would give complete coverage of the original problem statement.

For example, using `@pytest.mark.parametrize` over both `("_safe_reset_calculating_state", "DB connection lost")` and `("save", "DB error")` would demonstrate the fix handles both `finally`-block failure modes, which aligns with the PR description's claim that the fix addresses `self.save()` failures too.

How can I resolve this? If you propose a fix, please make it concise.

_{Reviews (1): Last reviewed commit: "apply fmt" | Re-trigger Greptile}

posthog/test/test_cohort_model.py

call update inside try block

6882ad0

matheus-vb changed the title ~~call update inside try block~~ fix(flags): fix cohort version update Mar 23, 2026

apply fmt

695f0ec

matheus-vb marked this pull request as ready for review March 23, 2026 20:28

matheus-vb requested a review from a team March 23, 2026 20:28

posthog-project-board-bot bot moved this to In Review in Feature Flags Mar 23, 2026

posthog-project-board-bot bot added this to Feature Flags Mar 23, 2026

greptile-apps bot reviewed Mar 23, 2026

View reviewed changes

posthog/test/test_cohort_model.py Outdated Show resolved Hide resolved

matheus-vb added 2 commits March 23, 2026 17:35

test both finally-block failure modes

0273628

fix test decorator order

8aa2b8d

matheus-vb requested a review from a team March 24, 2026 00:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(flags): fix cohort version update#51994

fix(flags): fix cohort version update#51994
matheus-vb wants to merge 4 commits intomasterfrom
matheus-vb/fix-version-update

matheus-vb commented Mar 23, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

matheus-vb commented Mar 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Changes

How did you test this code?

Uh oh!

greptile-apps bot commented Mar 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

matheus-vb commented Mar 23, 2026 •

edited

Loading