Skip to content

Determine how release statistics should be stored #30

@preaction

Description

@preaction

Presently, the per-release summary statistics are stored in two tables: release_data and release_summary. These two tables have the exact same schema, but slightly different uses:

  • The release_data table stores one row per test report. One of the pass, fail, na, unknown columns will have a 1 in it.
  • The release_summary table stores one row per distribution version. The pass, fail, na, and unknown columns will have the count of each test report grade.

In essence, the release_summary table is the sum of all the related release_data rows (this is also technically a duplication of the cpanstats table (which, technically is a duplication of test_report table with data extracted from the JSON)).

Now that we have a dedicated database server with a few more CPU cycles than we had previously, we can look at how we store this data: Do we need the intermediate state of the release_data table, or can we just store the release_summary? Or, should we avoid the further step of summing the values and storing them in release_summary and just keep release_data? Or can we get rid of these tables entirely and just build this data on-the-fly from cpanstats?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions