Determine how release statistics should be stored

Presently, the per-release summary statistics are stored in two tables: `release_data` and `release_summary`. These two tables have the exact same schema, but slightly different uses:

* The `release_data` table stores one row per test report. One of the `pass`, `fail`, `na`, `unknown` columns will have a `1` in it.
* The `release_summary` table stores one row per distribution version. The `pass`, `fail`, `na`, and `unknown` columns will have the count of each test report grade.

In essence, the `release_summary` table is the sum of all the related `release_data` rows (this is also technically a duplication of the `cpanstats` table (which, technically is a duplication of `test_report` table with data extracted from the JSON)).

Now that we have a dedicated database server with a few more CPU cycles than we had previously, we can look at how we store this data: Do we need the intermediate state of the `release_data` table, or can we just store the `release_summary`? Or, should we avoid the further step of summing the values and storing them in `release_summary` and just keep `release_data`? Or can we get rid of these tables entirely and just build this data on-the-fly from `cpanstats`?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Determine how release statistics should be stored #30

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Determine how release statistics should be stored #30

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions