-
Notifications
You must be signed in to change notification settings - Fork 5
Description
Presently, the per-release summary statistics are stored in two tables: release_data and release_summary. These two tables have the exact same schema, but slightly different uses:
- The
release_datatable stores one row per test report. One of thepass,fail,na,unknowncolumns will have a1in it. - The
release_summarytable stores one row per distribution version. Thepass,fail,na, andunknowncolumns will have the count of each test report grade.
In essence, the release_summary table is the sum of all the related release_data rows (this is also technically a duplication of the cpanstats table (which, technically is a duplication of test_report table with data extracted from the JSON)).
Now that we have a dedicated database server with a few more CPU cycles than we had previously, we can look at how we store this data: Do we need the intermediate state of the release_data table, or can we just store the release_summary? Or, should we avoid the further step of summing the values and storing them in release_summary and just keep release_data? Or can we get rid of these tables entirely and just build this data on-the-fly from cpanstats?