insights: over-reported numbers from backend
Created by: slimsag
Whoopsies, I made a mistake here and the numbers reported by the backend are wrong in the 3.25 release - over-reported by ~2x roughly depending on the situation. They aren't recorded wrong, but are queried wrong.
I originally added the aggregation behavior in https://github.com/sourcegraph/sourcegraph/pull/18506 - and did so on the assumption that we store 1 data point every 12h and thus SUM
ing all data points over a 12h span would give the accurate count.
@efritz called out this might be wrong, and I created https://github.com/sourcegraph/sourcegraph/issues/18510 which signals an issue with this type of SUM
aggregation during periods of frequent service restarts - but I failed to realize there were two other edge cases:
- If the 12h aggregation window does not align on the dot with the interval of recorded data, then we will sometimes get 2 of the 12h data points
SUM
'd together. - The above can also be affected by whether or not the data point is recorded exactly on the 12h interval, or is delayed a bit: this could happen due to the search taking e.g. 59s to complete - or in the future by nature of how the data back-filler will operate on a different interval schedule than the regular enqueuer.
SUM
was a bad choice, likely need MAX
.