monitoring: ratios for {un}indexed_search_request_errors, frontend_internal_api_error_responses
Created by: bobheadxi
Convert more alerts to ratios, in an effort to provide more meaningful feedback on larger instances (most notably Sourcegraph Cloud). See https://github.com/sourcegraph/sourcegraph/pull/12756 for related rationale, but the tl;dr is to make these metrics easier to interpret in terms of "how much of total traffic is being affected"
This closes #12865 (closed) - there are some other noisy alerts that might be caused by a misconfiguration, addressed in https://github.com/sourcegraph/sourcegraph/pull/12869
Breakdown of current frequencies of the following alerts - these alerts are all pretty frequently firing
-
indexed_search_request_errors
- fired capture-groups-ordering times in the past 7 days. comparison -
unindexed_search_request_errors
- fired ~70 times in the past 7 days comparison -
frontend_internal_api_error_responses
- fired ~194 times in the past 7 days, though this frequency is related to an incident earlier this week where the frontend internal API was accidentally made inaccessible. comparison