Skip to content

monitoring: ratios for {un}indexed_search_request_errors, frontend_internal_api_error_responses

Administrator requested to merge monitoring/ratios into main

Created by: bobheadxi

Convert more alerts to ratios, in an effort to provide more meaningful feedback on larger instances (most notably Sourcegraph Cloud). See https://github.com/sourcegraph/sourcegraph/pull/12756 for related rationale, but the tl;dr is to make these metrics easier to interpret in terms of "how much of total traffic is being affected"

This closes #12865 - there are some other noisy alerts that might be caused by a misconfiguration, addressed in https://github.com/sourcegraph/sourcegraph/pull/12869

Breakdown of current frequencies of the following alerts - these alerts are all pretty frequently firing

  • indexed_search_request_errors - fired ~271 times in the past 7 days. comparison
  • unindexed_search_request_errors - fired ~70 times in the past 7 days comparison
  • frontend_internal_api_error_responses - fired ~194 times in the past 7 days, though this frequency is related to an incident earlier this week where the frontend internal API was accidentally made inaccessible. comparison

Merge request reports

Loading