New troubleshooting docs
Created by: beyang
I tried to catalog all the troubleshooting recipes/procedures we use to diagnose various types of issues (e.g., performance, non-performance, repro-able, non-repro-able). I'd like for this to serve 2 purposes:
- Used by Sourcegraphers: We can follow this guide ourselves, so as to be more principled/standard in our approaches. We should update this guide as we improve observability in Sourcegraph. This guide can also be used to onboard engineers who are new to dealing with customer issues.
- Used by customers: We can refer customers to these docs, so that issue reports become higher quality and we can cut down on costly back-and-forth in order to collect basic information needed to diagnose issues.
Are there any tools / procedures I missed that people have used to diagnose and fix customer issues?