Skip to content

GitLab

Explore

Sign in
Register

Improve client error tracking of the Sourcegraph application: Tracking issue

Created by: valerybugakov

Problem to solve

Today, we lack processes around client-side error monitoring, error reporting, and prioritizing and fixing runtime errors. Consequently, it can be difficult for Sourcegraph engineers to see, understand, and fix problems in the Sourcegraph application, which has a negative impact on engineering teams' effectiveness and, ultimately, on our customers. For the observability of on-prem instances, distributed traces, and other more advanced observability efforts, we'll follow the lead of the DevX team.

Measure of success

We have a robust process for runtime error monitoring on the client, and it's documented in the handbook.
Relevant services and tools are configured to support the error monitoring process documented in the handbook.
An automated notification process lets the right people know there's an issue, and we have documentation that tells them how to address it.

Solution summary

Evaluate the relative advantages and disadvantages of Sentry vs Datadog and determine the best path forward
Improve Sentry configuration to make production error debugging easier.
Collaborate with the DevX team on proxying client events through our backend.
Introduce guidelines on error handling in client applications.

What we're not doing right now:

Explore ways to allow on-prem clients to share logs with the Sourcegraph team to speed up debugging.
Extend Open Telemetry traces into the frontend, so that client teams can easily collect a sequence of events to debug client issues
Explore what src debug can do today in regard to traces, and see if it can be used to export client log events as well

Artifacts:

RFC 501 REVIEW: Runtime error monitoring

What specific customers are we iterating on the problem and solution with?

Internal Sourcegraph developers

Impact on use cases

Delivery plan

Tracked issues

@unassigned

https://github.com/sourcegraph/sourcegraph/issues/26580
https://github.com/sourcegraph/sourcegraph/issues/26583
https://github.com/sourcegraph/sourcegraph/issues/26585

@plibither8: 3.00d

https://github.com/sourcegraph/sourcegraph/issues/26579 1.00d
https://github.com/sourcegraph/sourcegraph/issues/26581 1.00d
https://github.com/sourcegraph/sourcegraph/issues/26582 1.00d

Completed

(🏁 37 days ago) https://github.com/sourcegraph/sourcegraph/issues/26578
(🏁 21 days ago) https://github.com/sourcegraph/sourcegraph/issues/39899
(🏁 20 days ago) https://github.com/sourcegraph/sourcegraph/pull/40001

@valerybugakov

Completed

(🏁 287 days ago) https://github.com/sourcegraph/sourcegraph/pull/27419
(🏁 283 days ago) https://github.com/sourcegraph/sourcegraph/pull/27582
(🏁 282 days ago) https://github.com/sourcegraph/sourcegraph/issues/27742 (PRs: ~~#27743~~)
(🏁 248 days ago) https://github.com/sourcegraph/sourcegraph/pull/29242

Assignee Loading

Time tracking Loading