Distribution: 3.19 Tracking issue
Created by: pecigonzalo
Plan
Support new and existing deployments
This is an ongoing expense, we anticipate this taking no more than 10d of work spread across the entire team.
Reduce upgrade overhead
Upgrading Kubernetes deployments requires customers spend a lot of engineering time to converge our released Kubernetes manifests with their fork as documented in RFC-141.
We will finish the Dhall investigation and make a decision by the end of 3.19.
- @ggilmore
- @uwedeportivo
Increase our e2e test frequency
To increase our release cadence, we need to be able to run e2e tests more frequently. This is currently not possible as our CI infrastructure causes tests to be unreliable.
- @slimsag
- @davejrt
- @pecigonzalo
Support per-team alerts
To allow teams to support and monitor the services and features they ship, we need to be able to route alerts to the relevant teams as described in RFC-189.
- @bobheadxi
- @davejrt
- @slimsag
Availability
Period is from July 20th to August 19th (23 working days). Please write the days you won't be working and the number of working days for the period.
- Dave: 22d (off July20)
- Robert: 23d
Workload
@bobheadxi: 2.50d
-
Approved: Proposal: RFC-189: Support per-team alerts and on-call rotations #12010 -
monitoring: migrate existing alert rules to generator #12117 -
Dogfood the monitoring we ship with Sourcegraph #5370 2d -
cadvisor: investigate collecting IO metrics#12163 -
Better account for peaks / max resource usage in monitoring #12032 0.5d -
Make license check not fail when our node version is upgraded #12318 -
monitoring: generate alerts as native prometheus alerts, let alert_count depend on alert rules #12336 -
sourcegraph/server:3.18 provisioning indicators panel missing #12421 🐛 -
deploy-sourcegraph: prometheus missing in upmetric #12482 -
monitoring: no easy way to trigger alerts for testing #12423 -
monitoring: link back to grafana service dashboards in alert notifications #12235 -
monitoring: better advice for alerts that do not have an entry in alert_solutions #12236 -
frontend: hard errors alerts as ratio/percentage instead of absolute-value #12158 -
monitoring: change relevant hard threshold alerts to ratio-based alerts #12865 -
monitoring: index_queue_growth_rate firing without hitting threshold #12868 🐛 -
monitoring: remove custom alertmanager from cloud#12160 -
monitoring: difficult to understand 7d provisioning panel #12692
-
Bare-metal Buildkite agents capable of running Docker and VMs#12101 -
deploy-sourcegraph-dhall: Add CI job to generate diff between master and feature branch#12716 -
baremetal buildkite agent networking / instability issues#12996
@daxmc99
-
Sourcegraph.com - add redis-store & precise-code-intel-bundle-manager snapshotting #10450
@efritz
-
sourcegraph/customer #72 🐛 👩
@ggilmore
-
deploy-sourcegraph-dhall: incrementally migrate sourcegraph.com services over to dhall#12113 -
[draft] deploy-sourcegraph-dhall: implement migrate-to-non-root overlay#12283 -
deploy-sourcegraph-dhall: grafana: implement new configuration logic for k8s.sgdev.org#12109 -
deploy-sourcegraph-dhall: frontend: implement new configuration logic for k8s.sgdev.org#12105
@keegancsmith
-
sourcegraph/customer #69 👩
@pecigonzalo
@slimsag: 7.50d
-
sourcegraph/customer #62 2d 👩 -
Formalize managed instances #12495 5d -
sourcegraph/customer #73 👩 -
sourcegraph/customer #53 0.5d 👩 -
sourcegraph/customer #66 👩 -
sourcegraph/customer#74👩 -
sourcegraph/customer #85 👩 -
sourcegraph/customer#90🐛 👩 -
Run e2e tests on bare-metal Buildkite agents on every commit to master (non-blocking)#12339 -
Run e2e "regression" tests on bare-metal Buildkite agents on every commit to master (non-blocking)#12340 -
distribution: add monitoring architecture page #1221 :shipit: -
distribution roadmap #1104 :shipit: -
Document when to introduce new services or not #5487 :shipit:
@uwedeportivo: 8.00d
-
sourcegraph/customer #65 👩 -
deploy-sourcegraph-dhall: symbols: implement generate reading from config #12076 0.5d -
deploy-sourcegraph-dhall: searcher: implement generate reading from config #12075 0.5d -
deploy-sourcegraph-dhall: replacer: implement generate reading from config #12074 0.5d -
deploy-sourcegraph-dhall: repo-updater: implement generate reading from config #12073 0.5d -
deploy-sourcegraph-dhall: query-runner: implement generate reading from config #12072 0.5d -
deploy-sourcegraph-dhall: precise-code-intel: implement generate reading from config #12071 0.5d -
deploy-sourcegraph-dhall: postgres: implement generate reading from config #12070 0.5d -
deploy-sourcegraph-dhall: jaeger: implement generate reading from config #12069 0.5d -
deploy-sourcegraph-dhall: github-proxy: implement generate reading from config #12066 0.5d -
deploy-sourcegraph-dhall: cadvisor: implement generate reading from config #12065 0.5d -
ci-db-backcomp.sh is not branch-aware #12302 -
sourcegraph/customer #57 3d 👩