New Relic's largest monolith handles 200k req/min and communicates with more than 40 external services and 11 mysql databases; this should result in constant downtime. Being mindful and alerting on the right things has been critical for us. This talk will cover a successful process for identifying trustworthy data, refining alert conditions, and what kinds of checks to page on.
Get notified about new features and conference additions.