Lightstep from ServiceNow Logo

Products

Solutions

Documentation

Resources

Lightstep from ServiceNow Logo
< all announcements

Lightstep adds complete system context to PagerDuty alerts

“Now developers automatically have PagerDuty on-call details inside of a pull request, alongside system health details, at their fingertips in one screen.” Steve Gross, Sr. Director, Strategic Ecosystem Development at PagerDuty

There is a lot of noise surrounding the term “Observability”. While vendors and pundits debate three pillars, Lightstep has partnered with PagerDutyPagerDuty, to ensure software teams can move from context within an incident to quickly understand and determine root cause. Together we’re augmenting incident response solutions for pre-production scenarios.

Today, when a developer gets an early-morning notification and it’s unfortunately a major incident, they immediately want to know the context surrounding that incident. Lightstep adds extensive insights and correlation detail for the production system to PagerDuty's incident response workflow. Given the rich Lightstep and PagerDuty data-sets and the context they bring, we saw an opportunity to help developers understand an incident even before opening the runbook.

The moment before you hit Merge

Both alert and observability context can also provide relevant insights just before developers make an important code change to a production system. For example, when service owners working in GitHub are about to merge a pull request that has passed code review, they are likely missing important information without switching between different solutions. They don’t have access to product health context, and they don't necessarily know who is on-call and responsible for the service in production. For Lightstep and PagerDuty, this provides an opportunity to ask and answer “Is the code ready to deploy?”

Observability - PagerDuty

Recently Lighstep published the Lightstep Pre-Deploy Check GitHub ActionLightstep Pre-Deploy Check GitHub Action, providing an opinionated view of the health of the whole system before developers merge their service’s code, inside a pull request. Automatically surfacing complementary data from Lightstep and PagerDuty, just before a merge is initiated, helps teams ship move quickly and reliably. Developers gain additional context: who owns which service, information about the on-call team, and even an immediate view of system health and performance via a Lightstep SnapshotLightstep Snapshot.

Github + Lightstep

If issues are surfaced by the Action, the developer has what’s needed to investigate before clicking the merge button. This is very different from a production issue or ongoing incident. The Action gives the developer visibility to the grey area where latency might be slightly higher although the customer experience is not adversely impacted yet. The developer now has all the context needed, including the name of the person on-call for the service, before they decide the system is all clear to deploy new code.

Adding more context with a PagerDuty change event.

Lightstep brings context to services in PagerDuty using the new Change Events APIChange Events API. The Action detects issues with the production system, and generates a Change Event. In addition to customized messages (i.e.”Lightstep Pre-deploy Check failed”), the Action attaches metadata: the pull request and a Lightstep Snapshot.

PagerDuty + Lightstep
PagerDuty to Lightstep Snapshot

The Incident Response teamIncident Response team now has real-time access to all the telemetry for a production system at the time the code merged all the traces, metrics and correlations presented in a easy-to-consume UI that includes a service diagram. With Lightstep Pre-Deploy Check and the PagerDuty Change Event, developers and Incident Response teams have more control, and a simple and clear way to see all the interactions between what they are developing, deploying, and then investigating, when something inevitably goes wrong.

How can I try out these new features?

Interested in joining our team? See our open positions herehere.

October 23, 2020
3 min read
Announcements

Share this article

About the author

Fran Thorpe
Announcements

Transform ServiceNow workflows with Service Graph Connector for Observability - Lightstep

Andrew Gardner | Dec 20, 2022

The Service Graph Connector for Observability - Lightstep is the bridge between IT Operations and DevOps teams. When combined with ITOM Visibility, it provides organizations with a complete, end-to-end view of their entire cloud estate.

Learn moreLearn more
Announcements

Evolving our incident response strategy

Lightstep | Nov 2, 2022

Lightstep’s Incident Response offering will be sunset effective January 31, 2023. Current customers may continue to use the service until then. Lightstep Observability will not be affected.

Learn moreLearn more
Announcements

Change Intelligence-In-Context

Rakesh Patel | Oct 26, 2022

Lightstep’s latest announcement reduces mean time to resolution and drives proactive performance improvements by enabling analysis - in-context - during your troubleshooting journey.

Learn moreLearn more
THE CLOUD-NATIVE RELIABILITY PLATFORM

Lightstep sounds like a lovely idea

Monitoring and observability for the world’s most reliable systems