Lightstep from ServiceNow Logo

Products

Solutions

Documentation

Resources

Lightstep from ServiceNow Logo
< all blogs

Use Span Links in Lightstep

One of the things that I’ve enjoyed as part of my personal Observability JourneyObservability Journey has been learning about different aspects of OpenTelemetry. This week, I got to dig into a concept that was totally new to me: Span LinksSpan Links. I have to admit that this one was a tricky one to wrap my head around, because the definitions out there are um…confusing. So I made it my mission to better understand Span LinksSpan Links, so that I can tell y’all what they’re all about, and how they relate to Lightstep ObservabilityLightstep Observability. Are y’all ready for this? Let’s do it!

A Tale of Two Services

To better understand Span Links, let’s start with an example.

Suppose that you have an event-drivenevent-driven application. As part of this application, you have a service that adds a job to a queue. The jobs being added to the queue are executed asynchronously. One such job is a batch commit job. The “add job to queue” service and the “batch commit” service are each represented by a TraceTrace.

Here’s a handy little visualization, for your convenience.

Two Traces part of the same transaction

We know that the “add job to queue” and the “batch commit” services are related, since the “batch commit” service was queued up by the “add job to queue” service. Unfortunately, we are faced with a problem.

Our “batch commit” service runs asynchronously. This means that when the “add job to queue” service queues up “batch commit”, the work of “add job to queue” is done. This means that “add job to queue” service has no frickin’ clue as to when the “batch commit” service actually starts. Well if that’s the case, how could you even link the two??

To add insult to injury, we’ve started seeing some performance issues in our app. (Cue panic!) And although we are getting some insights about our services from the individual Traces, we have a missing link, if you will, and without it, we don’t have all the visibility into what’s going on with our system at a higher level. Okay, we know that in principle, there’s a connection between these two Traces! Which begs the question: is there a mechanism by which we can link these two Traces together? Um…Is the sky blue? It sure is. Are llamas awesome? Yes, yes they are. Does OpenTelemetryOpenTelemetry have our backs? Most definitely yes!

And so, yes, we can link our above Traces, thanks to a lovely OpenTelemetryOpenTelemetry concept called a Span LinkSpan Link. A Span LinkSpan Link allows us to create a relationship between two SpansSpans that are causally relatedcausally related. A causal relationship is one whereby one event causes another. In our case, the “add job to queue” service spawns a “batch commit” service, which will run at some point in time.

Or, in super duper layman's terms, when you have asynchronous processing, a Span LinkSpan Link gives the impression that the asynchronous process happened synchronously.

So with Span LinksSpan Links, our two services now look like this:

Two Traces part of the same transaction are causally-linked

Yay – they’re connected! Thank you Span Links!

Okay…so Span Links come to the rescue, and all is well with the world of ObservabilityObservability and OpenTelemetryOpenTelemetry…amirite? Well...kind of?

So here’s the rub. While Span Links are a thing in OpenTelemetry, not all Observability vendors support them. This means that even if you instrument your code to use Span Links, if the vendor back-end has no way to render them, then it’s pretty much the same as not using Span Links at all, because you lose out on that added bit of visibility that Span Links give you. Kinda sucks, doesn’t it? For example, at the time of this writing, in spite of requests from users, vendors such as NewRelic (see herehere) and Datadog (see herehere, herehere, and herehere) don’t support Span Links. I did a little quickie searchquickie search on Span Links for Splunk, and couldn’t find any mention of Span Links.

Luckily, Lightstep supports Span Links. As co-creatorsco-creators and active contributorsactive contributors to OpenTelemetry, we’re "all in" on OpenTelemetry, and we want our product to reflect that by supporting OpenTelemetry features like Span Links. We also know that this is a feature that’s important to our customers, and we want to say that we’re listening.

Dr. Frasier Crane I'm Listening Meme

Okay, so let’s see what Span Links look like in LightstepLightstep. In the Before Times, Lightstep had no way of surfacing Span Links at all. Which looked something like this:

Lightstep Observability without span link support

Booo! This meant that, going back to the earlier example with our “add job to queue” and “batch commit” services, we:

  • Had no way to understand end-to-end performance of a particular request

  • In looking at a slow batch commit, we didn’t know which writes were affected

💩

Fortunately, that’s no longer the case!! Now, with Span Links enabled in our UI, it means that we now have this:

Lightstep Observability now has Span Link support

Rejoice, for we now have Span Links!

Final Thoughts

As we’ve learned today, Span LinksSpan Links are pretty handy, because they give us the ability to show the relationship between causally-related processes. This is especially handy when it comes to event-driven architectures, allowing us to relate one or more asynchronous services with the service that spawned them. Lightstep now supports Span Links in its UI, which means that you now have a way to visualize and troubleshoot these causal relationships, giving you more power and insight to help you answer the question, “Why is this happening?Why is this happening?

We invite you to give Span Links in Lightstep a try to see for yourself, and tell us what you think. Connect with us on the Lightstep Community DiscordLightstep Community Discord, or get in touch by e-maile-mail. Hope to hear from y’all!

Now, please enjoy this picture of a lovely alpaca.

Alpaca Peering Through Wooden Slats

Peace, love, and code. 🦄 🌈 💫

July 28, 2022
5 min read
OpenTelemetry

Share this article

About the author

Adriana Villela

Adriana Villela

Read moreRead more

From Day 0 to Day 2: Reducing the anxiety of scaling up cloud-native deployments

Jason English | Mar 7, 2023

The global cloud-native development community is facing a reckoning. There are too many tools, too much telemetry data, and not enough skilled people to make sense of it all.  See how you can.

Learn moreLearn more

OpenTelemetry Collector in Kubernetes: Get started with autoscaling

Moh Osman | Jan 6, 2023

Learn how to leverage a Horizontal Pod Autoscaler alongside the OpenTelemetry Collector in Kubernetes. This will enable a cluster to handle varying telemetry workloads as the collector pool aligns to demand.

Learn moreLearn more

Observability-Landscape-as-Code in Practice

Adriana Villela, Ana Margarita Medina | Oct 25, 2022

Learn how to put Observability-Landscape-as-Code in this hands-on tutorial. In it, you'll use Terraform to create a Kubernetes cluster, configure and deploy the OTel Demo App to send Traces and Metrics to Lightstep, and create dashboards in Lightstep.

Learn moreLearn more
THE CLOUD-NATIVE RELIABILITY PLATFORM

Lightstep sounds like a lovely idea

Monitoring and observability for the world’s most reliable systems