The Latency Stack: Discovering Surprising Sources of Latency

Usually, when an API call is slow, developers blame ourselves and our code. We held a lock too long, or used a blocking operation, or built an inefficient query. But often, the simple picture of latency as “the time a server takes to process a message” hides a great deal of end-to-end complexity. Debugging tail latencies requires unpacking the abstractions that we normally ignore: virtualization, hidden queues, and network behavior.

In this talk, I’ll describe how developers can diagnose more sources of delay and failure by building a more realistic and broad understanding of networked services. I’ll give some real-world cases when high end-to-end latency or elevated failure rates occurred due to factors we ordinarily might not even measure. Some examples include TCP SYN retransmission; virtualization on the client; and surprising behavior from AWS load balancers. Unfortunately, many measurement techniques don’t cover anything but the portion most directly under developer control. But developers can do better by comparing multiple measurements, applying Little’s law, investing in eBPF probes, and paying attention to the network layer.

Understanding API performance to find and fix issues faster ultimately means understanding the entire stack: the client, your code, and the underlying infrastructure.

17 minutes

Watch this session from the P99 CONF livestream, plus get instant access to all of the P99 CONF sessions and decks.

The Latency Stack: Discovering Surprising Sources of Latency

Mark Gritter, Principal Engineer at Postman

Proudly supported by

The Latency Stack: Discovering Surprising Sources of Latency

Mark Gritter, Principal Engineer at Postman

Proudly supported by

Register for Your Free Ticket

Registration form not loading? Make sure any browser or plugin privacy settings are disabled (Privacy Badger, Adblock, etc). Also try refreshing the page, or complete your registration with this link instead.