Extreme HTTP Performance Tuning: 1.2M API req/s on a 4 vCPU EC2 Instance

In this talk I will walk you through the performance tuning steps that I took to serve 1.2M JSON requests per second from a 4 vCPU c5 instance, using a simple API server written in C. At the start of the journey the server is capable of a very respectable 224k req/s with the default configuration. Along the way I made extensive use of tools like FlameGraph and bpftrace to measure, analyze, and optimize the entire stack, from the application framework, to the network driver, all the way down to the kernel. I began this wild adventure without any prior low-level performance optimization experience; but once I started going down the performance tuning rabbit-hole, there was no turning back. Fueled by my curiosity, willingness to learn, and relentless persistence, I was able to boost performance by over 400% and reduce p99 latency by almost 80%.

20 Minutes
Marc Richards

Marc Richards, Performance Engineer at Talawah Solutions

Marc is the resident Jack of All Trades at Talawah Solutions, a cloud computing consultancy that has served clients across the globe. A generalist at heart, Marc has worn a number of hats over the years, ranging from Analyst to Architect. Most recently, he has been indulging his passion for performance engineering in his attempts to explore the limits of HTTP performance in the cloud.

Register for 2 days of keynotes & interactive tech talks on all things P99.

Virtual Event

October 19-20, 2022

Share on social with #p99conf and a link to p99conf.io for a chance to win $500.