Virtual Event | OCTOBER 23 + 24, 2024
Explore Rust, C++, Go, event streaming architectures, distributed databases, Linux kernel, observability, K8s & more
Follow us on Twitter @p99conf for announcements!
Working on Fluke: async Rust HTTP1+2 with io_uring & kTLS, sponsored by fly.io & Shopify. Unlike others, Fluke is built from the ground up to fully leverage io_uring, minimizing syscalls with kTLS. A promising future for proxies & apps if a stable API emerges. #Rust #io_uring #kTLS
I write (and shoot) deep dives about Rust, performance, operating system stuff, videography, and whatever else I feel my wonderful patrons & sponsors will like learning about. I'm extremely lucky to do this full time for now.
Tackling “noisy neighbor” issues in multi-tenant setups! At Netflix, we use eBPF to monitor and mitigate excessive CPU usage in real-time. Learn how we instrument the Linux scheduler, optimize eBPF, and maintain high performance. Get actionable insights for your infrastructure. #DevOps #eBPF
Software engineer specializing in observability, performance, and reliability. Passionate about low-level languages, focusing on Rust and eBPF technology. Creator of bpftop.
Think Rust is just about performance and safety? Let’s talk productivity. Last year, Rust’s library ecosystem needed work. What’s changed? I’ll dive into the current state of Rust libraries for database access, focusing on ergonomics and ease of use. #RustLang #DatabaseDev
Carl Lerche is a Principal Engineer at AWS. He is best known for his open source Rust libraries, primarily Tokio, the asynchronous I/O runtime for Rust.
Data replication ensures high availability—reliable, consistent, and timely access. Dive into the tough problems often skipped: reliable backfills, schema changes, data validation, failure resistance, and parallel writes. Learn how Shopify achieves ~5M rows/sec with <500ms p99 lag. #DataReplication #TechTalk
Cameron works on databases at Shopify where he works on distributed systems and MySQL. Originally from Vancouver BC, he moved to Montreal QC to compete a BEng. Electrical Engineering at McGill University. In his free time, he tries to ski as much powder as possible.
PostgreSQL continues to evolve with vector search extensions like pgvector and pgvecto.rs. We’ll explore recent benchmarks comparing vector search performance across various datasets and configurations, highlighting PostgreSQL’s adaptability in modern use cases. #PostgreSQL #VectorSearch
Daniel started his career as PhD student in the area of cloud computing with a focus on distributed databases in the cloud. Further interests cover cloud orchestration, model-driven engineering, and performance evaluations of distributed systems. After completing his PhD, Daniel has co-founded the Benchmarking-as-a-Service platform benchANT where he is responsible for the product development.
Dive into the future of cloud dev with Distributed Async Await. Simplify your code and conquer the chaos of distributed systems. Learn the ins and outs of async-await, functions, promises, and more. Transform your approach to scalable, reliable, and delightful cloud apps. 🚀 #CloudDev #AsyncAwait
Dominik Tornow is the Founder & CEO at Resonate HQ.
Ashley Williams is a former member of the Rust Core team. She is the founder of the Rust Foundation and served as its first executive director. she has served as a leader or member of the Rust crates.io, infrastructure, and community teams, WebAssembly working group, and the RustBridge and Increasing Rust's Reach programs. Ashley has been the original author and lead maintainer of many popular developer tools in the Rust ecosystem, from Cloudflare's wrangler, to wasm-pack, cargo-generate, and several others. previously, ashley was a leader of the Node.js Community Committee, founded the NodeTogether educational initiative, and served as the first Individual Membership Director of the Node.js Foundation (now OpenJS Foundation). Ashley has worked at npm, Mozilla, and Cloudflare. ashley studied Neuroscience and Philosophy and was originally a middle school science teacher in NYC.
Database queries vary widely—from milliseconds to hours. Optimizing concurrency is a delicate balance of CPU, memory, and stability. Bad design can lead to high latency or crashes. Join us to learn how we designed ScyllaDB’s query queue to handle these challenges. #Database #ScyllaDB
Avi Kivity, CTO of ScyllaDB, is known mostly for starting the Kernel-based Virtual Machine (KVM) project, the hypervisor underlying many production clouds. He has worked for Qumranet and Red Hat as KVM maintainer until December 2012. Avi is now CTO of ScyllaDB, a company that seeks to bring the same kind of innovation to the public cloud space.
Gunnar Morling dives into the tricks that the fastest 1BRC solutions used to process the challenge’s 13 GB input file within less than 2 secs — including parallelization and efficient memory access, optimized parsing routines using SIMD and SWAR, and custom map implementations.
Gunnar Morling is a software engineer and open-source enthusiast by heart. He is leading the Debezium project, a platform for change data capture (CDC). He is a Java Champion, the spec lead for Bean Validation 2.0 (JSR 380) and has founded multiple open source projects such as Deptective and MapStruct. Prior to joining Red Hat, Gunnar worked on a wide range of Java EE projects in the logistics and retail industries. He's based in Hamburg, Germany.
Historically, business apps use a three-tier architecture. Now, cloud-native architectures and DBMS can be combined, allowing for resilient, cost-effective, and high-performance systems. Learn about DBOS and why “you are doing it all wrong.” #TechTalk
Dr. Stonebraker has been a pioneer of database research and technology for more than forty years. He was the main architect of the INGRES relational DBMS, and the object- relational DBMS, POSTGRES. These prototypes were developed at the University of California at Berkeley where Stonebraker was a Professor of Computer Science for twenty five years. More recently at M.I.T. he was a co-architect of the Aurora/Borealis stream processing engine, the C-Store column-oriented DBMS, the H-Store transaction processing engine, the SciDB array DBMS, and the Data Tamer data curation system. Presently he serves as Chief Technology Officer of DBOS, Inc., Tamr, Inc., and Paradigm4. Professor Stonebraker was awarded the ACM System Software Award in 1992 for his work on INGRES. Additionally, he was awarded the first annual SIGMOD Innovation award in 1994, and was elected to the National Academy of Engineering in 1997. He was awarded the IEEE John Von Neumann award in 2005 and the 2014 Turing Award, and is presently an Adjunct Professor of Computer Science at M.I.T, where he is co-director of the Intel Science and Technology Center focused on big data.
Queues: both a blessing and a curse in computer science. They help predict performance but also signal overload. This talk explores their role in diagnosis, capacity planning, and development using physics concepts and the “hockey-stick” curve. Master queue intuition for better programs. #DevTalk
Dave is a performance engineer and capacity planner, once upon a time for the (late, lamented) Sun Microsystems. He's working as an individual contributor these days, from his home base in Toronto.
Excited to introduce SlateDB, an open-source, cloud-native storage engine. Built as an LSM on object stores like S3/GCS/ABS, it leverages object storage benefits while tackling unique latency and cost challenges. Join us to explore our design decisions and tradeoffs. #DevTalk #SlateDB
Chris Riccomini is a software engineer, startup investor, and advisor, and author with more than 15 years of experience at major tech companies such as PayPal, LinkedIn, and WePay. He has been involved in open source throughout his career and is the author of Apache Samza. He's co-author of The Missing README: A Guide for the New Software Engineer, and writes Materialized View, a weekly infrastructure newsletter.
Building a real-time, low-latency card payments system is a challenge. Join the Amex Payments Network team to learn about their 100% containerized, globally distributed platform powered by Kubernetes. Discover how they tackled latency with HTTP/2, local affinity, and more. #DevOps #Kubernetes
Benjamin Cane is a Distinguished Engineer at American Express, where he plays a pivotal role in the architecture, design, and engineering excellence of the Acquirer and Network Payments Platforms. With a focus on cloud-native technologies and practices, Ben specializes in building mission-critical and high-performance systems. His expertise has been instrumental in driving the evolution of American Express' cloud-native payments platform. Beyond his contributions to American Express, Ben is an active open-source community member and has contributed to various projects.
Analyzing request latency is tough since it’s not always CPU-bound. Many devs give up on CPU profiling, but sampling profilers can help. This talk dives into wallclock profiling and Datadog’s innovative profiling tech to uncover off-CPU activities and unexpected issues. #DevOps #Profiling
Richard has over ten years experience working on systems written in Java, including trading systems, big data, and payments systems. Richard has made a diverse set of open source contributions, is an Apache Pinot committer, and has contributed to OpenJDK and JMC. At Datadog, he has worked on tracing and profiling of applications running on the JVM.
Optimize your services with cost-efficient observability using high-performance sketching tools. Dive into creating sketching tech for various scenarios, making the most out of your resources and enhancing performance. #DevTools #Observability #PerformanceOptimization
Distributed computing and storage engineer for HPC, microservices and ML/Data platforms for many years. Evangelist of type-driven development and category theory for developers.
Introducing Netkit: a new eBPF enhancement replacing veth connections in container networking. Say goodbye to the overhead slowing down container apps. With Netkit, container networking now matches the speed of host networking. Fast, efficient, and ready to deploy. #DevOps #eBPF
Liz Rice is Chief Open Source Officer with eBPF specialists Isovalent, now part of Cisco, who are the original creators of the Cilium project. She was chair of the CNCF's Technical Oversight Committee 2019-2022, and Co-Chair of KubeCon + CloudNativeCon in 2018. She is also the author of Container Security, published by O'Reilly. She has a wealth of software development, team, and product management experience from working on network protocols and distributed systems, and in digital technology sectors such as VOD, music, and VoIP. When not writing code, or talking about it, Liz loves riding bikes in places with better weather than her native London, competing in virtual races on Zwift, and making music under the pseudonym Insider Nine.
Building for low latency is important, but the tips and tricks are often part of developer folklore and hard to discover on your own. This talk shares some of the important latency related patterns you want to know when working on low latency apps.
Pekka Enberg is the Founder & CTO at Turso.
Bryan Cantrill is a software engineer who has spent a quarter of a century at the hardware/software interface. He is the co-founder and CTO of Oxide Computer Company, which is endeavoring to build a rack-scale computer for the post-cloud era. Prior to Oxide he spent nearly a decade at Joyent, a cloud computing pioneer; prior to Joyent, he spent fourteen years at Sun Microsystems, a now-defunct computer company that Bryan's nine-year-old daughter apparently thought was a brewery.
At last year’s P99 CONF, Tanel introduced using eBPF Task State Arrays to track Linux apps’ thread states/activity without built-in instrumentation. This year, he demos “xcapture,” showing MySQL, Postgres, and DuckDB time spent on CPU and in sleep, all without altering source code. #p99conf #eBPF
Tanel Poder is a long-time computer performance geek, working on various complex systems like (Oracle) database clusters, modern Big Data & cloud technologies and anything running on Linux/Unix. He has built and fixed enterprise data systems all around the world, this has also resulted in building a few small-but-very-fun tech companies around better tools and methods. He has two patents in data virtualization space and has realized that this is enough. In addition to his R&D, he occasionally delivers consulting, advisory and training to companies and talks about performance & troubleshooting both at public conferences and his video channels available at his website.
Matklad dives into the implementation of TigerBeetle’s JIT compaction algorithm for LSM, which is highly concurrent and uses all available IO resources without starving normal transaction processing.
Aleksei is a staff software engineer at TigerBeetle. Before that, Aleksei worked on IDE support for the Rust programming language, but TigerBeetle's deterministic, non-allocating Zig codebase proved too hard to resist.
This talk showcases a methodology with examples to break down applications to low-level primitives and identify optimizations on existing compute instances or platform or for offloading specific portions of the application to accelerators or GPU’s. With the increasing use of a combination of CPU, GPU and accelerators/ASIC’s, this methodology could prove increasingly useful to evaluate what kind of compute to use and when.
Ravi is a Sr. Principal Engineer in Intel's Datacenter and AI group leading software engineering collaboration with hyperscale cloud providers. He focuses on performance engineering to address performance bottlenecks, infrastructure overhead and efficiency. Prior to this, he was responsible for the design and implementation of Operating Systems, Configuration Management and Compute Metrics solutions for Intel's internal design computing environment which runs on millions of cores of compute. He is a Lean Six Sigma Green Belt and has approved patent filings related to Energy Efficiency, Distributed Computing and Check-pointing. Ravi has authored numerous whitepapers including IEEE publications, IT@Intel public whitepapers and internal conference papers.
We’re transforming elfo, our Rust actor system, into a distributed mesh of services. Learn how we tackled message serialization, compression, and back-pressure to optimize for high-frequency trading. Dive into our journey to make everything observable and debuggable. #RustLang
Nikita is low-level Rust engineer specialising in databases and distributed systems. Previously, they worked on MongoDB, ClickHouse and YDB, as well as in high-frequency trading sector.
ShareChat’s scaling ML Feature Store to handle 1B features/sec was just the start. Next challenge: cutting costs while keeping quality. Join Ivan & David to explore cloud cost optimization, Kubernetes waste reduction, and autoscaling Apache Flink. Perfect for #ML & #CloudDev. #P99Conf
David Malinge is a Senior Staff Software Engineer at ShareChat.
Maximizing performance goes beyond server-level tweaks. Even low level code, scaling requires more. In this session, learn about “tablets”—a dynamic sharding design at ScyllaDB that optimizes CPU, storage, and elasticity for top-notch performance. #Database #ScyllaDB
Dor Laor is the CEO of ScyllaDB. Previously, Dor was part of the founding team of the KVM hypervisor under Qumranet that was acquired by Red Hat. At Red Hat Dor was managing the KVM and Xen development for several years. Dor holds an MSc from the Technion and a Phd in snowboarding.
Join us as we tackle Gunnar Morling’s One Billion Rows Challenge in Golang! We’ll walk through optimizing a 16GB file from 6 mins to ~12 secs using Go’s performance tools. Perfect for anyone keen on Golang performance tuning! #Golang #PerformanceOptimization #DevTalk
Shraddha is a Senior Software Engineer, Ceph, IBM.
Building a real-time, low-latency card payments system is a challenge. Join the Amex Payments Network team to learn about their 100% containerized, globally distributed platform powered by Kubernetes. Discover how they tackled latency with HTTP/2, local affinity, and more. #DevOps #Kubernetes
Tyler Wedin is a Vice President overseeing Core Platforms Site Reliability Engineering at American Express. He has 15+ years of experience designing, building, and operating critical infrastructure and applications. He is currently responsible for many critical authorizations and financial systems, and he has implemented innovative solutions across release management, resiliency, and observability. Prior to this role, Tyler led the data center network architecture and engineering teams, delivering the software-defined infrastructure stack supporting the company’s hybrid cloud model, and virtualizing and scaling enterprise network services.
Excited to introduce SlateDB, an open-source, cloud-native storage engine. Built as an LSM on object stores like S3/GCS/ABS, it leverages object storage benefits while tackling unique latency and cost challenges. Join us to explore our design decisions and tradeoffs. #DevTalk #SlateDB
Rohan has been working in the software infrastructure space for over 10 years. Most recently he was at Confluent, where he led the team building the cloud platform underpinning KSQL.
Unlock the full potential of database drivers! Dive deep into their design, uncover how they work under the hood, and learn to tweak their performance to meet your needs. Perfect for developers eager to optimize their database interactions. #TechTalk #DatabaseDrivers
* Founding Engineer @poolside, hacking fun AI things at scale * Ex-maintainer of ScyllaDB and Seastar * Ex-Staff @Turso * Maintainer of libSQL * Wrote a few patches for the Linux kernel
In Part 2 of my P99 2023 talk, I’ll dive into practical strategies to enhance our problem-solving skills in the performance space. Building on the basics from Part 1, this session will cover actionable tips and methods to improve our odds. #P99 #PerformanceEngineering #TechTalk
Kerry Osborne is a database performance specialist. He is a founder of Enkitec; an expert model Oracle-focused consulting company that was acquired by Accenture in 2014 (now the Accenture Enkitec Group). He is also a founder of Gluent, a software company which provided transparent connections between many analytic engines (Oracle, Microsoft SQL, Snowflake, BigQuery, Redshift, Hadoop, Synapse, …) Gluent’s IP was acquired by Google in 2022 and the team joined Google to form a specialized group of databases experts (Database Black Belts) that focus on helping enterprise customers and improving Google’s database products by infusing Gluent IP. Kerry has co-authored two performance focused books, Pro Oracle SQL and Expert Oracle Exadata.
Configuring Kubernetes for optimal workload performance is a continuous journey. Best practices can sometimes harm performance. Join us as we share our findings on running SpiceDB, a low-latency authorization system, and what has worked best for us. #Kubernetes #DevOps #SpiceDB
Jimmy Zelinskie is a software engineer and product leader with the goal of empowering the world through the democratization of software through open source development. He's currently the CPO and cofounder of authzed where he's focused on bringing hyperscaler best-practices in authorization software to the industry at large. At CoreOS, he helped pioneer the cloud-native ecosystem by starting and contributing to many of its foundational open source projects. After being acquired by Red Hat, his focus shifted to the enablement and adoption of cloud-native technologies by mature enterprise stakeholders. To this day, he still contributes to cloud-native ecosystem by building the future on top of these technologies and maintaining standards such as Open Container Initiative (OCI).
Discover how Generational ZGC slashed Java app pause times in real-world use! Learn how Apache Pinot tackled scatter-gather tail latencies with Generational ZGC, unveiling Pinot’s low latency query potential. Dive into key configs and features that made it all possible. #Java #ZGC #ApachePinot
Working with Apache Pinot and ClickHouse at Uber.
Zoo is creating a CAD suite that runs in the cloud but feels like it’s local. How? Regional deployment, WebRTC for video streaming & UDP channels for low-latency mouse events, and Nvidia Encoder for fast video. Design big models on low-power machines without lag. #dev #cloudCAD
I've worked on infrastructure and systems across clouds but frequently with Kubernetes. I really enjoy figuring out weird bugs that span across systems and making things fast.
Discover how LinkedIn optimized Apache Pinot’s performance! By using FALF Interning, a home-grown, lock-free method, they cut JVM heap usage by 20% and improved P99 query latencies by 35%. Join us to learn about JXRAY memory analysis, design, and implementation insights. #TechTalk #ApachePinot
Vivek is a Senior Software Engineer at LinkedIn working on developing and maintaining Pinot. He is an Apache Pinot Committer. Prior to LinkedIn, he worked as a Software Engineer for Cohesity, a startup in the data management space where he worked on building the distributed SpanFS filesystem.
I optimized a Python data analytics pipeline, making it 180,000x faster with Rust! Using compiler optimizations, data structures, vectorization, parallelization, and more, I’ll share a case study on these techniques. Join me to explore the tools in a performance engineer’s toolkit.
Will Crichton is an incoming assistant professor at Brown University. His research takes a human-centered perspective on the design of programming languages and developer tools. Will's research has been widely used within the Rust community, and he has published at venues such as POPL, PLDI, OOPSLA, and CHI. Will received his PhD from Stanford University in 2022.
In this talk, we share our journey in making QuestDB, an open-source time-series database, a much faster analytical database, featuring specialized data structures, SIMD-based code, scalable aggregation algorithms, and parallel execution pipelines.
Distributed and high-performance systems gazer. Occasional tech blogger and speaker.
Discover how WebAssembly is revolutionizing cloud computing. Join Thorsten Hans to learn about building serverless apps with Spin, achieving true scale-to-zero, and running them on platforms like Kubernetes with SpinKube. No more long cold starts or over-provisioning. #Serverless #WebAssembly
Thorsten Hans is a Sr. Cloud Advocate at Fermyon. He guides developers and teams through understanding, adopting, and mastering emerging technologies to build reliable software and embrace the next wave of cloud computing. As Microsoft MVP and Docker Captain, he keeps on sharing his experiments and knowledge with the developer community.
Can a generic text encoding library be both versatile and fast? We benchmarked 6+ C, C++, and Rust libraries for Unicode tasks and found ztd.text and cuneicode can match performance titans like simdutf. Discover how to achieve top speed without sacrificing user-friendly APIs! #DevTalk
JeanHeyd "ThePhD" is a student and an organizer for Shepherd's Oasis, LLC. They are the Project Editor for the C Language, and they manage large open-source contributions -- sol2, ztd.text, ztd.cuneicode, and more -- that is used across many industries and academic disciplines. They are currently working towards earning their own nickname, climbing the academic ladder while spending as much time as possible contributing to C and C++ standardization and development. Their newest and biggest project is Unicode for C and C++. They very much love dogs and hopes to have their own in a few years or so. They also like TWRP's "Feels Pretty Good" from the album Together Through Time, and "Truth" from T-SQUARE's album TRUTH.
Optimize your Go code with the powerful pprof tool. Learn how to integrate, access, and interpret pprof metrics, plus best practices for production. Master the interplay between memory and CPU for top-notch performance in your services. #GoLang #Profiling #DevTips
Miriah earned her bachelor's degree in physics from Brigham Young University in 2017, laying a strong foundation for her multifaceted contributions to the field. As a seasoned engineer with 6 years of expertise, her professional journey includes crafting videos, tutorials, and courses, and showcasing her mastery in Go and Data Engineering. As a dynamic speaker and Google Developer Expert in Go, Miriah has delivered talks on Go, Machine Learning, and Data Engineering. As a board member of Forge Foundation Inc. and an organizer of the GoWest Conference, Utah Data Engineering, and Machine Learning Utah meetups, she actively shapes her local tech community.
ScyllaDB teamed up with Memcached to compare how caches and databases handle storage and memory across different scenarios. We’ll dive into ScyllaDB’s row-based cache vs. Memcached’s in-memory hash table and IO handling. #Databases #Caching
Alan runs Cache Forge, supporting enterprises relying on memcached. He has spent decades scaling infrastructure for startups and enterprises: an early employee of Fastly, consultant for users with massive caches, and at various social networks stretching back to LiveJournal, the originator of memcached.
Moving apps to the Edge can complicate performance due to security constraints. Learn how WebAssembly bridges the gap, enabling both speed and security. We’ll share insights from Exograph app migrations, focusing on Wasi advancements and the new component model. #EdgeComputing #WebAssembly
Ramnivas leads the development of Exograph, a declarative approach to GraphQL backend written in Rust. He has led innovation in Spring Framework and Cloud Foundry since their beginning. Ramnivas is the author of AspectJ in Action, the best-selling book on aspect-oriented programming lauded by industry experts for its practical and innovative approach to real-world problems. He has spoken at leading industry conferences, including JavaOne, ScalaDays, SpringOne, and O'Reilly OSCON.
Dive into understanding app latency by exploring the Linux scheduler with ftrace, eBPF, and Perfetto for visualization. Uncover quirks in the CFS scheduler and get a glimpse of the new EEVDF scheduler in recent kernels. #Linux #eBPF #Performance #DevOps
Jason Rahman (B.S. Computer Science Honors, Purdue University) works in the infrastructure, database, and storage space. He is employed at Microsoft where he works on the PostgreSQL team. Previously he was employed at Meta (Facebook) where he worked on large scale distributed file systems and previously on the Database Engineering team. In his spare time, he enjoys photography, road trips, and hiking.
Lock-free programming and precise ultra low latency pipelining between CPU cores.
Discover how to detect subtle memory leaks and regressions in Android apps with a production-focused approach. Learn the key metrics to collect, gather real-world data, and analyze it effectively to optimize performance during A/B testing. Safeguard your features and enhance user experience. #PerformanceTuning
Pavlo is a Google Developer Expert for Android and Kotlin.
Discover powerful search algorithms and their SIMD- and GPU-accelerated implementations for AI-powered semantic search, structure search, or exact & fuzzy full-text search. Boost your search capabilities with cutting-edge tech. #AI #SearchAlgorithms #SIMD #GPU #FullTextSearch
Ash Vardanian is a seasoned researcher in Computer Science and Artificial Intelligence with an astrophysical background. With a keen interest in High-Performance Computing, Ash has developed and maintained numerous Open-Source tools tailored for large-scale data processing, mostly focusing on GPU & SIMD-accelerated Vector and Full-Text Search.
Learned a crucial lesson on read/write latency when fixing a real ScyllaDB issue! Discover how TRIM requests impact NVMe SSDs with XFS online discard enabled. Uncover the problems and explore potential solutions. #ScyllaDB #NVMe #XFS #DevOps #DatabasePerformance
Patryk is a C++ and programming enthusiast whose experience includes the development of a compute runtime for GPUs and contributions to the 5G packet scheduler. Since January 2024, he has been rendering services to ScyllaDB as a remote software engineer.
I've been working in the software industry for many years, mostly as a engineer. More recently with a transition to leadership, I began chasing the Platform dream and started with Observability as a discipline rather than an artifact. I've had a pleasure of working with some of the very best people, who I've shared my journey of learning with.
An in-depth analysis of asynchronous function pointers in Rust, why they aren’t a real thing (compared to normal function pointers) and a performance analysis of each way of constructing them. From Boxed Async functions, to Enum dispatch to StackFutures.
Professional developer turned founder to work full-time on Open Source technology. Developing tooling for load testing and performance analysis, inspired by work in industry and a deep passion for distributed systems, reliability, and 6 years of professional Rust experience.
Join us to dive into GCC and Clang optimizations for C/C++! We’ll explore how x86-64 executes code, use assembly for speed, tackle multi-core programming, and address Spectre mitigations. Learn with microbenchmarks and get insights on when profiler optimizations might fall short. #programming #performance
Alexander is the CEO of Tempesta Technologies, Inc., and is the architect of Tempesta FW, a high-performance and secure open source hybrid of a web accelerator and firewall. Alexander is responsible for the design and performance of several products in the areas of network traffic processing and databases. He designed the core architecture and leaded development of a Web application firewall, mentioned in the Gartner Magic Quadrant '15, the MariaDB temporal data tables and ultra-scalable S3 and NFS servers. Alexander gave talks at CppCon'22, Netdev, SCALE, Linux Conf Australia, MariaDB user conferences, All Things Open, FOSDEM, SeaGL, Percona Live, IBM CASCON, and many other conferences. Alexander is also the author of a very fast lock-free MPMC ring buffer queue, published by the Linux Journal.
DBMSs struggle with OS constraints, but new tech like eBPF can change the game. Join us to explore “user-bypass” designs for high-performance DBMSs with eBPF. Learn about BPF-DB, an embedded DBMS in the Linux kernel, providing ACID transactions and multi-versioned data. #databases #eBPF
Andy Pavlo is an Associate Professor of Databaseology in the Computer Science Department at Carnegie Mellon University. His (unnatural) infatuation with database systems has inadvertently caused him to incur several distinctions, such as VLDB Early Career Award (2021), NSF CAREER (2019), Sloan Fellowship (2018), and the ACM SIGMOD Jim Gray Best Dissertation Award (2014). He was also the CEO & co-founder of the OtterTune database tuning start-up (2020).
Maestro kernel began as a C-based school project and transitioned to Rust for better code quality. Now, it’s in a clean-up and performance enhancement phase. This talk shares key lessons learned along the way. #RustLang #KernelDevelopment #Programming
Creator of Maestro, a Unix-like kernel written in Rust aiming to be compatible-enough with Linux to cover most use cases.
Zoo is creating a CAD suite that runs in the cloud but feels like it’s local. How? Regional deployment, WebRTC for video streaming & UDP channels for low-latency mouse events, and Nvidia Encoder for fast video. Design big models on low-power machines without lag. #dev #cloudCAD
I started learning Rust for fun in 2017. I've now worked full-time with Rust for five years at Cloudflare and Zoo, writing high-performance and high-availability API servers and internet proxies.
Join our session on minimizing latency in self-hosted #ML models in cloud environments. Learn strategies for deploying Deepgram’s speech-to-text models on your hardware, including concurrency limits, auto-scaling, input chunk granularity, and efficient model loading. Optimize your ML inference.
Julia Kroll is an Applied Engineer at Deepgram where she provides engineering and product expertise on speech-to-text and voice AI, enabling developers to use language as the universal interface between humans and machines. She previously worked as a Senior Machine Learning Engineer creating natural-sounding AI voices, following five years at Amazon, where she contributed to machine learning and data engineering for AWS and Alexa. She holds two computer science degrees, a master's from the University of Wisconsin-Madison and a bachelor's from Carleton College. Her interests lie at the intersection of technology, linguistics, and society.
Discover how to boost your software with lesser-known compiler flags and Profile-Guided Optimization (PGO). Learn what PGO is, how it works, its nuances, and see real benchmarks. Ready to delegate optimization to your compiler and gain performance? Join us! #dev #programming #optimization
In the past, I used to be an active C++ user and contributor, but now my coding skills are a bit Rust-y. People at work call me "Solution architect" but I prefer "Confluence && Draw.io engineer". I like performance, crazy about optimizations and making optimizations as friendly as possible for humankind.
ScyllaDB teamed up with Memcached to compare how caches and databases handle storage and memory across different scenarios. We’ll dive into ScyllaDB’s row-based cache vs. Memcached’s in-memory hash table and IO handling. #Databases #Caching
Felipe Cardeneti Mendes is a Technical Director at ScyllaDB.
Struggling with performance issues despite all green dashboards? Experts say you need special skills, but we’ll show you how to tackle these problems yourself. Join us to understand why this happens and how you can take control. #DevTips #PerformanceTuning
Cary Millsap spent the 1990s learning a lifetime’s worth of lessons about software performance as a consultant for Oracle Corporation. In his ten years at Oracle, he personally helped over a hundred customers, and he created an elite 85-person team who have helped hundreds more. He left Oracle in 1999 to grow his family, and he has been an entrepreneur ever since. Cary has educated thousands of professionals through his commitment to writing, teaching, and speaking at public events. His books “Tracing Oracle” and “Mastering Oracle Trace Data” help professionals optimize any Oracle-based application. His newest book, “How to Make Things Faster: Lessons in Performance from Technology and Everyday Life,” is for anybody who is curious about performance and how to improve it.
Moving apps to the Edge can complicate performance due to security constraints. Learn how WebAssembly bridges the gap, enabling both speed and security. We’ll share insights from Exograph app migrations, focusing on Wasi advancements and the new component model. #EdgeComputing #WebAssembly
Brian Sletten is a liberal arts-educated software engineer with a focus on forward-leaning technologies. He is the author of the O’Reilly Media book, “WebAssembly: The Definitive Guide.” His experience has spanned many industries including retail, banking, online games, defense, finance, hospitality, and health care. He has a B.S. in Computer Science from the College of William and Mary and lives in Auburn, CA. He focuses on web architecture, resource-oriented computing, social networking, the Semantic Web, data science, 3D graphics, visualization, scalable systems, security consulting, and other technologies of the late 20th and early 21st Centuries. He is also a rabid reader, a devoted foodie, and has excellent taste in music. If pressed, he might tell you about his International Pop Recording career.
ShareChat’s scaling ML Feature Store to handle 1B features/sec was just the start. Next challenge: cutting costs while keeping quality. Join Ivan & David to explore cloud cost optimization, Kubernetes waste reduction, and autoscaling Apache Flink. Perfect for #ML & #CloudDev. #P99Conf
Ivan is an experienced Software Engineer, ex-Facebook, ex-ScyllaDB, passionate about performance problems in distributed systems. He is leading the effort to build a world-class Feature Engineering framework at ShareChat.
In 2022, I began crafting a tool to visualize Go’s GC in real-time. I’ll dive into the hurdles of extracting data from Go’s runtime, the paths I explored—from internal packages to a Go fork—and the route I took. I’ll share my insights and look into other exciting options. #golang #gc
Aadhav is a software engineer who loves working with Go, databases, and distributed systems. He spends most of his time working on bringing fun ideas/experiments to life, or watching sports.
Discovering performance regressions in modern systems is tough due to inevitable noise. Change Point Detection (CPD) algorithms are gaining traction for tackling this challenge. This talk covers how CPD works and shares examples of real regressions found in open source projects. #DevOps #CPD
Matt is the co-founder and CTO of Nyrkiö, a company that wants to arm every developer with the tools to understand the performance of their code. He has spent over 15 years working on low-level, high-performance systems and was previously the maintainer for the Linux kernel EFI subsystem while at Intel and SUSE. Most recently, Matt has focused on the performance of distributed systems and databases including leading a globally distributed team of engineers that contributed significant performance changes to the Apache Cassandra 5.0 release. He has co-authored papers on performance change detection and distributed systems testing and served on the ACM/SPEC ICPE program committee. Matt can often be found on Twitter, discussing topics such as software performance, benchmarking and statistics.
Get the most out of your CUDA code by understanding how the compiler works.
* Founding engineer @poolside, writing low-level kernels. * Ph.D in Cryptography, focusing on large-scale cryptanalysis. * Ex-intern in the profiling team at Elastic.
Join Peter as he dives into Redis alternatives like Valley, DragonflyDB, and Microsoft Garnet. He’ll cover licensing, features, community support, and performance. Get clear insights into each option and strategize your approach in the evolving Redis landscape. #Redis #Databases
Peter Zaitsev is an entrepreneur and co-founder of Coroot, FerretDB, Percona. As one of the leading experts in Open Source strategy and database optimization, Peter has used his technical vision and entrepreneurial skills to grow Percona from a two-person store into one of the most respected open source companies in the business with over 350 employees. Peter now continues to serve as a board member and advisor to a range of open source startups. Peter is the co-author of the book "High Performance MySQL: Optimization, Backup, and Replication," one of the most popular books on MySQL performance.
Observability is more than metrics and logs—it’s knowing your system’s status without checking under the hood. From QA processes to daily life, learn to enhance awareness and efficiency. Join us to master the principles and see your world with new clarity. #Observability #DevOps
Leandro is a performance testing advocate with K6-Grafana helping everyone to ramp up on their performance practices. He has over 20 years of experience in IT and over 10 in the performance testing practice where he served multiple S&P500 customers all over the USA, Mexico, Canada, Brazil, India, Austria, etc. Author of the popular performance testing blog Señor Performo (www.srperf.com) where he curates a diverse set of learning material for performance testers and engineers. He is the producer and host of the Spanish language version of the PerfBytes podcast and the Señor Performo learning YouTube channels. He is an international public speaker participating in multiple conferences, events and webinars, with keynotes, workshops and multiple talks on his belt. And last, author of "The Hitchhikers Guide To Load Testing Projects", a fun walkthrough that will guide you through the phases or levels of an IT load testing project.
This talk showcases a methodology with examples to break down applications to low-level primitives and identify optimizations on existing compute instances or platform or for offloading specific portions of the application to accelerators or GPU’s. With the increasing use of a combination of CPU, GPU and accelerators/ASIC’s, this methodology could prove increasingly useful to evaluate what kind of compute to use and when.
Harshad Sane is a performance engineer in the Data Center and AI group with a deep technical expertise in system software, memory, and CPU architectures. He specializes in performance monitoring, software optimization, and tool development with focus in the cloud domain. Harshad joined Intel in 2008 after completing his undergraduate degree in Electrical and Telecommunications from College of Engineering and Technology (COET), Pune, India and M.S. in Electrical and Computer Engineering from UC Boulder.
Observability is crucial for complex LLM systems. Simple monitoring isn’t enough. Enter OpenLLMetry: extensions on OpenTelemetry for full LLM observability. Connect it to your existing stack and optimize performance with best practices and real-world case studies. #DevOps #AI #LLMs
Filipe Oliveira is a Principal Performance Engineer @ Redis, working on both the enterprise and the source available products. Before Redis, he worked both on a large scale retail company and as Grid and Distributed Computing Researcher recurring to advanced computing infrastructures listed in the TOP500 supercomputers list.
Sharing our journey to improve P99 latency in third-party APIs. From optimizing network configs to fine-tuning connection management, we aimed to cut down latency and enhance user experience. Dive into our strategies and see how we achieved a smoother, more responsive service. #DevOps #API
Cristian Velazquez is a Staff Site Reliability Engineer on the Maps Production Engineering team at Uber. He works on multiple efficiency initiatives across multiple organizations. He leads the GC tuning efforts across the company.
Arroyo is a Rust-based, distributed stream processing engine offering millisecond-latency and high-throughput. It achieves fault tolerance and exactly-once processing via a Chandy-Lamport inspired checkpointing system. Learn about its architecture, stateful windowing, and aggregate algorithms. #RustLang
Micah is the co-founder and CEO of Arroyo, a startup building a new open-source stream processing engine. He was previously tech lead for streaming compute at Splunk, where his team built infrastructure to manage hundreds of customer Flink pipelines, and Lyft, where he built real-time data infra powering Lyft's dynamic pricing, ETA, and safety features. He has worked on many big data batch and streaming systems in his decade-plus career across domains including ad-tech, fraud protection, and the on-demand economy.
Wednesday, October 23
8:00am – 1:00pm Pacific Time
16:00 – 20:00 UTC
Thursday, October 24
8:00am – 1:00pm Pacific Time
16:00 – 20:00 UTC
Follow us on Twitter @p99conf for the latest updates.
No surprise, but once again #P99CONF from the #ScyllaDB crew is showing everyone how to do it: this is THE model for what a virtual conference should be! (Importantly: FREE!) Day 1 was terrific; looking forward to kicking off Day 2! https://t.co/PUAM7fkb2E
— Bryan Cantrill (@bcantrill) October 18, 2023
The second day of #p99conf and #scylladb. One of the best tech conference nowadays. By engineers for engineers.
— Marcin Rusek (@marcin_rusek) October 20, 2022
This is why I loved the @P99CONF format. Talks were pre-recorded, discord chat with the speaker during their talk and live Q/A after. Low stress for everyone - easy to get at missed talks.
— SMT Solvers (@SMT_Solvers) October 22, 2021
Great discussion on which languages are best for coding for speed (@rustlang or @ziglang) by @glcst, @jarredsumner, and @carllerche at #P99CONF #ScyllaDBhttps://t.co/4j0BZSvHyq
— Samir Alibabic (@samiralibabic) October 18, 2023
That was fun! Great questions from the #p99conf crowd https://t.co/bSRedaDN6x
— Liz Rice 🐝 💙💛 (@lizrice) October 19, 2022
The terrific work by the @P99CONF crew has me wondering if virtual conferences might be the future: the platform fostered collaboration, the price was right (free!), and (best of all?) the videos are already out! Viz. my talk from this morning: https://t.co/qWKsqt43nq
— Bryan Cantrill (@bcantrill) October 7, 2021
True before, true this year too.
— Paul Philleo (@philpauleo) October 18, 2023
Good thing that #P99conf talks are available on demand later!#ScyllaDB #rustlang #NoSql #database #AI #SQL #opensource #memes pic.twitter.com/3qLPBc7ckV
I've been told that cognitive biases are a trend in the #P99CONF community.
— Gwen (Chen) Shapira (@gwenshap) October 19, 2023
If you are not there live, you are missing the world's best live chat. I don't think I ever got so many good insights from so many top experts at the same time. https://t.co/P659WTVyjs
There’s no other event like this — a conference for engineers by engineers, where we’ll share novel approaches for solving complex problems efficiently and at speed. Vendor and tool agnostic, this conference will be for a highly technical audience only. Your boss’s boss is not invited.