Chris Down

👥 4 conferences
🎤 4 talks
📅 Years active: 2016 to 2023

Biography

Chris Down is an engineer on Facebook's Kernel team, based in London. He works on memory management within the kernel, especially cgroups, and is also a maintainer of the systemd project. Inside Facebook, he is responsible for debugging and resolving major production issues and improving the reliability and efficiency of Facebook's systems at scale.

— biography from FOSDEM 2023
https://archive.fosdem.org/2023/schedule/speaker/chris_down/

Conferences

4 known conferences

👥 FOSDEM 2023 📅 04 Feb 2023

🎤 7 years of cgroup v2: the future of Linux resource control
04 Feb 2023 show details

Control groups (or cgroups for short) are one of the most fundamental technologies underpinning our modern love of containerisation and resource control. Back in 2016, we released a complete overhaul of how cgroups work internally: cgroup v2, released with Linux 4.5. This brought many new and exciting possibilities to increase system stability and throughput, but with those possibilities have also come challenges of a type which we have largely not faced in Linux before.

This talk will go into some of the challenges faced in overhauling Linux's resource isolation and control capabilities, and how we've gone about fixing them. This will include some of the most complex and counter-intuitive practical effects we've seen in production, with details of how our expectations and knowledge have developed over the last 5 years using this on over a million machines in production, with insights that are immediately applicable to anyone who runs Linux at scale.

We will also go over the state-of-the-art of resource control in the "real world" outside of companies like Meta and Google, looking at how cgroup v2 is changing the technical landscape for distributions and containerisation technologies for the better.

👥 FOSDEM 2020 📅 01 Feb 2020

🎤 Linux memory management at scale
01 Feb 2020 show details

Memory management is an extraordinarily complex and widely misunderstood topic. It is also one of the most fundamental concepts to understand in order to produce coherent, stable, and efficient systems and containers, especially at scale. In this talk, we will go over how to compose reliable memory heavy, multi container systems that can withstand production incidents, and go over examples of how Facebook is achieving this in production at the cutting edge. We'll also go over the open-source technologies we're building to make this work at scale in a density that has never been achieved before.

We will go over widely-misunderstood Linux memory management concepts which are important to site reliability and container management with an engineer who works on the Linux kernel's memory subsystem, busting commonly held misconceptions about things like swap and memory constraints, and giving advice on key and bleeding-edge kernel concepts like PSI, cgroup v2, memory protection, and other important container-related topics along the way.

👥 FOSDEM 2017 📅 04 Feb 2017

🎤 cgroupv2: Linux's new unified control group hierarchy
05 Feb 2017 show details

cgroupv1 (or just "cgroups") has helped revolutionise the way that we manage and use containers over the past 8 years. A complete overhaul is coming -- cgroupv2. This talk will go into why a new control group system was needed, the changes from cgroupv1, and practical uses that you can apply to improve the level of control you have over the processes on your servers.

We will go over:

Design decisions and deviations for cgroupv2 compared to v1
Pitfalls and caveats you may encounter when migrating to cgroupv2
Discussion of the internals of cgroupv2
Practical information about how we are using cgroupv2 inside Facebook

👥 FOSDEM 2016 📅 30 Jan 2016

🎤 Lessons learned running SSL at scale
30 Jan 2016 show details

Several years ago, Facebook launched an internal initiative to integrate more encryption into its corporate infrastructure. The effort required advanced, yet highly responsive solutions in multiple areas, including vulnerability management, secure key distribution, and support for dated encryption in markets where modern encryption is still not viable. This technical talk will outline how Facebook has implemented some of these systems and provide recommendations for methodologies and open-source tools that could allow other organizations to put them into practice. It will also discuss how Facebook is addressing the challenge of serving SSL to millions of people in developing countries.

This talk will cover both technical and organisational topics. Main focuses include:

Designing a software/infrastructure ecosystem that can quickly respond to SSL security issues/other changes
Handling alerting and certificate monitoring, and where in your SSL stack to put such logic
Being able to provide both utility and maximum possible security in developing countries
Things to consider to avoid leaks with new developments in modern SSL infrastructure (for example, Certificate Transparency)
Proactively monitoring potentially malicious new certificate issuances for your domains
How we have implemented some of these systems at Facebook, with suggestions for open-source tools to help you do the same if you wish