/System Design

How To Choose The Right Database In A System Design Interview

- Ashish Pratap Singh tl;dr: “In system design interviews, the quality of your design and its ability to scale depends heavily on the database you choose. Choosing the wrong database can lead to high latency, data loss, or even system downtime. In this article, we will cover the 9 most common use cases that come up often in system design interviews and explore the best databases for each scenario.”

featured in #580


How We Built A Self-Healing System To Survive A Terrifying Concurrency Bug At Netflix

- Matthew Hawthorne tl;dr: “It was a Friday afternoon, and I heard a lot of commotion. I emerged from my cubicle to see my colleagues passionately discussing a problem: our CPU usage was slowly growing across our cluster.”

featured in #567


5 Lessons Design Systems Teams Can Learn From Open-Source Maintainers

- Nick Moore tl;dr: Design system teams constantly balance the evangelism of their standards and components while maintaining and evolving them. Turns out, the same is true of open source maintainers. This post suggests 5 ways enterprise design system teams can learn from open source to better demonstrate their value, improve the end-user experience, and reclaim time spent on maintenance for evolving and improving the system as a whole.

featured in #485


Standing On The Shoulders Of Giants: Colm On Constant Work

- Werner Vogels tl;dr: This is why many of our most reliable systems use very simple, very dumb, very reliable constant work patterns. Just like coffee urns. These patterns have three key features. (1) They don’t scale up or slow down with load or stress. (2) They don’t have modes, which means they do the same operations in all conditions. (3) If they have any variation, it’s to do less work in times of stress so they can perform better when you need them most. There’s that anti-fragility again.

featured in #466


Tumblr Shares Database Migration Strategy With 60+ Billion Rows

tl;dr: The article delves into Tumblr's database migration strategy. With a massive MySQL database spanning 21 terabytes and 60+ billion rows, Tumblr sought a migration approach that minimized user impact. Initially considering a brute force method, they later adopted the CQRS pattern, which separates database read and write operations. To combat latency issues, Tumblr introduced a database proxy in the local data center, which maintained persistent connections to the remote leader and allowed for connection pooling. This strategy ensured minimal user disruption during migration.

featured in #447


Live Comment System Design

tl;dr: The live comment is a popular system design interview question and is discussed here. The real-time platform built for publishing live comments can display Facebook reactions, likes, concurrent viewer counts, user presence status, online polls, or seen receipts.

featured in #422


Slack Architecture

tl;dr: The author guides us through the design for a system where the: (1) User can send real-time messages to another user and within a channel. (2) Chat messages should be persisted and displayed in chronological order to keep the total order of messages. (3) User can send media files such as pictures or short audio files. And more.

featured in #419


Consistent Hashing Explained

tl;dr: “Consistent hashing is a distributed systems technique that operates by assigning the data objects and nodes a position on a virtual ring structure - a hash ring. Consistent hashing minimizes the number of keys to be remapped when the total number of nodes changes.” The author dives deep into this works in the context of system design.

featured in #404


System Design Interview Cheat Sheet

tl;dr: “The system design questions are subjective. This cheat sheet is a work in progress and is written based on my research on the topic.” Topics include databases, API design, capacity planning, high level design, design deep dives, and more.

featured in #403


Tag Systems

- Hillel Wayne tl;dr: "I’ve tried to write a blog post on tag systems for years now... The problem is that there’s just so much to them, so many different approaches and models and concerns that trying to be comprehensive and rigorous is an exercise in madness... So screw it. These are my non-comprehensive, poorly-researched thoughts on tag systems. This is not about implementation of tag systems, just their design."

featured in #386