Pointer: What Current & Future Engineering Leaders Read

- Mahdi Yusuf

DistributedSystem

tl;dr: (1) The network is reliable, (2) Latency is zero, (3) Bandwidth is infinite, (4) The network is secure, (5) Topology doesn't change, (6) There is one administrator, (7) Transport cost is zero, (8) The network is homogeneous.

featured in #323

Distributed Systems Shibboleths

- Joseph Lynch

DistributedSystem

tl;dr: "Shibboleths are historically a word that indicate membership in a particular group or culture.... I have only studied and worked in the field for around a decade, but in that time I believe I have learned to recognize some key “distsys shibboleths” that help me recognize when I can trust what a vendor or other engineer is telling me."

featured in #314

Building Robust Distributed Systems

- Kislay Verma

DistributedSystem
Architecture

tl;dr: "I have written before on this blog about what distributed systems are and how they can give us tremendous scalability at the cost of having to deal with a more complicated system design. Let’s discuss how we can make a distributed system resilient to random failures which get more common as the system gets larger."

featured in #299

The Internet Was Designed With A Narrow Waist

- Andy Chu

tl;dr: A narrow waist is concept, interface, or protocol that solves an interoperability problem. Picture an hourglass with M things on one side, N on the other, and an important concept in the middle. Andy illustrates how IP is an example, and how that impacts internet architecture.

featured in #299

Caches, Modes, And Unstable Systems

- Marc Brooker

Cache
DistributedSystem

tl;dr: "Good caches have feedback loops. Like back pressure, and limited concurrency. Bad caches are typically open-loop. This starts to give us a hint about how we may use caches safely, and points to some of the safe patterns for distributed systems caching."

featured in #250

Edgar: Solving Mysteries Faster With Observability

- Elizabeth Carretto

Netflix
DistributedSystem

tl;dr: "Edgar helps Netflix teams troubleshoot distributed systems efficiently with the help of a summarized presentation of request tracing, logs, analysis, and metadata." A run through of how it works.

featured in #204

Patterns Of Distributed Systems

- Unmesh Joshi

DistributedSystem

tl;dr: "What follows is a first set of patterns observed in mainstream open source distributed systems. I hope that these set of patterns will be useful to all developers."

featured in #199

Build More Reliable Distributed Systems By Breaking Them With Jepsen

- Kyle Kingsbury

DistributedSystem

tl;dr: Podcast where Kyle discusses his "approach to testing complex systems, common challenges that are faced by engineers who build them, and why it is important to understand their limitations."

featured in #196

Naive Retries Considered Harmful

- Evan Jones

DistributedSystem

featured in #51.1

Building Globally Distributed, Mission Critical Applications: Lessons From The Trenches

- Kris Beevers

DistributedSystem
Architecture

featured in #48.1

/Distributed System