Essential Reading For Engineering Leaders

My Notes On GitLab Postgres Schema Design

- Shekhar Gulati

PostgreSQL
BestPractices

tl;dr: "My motivation to understand the schema of a big project like Gitlab was to compare it against schemas I am designing and learn some best practices from their schema definition. I can surely say I learnt a lot."

featured in #338

Best Practices For Making API Calls At Scale

- David Donnelly

tl;dr: 3’s a crowd and 429’s a rate limit exceeded error. We explore best practices for making API calls at scale, the 4 most common ways rate limits are implemented, and a methodology for avoiding rate limits dynamically.

featured in #331

Best Practices For Inclusive CLIs

- Rohan Kumar

tl;dr: "This began as a reply to another article that lists practices to improve user-experience of command-line interfaces... Unfortunately, a number of its suggestions are problematic, particularly from an accessibility perspective." Rohan elaborates on these and and discusses best practices for inclusive CLIs.

featured in #325

How I Think About Code Management

- Andreas Klinger

Management
BestPractices

tl;dr: Split code management into 2 tasks: Reducing complexity & increasing confidence. (1) Reducing complexity so you have fewer parts and interconnections between them. Develop "concepts with better boundaries." (2) Increase confidence by streamlining. Minimize ways things can be done, make them obvious and make sure they work. Andreas explains light-weight processes to do both.

featured in #305

What's In A Good Error Message?

- Gunnar Morling

BestPractices

tl;dr: Three pieces of information should be conveyed by an error message: (1) Context: What led to the error? What was the code trying to do when it failed? (2) The error itself: What exactly failed? (3) Mitigation: What needs to be done in order to overcome the error? Gunnar also highlights best practices: having a uniform voice and style, consistency in terminology, and more.

featured in #282

10 Rules For Better Cloud Security

- Thomas Segura

Cloud
BestPractices

tl;dr: (1) Limit manual configurations, (2) Design a recovery program to recover from outside accounts, providers, or locales. (3) Regularly review your DNS and cloud configurations to prevent take-over situations, and more.

featured in #275

OOPS Writeups

- Lorin Hochstein

BestPractices
Netflix

tl;dr: Operational Surprises (OOPS) is when something unexpected happened in operations and presents an opportunity to discover how the observed system behavior deviated from the mental model of how the system is supposed to behave. The template shared in this post is based on the template used at Netflix.

featured in #274

Save Your Engineers' Sleep: Best Practices For On-call Processes

- James Frost

BestPractices

tl;dr: 8 best practices shared including: (1) Alerts are treated as code i.e. go through code reviews, generated from existing modules. (2) Use percentiles over averages to get a higher quality signal. (3) Use playbooks to document each alert so there is corresponding documentation that explains what is broken and how to investigate and fix it.

featured in #274

The Developer’s Guide to SSO

Tools
BestPractices

tl;dr: Single-sign-on via SAML or oAuth is often one of the first features an engineering team needs to build as a product gains traction among larger customers. We explore how it works, how to add it to your app, and best practices for implementation.

featured in #265

Engineering Onboarding - Built to Scale

- Jamshed Vesuna

Management
BestPractices

tl;dr: Management at Robinhood rebuilt their engineering onboarding process around 3 key themes: learning experience, community building, and purposeful team matching, each outlined here.

featured in #264

/Best Practices