Issue #116
Chaos is good, do you rotate ssh-keys?, immutable architectures, observability beyond keywords, taming latency variability . . .
The Best Code is No Code At All
— Jeff Atwood.
Posts
Reliability at scale doesn’t mean eliminating failure. It means better anticipating and mitigating how it impacts our users, and improving how we handle it. - #increment #reliability
How often should I rotate my ssh keys?
When they rebuilt the server cluster after the first breach, they had been super careful, constructed all the servers from scratch, avoided any possibility of installed backdoors… and then promptly dropped in the same set of ssh public keys as before. Because ssh keys are secure, right? If there’s one thing we know for sure about security, it’s that. It’s even in the name! - #tailscale #blog
State of Urgency for Distributed Systems
What was needed and what we are building is PLAN; including realtime maps, secure messaging, and spatial planning capabilities that will give community organizers the ability to rapidly mobilize to assist those in need. Employing a distributed and peer-based system like PLAN means that even when the power goes out, there is a low bandwidth local alternative to centralized servers or cell phone towers (which may not be accessible). - #plan-systems
Journey through eventsourcing: Part 5 - Support and Evolution
Eventsourcing systems come with a unique set of capabilities but bring some issues/concerns to address. These unique features make evolving eventsourcing systems a very different process compared to the classical, state-sourced systems - many sophisticated things become simple, if not trivial (such as audit, system state provenance, derived data streams, etc.); but many simple things become much more convoluted (schema evolution, persistence, etc.). - #eugenykolpakov
FOQS: Scaling a distributed priority queue
Facebook Ordered Queueing Service (FOQS) - It’s a fully managed, horizontally scalable, multitenant, persistent distributed priority queue built on top of sharded MySQL that enables developers at Facebook to decouple and scale microservices and distributed systems. - #fb #engineering
Delivering Eventual Consistency with Kafka Streams
- Microservices choose eventual consistency because strong consistency requires distributed transactions with 2PC which is prohibitively expensive for microservice based architecture
- Kafka Streams stands out in eventual consistency implementations for its simplicity, light weightiness and scalability - #medium #blackrock-engineering
Implementing Sagas using AWS Step Functions
I presented some of the challenges associated, advocated the use of Sagas to help us to bring back transaction support, and presented the two flavors of Sagas: Choreographed and Orchestrated.
In this article, I will defend the case on why AWS Step Functions (SF) can be a good option and introduce how to implement our sample domain using it. - #medium
Podcasts
Michael Perry on Immutable Architecture
The Art of Immutable Architecture, distinguishing immutable architecture from other approaches and, using familiar examples such as git and blockchain, addresses some possible misunderstandings about designing for immutability. - #podcastaddict
Distributed Systems and Careers with Shubheksha Jalan
Shubheksha talks about service observability, the importance of job titles, and the difference between sponsors and mentors. - #softwaresessions
Observability beyond buzzwords with New Relic's Tori Wieldt
Let's move beyond buzzwords and talk about observability and DevOps in large systems. Observability is the ability to measure the internal state of a system only by its outputs, but often those outputs are hundreds of log files spread across dozens of systems. The cloud has only made these large systems harder to understand and manage. Scott talks to New Relic's Tori Wieldt about the benefits of formalizing how you think about distributed systems and the tools available to make things easier. - #hanselminutes
Videos
Taming Latency Variability and Scaling Deep Learning, by Jeff Dean, Google
Hillel Wayne is Designing Distributed Systems with TLA+