Collection
A collection of interesting learning resources and tools I found.
Arthur Chiao's Blog
#kubernetes #networking #cilium
Amazing collection of write-ups related to Cilium CNI
Did Kubernetes Make My p95s Worse? - Jian Cheung & Stephen Chan, Airbnb
#learnings #kubernetes #performance analysis
A reminder to not blindly jump onto flashy tech. Evaluate clearly whether a tool is really what you need
The documentation system
Do you find yourself struggling to write a documentation? If yes, this is for you
Continuous Delivery Pipelines - How to Build Better Software Faster • Dave Farley • GOTO 2021
#infrastructure #release engineering
Insightful talk about some of the best practices when it comes to building and releasing software
Networking & Kubernetes
#recommended books #kubernetes #networking
Deep dive into Kubernetes networking stack. Highly recommended for those who no longer want to view networking as a blackbox.
Russian Doll - Extending Containers with Nested Processes - Christie Wilson & Jason Hall, Google
#kubernetes #linux #containers
Understanding How Facebook Disappeared from the Internet
#networking #site reliability #disaster recovery
Linux x86 Program Start Up or - How the heck do we get to main()?
For those who want to understand how programs get loaded under Linux
Julia Evans Blog
Amazing blog belonging to publisher of Wizardzines. Plenty of articles on Linux and more.
Creating a simple kubernetes debug pod
For moments when you wish you had a pod that isn't on an alpine diet
Tutorial: Communication Is Key - Understanding Kubernetes Networking - Jeff Poole, Vivint Smart Home
Monitoring Service Architecture and Health with BPF - Jonathan Perry, Flowmill, Inc.
#architecture #bpf #monitoring
MIT Computer Systems Security 1. Introduction, Threat Models
Channel also has lots of other very informative videos.
How to Backup and Restore Your Kubernetes Cluster - Annette Clewett & Dylan Murray, Red Hat
#kubernetes #disaster recovery
SREcon19 Asia/Pacific - Why Does My Monitoring Suck?
Amazing talk on application of Rumsfeld Quandrant in monitoring.
Unknown response + unknown detection = ALERT!
Declarative Multi-Cluster Monitoring with Prometheus - Matthias Loibl, Loodse & Frederic Branczyk
#kubernetes #monitoring #prometheus
SREcon16 - Putting Together Great SRE Teams
What makes a great SRE team? Very insightful talk. Highly recommended.
Thanos - Transforming Prometheus to a Global Scale in a Seven Simple Steps
#monitoring #prometheus #thanos
SREcon19 Europe/Middle East/Africa - The SRE I Aspire to Be
Google's definition aside, what exactly is a SRE?
Intro to Thanos: Scale Your Prometheus Monitoring With Ease - Lucas Serven & Dominic Green
#monitoring #prometheus #thanos
The Evolution of Distributed Systems on Kubernetes
Modern distributed applications have needs around lifecycle, networking, binding, and state management that cloud-native platforms must provide.
Kubernetes has great support around lifecycle management but relies on other platforms using the sidecar and operator concepts to satisfy the networking, binding, and state management primitives.
Future distributed systems on Kubernetes will be composed of multiple runtimes where the business logic forms the core of the application, and sidecar “mecha” components offer powerful out-of-the-box distributed primitives.
This decoupled mecha architecture offers the benefits of cohesive units of business logic and improves day-2 operations, such as patching, upgrades, and long-term maintainability.
Monitorama BAL 2019 - Cory Watson - Dashboard Renaissance - How dashboards work and how to improve them
If you ever struggled with building an intuitive and understandable dashboard, this is for you.