zqw's notes
Posts About
zqw's notes
Cancel
PostsAbout

Dive into K8s 02: Cilium Replaces Flannel — Goodbye VXLAN Tax, Hello eBPF Native Networking

Paul published on 2025-12-10 included in Distributed Systems

Last post, we hand-rolled Flannel in Kind cluster and witnessed the 50-byte per-packet VXLAN encapsulation overhead via tcpdump. This time, we replace Flannel with Cilium, leveraging eBPF to completely eliminate this “network tax”, and use Hubble for traffic visualization.

Read More
 Kubernetes, Cilium, EBPF

Dive into K8s: Hand-Rolling CNI with Kind, From CrashLoop to VXLAN Packet Capture

Paul published on 2025-12-07 included in Distributed Systems

This weekend, I built a “networkless” cluster from scratch on Manjaro using kind, manually debugged missing kernel modules and CNI plugins, and finally witnessed the VXLAN encapsulation process with tcpdump. This post documents the entire journey.

Read More
 Kubernetes, CNI, Networking

Temporal vs K8s Controller: Declarative vs Imperative Control Plane Paradigms

Paul published on 2025-07-15 included in Distributed Systems

K8s Controllers and Temporal are both “control plane” technologies, but they represent fundamentally different design philosophies. This post deep-dives into their core differences and explores how to combine them in AI Infra platforms.

Read More
 Temporal, Kubernetes, Control Plane

Temporal in Practice: From Saga Pattern to Self-Healing Distributed Transactions

Paul published on 2025-07-10 included in Distributed Systems

While building a trading system backend, I used Temporal to solve the classic distributed transaction consistency problem. This post breaks down Temporal’s core mechanisms and explores its applications in AI task scheduling.

Read More
 Temporal, Go, Saga

Go Lock Performance Traps: From sync.Mutex to Spinlock Tuning

Paul published on 2025-04-02 included in Go

A seemingly simple concurrent Map access, performance crashes under high concurrency. pprof shows 90% of time spent in sync.Mutex.Lock. This post documents the investigation and performance comparison of different lock strategies.

Read More
 Concurrency, Locks, Performance

Custom Linux Distro: From Yocto Build to 60% Boot Time Optimization

Paul published on 2023-06-15 included in Linux

A key user experience metric for NAS products: how long from pressing power to accessing shared folders? Our target was 30 seconds, but the original took 75 seconds. This post documents the complete journey from 75s to 30s.

Read More
 Systemd, Boot Optimization, Performance
  • 1
  • 2
  • 3
  • 4
2019 - 2026 xxxx | CC BY-NC 4.0