{"product_id":"linux-performance-observability-handbook-diagnose-and-optimize-linux-systems-with-metrics-logs-traces-and-ebpf-9798298209441","title":"Linux Performance \u0026 Observability Handbook: Diagnose and Optimize Linux Systems with Metrics, Logs, Traces, and eBPF","description":"\u003cp\u003e • Author(s): Alex Russell\u003cbr\u003e • Publisher: Independently Published\u003cbr\u003e • Publisher Imprint: Independently Published\u003cbr\u003e • BISAC: Operating Systems - Linux\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003eModern Linux systems power cloud-native apps, containers, and microservices at scale. Performance and reliability hinge on observability: \u003cb\u003emetrics, logs, traces, and eBPF\u003c\/b\u003e. With tooling like \u003cb\u003ePrometheus, Grafana, Loki, Elastic, perf, strace, ftrace, Flame Graphs, bpftrace\/BCC, XDP\u003c\/b\u003e, and Kubernetes-native signals, engineers can pinpoint latency, eliminate bottlenecks, and ship confidently whether on \u003cb\u003eUbuntu, Debian, RHEL, Alpine\u003c\/b\u003e, or across \u003cb\u003eAWS, Azure, GCP\u003c\/b\u003e.\u003c\/p\u003e\u003cp\u003eWritten in a practitioner-first style by engineers who operate production Linux daily, this handbook distills proven playbooks for \u003cb\u003eSREs, DevOps, platform engineers, site operators, and performance analysts\u003c\/b\u003e. Every technique favors \u003cb\u003eprod-safe defaults\u003c\/b\u003e, low overhead, and reproducible workflows.\u003c\/p\u003e\u003cp\u003e\u003cb\u003eLinux Performance \u0026amp; Observability Handbook: Diagnose and Optimize Linux Systems with Metrics, Logs, Traces, and eBPF\u003c\/b\u003e is a hands-on guide to \u003cb\u003efinding, fixing, and preventing\u003c\/b\u003e CPU, memory, storage, and networking issues. It unifies classic tooling (perf, ftrace, tcpdump) with modern stacks (\u003cb\u003ePrometheus, Grafana, Loki\/Promtail, Elastic\u003c\/b\u003e) and \u003cb\u003eeBPF\u003c\/b\u003e for deep visibility in containers and Kubernetes. You'll learn to go from \u003cb\u003ealert → evidence → root cause → prevention\u003c\/b\u003e with clear thresholds, lab steps, and incident playbooks.\u003c\/p\u003eWhat's Inside\u003cul\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eCPU \u0026amp; memory deep dives: \u003c\/b\u003e run queues, steal time, cgroups v2 quotas, PSI, major faults, reclaim, swap, OOM forensics.\u003c\/p\u003e\u003c\/li\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eStorage \u0026amp; filesystems: \u003c\/b\u003e queue depth, schedulers, latency histograms (\u003cb\u003ebiolatency, iosnoop\u003c\/b\u003e), ext4\/XFS hotspots, I\/O wait, async\/direct I\/O.\u003c\/p\u003e\u003c\/li\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eNetworking: \u003c\/b\u003e sockets, retransmits, \u003cb\u003ecwnd\u003c\/b\u003e, RTT, \u003cb\u003eqdisc\u003c\/b\u003e, NIC rings, RSS, GRO\/LRO, DNS\/TLS visibility with eBPF, \u003cb\u003esafe packet capture\u003c\/b\u003e.\u003c\/p\u003e\u003c\/li\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eTracing \u0026amp; profiling: \u003c\/b\u003e \u003cb\u003eperf stat\/top\/record\u003c\/b\u003e, ftrace\/tracefs, on\/off-CPU analysis, \u003cb\u003eFlame Graphs\u003c\/b\u003e, lock contention, syscall triage.\u003c\/p\u003e\u003c\/li\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eeBPF in production: \u003c\/b\u003e verifier safety, tracepoints\/kprobes\/uprobes, \u003cb\u003eCO-RE\u003c\/b\u003e, maps, bpftool, bpftrace\/BCC one-liners, XDP for defense.\u003c\/p\u003e\u003c\/li\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eKubernetes \u0026amp; containers: \u003c\/b\u003e per-pod visibility, cgroup-aware metrics, CSI volumes, SLO alerts, anti-noise alert design, runbooks.\u003c\/p\u003e\u003c\/li\u003e\n\u003cli\u003e\u003cp\u003e\u003cb\u003eOperations \u0026amp; tuning: \u003c\/b\u003e evidence-based kernel and scheduler knobs, capacity \u0026amp; steady-state profiling, security observability (\u003cb\u003eauditd, Falco, eBPF policies\u003c\/b\u003e), capstone RCA.\u003c\/p\u003e\u003c\/li\u003e\n\u003c\/ul\u003e\u003cp\u003eFor \u003cb\u003eSREs, DevOps, platform and systems engineers, performance engineers, and backend developers\u003c\/b\u003e who run Linux in production-VMs or bare metal, Docker or Kubernetes. Familiarity with Linux basics is helpful; advanced kernel knowledge is optional.\u003c\/p\u003e\u003cp\u003eDesigned for \u003cb\u003efast triage under pressure\u003c\/b\u003e and \u003cb\u003edeep analysis when time allows\u003c\/b\u003e. Open to the quick-start playbooks for a 90-second snapshot, then jump deeper with focused labs and checklists. Ideal for incident response, postmortems, and continuous tuning cycles.\u003c\/p\u003e\u003cp\u003eLevel up your \u003cb\u003eLinux performance engineering\u003c\/b\u003e today. Use actionable, low-overhead techniques to \u003cb\u003ecut tail latency, stop noisy alerts, and turn signals into fixes\u003c\/b\u003e-across \u003cb\u003ecloud, containers, and Kubernetes\u003c\/b\u003e. Add this handbook to your toolbox and ship faster with confidence, backed by \u003cb\u003emetrics, logs, traces, and eBPF\u003c\/b\u003e.\u003c\/p\u003e","brand":"Independently Published","offers":[{"title":"Paperback","offer_id":47779217866903,"sku":"9798298209441","price":2089.0,"currency_code":"INR","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0666\/3471\/1191\/files\/9798298209441.webp?v=1778034426","url":"https:\/\/atlanticbooks.com\/products\/linux-performance-observability-handbook-diagnose-and-optimize-linux-systems-with-metrics-logs-traces-and-ebpf-9798298209441","provider":"Atlantic Books","version":"1.0","type":"link"}