Linux & DevOps Troubleshooting Blog
Practical guides for Linux engineers — NGINX debugging, process troubleshooting, CIS hardening, and production incident response.
Browse by category
Filter by topic
79 posts · page 7 of 9
top Command Linux: Real-World Guide to CPU and Process Monitoring
Learn how to use the top command on Linux for production troubleshooting — reading CPU, load average, memory metrics, interpreting process states, and diagnosing high CPU and stuck processes in real incidents.
strace Tutorial: Debug Linux Processes Like a Senior Engineer
A practical strace tutorial for production debugging — trace system calls, diagnose hung processes, find missing files, and debug permission errors with real command examples and incident workflows.
ps Command Linux: The Engineer's Troubleshooting Guide
How to use the ps command on Linux to debug high CPU, memory leaks, zombie processes, and unresponsive services — with real production workflows, common mistakes, and a quick-reference cheat sheet.
NGINX Troubleshooting Guide: Complete Production Reference
The complete NGINX troubleshooting reference — 502/504 errors, upstream failures, SSL issues, connection limits, keepalive misconfiguration, and production debugging workflows with real commands.
Linux Security Hardening Guide: CIS Benchmarks for Production
Complete Linux security hardening guide using CIS benchmarks — Ubuntu, RHEL, and Windows Server. SSH hardening, auditd, filesystem restrictions, firewall configuration, and production pitfalls to avoid.
Linux Process States Explained: R, S, D, Z and What They Mean for Troubleshooting
A practical guide to Linux process states — R, S, D, Z, T — what each means, how to read them in ps and top, why D state can freeze a system, and how to handle zombie processes in production.
Linux Performance Troubleshooting: Complete Engineer's Guide
The complete guide to Linux performance troubleshooting — CPU, memory, disk I/O, process states, and network — with step-by-step workflows, real production scenarios, and links to deep-dive articles on every topic.
Linux Memory Leak Troubleshooting: RSS vs VSZ Explained
How to troubleshoot memory leaks on Linux — understanding RSS vs VSZ, tracking memory growth over time, identifying the leaking process, and real debugging steps from production systems.
Linux Log Analysis: How to Debug Issues Like a Senior Engineer
A practical Linux log analysis guide — journalctl usage, grep techniques, log correlation across services, and real-world debugging workflows used in production incident response.