← All articles

Root Cause Analysis: Choosing the Right Tool

By XNM Technologies · December 10, 2022 · 5 min read
Root Cause Analysis: Choosing the Right Tool

When something goes wrong in an organisation, the instinct is to fix it and move on. Replace the part, retrain the person, update the procedure, and declare the problem solved. Most of the time, that instinct is wrong. What was fixed was the symptom. The root cause -- the underlying condition that made the failure possible -- remains untouched, and the problem recurs, sometimes in a slightly different form that fools everyone into thinking it is a new issue.

Root cause analysis (RCA) is the discipline of tracing a problem back to its origin rather than stopping at the first plausible explanation. Several structured tools exist to support this work. The challenge is not learning each tool -- most are straightforward -- but knowing which tool fits which situation. Using the wrong tool produces a superficially complete analysis that still misses the real cause.

Five Whys: Fast and Simple, With a Known Weakness

The Five Whys technique is exactly what it sounds like: you ask "why did this happen?" and then ask "why?" again about the answer, repeating until you reach a root cause -- typically in four to six iterations. It is fast, requires no special training, and can be done in a short meeting with the people closest to the problem.

The Five Whys works well for problems with a relatively simple, linear causal chain -- one thing caused another thing which caused another thing. Where it struggles is with complex problems that have multiple contributing causes, or where the causal chain branches. In those cases, stopping at the first plausible "why" answer leads you down one branch and misses the others. Teams also tend to stop when they hit a cause that feels actionable, even if deeper causes remain. The result is an analysis that is technically complete but practically shallow.

Fishbone Diagram: Broad Brainstorming for Multi-Causal Problems

The Fishbone diagram, also called the Ishikawa diagram or cause-and-effect diagram, maps potential causes across predefined categories -- commonly the 6 Ms: Man, Machine, Method, Material, Measurement, Mother Nature (environment). The problem statement sits at the head of the fish; potential causes branch off the spine in each category.

The Fishbone is best suited to situations where the root cause is genuinely unknown and the team benefits from structured brainstorming across multiple domains. It is particularly useful when a problem could plausibly originate from several different sources -- equipment, human behaviour, process design, supplier quality -- and you want to ensure none of them goes unexplored. The limitation is that the Fishbone generates hypotheses rather than confirming them. After building the diagram, teams need to validate which branches actually contributed to the problem, which requires data collection and testing.

Fault Tree Analysis: Top-Down Logic for High-Stakes Systems

Fault Tree Analysis (FTA) takes a top-down approach: you start with an undesired event (the "top event") and work backward through logical gates -- AND gates (all conditions must be present) and OR gates (any condition is sufficient) -- to identify the combinations of failures that could cause it. The output is a logical diagram, often rendered as a tree, that maps every pathway to failure.

FTA is the tool of choice for safety-critical and reliability engineering applications -- aerospace, nuclear, chemical processing, medical devices -- where the consequences of failure are severe and the analysis must be exhaustive. It is more resource-intensive than Five Whys or Fishbone and requires facilitators comfortable with logical decomposition. For everyday quality problems in business processes, FTA is usually overkill; for systems where a failure could cause injury, environmental damage, or regulatory consequences, it is often the minimum acceptable standard.

8D: Structured Team-Based Problem Solving

The Eight Disciplines (8D) methodology is a structured, team-based problem-solving process that originated in the automotive industry and became widely adopted across manufacturing and complex supply chains. The eight disciplines are: form a team, describe the problem, implement interim containment, identify and verify root causes, choose and verify corrective actions, implement permanent corrective actions, prevent recurrence, and congratulate the team.

8D is appropriate for problems that are significant enough to warrant a formal team response, particularly when the problem has affected a customer -- the framework was specifically designed for customer complaint resolution and manufacturing defects escaping to end users. Its strength is structure: the D4 step (identify and verify root causes) explicitly requires the team to test whether proposed causes actually explain the problem, rather than accepting plausible hypotheses. The 8D report also creates a documented record that can be shared with customers as evidence of systematic problem resolution.

How to Choose

A practical decision framework: use Five Whys for quick investigation of straightforward problems where the causal chain is likely linear. Use Fishbone when the cause is genuinely unknown and you want to explore multiple domains before hypothesising. Use FTA when the system is safety-critical and the analysis must be exhaustive and defensible. Use 8D when the problem is significant, customer-facing, or manufacturing-related and a structured team response with formal documentation is warranted.

In practice, these tools complement each other. A Fishbone helps identify the most likely causal branches; Five Whys then drills into each branch. An 8D investigation uses Five Whys or FTA within the D4 step. The tools are not mutually exclusive.

The Mistake That Undermines All of Them

  • Stopping at the first cause that is actionable, rather than the actual root cause.

  • Confusing "we did not follow the procedure" with a root cause -- that is still a symptom; ask why the procedure was not followed.

  • Blaming individuals rather than investigating the system conditions that made the error easy to make.

  • Skipping verification -- accepting a plausible cause without testing whether removing it would actually prevent recurrence.

  • Treating the analysis as a documentation exercise rather than a problem-solving one.

The most consequential mistake in RCA is fixing the symptom and calling it a root cause. Every tool described here is designed to prevent that -- but only if it is applied with enough rigour to push past the first comfortable answer. The question to keep asking is: if we fixed this, could the original problem still occur? If the answer is yes, you have not reached the root cause yet.

XNM Consulting helps organisations build structured quality and continuous improvement capabilities, including root cause analysis programmes that produce lasting results.