Pareto Chart 101: Visualizing the 80-20 Rule | Mode (2024)

Pareto Chart 101: Visualizing the 80-20 Rule | Mode (1)

Joel Carron, Data Scientist at Mode

You've probably heard of the Pareto principle, even if you didn't realize it at the time. Here are a few examples:

  • 80% of web traffic comes from 20% of your site's pages.

  • 80% of customer complaints are caused by 20% of product defects.

  • 80% of revenue comes from 20% of your customers.

Starting to sound familiar? The Pareto principle (also known as the 80-20 rule) applies to a wide range of fields from natural science to sports, but it's particularly suited to solving business problems, like those above. Conducting Pareto analysis allows you to discern the vital few from the trivial many and prioritize actions.

The history of the Pareto principle

In the 19th century, Vilfredo Pareto observed that 20% of the pea pods in his garden contained 80% of the peas. Pareto also happened to be an economist, and the pea pod discovery got him thinking. In 1896, he published a paper demonstrating that around 80% of the land in Italy was owned by 20% of the population.

Later, Joseph M. Juran came across Pareto's work and realized how relevant this idea was to quality management issues. As an engineer, he had noticed that a small number of defects caused the bulk of issues. He christened the pattern the Pareto principle. Since then the term has been used to describe any phenomenon where a small percentage of factors is responsible for a large percentage of the effect.

The Pareto chart

The Pareto chart is a visual representation of the 80-20 rule, featuring a bar + line chart. The bars represent the value of each item on your list (arranged in descending order), and the line indicates the cumulative percentage of those values.

Visualizing a Pareto analysis of your data allows you to quickly pick out where you should focus your time and resources. Take, for example, this dataset of consumer complaints submitted to the Consumer Financial Protection Bureau (a government agency working to protect consumers in the financial sector). Each complaint is bucketed into one of 94 distinct issue types, which might be too many for the CFPB to focus their policy efforts on all at once. Looking at a Pareto chart of consumer complaints will help them figure out where to start.

Click on the image above to see the full-size chart. To see the complete Python notebook generating this Pareto Chart, click here.

The CFPB's consumer complaint distribution follows the Pareto principle to a T. The top 20% of issues generate just over 80% of all complaints. This means that the CFPB can effectively work to address 80% of concerns by focusing on only the top 19 issues (compared to the total 94).

Not every distribution you come across will follow the 80-20 rule as closely as the CFPB's issue types. The Pareto principle assumes a long tail distribution, and not every dataset will follow that exact pattern. That said, a Pareto chart is an effective way of evaluating potential impact when selecting a subset of focus areas from a long list, regardless of the distribution.

Make your own Pareto chart

You can make your own Pareto chart for any list of things that have values associated with them (e.g. a list of blog post URLs and the number of web sessions for each, a list of accounts and the revenue they bring in). Check out the code in Mode and follow these steps:

Step 1: Pull data with SQL

Use a query like this one that pulls values for the items on your list. Note that a COUNT works here because you're counting complaints submitted. For numerical or financial data, you'll likely want to use a SUM instead.

SELECT issue, COUNT(DISTINCT complaint_id) as total FROM modeanalytics.consumer_complaintsGROUP BY 1ORDER BY 2 DESC
Step 2: Plot in Python with Plotly

Next, copy this code into your Python notebook.

Before you create the plot, you need to make a new column for the cumulative percentage, which is what you'll use for the line in your chart.

df['cumulative_sum'] = df.total.c*msum()df['cumulative_perc'] = 100*df.cumulative_sum/df.total.sum()

In this example, the DataFrame columns are named issue and total. If your columns are called something else, you'll need to update the Python code where applicable.

Optional Step 3: Customize HTML

The report page of this example also uses some custom HTML. The first code block sets the background color of the report page to match the background color used for the plot.

<style>body {background: rgb(240, 240, 240) !important;}.chart {background-color: rgb(240, 240, 240);border: none;}.mode-embed.mode-object {border: none;background: rgb(240, 240, 240);}.js-table-container {background-color: rgb(240, 240, 240);}</style>

The second code block increases the height of the chart area on the report page.

<script>$("#python_52c4404842c0 .chart-content").css("height",700)$("#python_52c4404842c0 .mode-python").css("height",700)$("#python_52c4404842c0 .mode-python").css("max-height",700)$("#python_52c4404842c0 img").css("max-height",700)</script>

Simply update the #python_ tags to match the chart id, which you'll find in a line that looks like this:

<mode-python id="python_52c4404842c0"options="python_options"></mode-python>

For more information on customizing the HTML of Mode reports, click here. And for more custom examples and inspiration, be sure to check out the Mode Gallery.

Additionally, here are some other tools and tutorials for making your own Pareto chart:

Recommended articles
  • 10 Useful Python Data Visualization Libraries for Any Discipline

  • Thinking in SQL vs Thinking in Python

  • 11 Data Experts Who Will Constantly Inspire You

Pareto Chart 101: Visualizing the 80-20 Rule | Mode (2024)

FAQs

How do you interpret the 80-20 rule in Pareto chart? ›

The Pareto principle states that for many outcomes, roughly 80% of consequences come from 20% of causes. In other words, a small percentage of causes have an outsized effect. This concept is important to understand because it can help you identify which initiatives to prioritize so you can make the most impact.

How do you visualize the 80-20 rule? ›

The Pareto chart is a visual representation of the 80-20 rule, featuring a bar + line chart. The bars represent the value of each item on your list (arranged in descending order), and the line indicates the cumulative percentage of those values.

How to find mode in Pareto chart? ›

The mode in a Pareto chart is represented by the category with the tallest bar, as it corresponds to the most frequently occurring category.

What is the 80/20 rule associated with in data visualization? ›

Data practitioners spend 80% of their valuable time finding, cleaning, and organizing the data. This leaves only 20% of their time to actually perform analysis on it – which is the most enjoyable part of the role for most. This is the 80/20 rule, also known as the Pareto principle.

What is an example of the 80-20 Pareto rule? ›

80% of crimes are committed by 20% of criminals. 80% of sales are from 20% of clients. 80% of project value is achieved with the first 20% of effort. 80% of your knowledge is used 20% of the time.

How do you interpret data in a Pareto chart? ›

The left vertical axis of the Pareto chart has "counts" or "cost" depending on the data used. Each vertical bar represents the contribution to the total from a given "problem" area. The bars are placed on the graph in rank order, that is the bar at the left has the highest contribution to counts or cost.

What is the best chart to show 80-20 rule? ›

The Pareto Chart is a very powerful tool for showing the relative importance of problems. It contains both bars and lines, where individual values are represented in descending order by bars, and the cumulative total of the sample is represented by the curved line.

What is the 80-20 rule step by step? ›

Steps to apply the 80/20 Rule
  1. Identify all your daily/weekly tasks.
  2. Identify key tasks.
  3. What are the tasks that give you more return?
  4. Brainstorm how you can reduce or transfer the tasks that give you less return.
  5. Create a plan to do more that brings you more value.
  6. Use 80/20 to prioritize any project you're working on.
Mar 29, 2020

What is the 80-20 rule illustrate? ›

Simply put, the 80/20 rule states that the relationship between input and output is rarely, if ever, balanced. When applied to work, it means that approximately 20 percent of your efforts produce 80 percent of the results.

How do you show percentages on a Pareto chart? ›

After drawing your bar graph, you can draw your cumulative percentage line graph. Label the right vertical axis with percentages, with 100% lining up with the highest frequency data point. Draw dots for each cumulative percentage value you found in step five. These will be in ascending order and eventually get to 100%.

How do I calculate my mode? ›

Definition. The mode is the most common number that appears in your set of data. To find the mode count how often each number appears and the number that appears the most times is the mode.

What is the 80-20 rule in Pareto chart analysis? ›

The Pareto Principle, also known as the 80/20 Rule, The Law of the Vital Few and The Principle of Factor Sparsity, illustrates that 80% of effects arise from 20% of the causes – or in lamens terms – 20% of your actions/activities will account for 80% of your results/outcomes.

What is 80-20 rule model? ›

The 80-20 rule, also known as the Pareto Principle, is a familiar saying that asserts that 80% of outcomes (or outputs) result from 20% of all causes (or inputs) for any given event. In business, a goal of the 80-20 rule is to identify inputs that are potentially the most productive and make them the priority.

What is the 80-20 rule design? ›

It states that the majority of your room – 80% – should be decorated in neutral colors, such as whites, creams, beige or pale pastel hues. The remaining 20% is where you can let your creativity shine, opting for statement colors and patterns. If you use this rule in your interior design, you can't go far wrong.

What are the interpretations of 80/20 rule Pareto analysis in testing? ›

According to the Pareto Principle, in any group of things that contribute to a common effect, a relatively few contributors account for the majority of the effect. Commonly, it is found that: 80% of complaints come from 20% of customers. 80% of sales come from 20% of clients.

What is the 80-20 rule in Pareto analysis inventory? ›

Imagine your facility stocks a vast range of products. According to the Pareto law, 80% of your revenue will likely come from 20% of your inventory items. This means that a small group of products are the most popular or higher in demand and, consequently, responsible for the majority of sales.

What does the Pareto Principle suggest 20% is to action as 80% is to? ›

The Pareto Principle is a concept that suggests that 80% of the end results of an action are due to 20% of causes. That's why this principle is also called the 80/20 rule.

What is the 80-20 principle summary? ›

Most motorists drive carefully, while a small minority is careless and causes the majority of accidents. This phenomenon is better known as the 80/20 principle: roughly 80 percent of work results – or output – are produced by 20 percent of the work effort, or input.

Top Articles
Latest Posts
Article information

Author: Patricia Veum II

Last Updated:

Views: 5603

Rating: 4.3 / 5 (44 voted)

Reviews: 91% of readers found this page helpful

Author information

Name: Patricia Veum II

Birthday: 1994-12-16

Address: 2064 Little Summit, Goldieton, MS 97651-0862

Phone: +6873952696715

Job: Principal Officer

Hobby: Rafting, Cabaret, Candle making, Jigsaw puzzles, Inline skating, Magic, Graffiti

Introduction: My name is Patricia Veum II, I am a vast, combative, smiling, famous, inexpensive, zealous, sparkling person who loves writing and wants to share my knowledge and understanding with you.