Empirical Rule: Definition, Formula, and Example

Empirical Rule: Definition, Formula, and Example

[ad_1]

What Is the Empirical Rule?

The empirical rule, also sometimes called the three-sigma or 68-95-99.7 rule, predicts deviations from the mean or average of data. It indicates that 68% of observations will fall within the first standard deviation (µ ± σ) in normal distributions, 95% within the first two standard deviations (µ ± 2σ), and 99.7% within the first three standard deviations (µ ± 3σ) of the mean. The rule is a vital component in quality control and risk analysis.

Key Takeaways

  • Three-sigma limits that follow the empirical rule are used to set the upper and lower control limits in statistical quality control charts and in risk analysis.
  • 68% of the data will fall within one standard deviation under the empirical rule.
  • 95% will fall within two standard deviations.
  • 99.7% will fall within three standard deviations from the mean.

Investopedia / Michela Buttignol


Understanding the Empirical Rule

The empirical rule is often used in statistics for forecasting final outcomes. After calculating the standard deviation and before collecting complete data, this rule can be used as a rough estimate of the outcome of the impending data to be collected and analyzed.

This probability distribution can be used as an evaluation technique since gathering the appropriate data may be time-consuming or even impossible in some cases. Such considerations come into play when a company reviews its quality control measures or evaluates its risk exposure. For instance, the frequently used risk tool value-at-risk (VaR) assumes that the probability of risk events follows a normal distribution.

The empirical rule is also used as a rough way to test a distribution’s “normality.” If too many data points fall outside the three standard deviation boundaries, this suggests that the distribution is not normal and may be skewed or follow some other distribution.

The empirical rule is also known as the three-sigma rule, as “three-sigma” refers to a statistical distribution of data within three standard deviations from the mean on a normal distribution (bell curve), as indicated by the figure below.

Image by Julie Bang © Investopedia 2019 

Example of the Empirical Rule

Let’s assume a population of animals in a zoo is known to be normally distributed. Each animal lives to be 13.1 years old on average (mean), and the standard deviation of the lifespan is 1.5 years. If someone wants to know the probability that an animal will live longer than 14.6 years, they could use the empirical rule. Knowing the distribution’s mean is 13.1 years old, the following age ranges occur for each standard deviation:

  • One standard deviation (µ ± σ): (13.1 – 1.5) to (13.1 + 1.5), or 11.6 to 14.6
  • Two standard deviations (µ ± 2σ): 13.1 – (2 x 1.5) to 13.1 + (2 x 1.5), or 10.1 to 16.1
  • Three standard deviations (µ ± 3σ): 13.1 – (3 x 1.5) to 13.1 + (3 x 1.5), or, 8.6 to 17.6

The person solving this problem needs to calculate the total probability of the animal living 14.6 years or longer. The empirical rule shows that 68% of the distribution lies within one standard deviation, in this case, from 11.6 to 14.6 years. Thus, the remaining 32% of the distribution lies outside this range. One half lies above 14.6 and the other below 11.6. So, the probability of the animal living for more than 14.6 is 16% (calculated as 32% divided by two).

The Empirical Rule in Investing

Most market data isn’t normally distributed, so the 68-95-99.7 rule doesn’t generally apply to investments. However, many analysts use aspects of it—such as standard deviation—to estimate volatility.

You can calculate the standard deviation of your portfolio, an index, or other investments and use it to assess volatility. Calculating a particular investment’s standard deviation is straightforward if you have access to a spreadsheet and your chosen investment’s prices or returns.

Market analysts express standard deviation in percentage form. For example, the standard deviation for the S&P 500 index from 2015 to 2025 was 15.37%.

Using the spreadsheet, you can paste the returns, prices, or values into it, find the percent change from the previous session, and use the standard deviation function:

= STDEV ( 1, 2, 3, 4, …) or = STDEV ( A1 : A200 )

Important

You’ll get more accurate results using more than one month’s trading data, such as three or more years. The example below uses the index’s daily values over one month and annualizes the standard deviation to limit the table size.

To annualize the standard deviation, multiply it by the square root of the number of trading days in one year—there are usually 252. Here’s a calculation of the standard deviation based on the closing prices of the S&P 500.

S&P 500 Standard Deviation (Annualized)
DATE CLOSE INTERDAY CHANGE
3/3/2025 5849.72
3/4/2025 5778.15 -1.22%
3/5/2025 5842.63 1.12%
3/6/2025 5738.52 -1.78%
3/7/2025 5770.2 0.55%
3/10/2025 5614.56 -2.70%
3/11/2025 5572.07 -0.76%
3/12/2025 5599.3 0.49%
3/13/2025 5521.52 -1.39%
3/14/2025 5638.94 2.13%
3/17/2025 5675.12 0.64%
3/18/2025 5614.66 -1.07%
3/19/2025 5675.29 1.08%
3/20/2025 5662.89 -0.22%
3/21/2025 5667.56 0.08%
3/24/2025 5767.57 1.76%
3/25/2025 5776.65 0.16%
3/26/2025 5712.2 -1.12%
3/27/2025 5693.31 -0.33%
3/28/2025 5580.94 -1.97%
3/31/2025 5611.85 0.55%
Standard Deviation stdev(C1:C3) 1.29%
Annualized Standard Deviation sqrt(252)*C24 20.42%
Based on closing prices in March 2025.

So the annual volatility based on the data used in the table is 20.42%. The higher the standard deviation, the more risk analysts believe the investment has.

Alternatively, you can find an investment’s standard deviation on popular investing websites. For example, Morningstar displays the S&P 500 standard deviation in three, five, and 10-year measurements.

Explain Like I’m Five

The empirical rule describes how the points in a data set are clustered around the center. It is based on the standard deviation, a measure of how widely the data points are spread out.

If the data set is normally distributed, the empirical rule predicts that 68% of the data is less than one standard deviation from the mean. 95% are within two standard deviations of the mean, and 99.7% are within three standard deviations.

What Is the Empirical Rule?

In statistics, the empirical rule states that in a normal distribution, 99.7% of observed data will fall within three standard deviations of the mean. Specifically, 68% of the observed data will occur within one standard deviation, 95% within two standard deviations, and 99.7% within three standard deviations.

How Is the Empirical Rule Used?

The empirical rule is applied to anticipate probable outcomes in a normal distribution. For instance, a statistician could use it to estimate the percentage of cases that fall in each standard deviation. Consider that the standard deviation is 3.1 and the mean equals 10. In this case, the first standard deviation would range between (10+3.2)= 13.2 and (10-3.2)= 6.8. The second deviation would fall between 10 + (2 X 3.2)= 16.4 and 10 – (2 X 3.2)= 3.6, and so forth.

What Are the Benefits of the Empirical Rule?

The empirical rule is beneficial because it serves as a means of forecasting data. This is especially true when it comes to large datasets and those where variables are unknown.

The Bottom Line

Analysts use the empirical rule to see how much data falls within a specified interval away from the data set’s mean. Investment analysts can use it to estimate the volatility of a particular investment, portfolio, or fund.

[ad_2]

Source link

Comments

No comments yet. Why don’t you start the discussion?

Leave a Reply

Your email address will not be published. Required fields are marked *