statistics Average could mean one of four things. The arithmetic mean, the median, midrange, or mode. For this reason, it is better to specify which average you're talking about.
Mean

This is what people usually intend when they say "average"

Population Mean: mu = ( sum x ) / N

Sample Mean:

Frequency Distribution:
The mean of a frequency distribution is also the weighted mean.
Median

The data must be ranked (sorted in ascending order) first. The median is the number in the middle.

To find the depth of the median, there are several formulas that could be used, the one that we will use is:
Depth of median = 0.5 * (n + 1)

Raw Data

The median is the number in the "depth of the median" position. If the sample size is even, the depth of the median will be a decimal -- you need to find the midpoint between the numbers on either side of the depth of the median.

Ungrouped Frequency Distribution

Find the cumulative frequencies for the data. The first value with a cumulative frequency greater than depth of the median is the median. If the depth of the median is exactly 0.5 more than the cumulative frequency of the previous class, then the median is the midpoint between the two classes.

Grouped Frequency Distribution

This is the tough one.

Since the data is grouped, you have lost all original information. Some textbooks have you simply take the midpoint of the class. This is an over-simplification which isn't the true value (but much easier to do). The correct process is to interpolate.

Find out what proportion of the distance into the median class the median by dividing the sample size by 2, subtracting the cumulative frequency of the previous class, and then dividing all that bay the frequency of the median class.

Multiply this proportion by the class width and add it to the lower boundary of the median class.

Mode

The mode is the most frequent data value. There may be no mode if no one value appears more than any other. There may also be two modes (bimodal), three modes (trimodal), or more than three modes (multi-modal).

For grouped frequency distributions, the modal class is the class with the largest frequency.
Midrange

The midrange is simply the midpoint between the highest and lowest values.
Summary

The Mean is used in computing other statistics (such as the variance) and does not exist for open ended grouped frequency distributions (1). It is often not appropriate for skewed distributions such as salary information.

The Median is the center number and is good for skewed distributions because it is resistant to change.

The Mode is used to describe the most typical case. The mode can be used with nominal data whereas the others can't. The mode may or may not exist and there may be more than one value for the mode (2).

Property Mean Median Mode Midrange
Always Exists No (1) Yes No (2) Yes
Uses all data values Yes No No No
Affected by extreme values Yes No No Yes

Related Posts:

  • Difference between Binomial Distribution and Poisson Distribution Binomial vs. Poisson Binomial DistributionPoisson Distribution Fixed Number of Trials (n)[10 pie throws]Infinite Number of Trials Only 2 Possible Outcomes[hit or miss]Unlimited Number of Outcomes Possible Probability of Suc… Read More
  • Difference between sample space and sample point In tossing a fair coin, there are only two possible outcomes, a Head (H) and a Tail (T).  If we let S be the set of all possible outcomes of this event, then, we write the set of possible outcomes as&nbs… Read More
  • 95% confidence interval "95% confidence interval". This means that if we used the same sampling method to select different samples and computed an interval estimate for each sample, we would expect the true population par… Read More
  • Bayes' Theorem Bayes' Theorem is a theorem of probability theory originally stated by the Reverend Thomas Bayes. It can be seen as a way of understanding how the probability that a theory is true is affected by a new piece of evidence. … Read More
  • Terminology of Correlation & Regression Terminology of Correlation & Regression Definitions Coefficient of Determination The percent of the variation that can be explained by the regression equation Correlation A method used to determine if a relationship … Read More

0 Comments:

Powered by Blogger.

Visitors

226154
Print Friendly Version of this pagePrint Get a PDF version of this webpagePDF


 download University Notes apps for android

Popular Posts

Flag Counter