| |
|
|
||||||||||||||||||||
|
||||||||||||||||||||||
|
Check for outliers, equal variance, normality, and model validity
On the boxplot, potential outliers are given the * symbol outside the box (not the two that form part of the box and that indicate the median). The boxplot at right is an example where there are no outliers.
In the example below, one potential outlier can be seen on the boxplot. This observation corresponds to the extreme value on the Stem-Leaf plot, and can be seen in the Extreme Observations table at the top of the image. The Extreme Observations table identifies the potential outlier (121.433) as observation number 11.
If you do a transformation, re-run the ANOVA and the asterisk remains in the boxplot, that is stronger evidence for an outlier. At this point, you must decide whether to delete the extreme value or not. If scientifically invalid (data entry errors, experimental procedure errors), the outlier should be deleted from the analysis. Simply replace the value with a period and SAS will treat it as missing. This process is the same whether the data are in your SAS program or being imported from an external file. If you have deleted an outlier(s), then eventually you will have to re-run your SAS analysis on the revised data set. But first proceed to the next step and check for equal variance. |
|
| H I N T S : |
| |
|
use this as distinct from an "outlier" (a real or actual outlier), which is invalid data. Do not automatically discard apparent outliers. |
| |
|
other outliers to be identified. |
| |
|
you may need to return to your analysis module and re-run SAS (maybe many times). |
Home | Contact us | Module list & summary | Glossary/Terms | About this site | Stats courses | Links | Index |