February 8th, 2002, 3:23 pm
I echo matthewcroberts points. I know of no reason to ever test for normality. What you want to test for is specific types of deviations that will invalidate your methods. In some cases skewness is most important, in other cases kurtosis and in the multivariate case it is often true that non-normality in the marginals is less important than multivariate deviations.
The first question I always ask is are you testing for normality because you hope to find it or you hope not to find it? The first case is generally that you have a technique that is optimal for normal data, and you want to make sure its safe to use. The second case is that you want to learn something from your data, and deviations from normality tell you something. Typically it is applied to residuals from prior modeling, and you want to know if you have extracted all information and reduced your data to white noise (the effect of a large number of independent factors, none of which is individually important). Although this is logically suspect, there can be important information in normal variates, and there is certainly lots of worthless non-normal data, it works surprisingly often.
In the first case, the common statistical tools come with normality tests tailored to their needs. If your tool does not have one, and it's too complicated for analysis, a bootstrap is a good general approach. Resampled normal data are also normal, so if you put them through your tool the distribution of answers should have the theoretical standard deviation computed for normal data. If it is significantly different, your data are not normal enough to use. This is simple, requires no math, and works for any distribution or tool.
In the second case, you are looking for exploitable deviations. In finance, for example, it's often interesting to compute the dollar value of knowing the exact deviations from normality in your data. For stock returns, for example, assume you create a zero-premium portfolio by buying and selling call options at different exercise prices, using the BS price from the standard deviation, but collecting payoffs based on a random draw from your data. If the best portfolio has an expected return more than a few percent of the standard deviation of your position, there are economically important non-normalities in your data. Otherwise, no.