September 2nd, 2004, 12:22 pm
QuoteOriginally posted by: r2338Aaron, I think that you're confusing in-sample fits with out of sample forecasts. In an efficient market, it is not the case that a forecast regression should not produce any statistically significant results.This is one way of putting the issue, I prefer to see it as a difference between finance and economics. Tmoi's post after mine explains the application, which is economics. Therefore, my answer is not applicable.Useful financial models usually deal with price movements, not prices. The fact that, say, the CPI and S&P500 have a high correlation is not financially meaningful. However, if changes in the CPI (the inflation rate) helped predict tomorrow's stock returns (change in the S&P500), it would be an important result for theory and practice.If you find a historical relation between price movements that would have generated excess profits had you known them in advance, you have a candidate exception to efficient markets. If the relation generates abnormal returns in the future, it is an exception, and a money-making opportunity. If it doesn't generate future profits, in finance we tend to ignore it. In economics, people continue to study it and invoke a "structural change" or something like that to explain why it stopped working.It's not that one field is right and the other wrong. In finance, we tend to assume that the people setting the prices in the past correctly anticipated that something different might have happened. The historical price is correct, there was a coin flip that caused the relation to be correct, it might have gone the other way and people who bet on the relation would have lost. Why doesn't matter, or more accurately, is too complex for useful discussion. Economists tend to assume that what happened happened, and then try to explain why. The relative values of these two approaches can be easily measured in the marketplace.QuoteOriginally posted by: tmoiAaron or anyone else reading this: Is there a way to determine when the correlation between dependant variable and independant variable x4 is affecting the model and when to drop it and switch to the model that does not use x4?This is not an easy question. The usual criterion is adjusted-R^2. That is, you ask whether the new variable reduces the residual sum of squares by more than an unrelated variable would do 1 time in 20. You can think of it as generating 40 random series (using RAND() for example) and putting them in the regression one at a time instead of x4. If two or more of them give a lower residual R^2 than x4, you throw out x4.While this approach makes some sense as a tactic, it's a bad strategy. If all you care about is whether x4 affects your dependent variable, after allowing for effects of the other independent variables, adjusted-R^2 is fine. But if you select a set of variables in a stepwise fashion using adjusted-R^2 (or t-statistics or anything else), you get bad regressions.