September 4th, 2012, 11:58 am
Dear all - Quick basic question on Artificial Neural Networks. I have gone through a handful of academic and illustrative articles on the topic online, and I understand (or at least I think so) how the NN is designed, how the weights are generated, fed forward, back propagated, the error corrections are made until the MSE (Mean Square Error) is minimized. However, inevitably every example I have seen trains the network with exactly 1 data point, probably merely to illustrate the concept. Let's assume I have 250 trading days worth of closing prices for 10 different securities, and I use 150 days worth of data to train my NN and the other 100 days to test the out-of-sample cases. The end product of the NN algorithm is to estimate and finalize on a set of optimal weights among the various neurons such that a sum product of those weights with the respective prices (or log prices, whatever the case may be) through the input and the hidden layers (after passing through a transfer function such as a sigmoid) can be used to predict the value of an endogenous variable (which will be the output of the network). As the NN is trained using each of the 150 trading days worth of data, wouldn't the optimal weights finalized for the 1st trading day have to be re-adjusted to suit the prices on the 2nd trading day, and so until the 150th trading day? And by the time the training is complete for all the 150 trading days worth of data, depending on the presence of the lack of meaningful patterns in the data, isn't there a huge risk of the optimal weights from the 250th day not even closely fitting the data from the 1st trading day? How does one overcome this? Sorry I have not been able to explain this any better. Thanks for your inputs.