<t>I have a 8800GTS (96 processors / 384MB).Doing a MC simulation of a local volatility process and putting the local vol into texture memory I found a speedup of 130. This compared to similar code on one CPU of my core2duo@2.4GHz. The code on the CPU was not performance tuned, though. Using the tex...