<r>I expect very similar trade-offs to the ones we've observed historically (e.g., for regular CPUs): Yes, optimizing compilers will miss some opportunities compared to, say, handcrafted assembly (think SSE/AVX vectorization, taking CPU-specific cache effects into account) -- but the flip side is in...