Perils of Simulation : Parallel Streams and the Case of Stata???s Rnormal Command
Large-scale simulation-based studies rely on at least three properties of pseudorandom number sequences. Since they behave like random numbers, they can be replicated and generated in parallel. However, there has been some divergence between empirical techniques employing random numbers and the standard battery of tests used to validate them. A random number generator that passes tests for any single stream of random numbers may fail the same tests when it is used to generate multiple streams in parallel. The lack of systematic testing of parallel streams leaves statistical software with important potential vulnerabilities. This paper reveals one such vulnerability in Stata's rnormal function which went unnoticed for almost four years and how this error was detected. Furthermore, the paper discusses practical implications for the use of parallel streams in existing software.