This paper by Fan et. al. at Stony Brook University presents the use of a cluster of commodity GPUs for high performance scientific computing. As an example application, they have developed a parallel flow simulation using the lattice Boltzmann model (LBM) on a GPU cluster and have simulated the dispersion of airborne contaminants in the Times Square area of New York City. Using 30 GPU nodes, their simulation can compute a 480 x 400 x 80 LBM in 0.31 second/step, a speed which is 4.6 times faster than that of their previous CPU cluster implementation. Besides the LBM, the paper also discusses other potential applications of the GPU cluster, such as cellular automata, PDE solvers, and FEM. (Zhe Fan, Feng Qiu, Arie Kaufman, Suzanne Yoakum-Stover, GPU Cluster for High Performance Computing, To Appear in Proceedings of the ACM/IEEE SuperComputing 2004 (SC’04), November, 2004)