This library defines an embedded language for regular, multi-dimensional array computations with multiple backends to facilitate high-performance implementations. Currently, there are two backends: (1) an interpreter that serves as a reference implementation of the intended semantics of the language and (2) a CUDA backend generating code for CUDA-capable NVIDIA GPUs. To use the CUDA backend, you need to have CUDA version 3.x installed. The CUDA backend currently doesn't support 'Char' and 'Bool' arrays. Known bugs: <https://github.com/mchakravarty/accelerate/issues> * New in 0.9.0.0: Streaming, precompilation, Repa-style indices, stencils, more scans, rank-polymorphic fold, generate, block I/O & many bug fixes * New in 0.8.1.0: bug fixes and some performance tweaks * New in 0.8.0.0: 'replicate', 'slice' and 'foldSeg' supported in the CUDA backend; frontend and interpreter support for 'stencil'; bug fixes * New in 0.7.1.0: the CUDA backend and a number of scalar functions For documentation, see the homepage and <https://github.com/mchakravarty/accelerate/wiki>.