GPU-Ocelot 2.0 Released

February 8th, 2011

GPU-Ocelot LogoOcelot 2.0.969 brings CUDA 3.2 and Fermi support to a stable release. Ocelot is a BSD-licensed open source implementation of the CUDA runtime, a PTX emulator, and a mid-level PTX compiler.

Here is a feature list for 2.0.969:

  • PTX 2.2 and Fermi device support: Floating point results should be within the ULP limits in the PTX ISA manual. Over 500 unit tests verify that the behaviour matches NVIDIA devices.
  • Four target device types: A functional PTX emulator. A PTX to LLVM to x86/ARM JIT. A PTX to CAL JIT for AMD devices (beta). A PTX to PTX JIT for NVIDIA devices.
  • A full-featured PTX 2.2 IR: An analysis/optimization pass interface over PTX (Control flow graph, dataflow graph, dominator/postdominator trees, structured control tree). Optimizations can be plugged in as modules.
  • Correctness checking tools: A memory checker (detects unaligned and out of bounds accesses). A race detector. An interactive debugger (allows stepping through PTX instructions).
  • An instruction trace analyzer interface: Allows user-defined modules to receive callbacks when PTX instructions are executed. Can be used to compute metrics over applications or perform correctness checks.
  • A CUDA API frontend: Existing CUDA programs can be directly linked against Ocelot. Device pointers can be shared across host threads. Multiple devices can be controlled from the same host thread (cudaSetDevice can be called multiple times).

Ocelot is available under a BSD license at http://code.google.com/p/gpuocelot.

    One Response to “GPU-Ocelot 2.0 Released”

    1. [...] This post was mentioned on Twitter by Raymond Tay, GPGPU .org. GPGPU .org said: GPU-Ocelot 2.0 Released: Ocelot 2.0.969 brings CUDA 3.2 and Fermi support to a stable release. Ocelot is a BSD-l… http://bit.ly/gfD75e [...]

    Leave a Comment