CLyther 0.1 Beta Released

April 25th, 2010

GeoSpin has released the first version of CLyther for beta testing. Please visit the CLyther SourceForge website for more information.  CLyther enables developers to seamlessly write GPGPU code completely in python with no additional syntax. CLyther’s core driver contains a python compiler to convert Python functions and types to OpenCL during runtime.

CLyther currently only supports a subset of the Python language definition but adds many new features to OpenCL such as:

  • OpenCL interface similar to PyOpenCL
  • Dynamic compilation of OpenCL code at runtime
  • Fast prototyping of OpenCL code
  • Create OpenCL code using the Python language definition
  • Passing functions as arguments to OpenCL kernels
  • Pure Python emulation mode of kernel functions

Read the rest of this entry »

Call for Authors: GPU Pro 2

April 12th, 2010

From the open call for authors:

After the tremendous success of the first seven entries to the ShaderX book series, and the upcoming success of the GPU Pro book, we are looking for authors for GPU Pro 2. The upcoming book will cover advanced rendering techniques that run on the DirectX and/or OpenGL run-time or any other run-time with any language available. It will include topics on: Geometry Manipulation; Rendering Techniques; Handheld Devices Programming; Effects in Image Space; Shadows; 3D Engine Design; Graphics Related Tools; Environmental Effects and a dedicated section on mathematics used in graphics programming.

Proposals are due by May 17th, 2010. Please send them to wolf at An example proposal, writing guidelines and a FAQ can be downloaded from

OpenCL Studio 1.0 beta released

April 5th, 2010

Geist Software Labs has released the first version of OpenCL Studio for beta testing. OpenCL Studio combines OpenCL and OpenGL into a single integrated development environment that allows you to visualize OpenCL computation using powerful 3D rendering techniques. The editor hides much of the complexity of the underlying APIs while still providing flexibility via the Lua scripting language. Integrated source code editors and debugging capabilities for OpenCL, GLSL, and Lua, as well as a toolbox of 2D user interface widgets provide a framework for a wide range of parallel programming solutions.

rCUDA 1.0 released

April 5th, 2010

The GAP (Universidad Politécnica de Valencia, Spain) and HPCA (Universidad Jaume I, Spain) research groups are proud to announce the public release of rCUDA 1.0. The rCUDA Framework enables the concurrent usage of CUDA-compatible devices remotely by employing the sockets API for communication between clients and servers. Thus, it can be useful in three different environments:

  • Clusters. To reduce the number of GPUs installed in High Performance Clusters. This leads to energy savings, as well as other related savings like acquisition costs, maintenance, space, cooling, etc.
  • Academia. In low performance networks, to offer access to a few high performance GPUs concurrently to all the students.
  • Virtual Machines. To enable the access to the CUDA facilities on the physical machine.

The current version of rCUDA (v1.0) implements all functions in the CUDA Runtime API version 2.3, excluding OpenGL and Direct3D interoperability. rCUDA 1.0 targets the Linux OS (for 32- and 64-bit architectures) on both client and server sides. The framework is free for any purpose under the terms and conditions of the GNU GPL/LGPL (where applicable) licenses.

For additional information, visit the rCUDA web page or Antonio Peña’s webpage.

CfP: Game Engines Gems 2

April 5th, 2010

After a very successful launch of the first volume of the Game Engine Gems series at GDC 2010, Jones and Bartlett Publishers is now accepting proposals for the second volume. The paper submission period for Game Engine Gems 2 is now open through June 15, 2010. To submit a proposal, please visit the official website.

As with the first volume, the theme of the book includes everything having to do with game engine design and implementation. Specific topics of interest include rendering techniques, shaders, OpenGL / DirectX, physics / collision detection, mathematics, programming techniques, engine architecture, visibility determination, audio, user interface, input devices, memory management, artificial intelligence, resource organization, and cross-platform considerations. This list is not exhaustive, and the editors are happy to evaluate any idea that pertains to making game engines.

NVIDIA Announces GPU Technology Conference 2010

March 31st, 2010

This year’s GPU Technology Conference (GTC 2010) will take place on Monday, Sept. 20 to Thursday, Sept. 23 at the San Jose Convention Center in San Jose, California.

Building on last year’s inaugural conference, GTC 2010 will feature an even broader and deeper selection of technical sessions, interactive tutorials, technology previews, and industry and academic presentations.

Three concurrent GPU-focused summits will occur under one roof:

  • Emerging Companies Summit: A showcase for innovative startups to demonstrate products and network with VC’s and other investors.
  • GPU Developers Summit: Sessions, tutorials, and presentations for developers, engineers, and scientists.
  • NVIDIA Research Summit: A unique opportunity for students, professors, and researchers to present their findings and collaborate.

For more information:

SpeedIT Toolkit 0.9.1 released

March 26th, 2010

The SpeedIT Tools library provides a set of accelerated solvers for sparse linear systems of equations. Manifold acceleration, e.g. more than an order of magnitude, is achieved with a single reasonably priced NVIDIA Graphics Processing Unit (GPU) that supports CUDA and proprietary advanced optimization techniques. The library can be used in a wide spectrum of domains arising from problems with underlying 2D and 3D geometry, such as computational fluid dynamics, electro-magnetics, thermodynamics, materials, acoustics, computer vision and graphics, robotics, semiconductor devices and structural engineering. The library can be also used for problems without defined geometry such as quantum chemistry, statistics, power networks and other graphs and chemical process simulation. All computations are performed with single or double floating point precision. Two version of SpeedIT toolkit have been released: The classic version provides a conjugate gradient solver, and the extreme edition provides optimized CG, BiCGSTAB, diagonal preconditioner, memory management, and heuristic-based analysis of input matrices.

Thrust v1.2 Released

March 23rd, 2010

Version 1.2 of Thrust, an open-source template library for developing CUDA applications, has been released. Modeled after the C++ Standard Template Library (STL), Thrust brings a familiar abstraction layer to the realm of GPU computing. This version adds several new features, including:

The Thrust web page provides a quick-start guide, online documentation, many examples and introductory slides. Thrust is open-source software distributed under the OSI-approved Apache License v2.0.

Palix Technologies launches ANDSolver beta program

March 23rd, 2010

Palix Technologies has introduced a new Computational Fluid Dynamics (CFD) product called ANDSolver that has been designed from the ground up to use Graphics Processing Units (GPUs) for fast and efficient aerodynamic analysis. Although developing and running applications to use multiple CPUs is a well established practice for high performance science and engineering simulations, a newer trend towards using GPUs for computation promises faster results with lower hardware acquisition and operating costs. ANDSolver delivers on that promise with up to a 10x speedup compared to a typical quad core CPU. This level of performance is unique in that it is achieved on unstructured meshes which have traditionally not been considered amenable to GPUs because of the memory access patterns. However, based on an innovative algorithm design to maximize the performance of the NVIDIA CUDA architecture, the ease and flexibility of unstructured meshing can now be used on high-performance, cost-effective GPUs.

A limited number of additional registrants will be accepted prior to our first production release in Q2 2010. More information can be found at for our current beta testing program.

Accelerating MATLAB Image Processing Toolbox Functions on GPUs

March 23rd, 2010


We present our effort in developing an open-source GPU (graphics processing units) code library for the MATLAB Image Processing Toolbox (IPT). We ported a dozen of representative functions from IPT and based on their inherent characteristics, we grouped these functions into four categories: data independent, data sharing, algorithm dependent and data dependent. For each category, we present a detailed case study, which reveals interesting insights on how to efficiently optimize the code for GPUs and highlight performance-critical hardware features, some of which have not been well explored in existing literature. Our results show drastic speedups for the functions in the data-independent or data-sharing category by leveraging hardware support judiciously; and moderate speedups for those in the algorithm-dependent category by careful algorithm selection and parallelization. For the functions in the last category, fine-grain synchronization and data-dependency requirements are the main obstacles to an efficient implementation on GPUs.

(J. Kong, et. al., “Accelerating MATLAB Image Processing Toolbox Functions on GPUs”, Proceedings of the Third Workshop on General-Purpose Computation on Graphics Processing Units (GPGPU-3), Pittsburgh, PA. Apr. 2010. Source code is available here.)

Page 20 of 39« First...10...1819202122...30...Last »