ViennaCL: Linear Algebra on GPUs using OpenCL

June 15th, 2010

The Vienna Computing Library (ViennaCL) is a scientific computing library written in C++ and based on OpenCL. It allows simple, high-level access to the vast computing resources available on parallel architectures such as GPUs and is primarily focused on common linear algebra operations (BLAS level 1 and 2) and the solution of large systems of equations by means of iterative methods. The following iterative solvers are implemented:

  • Conjugate Gradient (CG)
  • Stabilized BiConjugate Gradient (BiCGStab)
  • Generalized Minimum Residual (GMRES)

Read the rest of this entry »

3 New Rugged GPGPU products from GE

June 15th, 2010

GE has introduced three new rugged computing products featuring integrated GPGPU technology using NVIDIA CUDA-capable GPUs.  The first is the IPN250 Rugged 6U OpenVPX Single Board Computer (SBC).  The second is the 6U OpenVPX NPN240 multi-processor. The NPN240 features two NVIDIA® CUDA-capable GT240 96-core GPUs, enabling it to deliver up to 750 GFLOP/S peak per card slot (depending on the application). Multiple NPN240s can be linked to one or more hosts to create multi-node CUDA GPU clusters capable of thousands of GFLOP/S.  The third is the OpenVPX-compatible GRA111 high performance graphics board, which is the first rugged implementation of a CUDA-capable GPU.

Libra 1.2 includes new OpenCL back end

June 8th, 2010

GPU Systems LogoGPU Systems has added an OpenCL back end implementation to its Libra Technology compiler and runtime architecture. Libra version 1.2 now supports x86/x64, OpenGL/OpenCL and CUDA compute back ends. The OpenCL back end generates dynamic code specifically for AMD GPUs. Also, the CUDA back end generator has been enhanced with Fermi capabilities and this new release brings full BLAS 1,2,3 matrix, vector, dense, sparse, complex, single/double standard math library functionality and access through a standard C programming interface & library. The high-level approach of the Libra API enables developers to easily extend existing high-level functionality from their favorite programming language.

Read the rest of this entry »

New OpenCL back-end in CAPS HMPP 2.3 hybrid compiler

June 6th, 2010

CAPS has recently added an OpenCL code generator to the just released 2.3 version of its HMPP directive-based hybrid compiler. Also, the CUDA back-end generator has been enhanced with Fermi capabilities and this new release brings support for more native compilers with Intel ifort/icc, GNU gcc/gfortran and PGI pgcc/pgfort compilers, enabling developers to freely use their favorite compiler with HMPP 2.3.

Based on GPU programming and tuning directives, HMPP offers an incremental programming model that allows developers with different levels of expertise to fully exploit GPU hardware accelerators in their legacy code. Read the rest of this entry »

GPGPU Wrapper for R Statistical Computing Environment

June 2nd, 2010

Jaideep Singh and Ipseeta Aruni present a GPGPU wrapper for the R statistical computing environment at the R user conference 2010. Their approach is to overload datatypes using R’s simplified wrapper and the SWIG Interface Generator functionality. A full page summary of the approach is available at the conference web site (PDF link).

Mellanox and NVIDIA introduce GPUDirect Technology

June 2nd, 2010

Mellanox and NVIDIA have teamed up to create a solution that enables data sharing (without expensive memory copies) between CUDA-managed host memory and Mellanox Infiniband cards. NVIDIA GPUDirect technology allows application and middleware developers to improve performance by up to 30%, by providing a shared, RDMA-accessible address space between the GPU and the interconnect.

The full press release is available here.

Intel Releases Knights Corner

June 2nd, 2010

At ISC’10, Intel demonstrated their co-processor approach to HPC (formerly known as Larrabee, now codenamed Knights Corner). A prototype of the Intel Many Integrated Core (MIC) architecture with 32 in-order cores, each equipped with a 512-wide vector unit and connected via an on-chip coherent cache, delivered more than half a Teraflop performance for LU decomposition in a live demonstration during a keynote by Kirk Skaugen.

The full press release from ISC’10 is available here.

Australia GPU Users Groups

June 1st, 2010

The Australia GPU Users groups are informal special interest groups founded to bring together GPU users from all fields and experience levels to learn and share their ideas and creations at friendly meetings.  There are currently GPU users groups forming in Brisbane, Sydney, and Perth.

The groups will discuss general GPU computing, including GPGPU, CUDA, OpenCL, DirectCompute, DirectX and OpenGL and related technologies. There will be short presentations during the meetings, as well as informal discussions on a range of subjects, including core fundamentals, hardware architectures, parallel programming as well as specific optimisations and also examples of applications from different fields of industry, science and multimedia.

Sign up today: the meetings will allow you to meet others who share your interest in GPUs. is maintaining a list of GPU Users groups.  If you have a local GPU users group, please tell us about it!

New NVIDIA Research & Certification Progams for CUDA/GPGPU

June 1st, 2010

At the ISC 2010 conference in Hamburg, Germany, this week, NVIDIA announced new programs for the growing CUDA/GPGPU developer community:

  • CUDA Certification Program – Driven by demand for qualified GPGPU engineers, this is the first program to certify expertise in massively parallel programming on GPUs.
  • CUDA Research Centers – Recognizes institutions that embrace GPU Computing across multiple research fields.
  • CUDA Teaching Centers – Recognizes institutions that have integrated GPU Computing techniques into their mainstream computer programming curriculum.

These programs complement the existing CUDA Center of Excellence program, which has recognized 10 premier institutions around the world. More details are available here:

Introductory Tutorial to OpenCL™ for HPC at SAAHPC’10

May 30th, 2010

AMD is offering an introductory tutorial to OpenCL™ that will be held alongside the 2010 Symposium on Application Accelerators in High Performance Computing (SAAHPC’10). The tutorial is a “programmer’s introduction” which covers the ideas behind OpenCL™ and their translation to source code. Read the rest of this entry »

Page 20 of 40« First...10...1819202122...3040...Last »