<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GPGPU &#187; Category: Developer Resources :: GPGPU.org</title>
	<atom:link href="http://gpgpu.org/category/developer-resources/feed" rel="self" type="application/rss+xml" />
	<link>http://gpgpu.org</link>
	<description>General-Purpose Computation on Graphics Hardware</description>
	<lastBuildDate>Mon, 06 Feb 2012 04:59:24 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>New CLOGS library with sort and scan primitives for OpenCL</title>
		<link>http://gpgpu.org/2012/02/05/clogs-library</link>
		<comments>http://gpgpu.org/2012/02/05/clogs-library#comments</comments>
		<pubDate>Mon, 06 Feb 2012 04:59:24 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Performance Primitives]]></category>
		<category><![CDATA[Sorting]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4451</guid>
		<description><![CDATA[CLOGS is a library for higher-level operations on top of the OpenCL C++ API. It is designed to integrate with other OpenCL code, including synchronization using OpenCL events. Currently only two operations are supported: radix sorting and exclusive scan. Radix sort supports all the unsigned integral types as keys, and all the built-in scalar and [...]]]></description>
			<content:encoded><![CDATA[<p>CLOGS is a library for higher-level operations on top of the OpenCL C++ API. It is designed to integrate with other OpenCL code, including synchronization using OpenCL events. Currently only two operations are supported: radix sorting and exclusive scan. Radix sort supports all the unsigned integral types as keys, and all the built-in scalar and vector types suitable for storage in buffers as values. Scan supports all the integral types. It also supports vector types, which allows for limited multi-scan capabilities.</p>
<p>Version 1.0 of the library has just been released. The home page is <a title="CLOGS page on sourceforge" href="http://clogs.sourceforge.net/" target="_blank">http://clogs.sourceforge.net/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/02/05/clogs-library/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New GPU &amp; HPC meetup group in Pune, India</title>
		<link>http://gpgpu.org/2012/02/01/pune-india-meetup</link>
		<comments>http://gpgpu.org/2012/02/01/pune-india-meetup#comments</comments>
		<pubDate>Wed, 01 Feb 2012 07:55:19 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[India]]></category>
		<category><![CDATA[meetups]]></category>
		<category><![CDATA[User Groups]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4444</guid>
		<description><![CDATA[A new GPU and high-performance computing meetup group has been formed in Pune, India.   The informal special interest group will bring together GPU users from all fields and experience levels in India, including academicians, researchers, scientists, device manufacturers, system integrators, service providers and all early adopters of HPC &#38; GPU computing. The group, hosted [...]]]></description>
			<content:encoded><![CDATA[<p>A new <a href="http://www.meetup.com/HPC-and-GPU-Computing-Group-India">GPU and high-performance computing meetup group</a> has been formed in Pune, India.   The informal special interest group will bring together GPU users from all fields and experience levels in India, including academicians, researchers, scientists, device manufacturers, system integrators, service providers and all early adopters of HPC &amp; GPU computing. The group, hosted on Meetup.com, will provide HPC and GPU computing enthusiasts in India a comprehensive platform to track industry trends and engage with each other, discussing the latest developments in the field.</p>
<p>The group will have a core group of key academicians to lead and moderate discussions. The site will feature a bank of research papers, case studies and posts on the latest GPU-related technological developments. The meetup group will also encourage users to engage and interact over group chats and web conferences.  You can find the group at</p>
<p><a href="http://www.meetup.com/HPC-and-GPU-Computing-Group-India">http://www.meetup.com/HPC-and-GPU-Computing-Group-India</a></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/02/01/pune-india-meetup/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CUDA 4.1 Released</title>
		<link>http://gpgpu.org/2012/01/26/cuda-4-1</link>
		<comments>http://gpgpu.org/2012/01/26/cuda-4-1#comments</comments>
		<pubDate>Fri, 27 Jan 2012 04:06:55 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Compilers]]></category>
		<category><![CDATA[Debugging]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Profiling]]></category>
		<category><![CDATA[Programming Languages]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4422</guid>
		<description><![CDATA[Today NVIDIA released CUDA 4.1, including a new CUDA Toolkit, SDK, Visual Profiler, Parallel Nsight IDE and NVIDIA device driver. CUDA 4.1 makes it easier to accelerate scientific research with GPUs with key features including a redesigned Visual Profiler with automated performance analysis and expert guidance; a new LLVM-based compiler that generates up to 10% faster [...]]]></description>
			<content:encoded><![CDATA[<p>Today NVIDIA released <a href="http://www.developer.nvidia.com/cuda-toolkit-41" target="_blank">CUDA 4.1</a>, including a new CUDA Toolkit, SDK, Visual Profiler, Parallel Nsight IDE and NVIDIA device driver.</p>
<p>CUDA 4.1 makes it easier to accelerate scientific research with GPUs with key features including</p>
<ul>
<li>a redesigned Visual Profiler with automated performance analysis and expert guidance;</li>
<li>a new LLVM-based compiler that generates up to 10% faster code; and</li>
<li>1000+ new imaging and signal processing functions in the NPP library.</li>
</ul>
<p>The CuSparse library included with CUDA 4.1 has a new tridiagonal solver and 2x faster sparse matrix-vector multiplication using the ELL hybrid format, and the CuRand library included with CUDA 4.1 has two new random number generators. <span id="more-4422"></span> The CUDA 4.1 toolkit also brings some great improvements to its debugging and performance analysis tools.</p>
<p>Sign up for a webinar to learn more about all the new features &amp; high performance GPU-accelerated libraries!</p>
<p>CUDA 4.1 Toolkit 4.1 Feature Overview Webinar</p>
<ul>
<li><a href="https://www2.gotomeeting.com/register/955690146" target="_blank">For Europe and The Americas: 10am (PST), Wednesday, Feb 1</a></li>
<li><a href="  https://www2.gotomeeting.com/register/187844386" target="_blank">For Asia-Pacific and India:  10am (IST) Friday, Feb 3</a></li>
</ul>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/26/cuda-4-1/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CLCC v0.3.0 now available</title>
		<link>http://gpgpu.org/2012/01/16/clcc-v0-3-0</link>
		<comments>http://gpgpu.org/2012/01/16/clcc-v0-3-0#comments</comments>
		<pubDate>Mon, 16 Jan 2012 08:20:36 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Compilers]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[OpenCL]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4393</guid>
		<description><![CDATA[CLCC, the light-weight and flexible utility for integrating OpenCL source builds into your project has just been updated to version 0.3.0. This version allows developers to save compiled binaries as object files for distribution with their programs and adds a series of options to select specific target platform/device combinations. Documentation and further information is available [...]]]></description>
			<content:encoded><![CDATA[<p>CLCC, the light-weight and flexible utility for integrating OpenCL source builds into your project has just been updated to version 0.3.0. This version allows developers to save compiled binaries as object files for distribution with their programs and adds a series of options to select specific target platform/device combinations. Documentation and further information is available at <a title="CLCC homepage" href="http://clcc.sourceforge.net/" target="_blank">http://clcc.sourceforge.net</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/16/clcc-v0-3-0/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Performance of SpMV in CUSPARSE, CUSP and SpeedIT</title>
		<link>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit</link>
		<comments>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit#comments</comments>
		<pubDate>Sat, 14 Jan 2012 12:43:31 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Benchmarks]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4384</guid>
		<description><![CDATA[The SpeedIt team recently compared and benchmarked the SpMV performance of CUSPARSE 4.0, CUSP 0.2.0 and SpeedIT 2.0 on 23 randomly chosen matrices from University Florida Matrix Collection. Comparisons were done on a Tesla C2050 in single and double precision. The full report is available at http://wp.me/p1ZihD-1.]]></description>
			<content:encoded><![CDATA[<p>The SpeedIt team recently compared and benchmarked the SpMV performance of CUSPARSE 4.0, CUSP 0.2.0 and SpeedIT 2.0 on 23 randomly chosen matrices from University Florida Matrix Collection. Comparisons were done on a Tesla C2050 in single and double precision. The full report is available at <a title="full benchmarking report" href="http://wp.me/p1ZihD-1" target="_blank">http://wp.me/p1ZihD-1</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Acceleware 4 Day CUDA Course</title>
		<link>http://gpgpu.org/2012/01/06/acceleware-4-day-cuda-course</link>
		<comments>http://gpgpu.org/2012/01/06/acceleware-4-day-cuda-course#comments</comments>
		<pubDate>Fri, 06 Jan 2012 12:08:40 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Tutorials & Courses]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4342</guid>
		<description><![CDATA[Partnering with NVIDIA and Microsoft, this four day course is designed for Programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage the multi-core processing capabilities of the GPU. Delivered by Acceleware’s Developers, who provide real world experience and examples, the training comprises classroom lectures and hands-on tutorials. Each [...]]]></description>
			<content:encoded><![CDATA[<p>Partnering with NVIDIA and Microsoft, this four day course is designed for Programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage the multi-core processing capabilities of the GPU.</p>
<p>Delivered by Acceleware’s Developers, who provide real world experience and examples, the training comprises classroom lectures and hands-on tutorials. Each student will be supplied with a laptop equipped with NVIDIA GPUs for the duration of the course. Small class sizes maximize learning and ensure a personal educational experience.</p>
<p>Register before January 13 and receive $250 off your course fee!<br />
Enter promotional code AXTEB2012</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/06/acceleware-4-day-cuda-course/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ViennaCL 1.2.0 released</title>
		<link>http://gpgpu.org/2012/01/02/viennacl-1-2-0-released</link>
		<comments>http://gpgpu.org/2012/01/02/viennacl-1-2-0-released#comments</comments>
		<pubDate>Mon, 02 Jan 2012 09:51:24 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Linear Algebra]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Scientific Computing]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4325</guid>
		<description><![CDATA[Version 1.2.0 of the OpenCL-based C++ linear algebra library ViennaCL is now available for download! It features a high-level interface compatible with Boost.ublas, which allows for compact code and high productivity. Highlights of the new release are the following features (all experimental): Several algebraic multigrid preconditioners Sparse approximate inverse preconditioners Fast Fourier transform Structured dense [...]]]></description>
			<content:encoded><![CDATA[<p>Version 1.2.0 of the OpenCL-based C++ linear algebra library ViennaCL is now available for download! It features a high-level interface compatible with Boost.ublas, which allows for compact code and high productivity. Highlights of the new release are the following features (all experimental):</p>
<ul>
<li>Several algebraic multigrid preconditioners</li>
<li>Sparse approximate inverse preconditioners</li>
<li>Fast Fourier transform</li>
<li>Structured dense matrices (circulant, Hankel, Toeplitz, Vandermonde)</li>
<li>Reordering algorithms (Cuthill-McKee, Gibbs-Poole-Stockmeyer)</li>
<li>Proxies for manipulating subvectors and submatrices</li>
</ul>
<p>The features are expected to reach maturity in the 1.2.x branch. More information about the library including download links is available at <a title="ViennaCL on SourceForge" href="http://viennacl.sourceforge.net/" target="_blank">http://viennacl.sourceforge.net</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/02/viennacl-1-2-0-released/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>FortranCL: An OpenCL interface for Fortran 90</title>
		<link>http://gpgpu.org/2011/12/30/fortrancl</link>
		<comments>http://gpgpu.org/2011/12/30/fortrancl#comments</comments>
		<pubDate>Fri, 30 Dec 2011 08:55:20 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Fortran]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Programming Languages]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4320</guid>
		<description><![CDATA[FortranCL is an interface to OpenCL from Fortran90 programs, and it is distributed under the LGPL free software license. It allows Fortran programmer to directly execute code on GPUs or other massively parallel processors. The interface is designed to be as close to the C OpenCL interface as possible, and it is written in native [...]]]></description>
			<content:encoded><![CDATA[<p>FortranCL is an interface to OpenCL from Fortran90 programs, and it is distributed under the LGPL free software license. It allows Fortran programmer to directly execute code on GPUs or other massively parallel processors. The interface is designed to be as close to the C OpenCL interface as possible, and it is written in native Fortran 90 with type checking. FortranCL is not complete yet, but it includes enough subroutines to write GPU accelerated code in Fortran. More information: <a title="link to googlecode" href="http://code.google.com/p/fortrancl/" target="_blank">http://code.google.com/p/fortrancl/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/12/30/fortrancl/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>HOOMD-blue 0.10.0 release</title>
		<link>http://gpgpu.org/2011/12/19/hoomd-blue-0-10-0-release</link>
		<comments>http://gpgpu.org/2011/12/19/hoomd-blue-0-10-0-release#comments</comments>
		<pubDate>Mon, 19 Dec 2011 07:44:41 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[High-Performance Computing]]></category>
		<category><![CDATA[Molecular Dynamics]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Open Source]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4279</guid>
		<description><![CDATA[HOOMD-blue performs general-purpose particle dynamics simulations on a single workstation, taking advantage of NVIDIA GPUs to attain a level of performance equivalent to many cores on a fast cluster. Flexible and configurable, HOOMD-blue is currently being used for coarse-grained molecular dynamics simulations of nano-materials, glasses, and surfactants, dissipative particle dynamics simulations (DPD) of polymers, and [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://codeblue.umich.edu/hoomd-blue/">HOOMD-blue</a> performs general-purpose particle dynamics simulations on a single workstation, taking advantage of NVIDIA GPUs to attain a level of performance equivalent to many cores on a fast cluster. Flexible and configurable, HOOMD-blue is currently being used for coarse-grained molecular dynamics simulations of nano-materials, glasses, and surfactants, dissipative particle dynamics simulations (DPD) of polymers, and crystallization of metals.</p>
<p>HOOMD-blue 0.10.0 adds many new features. Highlights include:<span id="more-4279"></span></p>
<ul>
<li>Added <strong>pair.dpdlj</strong> which uses the <span class="caps">DPD </span>thermostat and the Lennard-Jones potential. In previous versions, this could be accomplished by using two pair commands but at the cost of reduced performance.</li>
<li>Additional example scripts are now present in the documentation. The example scripts are cross-linked to the commands that are used in them.</li>
<li>Most dump commands now accept the form: <strong>dump.ext(filename=&#8221;filename.ext&#8221;)</strong> which immediately writes out filename.ext.</li>
<li>Specify rigid bodies in <span class="caps">XML </span>input files</li>
<li>Simulations that contain rigid body constraints applied to groups of particles in <span class="caps">BDNVT, NVE, NVT, </span>and <span class="caps">NPT </span>ensembles.</li>
<li>Energy minimization of rigid bodies ( <strong>integrate.mode_minimize_rigid_fire</strong> )</li>
<li>Existing commands are now rigid-body aware</li>
<li><span class="caps">NVT </span>integration using the Berendsen thermostat ( <strong>integrate.berendsen</strong> )</li>
<li>Bonds, angles, dihedrals, and impropers can now be created and deleted with the python data access <span class="caps">API.</span></li>
<li>and <a href="http://codeblue.umich.edu/hoomd-blue/">more</a></li>
</ul>
<p>HOOMD-blue 0.10.0 is available for <a href="http://codeblue.umich.edu/hoomd-blue/download.html">download</a> under an open source license. Check out the <a href="http://codeblue.umich.edu/hoomd-blue/doc/page_quick_start.html">quick start tutorial</a> to get started, or check out the <a href="http://codeblue.umich.edu/hoomd-blue/doc/index.html">full documentation</a> to see everything it can do.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/12/19/hoomd-blue-0-10-0-release/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Intel SPMD Compiler Version 1.1 Released</title>
		<link>http://gpgpu.org/2011/12/07/intel-spmd-compiler-version-1-1-released</link>
		<comments>http://gpgpu.org/2011/12/07/intel-spmd-compiler-version-1-1-released#comments</comments>
		<pubDate>Wed, 07 Dec 2011 18:25:38 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Compilers]]></category>
		<category><![CDATA[Intel]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[SPMD]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4251</guid>
		<description><![CDATA[A major new release of the Intel SPMD Program Compiler (ispc) was posted on December 5, 2011. ispc is an extended version of the C programming language with support for &#8220;single program, multiple data&#8221; (SPMD) programming on the CPU; the SPMD model makes it easy to harness the full power of both the SIMD vector [...]]]></description>
			<content:encoded><![CDATA[<p>A major new release of the Intel SPMD Program Compiler (ispc) was posted on December 5, 2011. ispc is an extended version of the C programming language with support for &#8220;single program, multiple data&#8221; (SPMD) programming on the CPU; the SPMD model makes it easy to harness the full power of both the SIMD vector units and multiple cores on modern CPUs. The major features added in the 1.1 release include:</p>
<ul>
<li>Full support for pointers, including pointer arithmetic, function pointers, and all other features of pointers in C.</li>
<li>A new parallel &#8220;foreach&#8221; statement, for more easily mapping computation to data.</li>
<li>Substantially revised documentation, including a new Performance Guide.</li>
<li>Many other small bug fixes and improvements.</li>
</ul>
<p>ispc is open-source and is licensed under the BSD license. Source and binaries are available from <a href="http://ispc.github.com" target="_blank">http://ispc.github.com</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/12/07/intel-spmd-compiler-version-1-1-released/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

