<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GPGPU &#187; Category: Business :: GPGPU.org</title>
	<atom:link href="http://gpgpu.org/category/business/feed" rel="self" type="application/rss+xml" />
	<link>http://gpgpu.org</link>
	<description>General-Purpose Computation on Graphics Hardware</description>
	<lastBuildDate>Tue, 22 May 2012 08:44:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>NVIDIA Kepler GK110 Architecture White Paper</title>
		<link>http://gpgpu.org/2012/05/20/nvidia-kepler-gk110-paper</link>
		<comments>http://gpgpu.org/2012/05/20/nvidia-kepler-gk110-paper#comments</comments>
		<pubDate>Mon, 21 May 2012 00:35:21 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[GPUs]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4718</guid>
		<description><![CDATA[This white paper describes the new Kepler  GK110 Architecture from NVIDIA. Comprising 7.1 billion transistors, Kepler GK110 is not only the fastest, but also the most architecturally complex microprocessor ever built. Adding many new innovative features focused on compute performance, GK110 was designed to be a parallel processing powerhouse for Tesla® and the HPC market. Kepler GK110 will provide over [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_4719" class="wp-caption alignright" style="width: 160px"><a href="http://gpgpu.org/wp/wp-content/uploads/2012/05/nvidia_kepler2_die_shot.jpg"><img class="size-thumbnail wp-image-4719" title="nvidia_kepler2_die_shot" src="http://gpgpu.org/wp/wp-content/uploads/2012/05/nvidia_kepler2_die_shot-150x150.jpg" alt="" width="150" height="150" /></a><p class="wp-caption-text">NVIDIA Kepler GK110 Die Shot</p></div>
<p>This <a title="NVIDIA Kepler GK110 White Paper" href="http://www.nvidia.com/content/PDF/kepler/NVIDIA-Kepler-GK110-Architecture-Whitepaper.pdf" target="_blank">white paper</a> describes the new Kepler  GK110 Architecture from NVIDIA.</p>
<blockquote><p>Comprising 7.1 billion transistors, Kepler GK110 is not only the fastest, but also the most architecturally complex microprocessor ever built. Adding many new innovative features focused on compute performance, GK110 was designed to be a parallel processing powerhouse for Tesla® and the HPC market.</p>
<p>Kepler GK110 will provide over 1 TFlop of double precision throughput with greater than 80% DGEMM efficiency versus 60‐65% on the prior Fermi architecture.</p>
<p>In addition to greatly improved performance, the Kepler architecture offers a huge leap forward in power efficiency, delivering up to 3x the performance per watt of Fermi.</p></blockquote>
<p>The paper describes features of the Kepler GK110 architecture, including</p>
<ul>
<li>Dynamic Parallelism;</li>
<li>Hyper-Q;</li>
<li>Grid Management Unit;</li>
<li>NVIDIA GPUDirect™;</li>
<li>New SHFL instruction and atomic instruction enhancements;</li>
<li>New read-only data cache previously only accessible to texture;</li>
<li>Bindless Textures;</li>
<li>and much more.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/05/20/nvidia-kepler-gk110-paper/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CUVILib v1.2 released</title>
		<link>http://gpgpu.org/2012/05/17/cuvilib-v1-2</link>
		<comments>http://gpgpu.org/2012/05/17/cuvilib-v1-2#comments</comments>
		<pubDate>Thu, 17 May 2012 07:21:17 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Computer Vision]]></category>
		<category><![CDATA[Image Processing]]></category>
		<category><![CDATA[Libraries]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4698</guid>
		<description><![CDATA[TunaCode has released CUVILib v1.2, a library to accelerate imaging and computer vision applications. CUVILib adds acceleration to Imaging applications from Medical, Industrial and Defense domains. It delivers very high performance and supports both CUDA and OpenCL. Modules include color operations (demosaic, conversions, correction etc), linear/non-linear filtering, feature extraction &#38; tracking, motion estimation, image transforms [...]]]></description>
			<content:encoded><![CDATA[<p>TunaCode has released CUVILib v1.2, a library to accelerate imaging and computer vision applications. CUVILib adds acceleration to Imaging applications from Medical, Industrial and Defense domains. It delivers very high performance and supports both CUDA and OpenCL. Modules include color operations (demosaic, conversions, correction etc), linear/non-linear filtering, feature extraction &amp; tracking, motion estimation, image transforms and image statistics.</p>
<p>More information, including a free trial version: <a title="cuvilib home" href="http://www.cuvilib.com/" target="_blank">http://www.cuvilib.com/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/05/17/cuvilib-v1-2/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New Libra Platform version released</title>
		<link>http://gpgpu.org/2012/04/21/new-libra-version</link>
		<comments>http://gpgpu.org/2012/04/21/new-libra-version#comments</comments>
		<pubDate>Sat, 21 Apr 2012 09:05:14 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Heterogeneneous Computing]]></category>
		<category><![CDATA[MATLAB]]></category>
		<category><![CDATA[Programming Environments]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4649</guid>
		<description><![CDATA[Libra Platform is a GPGPU-Heterogeneous Compute API and runtime environment available on Windows, Mac and Linux. Libra Compute API offers performance portability and direct compute access via standard programming environments C/C++, Java, C# and Matlab to execute math operations on top of current and future compute architectures, including the latest GPUs, x86/x64 CPUs and with [...]]]></description>
			<content:encoded><![CDATA[<p>Libra Platform is a GPGPU-Heterogeneous Compute API and runtime environment available on Windows, Mac and Linux. Libra Compute API offers performance portability and direct compute access via standard programming environments C/C++, Java, C# and Matlab to execute math operations on top of current and future compute architectures, including the latest GPUs, x86/x64 CPUs and with broad support for compute devices compatible with low level specific APIs &#8211; OpenCL, CUDA, OpenGL and standard x86/x64 compute APIs.</p>
<p>Read more in the <a title="full announcements at gpusystems.com" href="http://www.gpusystems.com/doc/NewsReleaseGPGPU.pdf" target="_blank">full announcement</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/04/21/new-libra-version/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>2 Day CUDA Workshop, May 5-6 2012, Berlin, Germany</title>
		<link>http://gpgpu.org/2012/04/21/cuda-berlin-may-workshop</link>
		<comments>http://gpgpu.org/2012/04/21/cuda-berlin-may-workshop#comments</comments>
		<pubDate>Sat, 21 Apr 2012 08:59:29 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Courses]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4647</guid>
		<description><![CDATA[A 2 day CUDA workshop is taking place in Berlin, Germany on May 5 and 6 2012. Course details, outline and prices are available at http://cuda.eventbrite.com.]]></description>
			<content:encoded><![CDATA[<p>A 2 day CUDA workshop is taking place in Berlin, Germany on May 5 and 6 2012. Course details, outline and prices are available at <a title="course website" href="http://cuda.eventbrite.com/" target="_blank">http://cuda.eventbrite.com</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/04/21/cuda-berlin-may-workshop/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Acceleware OpenCL™ Training in NYC</title>
		<link>http://gpgpu.org/2012/02/28/acceleware-opencl-training-nyc</link>
		<comments>http://gpgpu.org/2012/02/28/acceleware-opencl-training-nyc#comments</comments>
		<pubDate>Wed, 29 Feb 2012 01:56:31 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Acceleware]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[Tutorials & Courses]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4529</guid>
		<description><![CDATA[Developed in partnership with AMD, this four day course is designed for GPU Programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage the multi-core processing capabilities of the GPU. Delivered by Acceleware’s Developers, who provide real world experience and examples, the training comprises classroom lectures and hands-on tutorials. [...]]]></description>
			<content:encoded><![CDATA[<p>Developed in partnership with AMD, <a href="http://acceleware.com/mar27New-york" target="_blank">this four day course</a> is designed for GPU Programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage the multi-core processing capabilities of the GPU.</p>
<p>Delivered by Acceleware’s Developers, who provide real world experience and examples, the training comprises classroom lectures and hands-on tutorials. Each student will be supplied with a laptop equipped with an AMD Fusion APU for the duration of the course. Small class sizes maximize learning and ensure a personal educational experience.<span id="more-4529"></span></p>
<p>Register before March 6 and receive $200 off your course fee! Enter promotional code AXTEB2012.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/02/28/acceleware-opencl-training-nyc/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SpeedIT 2.0 released</title>
		<link>http://gpgpu.org/2012/02/24/speedit-2-0</link>
		<comments>http://gpgpu.org/2012/02/24/speedit-2-0#comments</comments>
		<pubDate>Fri, 24 Feb 2012 06:55:07 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Fluid Simulation]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4520</guid>
		<description><![CDATA[SpeedIT 2.0 and the SpeedIT plugin to OpenFOAM have been released. New features include: One of the fastest Sparse Matrix Vector Multiplication worldwide. Faster Conjugate Gradient and BiConjugate Gradient solvers. State-of-the-art CMRS format for storing sparse matrices. The format requires less memory than CRS or HYB (from CUSPARSE and CUSP). Faster acceleration in OpenFOAM (Computational [...]]]></description>
			<content:encoded><![CDATA[<p><a title="link to company web site" href="http://speed-it.vratis.com" target="_blank">SpeedIT 2.0</a> and the SpeedIT plugin to OpenFOAM have been released. New features include:</p>
<ul>
<li>One of the fastest Sparse Matrix Vector Multiplication worldwide.</li>
<li>Faster Conjugate Gradient and BiConjugate Gradient solvers.</li>
<li>State-of-the-art CMRS format for storing sparse matrices. The format requires less memory than CRS or HYB (from CUSPARSE and CUSP).</li>
<li>Faster acceleration in OpenFOAM (Computational Fluid Dynamics).</li>
</ul>
<p>More information is available at <a href="http://speed-it.vratis.com/" target="_blank">http://speed-it.vratis.com</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/02/24/speedit-2-0/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Performance of SpMV in CUSPARSE, CUSP and SpeedIT</title>
		<link>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit</link>
		<comments>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit#comments</comments>
		<pubDate>Sat, 14 Jan 2012 12:43:31 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Benchmarks]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4384</guid>
		<description><![CDATA[The SpeedIt team recently compared and benchmarked the SpMV performance of CUSPARSE 4.0, CUSP 0.2.0 and SpeedIT 2.0 on 23 randomly chosen matrices from University Florida Matrix Collection. Comparisons were done on a Tesla C2050 in single and double precision. The full report is available at http://wp.me/p1ZihD-1.]]></description>
			<content:encoded><![CDATA[<p>The SpeedIt team recently compared and benchmarked the SpMV performance of CUSPARSE 4.0, CUSP 0.2.0 and SpeedIT 2.0 on 23 randomly chosen matrices from University Florida Matrix Collection. Comparisons were done on a Tesla C2050 in single and double precision. The full report is available at <a title="full benchmarking report" href="http://wp.me/p1ZihD-1" target="_blank">http://wp.me/p1ZihD-1</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Acceleware 4 Day CUDA Course</title>
		<link>http://gpgpu.org/2012/01/06/acceleware-4-day-cuda-course</link>
		<comments>http://gpgpu.org/2012/01/06/acceleware-4-day-cuda-course#comments</comments>
		<pubDate>Fri, 06 Jan 2012 12:08:40 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Tutorials & Courses]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4342</guid>
		<description><![CDATA[Partnering with NVIDIA and Microsoft, this four day course is designed for Programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage the multi-core processing capabilities of the GPU. Delivered by Acceleware’s Developers, who provide real world experience and examples, the training comprises classroom lectures and hands-on tutorials. Each [...]]]></description>
			<content:encoded><![CDATA[<p>Partnering with NVIDIA and Microsoft, this four day course is designed for Programmers who are looking to develop comprehensive skills in writing and optimizing applications that fully leverage the multi-core processing capabilities of the GPU.</p>
<p>Delivered by Acceleware’s Developers, who provide real world experience and examples, the training comprises classroom lectures and hands-on tutorials. Each student will be supplied with a laptop equipped with NVIDIA GPUs for the duration of the course. Small class sizes maximize learning and ensure a personal educational experience.</p>
<p>Register before January 13 and receive $250 off your course fee!<br />
Enter promotional code AXTEB2012</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/06/acceleware-4-day-cuda-course/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Introduction to Generic Accelerated Computing with Libra SDK</title>
		<link>http://gpgpu.org/2011/11/30/generic-accelerated-computing-libra-sdk</link>
		<comments>http://gpgpu.org/2011/11/30/generic-accelerated-computing-libra-sdk#comments</comments>
		<pubDate>Wed, 30 Nov 2011 07:35:49 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[OpenCL]]></category>
		<category><![CDATA[OpenGL]]></category>
		<category><![CDATA[Programming Environments]]></category>
		<category><![CDATA[Scientific Computing]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4230</guid>
		<description><![CDATA[Libra SDK is a sophisticated runtime including API, sample programs and documentation for massively accelerating software computations. This introduction tutorial provides an overview and usage examples of the powerful Libra API &#38; math libraries executing on x86/x64, OpenCL, OpenGL and CUDA technology. Libra API enables generic and portable CPU/GPU computing within software development without the [...]]]></description>
			<content:encoded><![CDATA[<p>Libra SDK is a sophisticated runtime including API, sample programs and documentation for massively accelerating software computations. This introduction tutorial provides an overview and usage examples of the powerful Libra API &amp; math libraries executing on x86/x64, OpenCL, OpenGL and CUDA technology. Libra API enables generic and portable CPU/GPU computing within software development without the need to create multiple, specific and optimized code paths to support x86, OpenCL, OpenGL or CUDA devices. Link to PDF: <a href="http://www.gpusystems.com/doc/LibraGenericComputing.pdf" target="_blank">www.gpusystems.com/doc/LibraGenericComputing.pdf</a></p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/11/30/generic-accelerated-computing-libra-sdk/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GPU Virtualization for Dynamic GPU Provisioning</title>
		<link>http://gpgpu.org/2011/11/18/zillians-vgpu</link>
		<comments>http://gpgpu.org/2011/11/18/zillians-vgpu#comments</comments>
		<pubDate>Fri, 18 Nov 2011 08:23:53 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Press]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Libraries]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4181</guid>
		<description><![CDATA[From a recent press release: Taipei, November 18, 2011: Zillians, a leading cloud solution provider specializing in high performance computing, GPU virtualization middleware and massive multi-player online game (MMOG) platforms today announced the availability of vGPU &#8211; the world’s first commercial virtualization solution for decoupling GPU hardware from software. Traditionally, physical GPUs must reside on [...]]]></description>
			<content:encoded><![CDATA[<p>From a recent press release:</p>
<blockquote><p>Taipei, November 18, 2011: Zillians, a leading cloud solution provider specializing in high performance computing, GPU virtualization middleware and massive multi-player online game (MMOG) platforms today announced the availability of vGPU &#8211; the world’s first commercial virtualization solution for decoupling GPU hardware from software. Traditionally, physical GPUs must reside on the same machine running GPU code. This severely hampers GPU cloud deployment due to the difficulty of dynamic GPU provisioning. With vGPU technology, bulky hardware is no longer a limiting factor. vGPU introduces a thin, transparent RPC layer between local application and remote GPU, enabling existing GPU software to run without any modification on a remote GPU resource.<span id="more-4181"></span></p>
<p>Utilizing Mellanox Technologies InfiniBand RDMA I/O solutions and NVDIA GPUDirect technology, Zillians vGPU achieved peak data transfer rates between application and GPU hardware. This opens the door to a broader range of GPU applications that can take advantage of virtualized GPU resources, and also a boon for service providers or cloud operators looking to deploy next generation cloud services based on scalable GPU and Infiniband technologies. As interest in GPU-based HPC clusters continues to grow, vGPU represents a critical enabler for systems to realize resource sharing and improve system utilization across a variety of application domains.</p>
<p>“We are excited to collaborate with Zillians to provide new capabilities for high-performance and massive multi-player online gaming cloud platforms,” said Gilad Shainer, Senior Director, Market Development at Mellanox Technologies. “Mellanox InfiniBand I/O solutions with native RDMA support and GPU acceleration provide the essential technology for Zillians’ innovative vGPU solution.”</p>
<p>Availability</p>
<p>Zillians vGPU is now available for early evaluation request. Version 1.0 featuring support for NVIDIA CUDA 4.0 runtime and driver APIs will be released by the end of 2011. Version 2.0 featuring further support for OpenCL 1.1 is scheduled for release in 2012 Q1. To learn more, please visit <a href="http://www.zillians.com/vgpu" target="_blank">www.zillians.com/vgpu</a>.</p></blockquote>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/11/18/zillians-vgpu/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

