<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GPGPU &#187; Tag: Sparse Linear Systems :: GPGPU.org</title>
	<atom:link href="http://gpgpu.org/tag/sparse-linear-systems/feed" rel="self" type="application/rss+xml" />
	<link>http://gpgpu.org</link>
	<description>General-Purpose Computation on Graphics Hardware</description>
	<lastBuildDate>Mon, 06 Feb 2012 04:59:24 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Performance of SpMV in CUSPARSE, CUSP and SpeedIT</title>
		<link>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit</link>
		<comments>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit#comments</comments>
		<pubDate>Sat, 14 Jan 2012 12:43:31 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Benchmarks]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4384</guid>
		<description><![CDATA[The SpeedIt team recently compared and benchmarked the SpMV performance of CUSPARSE 4.0, CUSP 0.2.0 and SpeedIT 2.0 on 23 randomly chosen matrices from University Florida Matrix Collection. Comparisons were done on a Tesla C2050 in single and double precision. The full report is available at http://wp.me/p1ZihD-1.]]></description>
			<content:encoded><![CDATA[<p>The SpeedIt team recently compared and benchmarked the SpMV performance of CUSPARSE 4.0, CUSP 0.2.0 and SpeedIT 2.0 on 23 randomly chosen matrices from University Florida Matrix Collection. Comparisons were done on a Tesla C2050 in single and double precision. The full report is available at <a title="full benchmarking report" href="http://wp.me/p1ZihD-1" target="_blank">http://wp.me/p1ZihD-1</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/01/14/performance-of-spmv-in-cusparse-cusp-and-speedit/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CULA Sparse Now Available</title>
		<link>http://gpgpu.org/2011/11/10/cula-sparse-now-available</link>
		<comments>http://gpgpu.org/2011/11/10/cula-sparse-now-available#comments</comments>
		<pubDate>Thu, 10 Nov 2011 09:09:48 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Business]]></category>
		<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4131</guid>
		<description><![CDATA[EM Photonics has released CULA Sparse, a ready-to-integrate package for solving sparse linear systems. Features include: Interfaces: C, C++, Fortran, Matlab, Python Platforms: all CUDA platforms. including Linux, Windows, and OS X Solvers and preconditioners: BiCG, BiCGStab, CG, GMRES, MINRES and Jacobi, ILU(0) Data formats: COO, CSR, CSC in double precision real and complex floating [...]]]></description>
			<content:encoded><![CDATA[<p>EM Photonics has released CULA Sparse, a ready-to-integrate package for solving sparse linear systems. Features include:</p>
<ul>
<li>Interfaces: C, C++, Fortran, Matlab, Python</li>
<li>Platforms: all CUDA platforms. including Linux, Windows, and OS X</li>
<li>Solvers and preconditioners: BiCG, BiCGStab, CG, GMRES, MINRES and Jacobi, ILU(0)</li>
<li>Data formats: COO, CSR, CSC in double precision real and complex floating point</li>
<li>No CUDA programming experience required.</li>
</ul>
<p>More information is available at <a href="http://www.culatools.com/sparse/" target="_blank">http://www.culatools.com/sparse</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/11/10/cula-sparse-now-available/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods</title>
		<link>http://gpgpu.org/2011/08/04/algebraic-multigrid</link>
		<comments>http://gpgpu.org/2011/08/04/algebraic-multigrid#comments</comments>
		<pubDate>Thu, 04 Aug 2011 07:00:52 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Iterative Solvers]]></category>
		<category><![CDATA[Multigrid]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=3807</guid>
		<description><![CDATA[Abstract: Algebraic multigrid methods for large, sparse linear systems are a necessity in many computational simulations, yet parallel algorithms for such solvers are generally decomposed into coarse-grained tasks suitable for distributed computers with traditional processing cores. However, accelerating multigrid on massively parallel throughput-oriented processors, such as the GPU, demands algorithms with abundant fine-grained parallelism. In [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:</p>
<blockquote><p>Algebraic multigrid methods for large, sparse linear systems are a necessity in many computational simulations, yet parallel algorithms for such solvers are generally decomposed into coarse-grained tasks suitable for distributed computers with traditional processing cores. However, accelerating multigrid on massively parallel throughput-oriented processors, such as the GPU, demands algorithms with abundant fine-grained parallelism. In this paper, we develop a parallel algebraic multigrid method which exposes substantial fine-grained parallelism in both the construction of the multigrid hierarchy as well as the cycling or solve stage. Our algorithms are expressed in terms of scalable parallel primitives that are efficiently implemented on the GPU. The resulting solver achieves an average speedup of over 2x in the setup phase and around 6x in the cycling phase when compared to a representative CPU implementation.</p></blockquote>
<p>(Nathan Bell, Steven Dalton and Luke Olson: <em>&#8220;Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods&#8221;</em>, NVIDIA Technical Report NVR-2011-002, June 2011 [<a href="http://research.nvidia.com/publication/exposing-fine-grained-parallelism-algebraic-multigrid-methods" target="_blank">PDF and Sources</a>])</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/08/04/algebraic-multigrid/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Enhanced Parallel ILU(p)-based Preconditioners for Multi-core CPUs and GPUs &#8212; The Power(q)-pattern Method</title>
		<link>http://gpgpu.org/2011/07/08/parallel-ilup-based-preconditioners</link>
		<comments>http://gpgpu.org/2011/07/08/parallel-ilup-based-preconditioners#comments</comments>
		<pubDate>Fri, 08 Jul 2011 07:19:38 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Multicore]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=3741</guid>
		<description><![CDATA[Abstract: Application demands and grand challenges in numerical simulation require for both highly capable computing platforms and efficient numerical solution schemes. Power constraints and further miniaturization of modern and future hardware give way for multi- and manycore processors with increasing fine-grained parallelism and deeply nested hierarchical memory systems &#8212; as already exemplified by recent graphics [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:</p>
<blockquote><p>Application demands and grand challenges in numerical simulation require for both highly capable computing platforms and efficient numerical solution schemes. Power constraints and further miniaturization of modern and future hardware give way for multi- and manycore processors with increasing fine-grained parallelism and deeply nested hierarchical memory systems &#8212; as already exemplified by recent graphics processing units. Accordingly, numerical schemes need to be adapted and re-engineered in order to deliver scalable solutions across diverse processor configurations. Portability of parallel software solutions across emerging hardware platforms is another challenge. This work investigates multi-coloring and re-ordering schemes for block Gauss-Seidel methods and, in particular, for incomplete LU factorizations with and without fill-ins. We consider two matrix re-ordering schemes that deliver flexible and efficient parallel preconditioners. The general idea is to generate block decompositions of the system matrix such that the diagonal blocks are diagonal itself. In such a way, parallelism can be exploited on the block-level in a scalable manner. Our goal is to provide widely applicable, out-of-the-box preconditioners that can be used in the context of finite element solvers.</p>
<p>We propose a new method for anticipating the fill-in pattern of ILU(p) schemes which we call the power(q)-pattern method. This method is based on an incomplete factorization of the system matrix A subject to a predetermined pattern given by the matrix power |A|<sup>p+1</sup> and its associated multi-coloring permutation pi. We prove that the obtained sparsity pattern is a superset of our modified ILU(p) factorization applied to pi A p<sup>i-1</sup>. As a result, this modified ILU(p) applied to multi-colored system matrix has no fill-ins in its diagonal blocks. This leads to an inherently parallel execution of triangular ILU(p) sweeps.</p>
<p>In addition, we describe the integration of the preconditioners into the HiFlow<sup>3</sup> open-source finite element package that provides a portable software solution across diverse hardware platforms. On this basis, we conduct performance analysis across a variety of test problems on multi-core CPUs and GPUs that proves efficiency, scalability and flexibility of our approach. Our preconditioners achieve a solver acceleration by a factor of up to 1.5, 8 and 85 for three different test problems. The GPU versions of the preconditioned solver are by a factor of up to 4 faster than an OpenMP parallel version on eight cores.</p></blockquote>
<p>(Vincent Heuveline, Dimitar Lukarski and Jan-Philipp Weiss: &#8220;Enhanced Parallel ILU(p)-based Preconditioners for Multi-core CPUs and GPUs &#8212; The Power(q)-pattern Method&#8221;, EMCL Preprint Series, number 08, July 2011 [<a href="http://www.emcl.kit.edu/preprints/emcl-preprint-2011-08.pdf" target="_blank">PDF</a>])</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/07/08/parallel-ilup-based-preconditioners/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Parallel Solution of Sparse Triangular Linear Systems</title>
		<link>http://gpgpu.org/2011/06/26/parallel-solution-of-sparse-triangular-linear-systems</link>
		<comments>http://gpgpu.org/2011/06/26/parallel-solution-of-sparse-triangular-linear-systems#comments</comments>
		<pubDate>Sun, 26 Jun 2011 23:22:32 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=3680</guid>
		<description><![CDATA[Abstract: A novel algorithm for solving in parallel a sparse triangular linear system on a graphical processing unit is proposed. It implements the solution of the triangular system in two phases. First, the analysis phase builds a dependency graph based on the matrix sparsity pattern and groups the independent rows into levels. Second, the solve [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:</p>
<blockquote><p>A novel algorithm for solving in parallel a sparse triangular linear system on a graphical processing unit is proposed. It implements the solution of the triangular system in two phases. First, the analysis phase builds a dependency graph based on the matrix sparsity pattern and groups the independent rows into levels. Second, the solve phase obtains the full solution by iterating sequentially across the constructed levels. The solution elements corresponding to each single level are obtained at once in parallel. The numerical experiments are also presented and it is shown that the incomplete-LU and Cholesky preconditioned iterative methods, using the parallel sparse triangular solve algorithm, can achieve on average more than 2x speedup on graphical processing units (GPUs) over their CPU implementation.</p></blockquote>
<p>(Maxim Naumov: <em>&#8220;Parallel Solution of Sparse Triangular Linear Systems in the Preconditioned Iterative Methods on the GPU&#8221;</em>, NVIDIA Technical Report, June 2011. [<a href="http://research.nvidia.com/publication/parallel-solution-sparse-triangular-linear-systems-preconditioned-iterative-methods-gpu" target="_blank">WWW</a>])</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/06/26/parallel-solution-of-sparse-triangular-linear-systems/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CUDA Libraries Performance Report Now Available</title>
		<link>http://gpgpu.org/2011/02/01/cuda-libraries-performance-report</link>
		<comments>http://gpgpu.org/2011/02/01/cuda-libraries-performance-report#comments</comments>
		<pubDate>Wed, 02 Feb 2011 00:38:13 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[FFT]]></category>
		<category><![CDATA[Linear Algebra]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Random Number Generation]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=3193</guid>
		<description><![CDATA[This new report covers all the performance improvements in the latest CUDA Toolkit 3.2 release, and compares CUDA parallel math library performance vs. commonly used CPU libraries. Learn about the performance advantages of using the CUDA parallel math libraries for FFT, BLAS, sparse matrix operations, and random number generation.]]></description>
			<content:encoded><![CDATA[<p><a href="http://bit.ly/ehR5az" target="_blank">This new report</a> covers all the performance improvements in the latest CUDA Toolkit 3.2 release, and compares CUDA parallel math library performance vs. commonly used CPU libraries.</p>
<p>Learn about the performance advantages of using the CUDA parallel math libraries for FFT, BLAS, sparse matrix operations, and random number generation.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2011/02/01/cuda-libraries-performance-report/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OpenNL 3.2.1 released</title>
		<link>http://gpgpu.org/2010/09/07/opennl-3-2-1</link>
		<comments>http://gpgpu.org/2010/09/07/opennl-3-2-1#comments</comments>
		<pubDate>Tue, 07 Sep 2010 23:01:06 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=2731</guid>
		<description><![CDATA[OpenNL (Open Numerical Library) is a library for solving sparse linear systems on CPUs and GPUs. Features include various preconditioned Krylov subspace solvers for several data structures. The library is explicitly designed for easy interfacing with existing codes and their storage schemes. Highlights of version 3.2.1 include: Support for double precision on the GPU Support [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://alice.loria.fr/index.php/software/4-library/23-opennl.html" target="_blank">OpenNL (Open Numerical Library)</a> is a library for solving sparse linear systems on CPUs and GPUs. Features include various preconditioned Krylov subspace solvers for several data structures. The library is explicitly designed for easy interfacing with existing codes and their storage schemes.</p>
<p>Highlights of version 3.2.1 include:</p>
<ul>
<li>Support for double precision on the GPU</li>
<li>Support for the Fermi architecture</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/09/07/opennl-3-2-1/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SMVM on GPU</title>
		<link>http://gpgpu.org/2010/07/29/smvm-on-gpu</link>
		<comments>http://gpgpu.org/2010/07/29/smvm-on-gpu#comments</comments>
		<pubDate>Fri, 30 Jul 2010 01:07:42 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Electromagnetics]]></category>
		<category><![CDATA[Linear Algebra]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=2617</guid>
		<description><![CDATA[From the paper&#8217;s abstract: A wide class of finite element electromagnetic applications requires computing very large sparse matrix vector multiplications (SMVM). Due to the sparsity pattern and size of the matrices, solvers can run relatively slowly. The rapid evolution of graphic processing units (GPUs) in performance, architecture and programmability make them very attractive platforms for [...]]]></description>
			<content:encoded><![CDATA[<p>From the paper&#8217;s abstract:</p>
<blockquote><p>A wide class of finite element electromagnetic applications requires computing very large sparse matrix vector multiplications (SMVM). Due to the sparsity pattern and size of the matrices, solvers can run relatively slowly. The rapid evolution of graphic processing units (GPUs) in performance, architecture and programmability make them very attractive platforms for accelerating computationally intensive kernels such as SMVM. This work presents a new algorithm to accelerate the performance of the SMVM kernel on graphic processing units.</p></blockquote>
<p>From the paper&#8217;s conclusion:</p>
<blockquote><p>We have introduced several efficient techniques to accelerate the execution of the sparse matrix vector multiplication (SMVM) on NVIDIA graphic processing units. The proposed methods increased the performance of the SMVM kernel on GT 8800 up to 18.8 times compared to the quad core CPU and 3 times compared to previous work by Bell and Garland on accelerating SMVM for GPUs.</p></blockquote>
<p>(M. Mehri Dehnavi, D. Fernandez and D. Giannacopoulos: <em>“Finite element sparse matrix vector multiplication on GPUs”</em>. IEEE Transactions on Magnetics, vol. 46, no. 8, pp. 2982-2985, August 2010. DOI <a href="http://dx.doi.org/10.1109/TMAG.2010.2043511 " target="_blank">10.1109/TMAG.2010.2043511</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/07/29/smvm-on-gpu/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SpeedIT Toolkit 0.9.1 released</title>
		<link>http://gpgpu.org/2010/03/26/speedit-toolkit-0-9-1-released</link>
		<comments>http://gpgpu.org/2010/03/26/speedit-toolkit-0-9-1-released#comments</comments>
		<pubDate>Fri, 26 Mar 2010 19:10:56 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Linear Algebra]]></category>
		<category><![CDATA[Numerics]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>
		<category><![CDATA[Tools]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=2220</guid>
		<description><![CDATA[The SpeedIT Tools library provides a set of accelerated solvers for sparse linear systems of equations. Manifold acceleration, e.g. more than an order of magnitude, is achieved with a single reasonably priced NVIDIA Graphics Processing Unit (GPU) that supports CUDA and proprietary advanced optimization techniques. The library can be used in a wide spectrum of [...]]]></description>
			<content:encoded><![CDATA[<p>The <a href="http://vratis.com/speedITblog/" target="_blank">SpeedIT Tools library</a> provides a set of accelerated solvers for sparse linear systems of equations. Manifold acceleration, e.g. more than an order of magnitude, is achieved with a single reasonably priced NVIDIA Graphics Processing Unit (GPU) that supports CUDA and proprietary advanced optimization techniques. The library can be used in a wide spectrum of domains arising from problems with underlying 2D and 3D geometry, such as computational fluid dynamics, electro-magnetics, thermodynamics, materials, acoustics, computer vision and graphics, robotics, semiconductor devices and structural engineering. The library can be also used for  problems without defined geometry such as quantum chemistry, statistics, power networks and other graphs and chemical process simulation. All computations are performed with single or double floating point precision. Two version of <a href="http://vratis.com/speedITblog/" target="_blank">SpeedIT toolkit</a> have been released: The classic version provides a conjugate gradient solver, and the extreme edition provides optimized CG, BiCGSTAB, diagonal preconditioner, memory management, and heuristic-based analysis of input matrices.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/03/26/speedit-toolkit-0-9-1-released/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OpenNL 3.0: CUDA sparse linear solvers</title>
		<link>http://gpgpu.org/2010/02/14/opennl-cuda-sparse-linear-solvers</link>
		<comments>http://gpgpu.org/2010/02/14/opennl-cuda-sparse-linear-solvers#comments</comments>
		<pubDate>Sun, 14 Feb 2010 10:06:38 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Libraries]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=2149</guid>
		<description><![CDATA[OpenNL (Open Numerical Library) is a library for solving sparse linear systems, especially designed for the Computer Graphics community. The goal of OpenNL is to be as small as possible, while offering the subset of functionalities required by this application field. The Makefiles of OpenNL can generate a single .c and .h file that make it very [...]]]></description>
			<content:encoded><![CDATA[<p><em><a href="http://alice.loria.fr/index.php/software/4-library/23-opennl.html" target="_blank">OpenNL</a></em><a href="http://alice.loria.fr/index.php/software/4-library/23-opennl.html" target="_blank"> (Open Numerical Library)</a> is a library for solving sparse linear systems, especially designed for the Computer Graphics community. The goal of <em>OpenNL</em> is to be as small as possible, while offering the subset of functionalities required by this application field. The Makefiles of <em>OpenNL</em> can generate a single .c and .h file that make it very easy to integrate into other projects. The distribution includes an implementation of a Least Squares Conformal Maps parameterization method.  The new version 3.0 of OpenNL includes support for CUDA (with <a href="http://gpgpu.org/2008/10/16/concurrent-number-cruncher-a-gpu-implementation-of-a-general-sparse-linear-solver" target="_blank">Concurrent Number Cruncher</a> and <a href="http://code.google.com/p/cusp-library/" target="_blank">CUSP</a> <a href="http://www.cs.purdue.edu/ellpack/" target="_blank">ELL</a> formats).</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/02/14/opennl-cuda-sparse-linear-solvers/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

