<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GPGPU &#187; Tag: Dissertations :: GPGPU.org</title>
	<atom:link href="http://gpgpu.org/tag/dissertations/feed" rel="self" type="application/rss+xml" />
	<link>http://gpgpu.org</link>
	<description>General-Purpose Computation on Graphics Hardware</description>
	<lastBuildDate>Tue, 22 May 2012 08:44:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Parallel Sparse Linear Algebra for Multi-core and Many-core Platforms &#8212; Parallel Solvers and Preconditioners</title>
		<link>http://gpgpu.org/2012/03/02/lukarski-phd</link>
		<comments>http://gpgpu.org/2012/03/02/lukarski-phd#comments</comments>
		<pubDate>Fri, 02 Mar 2012 06:52:27 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Iterative Solvers]]></category>
		<category><![CDATA[Numerical Algorithms]]></category>
		<category><![CDATA[Sparse Linear Systems]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=4545</guid>
		<description><![CDATA[Abstract: Partial differential equations are typically solved by means of finite difference, finite volume or finite element methods resulting in large, highly coupled, ill-conditioned and sparse (non-)linear systems. In order to minimize the computing time we want to exploit the capabilities of modern parallel architectures. The rapid hardware shifts from single core to multi-core and [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:</p>
<blockquote><p>Partial differential equations are typically solved by means of finite difference, finite volume or finite element methods resulting in large, highly coupled, ill-conditioned and sparse (non-)linear systems. In order to minimize the computing time we want to exploit the capabilities of modern parallel architectures. The rapid hardware shifts from single core to multi-core and many-core processors lead to a gap in the progression of algorithms and programming environments for these platforms &#8212; the parallel models for large clusters do not fully utilize the performance capability of the multi-core CPUs and especially of the GPUs. Software stack needs to run adequately on the next generation of computing devices in order to exploit the potential of these new systems. Moving numerical software from one platform to another becomes an important task since every parallel device has its own programming model and language. The greatest challenge is to provide new techniques for solving (non-)linear systems that combine scalability, portability, fine-grained parallelism and flexibility across the assortment of parallel platforms and programming models. The goal of this thesis is to provide new fine-grained parallel algorithms embedded in advanced sparse linear algebra solvers and preconditioners on the emerging multi-core and many-core technologies.</p>
<p><span id="more-4545"></span>&nbsp;</p>
<p>With respect to the mathematical methods, we focus on efficient iterative linear solvers. Here, we consider two types of solvers &#8212; out-of-the-box solvers such as preconditioned Krylov subspace solvers (e.g. CG, BiCGStab, GMRES), and problem-aware solvers such as geometric matrix-based multi-grid methods. Clearly, the majority of the solvers can be written in terms of sparse matrix-vector and vector-vector operations which can be performed in parallel. Our aim is to provide parallel, generic and portable preconditioners which are suitable for multi-core and many-core devices. We focus on additive (e.g.~Gauss-Seidel, SOR), multiplicative (ILU factorization with or without fill-ins) and approximate inverse preconditioners. The preconditioners can also be used as smoothing schemes in the multi-grid methods via a preconditioned defect correction step. We treat the additive splitting schemes by a multi-coloring technique to provide the necessary level of parallelism. For controlling the fill-in entries for the ILU factorization we propose a novel method which we call the power(q)-pattern method. We prove that this algorithm produces a new matrix structure with diagonal blocks containing only diagonal entries. This approach provides higher degrees of parallelism in comparison with the level-scheduling/topological sort algorithm. With these techniques we can perform the forward and backward substitution of the preconditioning step in parallel. By formulating the algorithm in block-matrix form we can execute the sweeps in parallel only by performing matrix-vector multiplications. Thus, we can express the data-parallelism in the sweeps without any specification of the underlying hardware or programming models.</p>
<p>In object-oriented languages, an abstraction separates the object behavior from its implementation. Based on this abstraction, we have developed a linear algebra toolbox which supports several platforms such as multi-core CPUs, GPUs and accelerators. The various backends (sequential, OpenMP, CUDA, OpenCL) consist of optimized and platform-specific matrix and vector routines. Using unified interfaces across all platforms, the library allows users to build linear solvers and preconditioners without any information about the underlying hardware. With this technique, we can write our solvers and preconditioners in a single source code for all platforms. Furthermore, we can extend the library by adding new platforms without modifying the existing solvers and preconditioners.</p>
<p>In our tests we consider two scenarios – preconditioned Krylov subspace methods and matrix-based multi-grid methods. We demonstrate speed ups in two directions: first, the preconditioners/smoothers reduce the total solution time by decreasing the number of iterations, and second, the preconditioning/smoothing phase is efficiently executed in parallel providing good scalability across several parallel architectures. We present numerical experiments and performance analysis on several platforms such as multi-core CPU and GPU devices. Furthermore, we show the viability and benefit of the proposed preconditioning schemes and software approach.</p>
</blockquote>
<p>(Dimitar Lukarski: <em>&#8220;Parallel Sparse Linear Algebra for Multi-core and Many-core Platforms : Parallel Solvers and Preconditioners&#8221;</em>, PhD thesis, Fakultät für Mathematik, KIT Karlsruhe, Germany, 2012 [<a href="http://digbib.ubka.uni-karlsruhe.de/volltexte/1000026568" target="_blank">WWW</a>])</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2012/03/02/lukarski-phd/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>PhD Thesis: Scientific Computing on Heterogeneous Architectures</title>
		<link>http://gpgpu.org/2010/12/22/phd-thesis-brodtkorb</link>
		<comments>http://gpgpu.org/2010/12/22/phd-thesis-brodtkorb#comments</comments>
		<pubDate>Wed, 22 Dec 2010 11:16:45 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Dissertations]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=3110</guid>
		<description><![CDATA[Abstract: The CPU has traditionally been the computational work horse in scientific computing, but we have seen a tremendous increase in the use of accelerators, such as Graphics Processing Units (GPUs), in the last decade. These architectures are used because they consume less power and offer higher performance than equivalent CPU solutions. They are typically [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:<strong> </strong></p>
<blockquote><p>The CPU has traditionally been the computational work horse in scientific computing, but we have seen a tremendous increase in the use of accelerators, such as Graphics Processing Units (GPUs), in the last decade. These architectures are used because they consume less power and offer higher performance than equivalent CPU solutions. They are typically also far less expensive, as more CPUs, and even clusters, are required to match their performance. Even though these accelerators are powerful in terms of floating point operations per second, they are considerably more primitive in terms of capabilities. For example, they cannot even open a file on disk without the use of the CPU. Thus, most applications can benefit from using accelerators to perform heavy computation, whilst running complex tasks on the CPU. This use of different compute resources is often referred to as heterogeneous computing, and we explore the use of heterogeneous architectures for scientific computing in this thesis. Through six papers, we present qualitative and quantitative comparisons of different heterogeneous architectures, the use of GPUs to accelerate linear algebra operations in MATLAB, and efficient shallow water simulation on GPUs. Our results show that the use of heterogeneous architectures can give large performance gains.</p></blockquote>
<p>(André R. Brodtkorb, <em>&#8220;Scientific Computing on Heterogeneous Architectures&#8221;</em>, Ph.D. thesis, University of Oslo, Faculty of Mathematics and Natural Sciences, 2010, (<a href="http://babrodtk.at.ifi.uio.no/files/publications/brodtkorb_phd_thesis.pdf" target="_blank">PDF</a>))</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/12/22/phd-thesis-brodtkorb/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>PhD Thesis: Fast and Accurate Finite-Element Multigrid Solvers for PDE Simulations on GPU Clusters</title>
		<link>http://gpgpu.org/2010/05/30/phd-thesis-goeddeke</link>
		<comments>http://gpgpu.org/2010/05/30/phd-thesis-goeddeke#comments</comments>
		<pubDate>Sun, 30 May 2010 21:46:25 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Dissertations]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=2337</guid>
		<description><![CDATA[Abstract: The main contribution of this thesis is to demonstrate that graphics processors (GPUs) as representatives of emerging many-core architectures are very well-suited for the fast and accurate solution of large sparse linear systems of equations, using parallel multigrid methods on heterogeneous compute clusters. Such systems arise for instance in the discretisation of (elliptic) partial [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:</p>
<blockquote><p>The main contribution of this thesis is to demonstrate that graphics processors (GPUs) as representatives of emerging many-core architectures are very well-suited for the fast and accurate solution of large sparse linear systems of equations, using parallel multigrid methods on heterogeneous compute clusters. Such systems arise for instance in the discretisation of (elliptic) partial differential equations with finite elements. We report on at least one order of magnitude speedup over highly-tuned conventional CPU implementations, without sacrificing either accuracy m<br />
or functionality. In more detail, this thesis includes the following contributions:<br />
<span id="more-2337"></span></p>
<p>Single precision floating point computations may be insufficient for the class of problems considered in this thesis. We revisit mixed precision iterative refinement techniques to not only increase the accuracy of computed results, but also to increase the efficiency of the solution process. Both on CPUs and on GPUs, we demonstrate a significant performance improvement without loss of accuracy compared to computing in high precision only.</p>
<p>We present efficient parallelisation techniques for multigrid solvers on graphics hardware, in particular for numerically strong smoothers and preconditioners that are suitable for highly anisotropic grids and operators. For instance, an efficient formulation of the cyclic reduction algorithm to solve tridiagonal systems is developed. In view of hardware-oriented numerics, we carefully analyse the trade-off between numerical and runtime performance for inexact parallelisation techniques that decouple some of the inherently sequential characteristics of strong smoothing operators.</p>
<p>For large-scale established software frameworks, the re-implementation tailored to novel hardware platforms is often prohibitively expensive. We develop a &#8216;minimally invasive&#8217; approach to integrate support for co-processor hardware like GPUs into FEAST, a finite element discretisation and solver toolbox. Our technique has the major advantage that applications built on top of the toolbox do not have to be changed at all to benefit from co-processor acceleration. The approach is evaluated for benchmark problems in linearised elasticity and stationary laminar flow computed on large-scale GPU-enhanced clusters. Good speedup factors and near-ideal weak scalability are observed. The achievable speedup is analysed and a theoretical speedup model is presented.</p>
<p>Finally, we provide a historical overview of scientific computing on graphics hardware since the early beginnings in 2001/2002, when GPGPU was an obscure research topic pursued by few, to the widespread adoption nowadays. We discuss the evolution of the hardware and the programming model, and provide a comprehensive bibliography of publications related to PDE simulations on GPUs.</p></blockquote>
<p>(<a href="http://www.mathematik.tu-dortmund.de/~goeddeke/" target="_blank">Dominik Göddeke</a>: <em>&#8220;Fast and Accurate Finite-Element Multigrid Solvers for PDE Simulations on GPU Clusters&#8221;</em>. PhD thesis, Technische Universität Dortmund, Fakultät für Mathematik, May 2010. Permalink: <a href="http://hdl.handle.net/2003/27243" target="_blank">http://hdl.handle.net/2003/27243</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/05/30/phd-thesis-goeddeke/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Speeding Up Molecular Docking Calculations Using Consumer Graphics Hardware</title>
		<link>http://gpgpu.org/2008/11/18/speeding-up-molecular-docking-calculations-using-consumer-graphics-hardware</link>
		<comments>http://gpgpu.org/2008/11/18/speeding-up-molecular-docking-calculations-using-consumer-graphics-hardware#comments</comments>
		<pubDate>Tue, 18 Nov 2008 09:00:32 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Molecular Dynamics]]></category>

		<guid isPermaLink="false">http://www.gpgpu.org/newgpgpu/?p=602</guid>
		<description><![CDATA[The computer-aided prediction of protein-ligand complex conformations, i.e. docking a small ligand into the active site of a protein, is an important application in the early stages of the modern drug discovery process. For this problem a new approach called PLANTS (Protein-Ligand ANT System) is presented which is based on Ant Colony Optimization (ACO). Part [...]]]></description>
			<content:encoded><![CDATA[<p>The computer-aided prediction of protein-ligand complex conformations, i.e. docking a small ligand into the active site of a protein, is an important application in the early stages of the modern drug discovery process. For this problem a new approach called <a title="Link to PLANTS website" href="http://www.tcd.uni-konstanz.de/research/plants.php" target="_blank">PLANTS</a> (Protein-Ligand ANT System) is presented which is based on Ant Colony Optimization (ACO). Part of the work deals with the acceleration of this approach by moving the most time-consuming steps, the transformation of the protein and ligand structure and the evaluation of the objective function, to the GPU. The combined CPU-GPU approach is able to reach a speedup of 5 on average when comparing an optimized CPU-version (single core of a dual-core Pentium 4, 3 GHz) with the GPU-accelerated version (Nvidia Geforce 8800 GTX). Especially virtual screening applications, where the complex conformations of thousands to millions of ligands need to be predicted, can benefit from this speedup.</p>
<p>(<a title="Link to Oliver Korb's thesis" href="http://www.ub.uni-konstanz.de/kops/volltexte/2008/6483/" target="_blank">Efficient Ant Colony Optimization Algorithms for Structure- and Ligand-Based Drug Design</a>.  Oliver Korb, <em>PhD thesis, University of Konstanz, 2008</em>)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2008/11/18/speeding-up-molecular-docking-calculations-using-consumer-graphics-hardware/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Case studies on GPU usage and data structure design</title>
		<link>http://gpgpu.org/2008/08/11/case-studies-on-gpu-usage-and-data-structure-design</link>
		<comments>http://gpgpu.org/2008/08/11/case-studies-on-gpu-usage-and-data-structure-design#comments</comments>
		<pubDate>Mon, 11 Aug 2008 11:27:04 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Data Structures]]></category>
		<category><![CDATA[Dissertations]]></category>

		<guid isPermaLink="false">http://gpgpu.site/?p=476</guid>
		<description><![CDATA[Abstract Big improvements in the performance of graphics processing units (GPUs) turned them into a compelling platform for high performance computing. In this thesis, we discuss the usage of NVIDIA&#8217;s CUDA in two applications &#8212; Einstein@Home, a distributed computing software, and OpenSteer, a game-like application. Our work on Einstein@Home demonstrates that CUDA can be integrated [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract</p>
<blockquote><p>Big improvements in the performance of graphics processing units (GPUs) turned them into a compelling platform for high performance computing. In this thesis, we discuss the usage of NVIDIA&#8217;s CUDA in two applications &#8212; Einstein@Home, a distributed computing software, and OpenSteer, a game-like application. Our work on Einstein@Home demonstrates that CUDA can be integrated into existing applications with minimal changes, even in programs designed without considering GPU usage. However the existing data structure of Einstein@Home performs poorly when used on the GPU. We demonstrate that using a redesigned data structure improves the performance to about three times as fast as the original CPU version, even though the code executed on the device is not optimized. We further discuss the design of a novel spatial data structure called &#8220;dynamic grid&#8221; that is optimized for CUDA usage. We measure its performance by integrating it into the Boids scenario of OpenSteer. Our new concept outperforms a uniform grid by a margin of up to 15%, even though the dynamic grid still provides optimization potential.</p></blockquote>
<p style="text-align: left;">(<a title="Link to dissertation" href="http://www.plm.eecs.uni-kassel.de/plm/fileadmin/pm/publications/chuerkamp/Masterarbeiten/Jens_Breitbart_thesis.pdf" target="_blank">Case studies on gpu usage and data structure design</a>. J. Breitbart, Master&#8217;s thesis, Universität Kassel, 2008)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2008/08/11/case-studies-on-gpu-usage-and-data-structure-design/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>GPGPU Based Image Segmentation Livewire Algorithm Implementation</title>
		<link>http://gpgpu.org/2008/04/01/gpgpu-based-image-segmentation-livewire-algorithm-implementation</link>
		<comments>http://gpgpu.org/2008/04/01/gpgpu-based-image-segmentation-livewire-algorithm-implementation#comments</comments>
		<pubDate>Tue, 01 Apr 2008 08:32:41 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Image Processing]]></category>
		<category><![CDATA[Open Source]]></category>

		<guid isPermaLink="false">http://gpgpu.site/?p=411</guid>
		<description><![CDATA[This thesis presents a GPU implementation of the Livewire algorithm. The algorithm is divided in three phases: Sobel or Laplacian filter convolution, image modeling as a grid graph and solving the non-negative weighted edges single-source shortest path problem. In order to calculate the shortest path, an adapted version of the delta-stepping algorithm was developed for [...]]]></description>
			<content:encoded><![CDATA[<p>This thesis presents a GPU implementation of the Livewire algorithm. The algorithm is divided in three phases: Sobel or Laplacian filter convolution, image modeling as a grid graph and solving the non-negative weighted edges single-source shortest path problem. In order to calculate the shortest path, an adapted version of the delta-stepping algorithm was developed for GPUs, using CUDA. A critical result analysis shows that intense speedups are seen in image filtering algorithms. On the other hand, the wide use of dependent device memory look-ups has constrained delta-stepping algorithm from achieving higher performance than CPU implementation although a better performance is expected for wider graphs. Besides showing the viability of the Livewire algorithm implementation, this thesis makes available an open-source image segmentation GPU based application, which can be used as example for future GPU algorithm implementations at <a title="Gpuwire code" href="http://code.google.com/p/gpuwire/" target="_blank">http://code.google.com/p/gpuwire/</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2008/04/01/gpgpu-based-image-segmentation-livewire-algorithm-implementation/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GPU++: An Embedded GPU Development System for General-Purpose Computations</title>
		<link>http://gpgpu.org/2008/01/14/gpu-an-embedded-gpu-development-system-for-general-purpose-computations</link>
		<comments>http://gpgpu.org/2008/01/14/gpu-an-embedded-gpu-development-system-for-general-purpose-computations#comments</comments>
		<pubDate>Sun, 13 Jan 2008 23:48:10 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[APIs]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Programming Languages]]></category>

		<guid isPermaLink="false">http://gpgpu.site/?p=306</guid>
		<description><![CDATA[This Ph.D. thesis by Jansen describes a GPGPU development system that is embedded in the C++ programming language using ad-hoc polymorphism (i.e. operator overloading). While this technique is already known from the Sh library and the RapidMind Development Platform, GPU++ uses a more generic class interface and requires no knowledge of GPU programming at all. [...]]]></description>
			<content:encoded><![CDATA[<p>This Ph.D. thesis by Jansen describes a GPGPU development system that is embedded in the C++ programming language using ad-hoc polymorphism (i.e. operator overloading). While this technique is already known from the Sh library and the RapidMind Development Platform, GPU++ uses a more generic class interface and requires no knowledge of GPU programming at all. Furthermore, there is no separation between the different computation units of the CPU and GPU &#8211; the appropriate computation frequency is automatically chosen by the GPU++ system using several optimization algorithms. (&#8220;<a href="http://blog.beef.de/thesis/" title="GPU++" target="_blank">GPU++: An Embedded GPU Development System for General-Purpose Computations</a>&#8220;. Thomas Jansen. Ph.D. Thesis, University of Munich, Germany.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2008/01/14/gpu-an-embedded-gpu-development-system-for-general-purpose-computations/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ph.D. Dissertation: Glift Generic GPU Data Structures, by Aaron Lefohn</title>
		<link>http://gpgpu.org/2007/01/18/phd-dissertation-glift-generic-gpu-data-structures-by-aaron-lefohn</link>
		<comments>http://gpgpu.org/2007/01/18/phd-dissertation-glift-generic-gpu-data-structures-by-aaron-lefohn#comments</comments>
		<pubDate>Thu, 18 Jan 2007 13:44:00 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Data Structures]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Parallel Algorithms]]></category>

		<guid isPermaLink="false">http://www.gpgpu.org/cgi-bin/blosxom.cgi/DataParallelAlgorithms/lefohnThesis06.html</guid>
		<description><![CDATA[This Ph.D. dissertation by Aaron Lefohn at the University of California, Davis describes the Glift GPU data structure abstraction and its application to both GPU-based data-parallel and interactive rendering algorithms. The applications include octree 3D painting, adaptive shadow maps, resolution matched shadow maps, heat-diffusion depth-of-field, and a GPU-based direct solver for tridiagonal linear systems. While [...]]]></description>
			<content:encoded><![CDATA[<p>This <a href="http://graphics.cs.ucdavis.edu/~lefohn/work/dissertation/">Ph.D. dissertation</a> by <a href="http://graphics.cs.ucdavis.edu/~lefohn/" title="Aaron Lefohn" target="_blank">Aaron Lefohn</a> at the<a href="http://graphics.cs.ucdavis.edu/"> University of California, Davis</a> describes the Glift GPU data structure abstraction and its application to both GPU-based data-parallel and interactive rendering algorithms. The applications include octree 3D painting, adaptive shadow maps, resolution matched shadow maps, heat-diffusion depth-of-field, and a GPU-based direct solver for tridiagonal linear systems. While much of this work has been posted previously, this dissertation contains a more in-depth discussion of the Glift data structure library and introduces several GPGPU and rendering algorithms that are not yet published. This dissertation demonstrates that a data structure abstraction for GPUs can simplify the description of new and existing data structures, stimulate development of complex GPU algorithms, and perform equivalently to hand-coded implementations. The dissertation also presents a case that future interactive rendering solutions will be an inseparable mix of general-purpose, data-parallel algorithms and traditional graphics programming. (<a href="http://graphics.cs.ucdavis.edu/~lefohn/">Aaron Lefohn</a>, <a href="http://graphics.cs.ucdavis.edu/~lefohn/work/dissertation/" target="_blank" title="Glift Dissertation">&#8220;Glift: Generic Data Structures for Graphics Hardware&#8221;</a>, Ph.D. dissertation, Computer Science Department, <a href="http://graphics.cs.ucdavis.edu/" target="_blank" title="UC Davis">University of California Davis</a>, September 2006.)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2007/01/18/phd-dissertation-glift-generic-gpu-data-structures-by-aaron-lefohn/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ph.D. dissertation discusses GPU-accelerated advanced rendering and image processing techniques</title>
		<link>http://gpgpu.org/2006/08/10/phd-dissertation-discusses-gpu-accelerated-advanced-rendering-and-image-processing-techniques</link>
		<comments>http://gpgpu.org/2006/08/10/phd-dissertation-discusses-gpu-accelerated-advanced-rendering-and-image-processing-techniques#comments</comments>
		<pubDate>Thu, 10 Aug 2006 15:04:00 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Augmented Reality]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Video Processing]]></category>

		<guid isPermaLink="false">http://www.gpgpu.org/cgi-bin/blosxom.cgi/AdvancedRendering/fischerThesis06.html</guid>
		<description><![CDATA[The Ph.D. dissertation Rendering Methods for Augmented Reality by Jan Fischer describes several GPU-based methods for artistic and illustrative rendering. A real-time video filter is described, which generates a cartoon-like version of the input video and is executed entirely on the GPU (Section 3.3). Section 4.2 of the thesis discusses a GPU-based algorithm for the [...]]]></description>
			<content:encoded><![CDATA[<p>The Ph.D. dissertation <a title="Jan Fischer Dissertation" href="http://www.janfischer.com/pub_pages/pub-fischer06-diss.html" target="_blank">Rendering Methods for Augmented Reality</a> by <a title="Jan Fischer" href="http://www.janfischer.com" target="_blank">Jan Fischer</a> describes several GPU-based methods for artistic and illustrative rendering. A real-time video filter is described, which generates a cartoon-like version of the input video and is executed entirely on the GPU (Section 3.3). Section 4.2 of the thesis discusses a GPU-based algorithm for the real-time illustrative display of hidden structures in polygonal datasets. In Section 4.3, the real-time conversion of augmented reality video streams into an illustrative style on the GPU is described. The thesis discusses the underlying image processing and rendering algorithms as well as implementation-specific aspects of the respective GPU techniques. (Jan Fischer, <a title="Jan Fischer Dissertation" href="http://www.janfischer.com/pub_pages/pub-fischer06-diss.html" target="_blank">Rendering Methods for Augmented Reality</a>, Dissertation, University of Tübingen, June 2006)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2006/08/10/phd-dissertation-discusses-gpu-accelerated-advanced-rendering-and-image-processing-techniques/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Universal employment of modern graphics hardware by the example of the optimization of a speech recognition system</title>
		<link>http://gpgpu.org/2006/05/24/universal-employment-of-modern-graphics-hardware-by-the-example-of-the-optimization-of-a-speech-recognition-system</link>
		<comments>http://gpgpu.org/2006/05/24/universal-employment-of-modern-graphics-hardware-by-the-example-of-the-optimization-of-a-speech-recognition-system#comments</comments>
		<pubDate>Wed, 24 May 2006 20:12:00 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Developer Resources]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Dissertations]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Speech Recognition]]></category>

		<guid isPermaLink="false">http://www.gpgpu.org/cgi-bin/blosxom.cgi/Tools/fenzlThesis06.html</guid>
		<description><![CDATA[In this masters thesis by Christian Fenzl (accomplished at the University of Applied Sciences in Darmstadt), an easy to use framework is implemented with additional demos to show the main concepts of gpgpu. Furthermore, a demo implementation is included which calculates scores on feature vectors used in a speech recognition system (about 12 times faster [...]]]></description>
			<content:encoded><![CDATA[<p>In this masters thesis by Christian Fenzl (accomplished at the <a href="http://www.fbi.fh-darmstadt.de" title="University of Applied Sciences in Darmstadt" target="_blank">University of Applied Sciences in Darmstadt</a>), an easy to use framework is implemented with additional demos to show the main concepts of gpgpu. Furthermore, a demo implementation is included which calculates scores on feature vectors used in a speech recognition system (about 12 times faster than an equivalent cpu implementation). An <a href="http://www.digital-mastery.de/studium/files/Masterarbeit_Sources_Fenzl" title="Application with several demos" target="_blank">application with several demos</a> using the framework including the fully documented source code (English) and the <a href="http://www.digital-mastery.de/studium/files/Masterarbeit_Fenzl_GPGPU.z" title="German-language paper" target="_blank">paper itself</a> (German) is available. The framework code is recommended especially for gpgpu beginners to look into the OpenGL and DirectX code which shows how gpgpu programs can be developed.</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2006/05/24/universal-employment-of-modern-graphics-hardware-by-the-example-of-the-optimization-of-a-speech-recognition-system/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

