<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GPGPU &#187; Tag: Static Program Analysis :: GPGPU.org</title>
	<atom:link href="http://gpgpu.org/tag/static-program-analysis/feed" rel="self" type="application/rss+xml" />
	<link>http://gpgpu.org</link>
	<description>General-Purpose Computation on Graphics Hardware</description>
	<lastBuildDate>Tue, 22 May 2012 08:44:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs</title>
		<link>http://gpgpu.org/2010/02/02/fcuda-sasp</link>
		<comments>http://gpgpu.org/2010/02/02/fcuda-sasp#comments</comments>
		<pubDate>Wed, 03 Feb 2010 00:31:15 +0000</pubDate>
		<dc:creator>dom</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[FPGAs]]></category>
		<category><![CDATA[NVIDIA CUDA]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Parallel Programming]]></category>
		<category><![CDATA[Static Program Analysis]]></category>

		<guid isPermaLink="false">http://gpgpu.org/?p=2105</guid>
		<description><![CDATA[Abstract: As growing power dissipation and thermal effects disrupted the rising clock frequency trend and threatened to annul Moore&#8217;s law, the computing industry has switched its route to higher performance through parallel processing. The rise of multi-core systems in all domains of computing has opened the door to heterogeneous multi-processors, where processors of different compute [...]]]></description>
			<content:encoded><![CDATA[<p>Abstract:</p>
<blockquote><p>As growing power dissipation and thermal effects disrupted the rising clock frequency trend and threatened to annul Moore&#8217;s law, the computing industry has switched its route to higher performance through parallel processing. The rise of multi-core systems in all domains of computing has opened the door to heterogeneous multi-processors, where processors of different compute characteristics can be combined to effectively boost the performance per watt of different application kernels. GPUs and FPGAs are becoming very popular in PC-based heterogeneous systems for speeding up compute intensive kernels of scientific, imaging and simulation applications. GPUs can execute hundreds of concurrent threads, while FPGAs provide customized concurrency for highly parallel kernels. However, exploiting the parallelism available in these applications is currently not a push-button task. Often the programmer has to expose the application&#8217;s fine and coarse grained parallelism by using special APIs. CUDA is such a parallel-computing API that is driven by the GPU industry and is gaining significant popularity. In this work, we adapt the CUDA programming model into a new FPGA design flow called FCUDA, which efficiently maps the coarse and fine grained parallelism exposed in CUDA onto the reconfigurable fabric. Our CUDA-to-FPGA flow employs AutoPilot, an advanced high-level synthesis tool which enables high-abstraction FPGA programming. FCUDA is based on a source-to-source compilation that transforms the SPMD CUDA thread blocks into parallel C code for AutoPilot. We describe the details of our CUDA-to-FPGA flow and demonstrate the highly competitive performance of the resulting customized FPGA multi-core accelerators. To the best of our knowledge, this is the first CUDA-to-FPGA flow to demonstrate the applicability and potential advantage of using the CUDA programming model for high-performance computing in FPGAs.</p></blockquote>
<p>(Alexandros Papakonstantinou, Karthik Gururaj, John A. Stratton, Deming Chen, Jason Cong and Wen-Mei W. Hwu, <em>FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs</em>, Proceedings of the 7th Symposium on Application Specific Processors, pp.35-42, July 2009. DOI: <a href="http://dx.doi.org/10.1109/SASP.2009.5226333" target="_blank">10.1109/SASP.2009.5226333</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2010/02/02/fcuda-sasp/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A Fast Implementation of the Octagon Abstract Domain on Graphics Hardware</title>
		<link>http://gpgpu.org/2007/07/14/a-fast-implementation-of-the-octagon-abstract-domain-on-graphics-hardware</link>
		<comments>http://gpgpu.org/2007/07/14/a-fast-implementation-of-the-octagon-abstract-domain-on-graphics-hardware#comments</comments>
		<pubDate>Sat, 14 Jul 2007 21:03:00 +0000</pubDate>
		<dc:creator>Mark Harris</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Abstract Interpretation]]></category>
		<category><![CDATA[Mathematics]]></category>
		<category><![CDATA[Papers]]></category>
		<category><![CDATA[Static Program Analysis]]></category>

		<guid isPermaLink="false">http://www.gpgpu.org/cgi-bin/blosxom.cgi/ScientificComputing/Mathematics/banterleOAD07.html</guid>
		<description><![CDATA[This paper by Banterle and Giacobazzi at UniversitÃ  degli Studi di Verona presents an efficient implementation of the Octagon Abstract Domain (OAD) on graphics hardware. OAD is a relational numerical abstract domain which approximates invariants as conjunctions of constraints of the form +/- x +/- y &#60;= c, where x and y are program variables [...]]]></description>
			<content:encoded><![CDATA[<p>This paper by <a href="http://profs.sci.univr.it/%7Egiaco/abstracts/sas07.abstract.html">Banterle and Giacobazzi</a> at UniversitÃ  degli Studi di Verona presents an efficient  implementation of the Octagon Abstract Domain (OAD) on graphics hardware.  OAD is a relational numerical abstract domain which approximates invariants as conjunctions of constraints of the form +/- x +/- y &lt;= c, where x and y are program variables and c is a constant which can be an integer, rational or real. OAD has been used with success in the aerospace industry for analyzing C programs such as the flight control software for the Airbus A340 fly-by-wire system. (<a href="http://profs.sci.univr.it/%7Egiaco/abstracts/sas07.abstract.html"> A Fast Implementation of the Octagon Abstract Domain on Graphics Hardware</a>. Francesco Banterle and Roberto Giacobazzi. <em>Proceeding of  The 14th  International Static Analysis Symposium (SAS). 2007</em>)</p>
]]></content:encoded>
			<wfw:commentRss>http://gpgpu.org/2007/07/14/a-fast-implementation-of-the-octagon-abstract-domain-on-graphics-hardware/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

