4.5 Article

Performance Comparison of Graphics Processors to Reconfigurable Logic: A Case Study

Journal

IEEE TRANSACTIONS ON COMPUTERS
Volume 59, Issue 4, Pages 433-448

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TC.2009.179

Keywords

Graphics processors; reconfigurable hardware; real-time and embedded systems; signal processing systems; performance measures; video

Funding

  1. Sony Broadcast & Professional Europe
  2. Donal Morphy Scholarship
  3. UK Engineering and Physical Sciences Research Council [EP/C549481/1]

Ask authors/readers for more resources

A systematic approach to the comparison of the graphics processor (GPU) and reconfigurable logic is defined in terms of three throughput drivers. The approach is applied to five case study algorithms, characterized by their arithmetic complexity, memory access requirements, and data dependence, and two target devices: the nVidia GeForce 7900 GTX GPU and a Xilinx Virtex-4 field programmable gate array (FPGA). Two orders of magnitude speedup, over a general-purpose processor, is observed for each device for arithmetic intensive algorithms. An FPGA is superior, over a GPU, for algorithms requiring large numbers of regular memory accesses, while the GPU is superior for algorithms with variable data reuse. In the presence of data dependence, the implementation of a customized data path in an FPGA exceeds GPU performance by up to eight times. The trends of the analysis to newer and future technologies are analyzed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available