4.5 Article

A dedicated hardware accelerator for real-time acceleration of YOLOv2

Journal

JOURNAL OF REAL-TIME IMAGE PROCESSING
Volume 18, Issue 3, Pages 481-492

Publisher

SPRINGER HEIDELBERG
DOI: 10.1007/s11554-020-00977-w

Keywords

Hardware accelerator; FPGA; Convolutional neural network; Object detection; YOLOv2

Funding

  1. NNSF of China [61574013, 61532005]
  2. Beijing Natural Science Foundation [4202063]
  3. National Key R&D Program of China [2019YFB2204200]
  4. BJTU-ZTE Industry University Research Cooperation Project
  5. BJTU-Kuaishou Research Grant

Ask authors/readers for more resources

This paper presents a high-throughput FPGA accelerator based on OpenCL for the YOLOv2 object detection algorithm. The accelerator utilizes a scalable pipeline design and layer fusion technology to improve processing speed while ensuring hardware resource utilization.
In recent years, dedicated hardware accelerators for the acceleration of the convolutional neural network (CNN) have been extensively studied. Although many studies have presented efficient designs on FPGAs for image classification neural network models such as AlexNet and VGG, there are still little implementations for CNN-based object detection applications. This paper presents an OpenCL-based high-throughput FPGA accelerator for the YOLOv2 object detection algorithm on Arria-10 GX1150 FPGA. The proposed hardware architecture adopts a scalable pipeline design to support multi-resolution input image and full 8-bit fixed-point datapath to improve hardware resource utilization. Layer fusion technology that merges the convolution, batch normalization and Leaky-ReLU is also developed to avoid transmission of intermediate data between FPGA and external memory. Experimental results show that the final design achieves a peak throughput of 566 GOP/s under the working frequency of 190 MHz. The accelerator can execute YOLOv2 inference computation (288x288 resolution) and tiny YOLOv2 (416x416resolution) at the speed of 35 and 71 FPS, respectively.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available