4.7 Article

Low-Complexity Approximate Convolutional Neural Networks

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2018.2815435

Keywords

Approximation; convolutional neural networks (ConvNets); numerical computation; optimization

Funding

  1. CNPq, Brazil

Ask authors/readers for more resources

In this paper, we present an approach for minimizing the computational complexity of the trained convolutional neural networks (ConvNets). The idea is to approximate all elements of a given ConvNet and replace the original convolutional filters and parameters (pooling and bias coefficients; and activation function) with an efficient approximations capable of extreme reductions in computational complexity. Low-complexity convolution filters are obtained through a binary (zero and one) linear programming scheme based on the Frobenius norm over sets of dyadic rationals. The resulting matrices allow for multiplication-free computations requiring only addition and bit-shifting operations. Such low-complexity structures pave the way for low power, efficient hardware designs. We applied our approach on three use cases of different complexities: 1) a light but efficient ConvNet for face detection (with around 1000 parameters); 2) another one for hand-written digit classification (with more than 180000 parameters); and 3) a significantly larger ConvNet: AlexNet with approximate to 1.2 million matrices. We evaluated the overall performance on the respective tasks for different levels of approximations. In all considered applications, very low-complexity approximations have been derived maintaining an almost equal classification performance.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Engineering, Electrical & Electronic

Signal detection and inference based on the beta binomial autoregressive moving average model

Bruna G. Palm, Fabio M. Bayer, Renato J. Cintra

Summary: This paper introduces a new beta binomial autoregressive moving average model (BBARMA) for modeling quantized amplitude data and bounded count data, which estimates the conditional mean through a dynamic structure and provides parameter estimation, detection tools, forecasting scheme, and diagnostic measures. Simulation results show that the proposed model outperforms traditional detectors in sinusoidal signal detection.

DIGITAL SIGNAL PROCESSING (2021)

Article Statistics & Probability

Prediction intervals in the beta autoregressive moving average model

Bruna Gregory Palm, Fabio M. Bayer, Renato J. Cintra

Summary: This article proposes five prediction intervals for the beta autoregressive moving average model and evaluates their performance through Monte Carlo simulations. The bias-corrected and acceleration (BCa) prediction interval shows the best performance among the evaluated intervals.

COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION (2023)

Article Mathematics, Applied

Goodness-of-fit measures based on the Mellin transform for beta generalized lifetime data

Josimar M. Vasconcelos, Renato J. Cintra, Abraao D. C. Nascimento

Summary: In this paper, new qualitative and quantitative GoF tools for model selection within the beta-G class are provided by combining probability weighted moments (PWMs) and the Mellin transform (MT). PWMs for the Frechet and Kumaraswamy distributions are derived, along with expressions for the MT and log-cumulants (LC) of various beta distributions. LC diagrams are constructed and confidence ellipses for the LCs are derived based on the Hotelling's T-2 statistic. The proposed GoF measures are applied on five real data sets to demonstrate their applicability.

MATHEMATICAL METHODS IN THE APPLIED SCIENCES (2021)

Article Computer Science, Interdisciplinary Applications

2-D Rayleigh autoregressive moving average model for SAR image modeling

Bruna G. Palm, Fabio M. Bayer, Renato J. Cintra

Summary: The application of a two-dimensional ARMA model tailored for Rayleigh-distributed data is studied. The model is derived and conditional likelihood inferences are discussed. Monte Carlo simulations and numerical experiments are conducted to evaluate the model's performance and results compared to traditional 2-D ARMA models and competing methods in the literature.

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2022)

Article Biology

Low-complexity three-dimensional discrete Hartley transform approximations for medical image compression

Vitor A. Coutinho, Renato J. Cintra, Fabio M. Bayer

Summary: The study introduces multiplierless DHT approximations for medical image compression, offering significant computational resource savings and suitable for devices with limited resources. These methods achieve the same level of visual quality while reducing computational effort. By implementing the proposed transforms on an ARM Cortex-M0+ processor, the execution time is significantly reduced.

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Article Geochemistry & Geophysics

Improved Point Estimation for the Rayleigh Regression Model

Bruna G. Palm, Fabio M. Bayer, Renato J. Cintra

Summary: This letter introduces bias-adjusted estimators tailored for the Rayleigh regression model and presents numerical experiments on synthetic and actual SAR data sets. The results show that the bias-adjusted estimators yield nearly unbiased estimates and accurate modeling results.

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (2022)

Article Geochemistry & Geophysics

Robust Rayleigh Regression Method for SAR Image Processing in Presence of Outliers

Bruna G. Palm, Fabio M. Bayer, Renato Machado, Mats Pettersson, Viet T. Vu, Renato J. Cintra

Summary: This article proposes a Rayleigh regression model based on robust estimation to address outliers in SAR data. Through numerical experiments, the robust estimator is found to outperform traditional estimators in corrupted signals, demonstrating better performance in the presence of anomalies.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2022)

Article Engineering, Electrical & Electronic

Data-independent low-complexity KLT approximations for image and video coding

Anabeth P. Radunz, Thiago L. T. da Silveira, Fabio M. Bayer, Renato J. Cintra

Summary: This work proposes low-computational cost approximations for the KLT that are suitable for image and video compression. Extensive computational experiments on blocklengths of 4, 8, 16, and 32 demonstrate the effectiveness of these low-complexity transforms.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2022)

Article Computer Science, Artificial Intelligence

Low-complexity rounded KLT approximation for image compression

Anabeth P. Radunz, Fabio M. Bayer, Renato J. Cintra

Summary: The paper introduces a new class of low-complexity transforms obtained through applying the round function to KLT matrix elements, which perform well in image compression with low implementation cost.

JOURNAL OF REAL-TIME IMAGE PROCESSING (2022)

Article Engineering, Electrical & Electronic

A Class of Low-Complexity DCT-Like Transforms for Image and Video Coding

Thiago L. T. da Silveira, Diego Ramos Canterle, Diego F. G. Coelho, Vitor A. Coutinho, Fabio M. Bayer, Renato J. Cintra

Summary: The discrete cosine transform (DCT) is a relevant tool in signal processing applications, known for its good decorrelation properties. Recent research has focused on low-complexity approximations of the DCT, which are important for real-time computation and low-power consumption applications. This paper presents a new multiparametric transform class and its associated fast algorithm. By solving an optimization problem, four novel low-complexity DCT approximations are obtained. Experimental results show that these new transforms perform as well as or better than current state-of-the-art DCT approximations.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Computer Science, Information Systems

Extensions on Low-Complexity DCT Approximations for Larger Blocklengths Based on Minimal Angle Similarity

Anabeth P. P. Radunz, Luan Portella, R. S. Oliveira, Fabio M. Bayer, Renato J. J. Cintra

Summary: In this paper, low-complexity approximations of the discrete cosine transform (DCT) with blocklengths of 16, 32, and 64 are introduced by minimizing the angle between the rows of the exact DCT matrix and the matrix induced by the approximate transforms. These proposed transforms outperform existing DCT approximations in terms of performance, as judged by classical figures of merit. Fast algorithms are also developed for these low-complexity transforms, striking a good balance between performance and computational cost. Experimental results in image encoding demonstrate the relevance of these transforms, showing superior performance compared to known approximations for blocklengths of 16, 32, and 64.

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY (2023)

Article Computer Science, Information Systems

Analog Current-Mode 8-Point Approximate-DFT Multi-Beamformer With 4.7 Gbps Channel Capacity

Haixiang Zhao, Arjuna Madanayake, Renato J. Cintra, Soumyajit Mandal

Summary: This paper presents a current-mode on-chip analog architecture for multi-beamforming of IF band signals from small- to moderate-sized antenna arrays. The architecture utilizes an approximate discrete Fourier transform (a-DFT) algorithm to achieve the required linear transformation for multi-beamforming. The proposed integrated multi-beamformer demonstrates lower power consumption and supports high-performance beamforming with a wide dynamic range.

IEEE ACCESS (2023)

Article Engineering, Electrical & Electronic

Radix- N Algorithm for Computing N2n -Point DFT Approximations

Luan Portella, Diego F. G. Coelho, Fabio M. Bayer, Arjuna Madanayake, Renato J. Cintra

Summary: This article presents a variant of transform scaling method to obtain DFT approximations of large blocksize. The proposed method scales up a given transformation successively, leading to different sizes of transformations. The authors fully introduce a 32(4)-point DFT approximation and provide a fast algorithm, arithmetic complexity assessment, and error analysis.

IEEE SIGNAL PROCESSING LETTERS (2022)

Article Engineering, Electrical & Electronic

Low-Complexity Scaling Methodsfor DCT-II Approximations

Diego F. G. Coelho, Renato J. Cintra, Arjuna Madanayake, Sirani M. Perera

Summary: This paper introduces a collection of scaling methods for generating 2N-point DCT-II approximations based on N-point low-complexity transformations. The proposed techniques outperform the transforms resulting from the JAM scaling method in terms of total error energy and mean squared error, as demonstrated by extensive error analysis based on statistical simulation. Additionally, a hardware implementation shows the competitiveness of the proposed methods compared to the JAM scaling method.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2021)

Article Mathematics, Applied

The compound truncated Poisson Cauchy model: A descriptor for multimodal data

Josimar M. Vasconcelos, Renato J. Cintra, Abraao D. C. Nascimento, Leandro C. Rego

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS (2020)

No Data Available