4.7 Article

Quantized CNN: A Unified Approach to Accelerate and Compress Convolutional Networks

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2017.2774288

Keywords

Acceleration and compression; convolutional neural network (CNN); mobile devices; product quantization

Funding

  1. National Natural Science Foundation of China [61332016]
  2. Scientific Research Key Program of Beijing Municipal Commission of Education [KZ201610005012]
  3. Fund of Hubei Key Laboratory of Transportation Internet of Things
  4. Fund of Jiangsu Key Laboratory of Big Data Analysis Technology

Ask authors/readers for more resources

We are witnessing an explosive development and widespread application of deep neural networks (DNNs) in various fields. However, DNN models, especially a convolutional neural network (CNN), usually involve massive parameters and are computationally expensive, making them extremely dependent on high-performance hardware. This prohibits their further extensions, e.g., applications on mobile devices. In this paper, we present a quantized CNN, a unified approach to accelerate and compress convolutional networks. Guided by minimizing the approximation error of individual layer's response, both fully connected and convolutional layers are carefully quantized. The inference computation can be effectively carried out on the quantized network, with much lower memory and storage consumption. Quantitative evaluation on two publicly available benchmarks demonstrates the promising performance of our approach: with comparable classification accuracy, it achieves 4 to 6x acceleration and 15 to 20x compression. With our method, accurate image classification can even be directly carried out on mobile devices within 1 s.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Oncology

Special issue The advance of solid tumor research in China: Presurgical therapy in the management of local retroperitoneal recurrence of renal cell carcinoma after radical nephrectomy

Yueming Wang, Jiwei Huang, Cuijian Zhang, Xiaoyi Hu, Ping Wang, Guohai Shi, Jin Zhang, Wen Kong, Yonghui Chen, Yiran Huang, Dingwei Ye, Dan Xia, Jianming Guo, Zhisong He, Wei Xue

Summary: This retrospective multicenter study analyzed the clinical outcomes and prognostic factors of 85 patients who underwent targeted therapy for local retroperitoneal recurrence (RPR) after radical nephrectomy (RN). The study found that presurgical targeted therapy followed by surgical resection was associated with better cancer-specific survival (CSS) compared to targeted therapy alone. Factors such as risk classification, number of recurrence lesions, and surgical resection were independent predictors of CSS.

INTERNATIONAL JOURNAL OF CANCER (2023)

Article Automation & Control Systems

A self-triggered stochastic model predictive control for uncertain networked control system

Hongyuan Wang, Jingcheng Wang, Haotian Xu, Shangwei Zhao

Summary: This paper proposes a self-triggered stochastic model predictive control (SMPC) algorithm for networked control systems with disturbances and communication frequency restrictions. The algorithm integrates the design of control inputs and sampling interval to reduce communication while ensuring performance. Chance constraints are transformed into deterministic ones, and the nonconvex terms in covariance propagation are avoided using an upper bound control approach. The proposed algorithm is shown to be effective and achieves the desired performance through numerical simulations.

INTERNATIONAL JOURNAL OF CONTROL (2023)

Article Chemistry, Physical

Nonradical-dominated peroxymonosulfate activation through bimetallic Fe/Mn-loaded hydroxyl-rich biochar for efficient degradation of tetracycline

Yihui Li, Deying Lin, Yongfu Li, Peikun Jiang, Xiaobo Fang, Bing Yu

Summary: In this study, bimetallic Fe/Mn-loaded hydroxyl-rich biochar (FeMn-OH-BC) is synthesized to activate peroxymonosulfate (PMS) through nonradical-dominated pathways for degrading organic pollutants. The FeMn-OH-BC exhibits excellent catalytic activity at a wide range of pH conditions and is not inhibited by various anions. Electron paramagnetic resonance measurements and quenching tests confirm that O-1(2) is the major reactive oxygen species generated from FeMn-OH-BC based PMS activation.

NANO RESEARCH (2023)

Article Dermatology

Relationships among social support, coping style, self-stigma, and quality of life in patients with diabetic foot ulcer: A multicentre, cross-sectional study

Ruzhen Luo, Yunan Ji, Yan-hui Liu, Hongyu Sun, Siyuan Tang, Xuechun Li

Summary: This study examines the relationships between social support, decision regret, self-stigma, and quality of life in patients with diabetic foot ulcers. The results show that quality of life is negatively correlated with self-stigma, positively correlated with social support and giving up coping, and not significantly correlated with confrontation coping and avoidance coping.

INTERNATIONAL WOUND JOURNAL (2023)

Article Clinical Neurology

Increased Free Water in the Substantia Nigra in Asymptomatic LRRK2 G2019S Mutation Carriers

Dongling Zhang, Liche Zhou, Yuting Shi, Jun Liu, Hongjiang Wei, Qiqi Tong, Hongjian He, Tao Wu

Summary: The study found that the free water (FW) values in the posterior substantia nigra (pSN) were significantly elevated and continued to increase in asymptomatic LRRK2 G2019S mutation carriers. There was also a negative correlation between FW changes in the left pSN and striatal binding ratio (SBR) changes in the left putamen.

MOVEMENT DISORDERS (2023)

Article Optics

Readout-efficient superconducting nanowire single-photon imager with orthogonal time-amplitude multiplexing by hotspot quantization

Ling-Dong Kong, Hui Wang, Qing-Yuan Zhao, Jia-Wei Guo, Yang-Hui Huang, Hao Hao, Shi Chen, Xue-Cou Tu, La-Bao Zhang, Xiao-Qing Jia, Lin Kang, Jian Chen, Pei-Heng Wu

Summary: A kilopixel imager is developed by introducing an orthogonal time-amplitude-multiplexing method in superconducting nanowire single-photon detectors. The readout is solely built in the nanowire structure to manipulate the hotspot growth and microwave propagation after photon detection. The experiment results show that the imager has high pixel fidelity and temporal resolution, making it suitable for single-photon imaging experiments in photon-starved conditions. This research is of great importance for the development of large-scale single-photon imagers in fields such as quantum measurements, remote sensing, and astronomical telescopes.

NATURE PHOTONICS (2023)

Article Public, Environmental & Occupational Health

Trends in dietary carbohydrates, protein and fat intake and diet quality among Chinese adults, 1991-2015: results from the China Health and Nutrition Survey

Lina Huang, Liusen Wang, Hongru Jiang, Huijun Wang, Zhihong Wang, Bing Zhang, Gangqiang Ding

Summary: There have been significant changes in the macronutrient composition of the Chinese adult diet over the past few decades, with a decrease in the percentage of energy consumed from carbohydrates and an increase in the percentage of energy consumed from total fat. The diet quality remains suboptimal.

PUBLIC HEALTH NUTRITION (2023)

Article Environmental Sciences

Sex-specific and dose-response relationships of urinary cobalt and molybdenum levels with glucose levels and insulin resistance in US adults

Jingli Yang, Yongbin Lu, Yana Bai, Zhiyuan Cheng

Summary: Growing studies have shown a potential link between metal exposure and diabetes risk. This study used a multiple linear regression model to examine the associations between urinary metals (cobalt and molybdenum) and diabetes-related indicators in a cross-sectional study. The results demonstrated significant sex-specific and dose-response relationships between urinary metals and diabetes-related indicators, suggesting the need for further investigation into the underlying mechanisms.

JOURNAL OF ENVIRONMENTAL SCIENCES (2023)

Article Plant Sciences

Zhenwu decoction ameliorates cardiac hypertrophy through activating sGC (soluble guanylate cyclase) - cGMP (cyclic guanosine monophosphate) - PKG (protein kinase G) pathway

Liqian Chen, Xinghong Zhou, Yijian Deng, Ying Yang, Xiaohu Chen, Qinghong Chen, Yanyan Liu, Xiuqiong Fu, Hiu Yee Kwan, Yanting You, Wen Jin, Xiaoshan Zhao

Summary: This study aimed to confirm the protective effects of ZWD on cardiac hypertrophy and explore the underlying mechanisms. The results showed that ZWD reduces oxidative stress and inflammation and exerts cardioprotective effects by activating the sGC-cGMP-PKG signaling pathway.

JOURNAL OF ETHNOPHARMACOLOGY (2023)

Article Genetics & Heredity

Detecting genomic mosaicism in de novo genetic epilepsy by amplicon-based deep sequencing

Jiaoyang Chen, Yi Chen, Ying Yang, Xueyang Niu, Jing Zhang, Qi Zeng, Aijie Liu, Xiaojing Xu, Xiaoxu Yang, Shupin Li, Xiaoling Yang, Yi Wang, Yuehua Zhang

Summary: This study investigated the occurrence of mosaicism in epilepsy probands and their parents, and found that mosaic phenomenon is not rare in families with epilepsy. The phenotypes of mosaic parents were milder or normal, and ADS is a reliable way of mosaicism detection for clinical application.

JOURNAL OF HUMAN GENETICS (2023)

Article Computer Science, Artificial Intelligence

ECBC: Efficient Convolution via Blocked Columnizing

Tianli Zhao, Qinghao Hu, Xiangyu He, Weixiang Xu, Jiaxing Wang, Cong Leng, Jian Cheng

Summary: This article introduces an efficient convolution algorithm called ECBC, which achieves high computation performance by utilizing highly optimized subroutines in matrix multiplication and dividing the convolution computation into blocks using a block algorithm. This approach helps reduce the memory overhead significantly.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Pharmacology & Pharmacy

TRIB3 promotes pulmonary fibrosis through inhibiting SLUG degradation by physically interacting with MDM2

Xiaoxi Lv, Shanshan Liu, Chang Liu, Yunxuan Li, Tingting Zhang, Jie Qi, Ke Li, Fang Hua, Bing Cui, Xiaowei Zhang, Yuxin Liu, Jiaojiao Yu, Jinmei Yu, Li Li, Xia Li, Zhigang Yao, Bo Huang

Summary: This study reveals that repetitive lung damage leads to the accumulation of the transcriptional repressor SLUG, inhibiting lung alveolar regeneration and causing the failure of pulmonary fibrosis. The elevated SLUG represses the expression of phosphate transporter SLC34A2, reducing intracellular phosphate and inhibiting key kinases JNK and P38 MAPK, resulting in LAR failure. Targeting the TRIB3/MDM2 interaction can restore LAR capacity and provide potential therapeutic efficacy against fibroproliferative lung diseases.

ACTA PHARMACEUTICA SINICA B (2023)

Article Engineering, Biomedical

Seamless and early gap healing of osteochondral defects by autologous mosaicplasty combined with bioactive supramolecular nanofiber-enabled gelatin methacryloyl (BSN-GelMA) hydrogel

Hongwei Wu, Yuna Shang, Wei Sun, Xinyi Ouyang, Wenyan Zhou, Jieji Lu, Shuhui Yang, Wei Wei, Xudong Yao, Xiaozhao Wang, Xianzhu Zhang, Yishan Chen, Qiulin He, Zhimou Yang, Hongwei Ouyang

Summary: This study developed a strategy to enhance integration of the gap region following mosaicplasty by applying BSN-GelMA hydrogel. The results showed that BSN-GelMA achieved seamless osteochondral healing in the gap region, with improved cartilage repair scores, glycosaminoglycan content, subchondral bone volume, and collagen II expression. This study provides a powerful approach to improve gap integration after autologous mosaicplasty and offers a promising bioactive material for cell-free tissue regeneration.

BIOACTIVE MATERIALS (2023)

Article Computer Science, Artificial Intelligence

Extremely Sparse Networks via Binary Augmented Pruning for Fast Image Classification

Peisong Wang, Fanrong Li, Gang Li, Jian Cheng

Summary: In this article, the advantages of extremely sparse networks with binary connections for image classification through software-hardware codesign are investigated. A binary augmented extremely pruning method is proposed to achieve high sparsity with minimal accuracy degradation, and a hardware architecture based on the resulting sparse and binary networks is designed to explore the benefits of extreme sparsity with negligible consumption. Experiments on ImageNet classification and FPGA demonstrate a significant tradeoff between accuracy and efficiency in the proposed software-hardware architecture.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

GPENs: Graph Data Learning With Graph Propagation-Embedding Networks

Bo Jiang, Leiling Wang, Jian Cheng, Jin Tang, Bin Luo

Summary: Compact representation of graph data is a fundamental problem in pattern recognition and machine learning. Graph neural networks (GNNs) have been widely studied for graph-structured data representation and learning. In this article, a new model called graph propagation-embedding networks (GPENs) is proposed, which integrates feature propagation and low-dimensional embedding into a unified network. GPENs have good motivation and can be well-explained, and they guarantee both compactness and efficiency. Experiments demonstrate the effectiveness and benefits of GPENs and M-GPENs.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

No Data Available