4.7 Article

MPCA SGD-A Method for Distributed Training of Deep Learning Models on Spark

Journal

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
Volume 29, Issue 11, Pages 2540-2556

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPDS.2018.2833074

Keywords

Deep learning; distributed computing; machine learning; neural networks; spark; stochastic gradient descent

Ask authors/readers for more resources

Many distributed deep learning systems have been published over the past few years, often accompanied by impressive performance claims. In practice these figures are often achieved in high performance computing (HPC) environments with fast InfiniBand network connections. For average deep learning practitioners this is usually an unrealistic scenario, since they cannot afford access to these facilities. Simple re-implementations of algorithms such as EASGD [1] for standard Ethernet environments often fail to replicate the scalability and performance of the original works [2]. In this paper, we explore this particular problem domain and present MPCA SGD, a method for distributed training of deep neural networks that is specifically designed to run in low-budget environments. MPCA SGD tries to make the best possible use of available resources, and can operate well if network bandwidth is constrained. Furthermore, MPCA SGD runs on top of the popular Apache Spark [3] framework. Thus, it can easily be deployed in existing data centers and office environments where Spark is already used. When training large deep learning models in a gigabit Ethernet cluster, MPCA SGD achieves significantly faster convergence rates than many popular alternatives. For example, MPCA SGD can train ResNet-152 [4] up to 5.3x faster than state-of-the-art systems like MXNet [5], up to 5.3x faster than bulk-synchronous systems like SparkNet [6] and up to 5.3x faster than decentral asynchronous systems like EASGD [1].

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Hardware & Architecture

Finding lowest-cost paths in settings with safe and preferred zones

Saad Aljubayrin, Jianzhong Qi, Christian S. Jensen, Rui Zhang, Zhen He, Yuan Li

VLDB JOURNAL (2017)

Article Computer Science, Information Systems

GCOTraj: A storage approach for historical trajectory data sets using grid cells ordering

Shengxun Yang, Zhen He, Yi-Ping Phoebe Chen

INFORMATION SCIENCES (2018)

Article Substance Abuse

How much are we exposed to alcohol in electronic media? Development of the Alcoholic Beverage Identification Deep Learning Algorithm (ABIDLA)

Emmanuel Kuntsche, Abraham Albert Bonela, Gabriel Caluzzi, Mia Miller, Zhen He

DRUG AND ALCOHOL DEPENDENCE (2020)

Article Computer Science, Theory & Methods

Distributed Training of Deep Learning Models: A Taxonomic Perspective

Matthias Langer, Zhen He, Wenny Rahayu, Yanbo Xue

IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS (2020)

Article Computer Science, Artificial Intelligence

The detection, tracking, and temporal action localisation of swimmers for automated analysis

Ashley Hall, Brandon Victor, Zhen He, Matthias Langer, Marc Elipot, Aiden Nibali, Stuart Morgan

Summary: It is crucial for swimming coaches to analyze swimmers' performance for strategy adjustment, relying on statistics derived from time-consuming manual video annotations. A two-phased deep learning approach called DeepDASH and a hierarchical tracking algorithm called HISORT are proposed to solve computer vision tasks in swimming videos, achieving significant improvements in swimmer head detection, tracking, and stroke detection.

NEURAL COMPUTING & APPLICATIONS (2021)

Article Biochemical Research Methods

Wheat physiology predictor: predicting physiological traits in wheat from hyperspectral reflectance measurements using deep learning

Robert T. Furbank, Viridiana Silva-Perez, John R. Evans, Anthony G. Condon, Gonzalo M. Estavillo, Wennan He, Saul Newman, Richard Poire, Ashley Hall, Zhen He

Summary: The study demonstrates that the accuracy of predicting wheat photosynthetic and leaf traits using deep learning and ensemble models can be improved compared to PLSR without overfitting. These models can be flexibly applied across different spectral ranges without compromising accuracy.

PLANT METHODS (2021)

Article Biology

Class activation attention transfer neural networks for MCI conversion prediction

Min Luo, Zhen He, Hui Cui, Yi-Ping Phoebe Chen, Phillip Ward

Summary: We propose a novel attention transfer method for accurately predicting the progression of Alzheimer's disease (AD) in patients with mild cognitive impairment (MCI). Our method trains a 3D convolutional neural network to automatically learn regions of interest (ROI) from images and transfer attention maps instead of model weights. Our method outperformed traditional transfer learning and methods using expert knowledge to define ROI, and the attention map revealed Alzheimer's pathology.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Proceedings Paper Engineering, Electrical & Electronic

3D Human Pose Estimation with 2D Marginal Heatmaps

Aiden Nibali, Zhen He, Stuart Morgan, Luke Prendergast

2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Extraction and Classification of Diving Clips from Continuous Video Footage

Aiden Nibali, Zhen He, Stuart Morgan, Daniel Greenwood

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) (2017)

No Data Available