☆ 4.7 Article

Converting tabular data into images for deep learning with convolutional neural networks

SCIENTIFIC REPORTS (2021)

Journal

SCIENTIFIC REPORTS

Volume 11, Issue 1, Pages -

Publisher

NATURE PORTFOLIO

DOI: 10.1038/s41598-021-90923-y

Keywords

Funding

Joint Design of Advanced Computing Solutions for Cancer (JDACS4C) program by the U.S. Department of Energy (DOE)
National Cancer Institute (NCI) of the National Institutes of Health
U.S. Department of Energy by Argonne National Laboratory [DE-AC02-06-CH11357]
U.S. Department of Energy by Lawrence Livermore National Laboratory [DE-AC52-07NA27344]
U.S. Department of Energy by Los Alamos National Laboratory [DE-AC5206NA25396]
U.S. Department of Energy by Oak Ridge National Laboratory [DE-AC05-00OR22725]
National Cancer Institute, National Institutes of Health [HHSN261200800001E]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The IGTD algorithm transforms tabular data into images by assigning similar features close to each other in the image through optimized allocation of features to pixel positions. Compared to existing methods, IGTD generates compact image representations with better preservation of feature neighborhood structure.

Convolutional neural networks (CNNs) have been successfully used in many applications where important information about data is embedded in the order of features, such as speech and imaging. However, most tabular data do not assume a spatial relationship between features, and thus are unsuitable for modeling using CNNs. To meet this challenge, we develop a novel algorithm, image generator for tabular data (IGTD), to transform tabular data into images by assigning features to pixel positions so that similar features are close to each other in the image. The algorithm searches for an optimized assignment by minimizing the difference between the ranking of distances between features and the ranking of distances between their assigned pixels in the image. We apply IGTD to transform gene expression profiles of cancer cell lines (CCLs) and molecular descriptors of drugs into their respective image representations. Compared with existing transformation methods, IGTD generates compact image representations with better preservation of feature neighborhood structure. Evaluated on benchmark drug screening datasets, CNNs trained on IGTD image representations of CCLs and drugs exhibit a better performance of predicting anti-cancer drug response than both CNNs trained on alternative image representations and prediction models trained on the original tabular data.

Converting tabular data into images for deep learning with convolutional neural networks

Journal

SCIENTIFIC REPORTS

Publisher

NATURE PORTFOLIO

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Converting tabular data into images for deep learning with convolutional neural networks

Journal

SCIENTIFIC REPORTS

Publisher

NATURE PORTFOLIO

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper