4.7 Article

Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2011.68

关键词

Markov clustering; graphs and networks; GPU computing; PPI networks; CUDA; ELLPACK-R sparse format; scalable parallel programming; parallelism and concurrency; performance evaluation; bioinformatics

资金

  1. AUSAID
  2. ARC Center of Excellence in Bioinformatics at IMB the University of Queensland

向作者/读者索取更多资源

Markov clustering (MCL) is becoming a key algorithm within bioinformatics for determining clusters in networks. However, with increasing vast amount of data on biological networks, performance and scalability issues are becoming a critical limiting factor in applications. Meanwhile, GPU computing, which uses CUDA tool for implementing a massively parallel computing environment in the GPU card, is becoming a very powerful, efficient, and low-cost option to achieve substantial performance gains over CPU approaches. The use of on-chip memory on the GPU is efficiently lowering the latency time, thus, circumventing a major issue in other parallel computing environments, such as MPI. We introduce a very fast Markov clustering algorithm using CUDA (CUDA-MCL) to perform parallel sparse matrix-matrix computations and parallel sparse Markov matrix normalizations, which are at the heart of MCL. We utilized ELLPACK-R sparse format to allow the effective and fine-grain massively parallel processing to cope with the sparse nature of interaction networks data sets in bioinformatics applications. As the results show, CUDA-MCL is significantly faster than the original MCL running on CPU. Thus, large-scale parallel computation on off-the-shelf desktop-machines, that were previously only possible on supercomputing architectures, can significantly change the way bioinformaticians and biologists deal with their data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Chemistry, Multidisciplinary

Comparison of Dengue Predictive Models Developed Using Artificial Neural Network and Discriminant Analysis with Small Dataset

Permatasari Silitonga, Alhadi Bustamam, Hengki Muradi, Wibowo Mangunwardoyo, Beti E. Dewi

Summary: The study developed models using Artificial Neural Network (ANN) and Discriminant Analysis (DA) to predict the severity level of dengue based on laboratory test results, achieving high accuracy of 90.91%, sensitivity of 91.11%, and specificity of 95.51%. The proposed model can assist physicians in timely predicting and treating dengue patients to prevent fatal cases.

APPLIED SCIENCES-BASEL (2021)

Article Computer Science, Artificial Intelligence

Interpretable deep learning systems for multi-class segmentation and classification of non-melanoma skin cancer

Simon M. Thomas, James G. Lefevre, Glenn Baxter, Nicholas A. Hamilton

Summary: This study applies interpretable deep learning methods to analyze the most common skin cancers in a histological setting, demonstrating the potential for automatic machine analysis of dermatopathology work. By characterizing tissue into meaningful dermatological classes, the research aims to pave the way for future computer aided diagnosis systems with human interpretable outcomes.

MEDICAL IMAGE ANALYSIS (2021)

Article Biochemical Research Methods

LLAMA: a robust and scalable machine learning pipeline for analysis of large scale 4D microscopy data: analysis of cell ruffles and filopodia

James G. Lefevre, Yvette W. H. Koh, Adam A. Wall, Nicholas D. Condon, Jennifer L. Stow, Nicholas A. Hamilton

Summary: This study introduces LLAMA, a platform for systematic analysis of terabyte-scale 4D microscopy datasets. The system utilizes machine learning for semantic segmentation and object analysis and tracking algorithms, running on high-performance computing to achieve high throughput. LLAMA provides detailed numerical and visual outputs for effective statistical analysis and has the capacity to screen large datasets for specific structural configurations.

BMC BIOINFORMATICS (2021)

Article Biology

Virtual screening of dipeptidyl peptidase-4 inhibitors using quantitative structure-activity relationship-based artificial intelligence and molecular docking of hit compounds

Oky Hermansyah, Alhadi Bustamam, Arry Yanuar

Summary: In this study, a virtual screening workflow was developed using quantitative structure-activity relationship (QSAR) strategy based on artificial intelligence to identify DPP-4-inhibitor hit compounds selective against DPP-8 and DPP-9. The study utilized regression and classification machine learning algorithms to build the virtual screening workflows, resulting in the identification of potential hit compounds with high inhibitory potential against DPP-4 and low inhibitory potential against DPP-8 and DPP-9. This technique showed effectiveness in identifying DPP-4-inhibitor hit compounds and has potential applications for discovering hit compounds of other targets.

COMPUTATIONAL BIOLOGY AND CHEMISTRY (2021)

Article Biology

Quantitative analysis of tumour spheroid structure

Alexander P. Browning, Jesse A. Sharp, Ryan J. Murphy, Gency Gunasingh, Brodie Lawson, Kevin Burrage, Nikolas K. Haass, Matthew Simpson

Summary: Tumour spheroids are common experimental models in vitro, closely mimicking avascular tumour growth. Research suggests that spheroids have a limiting structure and the mature structure is independent of seeding density. Additionally, comparing spheroid structure rather than size produces more accurate results.
Article Computer Science, Information Systems

Comparative Analysis of Performance between Multimodal Implementation of Chatbot Based on News Classification Data Using Categories

Prasnurzaki Anki, Alhadi Bustamam, Rinaldi Anwar Buyung

Summary: This study focuses on the application of sentence classification using the News Aggregator Dataset to create a chatbot program. Results show that the 1D CNN Transpose model achieves the highest accuracy in testing. Through testing four models via multimodal implementation, accurate sentence prediction and detection are expected for both types of chatbot.

ELECTRONICS (2021)

Article Computer Science, Information Systems

Deep Feature Vectors Concatenation for Eye Disease Detection Using Fundus Image

Radifa Hilya Paradisa, Alhadi Bustamam, Wibowo Mangunwardoyo, Andi Arus Victor, Anggun Rama Yudantha, Prasnurzaki Anki

Summary: Fundus image is crucial for disease detection, especially for early diagnosis of diabetic retinopathy. The demand for ophthalmologists who can read fundus images is increasing, leading to a need for an automated diagnostic system. This study proposes a deep learning approach using a concatenate model for fundus image classification, yielding improved accuracy and F1-score compared to a single model.

ELECTRONICS (2022)

Article Multidisciplinary Sciences

Non-melanoma skin cancer segmentation for histopathology dataset

Simon M. Thomas, James G. Lefevre, Glenn Baxter, Nicholas A. Hamilton

Summary: The dataset includes 290 hand-annotated histopathology tissue sections of the three most common skin cancers, each with a segmentation mask dividing the tissue into 12 types. It also provides cancer margin measurements for automated assessment, allowing researchers to build upon recent work in skin cancer image analysis.

DATA IN BRIEF (2021)

Article Mathematics, Applied

Effective numerical methods for simulating diffusion on a spherical surface in three dimensions

Kevin Burrage, Pamela M. Burrage, Grant Lythe

Summary: This paper presents an algorithm for homogeneous diffusive motion on a sphere by considering the equivalent process of a randomly rotating spin vector. By introducing appropriate sets of random variables, families of methods are constructed that effectively preserve the spin modulus for every realization, achieved by exponentiating an antisymmetric matrix.

NUMERICAL ALGORITHMS (2022)

Article Multidisciplinary Sciences

Analysis of sloppiness in model simulations: Unveiling parameter uncertainty when mathematical models are fitted to data

Gloria M. Monsalve-Bravo, Brodie A. J. Lawson, Christopher Drovandi, Kevin Burrage, Kevin S. Brown, Christopher M. Baker, Sarah A. Vollert, Kerrie Mengersen, Eve McDonald-Madden, Matthew P. Adams

Summary: This work introduces a comprehensive approach to assess the sensitivity of model outputs to changes in parameter values, constrained by the combination of prior beliefs and data. It identifies stiff parameter combinations affecting the model-data fit, and reveals which of these combinations are primarily influenced by the data or the priors. The technique is beneficial in contexts where data is limited compared to the number of model parameters, and has applications in biochemistry, ecology, and cardiac electrophysiology. It also helps uncover controlling mechanisms and guide parameter prioritization for improved parameter inference.

SCIENCE ADVANCES (2022)

Article Mathematics, Applied

A meshfree radial basis function method for simulation of multi-dimensional conservation problems

Brody H. H. Foy, Kevin Burrage, Ian Turner

Summary: This study proposes a meshfree numerical scheme based on strong-form finite volume style formulations. The technique uses radial basis functions to interpolate the problem domain and approximate fluxes in a disjoint finite volume scheme, eliminating the reliance on a mesh structure. The method shows potential for applications in porous media modeling and computational fluid dynamics.

NUMERICAL METHODS FOR PARTIAL DIFFERENTIAL EQUATIONS (2023)

Article Mathematics, Applied

Stability Switching in Lotka-Volterra and Ricker-Type Predator-Prey Systems with Arbitrary Step Size

Shamika Kekulthotuwage Don, Kevin Burrage, Kate J. Helmstedt, Pamela M. Burrage

Summary: We investigated the dynamical properties of discrete systems under two settings: discrete and continuous. By discretizing time, we obtained stability conditions that maintain the characteristics of continuous models and their numerical approximations. We found that small changes in model parameters can alter system dynamics unless an appropriate time discretization is chosen. We also observed similar dynamical properties in Ricker-type predator-prey systems under certain conditions. Our results highlight the importance of preliminary analysis in determining agreement or disagreement between the dynamical properties of approximated discrete systems and their continuous counterparts.

AXIOMS (2023)

Article Multidisciplinary Sciences

Implications of different membrane compartmentalization models in particle-based in silico studies

Philipp Henning, Till Koester, Fiete Haack, Kevin Burrage, Adelinde M. Uhrmacher

Summary: Studying membrane dynamics is crucial for understanding cellular response to environmental stimuli. The plasma membrane's compartmental structure, created by actin-based membrane-skeleton and anchored transmembrane proteins, plays an important role in this process. Particle-based reaction-diffusion simulation offers a suitable approach for analyzing the membrane's stochastic and spatially heterogeneous dynamics. However, different methods for modeling the compartmental structure have their own constraints and impact on simulation results and performance.

ROYAL SOCIETY OPEN SCIENCE (2023)

Review Multidisciplinary Sciences

Implementation and acceleration of optimal control for systems biology

Jesse A. Sharp, Kevin Burrage, Matthew J. Simpson

Summary: This review discusses the application of Pontryagin's maximum principle (PMP) in optimal control and the implementation of the forward-backward sweep method (FBSM). By conceptualizing FBSM as a fixed point iteration process and adapting existing acceleration techniques, the rate of convergence can be improved without costly tuning. Moreover, these methods can induce convergence in cases where the FBSM fails to converge.

JOURNAL OF THE ROYAL SOCIETY INTERFACE (2021)

Article Computer Science, Theory & Methods

Artificial intelligence paradigm for ligand-based virtual screening on the drug discovery of type 2 diabetes mellitus

Alhadi Bustamam, Haris Hamzah, Nadya A. Husna, Sarah Syarofina, Nalendra Dwimantara, Arry Yanuar, Devvi Sarwinda

Summary: This study aims to develop new DPP-4 inhibitors for the treatment of type 2 diabetes with low adverse effects using QSAR models built with Rotation Forest and Deep Neural Network. K-modes clustering and CatBoost are utilized for molecule selection and feature selection, resulting in QSAR models with high performance metrics. The study concludes that feature selection using CatBoost before building QSAR models is essential for accurate predictions.

JOURNAL OF BIG DATA (2021)

暂无数据