4.7 Article

RF-MaloSite and DL-Malosite: Methods based on random forest and deep learning to identify malonylation sites

Journal

Publisher

ELSEVIER
DOI: 10.1016/j.csbj.2020.02.012

Keywords

Malonylation; Post-translational Modification Sites; Random forest; Deep learning; Convolutional neural network

Funding

  1. National Science Foundation (NSF) [2021734, 1564606, 1901793]
  2. HBCU-UP Excellence in Research Award from NSF [1901793]
  3. SC1 Award from the National Institutes of Health National Institute of General Medical Science [5SC1GM130545]
  4. JSPS KAKENHI [JP18H01762, JP19H04176]
  5. Direct For Biological Sciences
  6. Div Of Biological Infrastructure [1564606, 2021734] Funding Source: National Science Foundation
  7. Direct For Biological Sciences
  8. Div Of Biological Infrastructure [1901793] Funding Source: National Science Foundation

Ask authors/readers for more resources

Malonylation, which has recently emerged as an important lysine modification, regulates diverse biological activities and has been implicated in several pervasive disorders, including cardiovascular disease and cancer. However, conventional global proteomics analysis using tandem mass spectrometry can be time-consuming, expensive and technically challenging. Therefore, to complement and extend existing experimental methods for malonylation site identification, we developed two novel computational methods for malonylation site prediction based on random forest and deep learning machine learning algorithms, RF-MaloSite and DL-MaloSite, respectively. DL-MaloSite requires the primary amino acid sequence as an input and RF-MaloSite utilizes a diverse set of biochemical, physiochemical and sequence-based features. While systematic assessment of performance metrics suggests that both 'RFMaloSite' and `DL-MaloSite' perform well in all metrics tested, our methods perform particularly well in the areas of accuracy, sensitivity and overall method performance (assessed by the Matthew's Correlation Coefficient). For instance, RF-MaloSite exhibited MCC scores of 0.42 and 0.40 using 10-fold cross-validation and an independent test set, respectively. Meanwhile, DL-MaloSite was characterized by MCC scores of 0.51 and 0.49 based on 10-fold cross-validation and an independent set, respectively. Importantly, both methods exhibited efficiency scores that were on par or better than those achieved by existing malonylation site prediction methods. The identification of these sites may also provide important insights into the mechanisms of crosstalk between malonylation and other lysine modifications, such as acetylation, glutarylation and succinylation. To facilitate their use, both methods have been made freely available to the research community at Lps://github.comjdukkakcIDL-MaloSite-and-RF-MaloSite. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Software Engineering

Understanding movie poster: transfer-deep learning approach for graphic-rich text recognition

Mridul Ghosh, Sayan Saha Roy, Himadri Mukherjee, Sk Md Obaidullah, K. C. Santosh, Kaushik Roy

Summary: Graphic-rich texts in posters, especially in movie posters, play a vital role in conveying information and genre sentiments. Recognizing and localizing these texts require specific techniques. This paper introduces a transfer learning-based approach that achieved high accuracy on a newly developed dataset, outperforming previous tools relying on handcrafted features.

VISUAL COMPUTER (2022)

Article Computer Science, Information Systems

A study of the performance of embedding methods for Arabic short-text sentiment analysis using deep learning approaches

Ali Alwehaibi, Marwan Bikdash, Mohammad Albogmi, Kaushik Roy

Summary: This paper proposes an optimized sentiment classification method based on deep learning for dialectal Arabic short text at the document level. The research results show significant performance improvement in Arabic text classification.

JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES (2022)

Article Computer Science, Information Systems

CNN based recognition of handwritten multilingual city names

Ramit Kumar Roy, Himadri Mukherjee, Kaushik Roy, Umapada Pal

Summary: Accurately recognizing destination city names is crucial for postal documents to reach their intended addresses. In India, people often mix up scripts when writing addresses due to the country's multilingual and multi script nature. This paper presents a Convolutional Neural Network (CNN) based approach for recognizing handwritten multilingual multiscript Indian city names. The proposed scheme achieves high accuracy in both single script and multi script scenarios, with a maximum accuracy of 98.01%.

MULTIMEDIA TOOLS AND APPLICATIONS (2022)

Article Computer Science, Artificial Intelligence

Non-volume preserving-based fusion to group-level emotion recognition on crowd videos

Kha Gia Quach, Ngan Le, Chi Nhan Duong, Ibsa Jalata, Kaushik Roy, Khoa Luu

Summary: Group-level emotion recognition is a growing research area that is becoming increasingly important for assessing crowds of all sizes in the security and social media domains. This work extends previous research on group-level emotion recognition from single images or videos to fully investigate expression recognition in crowd videos through an effective deep feature level fusion mechanism.

PATTERN RECOGNITION (2022)

Article Business

Evaluation of Diagnostic Performance of Machine Learning Algorithms to Classify the Fetal Heart Rate Baseline From Cardiotocograph

Sahana Das, Sk Md Obaidullah, Kaushik Roy, Chanchal Kumar Saha

Summary: Cardiotocography (CTG) is a widely used technique to monitor fetal health. This study uses machine learning algorithms to accurately classify the baseline and compares the results with visual estimation by obstetricians, with FURIA algorithm achieving the highest accuracy.

INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS (2022)

Article Computer Science, Information Systems

Comparative study on the performance of the state-of-the-art CNN models for handwritten Bangla character recognition

Payel Rakshit, Somnath Chatterjee, Chayan Halder, Shibaprasad Sen, Sk Md Obaidullah, Kaushik Roy

Summary: This paper discusses the application of popular Convolutional Neural Networks (CNNs) in Bangla handwritten character recognition and evaluates the performance of each network. The study shows the superior performance of CNN models in Bangla handwritten character recognition.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Review Computer Science, Information Systems

Advances in online handwritten recognition in the last decades

Trishita Ghosh, Shibaprasad Sen, Sk. Md. Obaidullah, K. C. Santosh, Kaushik Roy, Umapada Pal

Summary: The easy availability and rapid use of online devices have increased the demand for online handwriting recognition. This paper discusses various machine learning and deep learning approaches for recognizing online handwritten characters, words, and texts. The advantages and challenges of online handwriting recognition are also addressed.

COMPUTER SCIENCE REVIEW (2022)

Article Computer Science, Artificial Intelligence

A generalized line segmentation method for multi-script handwritten text documents

Payel Rakshit, Chayan Halder, Sk Md Obaidullah, Kaushik Roy

Summary: This paper presents a multi-script text line segmentation algorithm based on newly developed light projection, start point detection, and boundary tracking methods. The proposed approach overcomes the hindrance faced by state-of-the-art methods and achieves promising results on various public handwritten datasets.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Scene text understanding: recapitulating the past decade

Mridul Ghosh, Himadri Mukherjee, Sk Md Obaidullah, Xiao-Zhi Gao, Kaushik Roy

Summary: Computational perception has experienced a significant transformation from handcrafted feature-based techniques to deep learning in the field of scene text identification and recognition. Over the past decade, there have been important developments and advancements in this area. The traditional handcrafted feature-based techniques have been replaced by deep learning-based techniques, leading to a new stage in scene text identification.

ARTIFICIAL INTELLIGENCE REVIEW (2023)

Article Computer Science, Information Systems

Hybrid approach for text categorization: A case study with Bangla news article

Ankita Dhar, Himadri Mukherjee, Kaushik Roy, K. C. Santosh, Niladri Sekhar Dash

Summary: This article introduces a hybrid approach that combines text-based and graph-based features to showcase the effectiveness of an automatic text categorization system. The approach was applied on 14,373 Bangla articles, collected from various online news corpora covering nine categories. The experiments also include the application of the features on two popular English datasets to test the system's robustness and language independency.

JOURNAL OF INFORMATION SCIENCE (2023)

Article Medicine, General & Internal

Fetal Health Classification from Cardiotocograph for Both Stages of Labor-A Soft-Computing-Based Approach

Sahana Das, Himadri Mukherjee, Kaushik Roy, Chanchal Kumar Saha

Summary: Cardiotocography (CTG) is currently the only non-invasive and cost-effective tool for continuous fetal health monitoring. Automated analysis of CTG remains challenging due to the complex and dynamic patterns of fetal heart, which are poorly interpreted. In this study, a machine-learning-based model using SVM, RF, MLP, and bagging was proposed, achieving high accuracy and showing potential for integration into an automated decision support system.

DIAGNOSTICS (2023)

Article Computer Science, Information Systems

A bi-stage approach to North Indian raga distinction

Debjyoti Basu, Himadri Mukherjee, Matteo Marciano, Shibaprasad Sen, Sajai Vir Singh, Sk Md Obaidullah, Kaushik Roy

Summary: This research proposes a machine learning-based approach to classify the dawn and dusk time ragas in music. Mel-frequency cepstral coefficients are used for feature extraction, and a two-stage classification technique is employed, achieving promising results.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

BWordDeepNet: a novel deep learning architecture for the recognition of online handwritten Bangla words

Ankan Bhattacharyya, Somnath Chatterjee, Shibaprasad Sen, S. K. M. D. Obaidullah, Kaushik Roy

Summary: Online handwritten word recognition is still a challenging task, especially for low-resource languages like Bangla. This study explores the use of different recurrent neural network architectures to recognize online handwritten Bangla words. The challenge lies in the variable number of strokes used to write words. The developed segmentation-free recognition module achieves high accuracy by leveraging stroke features and outperforms existing techniques.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

City name recognition for Indian postal automation: Exploring script dependent and independent approach

Somnath Chatterjee, Himadri Mukherjee, Shibaprasad Sen, Sk Md Obaidullah, Kaushik Roy

Summary: Postal documents are commonly used for official communication and online shopping. Delivery delays can occur due to various handwritten scripts, necessitating the use of postal sorting facilities. To address this problem, a Deep Learning-based system is proposed to recognize handwritten city names written in six major scripts. Experimental results show high accuracy rates in both script-dependent and independent approaches.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Proceedings Paper Computer Science, Theory & Methods

Secondhand Smart IoT Devices Data Recovery and Digital Investigation

Taiwo Ojo, Hongmei Chi, Janei Elliston, Kaushik Roy

Summary: The probability of retrieving sensitive information from secondhand IoT devices has increased due to advancements in flash memory storage technology. This study investigates data retrieval methods from secondhand memory cards and finds that utilizing software tools is the best way to prevent data leakage.

SOUTHEASTCON 2022 (2022)

No Data Available