4.5 Article

Handwritten Digit Classification in Bangla and Hindi Using Deep Learning

期刊

APPLIED ARTIFICIAL INTELLIGENCE
卷 34, 期 14, 页码 1074-1099

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/08839514.2020.1804228

关键词

-

向作者/读者索取更多资源

Handwritten digit classification is a well-known and important problem in the field of optical character recognition (OCR). The primary challenge is correctly classifying digits which are highly varied in their visual characteristics primarily due to the writing styles of different individuals. In this paper, we propose the use of Convolutional Neural Networks (CNN) for the purpose of classifying handwritten Bangla and Hindi numerals. The major advantage that we face by using a CNN-based classifier is that no prior hand-crafted feature needs to be extracted from the images for efficient and accurate classification. An added benefit of a CNN classifier is that it provides translational invariance and a certain extent of rotational invariance during recognition. Applications can be found in real-time OCR systems where input images are often not perfectly oriented along a vertical axis. In this work, we use modified versions of the well-known LeNet CNN architecture. Extensive experiments have revealed a best-case classification accuracy of 98.2% for Bangla and 98.8% for Hindi numerals outperforming competitive models in the literature.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

A novel meta-heuristic approach for influence maximization in social networks

Bitanu Chatterjee, Trinav Bhattacharyya, Kushal Kanti Ghosh, Agneet Chatterjee, Ram Sarkar

Summary: This article presents a framework for maximizing influence propagation in a social network, which includes community detection and the utilization of the Shuffled Frog Leaping algorithm. Experimental results show that our method performs well compared to other algorithms.

EXPERT SYSTEMS (2023)

Article Computer Science, Information Systems

A feature selection model for speech emotion recognition using clustering-based population generation with hybrid of equilibrium optimizer and atom search optimization algorithm

Soham Chattopadhyay, Arijit Dey, Pawan Kumar Singh, Ali Ahmadian, Ram Sarkar

Summary: Speech is crucial in human communication and human-computer interaction. In the field of AI and ML, it has been extensively studied to recognize human emotions from speech signals. To address the challenge of large feature dimension, a hybrid feature selection algorithm called CEOAS is proposed. By extracting LPC and LPCC features, the proposed model reduces feature dimension and improves classification accuracy. Impressive recognition accuracies have been achieved on four benchmark datasets, surpassing state-of-the-art algorithms.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Inverted bell-curve-based ensemble of deep learning models for detection of COVID-19 from chest X-rays

Ashis Paul, Arpan Basu, Mufti Mahmud, M. Shamim Kaiser, Ram Sarkar

Summary: This article discusses the use of deep learning models and an inverted bell-curve weighted ensemble method to assist in the detection of COVID-19 in CXR images. By using transfer learning and retraining models pretrained on the ImageNet dataset, as well as performing weighted average predictions, the accuracy of COVID-19 identification in CXR images can be improved.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Software Engineering

Handwritten Arabic and Roman word recognition using holistic approach

Samir Malakar, Samanway Sahoo, Anuran Chakraborty, Ram Sarkar, Mita Nasipuri

Summary: Handwritten word recognition is an open research problem due to variations in writing style and degraded images. This paper proposes a holistic approach combined with distance calculation and feature descriptors to address the problem. The experimental results demonstrate the effectiveness of the proposed method on standard databases compared to deep learning models.

VISUAL COMPUTER (2023)

Article Computer Science, Artificial Intelligence

A new population initialization approach based on Metropolis-Hastings (MH) method

Erik Cuevas, Hector Escobar, Ram Sarkar, Heba F. Eid

Summary: This paper proposes a new population initialization method for metaheuristic algorithms, where the initial set of candidate solutions is obtained through the sampling of the objective function. The method aims to find initial solutions that are close to the prominent values of the objective function, and these initial points represent promising regions of the search space. The proposed approach shows faster convergence and improved quality of solutions compared to other similar approaches.

APPLIED INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Breast cancer detection in thermograms using a hybrid of GA and GWO based deep feature selection method

Rishav Pramanik, Payel Pramanik, Ram Sarkar

Summary: Breast cancer is a leading cause of premature death among women globally, but early detection and diagnosis can save lives. Hence, computer scientists are working to develop reliable models to tackle this disease. A proposed lightweight model combines transfer learning-based deep learning (DL) with feature selection to detect abnormalities in breast thermograms. This model performs well in detecting and differentiating malignant and healthy breasts.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Information Systems

Copy-move forgery detection using local tetra pattern based texture descriptor

Sagnik Ganguly, Sanmit Mandal, Samir Malakar, Ram Sarkar

Summary: This paper introduces a new copy-move image forgery detection technique which relies on a texture feature descriptor called Local Tetra Pattern (LTrP) for block level image comparison used to localize tampered region(s). Experimental results demonstrate that the proposed technique has been able to detect the forged regions with higher accuracy as compared to many state-of-the-art copy-move forgery detection methods.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Deep feature selection using local search embedded social ski-driver optimization algorithm for breast cancer detection in mammograms

Payel Pramanik, Souradeep Mukhopadhyay, Seyedali Mirjalili, Ram Sarkar

Summary: Breast cancer is a common malignancy in women, and early detection is crucial. In this research, a method for classifying breast masses using mammograms is proposed. Deep features are extracted using the VGG16 model with an attention mechanism, and an optimal features subset is obtained using a meta-heuristic algorithm. The proposed model shows successful identification and differentiation of malignant and healthy breasts.

NEURAL COMPUTING & APPLICATIONS (2023)

Correction Computer Science, Artificial Intelligence

Human activity recognition from sensor data using spatial attention-aided CNN with genetic algorithm (Oct, 10.1007/s00521-022-07911-0, 2022)

Apu Sarkar, S. K. Sabbir Hossain, Ram Sarkar

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

A hierarchical feature selection strategy for deepfake video detection

Sk Mohiuddin, Khalid Hassan Sheikh, Samir Malakar, Juan D. Velasquez, Ram Sarkar

Summary: Digital face manipulation has become a significant concern recently due to its harmful effects on society, particularly for high-profile celebrities who can easily be targeted using apps like FaceSwap and FaceApp. Detecting deepfake images or videos is challenging, and existing models often fail to check for irrelevant or redundant features. In this study, a hierarchical feature selection (HFS) method using a hybrid population-based meta-heuristic model and a single solution-based meta-heuristic model was proposed. The model achieved high AUC scores on three publicly available datasets and outperformed most state-of-the-art methods.

NEURAL COMPUTING & APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Gamma function based ensemble of CNN models for breast cancer detection in histopathology images

Samriddha Majumdar, Payel Pramanik, Ram Sarkar

Summary: Breast cancer is the second deadliest disease among women globally. Histopathology image analysis is an effective method for detecting tumor malignancies. Computer-aided diagnosis (CAD) using convolutional neural network (CNN) models has shown potential in breast histopathological image classification, but there is room for improvement. This paper proposes a novel rank-based ensemble method that combines multiple CNN models to enhance classification accuracy.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Computer Science, Information Systems

An ensemble approach to detect copy-move forgery in videos

S. k Mohiuddin, Samir Malakar, Ram Sarkar

Summary: Video forgery has become more common due to the easy availability of tools. This study proposes an ensemble based method to detect duplicate frames in a video. By extracting different types of features and applying lexicographical sorting, the method achieves high detection accuracy and outperforms state-of-the-art methods.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

A comprehensive survey on state-of-the-art video forgery detection techniques

Sk Mohiuddin, Samir Malakar, Munish Kumar, Ram Sarkar

Summary: Video plays a critical role in conveying authenticity in various fields such as surveillance, medicine, journalism, and social media. However, the trust in videos is diminishing due to the ease of video forgery using accessible editing tools. This article comprehensively discusses the initiatives and recent trends in video forgery detection research worldwide.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Information Systems

JUVDsi v1: developing and benchmarking a new still image database in Indian scenario for automatic vehicle detection

Avirup Bhattacharyya, Avigyan Bhattacharya, Sourajit Maity, Pawan Kumar Singh, Ram Sarkar

Summary: Designing an automatic vehicle detection system that caters to the requirements of the traffic management system is important. This research develops a still image database, JUVDsi v1, for designing an automated traffic management system in India. The database addresses the shortcomings of existing databases and is evaluated using state-of-the-art deep learning architectures.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Interdisciplinary Applications

Discrete equilibrium optimizer combined with simulated annealing for feature selection

Ritam Guha, Kushal Kanti Ghosh, Suman Kumar Bera, Ram Sarkar, Seyedali Mirjalili

Summary: This paper proposes a binary adaptation of Equilibrium Optimizer (EO) called Discrete EO (DEO) for solving binary optimization problems. DEOSA algorithm, combining DEO with Simulated Annealing (SA) as a local search procedure, is applied to various datasets and outperforms other algorithms. The scalability and robustness of DEOSA are also tested on high-dimensional Microarray datasets and Knapsack problems, showing its superiority.

JOURNAL OF COMPUTATIONAL SCIENCE (2023)

暂无数据