☆ 4.6 Article

A Hybrid Swarm and Gravitation-based feature selection algorithm for handwritten Indic script classification problem

COMPLEX & INTELLIGENT SYSTEMS (2021)

期刊

COMPLEX & INTELLIGENT SYSTEMS

卷 7, 期 2, 页码 823-839

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s40747-020-00237-1

关键词

Feature selection; Hybrid Swarm and Gravitation-based Feature Selection; Particle swarm optimization; Gravitational search algorithm; Handwritten script classification; Indic script

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this study, a new feature selection algorithm, HSGFS, is introduced to reduce dimensionality and improve accuracy of handwritten script classification. Experimental results demonstrate an average improvement in classification accuracy of 2-5% when using 75-80% of the original feature vectors. The proposed method also outperforms some popular FS models in terms of performance.

In any multi-script environment, handwritten script classification is an unavoidable pre-requisite before the document images are fed to their respective Optical Character Recognition (OCR) engines. Over the years, this complex pattern classification problem has been solved by researchers proposing various feature vectors mostly having large dimensions, thereby increasing the computation complexity of the whole classification model. Feature Selection (FS) can serve as an intermediate step to reduce the size of the feature vectors by restricting them only to the essential and relevant features. In the present work, we have addressed this issue by introducing a new FS algorithm, called Hybrid Swarm and Gravitation-based FS (HSGFS). This algorithm has been applied over three feature vectors introduced in the literature recently-Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG), and Modified log-Gabor (MLG) filter Transform. Three state-of-the-art classifiers, namely, Multi-Layer Perceptron (MLP), K-Nearest Neighbour (KNN), and Support Vector Machine (SVM), are used to evaluate the optimal subset of features generated by the proposed FS model. Handwritten datasets at block, text line, and word level, consisting of officially recognized 12 Indic scripts, are prepared for experimentation. An average improvement in the range of 2-5% is achieved in the classification accuracy by utilizing only about 75-80% of the original feature vectors on all three datasets. The proposed method also shows better performance when compared to some popularly used FS models. The codes used for implementing HSGFS can be found in the following Github link: https://github.com/Ritam-Guha/HSGFS.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.6

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Improved binary particle swarm optimization for feature selection with new initialization and search space reduction strategies

An-Da Li, Bing Xue, Mengjie Zhang

Summary: This paper proposes an improved sticky binary PSO algorithm for feature selection problems, which aims to enhance evolutionary performance through new mechanisms such as an initialization strategy, dynamic bits masking, and genetic operations. Experimental results show that ISBPSO achieves higher accuracy with fewer features and reduces computation time compared to benchmark PSO-based FS methods.

APPLIED SOFT COMPUTING (2021)