4.7 Article

Filtering segmentation cuts for digit string recognition

Journal

PATTERN RECOGNITION
Volume 41, Issue 10, Pages 3044-3053

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2008.03.019

Keywords

handwriting recognition; segmentation; filtering

Ask authors/readers for more resources

In this paper we propose a method to evaluate segmentation cuts for handwritten touching digits. The idea of this method is to work as a filter in segmentation-based recognition system. This kind of system usually rely on over-segmentation methods, where several segmentation hypotheses are created for each touching group of digits and then assessed by a general-purpose classifier. The novelty of the proposed methodology lies in the fact that unnecessary segmentation cuts can be identified without any attempt of classification by a general-purpose classifier, reducing the number of paths in a segmentation graph, what can consequently lead to a reduction in computational cost. An cost-based approach using ROC (receiver operating characteristics) was deployed to optimize the filter. Experimental results show that the filter can eliminate up to 83% of the unnecessary segmentation hypothesis and increase the overall performance of the system. (C) 2008 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

APPLIED SOFT COMPUTING (2020)

Review Computer Science, Information Systems

Machine Learning Methods for Histopathological Image Analysis: A Review

Jonathan de Matos, Steve Tsham Mpinda Ataky, Alceu de Souza Britto, Luiz Eduardo Soares de Oliveira, Alessandro Lameiras Koerich

Summary: This paper reviews machine learning methods for histopathological image analysis, including shallow and deep learning methods, covering common tasks and datasets used in HI research.

ELECTRONICS (2021)

Article Computer Science, Artificial Intelligence

Tensor analysis with n-mode generalized difference subspace

Bernardo B. Gatto, Eulanda M. dos Santos, Alessandro L. Koerich, Kazuhiro Fukui, Waldir S. S. Junior

Summary: This paper introduces a new method for multi-dimensional data classification, utilizing tensor representation and subspace concept to enhance classification accuracy. The use of generalized difference subspace (GDS) and n-mode GDS for data dimensionality reduction and discriminative feature extraction, along with the introduction of n-mode Fisher score and an improved metric based on geodesic distance for better tensor data classification performance.

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Article Computer Science, Artificial Intelligence

Two-view fine-grained classification of plant species

Voncarlos M. Araujo, Alceu S. Britto Jr, Luiz S. Oliveira, Alessandro L. Koerich

Summary: This study proposed a novel method based on a two-view leaf image representation and a hierarchical classification strategy for fine-grained plant species recognition, achieving effective results in identifying plant genus and species by using botanical taxonomy as a basis.

NEUROCOMPUTING (2022)

Article Chemistry, Analytical

Impact of Lung Segmentation on the Diagnosis and Explanation of COVID-19 in Chest X-ray Images

Lucas O. Teixeira, Rodolfo M. Pereira, Diego Bertolini, Luiz S. Oliveira, Loris Nanni, George D. C. Cavalcanti, Yandre M. G. Costa

Summary: The study demonstrated the impact of lung segmentation in COVID-19 identification using CXR images, achieving good Jaccard distance and Dice coefficient for segmentation. It investigated the generalization of COVID-19 from images created from different sources, finding a strong bias introduced by underlying factors from different sources even after segmentation.

SENSORS (2021)

Article Chemistry, Multidisciplinary

Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks

Thomas Teixeira, Eric Granger, Alessandro Lameiras Koerich

Summary: This paper investigates the use of deep learning architectures for continuous emotion recognition, extending 2D CNN models to learn spatiotemporal information from videos. Experimental results on the SEWA-DB dataset show that these architectures can effectively encode spatiotemporal information and achieve state-of-the-art results.

APPLIED SCIENCES-BASEL (2021)

Article Computer Science, Artificial Intelligence

A novel bio-inspired texture descriptor based on biodiversity and taxonomic measures

Steve Tsham Mpinda Ataky, Alessandro Lameiras Koerich

Summary: This paper proposes a novel approach to quantifying complex systems of diverse patterns in texture, using species diversity, richness, and taxonomic distinctiveness. The method takes advantage of ecological patterns' invariance to build a permutation, rotation, and translation invariant descriptor. Experimental results show the advantages of this method.

PATTERN RECOGNITION (2022)

Review Chemistry, Analytical

COVID-19 Detection on Chest X-ray and CT Scan: A Review of the Top-100 Most Cited Papers

Yandre M. G. Costa, Sergio A. Silva, Lucas O. Teixeira, Rodolfo M. Pereira, Diego Bertolini, Alceu S. Britto Jr, Luiz S. Oliveira, George D. C. Cavalcanti

Summary: This article reviews the top-100 most cited papers in the field of COVID-19 detection from thoracic medical imaging, discussing important aspects such as the type of medical imaging explored, learning settings, segmentation strategy, explainable artificial intelligence (XAI), and the availability of datasets and codes.

SENSORS (2022)

Article Computer Science, Artificial Intelligence

A human-in-the-loop recommendation-based framework for reconstruction of mechanically shre dde d documents

Thiago M. Paixao, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro L. Koerich, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos

Summary: Advances in machine learning, especially deep learning, have improved the accuracy of automatically reconstructing shredded documents. However, there is still room for improvement in fully automatic reconstruction. To address this issue, we propose a human-in-the-loop reconstruction framework that allows users to verify the adjacency of adjacent shreds in the solution. Introducing human involvement can reduce errors by over 40%.

PATTERN RECOGNITION LETTERS (2022)

Article Computer Science, Theory & Methods

Multidiscriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Summary: This paper introduces a defense approach against adversarial attacks on speech-to-text systems. The proposed algorithm utilizes short-time Fourier transform, spectrogram subspace projection, and a novel GAN architecture trained with Sobolev integral probability metric. Experimental results demonstrate that it outperforms other defense algorithms in terms of accuracy and signal quality.

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2022)

Proceedings Paper Acoustics

CLASS-CONDITIONAL DEFENSE GAN AGAINST END-TO-END SPEECH ATTACKS

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Summary: This paper introduces a novel defense approach against end-to-end adversarial attacks by finding the optimal input vector through minimizing the relative chordal distance adjustment and reconstructing the signal. Experimental results show that this approach significantly outperforms conventional defense algorithms.

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) (2021)

Article Engineering, Electrical & Electronic

Cyclic Defense GAN Against Speech Adversarial Attacks

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Summary: This letter introduces a new defense approach utilizing a cyclic generative adversarial network to reconstruct signals for countering state-of-the-art white and black-box adversarial attack algorithms. Experimental results show the effectiveness of this defense method in various adversarial attack scenarios.

IEEE SIGNAL PROCESSING LETTERS (2021)

Article Computer Science, Artificial Intelligence

Self-supervised deep reconstruction of mixed strip-shredded text documents

Thiago M. Paixao, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro L. Koerich, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos

PATTERN RECOGNITION (2020)

Article Computer Science, Theory & Methods

A Robust Approach for Securing Audio Classification Against Adversarial Attacks

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Koerich

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2020)

Article Computer Science, Artificial Intelligence

Exploiting sublimated deep features for image retrieval

Guang-Hai Liu, Zuo-Yong Li, Jing-Yu Yang, David Zhang

Summary: This article introduces a novel image retrieval method that improves retrieval performance by using sublimated deep features. The method incorporates orientation-selective features and color perceptual features, effectively mimicking these mechanisms to provide a more discriminating representation.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Region-adaptive and context-complementary cross modulation for RGB-T semantic segmentation

Fengguang Peng, Zihan Ding, Ziming Chen, Gang Wang, Tianrui Hui, Si Liu, Hang Shi

Summary: RGB-Thermal (RGB-T) semantic segmentation is an emerging task that aims to improve the robustness of segmentation methods under extreme imaging conditions by using thermal infrared modality. The challenges of foreground-background distinguishment and complementary information mining are addressed by proposing a cross modulation process with two collaborative components. Experimental results show that the proposed method achieves state-of-the-art performances on current RGB-T segmentation benchmarks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

F-SCP: An automatic prompt generation method for specific classes based on visual language pre-training models

Baihong Han, Xiaoyan Jiang, Zhijun Fang, Hamido Fujita, Yongbin Gao

Summary: This paper proposes a novel automatic prompt generation method called F-SCP, which focuses on generating accurate prompts for low-accuracy classes and similar classes. Experimental results show that our approach outperforms state-of-the-art methods on six multi-domain datasets.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Residual Deformable Convolution for better image de-weathering

Huikai Liu, Ao Zhang, Wenqian Zhu, Bin Fu, Bingjian Ding, Shengwu Xiong

Summary: Adverse weather conditions present challenges for computer vision tasks, and image de-weathering is an important component of image restoration. This paper proposes a multi-patch skip-forward structure and a Residual Deformable Convolutional module to improve feature extraction and pixel-wise reconstruction.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

A linear transportation LP distance for pattern recognition

Oliver M. Crook, Mihai Cucuringu, Tim Hurst, Carola-Bibiane Schonlieb, Matthew Thorpe, Konstantinos C. Zygalakis

Summary: The transportation LP distance (TLP) is a generalization of the Wasserstein WP distance that can be applied directly to color or multi-channelled images, as well as multivariate time-series. TLP interprets signals as functions, while WP interprets signals as measures. Although both distances are powerful tools in modeling data with spatial or temporal perturbations, their computational cost can be prohibitively high for moderate pattern recognition tasks. The linear Wasserstein distance offers a method for projecting signals into a Euclidean space, and in this study, we propose linear versions of the TLP distance (LTLP) that show significant improvement over the linear WP distance in signal processing tasks while being several orders of magnitude faster to compute than the TLP distance.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Learning a target-dependent classifier for cross-domain semantic segmentation: Fine-tuning versus meta-learning

Haitao Tian, Shiru Qu, Pierre Payeur

Summary: This paper proposes a method of target-dependent classifier, which optimizes the joint hypothesis of domain adaptation into a target-dependent hypothesis that better fits with the target domain clusters through an unsupervised fine-tuning strategy and the concept of meta-learning. Experimental results demonstrate that this method outperforms existing techniques in synthetic-to-real adaptation and cross-city adaptation benchmarks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

KGSR: A kernel guided network for real-world blind super-resolution

Qingsen Yan, Axi Niu, Chaoqun Wang, Wei Dong, Marcin Wozniak, Yanning Zhang

Summary: Deep learning-based methods have achieved remarkable results in the field of super-resolution. However, the limitation of paired training image sets has led researchers to explore self-supervised learning. However, the assumption of inaccurate downscaling kernel functions often leads to degraded results. To address this issue, this paper introduces KGSR, a kernel-guided network that trains both upscaling and downscaling networks to generate high-quality high-resolution images even without knowing the actual downscaling process.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Gait feature learning via spatio-temporal two-branch networks

Yifan Chen, Xuelong Li

Summary: Gait recognition is a popular technology for identification due to its ability to capture gait features over long distances without cooperation. However, current methods face challenges as they use a single network to extract both temporal and spatial features. To solve this problem, we propose a two-branch network that focuses on spatial and temporal feature extraction separately. By combining these features, we can effectively learn the spatio-temporal information of gait sequences.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

PAMI: Partition Input and Aggregate Outputs for Model Interpretation

Wei Shi, Wentao Zhang, Wei-shi Zheng, Ruixuan Wang

Summary: This article proposes a simple yet effective visualization framework called PAMI, which does not require detailed model structure and parameters to obtain visualization results. It can be applied to various prediction tasks with different model backbones and input formats.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Disturbance rejection with compensation on features

Xiaobo Hu, Jianbo Su, Jun Zhang

Summary: This paper reviews the latest technologies in pattern recognition, highlighting their instabilities and failures in practical applications. From a control perspective, the significance of disturbance rejection in pattern recognition is discussed, and the existing problems are summarized. Finally, potential solutions related to the application of compensation on features are discussed to emphasize future research directions.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

ECLAD: Extracting Concepts with Local Aggregated Descriptors

Andres Felipe Posada-Moreno, Nikita Surya, Sebastian Trimpe

Summary: Convolutional neural networks are widely used in critical systems, and explainable artificial intelligence has proposed methods for generating high-level explanations. However, these methods lack the ability to determine the location of concepts. To address this, we propose a novel method for automatic concept extraction and localization based on pixel-wise aggregations, and validate it using synthetic datasets.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

Dynamic Graph Contrastive Learning via Maximize Temporal Consistency

Peng Bao, Jianian Li, Rong Yan, Zhongyi Liu

Summary: In this paper, a novel Dynamic Graph Contrastive Learning framework, DyGCL, is proposed to capture the temporal consistency in dynamic graphs and achieve good performance in node representation learning.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

ConvGeN: A convex space learning approach for deep-generative oversampling and imbalanced classification of small tabular datasets

Kristian Schultz, Saptarshi Bej, Waldemar Hahn, Markus Wolfien, Prashant Srivastava, Olaf Wolkenhauer

Summary: Research indicates that deep generative models perform poorly compared to linear interpolation-based methods for synthetic data generation on small, imbalanced tabular datasets. To address this, a new approach called ConvGeN, combining convex space learning with deep generative models, has been proposed. ConvGeN improves imbalanced classification on small datasets while remaining competitive with existing linear interpolation methods.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

H-CapsNet: A capsule network for hierarchical image classification

Khondaker Tasrif Noor, Antonio Robles-Kelly

Summary: In this paper, the authors propose H-CapsNet, a capsule network designed for hierarchical image classification. The network effectively captures hierarchical relationships using dedicated capsules for each class hierarchy. A modified hinge loss is utilized to enforce consistency among the involved hierarchies. Additionally, a strategy for dynamically adjusting training parameters is presented to achieve better balance between the class hierarchies. Experimental results demonstrate that H-CapsNet outperforms competing hierarchical classification networks.

PATTERN RECOGNITION (2024)

Article Computer Science, Artificial Intelligence

CS-net: Conv-simpleformer network for agricultural image segmentation

Lei Liu, Guorun Li, Yuefeng Du, Xiaoyu Li, Xiuheng Wu, Zhi Qiao, Tianyi Wang

Summary: This study proposes a new agricultural image segmentation model called CS-Net, which uses Simple-Attention Block and Simpleformer to improve accuracy and inference speed, and addresses the issue of performance collapse of Transformers in agricultural image processing.

PATTERN RECOGNITION (2024)