4.6 Article

Music genre classification using LBP textural features

Journal

SIGNAL PROCESSING
Volume 92, Issue 11, Pages 2723-2737

Publisher

ELSEVIER
DOI: 10.1016/j.sigpro.2012.04.023

Keywords

Music genre; Texture; Image processing; Pattern recognition

Funding

  1. The National Council for Scientific and Technological Development (CNPq) [301653/2011-9]
  2. CAPES [BEX 5779/11-1, 223/09-FCT595-2009]
  3. Araucaria Foundation [16767-424/2009]
  4. European Commission
  5. FP7 (Seventh Framework Programme)
  6. ICT-2011.1.5 Networked Media and Search Systems [287711]
  7. European Regional Development Fund through the Programme COMPETE
  8. National Funds through the Portuguese Foundation for Science and Technology [PTDC/EAT-MMU/112255/2009, PTDC/EIA-CCO/111050/2009]
  9. Fundação para a Ciência e a Tecnologia [PTDC/EAT-MMU/112255/2009, PTDC/EIA-CCO/111050/2009] Funding Source: FCT

Ask authors/readers for more resources

In this paper we present an approach to music genre classification which converts an audio signal into spectrograms and extracts texture features from these time-frequency images which are then used for modeling music genres in a classification system. The texture features are based on Local Binary Pattern, a structural texture operator that has been successful in recent image classification research. Experiments are performed with two well-known datasets: the Latin Music Database (LMD), and the ISMIR 2004 dataset. The proposed approach takes into account some different zoning mechanisms to perform local feature extraction. Results obtained with and without local feature extraction are compared. We compare the performance of texture features with that of commonly used audio content based features (i.e. from the MARSYAS framework), and show that texture features always outperforms the audio content based features. We also compare our results with results from the literature. On the LMD, the performance of our approach reaches about 82.33%, above the best result obtained in the MIREX 2010 competition on that dataset. On the ISMIR 2004 database, the best result obtained is about 80.65%, i.e. below the best result on that dataset found in the literature. (c) 2012 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Incremental and decremental fuzzy bounded twin support vector machine

Alexandre R. Mello, Marcelo R. Stemmer, Alessandro L. Koerich

INFORMATION SCIENCES (2020)

Article Computer Science, Artificial Intelligence

A database for automatic classification of gender in Araucaria angustifolia plants

Jefferson G. Martins, Luiz E. S. Oliveira, Daniel Weingaertner, Andersson Barison, Gerlon A. R. Oliveira, Luciano M. Liao

Summary: Forests are being exploited disorderly and many species are endangered, prompting the need for a spatial distribution plan. Researchers facing a lack of representative databases can benefit from introducing new databases and proposing selection strategies to improve outcomes.

SOFT COMPUTING (2021)

Review Computer Science, Information Systems

Machine Learning Methods for Histopathological Image Analysis: A Review

Jonathan de Matos, Steve Tsham Mpinda Ataky, Alceu de Souza Britto, Luiz Eduardo Soares de Oliveira, Alessandro Lameiras Koerich

Summary: This paper reviews machine learning methods for histopathological image analysis, including shallow and deep learning methods, covering common tasks and datasets used in HI research.

ELECTRONICS (2021)

Article Computer Science, Artificial Intelligence

Tensor analysis with n-mode generalized difference subspace

Bernardo B. Gatto, Eulanda M. dos Santos, Alessandro L. Koerich, Kazuhiro Fukui, Waldir S. S. Junior

Summary: This paper introduces a new method for multi-dimensional data classification, utilizing tensor representation and subspace concept to enhance classification accuracy. The use of generalized difference subspace (GDS) and n-mode GDS for data dimensionality reduction and discriminative feature extraction, along with the introduction of n-mode Fisher score and an improved metric based on geodesic distance for better tensor data classification performance.

EXPERT SYSTEMS WITH APPLICATIONS (2021)

Article Engineering, Marine

On the Importance of Passive Acoustic Monitoring Filters

Rafael Aguiar, Gianluca Maguolo, Loris Nanni, Yandre Costa, Carlos Silla

Summary: Passive acoustic monitoring (PAM) is a noninvasive technique for wildlife surveillance, where machine learning is useful for identifying species based on audio recordings. The experimental protocols using PAM filters were not intended to improve accuracy rates, but rather to provide more reliable results in the classification system.

JOURNAL OF MARINE SCIENCE AND ENGINEERING (2021)

Article Computer Science, Artificial Intelligence

Two-view fine-grained classification of plant species

Voncarlos M. Araujo, Alceu S. Britto Jr, Luiz S. Oliveira, Alessandro L. Koerich

Summary: This study proposed a novel method based on a two-view leaf image representation and a hierarchical classification strategy for fine-grained plant species recognition, achieving effective results in identifying plant genus and species by using botanical taxonomy as a basis.

NEUROCOMPUTING (2022)

Article Chemistry, Analytical

Impact of Lung Segmentation on the Diagnosis and Explanation of COVID-19 in Chest X-ray Images

Lucas O. Teixeira, Rodolfo M. Pereira, Diego Bertolini, Luiz S. Oliveira, Loris Nanni, George D. C. Cavalcanti, Yandre M. G. Costa

Summary: The study demonstrated the impact of lung segmentation in COVID-19 identification using CXR images, achieving good Jaccard distance and Dice coefficient for segmentation. It investigated the generalization of COVID-19 from images created from different sources, finding a strong bias introduced by underlying factors from different sources even after segmentation.

SENSORS (2021)

Article Chemistry, Multidisciplinary

Continuous Emotion Recognition with Spatiotemporal Convolutional Neural Networks

Thomas Teixeira, Eric Granger, Alessandro Lameiras Koerich

Summary: This paper investigates the use of deep learning architectures for continuous emotion recognition, extending 2D CNN models to learn spatiotemporal information from videos. Experimental results on the SEWA-DB dataset show that these architectures can effectively encode spatiotemporal information and achieve state-of-the-art results.

APPLIED SCIENCES-BASEL (2021)

Article Computer Science, Artificial Intelligence

A novel bio-inspired texture descriptor based on biodiversity and taxonomic measures

Steve Tsham Mpinda Ataky, Alessandro Lameiras Koerich

Summary: This paper proposes a novel approach to quantifying complex systems of diverse patterns in texture, using species diversity, richness, and taxonomic distinctiveness. The method takes advantage of ecological patterns' invariance to build a permutation, rotation, and translation invariant descriptor. Experimental results show the advantages of this method.

PATTERN RECOGNITION (2022)

Article Computer Science, Artificial Intelligence

Cancer Identification in Walker 256 Tumor Model Exploring Texture Properties Taken from Microphotograph of Rats Liver

Mateus F. T. Carvalho, Sergio A. Silva Jr, Carla Cristina O. Bernardo, Franklin Cesar Flores, Juliana Vanessa C. M. Perles, Jacqueline Nelisis Zanoni, Yandre M. G. Costa

Summary: This study investigates automatic detection of cancer in laboratory animals using preclinical microphotograph images of liver tissue. Two different texture descriptors were explored to capture texture properties and their complementarity was evaluated. Results showed that both descriptors performed well in this scenario.

ALGORITHMS (2022)

Article Computer Science, Artificial Intelligence

A human-in-the-loop recommendation-based framework for reconstruction of mechanically shre dde d documents

Thiago M. Paixao, Rodrigo F. Berriel, Maria C. S. Boeres, Alessandro L. Koerich, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos

Summary: Advances in machine learning, especially deep learning, have improved the accuracy of automatically reconstructing shredded documents. However, there is still room for improvement in fully automatic reconstruction. To address this issue, we propose a human-in-the-loop reconstruction framework that allows users to verify the adjacency of adjacent shreds in the solution. Introducing human involvement can reduce errors by over 40%.

PATTERN RECOGNITION LETTERS (2022)

Article Computer Science, Theory & Methods

Multidiscriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Summary: This paper introduces a defense approach against adversarial attacks on speech-to-text systems. The proposed algorithm utilizes short-time Fourier transform, spectrogram subspace projection, and a novel GAN architecture trained with Sobolev integral probability metric. Experimental results demonstrate that it outperforms other defense algorithms in terms of accuracy and signal quality.

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2022)

Proceedings Paper Acoustics

CLASS-CONDITIONAL DEFENSE GAN AGAINST END-TO-END SPEECH ATTACKS

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Summary: This paper introduces a novel defense approach against end-to-end adversarial attacks by finding the optimal input vector through minimizing the relative chordal distance adjustment and reconstructing the signal. Experimental results show that this approach significantly outperforms conventional defense algorithms.

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) (2021)

Article Engineering, Electrical & Electronic

Cyclic Defense GAN Against Speech Adversarial Attacks

Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

Summary: This letter introduces a new defense approach utilizing a cyclic generative adversarial network to reconstruct signals for countering state-of-the-art white and black-box adversarial attack algorithms. Experimental results show the effectiveness of this defense method in various adversarial attack scenarios.

IEEE SIGNAL PROCESSING LETTERS (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Continuous Emotion Recognition via Deep Convolutional Autoencoder and Support Vector Regressor

Sevegni Odilon Clement Allognon, Alceu de S. Britto Jr, Alessandro L. Koerich

2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) (2020)

Article Engineering, Electrical & Electronic

Robust registration and learning using multi-radii spherical polar Fourier transform

Alam Abbas Syed, Hassan Foroosh

Summary: This paper presents effective methods using spherical polar Fourier transform data for two different applications: 3D volumetric registration and machine learning classification network. The proposed method for registration offers unique and effective techniques, handling arbitrary large rotation angles and showing robustness. The modified classification network achieves robust classification results in processing spherical data.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

TVRPCA plus : Low-rank and sparse decomposition based on spectral norm and structural sparsity-inducing norm

Ruibo Fan, Mingli Jing, Jingang Shi, Lan Li, Zizhao Wang

Summary: In this study, a new low-rank sparse decomposition algorithm named TVRPCA+ is proposed for foreground-background separation. The algorithm combines spectral norm, structured sparse norm, and total variation regularization to suppress noise and obtain cleaner foregrounds. Experimental results demonstrate that TVRPCA+ achieves high performance in complex backgrounds and noise scenarios.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

LFM signal parameter estimation in the fractional Fourier domains: Analytical models and a high-performance algorithm

Omair Aldimashki, Ahmet Serbes

Summary: This paper proposes a coarse-to-fine FrFT-based algorithm for chirp-rate estimation of multi-component LFM signals, which achieves improved performance and a reduced signal-to-noise breakdown threshold by utilizing mathematical models for coarse estimation and a refined estimate-and-subtract strategy. Extensive simulation results demonstrate that the proposed algorithm performs very close to the Cramer-Rao lower bound, with the advantages of eliminating leakage effect, avoiding error propagation, and maintaining acceptable computational cost compared to other state-of-the-art methods.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Multiple sources localization with 2D-DFT under distributed massive antenna arrays

Xinlei Shi, Xiaofei Zhang, Yuxin Sun, Yang Qian, Jinke Cao

Summary: In this paper, a low-complexity localization approach for multiple sources using two-dimensional discrete Fourier transform (2D-DFT) is proposed. The method computes the cross-covariance and utilizes phase offset method and total least square solution to obtain accurate position estimates.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Extended target tracking under multitarget tracking framework for convex polytope shapes

Prabhanjan Mannari, Ratnasingham Tharmarasa, Thiagalingam Kirubarajan

Summary: This paper discusses the problem of extended target tracking for a single 2D extended target with a known convex polytope shape and dynamics. It proposes a framework based on the existing point multitarget tracking framework to address the challenges of uncertainty in shape and kinematics, as well as self-occlusion. The algorithm developed using this framework is capable of dynamically changing the number of parameters used to describe the shape and estimating the whole target shape even when different parts of the target are visible at different frames.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Robust small infrared target detection using weighted adaptive ring top-hat transformation

Yongsong Li, Zhengzhou Li, Jie Li, Junchao Yang, Abubakar Siddique

Summary: This paper proposes a weighted adaptive ring top-hat transformation (WARTH) for extracting infrared small targets in complex backgrounds. The WARTH method effectively measures local and global feature information using an adaptive ring-shaped structural element and a target awareness indicator, resulting in accurate detection of small targets with minimized false alarms.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Variable step-size convex regularized PRLS algorithms

Yu Wang, Zhen Qin, Jun Tao, Yili Xia

Summary: In this paper, an enhanced sparsity-aware recursive least squares (RLS) algorithm is proposed, which combines the proportionate updating (PU) and zero-attracting (ZA) mechanisms, and introduces a general convex regularization (CR) function and variable step-size (VSS) technique to improve performance.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Analysis of the Least Mean Square algorithm with processing delays in the adaptive arm for Gaussian inputs for system identification

Neil J. Bershad, Jose C. M. Bermudez

Summary: This paper analyzes the impact of processing delay on the Least Mean Squares (LMS) algorithm in system identification, highlighting bias issues in the resulting weight vector.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

A single dwell velocity estimation method for pulse Doppler radar using multicarrier signals

Kanghui Jiang, Defu Jiang, Mingxing Fu, Yan Han, Song Wang, Chao Zhang, Jingyu Shi

Summary: In this paper, a novel method for velocity estimation using multicarrier signals in a single dwell is proposed, which effectively addresses the issue of Doppler ambiguity in pulse Doppler radars.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Long-time adaptive coherent detection of small targets in sea clutter by fast inversion algorithm of block tridiagonal speckle covariance matrices

Xiao-Jun Zhang, Peng-Lang Shui, Yu-Fan Xue

Summary: This paper proposes a method for low-velocity small target detection in maritime surveillance radars. It models sea clutter sequences using the spherical invariant random vector (SIRV) model with block tridiagonal speckle covariance matrix and inverse Gamma distributed texture. The proposed detector, which is a long-time adaptive generalized likelihood ratio test with linear threshold detector (GLRT-LTD), shows competitive detection performance in experiments.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Adaptive weighted robust data recovery with total variation for hyperspectral image

Aiyi Zhang, Fulai Liu, Ruiyan Du

Summary: This paper proposes an adaptive weighted robust data recovery method with total variation regularization for hyperspectral image. The method models the HSI recovery problem as a tensor robust principal component analysis optimization problem, decomposing the data into low-rank HSI data, outliers, and noise component. An adaptive weighted strategy is then defined to impose on the tensor nuclear norm and outliers, using the priori information of singular values and strengthening the sparsity of outliers.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Model order estimation based on the correntropy of observation eigenvalues

Hamid Asadi, Babak Seyfe

Summary: This paper presents a novel approach for estimating the model order in the presence of observation errors. The proposed method is based on correntropy estimation of eigenvalues in the observation space, which is further enhanced by resampling the observations using the bootstrap method. The algorithm partitions the observation space into signal and noise subspaces using the covariance matrix of mixtures, and determines the model order based on a correntropy estimator with kernel functions. Theoretical analysis and comparative evaluations demonstrate the superiority of this information-theoretic approach.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

A novel family of online censoring based complex-valued least mean kurtosis algorithms

Buket colak Guvenc, Engin Cemal Menguc

Summary: In this paper, a novel family of online censoring based complex-valued least mean kurtosis (CLMK) algorithms is proposed. The algorithms censor less informative complex-valued data streams and reduce the costs of data processing without affecting accuracy. Robust algorithms are also developed to handle outliers. The simulation results confirm the attractive features of the proposed algorithms in large-scale system identification and regression scenarios.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Enhancing concealed object detection in Active Millimeter Wave Images using wavelet transform

Yun Su, Weixian Tan, Yifan Dong, Wei Xu, Pingping Huang, Jianxin Zhang, Diankun Zhang

Summary: In this study, a novel method for detecting low-resolution and small targets in millimeter wave radar images is proposed. The Wavelet-Conv structure and Wavelet-Attention mechanism are introduced to overcome the limitations of existing detectors. Experimental results demonstrate that the proposed method improves recall and mean average precision while maintaining competitive inference speed.

SIGNAL PROCESSING (2024)

Article Engineering, Electrical & Electronic

Spectral structure inducing efficient variational model for enhancing bearing fault feature

Xin Wang, Xingxing Jiang, Qiuyu Song, Jie Liu, Jianfeng Guo, Zhongkui Zhu

Summary: This study proposes a variational mode extraction (VME) method for extracting specific modes from complicated signals. By exploring the convergence property of VME, strategies for identifying ICF and determining the balance parameter are designed, and a bandwidth estimation strategy is constructed. The effectiveness of the proposed method for bearings fault diagnosis is verified and compared with other methods.

SIGNAL PROCESSING (2024)