4.6 Article

Sparse nonnegative matrix factorization with l0-constraints

期刊

NEUROCOMPUTING
卷 80, 期 -, 页码 38-46

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.neucom.2011.09.024

关键词

NMF; Sparse coding; Nonnegative least squares

资金

  1. Austrian Science Fund [P22488-N23]
  2. Austrian Science Fund (FWF) [P22488] Funding Source: Austrian Science Fund (FWF)

向作者/读者索取更多资源

Although nonnegative matrix factorization (NMF) favors a sparse and part-based representation of nonnegative data, there is no guarantee for this behavior. Several authors proposed NMF methods which enforce sparseness by constraining or penalizing the l(1)-norm of the factor matrices. On the other hand, little work has been done using a more natural sparseness measure, the l(0)-pseudo-norm. In this paper, we propose a framework for approximate NMF which constrains the l(0)-norm of the basis matrix, or the coefficient matrix, respectively. For this purpose, techniques for unconstrained NMF can be easily incorporated, such as multiplicative update rules, or the alternating nonnegative least-squares scheme. In experiments we demonstrate the benefits of our methods, which compare to, or outperform existing approaches. (C) 2011 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Multidisciplinary Sciences

Novel AI driven approach to classify infant motor functions

Simon Reich, Dajie Zhang, Tomas Kulvicius, Sven Boelte, Karin Nielsen-Saines, Florian B. Pokorny, Robert Peharz, Luise Poustka, Florentin Woergoetter, Christa Einspieler, Peter B. Marschik

Summary: This study introduces a novel machine learning algorithm to detect age-specific movement patterns in infants. By analyzing skeletal information from video data, the algorithm is able to accurately discriminate fidgety movements with a classification accuracy of 88%, potentially becoming a universally accessible tool in clinical practice.

SCIENTIFIC REPORTS (2021)

Article Computer Science, Artificial Intelligence

Conditional sum-product networks: Modular probabilistic circuits via gate functions

Xiaoting Shao, Alejandro Molina, Antonio Vergari, Karl Stelzner, Robert Peharz, Thomas Liebig, Kristian Kersting

Summary: This paper introduces a new type of conditional SPNs model, which can be used as tractable building blocks of deep probabilistic models and shows competitive performance in various tasks. The research demonstrates that CSPNs can outperform other probabilistic models and have the potential to improve performance in different tasks.

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING (2022)

Article Acoustics

Blind Speech Separation and Dereverberation using neural beamforming

Lukas Pfeifenberger, Franz Pernkopf

Summary: The paper introduces the BSSD network, which achieves speaker separation, dereverberation, and speaker identification simultaneously. Various techniques like predefined spatial cues, neural beamforming, embedding vectors, and triplet mining are utilized for these tasks. The system is evaluated based on SI-SDR, WER, and EER metrics.

SPEECH COMMUNICATION (2022)

Article Engineering, Biomedical

Lung Sound Classification Using Co-Tuning and Stochastic Normalization

Truc Nguyen, Franz Pernkopf

Summary: Computational methods for lung sound analysis are used for computer-aided diagnosis support, storage, and monitoring in critical care. This paper proposes a method that uses pre-trained ResNet models for the classification of adventitious lung sounds and respiratory diseases. The method incorporates techniques such as fine-tuning, co-tuning, stochastic normalization, and data augmentation to improve performance. The experimental results show that the proposed systems outperform state-of-the-art lung sound classification systems.

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING (2022)

Article Physics, Nuclear

Explainable machine learning for breakdown prediction in high gradient rf cavities

Christoph Obermair, Thomas Cartier-Michaud, Andrea Apollonio, William Millar, Lukas Felsberger, Lorenz Fischl, Holger Severin Bovbjerg, Daniel Wollmann, Walter Wuensch, Nuria Catalan-Lasheras, Marca Boronat, Franz Pernkopf, Graeme Burt

Summary: This paper explores the occurrence of vacuum arcs or radio frequency breakdowns in particle accelerators, which limit the high-gradient performance of normal conducting rf cavities. Using a machine learning strategy and explainable artificial intelligence, the authors reverse-engineer physical properties to derive fast and reliable rule-based models. The results show the correlation between the field emitted current following an initial breakdown and the probability of another breakdown occurring shortly thereafter.

PHYSICAL REVIEW ACCELERATORS AND BEAMS (2022)

Article Computer Science, Artificial Intelligence

Self-Guided Belief Propagation - A Homotopy Continuation Method

Christian Knoll, Adrian Weller, Franz Pernkopf

Summary: Self-guided belief propagation (SBP) is an enhanced version of belief propagation that gradually incorporates pairwise potentials, increasing accuracy without increasing computational burden. SBP finds the global optimum of the Bethe approximation for attractive models and obtains a unique, stable, and accurate solution when BP does not converge.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Proceedings Paper Acoustics

Homophone Disambiguation Profits from Durational Information

Barbara Schuppler, Emil Berger, Xenia Kogler, Franz Pernkopf

Summary: The high degree of segmental reduction in conversational speech leads to a large number of words becoming homophones, which poses a challenge for automatic speech recognition. This study proposes two approaches, one based on prosodic and spectral features and the other based on a convolutional neural network, to disambiguate homophones. The results show potential for both approaches, especially when combined with a stochastic language model as part of an ASR system.

INTERSPEECH 2022 (2022)

Proceedings Paper Engineering, Electrical & Electronic

Fast and resource-efficient CNNs for Radar Interference Mitigation on Embedded Hardware

Michael Hirschmugl, Johanna Rock, Paul Meissner, Franz Pernkopf

Summary: In this paper, the authors propose a scalable and resource-efficient accelerator for interference mitigation in automotive radar. They compare different implementations and conclude that hardware acceleration significantly improves speed without much increase in power consumption, and the use of integer arithmetic does not compromise performance. The parallelization of layers in programmable logic further reduces latency.

2022 19TH EUROPEAN RADAR CONFERENCE (EURAD) (2022)

Proceedings Paper Acoustics

END-TO-END KEYWORD SPOTTING USING NEURAL ARCHITECTURE SEARCH AND QUANTIZATION

David Peter, Wolfgang Roth, Franz Pernkopf

Summary: This paper introduces a neural architecture search (NAS) approach for automatically discovering end-to-end keyword spotting (KWS) models in limited resource environments. The authors optimize the structure of convolutional neural networks (CNNs) operating on raw audio waveforms using a differentiable NAS method. After finding a suitable KWS model with NAS, weight and activation quantization is performed to reduce memory usage. Extensive experiments are conducted on the Google speech commands dataset, comparing the end-to-end models to mel-frequency cepstral coefficient (MFCC) based CNNs. Using only NAS, a highly efficient model with an accuracy of 95.55% is obtained, using 75.7k parameters and 13.6M operations. By using trained bit-width quantization, the same model achieves a test accuracy of 93.76% while utilizing on average only 2.91 bits per activation and 2.51 bits per weight.

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) (2022)

Proceedings Paper Audiology & Speech-Language Pathology

Acoustic Echo Cancellation with Cross-Domain Learning

Lukas Pfeifenberger, Matthias Zoehrer, Franz Pernkopf

Summary: The paper introduces the CDEC system for the Interspeech 2021 AEC-Challenge, which includes TDC module, frequency-domain AEC, and time-domain neural network, achieving an overall MOS score of 3.80.

INTERSPEECH 2021 (2021)

Proceedings Paper Audiology & Speech-Language Pathology

Autonomous Robot for Measuring Room Impulse Responses

Stefan Fragner, Tobias Topar, Maximilian Giller, Lukas Pfeifenberger, Franz Pernkopf

Summary: This paper introduces an autonomous robot for recording a database of Room Impulse Responses (RIRs) at a high spatial resolution, which can be used to create realistic simulation environments. These RIRs can be exploited to generate multi-channel speech mixtures of static or moving speakers for various applications.

INTERSPEECH 2021 (2021)

Proceedings Paper Engineering, Biomedical

Crackle Detection In Lung Sounds Using Transfer Learning And Multi-Input Convolutional Neural Networks

Truc Nguyen, Franz Pernkopf

Summary: This study utilizes transfer learning to address the issue of mismatched recording setups in small non-public data, transferring knowledge from one dataset to another for crackle detection in lung sounds and achieving significant performance improvements.

2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

PS3: Partition-Based Skew-Specialized Sampling for Batch Mode Active Learning in Imbalanced Text Data

Ricky Maulana Fajri, Samaneh Khoshrou, Robert Peharz, Mykola Pechenizkiy

Summary: As social media plays a fixed role in our daily life, the issue of hostile contents and hate-speech is exacerbated, necessitating automatic hate-speech detection. A novel partition-based batch mode active learning framework is proposed to address the challenges of high class-skew, demonstrating substantial improvements in detection performance through extensive experiments.

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2020, PT V (2021)

Proceedings Paper Engineering, Electrical & Electronic

Complex-valued Convolutional Neural Networks for Enhanced Radar Signal Denoising and Interference Mitigation

Alexander Fuchs, Johanna Rock, Mate Toth, Paul Meissner, Franz Pernkopf

Summary: Autonomous driving relies heavily on sensors to perceive the environment and deliver information to control systems. Radar sensors play a vital role in providing high resolution range and velocity measurements. However, the increased use of radar sensors in road traffic leads to mutual interference, which can be mitigated using Complex-Valued Convolutional Neural Networks (CVCNNs) to improve data efficiency and phase information conservation.

2021 IEEE RADAR CONFERENCE (RADARCONF21): RADAR ON THE MOVE (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Resource-efficient DNNs for Keyword Spotting using Neural Architecture Search and Quantization

David Peter, Wolfgang Roth, Franz Pernkopf

Summary: This paper introduces the use of neural architecture search (NAS) to automatically discover small models for keyword spotting (KWS) in limited resource environments. By employing a differentiable NAS approach, the study achieves a highly efficient model with 95.4% accuracy on the Google speech commands dataset, with memory usage of 494.8 kB and 19.6 million operations. Additionally, weight quantization is utilized to reduce memory consumption further and improve model performance.

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Article Computer Science, Artificial Intelligence

3D-KCPNet: Efficient 3DCNNs based on tensor mapping theory

Rui Lv, Dingheng Wang, Jiangbin Zheng, Zhao-Xu Yang

Summary: In this paper, the authors investigate tensor decomposition for neural network compression. They analyze the convergence and precision of tensor mapping theory, validate the rationality of tensor mapping and its superiority over traditional tensor approximation based on the Lottery Ticket Hypothesis. They propose an efficient method called 3D-KCPNet to compress 3D convolutional neural networks using the Kronecker canonical polyadic (KCP) tensor decomposition. Experimental results show that 3D-KCPNet achieves higher accuracy compared to the original baseline model and the corresponding tensor approximation model.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Personalized robotic control via constrained multi-objective reinforcement learning

Xiangkun He, Zhongxu Hu, Haohan Yang, Chen Lv

Summary: In this paper, a novel constrained multi-objective reinforcement learning algorithm is proposed for personalized end-to-end robotic control with continuous actions. The approach trains a single model using constraint design and a comprehensive index to achieve optimal policies based on user-specified preferences.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Overlapping community detection using expansion with contraction

Zhijian Zhuo, Bilian Chen, Shenbao Yu, Langcai Cao

Summary: In this paper, a novel method called Expansion with Contraction Method for Overlapping Community Detection (ECOCD) is proposed, which utilizes non-negative matrix factorization to obtain disjoint communities and applies expansion and contraction processes to adjust the degree of overlap. ECOCD is applicable to various networks with different properties and achieves high-quality overlapping community detection.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

High-compressed deepfake video detection with contrastive spatiotemporal distillation

Yizhe Zhu, Chunhui Zhang, Jialin Gao, Xin Sun, Zihan Rui, Xi Zhou

Summary: In this work, the authors propose a Contrastive Spatio-Temporal Distilling (CSTD) approach to improve the detection of high-compressed deepfake videos. The approach leverages spatial-frequency cues and temporal-contrastive alignment to fully exploit spatiotemporal inconsistency information.

NEUROCOMPUTING (2024)

Review Computer Science, Artificial Intelligence

A review of coverless steganography

Laijin Meng, Xinghao Jiang, Tanfeng Sun

Summary: This paper provides a review of coverless steganographic algorithms, including the development process, known contributions, and general issues in image and video algorithms. It also discusses the security of coverless steganography from theoretical analysis to actual investigation for the first time.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Confidence-based interactable neural-symbolic visual question answering

Yajie Bao, Tianwei Xing, Xun Chen

Summary: Visual question answering requires processing multi-modal information and effective reasoning. Neural-symbolic learning is a promising method, but current approaches lack uncertainty handling and can only provide a single answer. To address this, we propose a confidence based neural-symbolic approach that evaluates NN inferences and conducts reasoning based on confidence.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

A framework-based transformer and knowledge distillation for interior style classification

Anh H. Vo, Bao T. Nguyen

Summary: Interior style classification is an interesting problem with potential applications in both commercial and academic domains. This project proposes a method named ISC-DeIT, which combines data-efficient image transformer architectures and knowledge distillation, to address the interior style classification problem. Experimental results demonstrate a significant improvement in predictive accuracy compared to other state-of-the-art methods.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Improving robustness for vision transformer with a simple dynamic scanning augmentation

Shashank Kotyan, Danilo Vasconcellos Vargas

Summary: This article introduces a novel augmentation technique called Dynamic Scanning Augmentation to improve the accuracy and robustness of Vision Transformer (ViT). The technique leverages dynamic input sequences to adaptively focus on different patches, resulting in significant changes in ViT's attention mechanism. Experimental results demonstrate that Dynamic Scanning Augmentation outperforms ViT in terms of both robustness to adversarial attacks and accuracy against natural images.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Introducing shape priors in Siamese networks for image classification

Hiba Alqasir, Damien Muselet, Christophe Ducottet

Summary: The article proposes a solution to improve the learning process of a classification network by providing shape priors, reducing the need for annotated data. The solution is tested on cross-domain digit classification tasks and a video surveillance application.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Neural dynamics solver for time-dependent infinity-norm optimization based on ACP framework with robot application

Dexiu Ma, Mei Liu, Mingsheng Shang

Summary: This paper proposes a method using neural dynamics solvers to solve infinity-norm optimization problems. Two improved solvers are constructed and their effectiveness and superiority are demonstrated through theoretical analysis and simulation experiments.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

cpp-AIF: A multi-core C plus plus implementation of Active Inference for Partially Observable Markov Decision Processes

Francesco Gregoretti, Giovanni Pezzulo, Domenico Maisto

Summary: Active Inference is a computational framework that uses probabilistic inference and variational free energy minimization to describe perception, planning, and action. cpp-AIF is a header-only C++ library that provides a powerful tool for implementing Active Inference for Partially Observable Markov Decision Processes through multi-core computing. It is cross-platform and improves performance, memory management, and usability compared to existing software.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Predicting stock market trends with self-supervised learning

Zelin Ying, Dawei Cheng, Cen Chen, Xiang Li, Peng Zhu, Yifeng Luo, Yuqi Liang

Summary: This paper proposes a novel stock market trends prediction framework called SMART, which includes a self-supervised stock technical data sequence embedding model S3E. By training with multiple self-supervised auxiliary tasks, the model encodes stock technical data sequences into embeddings and uses the learned sequence embeddings for predicting stock market trends. Extensive experiments on China A-Shares market and NASDAQ market prove the high effectiveness of our model in stock market trends prediction, and its effectiveness is further validated in real-world applications in a leading financial service provider in China.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

DHGAT: Hyperbolic representation learning on dynamic graphs via attention networks

Hao Li, Hao Jiang, Dongsheng Ye, Qiang Wang, Liang Du, Yuanyuan Zeng, Liu Yuan, Yingxue Wang, C. Chen

Summary: DHGAT1, a dynamic hyperbolic graph attention network, utilizes hyperbolic metric properties to embed dynamic graphs. It employs a spatiotemporal self-attention mechanism and weighted node representations, resulting in excellent performance in link prediction tasks.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Jiehui Huang, Zhenchao Tang, Xuedong He, Jun Zhou, Defeng Zhou, Calvin Yu-Chian Chen

Summary: This study proposes a progressive learning multi-scale feature blending model for image deraining tasks. The model utilizes detail dilation and texture extraction to improve the restoration of rainy images. Experimental results show that the model achieves near state-of-the-art performance in rain removal tasks and exhibits better rain removal realism.

NEUROCOMPUTING (2024)

Article Computer Science, Artificial Intelligence

Stabilization and synchronization control for discrete-time complex networks via the auxiliary role of edges subsystem

Lizhi Liu, Zilin Gao, Yinhe Wang, Yongfu Li

Summary: This paper proposes a novel discrete-time interconnected model for depicting complex dynamical networks. The model consists of nodes and edges subsystems, which consider the dynamic characteristic of both nodes and edges. By designing control strategies and coupling modes, the stabilization and synchronization of the network are achieved. Simulation results demonstrate the effectiveness of the proposed methods.

NEUROCOMPUTING (2024)