4.7 Article

Co-Clustering via Information-Theoretic Markov Aggregation

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2018.2846252

Keywords

Co-clustering; information-theoretic cost function; clustering; Markov chains

Funding

  1. German Ministry of Education and Research
  2. Erwin Schrodinger Fellowship of the Austrian Science Fund [J 3765]
  3. Austrian COMET Program - Competence Centers for Excellent Technologies - under the Austrian Federal Ministry of Transport, Innovation and Technology
  4. Austrian Federal Ministry of Economy, Family and Youth
  5. State of Styria

Ask authors/readers for more resources

We present an information-theoretic cost function for co-clustering, i.e., for simultaneous clustering of two sets based on similarities between their elements. By constructing a simple random walk on the corresponding bipartite graph, our cost function is derived from a recently proposed generalized framework for information-theoretic Markov chain aggregation. The goal of our cost function is to minimize relevant information loss, hence it connects to the information bottleneck formalism. Moreover, via the connection to Markov aggregation, our cost function is not ad hoc, but inherits its justification from the operational qualities associated with the corresponding Markov aggregation problem. We furthermore show that, for appropriate parameter settings, our cost function is identical to well-known approaches from the literature, such as Information-Theoretic Co-Clustering by Dhillon et al. Hence, understanding the influence of this parameter admits a deeper understanding of the relationship between previously proposed information-theoretic cost functions. We highlight some strengths and weaknesses of the cost function for different parameters. We also illustrate the performance of our cost function, optimized with a simple sequential heuristic, on several synthetic and real-world data sets, including the Newsgroup20 and the MovieLens100k data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Energy & Fuels

Detection of Knocking Combustion Using the Continuous Wavelet Transformation and a Convolutional Neural Network

Achilles Kefalas, Andreas B. Ofner, Gerhard Pirker, Stefan Posch, Bernhard C. Geiger, Andreas Wimmer

Summary: The study proposes an efficient method for detecting knocking combustion in internal combustion engines using continuous wavelet transformation and convolutional neural network. The approach outperformed existing methods, improving accuracy by 6.15 percentage points. The CWT + CNN method does not require calibrating threshold values for different engines or operating conditions, as long as diverse data is used for training.

ENERGIES (2021)

Article Computer Science, Artificial Intelligence

Synwalk: community detection via random walk modelling

Christian Toth, Denis Helic, Bernhard C. Geiger

Summary: This paper presents Synwalk, a random walk-based community detection method, which detects communities in networks by synthesizing random walks. The results indicate that Synwalk performs robustly in various network scenarios.

DATA MINING AND KNOWLEDGE DISCOVERY (2022)

Article Automation & Control Systems

Knock Detection in Combustion Engine Time Series Using a Theory-Guided 1-D Convolutional Neural Network Approach

Andreas Benjamin Ofner, Achilles Kefalas, Stefan Posch, Bernhard Claus Geiger

Summary: This study utilizes a 1-D convolutional neural network to detect knocking occurrences in an internal combustion engine. The network achieves an accuracy of over 92% in distinguishing between knocking and non-knocking cycles in tenfold cross-validation. The network outperforms existing methods and demonstrates remarkable generalization ability.

IEEE-ASME TRANSACTIONS ON MECHATRONICS (2022)

Article Chemistry, Multidisciplinary

Gaussian Process Surrogates for Modeling Uncertainties in a Use Case of Forging Superalloys

Johannes G. Hoffer, Bernhard C. Geiger, Roman Kern

Summary: The avoidance of scrap and adherence to tolerances are important goals in manufacturing. Researchers propose a simulation method using Gaussian Process surrogate model that considers real manufacturing process uncertainties, acting as a substitute for expensive and computationally intensive finite element method (FEM) simulation, resulting in a fast and robust method to adequately depict reality.

APPLIED SCIENCES-BASEL (2022)

Article Chemistry, Analytical

Estimation of Combustion Parameters from Engine Vibrations Based on Discrete Wavelet Transform and Gradient Boosting

Achilles Kefalas, Andreas B. Ofner, Gerhard Pirker, Stefan Posch, Bernhard C. Geiger, Andreas Wimmer

Summary: The study investigates the potential of using a virtual sensor based on vibration signals acquired by a knock sensor for controlling the combustion process. A data-driven approach utilizing discrete wavelet transform as a preprocessing step and extreme gradient boosting regression models for regression tasks of combustion parameters is introduced. The methodology will be applied to data from two different spark-ignited, single cylinder gas engines, with analysis to identify important features based on the model's decisions.

SENSORS (2022)

Review Metallurgy & Metallurgical Engineering

Theory-inspired machine learning-towards a synergy between knowledge and data

Johannes G. Hoffer, Andreas B. Ofner, Franz M. Rohrhofer, Mario Lovric, Roman Kern, Stefanie Lindstaedt, Bernhard C. Geiger

Summary: This article introduces the application of theory-inspired machine learning in engineering fields, which combines the advantages of traditional models and modern data-driven methods. This approach can often result in models that are more accurate, simpler, better at extrapolating, and allow for faster model training or inference.

WELDING IN THE WORLD (2022)

Article Environmental Sciences

Machine Learning and Meteorological Normalization for Assessment of Particulate Matter Changes during the COVID-19 Lockdown in Zagreb, Croatia

Mario Lovric, Mario Antunovic, Iva Sunic, Matej Vukovic, Simonas Kecorius, Mark Kroell, Ivan Beslic, Ranka Godec, Gordana Pehnec, Bernhard C. Geiger, Stuart K. Grange, Iva Simic

Summary: The authors investigated the changes in particulate matter (PM) concentration during the COVID-19 lockdown in Zagreb, Croatia. The results showed that there were no significant differences in PM concentration during the lockdown compared to pre-lockdown and new normal periods.

INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH (2022)

Article Engineering, Mechanical

In-cylinder pressure reconstruction from engine block vibrations viaa branched convolutional neural network

Andreas B. Ofner, Achilles Kefalas, Stefan Posch, Gerhard Pirker, Bernhard C. Geiger

Summary: This study introduces a novel approach to reconstructing in-cylinder pressure trace using vibration signals recorded by knock sensors. The proposed data-driven methodology employs a convolutional neural network with two distinct branches. The model architecture incentivizes each branch to learn low-frequency and high-frequency contents of the pressure trace. The reconstruction achieves high correlation and small errors, and can also extract peak firing pressure and peak pressure position.

MECHANICAL SYSTEMS AND SIGNAL PROCESSING (2023)

Review Computer Science, Artificial Intelligence

On Information Plane Analyses of Neural Network Classifiers--A Review

Bernhard C. Geiger

Summary: This article reviews the literature on information plane analysis of neural network classifiers, discussing the causal relationship between information bottleneck theory and generalization. The research found conflicting empirical evidence in IP analysis and emphasized the importance of detailed estimation of information quantities. It suggests that compression visualized in IPs may not necessarily be information-theoretic, but often compatible with geometric compression of latent representations.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation

Rana Ali Amjad, Kairen Liu, Bernhard C. Geiger

Summary: In this study, three information-theoretic quantities were used to analyze the behavior of trained neural networks, revealing that class selectivity is not a reliable indicator for classification performance. However, when examining individual layers, mutual information and class selectivity show a positive correlation with classification performance.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Article Computer Science, Interdisciplinary Applications

Robust Bayesian target value optimization

J. G. Hoffer, S. Ranftl, B. C. Geiger

Summary: This article discusses how to find an input such that the output of a stochastic black box function is as close as possible to a target value. It fills the gap in current approaches by deriving acquisition functions for common criteria and demonstrating their compatibility with certain extensions of Gaussian processes. The experiments show that these derived acquisition functions can outperform classical Bayesian optimization.

COMPUTERS & INDUSTRIAL ENGINEERING (2023)

Review Computer Science, Information Systems

Reconsidering Read and Spontaneous Speech: Causal Perspectives on the Generation of Training Data for Automatic Speech Recognition

Philipp Gabler, Bernhard C. Geiger, Barbara Schuppler, Roman Kern

Summary: Superficially, read and spontaneous speech are two main types of training data in automatic speech recognition, but they are fundamentally different due to the way the audio signal is generated. This review introduces causal reasoning into automatic speech recognition, highlighting the impact of data generation processes on inference and performance. By applying a causal perspective, this work discusses the relationship between data generation mechanisms, learning, and prediction in speech data. Furthermore, the authors argue that a causal perspective can enhance the understanding of models in speech processing.

INFORMATION (2023)

Article Computer Science, Information Systems

Data vs. Physics: The Apparent Pareto Front of Physics-Informed Neural Networks

Franz M. M. Rohrhofer, Stefan Posch, Clemens Gossnitzer, Bernhard C. Geiger

Summary: This study investigates the impact of system parameters on multi-objective optimization in PINNs and the selection of loss weights. The authors find that system parameters effectively scale loss residuals and cause imbalances in MO optimization. However, they demonstrate that loss weights can compensate for the scaling of system parameters and enable the selection of an optimal solution on the Pareto front.

IEEE ACCESS (2023)

Article Automation & Control Systems

Knock Detection in Combustion Engine Time Series Using a Theory-Guided 1-D Convolutional Neural Network Approach

Andreas Benjamin Ofner, Achilles Kefalas, Stefan Posch, Bernhard Claus Geiger

Summary: This study demonstrates the effective detection of knocking events in an internal combustion engine using a 1-D convolutional neural network. The model is capable of accurately distinguishing between knocking and non-knocking cycles, showing remarkable generalization ability to unseen operating points. By training on a small number of non-knocking cycles, the model also achieved increased accuracy in classifying knocking cycles in unseen engines.

IEEE-ASME TRANSACTIONS ON MECHATRONICS (2022)

No Data Available