4.7 Article

New insights into churn prediction in the telecommunication sector: A profit driven data mining approach

Journal

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH
Volume 218, Issue 1, Pages 211-229

Publisher

ELSEVIER
DOI: 10.1016/j.ejor.2011.09.031

Keywords

Data mining; Churn prediction; Profit; Input selection; Oversampling; Telecommunication sector

Funding

  1. Flemish Research Council (FWO) [B.0915.09]
  2. National Bank of Belgium [NBB/11/004]

Ask authors/readers for more resources

Customer churn prediction models aim to indicate the customers with the highest propensity to attrite, allowing to improve the efficiency of customer retention campaigns and to reduce the costs associated with churn. Although cost reduction is their prime objective, churn prediction models are typically evaluated using statistically based performance measures, resulting in suboptimal model selection. Therefore, in the first part of this paper, a novel, profit centric performance measure is developed, by calculating the maximum profit that can be generated by including the optimal fraction of customers with the highest predicted probabilities to attrite in a retention campaign. The novel measure selects the optimal model and fraction of customers to include, yielding a significant increase in profits compared to statistical measures. In the second part an extensive benchmarking experiment is conducted, evaluating various classification techniques applied on eleven real-life data sets from telecom operators worldwide by using both the profit centric and statistically based performance measures. The experimental results show that a small number of variables suffices to predict churn with high accuracy, and that oversampling generally does not improve the performance significantly. Finally, a large group of classifiers is found to yield comparable performance. (C) 2011 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Management

Cost-sensitive learning for profit-driven credit scoring

George Petrides, Darie Moldovan, Lize Coenen, Tias Guns, Wouter Verbeke

Summary: This study evaluates the effectiveness of cost-sensitive learning methods in improving the profitability of credit scorecards. The results show that these methods can increase profitability, especially for channels with higher default rates.

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY (2022)

Article Management

Machine learning methods for short-term probability of default: A comparison of classification, regression and ranking methods

Lize Coenen, Wouter Verbeke, Tias Guns

Summary: This study investigates the use of machine learning for a finer-grained risk estimation task in spot factoring. The results show that regression models can lead to higher profits and better risk distribution in spot factoring.

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY (2022)

Article Computer Science, Artificial Intelligence

Weight-of-evidence through shrinkage and spline binning for interpretable nonlinear classification

Jakob Raymaekers, Wouter Verbeke, Tim Verdonck

Summary: This paper proposes a formalized, data-driven method for increasing classification precision while maintaining model interpretability and reducing the risk of overfitting. By exploring the discretization of continuous variables through spline functions, the method captures nonlinear effects in predictor variables and yields interpretable predictors with a small number of discrete values. Furthermore, the extension of the weight-of-evidence approach with shrinkage estimators improves the ability to utilize both nonlinear and categorical predictors effectively.

APPLIED SOFT COMPUTING (2022)

Article Computer Science, Artificial Intelligence

Cost-sensitive ensemble learning: a unifying framework

George Petrides, Wouter Verbeke

Summary: The study introduces a unified framework for cost-sensitive ensemble methods, categorizing and comparing them, including extensions and generalizations for methods like AdaBoost, Bagging, and Random Forest.

DATA MINING AND KNOWLEDGE DISCOVERY (2022)

Article Management

Instance-dependent cost-sensitive learning for detecting transfer fraud

Sebastiaan Hoppner, Bart Baesens, Wouter Verbeke, Tim Verdonck

Summary: Credit card transaction fraud is a global issue, and financial institutions are increasingly relying on data-driven methods to develop fraud detection systems for detecting and preventing fraudulent transactions. The article introduces two novel classifiers that minimize the instance-dependent cost measure when learning a classification model, highlighting the potential to reduce fraud losses.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2022)

Article Computer Science, Information Systems

Predict-then-optimize or predict-and-optimize? An empirical evaluation of cost-sensitive learning strategies

Toon Vanderschueren, Tim Verdonck, Bart Baesens, Wouter Verbeke

Summary: Predictive models are increasingly used to optimize decision-making and minimize costs. This work compared the predict-then-optimize approach with the predict-and-optimize approach in cost-sensitive classification. The key finding was that the decision-making strategy was generally more effective than training with a task-specific loss or their combination.

INFORMATION SCIENCES (2022)

Article Computer Science, Artificial Intelligence

Data misrepresentation detection for insurance underwriting fraud prevention

Felix Vandervorst, Wouter Verbeke, Tim Verdonck

Summary: This paper proposes a novel approach to evaluate the risk of underwriting premium fraud by using conditional density estimates. The approach supports insurance companies in identifying fraudulent applications and can adapt to changes in pricing policy. It can also detect outliers and predict underwriting fraud.

DECISION SUPPORT SYSTEMS (2022)

Article Management

To do or not to do? Cost-sensitive causal classification with individual treatment effect estimates

Wouter Verbeke, Diego Olaya, Marie-Anne Guerry, Jente Van Belle

Summary: Individual treatment effect models optimize decision-making by predicting the effect of treatment on specific outcomes of individual instances. This article introduces a cost-sensitive causal classification approach based on predictions of individual treatment effects, which allows selecting instances to treat and maximize expected causal profit.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2023)

Article Computer Science, Artificial Intelligence

HydaLearn Highly Dynamic Task Weighting for Multitask Learning with Auxiliary Tasks

Sam Verboven, Muhammad Hafeez Chaudhary, Jeroen Berrevoets, Vincent Ginis, Wouter Verbeke

Summary: Multitask learning can enhance the performance of a task by sharing representations with related auxiliary tasks. However, static loss weights often lead to poor results. This paper introduces an intelligent weighting algorithm called HydaLearn that addresses the shortcomings of static loss weights by connecting the main-task gain to individual task gradients, allowing for dynamic loss weighting at the minibatch level. Experiments show significant performance improvements on synthetic and real-world datasets.

APPLIED INTELLIGENCE (2023)

Article Economics

Improving forecast stability using deep learning

Jente Van Belle, Ruben Crevits, Wouter Verbeke

Summary: In this paper, the authors define forecast (in)stability as the variability in forecasts caused by updating them over time. They propose an extension to the N-BEATS deep learning architecture for time series forecasting, optimizing forecasts for both accuracy and stability. Experimental results show that the proposed extension improves both accuracy and stability compared to the original N-BEATS architecture, suggesting that including forecast instability in the loss function can serve as a regularization mechanism.

INTERNATIONAL JOURNAL OF FORECASTING (2023)

Article Computer Science, Artificial Intelligence

Learning to Rank for Uplift Modeling

Floris Devriendt, Jente Van Belle, Tias Guns, Wouter Verbeke

Summary: This article investigates the application of learning to rank techniques in causal classification and its combination with uplift modeling. A unified formalization model is proposed, and a new performance metric is introduced. The effectiveness of this method is demonstrated through experiments, but the results are only applicable in specific scenarios.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Article Engineering, Industrial

Optimizing the preventive maintenance frequency with causal machine

Toon Vanderschueren, Robert Boute, Tim Verdonck, Bart Baesens, Wouter Verbeke

Summary: This study uses causal inference to learn the effect of preventive maintenance frequency on asset overhaul and failure rates, based on observational data. The learned outcomes are used to optimize maintenance schedules and minimize the combined cost of failures, overhauls, and preventive maintenance.

INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS (2023)

Article Statistics & Probability

Robust instance-dependent cost-sensitive classification

Simon De Vos, Toon Vanderschueren, Tim Verdonck, Wouter Verbeke

Summary: In this paper, the authors demonstrate that instance-dependent cost-sensitive (IDCS) learning methods are sensitive to noise and outliers in relation to instance-dependent misclassification costs. They propose a three-step framework to enhance the robustness of IDCS methods by automatically detecting outliers, correcting outlying cost information, and constructing an IDCS learning method. The newly proposed r-cslogit method, tested on synthetic and semi-synthetic data, shows superior savings compared to its non-robust counterpart for different levels of noise and outliers.

ADVANCES IN DATA ANALYSIS AND CLASSIFICATION (2023)

Article Computer Science, Artificial Intelligence

Fraud analytics: A decade of research Organizing challenges and solutions in the field

Christopher Bockel-Rickermann, Tim Verdonck, Wouter Verbeke

Summary: The literature on fraud analytics and detection has experienced a significant increase in the past decade, resulting in a wide range of research topics and a lack of overall organization. This paper provides an overview of fraud analytics, analyzes published records, identifies prominent domains, challenges, performance metrics, and methods, and proposes a framework and keywording strategy for future research. Additionally, the paper addresses the challenge of accessing public datasets and offers requirements for suitable data sets, while providing an online database for fellow researchers to explore and build upon.

EXPERT SYSTEMS WITH APPLICATIONS (2023)

Article Mathematics, Interdisciplinary Applications

Treatment effect optimisation in dynamic environments

Jeroen Berrevoets, Sam Verboven, Wouter Verbeke

Summary: Applying causal methods in fields like healthcare, marketing, and economics is gaining increasing interest. The research on individual-treatment-effect optimization, also known as uplift modelling, has reached its peak in precision medicine and targeted advertising. Existing techniques have shown their utility in many scenarios, but they suffer from limitations in dynamic environments. To address this, the researchers propose a novel optimization target called uplifted contextual multi-armed bandit, which effectively improves upon the state-of-the-art according to experiments on real and simulated data.

JOURNAL OF CAUSAL INFERENCE (2022)

Review Management

Survey of optimization models for power system operation and expansion planning with demand response

Vinicius N. Motta, Miguel F. Anjos, Michel Gendreau

Summary: This survey presents a review of optimization approaches for the integration of demand response in power systems planning and highlights important future research directions.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

R-SALSA: A branch, bound, and remember algorithm for the workload smoothing problem on simple assembly lines

Philipp Schulze, Armin Scholl, Rico Walter

Summary: This paper proposes an improved branch-and-bound algorithm, R-SALSA, for solving the simple assembly line balancing problem, which performs well in balancing workloads and providing initial solutions.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Adaptive scheduling in service systems: A Dynamic programming approach

Roshan Mahes, Michel Mandjes, Marko Boon, Peter Taylor

Summary: This paper discusses appointment scheduling and presents a phase-type-based approach to handle variations in service times. Numerical experiments with dynamic scheduling demonstrate the benefits of rescheduling.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Discrete scheduling and critical utilization

Oleg S. Pianykh, Sebastian Perez, Chengzhao Richard Zhang

Summary: Efficient scheduling is crucial for optimizing resource allocation and system performance. This study focuses on critical utilization and efficient scheduling in discrete scheduling systems, and compares the results with classical queueing theory.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Review Management

Supply chain network design with financial considerations: A comprehensive review

Hamed Jahani, Babak Abbasi, Jiuh-Biing Sheu, Walid Klibi

Summary: Supply chain network design is a large and growing area of research. This study comprehensively surveys and analyzes articles published from 2008 to 2021 to detect and report financial perspectives in SCND models. The study also identifies research gaps and offers future research directions.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

A branch-and-cut algorithm for the connected max- k-cut problem

Patrick Healy, Nicolas Jozefowiez, Pierre Laroche, Franc Marchetti, Sebastien Martin, Zsuzsanna Roka

Summary: The Connected Max-k-Cut Problem is an extension of the well-known Max-Cut Problem, where the objective is to partition a graph into k connected subgraphs by maximizing the cost of inter-partition edges. The researchers propose a new integer linear program and a branch-and-cut algorithm for this problem, and also use graph isomorphism to structure the instances and facilitate their resolution. Extensive computational experiments show that, if k > 2, their approach outperforms existing algorithms in terms of quality.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Estimating production functions through additive models based on regression splines

Victor J. Espana, Juan Aparicio, Xavier Barber, Miriam Esteve

Summary: This paper introduces a new methodology based on the machine learning technique MARS for estimating production functions that satisfy classical production theory axioms. The new approach overcomes the overfitting problem of DEA through generalized cross-validation and demonstrates better performance in reducing mean squared error and bias compared to DEA and C2NLS methods.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Time-flexible min completion time variance in a single machine by quadratic programming

Stefano Nasini, Rabia Nessah

Summary: In this paper, the authors investigate the impact of time flexibility in job scheduling, showing that it can significantly affect operators' ability to solve the problem efficiently. They propose a new methodology based on convex quadratic programming approaches that allows for optimal solutions in large-scale instances.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Convex support vector regression

Zhiqiang Liao, Sheng Dai, Timo Kuosmanen

Summary: Nonparametric regression subject to convexity or concavity constraints is gaining popularity in various fields. The conventional convex regression method often suffers from overfitting and outliers. This paper proposes the convex support vector regression method to address these issues and demonstrates its advantages in prediction accuracy and robustness through numerical experiments.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

A simulation evacuation framework for effective disaster preparedness strategies and response decision making

Kuo-Hao Chang, Ying-Zheng Wu, Wen-Ray Su, Lee-Yaw Lin

Summary: The damage and destruction caused by earthquakes necessitates the evacuation of affected populations. Simulation models, such as the Stochastic Pedestrian Cell Transmission Model (SPCTM), can be utilized to enhance disaster and evacuation management. The analysis of SPCTM provides insights for government officials to formulate effective evacuation strategies.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

An effective hybrid evolutionary algorithm for the clustered orienteering problem

Qinghua Wu, Mu He, Jin-Kao Hao, Yongliang Lu

Summary: This paper studies a variant of the orienteering problem known as the clustered orienteering problem. In this problem, customers are grouped into clusters and a profit is associated with each cluster, collected only when all customers in the cluster are served. The proposed evolutionary algorithm, incorporating a backbone-based crossover operator and a destroy-and-repair mutation operator, outperforms existing algorithms on benchmark instances and sets new records on some instances. It also demonstrates scalability on large instances and has shown superiority over three state-of-the-art COP algorithms. The algorithm is also successfully applied to a dynamic version of the COP considering stochastic travel time.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Improving uplift model evaluation on randomized controlled trial data

Bjorn Bokelmann, Stefan Lessmann

Summary: Estimating treatment effects is an important task for data analysts, and uplift models provide support for efficient allocation of treatments. However, evaluating uplift models is challenging due to variance issues. This paper theoretically analyzes the variance of uplift evaluation metrics, proposes variance reduction methods based on statistical adjustment, and demonstrates their benefits on simulated and real-world data.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Newsvendor conditional value-at-risk minimisation: A feature-based approach under adaptive data selection

Congzheng Liu, Wenqi Zhu

Summary: This paper proposes a feature-based non-parametric approach to minimizing the conditional value-at-risk in the newsvendor problem. The method is able to handle both linear and nonlinear profits without prior knowledge of the demand distribution. Results from numerical and real-life experiments demonstrate the robustness and effectiveness of the approach.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Right-left asymmetry of the eigenvector method: A simulation study

Laszlo Csato

Summary: This paper compares the performance of the eigenvalue method and the row geometric mean as two weighting procedures. Through numerical experiments, it is found that the priorities derived from the two eigenvectors in the eigenvalue method do not always agree, while the row geometric mean serves as a compromise between them.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)

Article Management

Compete or cooperate? Effects of channel relationships on government policies for sustainability

Guowei Dou, Tsan-Ming Choi

Summary: This study investigates the impact of channel relationships between manufacturers on government policies and explores the effectiveness of positive incentives versus taxes in increasing social welfare. The findings suggest that competition may be more effective in improving sustainability and social welfare. Additionally, government incentives for green technology may not necessarily enhance sustainability.

EUROPEAN JOURNAL OF OPERATIONAL RESEARCH (2024)