4.5 Article

Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost

期刊

PATTERN RECOGNITION LETTERS
卷 136, 期 -, 页码 190-197

出版社

ELSEVIER
DOI: 10.1016/j.patrec.2020.05.035

关键词

Imbalanced classification; XGBoost; Python package

资金

  1. National Natural Science Foundation of China [81872719, 81803337]
  2. Provincial Natural Science Foundation of Shandong Province [ZR201807090257]
  3. National Bureau of Statistics Foundation Project [2018LY79]

向作者/读者索取更多资源

The paper presents Imbalance-XGBoost, a Python package that combines the powerful XGBoost software with weighted and focal losses to tackle binary label-imbalanced classification tasks. Though a small-scale program in terms of size, the package is, to the best of our knowledge, the first of its kind which provides an integrated implementation for the two loss functions on XGBoost and brings a general-purpose extension to XGBoost for label-imbalanced scenarios. In this paper, the design and usage of the package are discussed and illustrated with examples. Furthermore, as the first- and second-order derivatives of the loss functions are essential for the implementations, the algebraic derivation is discussed and it can be deemed as a separate contribution. The performances of the methods implemented in the package are extensively evaluated on Parkinson's disease classification dataset, and multiple competitive performances are presented with the ROC and Precision-Recall (PR) curves. To further assert the superiority of the methods, the performances on four other benchmark datasets from the UCI machine learning repository are additionally reported. Given the scalable nature of XGBoost, the package has great potentials to be broadly applied to real-life binary classification tasks, which are usually of large-scale and label-imbalanced. (C) 2020 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Artificial Intelligence

Scalar Quantization as Sparse Least Square Optimization

Chen Wang, Xiaomei Yang, Shaomin Fei, Kai Zhou, Xiaofeng Gong, Miao Du, Ruisen Luo

Summary: This paper investigates the application of scalar quantization based on sparse least square optimization, proposing multiple quantization algorithms based on different regularization criteria, and comparing and testing them through iterative methods and clustering-based methods.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Computer Science, Artificial Intelligence

Adaptive ensemble of classifiers with regularization for imbalanced data classification

Chen Wang, Chengyuan Deng, Zhoulu Yu, Dafeng Hui, Xiaofeng Gong, Ruisen Luo

Summary: The study introduces a novel dynamic ensemble method AER to address the overfitting issue in binary imbalanced data classification through regularization and utilizing global geometry of data, demonstrating superior performance in experiments.

INFORMATION FUSION (2021)

Proceedings Paper Computer Science, Theory & Methods

Exploration with Limited Memory: Streaming Algorithms for Coin Tossing, Noisy Comparisons, and Multi-armed Bandits

Sepehr Assadi, Chen Wang

PROCEEDINGS OF THE 52ND ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '20) (2020)

Article Computer Science, Information Systems

Integrating Wildfires Propagation Prediction Into Early Warning of Electrical Transmission Line Outages

Songyi Dian, Peng Cheng, Qiang Ye, Jirong Wu, Ruisen Luo, Chen Wang, Dafeng Hui, Ning Zhou, Dong Zou, Qin Yu, Xiaofeng Gong

IEEE ACCESS (2019)

Proceedings Paper Automation & Control Systems

Bagging of Xgboost Classifiers with Random Under-sampling and Tomek Link for Noisy Label-imbalanced Data

Luo Ruisen, Dian Songyi, Wang Chen, Cheng Peng, Tang Zuodong, Yu YanMei, Wang Shixiong

3RD INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTICS ENGINEERING (CACRE 2018) (2018)

Article Computer Science, Hardware & Architecture

The definition and numerical method of final value problem and arbitrary value problem

Shixiong Wang, Jianhua He, Chen Wang, Xitong Li

COMPUTER SYSTEMS SCIENCE AND ENGINEERING (2018)

Article Computer Science, Information Systems

Feature Learning With a Divergence-Encouraging Autoencoder for Imbalanced Data Classification

Ruisen Luo, Qian Feng, Chen Wang, Xiaomei Yang, Haiyan Tu, Qin Yu, Shaomin Fei, Xiaofeng Gong

IEEE ACCESS (2018)

暂无数据