4.5 Article

Effective detection of sophisticated online banking fraud on extremely imbalanced data

Journal

Publisher

SPRINGER
DOI: 10.1007/s11280-012-0178-0

Keywords

fraud detection; online banking; contrast pattern; neural network; data mining

Funding

  1. Australian Research Council [DP1096218, LP100200774]
  2. Australian Research Council [DP1096218, LP100200774] Funding Source: Australian Research Council

Ask authors/readers for more resources

Sophisticated online banking fraud reflects the integrative abuse of resources in social, cyber and physical worlds. Its detection is a typical use case of the broad-based Wisdom Web of Things (W2T) methodology. However, there is very limited information available to distinguish dynamic fraud from genuine customer behavior in such an extremely sparse and imbalanced data environment, which makes the instant and effective detection become more and more important and challenging. In this paper, we propose an effective online banking fraud detection framework that synthesizes relevant resources and incorporates several advanced data mining techniques. By building a contrast vector for each transaction based on its customer's historical behavior sequence, we profile the differentiating rate of each current transaction against the customer's behavior preference. A novel algorithm, ContrastMiner, is introduced to efficiently mine contrast patterns and distinguish fraudulent from genuine behavior, followed by an effective pattern selection and risk scoring that combines predictions from different models. Results from experiments on large-scale real online banking data demonstrate that our system can achieve substantially higher accuracy and lower alert volume than the latest benchmarking fraud detection system incorporating domain knowledge and traditional fraud detection methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Recurrent Coupled Topic Modeling over Sequential Documents

Jinjin Guo, Longbing Cao, Zhiguo Gong

Summary: This work introduces a dynamic topic modeling method based on multi-topic-thread evolution, successfully disentangling the multi-couplings between evolving topics through data augmentation techniques, improving the effectiveness and efficiency of inference technique.

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2022)

Article Computer Science, Artificial Intelligence

Unsupervised Heterogeneous Coupling Learning for Categorical Representation

Chengzhang Zhu, Longbing Cao, Jianping Yin

Summary: This paper introduces a shallow but powerful unsupervised learning method called UNTIE for representing coupled categorical data. It reveals heterogeneous distributions between couplings and achieves significant performance improvement on multiple categorical datasets.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Multidisciplinary Sciences

Personalized next-best action recommendation with multi-party interaction learning for automated decision-making

Longbing Cao, Chengzhang Zhu

Summary: Automated next-best action recommendation for each customer in a sequential, dynamic and interactive context is widely needed in decision-making. However, existing modeling theories and tools cannot quantify such complex decision-making on a personal level. This study proposes a data-driven approach using a reinforced coupled recurrent neural network to learn personalized next-best actions, demonstrating the potential of personalized deep learning and automated dynamic intervention for personalized decision-making in complex systems.

PLOS ONE (2022)

Article Biology

BiT-MAC: Mortality prediction by bidirectional time and multi-feature attention coupled network on multivariate irregular time series

Qinfen Wang, Geng Chen, Xuting Jin, Siyuan Ren, Gang Wang, Longbing Cao, Yong Xia

Summary: Mortality prediction is crucial in evaluating illness severity and improving patient prognosis. Existing methods for analyzing multivariate time series (MTSs) suffer from sparse and incomplete data. We propose a BiT-MAC network that captures both intra-time series coupling and inter-time series coupling to estimate missing values and improve MTS-based prediction. Extensive experiments on clinical datasets demonstrate the superiority of BiT-MAC and the interpretability of its features.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Computer Science, Artificial Intelligence

An Efficient Method for Modeling Nonoccurring Behaviors by Negative Sequential Patterns With Loose Constraints

Ping Qiu, Yongshun Gong, Yuhai Zhao, Longbing Cao, Chengqi Zhang, Xiangjun Dong

Summary: This article explores an efficient method for mining negative sequential patterns (NSPs) using temporal point processes (TPPs) to model frequently occurring and nonoccurring events and behaviors. By loosening constraints, a new definition of negative containment is provided, and an efficient method for calculating the supports of negative sequences is proposed. Finally, a novel and efficient algorithm is presented to identify valuable NSPs.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Editorial Material Computer Science, Artificial Intelligence

AI and data science for smart emergency, crisis and disaster resilience

Longbing Cao

Summary: The uncertain world faces increasing emergencies, crises and disasters, including COVID-19 pandemic, hurricane Ian, global financial inflation and recession, misinformation disaster, and cyberattacks. AI for smart disaster resilience transforms traditional reactive and scripted disaster management into proactive and intelligent resilience in the face of diverse ECDs. This article provides a systematic overview of various ECDs, conventional ECD management, ECD data complexities, and the research landscape of AISDR. Translational disaster AI is crucial in enabling smart disaster resilience.

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2023)

Article Computer Science, Artificial Intelligence

Trans-AI/DS: transformative, transdisciplinary and translational artificial intelligence and data science

Longbing Cao

Summary: After years of development, a new generation of AI and data science has emerged, based on the integration of science, technology, and engineering. This new generation embraces Trans-AI/DS thinking, which combines AI and data science to promote transformative, transdisciplinary, and translational approaches. These paradigm shifts encourage innovative thinking beyond traditional AI and data-driven methods, and focus on the complexities of human intelligence, nature, society, and their creations.

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2023)

Article Computer Science, Artificial Intelligence

Trans-AI/DS: transformative, transdisciplinary and translational artificial intelligence and data science

Longbing Cao

Summary: After 70 years of AI and 50 years of DS, AI/DS have entered a new age, where they are built upon the integration of science, technology, and engineering. This integration has resulted in Trans-AI/DS, which promote transformative, transdisciplinary, and translational thinking, methodologies, and practices in AI/DS.

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2023)

Article Computer Science, Artificial Intelligence

Modeling User Demand Evolution for Next-Basket Prediction

Shoujin Wang, Yan Wang, Liang Hu, Xiuzhen Zhang, Qi Zhang, Quan Z. Sheng, Mehmet A. Orgun, Longbing Cao, Defu Lian

Summary: Users' purchase behaviors are complex and dynamic, driven by personal demands evolving over time. Predicting the next basket involves tracking demand changes and satisfying the current demand. The EvoDESA model predicts the next basket by learning demand dynamics and effectively packing item combinations to best satisfy the user, showing considerable superiority over existing approaches.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Copula Variational LSTM for High-Dimensional Cross-Market Multivariate Dependence Modeling

Jia Xu, Longbing Cao

Summary: This paper proposes a method that combines deep variational sequential learning with copula-based statistical dependence modeling to address the challenging problem of modeling high-dimensional, long-range dependencies between nonnormal multivariates. The method can characterize both the temporal dependence degrees and structures between the hidden variables representing the nonnormal multivariates, and it outperforms benchmarks in terms of both technical significance and portfolio forecasting performance.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Revealing the Distributional Vulnerability of Discriminators by Implicit Generators

Zhilin Zhao, Longbing Cao, Kun-Yu Lin

Summary: In deep neural learning, training a discriminator on in-distribution samples may lead to misclassification of out-of-distribution samples, which poses a significant challenge for robust and safe deep learning. To address this issue, we propose a general approach called Fine-tuning Discriminators by Implicit Generators (FIG) that enhances the discriminatory power of standard discriminators in distinguishing in-distribution and out-of-distribution samples. FIG leverages information theory to infer an energy-based implicit generator from a discriminator and uses a Langevin dynamic sampler to draw specific out-of-distribution samples. Experimental results demonstrate that FIG achieves state-of-the-art out-of-distribution detection performance.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Out-of-Distribution Detection by Cross-Class Vicinity Distribution of In-Distribution Data

Zhilin Zhao, Longbing Cao, Kun-Yu Lin

Summary: Deep neural networks for image classification only learn to map in-distribution inputs to their corresponding ground-truth labels in training without differentiating out-of-distribution samples from in-distribution ones. To address this issue, we draw out-of-distribution samples from the vicinity distribution of training in-distribution samples for learning to reject the prediction on out-of-distribution inputs. Experiments show that the proposed method significantly outperforms existing methods in improving the capacity for discriminating between in-and out-of-distribution samples.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Gray Learning From Non-IID Data With Out-of-Distribution Samples

Zhilin Zhao, Longbing Cao, Chang-Dong Wang

Summary: The integrity of training data is uncertain, especially for non-IID datasets. Experts may misclassify samples, leading to unreliable labels. This study proposes a gray learning (GL) method that leverages both ground-truth and complementary labels to improve the robustness of neural networks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Theory & Methods

AI in Finance: Challenges, Techniques, and Opportunities

Longbing Cao

Summary: This article provides an overview of the application of artificial intelligence techniques in the finance industry. It offers a comprehensive and dense landscape of the challenges, techniques, and opportunities of AIDS research in finance over the past decades. The article outlines the challenges of financial businesses and data, categorizes the decades of AIDS research in finance, illustrates the data-driven analytics and learning in financial businesses, compares classic and modern AIDS techniques, and discusses future opportunities for AIDS-empowered finance and finance-motivated AIDS research.

ACM COMPUTING SURVEYS (2023)

Proceedings Paper Computer Science, Information Systems

Mining Contextual Item Similarity without Concept Hierarchy

Md Fahim Arefin, Chowdhury Farhan Ahmed, Redwan Ahmed Rizvee, Carson K. Leung, Longbing Cao

Summary: In this paper, a novel measure of similarity is proposed to evaluate contextual similarity between items without using any additional metadata. An optimal algorithm and a heuristic algorithm are proposed to calculate this measure. The experimental results confirm the effectiveness and versatility of this measure in data of varying nature.

PROCEEDINGS OF THE 2022 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2022) (2022)

No Data Available