4.7 Article

A hybrid model based on rough sets theory and genetic algorithms for stock price forecasting

期刊

INFORMATION SCIENCES
卷 180, 期 9, 页码 1610-1629

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2010.01.014

关键词

Rough set theory; Genetic algorithms; Cumulative probability distribution approach; Minimize entropy principle approach; Technical indicators

向作者/读者索取更多资源

In the stock market, technical analysis is a useful method for predicting stock prices. Although, professional stock analysts and fund managers usually make subjective judgments, based on objective technical indicators, it is difficult for non-professionals to apply this forecasting technique because there are too many complex technical indicators to be considered. Moreover, two drawbacks have been found in many of the past forecasting models: (1) statistical assumptions about variables are required for time series models, such as the autoregressive moving average model (ARMA) and the autoregressive conditional heteroscedasticity (ARCH), to produce forecasting models of mathematical equations, and these are not easily understood by stock investors; and (2) the rules mined from some artificial intelligence (AI) algorithms, such as neural networks (NN), are not easily realized. In order to overcome these drawbacks, this paper proposes a hybrid forecasting model, using multi-technical indicators to predict stock price trends. Further, it includes four proposed procedures in the hybrid model to provide efficient rules for forecasting, which are evolved from the extracted rules with high support value, by using the toolset based on rough sets theory (RST): (1) select the essential technical indicators, which are highly related to the future stock price, from the popular indicators based on a correlation matrix; (2) use the cumulative probability distribution approach (CDPA) and minimize the entropy principle approach (MEPA) to partition technical indicator value and daily price fluctuation into linguistic values, based on the characteristics of the data distribution; (3) employ a RST algorithm to extract linguistic rules from the linguistic technical indicator dataset; and (4) utilize genetic algorithms (GAs) to refine the extracted rules to get better forecasting accuracy and stock return. The effectiveness of the proposed model is verified with two types of performance evaluations, accuracy and stock return, and by using a six-year period of the TAIEX (Taiwan Stock Exchange Capitalization Weighted Stock Index) as the experiment dataset. The experimental results show that the proposed model is superior to the two listed forecasting models (RST and GAs) in terms of accuracy, and the stock return evaluations have revealed that the profits produced by the proposed model are higher than the three listed models (Buy-and-Hold, RST and GAs). (C) 2010 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Multidisciplinary Sciences

A Clinical Decision-Support System Based on Three-Stage Integrated Image Analysis for Diagnosing Lung Disease

Ching-Hsue Cheng, Hsien-Hsiu Chen, Tai-Liang Chen

SYMMETRY-BASEL (2020)

Article Biology

A novel weighted distance threshold method for handling medical missing values

Ching-Hsue Cheng, Jing-Rong Chang, Hao-Hsuan Huang

COMPUTERS IN BIOLOGY AND MEDICINE (2020)

Review Computer Science, Artificial Intelligence

A systematic review to identify the effects of tea by integrating an intelligence-based hybrid text mining and topic model

You-Shyang Chen, Ching-Hsue Cheng, Wei-Lun Hung

Summary: This study addresses the emerging issue of tea and health by proposing a hybrid method of intelligent text mining and topic modeling. It fills the gap in knowledge on the curative effects of tea against fatal diseases, particularly cancer, and offers eight beneficial directions for future research and applications.

SOFT COMPUTING (2021)

Article Computer Science, Artificial Intelligence

A financial statement fraud model based on synthesized attribute selection and a dataset with missing values and imbalanced classes

Ching-Hsue Cheng, Yung-Fu Kao, Hsien-Ping Lin

Summary: The study establishes a model for detecting financial statement fraud, addresses missing values and imbalanced classes, proposes useful rules through various methods, utilizes a random forest model, and demonstrates the robustness of ensemble learning in this research.

APPLIED SOFT COMPUTING (2021)

Article Biology

A multiple combined method for rebalancing medical data with class imbalances

Yun-Chun Wang, Ching-Hsue Cheng

Summary: This study proposes a multiple combined method to address class imbalances in medical data, utilizing resampling, particle swarm optimization, and MetaCost. Experimental results demonstrate improvement in various evaluation metrics, suggesting the effectiveness of the proposed approach in comparison to traditional methods.

COMPUTERS IN BIOLOGY AND MEDICINE (2021)

Article Computer Science, Artificial Intelligence

A novel clustering-based purity and distance imputation for handling medical data with missing values

Ching-Hsue Cheng, Shu-Fen Huang

Summary: In the field of medical data imputation methods, a clustering-based purity and distance imputation method is proposed to improve the handling of missing values. Experimental results indicate that this method can enhance imputation performance in terms of accuracy, AUC, and RMSE for different missing degrees and types.

SOFT COMPUTING (2021)

Article Chemistry, Multidisciplinary

An Intelligent Time-Series Model for Forecasting Bus Passengers Based on Smartcard Data

Ching-Hsue Cheng, Ming-Chi Tsai, Yi-Chen Cheng

Summary: This study used smartcard data to forecast bus passenger flow and established an integrated-weight time-series forecast model. The lag period was found to significantly affect the forecast results, and the proposed model was more effective than other individual intelligent forecast models in improving passenger flow forecasting.

APPLIED SCIENCES-BASEL (2022)

Article Environmental Sciences

An Intelligent Time Series Model Based on Hybrid Methodology for Forecasting Concentrations of Significant Air Pollutants

Ching-Hsue Cheng, Ming-Chi Tsai

Summary: In this study, a hybrid methodology was used to forecast the concentrations of air pollutants, showing that random forest and intelligent time series support vector regression performed well in classification and prediction. These research results are of great reference value for addressing air quality issues.

ATMOSPHERE (2022)

Article Multidisciplinary Sciences

Rule-based classifier based on accident frequency and three-stage dimensionality reduction for exploring the factors of road accident injuries

Ching-Hsue Cheng, Jun-He Yang, Po-Chien Liu

Summary: This study analyzes traffic accident data in Taoyuan, Taiwan to identify key attributes related to accident severity. The findings provide insights for governments and stakeholders to reduce road accident risk factors.

PLOS ONE (2022)

Article Computer Science, Artificial Intelligence

A weighted-link graph neural network for lung cancer knowledge classification

Ching-Hsue Cheng, Zheng-Ting Ji

Summary: This study utilized visualized knowledge representation to analyze lung cancer literature, employing natural language processing and latent Dirichlet allocation method for topic modeling and classification. A new weighted knowledge graph construction method was proposed and trained using graph neural network algorithms. The results showed improved classification performance and effective reduction of edges on the knowledge graphs.

APPLIED INTELLIGENCE (2023)

Article Social Sciences, Interdisciplinary

A Time Series Model Based on Deep Learning and Integrated Indicator Selection Method for Forecasting Stock Prices and Evaluating Trading Profits

Ching-Hsue Cheng, Ming-Chi Tsai, Chin Chang

Summary: A stock forecasting and trading system is complex, and this study proposes an effective time series model for predicting stock prices by integrating various models and methods, demonstrating its advantages over traditional models.

SYSTEMS (2022)

Article Computer Science, Information Systems

Double-weight LDA extracting keywords for financial fraud detection system

Ching-Hsue Cheng, Wen-Hong Cai

Summary: This study proposes an intelligent financial fraud detection system using a double-weight latent Dirichlet allocation (DW-LDA) algorithm to extract keywords and build an intelligent text fraud detection model. In addition, it uses SMOTE and random undersampling to handle imbalanced datasets. The results show that the proposed algorithm outperforms existing topic models in terms of performance.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Computer Science, Artificial Intelligence

Linguistic multi-criteria decision-making aggregation model based on situational ME-LOWA and ME-LOWGA operators

Ching-Hsue Cheng, Mu-Yen Chen, Jing-Rong Chang

Summary: This paper presents a linguistic MCDM aggregation model, which can handle problems under different decision situations based on the decision-maker's preference.

GRANULAR COMPUTING (2023)

Article Environmental Studies

An Intelligent Homogeneous Model Based on an Enhanced Weighted Kernel Self-Organizing Map for Forecasting House Prices

Ching-Hsue Cheng, Ming-Chi Tsai

Summary: This study proposed an intelligent homogeneous model based on an enhanced weighted kernel self-organizing map (EW-KSOM) for forecasting house prices, and found that the best prediction algorithm is the combination of EW-KSOM and random forest. The top five key factors influencing house prices include transferred land area, house age, building transfer total area, population percentage, and the total number of floors.
Article Education & Educational Research

Investigating the impacts of using a mobile interactive English learning system on the learning achievements and learning perceptions of student with different backgrounds

Ching-Hsue Cheng, Chung-Hsi Chen

Summary: This research explores the impact of a mobile-assisted English learning system on elementary school students' learning achievement, finding that the system benefits students' learning outcomes. Additionally, it highlights that lower levels of English anxiety and higher levels of perceived usefulness lead to better learning achievement.

COMPUTER ASSISTED LANGUAGE LEARNING (2022)

Article Computer Science, Information Systems

A consensus model considers managing manipulative and overconfident behaviours in large-scale group decision-making

Xia Liang, Jie Guo, Peide Liu

Summary: This paper investigates a novel consensus model based on social networks to manage manipulative and overconfident behaviors in large-scale group decision-making. By proposing a novel clustering model and improved methods, the consensus reaching is effectively facilitated. The feedback mechanism and management approach are employed to handle decision makers' behaviors. Simulation experiments and comparative analysis demonstrate the effectiveness of the model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

CGN: Class gradient network for the construction of adversarial samples

Xiang Li, Haiwang Guo, Xinyang Deng, Wen Jiang

Summary: This paper proposes a method based on class gradient networks for generating high-quality adversarial samples. By introducing a high-level class gradient matrix and combining classification loss and perturbation loss, the method demonstrates superiority in the transferability of adversarial samples on targeted attacks.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Distinguishing latent interaction types from implicit feedbacks for recommendation

Lingyun Lu, Bang Wang, Zizhuo Zhang, Shenghao Liu

Summary: Many recommendation algorithms only rely on implicit feedbacks due to privacy concerns. However, the encoding of interaction types is often ignored. This paper proposes a relation-aware neural model that classifies implicit feedbacks by encoding edges, thereby enhancing recommendation performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Proximity-based density description with regularized reconstruction algorithm for anomaly detection

Jaehong Yu, Hyungrok Do

Summary: This study discusses unsupervised anomaly detection using one-class classification, which determines whether a new instance belongs to the target class by constructing a decision boundary. The proposed method uses a proximity-based density description and a regularized reconstruction algorithm to overcome the limitations of existing one-class classification methods. Experimental results demonstrate the superior performance of the proposed algorithm.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Non-iterative border-peeling clustering algorithm based on swap strategy

Hui Tu, Shifei Ding, Xiao Xu, Haiwei Hou, Chao Li, Ling Ding

Summary: Border-Peeling algorithm is a density-based clustering algorithm, but its complexity and issues on unbalanced datasets restrict its application. This paper proposes a non-iterative border-peeling clustering algorithm, which improves the clustering performance by distinguishing and associating core points and border points.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A two-stage denoising framework for zero-shot learning with noisy labels

Long Tang, Pan Zhao, Zhigeng Pan, Xingxing Duan, Panos M. Pardalos

Summary: In this work, a two-stage denoising framework (TSDF) is proposed for zero-shot learning (ZSL) to address the issue of noisy labels. The framework includes a tailored loss function to remove suspected noisy-label instances and a ramp-style loss function to reduce the negative impact of remaining noisy labels. In addition, a dynamic screening strategy (DSS) is developed to efficiently handle the nonconvexity of the ramp-style loss.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Selection of a viable blockchain service provider for data management within the internet of medical things: An MCDM approach to Indian healthcare

Raghunathan Krishankumar, Sundararajan Dhruva, Kattur S. Ravichandran, Samarjit Kar

Summary: Health 4.0 is gaining global attention for better healthcare through digital technologies. This study proposes a new decision-making framework for selecting viable blockchain service providers in the Internet of Medical Things (IoMT). The framework addresses the limitations in previous studies and demonstrates its applicability in the Indian healthcare sector. The results show the top ranking BSPs, the importance of various criteria, and the effectiveness of the developed model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Q-learning with heterogeneous update strategy

Tao Tan, Hong Xie, Liang Feng

Summary: This paper proposes a heterogeneous update idea and designs HetUp Q-learning algorithm to enlarge the normalized gap by overestimating the Q-value corresponding to the optimal action and underestimating the Q-value corresponding to the other actions. To address the limitation, a softmax strategy is applied to estimate the optimal action, resulting in HetUpSoft Q-learning and HetUpSoft DQN. Extensive experimental results show significant improvements over SOTA baselines.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Dyformer: A dynamic transformer-based architecture for multivariate time series classification

Chao Yang, Xianzhi Wang, Lina Yao, Guodong Long, Guandong Xu

Summary: This paper proposes a dynamic transformer-based architecture called Dyformer for multivariate time series classification. Dyformer captures multi-scale features through hierarchical pooling and adaptive learning strategies, and improves model performance by introducing feature-map-wise attention mechanisms and a joint loss function.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

ESSENT: an arithmetic optimization algorithm with enhanced scatter search strategy for automated test case generation

Xiguang Li, Baolu Feng, Yunhe Sun, Ammar Hawbani, Saeed Hammod Alsamhi, Liang Zhao

Summary: This paper proposes an enhanced scatter search strategy, using opposition-based learning, to solve the problem of automated test case generation based on path coverage (ATCG-PC). The proposed ESSENT algorithm selects the path with the lowest path entropy among the uncovered paths as the target path and generates new test cases to cover the target path by modifying the dimensions of existing test cases. Experimental results show that the ESSENT algorithm outperforms other state-of-the-art algorithms, achieving maximum path coverage with fewer test cases.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

An attention based approach for automated account linkage in federated identity management

Shirin Dabbaghi Varnosfaderani, Piotr Kasprzak, Aytaj Badirova, Ralph Krimmel, Christof Pohl, Ramin Yahyapour

Summary: Linking digital accounts belonging to the same user is crucial for security, user satisfaction, and next-generation service development. However, research on account linkage is mainly focused on social networks, and there is a lack of studies in other domains. To address this, we propose SmartSSO, a framework that automates the account linkage process by analyzing user routines and behavior during login processes. Our experiments on a large dataset show that SmartSSO achieves over 98% accuracy in hit-precision.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A memetic algorithm with fuzzy-based population control for the joint order batching and picker routing problem

Renchao Wu, Jianjun He, Xin Li, Zuguo Chen

Summary: This paper proposes a memetic algorithm with fuzzy-based population control (MA-FPC) to solve the joint order batching and picker routing problem (JOBPRP). The algorithm incorporates batch exchange crossover and a two-level local improvement procedure. Experimental results show that MA-FPC outperforms existing algorithms in terms of solution quality.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Refining one-class representation: A unified transformer for unsupervised time-series anomaly detection

Guoxiang Zhong, Fagui Liu, Jun Jiang, Bin Wang, C. L. Philip Chen

Summary: In this study, we propose the AMFormer framework to address the problem of mixed normal and anomaly samples in deep unsupervised time-series anomaly detection. By refining the one-class representation and introducing the masked operation mechanism and cost sensitive learning theory, our approach significantly improves anomaly detection performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A data-driven optimisation method for a class of problems with redundant variables and indefinite objective functions

Jin Zhou, Kang Zhou, Gexiang Zhang, Ferrante Neri, Wangyang Shen, Weiping Jin

Summary: In this paper, the authors focus on the issue of multi-objective optimisation problems with redundant variables and indefinite objective functions (MOPRVIF) in practical problem-solving. They propose a dual data-driven method for solving this problem, which consists of eliminating redundant variables, constructing objective functions, selecting evolution operators, and using a multi-objective evolutionary algorithm. The experiments conducted on two different problem domains demonstrate the effectiveness, practicality, and scalability of the proposed method.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A Monte Carlo fuzzy logistic regression framework against imbalance and separation

Georgios Charizanos, Haydar Demirhan, Duygu Icen

Summary: This article proposes a new fuzzy logistic regression framework that addresses the problems of separation and imbalance while maintaining the interpretability of classical logistic regression. By fuzzifying binary variables and classifying subjects based on a fuzzy threshold, the framework demonstrates superior performance on imbalanced datasets.

INFORMATION SCIENCES (2024)