4.7 Article

A time optimized scheme for top-k list maintenance over incomplete data streams

Journal

INFORMATION SCIENCES
Volume 311, Issue -, Pages 59-73

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2015.03.035

Keywords

Data streams; Top-k list; Top-k list maintenance; Optimal stopping theory

Ask authors/readers for more resources

A large number of contemporary research efforts focus on incomplete data streams handling. These efforts, usually, focus on the creation and maintenance of top-k lists useful to provide efficient responses to top-k queries. In case of large volumes of data accumulated at high rates the problem becomes more intense as an efficient method for maintaining the top-k list is considered imperative. In this paper, we focus on the behavior of an Observer Entity (OE) responsible to observe the incoming data and initiate the maintenance process of the top-k list. The maintenance process involves the calculation of a score for each object and the update of the top-k list. We adopt the principles of Optimal Stopping Theory (OST) and introduce a scheme that determines, in the appropriate time, when the OE decides to initiate the maintenance process. Due to the incomplete data setting, the high rate of the incoming data and the large volume of data, the maintenance process is initiated only when the OE has the appropriate amount of information for providing a top-k list. In contrast to other research approaches, we do not maintain any additional sets of objects. Instead, we attempt to minimize the number of the necessary processing tasks over the list of objects. We present a mathematical analysis for our scheme and an extensive experimental evaluation. The comparison with other models shows that the proposed model provides efficiency in the list management and minimizes the required time to result the final top-k list. (c) 2015 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Theory & Methods

A probabilistic model for assigning queries at the edge

Kostas Kolomvatsos, Christos Anagnostopoulos

COMPUTING (2020)

Article Computer Science, Artificial Intelligence

Predictive intelligence of reliable analytics in distributed computing environments

Yiannis Kathidjiotis, Kostas Kolomvatsos, Christos Anagnostopoulos

APPLIED INTELLIGENCE (2020)

Article Computer Science, Information Systems

To Transmit or Not to Transmit: Controlling Communications in the Mobile IoT Domain

K. Panagidi, C. Anagnostopoulos, A. Chalvatzaras, S. Hadjiefthymiades

ACM TRANSACTIONS ON INTERNET TECHNOLOGY (2020)

Article Computer Science, Theory & Methods

Adaptive learning of aggregate analytics under dynamic workloads

Fotis Savva, Christos Anagnostopoulos, Peter Triantafillou

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2020)

Article Computer Science, Hardware & Architecture

Edge-centric inferential modeling & analytics

Christos Anagnostopoulos

JOURNAL OF NETWORK AND COMPUTER APPLICATIONS (2020)

Article Computer Science, Theory & Methods

Data quality-aware task offloading in Mobile Edge Computing: An Optimal Stopping Theory approach

Ibrahim Alghamdi, Christos Anagnostopoulos, Dimitrios P. Pezaros

Summary: An important use case of Mobile Edge Computing is task and data offloading, which is beneficial for a variety of mobile applications. By adopting Optimal Stopping Theory, the data quality-aware offloading sequential decision making problem can be addressed to minimize the expected processing time. The proposed OST models show significant improvement in minimizing the expected processing time for analytics task execution.

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2021)

Article Computer Science, Theory & Methods

Proactive, uncertainty-driven queries management at the edge

Kostas Kolomvatsos, Christos Anagnostopoulos

Summary: The research community has identified challenges in data processing at the Cloud, particularly related to latency and waiting times for data to travel from collection points. An Edge Computing approach is proposed as a solution, where data can be processed closer to their source. This approach aims to efficiently allocate queries to available edge nodes, utilizing Fuzzy Logic theory and a Support Vector Machine-based machine learning model. The aim is to manage uncertainties in the problem and enhance decision making in processing queries.

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2021)

Article Computer Science, Hardware & Architecture

Dynamic UPF placement and chaining reconfiguration in 5G networks

Irian Leyva-Pupo, Cristina Cervello-Pastor, Christos Anagnostopoulos, Dimitrios P. Pezaros

Summary: This paper addresses the problem of dynamic user plane function placement and chaining reconfiguration (UPCR) in a MEC environment to cope with user mobility while guaranteeing cost reductions and acceptable quality of service (QoS). The proposed heuristic algorithm, dynamic priority and cautious UPCR (DPC-UPCR), provides near-optimal solutions within significantly shorter times than the mathematical model. Additionally, the scheduling mechanism based on optimal stopping theory outperforms baseline solutions in terms of the number of reconfiguration events and QoS levels.

COMPUTER NETWORKS (2022)

Article Computer Science, Theory & Methods

Local & Federated Learning at the network edge for efficient predictive analytics

Natascha Harth, Christos Anagnostopoulos, Hans-Joerg Voegel, Kostas Kolomvatsos

Summary: This study addresses the importance of utilizing device computational capacity for machine learning and analytics tasks in edge computing and IoT environments. A methodology based on federated learning principles is proposed to ensure the generalization capability of locally trained models, incorporating adaptive weighting for optimal model selection decision making. Comparative and performance evaluation demonstrate significant improvement in prediction quality compared to existing approaches.

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2022)

Article Computer Science, Hardware & Architecture

Knowledge reuse in edge computing environments

Qianyu Long, Kostas Kolomvatsos, Christos Anagnostopoulos

Summary: This paper introduces a novel paradigm in edge computing where edge nodes reuse locally completed computations to reduce the burden on resources. The paradigm is enhanced with lightweight monitoring mechanisms to forecast violations and updates, and feasibility is studied through similarity metrics among datasets.

JOURNAL OF NETWORK AND COMPUTER APPLICATIONS (2022)

Article Computer Science, Information Systems

A Proactive Statistical Model Supporting Services and Tasks Management in Pervasive Applications

Kostas Kolomvatsos, Christos Anagnostopoulos

Summary: This study proposes a proactive scheme to support collaborative activities and makes decisions on the presence of services through statistical inference of service demand and node performance, filling a research gap in this field.

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Query Driven Data Subspace Mapping

Panagiotis Fountas, Maria Papathanasaki, Kostas Kolomvatsos, Christos Anagnostopoulos

Summary: This paper presents a hierarchical query-driven clustering approach for efficient data mapping in remote datasets. By combining a Query-Based Learning (QBL) model with hierarchical clustering, the proposed method achieves fast response to a large number of query sets.

ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART II (2022)

Article Engineering, Electrical & Electronic

Data-Driven Analytics Task Management Reasoning Mechanism in Edge Computing

Christos Anagnostopoulos, Tahani Aladwani, Ibrahim Alghamdi, Konstantinos Kolomvatsos

Summary: In this paper, the authors study computational offloading techniques in autonomous computing nodes at the network edge and propose a task-management mechanism based on the popularity of tasks and data availability. The mechanism utilizes a fuzzy inference system to determine the probability of executing tasks locally, offloading them to peer computing nodes, or offloading them to the Cloud.

SMART CITIES (2022)

Article Computer Science, Artificial Intelligence

Intelligent tasks allocation at the edge based on machine learning and bio-inspired algorithms

Madalena Soula, Anna Karanika, Kostas Kolomvatsos, Christos Anagnostopoulos, George Stamoulis

Summary: This paper discusses the role of edge computing in IoT and cloud computing, explores research on task allocations in the EC ecosystem, and proposes a batch processing model and two allocation models. The study demonstrates the advantages and limitations of the models in task allocations.

EVOLVING SYSTEMS (2022)

Article Computer Science, Artificial Intelligence

Large-scale predictive modeling and analytics through regression queries in data management systems

Christos Anagnostopoulos, Peter Triantafillou

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS (2020)

Article Computer Science, Information Systems

A consensus model considers managing manipulative and overconfident behaviours in large-scale group decision-making

Xia Liang, Jie Guo, Peide Liu

Summary: This paper investigates a novel consensus model based on social networks to manage manipulative and overconfident behaviors in large-scale group decision-making. By proposing a novel clustering model and improved methods, the consensus reaching is effectively facilitated. The feedback mechanism and management approach are employed to handle decision makers' behaviors. Simulation experiments and comparative analysis demonstrate the effectiveness of the model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

CGN: Class gradient network for the construction of adversarial samples

Xiang Li, Haiwang Guo, Xinyang Deng, Wen Jiang

Summary: This paper proposes a method based on class gradient networks for generating high-quality adversarial samples. By introducing a high-level class gradient matrix and combining classification loss and perturbation loss, the method demonstrates superiority in the transferability of adversarial samples on targeted attacks.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Distinguishing latent interaction types from implicit feedbacks for recommendation

Lingyun Lu, Bang Wang, Zizhuo Zhang, Shenghao Liu

Summary: Many recommendation algorithms only rely on implicit feedbacks due to privacy concerns. However, the encoding of interaction types is often ignored. This paper proposes a relation-aware neural model that classifies implicit feedbacks by encoding edges, thereby enhancing recommendation performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Proximity-based density description with regularized reconstruction algorithm for anomaly detection

Jaehong Yu, Hyungrok Do

Summary: This study discusses unsupervised anomaly detection using one-class classification, which determines whether a new instance belongs to the target class by constructing a decision boundary. The proposed method uses a proximity-based density description and a regularized reconstruction algorithm to overcome the limitations of existing one-class classification methods. Experimental results demonstrate the superior performance of the proposed algorithm.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Non-iterative border-peeling clustering algorithm based on swap strategy

Hui Tu, Shifei Ding, Xiao Xu, Haiwei Hou, Chao Li, Ling Ding

Summary: Border-Peeling algorithm is a density-based clustering algorithm, but its complexity and issues on unbalanced datasets restrict its application. This paper proposes a non-iterative border-peeling clustering algorithm, which improves the clustering performance by distinguishing and associating core points and border points.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A two-stage denoising framework for zero-shot learning with noisy labels

Long Tang, Pan Zhao, Zhigeng Pan, Xingxing Duan, Panos M. Pardalos

Summary: In this work, a two-stage denoising framework (TSDF) is proposed for zero-shot learning (ZSL) to address the issue of noisy labels. The framework includes a tailored loss function to remove suspected noisy-label instances and a ramp-style loss function to reduce the negative impact of remaining noisy labels. In addition, a dynamic screening strategy (DSS) is developed to efficiently handle the nonconvexity of the ramp-style loss.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Selection of a viable blockchain service provider for data management within the internet of medical things: An MCDM approach to Indian healthcare

Raghunathan Krishankumar, Sundararajan Dhruva, Kattur S. Ravichandran, Samarjit Kar

Summary: Health 4.0 is gaining global attention for better healthcare through digital technologies. This study proposes a new decision-making framework for selecting viable blockchain service providers in the Internet of Medical Things (IoMT). The framework addresses the limitations in previous studies and demonstrates its applicability in the Indian healthcare sector. The results show the top ranking BSPs, the importance of various criteria, and the effectiveness of the developed model.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Q-learning with heterogeneous update strategy

Tao Tan, Hong Xie, Liang Feng

Summary: This paper proposes a heterogeneous update idea and designs HetUp Q-learning algorithm to enlarge the normalized gap by overestimating the Q-value corresponding to the optimal action and underestimating the Q-value corresponding to the other actions. To address the limitation, a softmax strategy is applied to estimate the optimal action, resulting in HetUpSoft Q-learning and HetUpSoft DQN. Extensive experimental results show significant improvements over SOTA baselines.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Dyformer: A dynamic transformer-based architecture for multivariate time series classification

Chao Yang, Xianzhi Wang, Lina Yao, Guodong Long, Guandong Xu

Summary: This paper proposes a dynamic transformer-based architecture called Dyformer for multivariate time series classification. Dyformer captures multi-scale features through hierarchical pooling and adaptive learning strategies, and improves model performance by introducing feature-map-wise attention mechanisms and a joint loss function.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

ESSENT: an arithmetic optimization algorithm with enhanced scatter search strategy for automated test case generation

Xiguang Li, Baolu Feng, Yunhe Sun, Ammar Hawbani, Saeed Hammod Alsamhi, Liang Zhao

Summary: This paper proposes an enhanced scatter search strategy, using opposition-based learning, to solve the problem of automated test case generation based on path coverage (ATCG-PC). The proposed ESSENT algorithm selects the path with the lowest path entropy among the uncovered paths as the target path and generates new test cases to cover the target path by modifying the dimensions of existing test cases. Experimental results show that the ESSENT algorithm outperforms other state-of-the-art algorithms, achieving maximum path coverage with fewer test cases.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

An attention based approach for automated account linkage in federated identity management

Shirin Dabbaghi Varnosfaderani, Piotr Kasprzak, Aytaj Badirova, Ralph Krimmel, Christof Pohl, Ramin Yahyapour

Summary: Linking digital accounts belonging to the same user is crucial for security, user satisfaction, and next-generation service development. However, research on account linkage is mainly focused on social networks, and there is a lack of studies in other domains. To address this, we propose SmartSSO, a framework that automates the account linkage process by analyzing user routines and behavior during login processes. Our experiments on a large dataset show that SmartSSO achieves over 98% accuracy in hit-precision.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A memetic algorithm with fuzzy-based population control for the joint order batching and picker routing problem

Renchao Wu, Jianjun He, Xin Li, Zuguo Chen

Summary: This paper proposes a memetic algorithm with fuzzy-based population control (MA-FPC) to solve the joint order batching and picker routing problem (JOBPRP). The algorithm incorporates batch exchange crossover and a two-level local improvement procedure. Experimental results show that MA-FPC outperforms existing algorithms in terms of solution quality.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

Refining one-class representation: A unified transformer for unsupervised time-series anomaly detection

Guoxiang Zhong, Fagui Liu, Jun Jiang, Bin Wang, C. L. Philip Chen

Summary: In this study, we propose the AMFormer framework to address the problem of mixed normal and anomaly samples in deep unsupervised time-series anomaly detection. By refining the one-class representation and introducing the masked operation mechanism and cost sensitive learning theory, our approach significantly improves anomaly detection performance.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A data-driven optimisation method for a class of problems with redundant variables and indefinite objective functions

Jin Zhou, Kang Zhou, Gexiang Zhang, Ferrante Neri, Wangyang Shen, Weiping Jin

Summary: In this paper, the authors focus on the issue of multi-objective optimisation problems with redundant variables and indefinite objective functions (MOPRVIF) in practical problem-solving. They propose a dual data-driven method for solving this problem, which consists of eliminating redundant variables, constructing objective functions, selecting evolution operators, and using a multi-objective evolutionary algorithm. The experiments conducted on two different problem domains demonstrate the effectiveness, practicality, and scalability of the proposed method.

INFORMATION SCIENCES (2024)

Article Computer Science, Information Systems

A Monte Carlo fuzzy logistic regression framework against imbalance and separation

Georgios Charizanos, Haydar Demirhan, Duygu Icen

Summary: This article proposes a new fuzzy logistic regression framework that addresses the problems of separation and imbalance while maintaining the interpretability of classical logistic regression. By fuzzifying binary variables and classifying subjects based on a fuzzy threshold, the framework demonstrates superior performance on imbalanced datasets.

INFORMATION SCIENCES (2024)