4.5 Article Proceedings Paper

Interest-based recommendations for business intelligence users

期刊

INFORMATION SYSTEMS
卷 86, 期 -, 页码 79-93

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.is.2018.08.004

关键词

User interest; Feature construction; Clustering; BI analyses; Collaborative recommender systems

向作者/读者索取更多资源

It is quite common these days for experts, casual analysts, executives and data enthusiasts, to analyze large datasets through user-friendly interfaces on top of Business Intelligence (BI) systems. However, current BI systems do not adequately detect and characterize user interests, which may lead to tedious and unproductive interactions. In this paper, we propose a collaborative recommender system for BI interactions, specifically designed to take advantage of identified user interests. Such user interests are discovered by characterizing the intent of the interaction with the BI system. Building on user modeling for proactive search systems, we identify a set of features for an adequate description of intents, and a similarity measure for grouping intents into coherent clusters. On top of these automatically identified interests, we build a collaborative recommender system based on a Markov model that represents the probability for a user to switch from one interest to another. We validate our approach experimentally with an in-depth user study, where we analyze traces of BI navigation. Our results are two-fold. First, we show that our similarity measure outperforms a state-of-the-art query similarity measure and yields a very good precision with respect to expressed user interests. Second, we compare our recommender system to two state-of-the-art systems to demonstrate the benefit of relying on user interests. (C) 2018 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Detecting coherent explorations in SQL workloads

Veronika Peralta, Patrick Marcel, Willeme Verdeaux, Aboubakar Sidikhy Diakhaby

INFORMATION SYSTEMS (2020)

Article Multidisciplinary Sciences

PRYNT: a tool for prioritization of disease candidates from proteomics data using a combination of shortest-path and random walk algorithms

Franck Boizard, Benedicte Buffin-Meyer, Julien Aligon, Olivier Teste, Joost P. Schanstra, Julie Klein

Summary: PRYNT, a new prioritization strategy based on urinary proteome data, outperformed other available methods in prioritizing kidney disease candidates. It can indirectly predict key proteins and has the potential for application in other biofluids and diseases.

SCIENTIFIC REPORTS (2021)

Article Computer Science, Information Systems

Enhancing Cubes with Models to Describe Multidimensional Data

Matteo Francia, Patrick Marcel, Veronika Peralta, Stefano Rizzi

Summary: IAM is a new model paradigm that allows users to explore data by expressing analysis intentions and returns annotated multidimensional data. Research challenges include automatically tuning model size, estimating model component interestingness, selecting effective visualizations, and designing visual metaphors for interaction. Method effectiveness is evaluated based on user effort, efficiency, and scalability.

INFORMATION SYSTEMS FRONTIERS (2022)

Article Computer Science, Information Systems

Coalitional Strategies for Efficient Individual Prediction Explanation

Gabriel Ferrettini, Elodie Escriva, Julien Aligon, Jean-Baptiste Excoffier, Chantal Soule-Dupuy

Summary: With the increasing application of machine learning in various fields, there is a growing demand for understanding the internal operations of models. This paper proposes a method based on attribute coalition detection, which proves to be more efficient than existing methods, reducing computation time while maintaining acceptable accuracy.

INFORMATION SYSTEMS FRONTIERS (2022)

Article Health Care Sciences & Services

Systemic Periodontal Risk Score Using an Innovative Machine Learning Strategy: An Observational Study

Paul Monsarrat, David Bernard, Mathieu Marty, Chiara Cecchin-Albertoni, Emmanuel Doumard, Laure Gez, Julien Aligon, Jean-Noel Vergnes, Louis Casteilla, Philippe Kemoun

Summary: This study proposes a personalized explainable machine learning algorithm for identifying individuals at risk of developing periodontal diseases. By analyzing the data of 532 subjects, the most contributive variables for periodontal health prediction were found to be age, BMI, smoking habits, systemic pathologies, diet, alcohol, educational level, and hormonal status. The algorithm clearly shows different risk profiles before and after a certain age, providing new strategies for periodontal health prediction.

JOURNAL OF PERSONALIZED MEDICINE (2022)

Article Computer Science, Information Systems

Highlighting the Importance of Intentional Aspects in Data Narrative Crafting Processes

Faten El Outa, Patrick Marcel, Veronika Peralta, Panos Vassiliadis

Summary: Data narration is the process of crafting narratives using facts extracted from data exploration and analysis, presented through interactive visualizations. This article introduces a comprehensive and well-founded process that supports the entire cycle of data narration, from exploring the data to visualizing the narrative. It also covers a wide range of crafting practices and is based on a conceptual model of the domain.

INFORMATION SYSTEMS FRONTIERS (2023)

Article Computer Science, Information Systems

Explanations as a New Metric for Feature Selection: A Systematic Approach

Haomiao Wang, Emmanuel Doumard, Chantal Soule-Dupuy, Philippe Kemoun, Julien Aligon, Paul Monsarrat

Summary: With the increasing use of Machine Learning in the biomedical field, there is a growing need for Explainable Artificial Intelligence (XAI) to improve transparency and reveal complex relationships between variables for medical practitioners. Feature Selection (FS) is commonly used to reduce the number of variables while preserving information, but there is limited research on the relationship between FS and model explanations. This study demonstrates the complementary nature of explanation-based metrics, accuracy, and retention rate in selecting the most appropriate FS/ML models, providing a framework for offering healthcare professionals the appropriate FS technique based on their preferences.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Proceedings Paper Computer Science, Artificial Intelligence

Data Narrative Crafting via a Comprehensive and Well-Founded Process

Faten El Outa, Patrick Marcel, Veronika Peralta, Raphael da Silva, Marie Chagnoux, Panos Vassiliadis

Summary: This article proposes a comprehensive and well-founded process for crafting data narratives, covering the complete cycle from data exploration to visual rendering, and based on a conceptual model.

ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2022 (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Modeling Lifelong Pathway Co-construction

Nicolas Ringuet, Patrick Marcel, Nicolas Labroche, Thomas Devogele, Christophe Bortolaso

Summary: While personal coach applications are widely used, personal lifelong pathway co-construction is often overlooked. This paper presents a generic model that supports interaction between advisees and advisors and includes key aspects of lifelong pathways. The importance of this model is illustrated through its application in a guidance system.

CONCEPTUAL MODELING (ER 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Modeling Context for Data Quality Management

Flavia Serra, Veronika Peralta, Adriana Marotta, Patrick Marcel

Summary: This paper highlights the importance of context for data quality and proposes a context model specifically designed for data quality management.

CONCEPTUAL MODELING (ER 2022) (2022)

Proceedings Paper Computer Science, Artificial Intelligence

A Chain Composite Item Recommender for Lifelong Pathways

Alexandre Chanson, Thomas Devogele, Nicolas Labroche, Patrick Marcel, Nicolas Ringuet, Vincent T'Kindt

Summary: This work addresses the problem of recommending lifelong pathways, modeling them as particular chain composite items and formalizing the recommendation problem as an orienteering problem. The approach is experimented with artificial and real datasets, showing promising results as a building block for an interactive lifelong pathways recommender system.

BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Analysis-oriented Metadata for Data Lakes

Yan Zhao, Julien Aligon, Gabriel Ferrettini, Imen Megdiche, Franck Ravat, Chantal Soule-Dupuy

Summary: The aim of this paper is to establish an easily accessible, reusable data lake by proposing an analysis-oriented metadata model. This model includes descriptive information of datasets and attributes, as well as all metadata related to machine learning analyzes on these datasets. The implementation of a data lake metadata management application allows users to search for and use existing data, processes, and analyses by finding relevant metadata stored in a NoSQL data store within the data lake.

IDEAS 2021: 25TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (2021)

Proceedings Paper Computer Science, Theory & Methods

Explaining Single Predictions: A Faster Method

Gabriel Ferrettini, Julien Aligon, Chantal Soule-Dupuy

SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE (2020)

Article Computer Science, Information Systems

Measuring rule-based LTLf process specifications: A probabilistic data-driven approach

Alessio Cecconi, Luca Barbaro, Claudio Di Ciccio, Arik Senderovich

Summary: This paper introduces a framework for designing probabilistic measures for declarative process specifications, which can assess the degree of compliance between process data and specifications. Through experiments, the applicability of the approach for various process mining tasks is demonstrated.

INFORMATION SYSTEMS (2024)

Article Computer Science, Information Systems

A Value Co-Creation Perspective on Data Labeling in Hybrid Intelligence Systems: A Design Study

Mahei Manhai Li, Philipp Reinhard, Christoph Peters, Sarah Oeste-Reiss, Jan Marco Leimeister

Summary: This article introduces a novel human-in-the-loop (HIL) design for ITSM support ticket recommendations by incorporating a value co-creation perspective. The design incentivizes ITSM agents to provide labels during their everyday ticket-handling procedures, and the evaluation shows that recommendations after label improvement have increased user ratings.

INFORMATION SYSTEMS (2024)

Article Computer Science, Information Systems

A survey of approaches for event sequence analysis and visualization

Anton Yeshchenko, Jan Mendling

Summary: This paper presents the development of event sequence data analysis techniques in different fields and proposes an integrated framework to facilitate collaboration and research synergy across various domains.

INFORMATION SYSTEMS (2024)

Article Computer Science, Information Systems

Adoption of IT solutions: A data-driven analysis approach

Iris Reinhartz-Berger, Alan Hartman, Doron Kliger

Summary: Many IT departments provide solutions that partially meet the needs of business units. This research aims to develop a data-driven analysis method to support the selection of solutions with higher prospects of adoption and identify design gaps and barriers.

INFORMATION SYSTEMS (2024)

Article Computer Science, Information Systems

Discovery, simulation, and optimization of business processes with differentiated resources

Orlenys Lopez-Pintado, Marlon Dumas, Jonas Berx

Summary: Business process simulation is a versatile technique that predicts the impact of changes on process performance. However, previous approaches have limitations due to their treatment of resources as undifferentiated entities. This article addresses this issue by proposing a new simulation approach that treats each resource as an individual entity with its own performance and availability. The article also presents methods for discovering simulation models with differentiated resources and optimizing resource availability calendars. Empirical evaluation demonstrates that differentiated resource models better replicate cycle time distributions and work rhythm, and iterative optimization of resource allocations and calendars leads to improved cost-time tradeoffs.

INFORMATION SYSTEMS (2024)