4.7 Article

Relation Extraction Using Distant Supervision: A Survey

Journal

ACM COMPUTING SURVEYS
Volume 51, Issue 5, Pages -

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.1145/3241741

Keywords

Relation extraction; distant supervision; knowledge graph

Funding

  1. European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme [683253/GraphInt]

Ask authors/readers for more resources

Relation extraction is a subtask of information extraction where semantic relationships are extracted from natural language text and then classified. In essence, it allows us to acquire structured knowledge from unstructured text. In this article, we present a survey of relation extraction methods that leverage pre-existing structured or semi-structured data to guide the extraction process. We introduce a taxonomy of existing methods and describe distant supervision approaches in detail. We describe, in addition, the evaluation methodologies and the datasets commonly used for quality assessment. Finally, we give a high-level outlook on the field, highlighting open problems as well as the most promising research directions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Automation & Control Systems

Playing Atari with few neurons Improving the efficacy of reinforcement learning by decoupling feature extraction and decision making

Giuseppe Cuccu, Julian Togelius, Philippe Cudre-Mauroux

Summary: The proposed method allows for learning compact state representations and policies separately and simultaneously for policy approximation in vision-based applications. By separating image processing from action selection, better understanding of each task and potentially finding smaller policy representations is possible. The study uses a compact encoder and two novel algorithms for state representation learning, along with a new evolutionary algorithm to address the issue of producing large inputs as the dictionary size increases.

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS (2021)

Article Computer Science, Artificial Intelligence

Event Detection on Microposts: A Comparison of Four Approaches

Akansha Bhardwaj, Albert Blarer, Philippe Cudre-Mauroux, Vincent Lenders, Boris Motik, Axel Tanner, Alberto Tonon

Summary: Microblogging services like Twitter are important sources of information. We proposed a successful system ArmaTweet for semantic event detection on Twitter streams, and explored and empirically evaluated various approaches for event detection on microposts, with results showing ArmaTweet outperforming other methods in most cases and a combined approach offering highest recall without adversely affecting precision.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2021)

Article Computer Science, Information Systems

Managing Big Interval Data with CINTIA: The Checkpoint INTerval Array

Ruslan Mavlyutov, Philippe Cudre-Mauroux

Summary: This paper introduces CINTIA, an efficient data structure for storing and querying interval data, achieving high memory locality and outperforming current solutions. Additionally, a low-latency Big Data system is proposed to implement CINTIA on popular distributed file systems and manage large interval data effectively in clusters of commodity machines.

IEEE TRANSACTIONS ON BIG DATA (2021)

Article Computer Science, Artificial Intelligence

Streaming Graph Embeddings via Incremental Neighborhood Sketching

Dingqi Yang, Bingqing Qu, Jie Yang, Liang Wang, Philippe Cudre-Mauroux

Summary: SGSketch is a highly-efficient streaming graph embedding technique that can generate high-quality node embeddings from a streaming graph by gradually forgetting outdated streaming edges, and efficiently update the generated node embeddings via an incremental embedding updating mechanism.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Editorial Material Computer Science, Hardware & Architecture

Leveraging Social Context for Fake News Detection

Philippe Cudre-Mauroux

COMMUNICATIONS OF THE ACM (2022)

Article Computer Science, Information Systems

DBMS Annihilator: A High-Performance Database Workload Generator in Action

Alberto Lerner, Matthias Jasny, Theo Jepsen, Carsten Binnig, Philippe Cudre-Mauroux

Summary: Modern DBMS engines achieve unprecedented transaction processing speeds, but benchmarking networked clients remains challenging. This demo presents a new framework that leverages hardware-software co-design for benchmarking.

PROCEEDINGS OF THE VLDB ENDOWMENT (2022)

Proceedings Paper Computer Science, Cybernetics

DiBB: Distributing Black-Box Optimization

Giuseppe Cuccu, Luca Rolshoven, Fabien Vorpe, Philippe Cudre-Mauroux, Tobias Glasmachers

Summary: DiBB is a meta-algorithm and framework that addresses the scalability issue of Black-Box Optimization (BBO). It creates a Partially Separable (PS) version of any existing black-box algorithm and distributes the computation to a set of machines while retaining the advanced features of the underlying algorithm. The performance of DiBB scales with the number of parameter-blocks defined.

PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'22) (2022)

Article Computer Science, Artificial Intelligence

LBSN2Vec++: Heterogeneous Hypergraph Embedding for Location-Based Social Networks

Dingqi Yang, Bingqing Qu, Jie Yang, Philippe Cudre-Mauroux

Summary: In this paper, the authors propose LBSN2Vec++, a heterogeneous hypergraph embedding approach designed specifically for LBSN data. By automatically learning features, LBSN2Vec++ outperforms other methods and handcrafted features in friendship and location prediction tasks.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2022)

Proceedings Paper Computer Science, Artificial Intelligence

City-Stories: Combining Entity Linking, Multimedia Retrieval, and Crowdsourcing to Make Historical Data Accessible

Laura Rettig, Shaban Shabani, Loris Sauter, Philippe Cudre-Mauroux, Maria Sokhn, Heiko Schuldt

Summary: The City-Stories system combines entity linking, multimedia retrieval, and crowdsourcing to make historical images searchable even across collections.

WEB ENGINEERING, ICWE 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

ConvTab: A Context-Preserving, Convolutional Model for Ad-Hoc Table Retrieval

Vibhav Agarwal, Akansha Bhardwaj, Paolo Rosso, Philippe Cudre-Mauroux

Summary: Ad-hoc table retrieval involves finding tables relevant to a search query, with existing research utilizing various methods to address this issue, but most methods face challenges in constructing semantic representations of tabular data. Researchers propose the ConvTab method based on Convolutional Neural Networks, which achieves remarkable results in table retrieval tasks.

2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Wiki2Prop: A Multimodal Approach for Predicting Wikidata Properties from Wikipedia

Michael Luggen, Julien Audiffren, Djellel Difallah, Philippe Cudre-Mauroux

Summary: Wikidata is becoming an important resource for various online tasks, and the Wiki2Prop method proposed in this work significantly outperforms competitors in predicting missing properties for entities, with the ability to incorporate multilingual and multimodal data to further improve prediction accuracy. This system provides a valuable tool for filling knowledge gaps in Wikidata and can be used as a property recommender system directly on Wikidata entity pages.

PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Peer Grading the Peer Reviews: A Dual-Role Approach for Lightening the Scholarly Paper Review Process

Ines Arous, Jie Yang, Mourad Khayati, Philippe Cudre-Mauroux

Summary: The increase in paper submissions challenges the effectiveness of scientific peer review, prompting the proposal of a human-AI approach to estimate review conformity. Through a large-scale crowdsourced study, this approach has shown superior performance and easy integration into existing peer review systems, reducing the burden on meta-reviewers.

PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

RETA: A Schema-Aware, End-to-End Solution for Instance Completion in Knowledge Graphs

Paolo Rosso, Dingqi Yang, Natalia Ostapuk, Philippe Cudre-Mauroux

Summary: This paper focuses on an instance completion task of suggesting relation and tail entities for a given head entity, proposing an end-to-end solution called RETA which outperforms existing techniques. RETA-Filter generates high-quality candidate pairs, while RETA-Grader significantly outperforms state-of-the-art link prediction techniques in the instance completion task.

PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

MARTA: Leveraging Human Rationales for Explainable Text Classification

Ines Arous, Ljiljana Dolamic, Jie Yang, Akansha Bhardwaj, Giuseppe Cuccu, Philippe Cudre-Mauroux

Summary: The study introduces a hybrid human-AI approach that incorporates human rationales into attention-based text classification models to enhance the explainability of classification results. MARTA framework jointly learns model and worker reliability, leading to improved classification explainability and accuracy.

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE (2021)

Article Computer Science, Information Systems

In-Network Support for Transaction Triaging

Theo Jepsen, Alberto Lerner, Fernando Pedone, Robert Soule, Philippe Cudre-Mauroux

Summary: Transaction Triaging is a set of techniques that manipulate streams of transaction requests and responses to improve performance when reaching the database server. The algorithms do not interfere with transaction execution and can be easily ported across different database systems. Leveraging programmable networking hardware, triaging brings significant throughput improvements in high-overhead network stacks.

PROCEEDINGS OF THE VLDB ENDOWMENT (2021)

No Data Available