☆ 4.6 Article

Using Open Geographic Data to Generate Natural Language Descriptions for Hydrological Sensor Networks

SENSORS (2015)

期刊

SENSORS

卷 15, 期 7, 页码 16009-16026

出版社

MDPI

DOI: 10.3390/s150716009

关键词

sensor network; natural language generation; open geographic data

类别

Chemistry, Analytical Engineering, Electrical & Electronic Instruments & Instrumentation

资金

Ministry of Environment of Spain Direccion General del Agua, Ministerio de Medio Ambiente, Medio Rural y Marino)
Ministry of Science and Innovation of Spain within the VIOMATICA project [TIN200805837/TIN]

向作者/读者索取更多资源

Protocol

Reagent

摘要

Providing descriptions of isolated sensors and sensor networks in natural language, understandable by the general public, is useful to help users find relevant sensors and analyze sensor data. In this paper, we discuss the feasibility of using geographic knowledge from public databases available on the Web (such as OpenStreetMap, Geonames, or DBpedia) to automatically construct such descriptions. We present a general method that uses such information to generate sensor descriptions in natural language. The results of the evaluation of our method in a hydrologic national sensor network showed that this approach is feasible and capable of generating adequate sensor descriptions with a lower development effort compared to other approaches. In the paper we also analyze certain problems that we found in public databases (e.g., heterogeneity, non-standard use of labels, or rigid search methods) and their impact in the generation of sensor descriptions.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Computer Science, Hardware & Architecture

Improving open data web API documentation through interactivity and natural language generation

Cesar Gonzalez-Mora, Cristina Barros, Irene Garrig, Jose Zubcoff, Elena Lloret, Jose -Norberto Maz

Summary: This paper proposes a novel approach to automatically generate interactive Web API documentation. By applying natural language processing techniques, the API documentation is transformed into easily understandable natural language descriptions. Through a web interface, the documentation becomes interactive, enhancing the usability and reusability of the Web API.

COMPUTER STANDARDS & INTERFACES (2023)

添加到收藏夹

Article Biotechnology & Applied Microbiology

Next-generation cell line selection methodology leveraging data lakes, natural language generation and advanced data analytics

Stephen Goldrick, Haneen Alosert, Clare Lovelady, Nicholas J. Bond, Tarik Senussi, Diane Hatton, John Klein, Matthew Cheeks, Richard Turner, James Savery, Suzanne S. Farid

Summary: Cell line development is a crucial stage in biopharmaceutical development, and failure to fully characterize the lead clone during initial screening can lead to delays and compromise manufacturing success. This study proposes a novel cell line development methodology called CLD (4), which uses four steps to autonomously select the lead clone based on data. CLD (4) incorporates digitalization, machine learning, and natural language generation to generate an automated report summarizing relevant statistics.

FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY (2023)

添加到收藏夹

Article Computer Science, Interdisciplinary Applications

An Open Data Repository for Engineering Design: Using Text Mining with Open Government Data

Vito Giordano, Elena Coli, Antonella Martini

Summary: This paper aims to demonstrate that text mining can help make a complex open database more effective for the engineering design process, using the U.S. Open Government Data repository as a case study. It explores the expertise and data science methods required for processing different data formats, and presents significant implications and challenges for researchers, practitioners, and policy makers in the fields of engineering design and data science.

COMPUTERS IN INDUSTRY (2022)

添加到收藏夹

Article Computer Science, Information Systems

Keyword-Aware Transformers Network for Chinese Open-Domain Conversation Generation

Yang Zhou, Chenjiao Zhi, Feng Xu, Weiwei Cui, Huaqiong Wang, Aihong Qin, Xiaodiao Chen, Yaqi Wang, Xingru Huang

Summary: This paper proposes a method based on Keyword-Aware Transformers Network (KAT) to fuse contextual keywords, enabling keyword semantic enhancement. Experimental results on two Chinese open-domain dialogue datasets show that our model outperforms existing methods in both semantic and non-semantic evaluation metrics, improving Coherence, Fluency, and Informativeness in manual evaluation.

ELECTRONICS (2023)

添加到收藏夹

Article Chemistry, Multidisciplinary

Quality Control for Distantly-Supervised Data-to-Text Generation via Meta Learning

Heng Gong, Xiaocheng Feng, Bing Qin

Summary: Data-to-text generation is important in natural language processing for generating user-friendly descriptive text from structured data. Distantly-supervised data-to-text generation has been proposed to overcome the lack of annotated training corpus, but it suffers from over-generation due to the inclusion of hallucinated expressions. We address this issue by empowering the neural data-to-text model with meta learning to assign higher weights to well-aligned instances and rewrite low-quality texts. Experiments show that our model outperforms the state-of-the-art model in terms of automatic evaluation metrics and human evaluation. We also introduce a new dataset, DIST-ToTTo, for distantly-supervised data-to-text generation.

APPLIED SCIENCES-BASEL (2023)

添加到收藏夹

Article Engineering, Multidisciplinary

Sign annotation generation to alphabets via integrating visual data with somatosensory data from flexible strain sensor-based data glove

Yangyang Zhang, Weijun Xu, Xia Zhang, Liping Li

Summary: This paper proposes a sign language recognition system that combines a data glove and a camera to detect and recognize gestures using deep learning. The system is able to recognize letters in sign language in a short amount of time, aiding in the understanding of gestures as words and texts.

MEASUREMENT (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Improving semantic coverage of data-to-text generation model using dynamic memory networks

Elham Seifossadat, Hossein Sameti

Summary: This paper proposes a sequence-to-sequence model called DM-NLG for data-to-text generation, which can generate natural language text from structured nonlinguistic input. By adding a dynamic memory module to the attention-based sequence-to-sequence model, it can store the information leading to the generation of previous output words and use it for generating the next word, thus preventing the generation of duplicate words or incomplete semantic concepts. Experiments on five different datasets show that the proposed DM-NLG model can reduce the slot error rate by 50% and improve BLEU by 10% compared to state-of-the-art models.

NATURAL LANGUAGE ENGINEERING (2023)

添加到收藏夹

Article Computer Science, Information Systems

DiffuD2T: Empowering Data-to-Text Generation with Diffusion

Heng Gong, Xiaocheng Feng, Bing Qin

Summary: Data-to-text generation is an important task in natural language processing, aiming to provide user-friendly text to help people understand structured data. Existing methods have shown promising results in addressing content planning and surface realization challenges. However, they lack an iterative refinement process for text generation. This paper explores enhancing data-to-text generation with an iterative refinement process via diffusion.

ELECTRONICS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

A divide-and-conquer approach to neural natural language generation from structured data

Nina Dethlefs, Annika Schoene, Heriberto Cuayahuitl

Summary: This article proposes a novel divide-and-conquer approach to automatically induce a hierarchy of generation spaces for concept-to-text generation, which performs well in experiments and overcomes issues of data sparsity. The approach outperforms flat baselines and previous work by up to 30%.

NEUROCOMPUTING (2021)

添加到收藏夹

Review Computer Science, Artificial Intelligence

Summarization, simplification, and generation: The case of patents

Silvia Casola, Alberto Lavelli

Summary: This paper surveys Natural Language Processing (NLP) approaches in summarizing, simplifying, and generating patent text. It highlights the challenges posed by the unique characteristics of patents to the current state of NLP research, presents previous work and its evolution critically, and draws attention to areas where further research is needed. To the best of the authors' knowledge, this is the first survey of generative approaches in the patent domain.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

添加到收藏夹

Article Computer Science, Theory & Methods

Entity-aware capsule network for multi-class classification of big data: A deep learning approach

Amit Kumar Jaiswal, Prayag Tiwari, Sahil Garg, M. Shamim Hossain

Summary: Researchers propose a deep learning approach based on capsule networks for named entity recognition, which can accurately predict entities in a corpus and serve as input for subsequent NLP tasks.

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Generative adversarial network for Table-to-Text generation

Jianyu Zhao, Zhiqiang Zhan, Tong Li, Rang Li, Changjian Hu, Siyun Wang, Yang Zhang

Summary: Table-to-Text generation models suffer from issues like nonfluency and divergence. To address these problems, a novel GAN-based model is proposed, which trains a generative model and a discriminative model simultaneously. The model achieves significant improvements in fluency and information consistency, outperforming baselines on various evaluation metrics while also constructing a new dataset to advance research in the field.

NEUROCOMPUTING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Stochastic Data-to-Text Generation Using Syntactic Dependency Information

Elham Seifossadat, Hossein Sameti

Summary: Data-to-Text Generation (D2T) is a sub-field of Natural Language Generation that transcribes structured data into natural language text. This work proposes a stochastic corpus-based model that generates tree-structured sentences based on dependency information, resulting in fluent sentences with correct grammatical structures. The model improves BLEU scores and generates high-quality utterances in various domains.

COMPUTER SPEECH AND LANGUAGE (2022)

添加到收藏夹

Article Computer Science, Theory & Methods

A Survey of Knowledge-enhanced Text Generation

Wenhao Yu, Chenguang Zhu, Zaitang Li, Zhiting Hu, Qingyun Wang, Heng Ji, Meng Jiang

Summary: The goal of text-to-text generation is to make machines express like a human in various applications. However, current text generation models often lack sufficient knowledge to generate desirable outputs. To address this issue, researchers have explored integrating internal and external knowledge into text generation systems. This survey provides a comprehensive review of the research on knowledge-enhanced text generation, including general methods and architectures, as well as specific techniques and applications.

ACM COMPUTING SURVEYS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Navigating the text generation revolution: Traditional data-to-text NLG companies and the rise of ChatGPT

Robert Dale

Summary: Since the release of ChatGPT at the end of November 2022, there has been extensive discussion about generative AI in both the technical and mainstream media. The impact of large language model technology has been debated, from disrupting search engines to affecting student essays and spreading disinformation. This article examines how major players in the field are reacting and explores potential future developments.

NATURAL LANGUAGE ENGINEERING (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

TheyBuyForYou platform and knowledge graph: Expanding horizons in public procurement with open linked data

Ahmet Soylu, Oscar Corcho, Brian Elvesaeter, Carlos Badenes-Olmedo, Tom Blount, Francisco Yedro Martinez, Matej Kovacic, Matej Posinkovic, Ian Makgill, Chris Taggart, Elena Simperl, Till C. Lech, Dumitru Roman

Summary: Public procurement is a significant market that impacts various organizations and individuals, requiring governments to ensure transparency, efficiency, and healthy competition. Initiatives like open data and cross-border data integration have the potential to enhance competition and lower barriers for smaller suppliers, but existing challenges include technical heterogeneity, data quality issues, and insufficient metadata.

SEMANTIC WEB (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Handling qualitative preferences in SPARQL over virtual ontology-based data access

Marlene Goncalves, David Chaves-Fraga, Oscar Corcho

Summary: This paper presents a framework called Morph-Skyline++ for processing SPARQL qualitative preference queries by directly querying relational databases. The framework achieves high performance and accurately identifies the result set.

SEMANTIC WEB (2022)

添加到收藏夹

Article Geography

Annotating OGC web feature services automatically for generating geospatial knowledge graphs

Victor Saquicela, Luis M. Vilches-Blazquez, Renan Freire, Oscar Corcho

Summary: This article presents an approach for automatically generating semantic annotations of Web Feature Services (WFS) at different request levels to generate knowledge graphs, using external services, ontological resources, and knowledge bases. The approach allows for validating the annotations and demonstrates its feasibility through an application case.

TRANSACTIONS IN GIS (2022)

添加到收藏夹

Article Computer Science, Information Systems

Data Quality Barriers for Transparency in Public Procurement

Ahmet Soylu, Oscar Corcho, Brian Elvesaeter, Carlos Badenes-Olmedo, Francisco Yedro-Martinez, Matej Kovacic, Matej Posinkovic, Mitja Medvescek, Ian Makgill, Chris Taggart, Elena Simperl, Till C. Lech, Dumitru Roman

Summary: Open data and knowledge graph can be powerful tools for governments to prevent fraud and corruption, improve transparency, and enhance policy making. This article presents a case study in Slovenia where anomaly detection techniques were successfully applied to detect fraud and uncompetitive markets using integrated open data sets and a linked data-based platform called TheyBuyForYou. The article also provides guidelines for publishing high quality procurement data and emphasizes the importance of data quality in procurement analytics.

INFORMATION (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Rule extraction in unsupervised anomaly detection for model explainability: Application to OneClass SVM

Alberto Barbado, Oscar Corcho, Richard Benjamins

Summary: This paper focuses on the black box problem in unsupervised learning, evaluating rule extraction techniques from OneClass SVM models and proposing algorithms for computing XAI-related metrics. The research evaluates the proposals with different data sets, including real-world data, aiming to extend XAI techniques to unsupervised machine learning models.

EXPERT SYSTEMS WITH APPLICATIONS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Balancing coverage and specificity for semantic labelling of subject columns

Ahmad Alobaid, Oscar Corcho

Summary: This paper presents a novel approach to automatically assigning ontology classes to entity columns in tabular data, without the need for external linguistic resources, lookup services, model training, building a model of the knowledge graph beforehand, or human involvement.

KNOWLEDGE-BASED SYSTEMS (2022)

添加到收藏夹

Article Chemistry, Analytical

Web and MATLAB-Based Platform for UAV Flight Management and Multispectral Image Processing

Nourdine Aliane, Carlos Quiterio Gomez Munoz, Javier Sanchez-Soriano

Summary: This paper proposes the development of a Web and MATLAB-based application that integrates several services in the same environment, solving the issues of task development and software integration in precision agriculture.

SENSORS (2022)

添加到收藏夹

Article Automation & Control Systems

Interpretable machine learning models for predicting and explaining vehicle fuel consumption anomalies

Alberto Barbado, Oscar Corcho

Summary: This study combines unsupervised anomaly detection techniques, domain knowledge, and interpretable machine learning models to explain abnormal fuel consumption in vehicle fleets. Results evaluated on real-world data show that this approach provides recommendations for fuel optimization adjusted to different user profiles.

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (2022)

添加到收藏夹

Article Environmental Sciences

Overcoming Domain Shift in Neural Networks for Accurate Plant Counting in Aerial Images

Javier Rodriguez-Vazquez, Miguel Fernandez-Cortizas, David Perez-Saura, Martin Molina, Pascual Campoy

Summary: This paper proposes a novel semi-supervised approach for accurate counting and localization of tropical plants in aerial images without labeled data. The approach utilizes deep learning and domain adaptation to handle the domain shifts between training and test data, which is a common challenge in agricultural applications. By using unsupervised domain alignment and pseudolabeling, the method adapts a model trained on a labeled source dataset to an unlabeled target dataset. Experimental results demonstrate the effectiveness of this approach in counting pineapple plants in aerial images under significant domain shifts, achieving a reduction in counting error of up to 97% (1.42 in absolute count) compared to the supervised baseline (48.6 in absolute count).

REMOTE SENSING (2023)

添加到收藏夹

Article Remote Sensing

Automatic Real-Time Creation of Three-Dimensional (3D) Representations of Objects, Buildings, or Scenarios Using Drones and Artificial Intelligence Techniques

Jorge Cujo Blasco, Sergio Bemposta Rosende, Javier Sanchez-Soriano

Summary: This study presents a real-time 3D reconstruction system using drones, which leverages innovative AI techniques to achieve accurate and efficient reconstruction of 3D environments. By integrating vision, navigation, and 3D reconstruction subsystems, the system overcomes the limitations of existing applications and software in terms of speed and accuracy. The proposed system outperforms traditional software by more than 90 times and contributes to the advancement in the field of 3D reconstruction using drones.

DRONES (2023)

添加到收藏夹

Article Computer Science, Information Systems

Optimization Algorithm to Reduce Training Time for Deep Learning Computer Vision Algorithms Using Large Image Datasets With Tiny Objects

Sergio Bemposta Rosende, Javier Fernandez-Andres, Javier Sanchez-Soriano

Summary: This study proposes a technique to reduce CNN training time by using an algorithm to partition the dataset and discard unnecessary objects. The average reduction in training time is 75% without altering the training results. This tool is particularly effective for sequential images, large images, and images with small targets.

IEEE ACCESS (2023)

添加到收藏夹

Proceedings Paper Computer Science, Artificial Intelligence

Extending Ontology Engineering Practices to Facilitate Application Development

Paola Espinoza-Arias, Daniel Garijo, Oscar Corcho

Summary: This paper introduces a method for API generation based on ontologies. The authors also developed a tool to support the process and address limitations. Future work is needed to further exploit the potential of KGs and ontologies.

KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2022 (2022)

添加到收藏夹

Article Remote Sensing

Medium-Scale UAVs: A Practical Control System Considering Aerodynamics Analysis

Mohammad Sadeq Ale Isaac, Marco Andres Luna, Ahmed Refaat Ragab, Mohammad Mehdi Ale Eshagh Khoeini, Rupal Kalra, Pascual Campoy, Pablo Flores Pena, Martin Molina

Summary: This paper introduces a medium-scale hexacopter, called the Fan Hopper, which investigates the optimum control possibilities for a fully autonomous mission carrying a heavy payload. The research reveals that tuned Electric Ducted Fan (EDF) engines function dramatically for large payloads.

DRONES (2022)

添加到收藏夹

Article Computer Science, Information Systems

Dataset: Variable Message Signal Annotated Images for Object Detection

Enrique Puertas, Gonzalo De-Las-Heras, Javier Sanchez-Soriano, Javier Fernandez-Andres

Summary: This dataset consists of Spanish road images with annotations of Variable Message Signals, which can be used for training computer vision algorithms. It contains 1216 instances for research in road computer vision.

DATA (2022)

添加到收藏夹

Article Computer Science, Information Systems

Dataset: Roundabout Aerial Images for Vehicle Detection

Enrique Puertas, Gonzalo De-Las-Heras, Javier Fernandez-Andres, Javier Sanchez-Soriano

Summary: This article introduces a dataset of Spanish roundabouts aerial images with vehicle position annotations. The dataset contains 985,260 instances and can be used for training computer vision models, such as convolutional neural networks.

DATA (2022)

添加到收藏夹

暂无数据

© Peeref 2019-2024. All rights reserved.