4.6 Article

Event detection and evolution in multi-lingual social streams

Journal

FRONTIERS OF COMPUTER SCIENCE
Volume 14, Issue 5, Pages -

Publisher

HIGHER EDUCATION PRESS
DOI: 10.1007/s11704-019-8201-6

Keywords

event detection; event evolution; stream processing; multi-lingual anomaly detection

Funding

  1. NSFC program [61872022, 61421003, U1636123]
  2. Beijing Advanced Innovation Center for Big Data and Brain Computing
  3. [SKLSDE-2018ZX-16]

Ask authors/readers for more resources

Real-life events are emerging and evolving in social and news streams. Recent methods have succeeded in capturing designed features of monolingual events, but lack of interpretability and multi-lingual considerations. To this end, we propose a multi-lingual event mining model, namely MLEM, to automatically detect events and generate evolution graph in multilingual hybrid-length text streams including English, Chinese, French, German, Russian and Japanese. Specially, we merge the same entities and similar phrases and present multiple similarity measures by incremental word2vec model. We propose an 8-tuple to describe event for correlation analysis and evolution graph generation. We evaluate the MLEM model using a massive human-generated dataset containing real world events. Experimental results show that our new model MLEM outperforms the baseline method both in efficiency and effectiveness.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

Multi-View Tensor Graph Neural Networks Through Reinforced Aggregation

Xusheng Zhao, Qiong Dai, Jia Wu, Hao Peng, Mingsheng Liu, Xu Bai, Jianlong Tan, Senzhang Wang, Philip S. Yu

Summary: In this paper, a novel Reinforced Tensor Graph Neural Network (RTGNN) framework is proposed to effectively learn multi-view graph representation by reinforcing inter- and intra-graph aggregation. The framework utilizes tensor decomposition to extract the graph structure features (GSFs) of each view, which contain the correlation information among multiple views. Furthermore, a reinforcement learning guided scheme is developed to automatically calculate the optimal filtering threshold for each view, improving the performance of intra-graph aggregation.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Reinforced, Incremental and Cross-Lingual Event Detection From Social Messages

Hao Peng, Ruitong Zhang, Shaoning Li, Yuwei Cao, Shirui Pan, Philip S. Yu

Summary: This paper presents a novel reinforced, incremental, and cross-lingual social event detection architecture, FinEvent, which models social messages into heterogeneous graphs and uses reinforcement learning algorithm to select optimal aggregation thresholds. It addresses the challenges of ambiguous event features, dispersive text contents, and multiple languages in existing event detection methods for streaming social messages, thereby improving accuracy and generalization ability.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Biochemistry & Molecular Biology

Morphological and physiological mechanism of cytoplasmic inheritance stigma exsertion trait expression in tobacco (Nicotiana tabacum)

Juxu Li, Man Li, Weimin Wang, Dong Wang, Yuwei Hu, Yunyun Zhang, Xiaoquan Zhang

Summary: In this study, the morphological and physiological mechanisms of cytoplasmic inheritance stigma exsertion trait expression in a tobacco line (MSK326SE) were investigated. It was found that the exserted stigma phenotype of MSK326SE was mainly caused by corolla shortening, which was stable under different environmental temperature. The difference in cell division and expansion in different flower organs, as well as the decrease in corolla epidermal cell size, were the main factors contributing to the stigma exsertion of MSK326SE. Exogenous JA could further enhance the stigma exsertion degree.

PLANT SCIENCE (2023)

Article Chemistry, Multidisciplinary

Donor Extension on Spiro-Acridine Enables Highly Efficient TADF-OLEDs with Relieved Efficiency Roll-Off

Hao Peng, Yulin Xu, Changjiang Zhou, Ranran Pei, Jingsheng Miao, He Liu, Chuluo Yang

Summary: This study develops three thermally activated delayed fluorescent (TADF) emitters based on fluorene-spiro-acridine derivatives as donors, which exhibit high external quantum efficiency (EQE) and low efficiency roll-off. The emitters achieve high photoluminescence quantum yields (PLQYs) of 96-99% due to the rigid donor and acceptor composition. The enhancement of reverse intersystem crossing (k(RISC)) in IA-TRZ and the acceleration of radiative decay rate (k(r)) in 2S-TRZ and IT-TRZ contribute to their high electroluminescent performance.

ADVANCED FUNCTIONAL MATERIALS (2023)

Article Chemistry, Multidisciplinary

A Tetrahedral Bisacridine Donor Enables Fast Radiative Decay in Thermally Activated Delayed Fluorescence Emitter

Ranran Pei, Yulin Xu, Jingsheng Miao, Hao Peng, Zhanxiang Chen, Changjiang Zhou, He Liu, Chuluo Yang

Summary: Simultaneously achieving high efficiency and low efficiency roll-off is crucial for the further application of thermally activated delayed fluorescent (TADF) emitters. This study proposes a quinolino-acridine (QAc) donor, composed of two acridine units with planar (pAc) and bended (bAc) geometries, combined with triazine to assemble a TADF emitter QAc-TRZ. The pAc ensures decent TADF behavior through interaction with triazine, while the bAc enhances radiative decay through delocalization of the highest occupied molecular orbital (HOMO). Notably, QAc-TRZ enables a highly efficient organic light emitting diode (OLED) with a maximum external quantum efficiency (EQE) of 37.3%, and maintains efficiencies of 36.3% and 31.7% under 100/1000 cd m(-2), respectively, even at 10,000 cd m(-2).

ANGEWANDTE CHEMIE-INTERNATIONAL EDITION (2023)

Article Thermodynamics

Combustion Characteristics and NO Emissions during Co-Combustion of Coal Gangue and Coal Slime in O2/CO2 Atmospheres

Hao Peng, Baofeng Wang, Wenxiu Li, Fengling Yang, Fangqin Cheng

Summary: Coal slime added during coal gangue combustion affects combustion characteristics and leads to NO emissions, causing environmental pollution. Reducing NO emissions is possible in O-2/CO2 atmospheres. This study investigates the co-combustion of coal gangue and coal slime in O-2/CO2 atmospheres, finding that coal slime enhances combustion activity and co-combustion shows synergistic effects. The proportion of pyrrole and nitrogen oxide in the ashes increases with temperature, while pyridine and quaternary nitrogen disappear, and total nitrogen content in ash decreases. These findings contribute to understanding the co-combustion process and provide data for practical industrial applications.

JOURNAL OF THERMAL SCIENCE (2023)

Article Computer Science, Artificial Intelligence

Adaptive curvature exploration geometric graph neural network

Xingcheng Fu, Jianxin Li, Jia Wu, Jiawen Qin, Qingyun Sun, Cheng Ji, Senzhang Wang, Hao Peng, Philip S. Yu

Summary: Current graph neural networks (GNNs) based on Euclidean space embedding struggle to represent diverse graph geometric properties. To address this issue, we propose the Adaptive Curvature Exploration Geometric Graph Neural Network, which automatically learns high-quality graph representations and explores embedding spaces with optimal curvature.

KNOWLEDGE AND INFORMATION SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

Domain-Invariant Feature Progressive Distillation with Adversarial Adaptive Augmentation for Low-Resource Cross-Domain NER

Tao Zhang, Congying Xia, Zhiwei Liu, Shu Zhao, Hao Peng, Philip Yu

Summary: This article proposes an adversarial adaptive augmentation method to address the problems in cross-domain named entity recognition. It integrates adversarial strategy into a multi-task learner to improve the quality of domain adaptive data generation and extract domain-invariant features to solve the domain shift and label-sparsity problems. The article also introduces a progressive domain-invariant feature distillation framework that uses multi-grained Maximum Mean Discrepancy approach to extract multi-level domain invariant features for knowledge transfer across domains. Extensive comparative experiments on four English and two Chinese benchmarks demonstrate the importance of adversarial augmentation and effective adaptation from high-resource domains to low-resource target domains. Comparison with two vanilla and four latest baselines shows state-of-the-art performance and superiority in both zero-resource and minimal-resource scenarios.

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING (2023)

Article Biodiversity Conservation

Conservation and sustainable development of coastal species of horticultural importance: insights from genetic and environmental patterns at spatio-temporal scale

Achyut Kumar Banerjee, Jiakai Wang, Hui Feng, Yuting Lin, Xinru Liang, Minghui Yin, Hao Peng, Weixi Li, Tengjiao Li, Wuxia Guo, Yelin Huang

Summary: Understanding genetic patterns and environmental influence on Thespesia populnea is crucial for its conservation and sustainable use. This study focused on the species' phylogeographic pattern, genetic diversity distribution, and range dynamics under changing environmental conditions. Findings suggested sustainable use of genetically diverse populations, measures for preserving lineages' evolutionary potential, and prioritized conservation areas.

BIODIVERSITY AND CONSERVATION (2023)

Article Computer Science, Artificial Intelligence

Heterogeneous graph neural network with semantic-aware differential privacy guarantees

Yuecen Wei, Xingcheng Fu, Dongqi Yan, Qingyun Sun, Hao Peng, Jia Wu, Jinyan Wang, Xianxian Li

Summary: Most social networks can be modeled as heterogeneous graphs, but existing privacy-preserving methods neglect the significance of high-order semantic information. To address this issue, we propose HeteSDG, a novel heterogeneous graph neural network that provides double privacy guarantee and performance trade-off. We reveal the privacy leakage in heterogeneous graphs and design a two-stage mechanism based on differential privacy to defend against privacy attacks and provide stronger interpretation.

KNOWLEDGE AND INFORMATION SYSTEMS (2023)

Article Chemistry, Physical

The enhanced photocatalytic and photothermal effects of Ti3C2 Mxene quantum dot/macroscopic porous graphitic carbon nitride heterojunction for Hydrogen Production

Jinghua Li, Hao Peng, Bing Luo, Jiamei Cao, Lijing Ma, Dengwei Jing

Summary: A new heterostructure between Ti3C2 MXene quantum dot and 3D macroscopic porous graphitic carbon nitride (PGCN) was successfully obtained using in situ electrostatic self-assembly techniques. The optimized Ti3C2 QD/PGCN composites exhibit significantly enhanced photocatalytic H2 evolution rate compared to pristine CN and PGCN. The improved performance is attributed to the synergistic effects of Ti3C2 quantum dots and the good photothermal conversion ability of PGCN.

JOURNAL OF COLLOID AND INTERFACE SCIENCE (2023)

Article Computer Science, Artificial Intelligence

Higher-Order Attribute-Enhancing Heterogeneous Graph Neural Networks

Jianxin Li, Hao Peng, Yuwei Cao, Yingtong Dou, Hekai Zhang, Philip S. Yu, Lifang He

Summary: Graph neural networks (GNNs) are widely used in deep learning for graph analysis tasks. However, current methods ignore heterogeneity in real-world graphs and fail to capture content-based correlations between nodes. In this paper, we propose a novel HAE framework and a HAE(GNN) model that incorporates meta-paths and meta-graphs for rich, heterogeneous semantics and leverages self-attention mechanism for exploring content-based interactions between nodes.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Computer Science, Artificial Intelligence

Lifelong Property Price Prediction: A Case Study for the Toronto Real Estate Market

Hao Peng, Jianxin Li, Zheng Wang, Renyu Yang, Mingsheng Liu, Mingming Zhang, Philip S. Yu, Lifang He

Summary: Luce is a life-long predictive model that addresses the lack of recent sold prices and sparsity of house data in property valuation. It utilizes a heterogeneous information network (HIN) to organize house data and employs a Graph Convolutional Network (GCN) and Long Short Term Memory (LSTM) network to extract spatial and temporal information for accurate valuation.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2023)

Article Engineering, Electrical & Electronic

Low-Cost Substrate Integrated Waveguide Equalizer Based on the Indium Tin Oxides Conductive Film

Jun Dong, Fan Yin, Taixing Jiang, Xiang Zhong, Yang Yang, Hao Peng

Summary: This work presents and studies a low-cost and mechanical reconfigurable substrate integrated waveguide (SIW) equalizer. Indium tin oxides (ITO) are introduced instead of Tantalum Nitride (TaN) or absorbing material as the resistive element in SIW equalizer, providing more structural stability, excellent high frequency characteristic, and easy integration with traditional printed circuit boards. The proposed equalizer based on ITO Conductive Film demonstrates mechanical reconfigurability, low cost, and high stability, making ITO a good candidate for high millimeter-wave band equalizer design.

PROGRESS IN ELECTROMAGNETICS RESEARCH LETTERS (2023)

Article Chemistry, Physical

Techno-economic assessment of a chemical looping splitting system for H2 and CO Co-generation

Hao Peng, Zichen Di, Pan Gong, Fengling Yang, Fangqin Cheng

Summary: A novel double chemical looping process is proposed for the co-production of high purity hydrogen and carbon monoxide. The system achieves high efficiency and economic competitiveness, making it a promising method for hydrogen production.

GREEN ENERGY & ENVIRONMENT (2023)

No Data Available