4.5 Article

Semantic-aware blind image quality assessment

Journal

SIGNAL PROCESSING-IMAGE COMMUNICATION
Volume 60, Issue -, Pages 237-252

Publisher

ELSEVIER
DOI: 10.1016/j.image.2017.10.009

Keywords

Blind image quality assessment; No-reference image quality metrics (NR-IQM); Quality of experience (QoE); Image semantics; Subjective quality datasets

Funding

  1. SURF Cooperative

Ask authors/readers for more resources

Many studies have indicated that predicting users' perception of visual quality depends on various factors other than artifact visibility alone, such as viewing environment, social context, or user personality. Exploiting information on these factors, when applicable, can improve users' quality of experience while saving resources. In this paper, we improve the performance of existing no-reference image quality metrics (NR-IQM) using image semantic information (scene and object categories), building on our previous findings that image scene and object categories influence user judgment of visual quality. We show that adding scene category features, object category features, or the combination of both to perceptual quality features results in significantly higher correlation with user judgment of visual quality. We also contribute a new publicly available image quality dataset which provides subjective scores on images that cover a wide range of scene and object category evenly. As most public image quality datasets so far span limited semantic categories, this new dataset opens new possibilities to further explore image semantics and quality of experience. (C) 2017 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

One deep music representation to rule them all? A comparative analysis of different representation learning strategies

Jaehun Kim, Julian Urbano, Cynthia C. S. Liem, Alan Hanjalic

NEURAL COMPUTING & APPLICATIONS (2020)

Article Computer Science, Artificial Intelligence

Unified Binary Generative Adversarial Network for Image Retrieval and Compression

Jingkuan Song, Tao He, Lianli Gao, Xing Xu, Alan Hanjalic, Heng Tao Shen

INTERNATIONAL JOURNAL OF COMPUTER VISION (2020)

Article Chemistry, Analytical

CorrNet: Fine-Grained Emotion Recognition for Video Watching Using Wearable Physiological Sensors

Tianyi Zhang, Abdallah El Ali, Chen Wang, Alan Hanjalic, Pablo Cesar

Summary: The study proposes a correlation-based emotion recognition algorithm (CorrNet) to recognize the valence and arousal of each instance using wearable physiological signals, achieving promising recognition accuracies in indoor and outdoor environments. Results show that instance segment lengths between 1-4s result in the highest recognition accuracies, and large amounts of neutral V-A labels affect the recognition performance. The study also found that the accuracies between laboratory-grade and wearable sensors are comparable, even under low sampling rates.

SENSORS (2021)

Article Computer Science, Information Systems

Accuracy-diversity trade-off in recommender systems via graph convolutions

Elvin Isufi, Matteo Pocchiari, Alan Hanjalic

Summary: The study introduces a joint graph convolutional model that balances accuracy and diversity in recommender systems by learning convolutions from nearest neighbor and furthest neighbor graphs, with the information between the two modules balanced in training through a regularizer inspired by multi-kernel learning. The proposed method can significantly improve catalog coverage or diversity within the list, with diversity gains up to seven times by trading as little as 1% in accuracy.

INFORMATION PROCESSING & MANAGEMENT (2021)

Article Computer Science, Artificial Intelligence

Radial Graph Convolutional Network for Visual Question Generation

Xing Xu, Tan Wang, Yang Yang, Alan Hanjalic, Heng Tao Shen

Summary: This article introduces an innovative answer-centric approach called radial graph convolutional network (Radial-GCN) for visual question generation (VQG). Experimental results demonstrate the superiority of this method over reference methods on three benchmark datasets, and even boost the performance of state-of-the-art VQA methods significantly in the challenging zero-shot VQA task.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)

Article Computer Science, Information Systems

Cross-Modal Hybrid Feature Fusion for Image-Sentence Matching

Xing Xu, Yifan Wang, Yixuan He, Yang Yang, Alan Hanjalic, Heng Tao Shen

Summary: In this study, a novel CMHF framework is proposed for directly learning the image-sentence similarity by fusing multimodal features with inter- and intra-modality relations incorporated. The framework utilizes flexible attention mechanisms to generate effective attention flows within and across the modalities of images and sentences, capturing high-level interactions between visual regions in images and words in sentences. The structured objective with ranking loss constraint in CMHF is demonstrated to effectively learn the image-sentence similarity based on the fused fine-grained features of different modalities, achieving state-of-the-art matching performance.

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS (2021)

Article Computer Science, Information Systems

Towards user-oriented privacy for recommender system data: A personalization-based approach to gender obfuscation for user profiles

Manel Slokom, Alan Hanjalic, Martha Larson

Summary: This paper introduces a new privacy solution called PerBlur for protecting user privacy while training a recommender system, by adding and removing items from user profiles to generate obfuscated user-item matrix. Results show that gender obfuscation impacts the fairness and diversity of recommender system results, highlighting the importance of maintaining fairness and enhancing diversity for user recommendations.

INFORMATION PROCESSING & MANAGEMENT (2021)

Article Computer Science, Artificial Intelligence

Joint Feature Synthesis and Embedding: Adversarial Cross-Modal Retrieval Revisited

Xing Xu, Kaiyi Lin, Yang Yang, Alan Hanjalic, Heng Tao Shen

Summary: This article proposes a novel method called Joint Feature Synthesis and Embedding (JFSE), which utilizes two coupled conditional Wassertein GAN modules to synthesize meaningful and correlated multimodal features, and employs advanced distribution alignment schemes and cycle-consistency constraints to preserve semantic compatibility and enable knowledge transfer in a shared embedding space. Experimental results show that the JFSE method achieves significant accuracy improvement in standard retrieval and newly explored zero-shot and generalized zero-shot retrieval tasks.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Engineering, Multidisciplinary

Temporal Network Prediction and Interpretation

Li Zou, Xiu-Xiu Zhan, Jie Sun, Alan Hanjalic, Huijuan Wang

Summary: This study focuses on predicting temporal networks using interpretable learning algorithms like Lasso Regression and Random Forest. The results show that the next step activity of a particular link is mainly influenced by its current activity and links strongly correlated in the time series and close in distance in the aggregated network.

IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING (2022)

Article Engineering, Electrical & Electronic

Task-Aware Connectivity Learning for Incoming Nodes Over Growing Graphs

Bishwadeep Das, Alan Hanjalic, Elvin Isufi

Summary: This paper discusses the importance of connectivity information for data processing on expanding graphs with new nodes. By modeling the attachment of new nodes without connectivity information and showing implicit constraints on spectral perturbation, the paper provides a task-driven data processing approach. Numerical results confirm the superior performance of the proposed approach in the absence of connectivity information.

IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS (2022)

Article Computer Science, Theory & Methods

Influence of clustering coefficient on network embedding in link prediction

Omar F. Robledo, Xiu-Xiu Zhan, Alan Hanjalic, Huijuan Wang

Summary: This paper investigates the impact of network topology on the performance of network embedding algorithms in link prediction. The results show that a higher clustering coefficient leads to better link prediction performance, except for Matrix Factorisation which is not sensitive to changes in clustering coefficient. The study found that the algorithms tend to assign a higher likelihood of connection to node pairs with a higher number of common neighbors, regardless of the clustering coefficient. The predicted networks have more triangles and higher clustering coefficient as a result. The findings suggest that increasing the clustering coefficient improves link prediction performance, except for Matrix Factorisation.

APPLIED NETWORK SCIENCE (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Leave No User Behind: Towards Improving the Utility of Recommender Systems for Non-mainstream Users

Roger Zhe Li, Julian Urbano, Alan Hanjalic

Summary: This paper presents a method to address mainstream bias by adding an autoencoder layer, improving recommendations for nonmainstream users.

WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (2021)

Proceedings Paper Computer Science, Information Systems

New Insights into Metric Optimization for Ranking-based Recommendation

Roger Zhe Li, Julian Urbano, Alan Hanjalic

Summary: Direct optimization of IR metrics in ranking-based recommender systems may not necessarily lead to the best performance, as shown in an experimental study comparing the relative merits of different IR metrics. RBP-inspired losses offer consistent and clear benefits, especially for more active users, challenging the current research practice of optimizing and evaluating the same metric in recommendation systems.

SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (2021)

Article Acoustics

Generating Images From Spoken Descriptions

Xinsheng Wang, Tingting Qiao, Jihua Zhu, Alan Hanjalic, Odette Scharenborg

Summary: This paper introduces a new speech technology task - speech-to-image generation framework, showcasing its potential applications in unwritten languages. Through experiments, the effectiveness of S2IGAN in synthesizing high-quality and semantically-consistent images has been demonstrated.

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING (2021)

Article Mathematics, Interdisciplinary Applications

Are Nearby Neighbors Relatives? Testing Deep Music Embeddings

Jaehun Kim, Julian Urbano, Cynthia C. S. Liem, Alan Hanjalic

FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS (2019)

Article Engineering, Electrical & Electronic

Image tone mapping based on clustering and human visual system models

Xueyu Han, Ishtiaq Rasool Khan, Susanto Rahardja

Summary: This paper proposes a clustering-based TMO method by embedding human visual system models to adapt to different HDR scenes. The method reduces computational complexity using a hierarchical scheme for clustering and enhances local contrast by superimposing details and controlling color saturation by limiting the adaptive saturation parameter. Experimental results show that the proposed method achieves improvements in generating high quality tone-mapped images compared to competing methods.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

YOLO-PAI: Real-time handheld call behavior detection algorithm and embedded application

Zuopeng Zhao, Tianci Zheng, Kai Hao, Junjie Xu, Shuya Cui, Xiaofeng Liu, Guangming Zhao, Jie Zhou, Chen He

Summary: The research team developed a handheld phone detection network called YOLO-PAI, which successfully achieved real-time detection and underwent testing under various conditions. Experimental results show that YOLO-PAI reduces network structure parameters and computational costs while maintaining accuracy, outperforming other popular networks in terms of speed and accuracy.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

ClGanNet: A novel method for maize leaf disease identification using ClGan and deep CNN

Vivek Sharma, Ashish Kumar Tripathi, Purva Daga, M. Nidhi, Himanshu Mittal

Summary: In this study, a novel ClGan method is proposed for automated plant disease detection. The method reduces the number of parameters and addresses the issues of vanishing gradients, training instability, and non-convergence by using an encoder-decoder network. Additionally, an improved loss function is introduced to stabilize the learning process and optimize weights effectively. Furthermore, a new plant leaf classification method called ClGanNet is introduced, achieving 99.97% training accuracy and 99.04% testing accuracy using the least number of parameters.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)

Article Engineering, Electrical & Electronic

Individual tooth segmentation in human teeth images using pseudo edge-region obtained by deep neural networks

Seongeun Kim, Chang-Ock Lee

Summary: This article introduces a method for segmenting individual teeth in human teeth images by using deep neural networks to obtain pseudo edge-regions and applying active contour models for segmentation.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2024)