4.3 Article

SVD-Based Quality Metric for Image and Video Using Machine Learning

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSMCB.2011.2163391

关键词

Image structure; singular value decomposition (SVD); support vector regression (SVR); visual quality assessment

资金

  1. Singapore Ministry of Education [T208B1218]

向作者/读者索取更多资源

We study the use of machine learning for visual quality evaluation with comprehensive singular value decomposition (SVD)-based visual features. In this paper, the two-stage process and the relevant work in the existing visual quality metrics are first introduced followed by an in-depth analysis of SVD for visual quality assessment. Singular values and vectors form the selected features for visual quality assessment. Machine learning is then used for the feature pooling process and demonstrated to be effective. This is to address the limitations of the existing pooling techniques, like simple summation, averaging, Minkowski summation, etc., which tend to be ad hoc. We advocate machine learning for feature pooling because it is more systematic and data driven. The experiments show that the proposed method outperforms the eight existing relevant schemes. Extensive analysis and cross validation are performed with ten publicly available databases (eight for images with a total of 4042 test images and two for video with a total of 228 videos). We use all publicly accessible software and databases in this study, as well as making our own software public, to facilitate comparison in future research.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Engineering, Electrical & Electronic

AFD-Former: A Hybrid Transformer With Asymmetric Flow Division for Synthesized View Quality Enhancement

Xu Zhang, Nian Cai, Huan Zhang, Yun Zhang, Jianglei Di, Weisi Lin

Summary: This paper presents a novel U-shaped hybrid transformer called AFD-former for Synthesized View Quality Enhancement (SVQE). It combines the advantages of transformers and CNNs to capture global and local information collaboratively. By using the Asymmetric Flow Division Unit (AFDU), the model assigns different contributions of global-local information to the transformer and CNN branches across different layers, resulting in enhanced perceptual quality of synthesized views.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

Camera Contrast Learning for Unsupervised Person Re-Identification

Guoqing Zhang, Hongwei Zhang, Weisi Lin, Arun Kumar Chandran, Xuan Jing

Summary: Unsupervised person re-identification aims to find informative features from unlabeled person datasets. This research proposes a camera contrast learning framework, which selects camera centroids as proxies for each cluster based on time contrast principle to reduce the correlation between features and cameras. It also utilizes a 3-dimensional attention module to reduce intra-ID discrepancies caused by background shifts. Experimental results show that this method outperforms existing unsupervised person re-identification approaches on popular datasets.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Geochemistry & Geophysics

Lightweight Salient Object Detection in Optical Remote-Sensing Images via Semantic Matching and Edge Alignment

Gongyang Li, Zhi Liu, Xinpeng Zhang, Weisi Lin

Summary: In this article, a lightweight network called SeaNet is proposed for salient object detection in optical remote-sensing images (ORSI-SOD) based on semantic matching and edge alignment. SeaNet uses a lightweight MobileNet-V2 for feature extraction, a dynamic semantic matching module (DSMM) for high-level features, an edge self-alignment module (ESAM) for low-level features, and a portable decoder for inference. Experimental results demonstrate that SeaNet outperforms state-of-the-art lightweight methods and achieves comparable accuracy with conventional methods.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

Article Computer Science, Artificial Intelligence

A Thorough Benchmark and a New Model for Light Field Saliency Detection

Wei Gao, Songlin Fan, Ge Li, Weisi Lin

Summary: Compared with current RGB or RGB-D saliency detection datasets, light field saliency detection datasets suffer from defects such as insufficient data amount and diversity, incomplete data formats, and rough annotations. To address these issues, a large-scale light field dataset called PKU-LF is constructed, which includes 5,000 light fields and covers diverse indoor and outdoor scenes. PKU-LF provides inclusive representation formats of light fields and a unified platform for comparing algorithms with different input formats.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Engineering, Civil

Retrieval of total suspended solids concentration from hyperspectral sensing using hierarchical Bayesian model aggregation for optimal multiple band ratio analysis

Hui Ying Pak, Adrian Wing-Keung Law, Weisi Lin

Summary: Water quality monitoring is essential for water resource management and governance. Remote sensing with UAVs and hyperspectral sensors has shown promise as a cost-effective and efficient method. This study has developed a new method called HBMA-OMBRA for estimating TSS concentrations from hyperspectral data, which has been verified through laboratory investigations.

JOURNAL OF HYDRO-ENVIRONMENT RESEARCH (2023)

Article Automation & Control Systems

Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images

Gongyang Li, Zhi Liu, Dan Zeng, Weisi Lin, Haibin Ling

Summary: In this article, a novel adjacent context coordination network (ACCoNet) is proposed for salient object detection (SOD) in optical remote sensing images (RSIs). ACCoNet improves the performance of SOD by exploring the coordination of adjacent features in an encoder-decoder architecture and introduces local and adjacent branches to handle multilevel features. Additionally, a bifurcation-aggregation block (BAB) is introduced to capture contextual information by extending the capabilities of the classic decoder block.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Engineering, Electrical & Electronic

Minimum Noticeable Difference-Based Adversarial Privacy Preserving Image Generation

Wen Sun, Jian Jin, Weisi Lin

Summary: Deep learning models are vulnerable to adversarial examples and existing methods focus on attacking models without considering perceptual quality. This research proposes a framework based on the MND concept for generating adversarial privacy preserving images that have minimum perceptual difference while attacking deep learning models.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Materials Science, Multidisciplinary

Highly sensitive liquid refractometric sensing based on a square coreless fiber functionalized with few-layer Ti3C2Tx Mxene

Qianying Feng, Jixuan Wu, Hua Bai, Binbin Song, Cheng Zhang, Wei Lin, Haifeng Liu, Shaoxiang Duan

Summary: In this study, a square coreless fiber functionalized with a Ti3C2Tx MXene layer is proposed for highly sensitive refractometric measurement. The sensitivity of the refractometric sensor is improved by more than 12% compared to the pristine fiber. The Ti3C2Tx modified square coreless fiber provides a promising platform for general ultra-low concentration analytical detection.

OPTICAL MATERIALS EXPRESS (2023)

Article Engineering, Electrical & Electronic

DisCoVQA: Temporal Distortion-Content Transformers for Video Quality Assessment

Haoning Wu, Chaofeng Chen, Liang Liao, Jingwen Hou, Wenxiu Sun, Qiong Yan, Weisi Lin

Summary: Compared with existing works, temporal relationships between frames and their influences on video quality assessment (VQA) are relatively under-studied. This study proposes a Transformer-based VQA method to tackle these issues. The method extracts spatial-temporal features and handles temporal quality attention, achieving state-of-the-art performance on multiple benchmarks.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Engineering, Electrical & Electronic

Real-World Non-Homogeneous Haze Removal by Sliding Self-Attention Wavelet Network

Yuxin Feng, Xiaozhe Meng, Fan Zhou, Weisi Lin, Zhuo Su

Summary: This paper proposes a sliding self-attention wavelet network for image haze removal in complex natural haze scenes. The method uses a sliding self-attention module to identify haze regions and uses discrete wavelet transform and inverse transform to construct a hierarchical encoder-decoder structure for gradually recovering sharp edges and precise texture details. Experimental results demonstrate that the proposed algorithm achieves favorable dehazing performance on relevant benchmark datasets.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2023)

Article Computer Science, Information Systems

Efficient Geometry Surface Coding in V-PCC

Jian Xiong, Hao Gao, Miaohui Wang, Hongliang Li, King Ngi Ngan, Weisi Lin

Summary: This paper proposes an efficient geometry surface coding (EGSC) method for video point cloud compression, which improves the compression of geometry information by establishing an error projection model and using an EP-based rate-distortion optimization method. It also introduces an occupancy-map driven scheme for merge mode prediction to enhance prediction accuracy.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

KSS-ICP: Point Cloud Registration Based on Kendall Shape Space

Chenlei Lv, Weisi Lin, Baoquan Zhao

Summary: In this paper, a new registration method called KSS-ICP is proposed for rigid registration in Kendall shape space (KSS) with Iterative Closest Point (ICP). The KSS is a quotient space that removes influences of translations, scales, and rotations for shape feature-based analysis. The KSS-ICP achieves accurate registration from point clouds and outperforms the state-of-the-art.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

Article Computer Science, Information Systems

Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding

Jian Jin, Xingxing Zhang, Lili Meng, Weisi Lin, Jie Liang, Huaxiang Zhang, Yao Zhao

Summary: In this paper, an auto-weighted layer representation based view synthesis distortion estimation model is proposed, which calculates sub-synthesis distortion and learns a nonlinear mapping function to obtain the associated weights. It can efficiently and accurately estimate the view synthesis distortion.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Information Systems

Interaction-Matrix Based Personalized Image Aesthetics Assessment

Jingwen Hou, Weisi Lin, Guanghui Yue, Weide Liu, Baoquan Zhao

Summary: Personalized image aesthetics assessment aims to estimate aesthetic experiences based on individual preferences. This research proposes a method that directly estimates personalized aesthetic experiences from the interaction between image contents and user preferences, without the need for prior knowledge on generic aesthetics assessment. Extensive experiments show that the proposed method outperforms previous personalized methods and generic methods in terms of both personalized and generic aesthetics assessment.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Geochemistry & Geophysics

Uplink-Assist Downlink Remote-Sensing Image Compression via Historical Referencing

Huiwen Wang, Liang Liao, Jing Xiao, Weisi Lin, Mi Wang

Summary: This article proposes an enhanced remote-sensing image compression approach that utilizes uplink assistance to improve compression efficiency. By leveraging historical images from ground stations as reference images for on-orbit compression, spatiotemporal redundancy in remote-sensing images can be effectively eliminated. The proposed dual-end referencing downsampling-based coding framework effectively mitigates fake texture generation and achieves significant bitrate savings compared to standard compression baselines.

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (2023)

暂无数据