4.7 Letter

Semi-Supervised Billinear Subspace Learning

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING
卷 18, 期 7, 页码 1671-1676

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2009.2018015

关键词

Adaptive regularization; dimensionality reduction; face recognition; semi-supervised learning

向作者/读者索取更多资源

Recent research has demonstrated the success of tensor based subspace learning in both unsupervised and supervised configurations (e.g., 2-D PCA, 2-D LDA, and DATER). In this correspondence, we present a new semi-supervised subspace learning algorithm by integrating the tensor representation and the complementary information conveyed by unlabeled data. Conventional semi-supervised algorithms mostly impose a regularization term based on the data representation in the original feature space. Instead, we utilize graph Laplacian regularization based on the low-dimensional feature space. An iterative algorithm, referred to as adaptive regularization based semi-supervised discriminant analysis with tensor representation (ARSDA/T), is also developed to compute the solution. In addition to handling tensor data, a vector-based variant (ARSDA/V) is also presented, in which the tensor data are converted into vectors before subspace learning. Comprehensive experiments on the CMU PIE and YALE-B databases demonstrate that ARSDA/T brings significant improvement in face recognition accuracy over both conventional supervised and semi-supervised subspace learning algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Engineering, Electrical & Electronic

Transformer3D-Det: Improving 3D Object Detection by Vote Refinement

Lichen Zhao, Jinyang Guo, Dong Xu, Lu Sheng

Summary: In this work, a simple but effective 3D object detection method called Transformer3D-Det (T3D) is proposed, which introduces a transformer based vote refinement module to refine the voting results and significantly improve performance.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

Article Engineering, Electrical & Electronic

GeometryMotion-Net: A Strong Two-Stream Baseline for 3D Action Recognition

Jiaheng Liu, Dong Xu

Summary: In this work, a strong two-stream baseline method called GeometryMotion-Net is proposed for 3D action recognition. By utilizing new modules and virtual point clouds in both geometry and motion streams, discriminant geometry and bidirectional motion features are extracted, demonstrating effectiveness and efficiency on mainstream datasets.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

Article Engineering, Electrical & Electronic

Sequential Point Cloud Upsampling by Exploiting Multi-Scale Temporal Dependency

Kaisiyuan Wang, Lu Sheng, Shuhang Gu, Dong Xu

Summary: The proposed SPU method effectively upsamples sparse, non-uniform, and orderless point cloud sequences by exploiting rich temporal dependencies from multiple inputs. The use of a new temporal alignment module and gating mechanism allows for effective aggregation of transformed features into a fused feature, which can be fed into existing frame-based point cloud upsampling methods to generate dense point clouds. Comprehensive experiments demonstrate the effectiveness of the method on benchmark datasets.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

Article Computer Science, Artificial Intelligence

Exploiting Intra-Slice and Inter-Slice Redundancy for Learning-Based Lossless Volumetric Image Compression

Zhenghao Chen, Shuhang Gu, Guo Lu, Dong Xu

Summary: This work proposes an end-to-end optimized learning framework for losslessly compressing 3D volumetric data. The framework utilizes hierarchical compression scheme, intra-slice auxiliary features, and estimation of entropy model based on intra-slice and inter-slice information. Experimental results show that the framework outperforms existing lossless volumetric image codecs in compressing 3D Medical Images and Hyper-Spectral Images.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization

Rui Su, Dong Xu, Luping Zhou, Wanli Ouyang

Summary: The proposed progressive cross-stream cooperation (PCSC) framework improves spatial and temporal action localization by utilizing spatial region and features from one stream to help another stream iteratively generate better bounding boxes. This approach not only enhances the accuracy of action localization, but also improves the prediction of action classes.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2021)

Article Computer Science, Artificial Intelligence

P-CNN: Part-Based Convolutional Neural Networks for Fine-Grained Visual Categorization

Junwei Han, Xiwen Yao, Gong Cheng, Xiaoxu Feng, Dong Xu

Summary: This paper proposes a new end-to-end fine-grained visual categorization system called P-CNN, which consists of three modules: SE block for recalibrating feature responses, PLN for locating object parts, and PCN for part classification. The paper also introduces new metric learning and part classification techniques. Experimental results demonstrate the effectiveness of this approach.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2022)

Article Engineering, Electrical & Electronic

VDM-DA: Virtual Domain Modeling for Source Data-Free Domain Adaptation

Jiayi Tian, Jing Zhang, Wen Li, Dong Xu

Summary: In this work, we propose a novel approach referred to as Virtual Domain Modeling for Domain Adaptation (VDM-DA) to tackle the source data-free unsupervised domain adaptation problem. By generating virtual domain samples and designing an effective distribution alignment method, we successfully achieve the goal of distribution alignment between the source and target domains when training deep networks.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Engineering, Electrical & Electronic

3D-Pruning: A Model Compression Framework for Efficient 3D Action Recognition

Jinyang Guo, Jiaheng Liu, Dong Xu

Summary: This paper proposes a fully automatic model compression framework called 3D-Pruning (3DP) for efficient 3D action recognition. The compressed model can process different frames and points based on their importance values, allowing for different computational complexities. Experimental results demonstrate the effectiveness of the 3DP framework for model compression.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Article Engineering, Electrical & Electronic

Human-Centric Spatio-Temporal Video Grounding With Visual Transformers

Zongheng Tang, Yue Liao, Si Liu, Guanbin Li, Xiaojie Jin, Hongxu Jiang, Qian Yu, Dong Xu

Summary: In this work, a novel task called Human-centric Spatio-Temporal Video Grounding (HC-STVG) is introduced. HC-STVG aims to localize a spatio-temporal tube of the target person from an untrimmed video based on a given textual description, focusing on humans. This task is useful for healthcare and security applications.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Complexity-guided Slimmable Decoder for Efficient Deep Video Compression

Zhihao Hu, Dong Xu

Summary: In this paper, we propose a combination of complexity-guided slimmable decoder (cgSlimDecoder) and skip-adaptive entropy coding (SaEC) for efficient deep video compression. The cgSlimDecoder automatically determines the optimal channel width for each slimmable convolution layer and allocates the optimal number of parameters for different modules, supporting multiple complexity levels. The SaEC further speeds up the decoding process by skipping the entropy coding for well-predicted elements. Experimental results demonstrate that the proposed methods significantly improve coding efficiency with minimal performance drop.

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2023)

Proceedings Paper Computer Science, Artificial Intelligence

SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling

Chenjian Gao, Qian Yu, Lu Sheng, Yi-Zhe Song, Dong Xu

Summary: This paper introduces a method for reconstructing a 3D shape based on a single sketch image. By analyzing the 3D-to-2D projection process, the density map is used as a proxy to facilitate the reconstruction process. Experimental results show that this method outperforms other baseline methods in both quantitative and qualitative aspects.

COMPUTER VISION - ECCV 2022, PT I (2022)

Proceedings Paper Computer Science, Artificial Intelligence

Improving RGB-D Point Cloud Registration by Learning Multi-scale Local Linear Transformation

Ziming Wang, Xiaoliang Huo, Zhenghao Chen, Jing Zhang, Lu Sheng, Dong Xu

Summary: In this work, a new Geometry-Aware Visual Feature Extractor (GAVE) is proposed for point cloud registration, which effectively fuses geometric and visual information for more reliable correspondence estimation, outperforming existing methods.

COMPUTER VISION - ECCV 2022, PT XXXII (2022)

Article Computer Science, Artificial Intelligence

APSNet: Toward Adaptive Point Sampling for Efficient 3D Action Recognition

Jiaheng Liu, Jinyang Guo, Dong Xu

Summary: This paper investigates the trade-off between accuracy and efficiency in 3D action recognition. A simple and efficient backbone network structure and an adaptive point sampling network are proposed to achieve this trade-off. Experimental results demonstrate the effectiveness and efficiency of the proposed approach for 3D action recognition.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

VPU: A Video-Based Point Cloud Upsampling Framework

Kaisiyuan Wang, Lu Sheng, Shuhang Gu, Dong Xu

Summary: In this work, we propose a new patch-based framework called VPU for video-based point cloud upsampling. By effectively exploiting temporal dependency among multiple consecutive point cloud frames, our method extracts and aggregates rich local geometric clues to infer the local geometry distributions at the current frame. Our framework achieves substantial performance improvement over its single frame-based counterparts on multiple point cloud sequence datasets.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

Article Computer Science, Artificial Intelligence

Structure Guided Deep Neural Network for Unsupervised Active Learning

Changsheng Li, Handong Ma, Ye Yuan, Guoren Wang, Dong Xu

Summary: The study presents a novel deep unsupervised active learning framework, which can learn nonlinear embedding and select representative samples using a self-supervised learning strategy, preserving both the global and cluster structure, addressing data imbalance issues, and improving model performance.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2022)

暂无数据