4.7 Article

Video Colorization Using Parallel Optimization in Feature Space

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCSVT.2013.2276702

关键词

Gabor feature space; parallel optimization; user strokes; video colorization

资金

  1. National Natural Science Foundation of China [61202154, 61133009]
  2. National Basic Research Project of China [2011CB302203]
  3. Shanghai Pujiang Program [13PJ1404500]
  4. RGC [416311]
  5. UGC Direct Grants for Research [2050454, 2050485]
  6. European Union [256941]
  7. Reality CG
  8. Open Projects Program of the National Laboratory of Pattern Recognition
  9. Open Project Program of the State Key Laboratory of CADCG [A1206]
  10. Zhejiang University

向作者/读者索取更多资源

We present a new scheme for video colorization using optimization in rotation-aware Gabor feature space. Most current methods of video colorization incur temporal artifacts and prohibitive processing costs, while this approach is designed in a spatiotemporal manner to preserve temporal coherence. The parallel implementation on graphics hardware is also facilitated to achieve realtime performance of color optimization. By adaptively clustering video frames and extending Gabor filtering to optical flow computation, we can achieve real-time color propagation within and between frames. Temporal coherence is further refined through user scribbles in video frames. The experimental results demonstrate that our proposed approach is efficient in producing high-quality colorized videos.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

PhotoHelper: Portrait Photographing Guidance Via Deep Feature Retrieval and Fusion

Nan Jiang, Bin Sheng, Ping Li, Tong-Yee Lee

Summary: We introduce a new photographing guidance (PhotoHelper) that enhances the portrait photo quality for amateur photographers using deep feature retrieval and fusion. Our model combines empirical aesthetic rules, traditional machine learning algorithms, and deep neural networks to extract various features in color and space aspects. Through a series of processes, our approach provides users with professional reference photographs and automatically generates spatial composition guidance, resulting in significantly improved aesthetic quality of portrait photos.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

FSAD-Net: Feedback Spatial Attention Dehazing Network

Yu Zhou, Zhihua Chen, Ping Li, Haitao Song, C. L. Philip Chen, Bin Sheng

Summary: In this article, a novel end-to-end dehazing method called feedback spatial attention dehazing network (FSAD-Net) is proposed. FSAD-Net improves dehazing performance by utilizing feedback connections and attention mechanisms, and it shows superiority in both quantitative and qualitative experiments.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Engineering, Electrical & Electronic

Coupled Solution of Volume Integral and Hydrodynamic Equations to Analyze Electromagnetic Scattering From Composite Nanostructures

Doolos Aibek Uulu, Rui Chen, Liang Chen, Ping Li, Hakan Bagci

Summary: This study solves a coupled system of volume integral and hydrodynamic equations to analyze electromagnetic scattering from nanostructures consisting of metallic and dielectric parts. The hydrodynamic equation relates the free electron polarization current to the electric flux, while the volume integral equation relates the electric flux and the free electron polarization current to the scattered electric field. An efficient two-level iterative solver is proposed to solve the matrix system generated by the expansions of the unknown electric flux and free electron polarization current. Numerical experiments demonstrate the accuracy, efficiency, and applicability of the proposed method.

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION (2023)

Article Computer Science, Information Systems

EAPT: Efficient Attention Pyramid Transformer for Image Processing

Xiao Lin, Shuzhou Sun, Wei Huang, Bin Sheng, Ping Li, David Dagan Feng

Summary: Recent transformer-based models, especially patch-based methods, have shown great potential in vision tasks. However, fixed-size patch division ignores the diversity of visual elements and hinders global attention information extraction. To address these issues, an Efficient Attention Pyramid Transformer (EAPT) is proposed, which includes Deformable Attention for non-fixed attention information and an Encode-Decode Communication module for complete global attention information. A specifically designed position encoding for vision transformers is also introduced. Extensive experiments and ablation studies demonstrate the effectiveness and key components of the proposed model.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

Mobile game props recommendation for machine learning

Can Yang, Ping Li

Summary: Mobile game providers benefit from selling virtual items, and by utilizing multi-instance multi-label learning algorithm and semi-supervised learning, the recommendation performance of game props is improved, leading to a significant increase in game revenue, as confirmed by offline data sets and online game experimental results.

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS (2023)

Article Materials Science, Characterization & Testing

Feature-Sensitive Ring Artifact Correction for Computed Tomography

Markus Wedekind, Susana Castillo, Marcus Magnor

Summary: This article introduces a method for correcting ring artifacts in computed tomography (CT) reconstruction. The method compensates for errors in the gain and offset values of each pixel and reduces blur by inferring information from neighboring pixels. Experimental results show that this method effectively mitigates the shortcomings of purely offset-based approaches and approaches using all projections, and can be efficiently implemented.

JOURNAL OF NONDESTRUCTIVE EVALUATION (2023)

Article Computer Science, Information Systems

Multi-stage feature-fusion dense network for motion deblurring

Cai Guo, Qian Wang, Hong-Ning Dai, Ping Li

Summary: In this paper, a multi-stage feature-fusion dense network (MFFDNet) is proposed for motion deblurring, which improves the deblurring effect by reusing the extracted features and reducing computation costs. Experimental results show that MFFDNet outperforms state-of-the-art methods while maintaining a relatively small computing cost.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2023)

Article Engineering, Electrical & Electronic

A Wave Equation-Based Hybridizable Discontinuous Galerkin-Robin Transmission Condition Algorithm for Electromagnetic Problems Analyzing

Xuan Zhang, Shi Min Liu, Ran Zhao, Xiao Chun Li, Jun Fa Mao, Li Jun Jiang, Ping Li

Summary: In this work, a wave-equation-based discontinuous Galerkin (DG) method hybridized with the Robin transmission condition (DG-RTC) is developed to solve frequency-domain electromagnetic (EM) problems. The proposed method discretizes the vector electric field wave equation in each subdomain and introduces numerical flux at the subdomain interfaces. An auxiliary equation based on the tangential continuity of EM fields is introduced to solve for the magnetic field H at the interfaces. A finite-element tearing and interconnecting (FETI)-like approach is used to reduce the computational cost. Several examples are presented to validate the proposed DG method.

IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES (2023)

Article Engineering, Biomedical

Fine-Grained and Multiple Classification for Alzheimer's Disease With Wavelet Convolution Unit Network

Jinyu Wen, Yang Li, Meie Fang, Lei Zhu, David Dagan Feng, Ping Li

Summary: The article introduces a novel wavelet convolution unit for image-oriented neural networks, combining wavelet analysis with convolution operator to extract deep abstract features efficiently. By fusing traditional convolution and wavelet decomposition, as well as incorporating multi-scale decompositions, it aims to improve classification accuracy.

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING (2023)

Article Engineering, Electrical & Electronic

Localization and Identification of EMI Sources in Shielding Enclosures Based on a Two-Step Source Reconstruction Method

Zi An Wang, Jun Fa Mao, Li Jun Jiang, Ping Li

Summary: This paper proposes a two-step source reconstruction method to address the localization and identification of unknown electromagnetic interference (EMI) sources in shielding enclosures. The actual EMI sources are initially modeled by equally distributed equivalent electric dipoles and the effects of surrounding environments are incorporated using the numerical Green's function. The parameters of equivalent dipoles are then updated to improve the precision of the equivalent model by introducing additional magnetic dipoles.

IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY (2023)

Article Engineering, Electrical & Electronic

Efficient Thermal Analysis of Integrated Circuits and Packages With Microchannel Cooling Using Laguerre-Based Layered Finite Element Method

Bo Li, Min Tang, Ping Li, Junfa Mao

Summary: In this article, an efficient approach named Laguerre-based layered finite element method (LB-LFEM) is presented for transient thermal analysis of integrated circuits (ICs) and packages with microchannel cooling. The LB-LFEM utilizes a marching-on-in-order scheme based on weighted Laguerre polynomials to eliminate the time variables and reduce the system matrix. By solving the Laguerre coefficients recursively, the computational efficiency is significantly improved. The validity and high efficiency of LB-LFEM are demonstrated by several examples.

IEEE JOURNAL ON MULTISCALE AND MULTIPHYSICS COMPUTATIONAL TECHNIQUES (2023)

Article Computer Science, Information Systems

S 3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection

Lei Zhu, Xiaoqiang Wang, Ping Li, Xin Yang, Qing Zhang, Weiming Wang, Carola-Bibiane Schonlieb, C. L. Philip Chen

Summary: RGB-D salient object detection aims to detect visually distinctive objects or regions from a pair of the RGB image and the depth image. In this work, we propose a self-supervised self-ensembling network (S-3 Net) for semi-supervised RGB-D salient object detection by leveraging the unlabeled data and exploring a self-supervised learning mechanism. Experimental results demonstrate that our network outperforms the state-of-the-art methods on seven widely-used benchmark datasets.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Article Computer Science, Artificial Intelligence

A New Framework of Collaborative Learning for Adaptive Metric Distillation

Hao Liu, Mang Ye, Yan Wang, Sanyuan Zhao, Ping Li, Jianbing Shen

Summary: This article presents a new adaptive metric distillation approach that significantly improves the backbone features of student networks and achieves better classification results. Previous knowledge distillation methods focused on transferring knowledge across classifier logits or feature structure, neglecting the excessive sample relations in the feature space. The proposed collaborative adaptive metric distillation (CAMD) optimizes the relationship between key pairs, adapts the metric for student embeddings using teacher embeddings as supervision, and employs a collaborative scheme for knowledge aggregation. Extensive experiments show that CAMD outperforms other cutting-edge distillers and sets a new state-of-the-art in both classification and retrieval tasks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

Article Computer Science, Artificial Intelligence

MNGNAS: Distilling Adaptive Combination of Multiple Searched Networks for One-Shot Neural Architecture Search

Zhihua Chen, Guhao Qiu, Ping Li, Lei Zhu, Xiaokang Yang, Bin Sheng

Summary: Recently, neural architecture search (NAS) has attracted attention from academia and industry. However, it remains a challenging problem due to the large search space and computational costs. In this study, a multi-teacher-guided NAS approach was proposed to improve the search efficiency and precision. Experimental results show that this approach achieves significant improvements in both accuracy and efficiency.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

Article Education & Educational Research

GAN-based pencil drawing learning system for art education on large-scale image datasets with learning analytics

Yuxi Jin, Ping Li, Wenxiao Wang, Suiyun Zhang, Di Lin, Chengjiu Yin

Summary: We have designed a generative adversarial network (GAN)-based pencil drawing learning system for art education on large image datasets. Students can learn how to draw pencil drawings for natural scenes by uploading an image and getting a pencil drawing example. Teachers can conveniently assign homework and understand the learning demands of students using the system.

INTERACTIVE LEARNING ENVIRONMENTS (2023)

暂无数据