☆ 4.7 Article

Video Colorization Using Parallel Optimization in Feature Space

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2014)

期刊

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

卷 24, 期 3, 页码 407-417

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TCSVT.2013.2276702

关键词

Gabor feature space; parallel optimization; user strokes; video colorization

类别

Engineering, Electrical & Electronic

资金

National Natural Science Foundation of China [61202154, 61133009]
National Basic Research Project of China [2011CB302203]
Shanghai Pujiang Program [13PJ1404500]
RGC [416311]
UGC Direct Grants for Research [2050454, 2050485]
European Union [256941]
Reality CG
Open Projects Program of the National Laboratory of Pattern Recognition
Open Project Program of the State Key Laboratory of CADCG [A1206]
Zhejiang University

向作者/读者索取更多资源

Protocol

Reagent

摘要

We present a new scheme for video colorization using optimization in rotation-aware Gabor feature space. Most current methods of video colorization incur temporal artifacts and prohibitive processing costs, while this approach is designed in a spatiotemporal manner to preserve temporal coherence. The parallel implementation on graphics hardware is also facilitated to achieve realtime performance of color optimization. By adaptively clustering video frames and extending Gabor filtering to optical flow computation, we can achieve real-time color propagation within and between frames. Temporal coherence is further refined through user scribbles in video frames. The experimental results demonstrate that our proposed approach is efficient in producing high-quality colorized videos.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7

评分不足

次要评分

新颖性

-

重要性

-

科学严谨性

-

评价这篇论文

推荐

Article Computer Science, Information Systems

VCGAN: Video Colorization With Hybrid Generative Adversarial Network

Yuzhi Zhao, Lai-Man Po, Wing-Yin Yu, Yasar Abbas Ur Rehman, Mengyang Liu, Yujia Zhang, Weifeng Ou

Summary: We propose VCGAN, an improved approach to video colorization addressing the issues of temporal consistency and unification of colorization network and refinement network. VCGAN utilizes additional networks to enhance colorization quality and spatiotemporal consistency. Experimental results show that VCGAN produces higher-quality and temporally more consistent colorful videos than existing approaches.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

添加到收藏夹

Article Computer Science, Software Engineering

Reference-Based Deep Line Art Video Colorization

Min Shi, Jia-Qi Zhang, Shu-Yu Chen, Lin Gao, Yu-Kun Lai, Fang-Lue Zhang

Summary: This paper presents a deep architecture for automatically coloring line art videos with the same color style as the reference images. The proposed framework consists of a color transform network and a temporal refinement network. Experimental results show that our method achieves the best performance in line art video coloring.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2023)

添加到收藏夹

Article Mathematics

Parallel Dense Video Caption Generation with Multi-Modal Features

Xuefei Huang, Ka-Hou Chan, Wei Ke, Hao Sheng

Summary: This work proposes a parallel-based dense video captioning method that can address the mutual constraint between event proposals and captions. It introduces a deformable Transformer framework to reduce or eliminate manual threshold of hyperparameters. Experimental results show that the proposed method outperforms other methods in this area, providing competitive results on the ActivityNet Caption dataset.

MATHEMATICS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Broad Colorization

Yuxi Jin, Bin Sheng, Ping Li, C. L. Philip Chen

Summary: This automatic colorization method combines local and global features for coloring without user input and with short training time. The system uses local and global broad learning systems for obtaining and refining chrominance values, significantly reducing training time compared to traditional methods.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2021)

添加到收藏夹

Article Computer Science, Software Engineering

Mood-Driven Colorization of Virtual Indoor Scenes

Michael Solah, Haikun Huang, Jiachuan Sheng, Tian Feng, Marc Pomplun, Lap-Fai Yu

Summary: One of the challenges in virtual scene design is invoking a specific mood in viewers, and we propose a novel approach to automatically adjust the colors of textures in a virtual indoor scene to match a target mood. Our approach was tested in different indoor scenes and showed efficacy in a user study using VR headsets.

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS (2022)

添加到收藏夹

Article Computer Science, Artificial Intelligence

SVCNet: Scribble-Based Video Colorization Network With Temporal Aggregation

Yuzhi Zhao, Lai-Man Po, Kangcheng Liu, Xuehui Wang, Wing-Yin Yu, Pengfei Xian, Yujia Zhang, Mengyang Liu

Summary: In this paper, a scribble-based video colorization network called SVCNet is proposed for colorizing monochrome videos based on different user-given color scribbles. It addresses common issues in the colorization area and utilizes two sequential sub-networks for precise colorization and temporal smoothing. The experimental results demonstrate that SVCNet produces higher-quality and more temporally consistent videos than other well-known video colorization approaches, as evaluated on DAVIS and Videvo benchmarks.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2023)

添加到收藏夹

Article Computer Science, Hardware & Architecture

Video colorization with semantic preservation and consistency

Xiangjun Zhao, Zhenlong Du

Summary: This paper proposes a text-driven video colorization method which incorporates a hybrid deep network for image object segmentation, foreground object colorization, background completion, text to image object linkage, and spatiotemporal preservation. The network, composed of a two-layer LSTM and mask RCNN, adopts a flexible training strategy to avoid the high cost of separate training while benefiting from text instruction, foreground object recognition, and location. Experiments show that the method outperforms state-of-the-art techniques in quantitative criteria and user studies confirm the preserved color naturalness, saturation, and richness in the colorized videos.

COMPUTERS & ELECTRICAL ENGINEERING (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Structure-preserving feature alignment for old photo colorization

Yingxue Pang, Xin Jin, Jun Fu, Zhibo Chen

Summary: Deep learning techniques trained on large-scale datasets have made significant advancements in reference-based colorization. However, directly applying these methods to old photo colorization is challenging due to the lack of ground truth and domain gap. To tackle this problem, we propose a CNN-based algorithm called SFAC that trains on only two images for old photo colorization, overcoming the domain gap issue.

PATTERN RECOGNITION (2024)

添加到收藏夹

Article Computer Science, Artificial Intelligence

STDAN: Deformable Attention Network for Space-Time Video Super-Resolution

Hai Wang, Xiaoyu Xiang, Yapeng Tian, Wenming Yang, Qingmin Liao

Summary: This research proposes a deformable attention network called STDAN for increasing the spatial-temporal resolution of low-resolution and low frame rate videos. It includes a LSTFI module for excavating content from neighboring input frames and a STDFA module for capturing and aggregating spatial and temporal contexts in dynamic video frames. Experimental results show that this approach outperforms existing STVSR methods.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

CPNet: Continuity Preservation Network for infrared video colorization

Cheng Cheng, Hang Wang, Xiang Liao, Gang Cheng, Hongbin Sun

Summary: This paper proposes a infrared video colorization network CPNet, which aims to generate visually plausible and spatial-temporal consistent colorized videos. Experimental results demonstrate the superiority of CPNet in producing more authentic colorized videos compared to state-of-the-art methods.

COMPUTER VISION AND IMAGE UNDERSTANDING (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

Adaptively Meshed Video Stabilization

Minda Zhao, Qiang Ling

Summary: This paper proposes an adaptively meshed method to stabilize shaky videos, addressing issues in current video stabilization methods. By utilizing all feature trajectories and an adaptive blocking strategy, the method improves camera motion estimation performance and produces visually pleasing stabilization effects in challenging videos.

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2021)

添加到收藏夹

Article Computer Science, Information Systems

Background-Aware Colorization Technique for Augmented Reality Applications

Emanuele Marino, Fabio Bruno, Loris Barbieri, Maurizio Muzzupappa, Fotis Liarokapis

Summary: A major challenge in Augmented Reality (AR) is presenting augmented information effectively in various uncontrollable environmental conditions. The selection of colors plays a crucial role in blending virtual objects into the real environment, and different coloring strategies should be chosen based on the specific scenario to ensure optimal visibility and integration.

IEEE ACCESS (2021)

添加到收藏夹

Article Computer Science, Interdisciplinary Applications

An evolutionary parallel multiobjective feature selection framework

Hakan Ezgi Kiziloz, Ayca Deniz

Summary: In this study, a robust framework for feature selection is built leveraging the multi-core nature of a regular PC. Multiple execution settings are facilitated through the use of two multiobjective selection algorithms, four initial population generation methods, and five machine learning techniques. Extensive experiments on 11 UCI benchmark datasets show remarkable improvement in terms of maximum accuracy.

COMPUTERS & INDUSTRIAL ENGINEERING (2021)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Parallel binary arithmetic optimization algorithm and its application for feature selection

Zhongjie Zhuang, Jeng-Shyang Pan, Junbao Li, Shu-Chuan Chu

Summary: Arithmetic Optimization Algorithm (AOA) is a simple and easy to implement algorithm with few parameters. It utilizes the distribution behavior of arithmetic operators in mathematics. In this manuscript, AOA algorithm is converted into binary form with improved exploration using Multiplication Mathematical Optimizer Operator (MOO). Four families of transfer functions are used in the binary AOA (BAOA). Parallel mechanism is introduced to further enhance performance and proposed the Parallel Binary AOA (PBAOA) algorithm. Experimental results show that the proposed BAOA and PBAOA algorithms outperform classical and state-of-the-art algorithms in feature selection problems on low-dimensional and high-dimensional datasets.

KNOWLEDGE-BASED SYSTEMS (2023)

添加到收藏夹

Article Chemistry, Multidisciplinary

Video Forensics: Identifying Colorized Images Using Deep Learning

Carlos Ulloa, Dora M. Ballesteros, Diego Renza

Summary: This study presents two CNN-based models for classifying colorized and original images, evaluating the impact of hyperparameters. The transfer-learning-based model performs better in accuracy but has slower inference times compared to the custom model.

APPLIED SCIENCES-BASEL (2021)

添加到收藏夹

Article Computer Science, Information Systems

PhotoHelper: Portrait Photographing Guidance Via Deep Feature Retrieval and Fusion

Nan Jiang, Bin Sheng, Ping Li, Tong-Yee Lee

Summary: We introduce a new photographing guidance (PhotoHelper) that enhances the portrait photo quality for amateur photographers using deep feature retrieval and fusion. Our model combines empirical aesthetic rules, traditional machine learning algorithms, and deep neural networks to extract various features in color and space aspects. Through a series of processes, our approach provides users with professional reference photographs and automatically generates spatial composition guidance, resulting in significantly improved aesthetic quality of portrait photos.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

FSAD-Net: Feedback Spatial Attention Dehazing Network

Yu Zhou, Zhihua Chen, Ping Li, Haitao Song, C. L. Philip Chen, Bin Sheng

Summary: In this article, a novel end-to-end dehazing method called feedback spatial attention dehazing network (FSAD-Net) is proposed. FSAD-Net improves dehazing performance by utilizing feedback connections and attention mechanisms, and it shows superiority in both quantitative and qualitative experiments.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

Coupled Solution of Volume Integral and Hydrodynamic Equations to Analyze Electromagnetic Scattering From Composite Nanostructures

Doolos Aibek Uulu, Rui Chen, Liang Chen, Ping Li, Hakan Bagci

Summary: This study solves a coupled system of volume integral and hydrodynamic equations to analyze electromagnetic scattering from nanostructures consisting of metallic and dielectric parts. The hydrodynamic equation relates the free electron polarization current to the electric flux, while the volume integral equation relates the electric flux and the free electron polarization current to the scattered electric field. An efficient two-level iterative solver is proposed to solve the matrix system generated by the expansions of the unknown electric flux and free electron polarization current. Numerical experiments demonstrate the accuracy, efficiency, and applicability of the proposed method.

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION (2023)

添加到收藏夹

Article Computer Science, Information Systems

EAPT: Efficient Attention Pyramid Transformer for Image Processing

Xiao Lin, Shuzhou Sun, Wei Huang, Bin Sheng, Ping Li, David Dagan Feng

Summary: Recent transformer-based models, especially patch-based methods, have shown great potential in vision tasks. However, fixed-size patch division ignores the diversity of visual elements and hinders global attention information extraction. To address these issues, an Efficient Attention Pyramid Transformer (EAPT) is proposed, which includes Deformable Attention for non-fixed attention information and an Encode-Decode Communication module for complete global attention information. A specifically designed position encoding for vision transformers is also introduced. Extensive experiments and ablation studies demonstrate the effectiveness and key components of the proposed model.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

Mobile game props recommendation for machine learning

Can Yang, Ping Li

Summary: Mobile game providers benefit from selling virtual items, and by utilizing multi-instance multi-label learning algorithm and semi-supervised learning, the recommendation performance of game props is improved, leading to a significant increase in game revenue, as confirmed by offline data sets and online game experimental results.

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS (2023)

添加到收藏夹

Article Materials Science, Characterization & Testing

Feature-Sensitive Ring Artifact Correction for Computed Tomography

Markus Wedekind, Susana Castillo, Marcus Magnor

Summary: This article introduces a method for correcting ring artifacts in computed tomography (CT) reconstruction. The method compensates for errors in the gain and offset values of each pixel and reduces blur by inferring information from neighboring pixels. Experimental results show that this method effectively mitigates the shortcomings of purely offset-based approaches and approaches using all projections, and can be efficiently implemented.

JOURNAL OF NONDESTRUCTIVE EVALUATION (2023)

添加到收藏夹

Article Computer Science, Information Systems

Multi-stage feature-fusion dense network for motion deblurring

Cai Guo, Qian Wang, Hong-Ning Dai, Ping Li

Summary: In this paper, a multi-stage feature-fusion dense network (MFFDNet) is proposed for motion deblurring, which improves the deblurring effect by reusing the extracted features and reducing computation costs. Experimental results show that MFFDNet outperforms state-of-the-art methods while maintaining a relatively small computing cost.

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

A Wave Equation-Based Hybridizable Discontinuous Galerkin-Robin Transmission Condition Algorithm for Electromagnetic Problems Analyzing

Xuan Zhang, Shi Min Liu, Ran Zhao, Xiao Chun Li, Jun Fa Mao, Li Jun Jiang, Ping Li

Summary: In this work, a wave-equation-based discontinuous Galerkin (DG) method hybridized with the Robin transmission condition (DG-RTC) is developed to solve frequency-domain electromagnetic (EM) problems. The proposed method discretizes the vector electric field wave equation in each subdomain and introduces numerical flux at the subdomain interfaces. An auxiliary equation based on the tangential continuity of EM fields is introduced to solve for the magnetic field H at the interfaces. A finite-element tearing and interconnecting (FETI)-like approach is used to reduce the computational cost. Several examples are presented to validate the proposed DG method.

IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES (2023)

添加到收藏夹

Article Engineering, Biomedical

Fine-Grained and Multiple Classification for Alzheimer's Disease With Wavelet Convolution Unit Network

Jinyu Wen, Yang Li, Meie Fang, Lei Zhu, David Dagan Feng, Ping Li

Summary: The article introduces a novel wavelet convolution unit for image-oriented neural networks, combining wavelet analysis with convolution operator to extract deep abstract features efficiently. By fusing traditional convolution and wavelet decomposition, as well as incorporating multi-scale decompositions, it aims to improve classification accuracy.

IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

Localization and Identification of EMI Sources in Shielding Enclosures Based on a Two-Step Source Reconstruction Method

Zi An Wang, Jun Fa Mao, Li Jun Jiang, Ping Li

Summary: This paper proposes a two-step source reconstruction method to address the localization and identification of unknown electromagnetic interference (EMI) sources in shielding enclosures. The actual EMI sources are initially modeled by equally distributed equivalent electric dipoles and the effects of surrounding environments are incorporated using the numerical Green's function. The parameters of equivalent dipoles are then updated to improve the precision of the equivalent model by introducing additional magnetic dipoles.

IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY (2023)

添加到收藏夹

Article Engineering, Electrical & Electronic

Efficient Thermal Analysis of Integrated Circuits and Packages With Microchannel Cooling Using Laguerre-Based Layered Finite Element Method

Bo Li, Min Tang, Ping Li, Junfa Mao

Summary: In this article, an efficient approach named Laguerre-based layered finite element method (LB-LFEM) is presented for transient thermal analysis of integrated circuits (ICs) and packages with microchannel cooling. The LB-LFEM utilizes a marching-on-in-order scheme based on weighted Laguerre polynomials to eliminate the time variables and reduce the system matrix. By solving the Laguerre coefficients recursively, the computational efficiency is significantly improved. The validity and high efficiency of LB-LFEM are demonstrated by several examples.

IEEE JOURNAL ON MULTISCALE AND MULTIPHYSICS COMPUTATIONAL TECHNIQUES (2023)

添加到收藏夹

Article Computer Science, Information Systems

S 3 Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection

Lei Zhu, Xiaoqiang Wang, Ping Li, Xin Yang, Qing Zhang, Weiming Wang, Carola-Bibiane Schonlieb, C. L. Philip Chen

Summary: RGB-D salient object detection aims to detect visually distinctive objects or regions from a pair of the RGB image and the depth image. In this work, we propose a self-supervised self-ensembling network (S-3 Net) for semi-supervised RGB-D salient object detection by leveraging the unlabeled data and exploring a self-supervised learning mechanism. Experimental results demonstrate that our network outperforms the state-of-the-art methods on seven widely-used benchmark datasets.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

A New Framework of Collaborative Learning for Adaptive Metric Distillation

Hao Liu, Mang Ye, Yan Wang, Sanyuan Zhao, Ping Li, Jianbing Shen

Summary: This article presents a new adaptive metric distillation approach that significantly improves the backbone features of student networks and achieves better classification results. Previous knowledge distillation methods focused on transferring knowledge across classifier logits or feature structure, neglecting the excessive sample relations in the feature space. The proposed collaborative adaptive metric distillation (CAMD) optimizes the relationship between key pairs, adapts the metric for student embeddings using teacher embeddings as supervision, and employs a collaborative scheme for knowledge aggregation. Extensive experiments show that CAMD outperforms other cutting-edge distillers and sets a new state-of-the-art in both classification and retrieval tasks.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023)

添加到收藏夹

Article Computer Science, Artificial Intelligence

MNGNAS: Distilling Adaptive Combination of Multiple Searched Networks for One-Shot Neural Architecture Search

Zhihua Chen, Guhao Qiu, Ping Li, Lei Zhu, Xiaokang Yang, Bin Sheng

Summary: Recently, neural architecture search (NAS) has attracted attention from academia and industry. However, it remains a challenging problem due to the large search space and computational costs. In this study, a multi-teacher-guided NAS approach was proposed to improve the search efficiency and precision. Experimental results show that this approach achieves significant improvements in both accuracy and efficiency.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2023)

添加到收藏夹

Article Education & Educational Research

GAN-based pencil drawing learning system for art education on large-scale image datasets with learning analytics

Yuxi Jin, Ping Li, Wenxiao Wang, Suiyun Zhang, Di Lin, Chengjiu Yin

Summary: We have designed a generative adversarial network (GAN)-based pencil drawing learning system for art education on large image datasets. Students can learn how to draw pencil drawings for natural scenes by uploading an image and getting a pencil drawing example. Teachers can conveniently assign homework and understand the learning demands of students using the system.

INTERACTIVE LEARNING ENVIRONMENTS (2023)

添加到收藏夹

暂无数据

© Peeref 2019-2024. All rights reserved.