A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets
Published 2021 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets
Authors
Keywords
-
Journal
VISUAL COMPUTER
Volume -, Issue -, Pages -
Publisher
Springer Science and Business Media LLC
Online
2021-06-11
DOI
10.1007/s00371-021-02166-7
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- TICS: text–image-based semantic CAPTCHA synthesis via multi-condition adversarial learning
- (2021) Xinkang Jia et al. VISUAL COMPUTER
- DTR-HAR: deep temporal residual representation for human activity recognition
- (2021) Hend Basly et al. VISUAL COMPUTER
- Combining CNN streams of dynamic image and depth data for action recognition
- (2020) Roshan Singh et al. MULTIMEDIA SYSTEMS
- Recent advances in deep learning for object detection
- (2020) Xiongwei Wu et al. NEUROCOMPUTING
- Passion fruit detection and counting based on multiple scale faster R-CNN using RGB-D images
- (2020) Shuqin Tu et al. PRECISION AGRICULTURE
- Multiple answers to a question: a new approach for visual question answering
- (2020) Sayedshayan Hashemi Hosseinabad et al. VISUAL COMPUTER
- A single-shot multi-level feature reused neural network for object detection
- (2020) Lixin Wei et al. VISUAL COMPUTER
- Multi-level progressive parallel attention guided salient object detection for RGB-D images
- (2020) Zhengyi Liu et al. VISUAL COMPUTER
- Face–Iris Multimodal Biometric Identification System
- (2020) Basma Ammour et al. Electronics
- A survey of the recent architectures of deep convolutional neural networks
- (2020) Asifullah Khan et al. ARTIFICIAL INTELLIGENCE REVIEW
- AI-Driven Tools for Coronavirus Outbreak: Need of Active Learning and Cross-Population Train/Test Models on Multitudinal/Multimodal Data
- (2020) K. C. Santosh JOURNAL OF MEDICAL SYSTEMS
- Back-projection-based progressive growing generative adversarial network for single image super-resolution
- (2020) Tingsong Ma et al. VISUAL COMPUTER
- Efficient object tracking using hierarchical convolutional features model and correlation filters
- (2020) Mohammed Y. Abbass et al. VISUAL COMPUTER
- A survey on online learning for visual tracking
- (2020) Mohammed Y. Abbass et al. VISUAL COMPUTER
- Online multi-object tracking with pedestrian re-identification and occlusion processing
- (2020) Xueqin Zhang et al. VISUAL COMPUTER
- Transfer learning based hybrid 2D-3D CNN for traffic sign recognition and semantic road detection applied in advanced driver assistance systems
- (2020) Khaled Bayoudh et al. APPLIED INTELLIGENCE
- Real-time multimodal ADL recognition using convolution neural networks
- (2020) Danushka Madhuranga et al. VISUAL COMPUTER
- Paradigm shifts in super-resolution techniques for remote sensing applications
- (2020) G. Rohith et al. VISUAL COMPUTER
- Fine-grained talking face generation with video reinterpretation
- (2020) Xin Huang et al. VISUAL COMPUTER
- $$\hbox {S}^2\hbox {RGAN}$$: sonar-image super-resolution based on generative adversarial network
- (2020) Hongtao Song et al. VISUAL COMPUTER
- A Novel Multi-Stage Training Approach for Human Activity Recognition From Multimodal Wearable Sensor Data Using Deep Neural Network
- (2020) Tanvir Mahmud et al. IEEE SENSORS JOURNAL
- Modality-transfer generative adversarial network and dual-level unified latent representation for visible thermal Person re-identification
- (2020) Xing Fan et al. VISUAL COMPUTER
- Multimodal End-to-End Autonomous Driving
- (2020) Yi Xiao et al. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS
- Semantic embeddings of generic objects for zero-shot learning
- (2019) Tristan Hascoet et al. EURASIP Journal on Image and Video Processing
- Fusing Multimodal Video Data for Detecting Moving Objects/Targets in Challenging Indoor and Outdoor Scenes
- (2019) Zacharias Kandylakis et al. Remote Sensing
- A patch-based super resolution algorithm for improving image resolution in clinical mass spectrometry
- (2019) Klára Ščupáková et al. Scientific Reports
- Exploring RGBDepth Fusion for Real-Time Object Detection
- (2019) Tanguy Ophoff et al. SENSORS
- Representation Learning Using Step-based Deep Multi-modal Autoencoders
- (2019) Gaurav Bhatt et al. PATTERN RECOGNITION
- Transformation of portraits to Picasso’s cubism style
- (2019) Guanyu Lian et al. VISUAL COMPUTER
- Video Question Answering with Spatio-Temporal Reasoning
- (2019) Yunseok Jang et al. INTERNATIONAL JOURNAL OF COMPUTER VISION
- A review of monocular visual odometry
- (2019) Ming He et al. VISUAL COMPUTER
- 4D facial expression recognition using multimodal time series analysis of geometric landmark-based deformations
- (2019) Payam Zarbakhsh et al. VISUAL COMPUTER
- Integrating global and local image features for enhanced loop closure detection in RGB-D SLAM systems
- (2019) Oguzhan Guclu et al. VISUAL COMPUTER
- Deep Learning for Generic Object Detection: A Survey
- (2019) Li Liu et al. INTERNATIONAL JOURNAL OF COMPUTER VISION
- A survey of deep learning techniques for autonomous driving
- (2019) Sorin Grigorescu et al. Journal of Field Robotics
- 3D-SSD: Learning hierarchical features from RGB-D images for amodal 3D object detection
- (2019) Qianhui Luo et al. NEUROCOMPUTING
- Deep learning in video multi-object tracking: A survey
- (2019) Gioele Ciaparrone et al. NEUROCOMPUTING
- Accurate and Robust Monocular SLAM with Omnidirectional Cameras
- (2019) Liu et al. SENSORS
- DRCDN: learning deep residual convolutional dehazing networks
- (2019) Shengdong Zhang et al. VISUAL COMPUTER
- Memory influences haptic perception of softness
- (2019) Anna Metzger et al. Scientific Reports
- An integrated approach for medical abnormality detection using deep patch convolutional neural networks
- (2019) Pengcheng Xi et al. VISUAL COMPUTER
- A comprehensive survey on the biometric recognition systems based on physiological and behavioral modalities
- (2019) Shaveta Dargan et al. EXPERT SYSTEMS WITH APPLICATIONS
- Unsupervised Deep Visual-Inertial Odometry with Online Error Correction for RGB-D Imagery
- (2019) E. Jared Shamwell et al. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
- Multimedia Big Data Analytics
- (2018) Samira Pouyanfar et al. ACM COMPUTING SURVEYS
- Generative Adversarial Networks: An Overview
- (2018) Antonia Creswell et al. IEEE SIGNAL PROCESSING MAGAZINE
- CCL: Cross-modal Correlation Learning With Multigrained Fusion by Hierarchical Network
- (2018) Yuxin Peng et al. IEEE TRANSACTIONS ON MULTIMEDIA
- Multimodal Recurrent Neural Networks With Information Transfer Layers for Indoor Scene Labeling
- (2018) Abrar H. Abdulnabi et al. IEEE TRANSACTIONS ON MULTIMEDIA
- Multi-modal gated recurrent units for image description
- (2018) Xuelong Li et al. MULTIMEDIA TOOLS AND APPLICATIONS
- Autonomous vehicle perception: The technology of today and tomorrow
- (2018) Jessica Van Brummelen et al. TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES
- A survey on deep neural network-based image captioning
- (2018) Xiaoxiao Liu et al. VISUAL COMPUTER
- A Survey of Deep Learning: Platforms, Applications and Emerging Research Trends
- (2018) William Grant Hatcher et al. IEEE Access
- A Review of Point Feature Based Medical Image Registration
- (2018) Shao-Ya Guan et al. Chinese Journal of Mechanical Engineering
- CNN-Based Multimodal Human Recognition in Surveillance Environments
- (2018) Ja Koo et al. SENSORS
- Multimodal Ambulatory Sleep Detection Using LSTM Recurrent Neural Networks
- (2018) Akane Sano et al. IEEE Journal of Biomedical and Health Informatics
- Complete 3D Scene Parsing from an RGBD Image
- (2018) Chuhang Zou et al. INTERNATIONAL JOURNAL OF COMPUTER VISION
- Modality-correlation-aware sparse representation for RGB-infrared object tracking
- (2018) Xiangyuan Lan et al. PATTERN RECOGNITION LETTERS
- Photographic style transfer
- (2018) Li Wang et al. VISUAL COMPUTER
- Joint learning of image detail and transmission map for single image dehazing
- (2018) Shengdong Zhang et al. VISUAL COMPUTER
- A multimodal fusion approach for image captioning
- (2018) Dexin Zhao et al. NEUROCOMPUTING
- Deep Multimodal Learning: A Survey on Recent Advances and Trends
- (2017) Dhanesh Ramachandram et al. IEEE SIGNAL PROCESSING MAGAZINE
- ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras
- (2017) Raul Mur-Artal et al. IEEE Transactions on Robotics
- Multi-modal multiple kernel learning for accurate identification of Tourette syndrome children
- (2017) Hongwei Wen et al. PATTERN RECOGNITION
- Multimodal vehicle detection: fusing 3D-LIDAR and color camera data
- (2017) Alireza Asvadi et al. PATTERN RECOGNITION LETTERS
- The sketchy database
- (2016) Patsorn Sangkloy et al. ACM TRANSACTIONS ON GRAPHICS
- A Neural Algorithm of Artistic Style
- (2016) Leon Gatys et al. JOURNAL OF VISION
- Correlational Neural Networks
- (2016) Sarath Chandar et al. NEURAL COMPUTATION
- Deep learning for visual understanding: A review
- (2016) Yanming Guo et al. NEUROCOMPUTING
- Multimodal video classification with stacked contractive autoencoders
- (2016) Yanan Liu et al. SIGNAL PROCESSING
- Robust Face Recognition via Multimodal Deep Face Representation
- (2015) Changxing Ding et al. IEEE TRANSACTIONS ON MULTIMEDIA
- Deep learning
- (2015) Yann LeCun et al. NATURE
- EasyMKL: a scalable multiple kernel learning algorithm
- (2015) Fabio Aiolli et al. NEUROCOMPUTING
- Multimodal Data Fusion: An Overview of Methods, Challenges, and Prospects
- (2015) Dana Lahat et al. PROCEEDINGS OF THE IEEE
- Transfer Learning for Visual Categorization: A Survey
- (2015) Ling Shao et al. IEEE Transactions on Neural Networks and Learning Systems
- Representation Learning: A Review and New Perspectives
- (2013) Y. Bengio et al. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
- Selective Search for Object Recognition
- (2013) J. R. R. Uijlings et al. INTERNATIONAL JOURNAL OF COMPUTER VISION
- Review on Methods to Fix Number of Hidden Neurons in Neural Networks
- (2013) K. Gnana Sheela et al. MATHEMATICAL PROBLEMS IN ENGINEERING
Add your recorded webinar
Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.
Upload NowCreate your own webinar
Interested in hosting your own webinar? Check the schedule and propose your idea to the Peeref Content Team.
Create Now