☆ 4.7 Article

Audiovisual Information Fusion in Human-Computer Interfaces and Intelligent Environments: A Survey

PROCEEDINGS OF THE IEEE (2010)

期刊

PROCEEDINGS OF THE IEEE

卷 98, 期 10, 页码 1692-1715

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JPROC.2010.2057231

关键词

Audiovisual fusion; dynamic Bayesian networks (DBNs); hidden Markov models; human activity analysis; human activity modeling; information fusion; machine learning; multimodal systems

类别

Engineering, Electrical & Electronic

资金

University of California at San Diego (UCSD)
National Science Foundation (NSF
University of California

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Microphones and cameras have been extensively used to observe and detect human activity and to facilitate natural modes of interaction between humans and intelligent systems. Human brain processes the audio and video modalities, extracting complementary and robust information from them. Intelligent systems with audiovisual sensors should be capable of achieving similar goals. The audiovisual information fusion strategy is a key component in designing such systems. In this paper, we exclusively survey the fusion techniques used in various audiovisual information fusion tasks. The fusion strategy used tends to depend mainly on the model, probabilistic or otherwise, used in the particular task to process sensory information to obtain higher level semantic information. The models themselves are task oriented. In this paper, we describe the fusion strategies and the corresponding models used in audiovisual tasks such as speech recognition, tracking, biometrics, affective state recognition, and meeting scene analysis. We also review the challenges and existing solutions and also unresolved or partially resolved issues in these fields. Specifically, we discuss established and upcoming work in hierarchical fusion strategies and cross-modal learning techniques, identifying these as critical areas of research in the future development of intelligent systems.

Audiovisual Information Fusion in Human-Computer Interfaces and Intelligent Environments: A Survey

期刊

PROCEEDINGS OF THE IEEE

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Audiovisual Information Fusion in Human-Computer Interfaces and Intelligent Environments: A Survey

期刊

PROCEEDINGS OF THE IEEE

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文