期刊
MULTIMEDIA TOOLS AND APPLICATIONS
卷 73, 期 1, 页码 273-289出版社
SPRINGER
DOI: 10.1007/s11042-013-1608-4
关键词
Multimodal joint multimedia processing; Crowd counting; Ordinary depth camera; Scene-adaptive scheme; Real time system
类别
资金
- China National Funds for Distinguished Young Scientists [60925010]
- Natural Science Foundation of China [61272517]
- Research Fund for the Doctoral Program of Higher Education of China [20120005130002]
- Beijing Committee of Education
- Funds for Creative Research Groups of China [61121001]
- Program for Changjiang Scholars and Innovative Research Team in University [IRT1049]
Reliable and real-time crowd counting is one of the most important tasks in intelligent visual surveillance systems. Most previous works only count passing people based on color information. Owing to the restrictions of color information influences themselves for multimedia processing, they will be affected inevitably by the unpredictable complex environments (e.g. illumination, occlusion, and shadow). To overcome this bottleneck, we propose a new algorithm by multimodal joint information processing for crowd counting. In our method, we use color and depth information together with a ordinary depth camera (e.g. Microsoft Kinect). Specifically, we first detect each head of the passing or still person in the surveillance region with adaptive modulation ability to varying scenes on depth information. Then, we track and count each detected head on color information. The characteristic advantage of our algorithm is that it is scene adaptive, which means the algorithm can be applied into all kinds of different scenes directly without additional conditions. Based on the proposed approach, we have built a practical system for robust and fast crowd counting facing complicated scenes. Extensive experimental results show the effectiveness of our proposed method.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据