4.5 Article

Interactive visualization of video content and associated description for semantic annotation

期刊

SIGNAL IMAGE AND VIDEO PROCESSING
卷 3, 期 2, 页码 183-196

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s11760-008-0071-6

关键词

Semantic video annotation; Visualization; Content organization; Content description; Browsing; Low-level features

向作者/读者索取更多资源

In this paper, we present an intuitive graphic framework introduced for the effective visualization of video content and associated audio-visual description, with the aim to facilitate a quick understanding and annotation of the semantic content of a video sequence. The basic idea consists in the visualization of a 2D feature space in which the shots of the considered video sequence are located. Moreover, the temporal position and the specific content of each shot can be displayed and analysed in more detail. The selected features are decided by the user, and can be updated during the navigation session. In the main window, shots of the considered video sequence are displayed in a Cartesian plane, and the proposed environment offers various functionalities for automatically and semi-automatically finding and annotating the shot clusters in such feature space. With this tool the user can therefore explore graphically how the basic segments of a video sequence are distributed in the feature space, and can recognize and annotate the significant clusters and their structure. The experimental results show that browsing and annotating documents with the aid of the proposed visualization paradigms is easy and quick, since the user has a fast and intuitive access to the audio-video content, even if he or she has not seen the document yet.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Telecommunications

A Note on Probabilistic and Geometric Shaping for the AWGN Channel

Roberto Modonesi, Marco Dalai, Pierangelo Migliorati, Riccardo Leonardi

IEEE COMMUNICATIONS LETTERS (2020)

Article Computer Science, Theory & Methods

A wavelet filter comparison on multiple datasets for signal compression and denoising

Alessandro Gnutti, Fabrizio Guerrini, Nicola Adami, Pierangelo Migliorati, Riccardo Leonardi

Summary: This paper explicitly analyzes the performance effects of various wavelet families in compression and denoising tasks, highlighting the significant impact of wavelet parameters. The study serves as a valuable benchmark for future research in this area.

MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING (2021)

Article Multidisciplinary Sciences

Automatic classification of mice vocalizations using Machine Learning techniques and Convolutional Neural Networks

Marika Premoli, Daniele Baggi, Marco Bianchetti, Alessandro Gnutti, Marco Bondaschi, Andrea Mastinu, Pierangelo Migliorati, Alberto Signoroni, Riccardo Leonardi, Maurizio Memo, Sara Anna Bonini

Summary: The study proposed a method for automatic classification of ultrasonic vocalizations (USVs) using supervised learning. By utilizing manually built datasets and algorithms like Convolutional Neural Network, it showed that analyzing the full time/frequency information of the USVs spectrograms led to higher performance compared to considering a subset of numerical features.

PLOS ONE (2021)

Article Computer Science, Artificial Intelligence

BS-Net: Learning COVID-19 pneumonia severity on a large chest X-ray dataset

Alberto Signoroni, Mattia Savardi, Sergio Benini, Nicola Adami, Riccardo Leonardi, Paolo Gibellini, Filippo Vaccher, Marco Ravanelli, Andrea Borghesi, Roberto Maroldi, Davide Farina

Summary: This study presents an end-to-end deep learning architecture for predicting the degree of lung compromise in COVID-19 patients on Chest X-ray images. The proposed semi-quantitative scoring system shows significant prognostic value and outperforms human annotators. The BS-Net demonstrates high accuracy and self-attentive behavior, showcasing its potential for computer-assisted monitoring in clinical settings.

MEDICAL IMAGE ANALYSIS (2021)

Article Engineering, Electrical & Electronic

Head pose estimation: A survey of the last ten years

Khalil Khan, Rehan Ullah Khan, Riccardo Leonardi, Pierangelo Migliorati, Sergio Benini

Summary: This paper surveys the development of head pose estimation methods in both constrained and unconstrained conditions over the past decade. It highlights the ongoing open research topic in this field and summarizes the advantages and disadvantages of existing algorithms, while also suggesting promising directions for future research.

SIGNAL PROCESSING-IMAGE COMMUNICATION (2021)

Article Computer Science, Information Systems

Image forgery detection: a survey of recent deep-learning approaches

Marcello Zanardelli, Fabrizio Guerrini, Riccardo Leonardi, Nicola Adami

Summary: In recent years, there has been a proliferation of fake and altered images due to the availability and ease of use of image editing tools. This paper conducts a survey of the latest image forgery detection methods based on Deep Learning (DL) techniques, focusing on copy-move and splicing attacks. The survey discusses the key aspects of these methods, the datasets used for training and validation, as well as their performance. The paper also addresses future research trends and directions in deep learning architectures and evaluation approaches, as well as dataset building for easy methods comparison.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Article Engineering, Electrical & Electronic

The Mirror Transform

Fabrizio Guerrini, Alessandro Gnutti, Riccardo Leonardi

Summary: This paper explains a unique representation method for describing any finite-energy signal, which consists of an ordered set of positions and a sparse set of signals. By designing and implementing an iterative decomposition approach using mirror operations, the method achieves maximum decoupling and sparsity in the signal representation. Experimental simulations demonstrate the superior approximation capabilities of this approach, making it potentially useful in various domains such as approximation and coding.

IEEE TRANSACTIONS ON SIGNAL PROCESSING (2022)

Proceedings Paper Computer Science, Artificial Intelligence

SYMMETRY-BASED GRAPH FOURIER TRANSFORMS: ARE THEY OPTIMAL FOR IMAGE COMPRESSION?

Alessandro Gnutti, Fabrizio Guerrini, Riccardo Leonardi, Antonio Ortega

Summary: The Symmetry-Based Graph Fourier Transforms (SBGFTs) have shown promising performance in energy compaction and improvement of HEVC intra coding, making them a potential central role in image compression. Comparisons with other orthogonal transforms such as KLT and SOT further confirm the superior representation ability of SBGFTs in multiple transforms and non-linear approximation perspective.

2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) (2021)

Proceedings Paper Computer Science, Artificial Intelligence

2D Discrete Mirror Transform for Image Non-Linear Approximation

Alessandro Gnutti, Fabrizio Guerrini, Riccardo Leonardi

Summary: This paper presents a new 2D transform called Discrete Mirror Transform (DMT), which decomposes a signal into even and odd parts around an optimal location to maximize signal energy split. An optimized version of DMT (ODMT) is also introduced, showing superior performance compared to DCT and DWT in terms of non-linear approximation when applied on images.

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) (2021)

Article Computer Science, Artificial Intelligence

Combining Appearance and Gradient Information for Image Symmetry Detection

Alessandro Gnutti, Fabrizio Guerrini, Riccardo Leonardi

Summary: This work addresses the challenging problem of reflection symmetry detection in unconstrained environments by proposing a two-stage solution, which establishes a better correspondence between the outcomes of the algorithm and a human-constructed ground truth, achieves significant performance gains compared to recent symmetry detection competitions, and further validates the approach through perceptual validation experiments with users on a newly built dataset.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Article Computer Science, Artificial Intelligence

Learnable Descriptors for Visual Search

Andrea Migliorati, Attilio Fiandrotti, Gianluca Francini, Riccardo Leonardi

Summary: LDVS is a learnable binary local descriptor designed for matching natural images within the MPEG CDVS framework. Through experiments, LDVS descriptors have shown favorable performance in image patch matching and have a moderate parameters count for operations on mobile devices.

IEEE TRANSACTIONS ON IMAGE PROCESSING (2021)

Proceedings Paper Marine & Freshwater Biology

Distribution of Sabellaria alveolata (Polychaeta Sabellariidae) in the Mediterranean Sea: update and new findings

Rossana Sanfilippo, A. Guido, G. Insacco, C. Deias, G. Catania, A. Reitano, R. Leonardi, A. Rosso

13TH INTERNATIONAL POLYCHAETE CONFERENCE (IPC13) (2020)

Article Computer Science, Theory & Methods

Minimal Information Exchange for Secure Image Hash-Based Geometric Transformations Estimation

Fabrizio Guerrini, Marco Dalai, Riccardo Leonardi

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY (2020)

Proceedings Paper Acoustics

ITERATIVE MIRROR DECOMPOSITION FOR SIGNAL REPRESENTATION

Fabrizio Guerrini, Alessandro Gnutti, Riccardo Leonardi

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) (2019)

Article Multidisciplinary Sciences

FASSEG: A FAce semantic SEGmentation repository for face image analysis

Sergio Benini, Khalil Khan, Riccardo Leonardi, Massimo Mauro, Pierangelo Migliorati

DATA IN BRIEF (2019)

暂无数据