The aim of this work consists in the development of automatic techniques for the extraction of content-based information from audiovisual data. The focus has been placed on providing tools for analyzing both audio and visual streams, for translating the signal samples into sequences of indices. The signal classification are performed by means of Hidden Markov Models (HMM), used in an innovative approach: the input signal is considered as a non-stationary stochastic process, modeled by a HMM in which each state stands for a different class of the signal. This defines an adaptive classification scheme for which a set of new training algorithms has been developed. Several samples from the MPEG-7 content set have been analyzed using the proposed classification scheme, demonstrating the performance of the overall approach to provide insights of the content of the audio-visual material.

Audio-Visual Pattern Recognition using HMM for Content-Based Multimedia Indexing

LEONARDI, Riccardo
2000-01-01

Abstract

The aim of this work consists in the development of automatic techniques for the extraction of content-based information from audiovisual data. The focus has been placed on providing tools for analyzing both audio and visual streams, for translating the signal samples into sequences of indices. The signal classification are performed by means of Hidden Markov Models (HMM), used in an innovative approach: the input signal is considered as a non-stationary stochastic process, modeled by a HMM in which each state stands for a different class of the signal. This defines an adaptive classification scheme for which a set of new training algorithms has been developed. Several samples from the MPEG-7 content set have been analyzed using the proposed classification scheme, demonstrating the performance of the overall approach to provide insights of the content of the audio-visual material.
File in questo prodotto:
File Dimensione Formato  
OL_PV-2000_pre-print.pdf

accesso aperto

Descrizione: OL_PV-2000_pre-print
Tipologia: Documento in Pre-print
Licenza: PUBBLICO - Creative Commons 3.6
Dimensione 552.07 kB
Formato Adobe PDF
552.07 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/15081
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact