A challenging problem to construct video databases is the organization of video information. The development of algorithms able to organize video information according to semantic content of the data is getting more and more important. This will allow algorithms such as indexing and retrieval to work more efficiently. Until now, an attempt to extract semantic information has been performed using only video information. As a video sequence is constructed from a 2-D projection of a 3-D scene, video processing has shown its limitations especially in solving problems such as object identification or object tracking, reducing the ability to extract semantic characteristics. A possibility to overcome the problem is to use additional information. The associated audio signal is then the most natural way to obtain this information. This paper presents a technique which combines video and audio information together for classification and indexing purposes. The classification is performed on the audio signal; a general framework that uses the results of such classification is then proposed for organizing video information.

Audio as a Support to Scene Change Detection and Characterization of Video Sequences

LEONARDI Riccardo
Supervision
;
1997-01-01

Abstract

A challenging problem to construct video databases is the organization of video information. The development of algorithms able to organize video information according to semantic content of the data is getting more and more important. This will allow algorithms such as indexing and retrieval to work more efficiently. Until now, an attempt to extract semantic information has been performed using only video information. As a video sequence is constructed from a 2-D projection of a 3-D scene, video processing has shown its limitations especially in solving problems such as object identification or object tracking, reducing the ability to extract semantic characteristics. A possibility to overcome the problem is to use additional information. The associated audio signal is then the most natural way to obtain this information. This paper presents a technique which combines video and audio information together for classification and indexing purposes. The classification is performed on the audio signal; a general framework that uses the results of such classification is then proposed for organizing video information.
1997
IEEE International Conference on Speech, Acoustics and Signal Processing (ICASSP 97)
MIUR (compresi PRIN FIRB,FISR)
PE6_11 Machine learning, statistical data processing and applications using signal processing (eg. speech, image, video)
PE7_7 Signal processing
PE6_8 Computer graphics, computer vision, multi media, computer games
Esperti anonimi
Inglese
no
IEEE International Conference on Speech, Acoustics and Signal Processing (ICASSP 97)
Apr. 1997
Muenich, DE
Internazionale
STAMPA
IV
2597
2600
4
0818679190
IEEE
Audio-visual indexing; multimedia information retrieval
Ateneo di appartenenza
no
restricted
Leonardi, Riccardo; Saraceno, Caterina
273
info:eu-repo/semantics/conferenceObject
2
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
SL_ICASSP-1997_full-text.pdf

solo utenti autorizzati

Descrizione: SL_ICASSP-1997_full-text
Tipologia: Full Text
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 620.92 kB
Formato Adobe PDF
620.92 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/3829
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 49
  • ???jsp.display-item.citation.isi??? 24
social impact