Audio as a Support to Scene Change Detection and Characterization of Video Sequences

IRIS Institutional Research Information System - OPENBS Open Archive UniBS

A challenging problem to construct video databases is the organization of video information. The development of algorithms able to organize video information according to semantic content of the data is getting more and more important. This will allow algorithms such as indexing and retrieval to work more efficiently. Until now, an attempt to extract semantic information has been performed using only video information. As a video sequence is constructed from a 2-D projection of a 3-D scene, video processing has shown its limitations especially in solving problems such as object identification or object tracking, reducing the ability to extract semantic characteristics. A possibility to overcome the problem is to use additional information. The associated audio signal is then the most natural way to obtain this information. This paper presents a technique which combines video and audio information together for classification and indexing purposes. The classification is performed on the audio signal; a general framework that uses the results of such classification is then proposed for organizing video information.

Audio as a Support to Scene Change Detection and Characterization of Video Sequences

LEONARDI Riccardo^Supervision;SARACENO Caterina^Methodology

1997-01-01

Abstract

A challenging problem to construct video databases is the organization of video information. The development of algorithms able to organize video information according to semantic content of the data is getting more and more important. This will allow algorithms such as indexing and retrieval to work more efficiently. Until now, an attempt to extract semantic information has been performed using only video information. As a video sequence is constructed from a 2-D projection of a 3-D scene, video processing has shown its limitations especially in solving problems such as object identification or object tracking, reducing the ability to extract semantic characteristics. A possibility to overcome the problem is to use additional information. The associated audio signal is then the most natural way to obtain this information. This paper presents a technique which combines video and audio information together for classification and indexing purposes. The classification is performed on the audio signal; a general framework that uses the results of such classification is then proposed for organizing video information.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				1997
			
	Titolo del volume
	
				IEEE International Conference on Speech, Acoustics and Signal Processing (ICASSP 97)
			
	Fonte principale del progetto
	
				MIUR (compresi PRIN FIRB,FISR)
			
	Aree tematiche (ERC)
	
				PE6_11 Machine learning, statistical data processing and applications using signal processing (eg. speech, image, video)
PE7_7 Signal processing
PE6_8 Computer graphics, computer vision, multi media, computer games
			
	Rivista su cui è pubblicata l'opera
	
				PROCEEDINGS OF THE ... IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING
			
	Codice ISI
	
				WOS:A1997BH95E00651
			
	Codice Scopus
	
				2-s2.0-0030702213
			
	Referee
	
				Esperti anonimi
			
	Lingua/e
	
				Inglese
			
	Su invito
	
				no
			
	Titolo del convegno
	
				IEEE International Conference on Speech, Acoustics and Signal Processing (ICASSP 97)
			
	Periodo del Convegno
	
				Apr. 1997
			
	Luogo del Convegno
	
				Muenich, DE
			
	Rilevanza del Convegno
	
				Internazionale
			
	Formato
	
				STAMPA
			
	Volume
	
				IV
			
	Da pagina
	
				2597
			
	A pagina
	
				2600
			
	Numero di pagine
	
				4
			
	Codice ISBN
	
				0818679190
			
	Codice DOI
	
				https://dx.doi.org/10.1109/icassp.1997.595320
			
	Nome editore
	
				IEEE
			
	Parole chiave
	
				Audio-visual indexing; multimedia information retrieval
			
	Eventuale altra fonte di finanziamento
	
				Ateneo di appartenenza
			
	Presenza di coautori internazionali
	
				no
			
	Fulltext
	
				restricted
			
	Tutti gli autori
	
						Leonardi, Riccardo; Saraceno, Caterina
					
	Tipologia sito docente
	
				273
			
	Tipologia
	
				info:eu-repo/semantics/conferenceObject
			
	Numero autori
	
				2
			
	Tipologia
	
				4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
SL_ICASSP-1997_full-text.pdf solo utenti autorizzati Descrizione: SL_ICASSP-1997_full-text Tipologia: Full Text Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 620.92 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	620.92 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/3829

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

49

24

social impact