In this invention we introduce a method and system for segmenting both a video sequence and an accompanying audio sequence into respective video and audio scenes. To allow for ready integration of the audio and video scene segmentation information the concept of an audio shot is introduced corresponding to readily identifiable video shots. By synchronising the audio stream with the video stream on the basis of corresponding audio shots and video shots the integration of respective determined audio and video semantic scene information (a scene comprising one or more shots) becomes relatively straightforward, thus giving richer semantic understanding of the content. Moreover the fusion of audio and visual analysis results through heuristic rules also provides further advantages. The industrial applicability of the invention is in the field of automating the time­ consuming and laborious process of organising and indexing increasingly large video databases such that they can be easily browsed and searched using natural query structures that are close to human concepts.

Method and System for Detecting Audio and Video Scene Changes

BENINI, Sergio
2005-01-01

Abstract

In this invention we introduce a method and system for segmenting both a video sequence and an accompanying audio sequence into respective video and audio scenes. To allow for ready integration of the audio and video scene segmentation information the concept of an audio shot is introduced corresponding to readily identifiable video shots. By synchronising the audio stream with the video stream on the basis of corresponding audio shots and video shots the integration of respective determined audio and video semantic scene information (a scene comprising one or more shots) becomes relatively straightforward, thus giving richer semantic understanding of the content. Moreover the fusion of audio and visual analysis results through heuristic rules also provides further advantages. The industrial applicability of the invention is in the field of automating the time­ consuming and laborious process of organising and indexing increasingly large video databases such that they can be easily browsed and searched using natural query structures that are close to human concepts.
2005
File in questo prodotto:
File Dimensione Formato  
Patent WO2005093752A1 - Method and system for detecting audio and video scene changes - Google Patents.pdf

solo utenti autorizzati

Licenza: Creative commons
Dimensione 1.2 MB
Formato Adobe PDF
1.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/34576
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact