Many post-production videos such as movies and cartoons present well structured story-lines organized in separated visual scenes. Accurate grouping of shots into these logical segments could lead to semantic indexing of scenes for interactive multimedia retrieval and video summaries. In this paper we introduce a novel shot-based analysis approach which aims to cluster together shots with similar visual content. We demonstrate how the use of codebooks of visual codewords (generated by a vector quantization process) represents an effective method to identify clusters containing shots with similar long-term consistency of chromatic compositions. The clusters, obtained by a single-link clustering algorithm, allow the further use of the well-known scene transition graph framework for logical story unit detection and pattern investigation.

Identifying Video Content Consistency by Vector Quantization

BENINI, Sergio;LEONARDI, Riccardo
2005-01-01

Abstract

Many post-production videos such as movies and cartoons present well structured story-lines organized in separated visual scenes. Accurate grouping of shots into these logical segments could lead to semantic indexing of scenes for interactive multimedia retrieval and video summaries. In this paper we introduce a novel shot-based analysis approach which aims to cluster together shots with similar visual content. We demonstrate how the use of codebooks of visual codewords (generated by a vector quantization process) represents an effective method to identify clusters containing shots with similar long-term consistency of chromatic compositions. The clusters, obtained by a single-link clustering algorithm, allow the further use of the well-known scene transition graph framework for logical story unit detection and pattern investigation.
2005
Proceedings of the 2005 Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2005)
Ateneo di appartenenza
PE6_11 Machine learning, statistical data processing and applications using signal processing (eg. speech, image, video)
PE7_7 Signal processing
Esperti anonimi
Inglese
no
2005 Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2005)
13-15 April 2005
Montreux, Switzerland
Internazionale
ELETTRONICO
UNICO
1
4
4
283990067X
Ecole Polytechnique Fédérale de Lausanne
Video indexing; Vector Quantization; Visual codewords
UE
open
Benini, Sergio; Xu, L. Q.; Leonardi, Riccardo
273
info:eu-repo/semantics/conferenceObject
3
4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
BXL_WIAMIS-2005_post-print.pdf

accesso aperto

Descrizione: BXL_WIAMIS-2005_post-print
Tipologia: Documento in Post-print
Licenza: Creative commons
Dimensione 454.83 kB
Formato Adobe PDF
454.83 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/14942
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact