In this contribution, a fast search motion estimation algorithm for H.264/AVC SVC (scalable video coding) [2] base layer with hierarchical B-frame structure for temporal decomposition is presented and compared with fast search motion estimation algorithm in JSVM software [1], that is the reference software for H.264/AVC SVC. The proposed technique is a block-matching based motion estimation algorithm working in two steps, called Coarse search and Fine search. The Coarse search is performed for each frame in display order, and for each 16x16 macroblock chooses the best motion vector at half pel accuracy. Fine search is performed for each frame in encoding order and finds the best prediction for each block type, reference frame and direction, choosing the best motion vector at quarter pel accuracy using R-D optimization. Both Coarse and Fine Search test 3 spatial and 3 temporal predictors, and add to the best one a set of updates. The spatial predictors for the fine search are the result of the Fine search already performed for the previous blocks, while the temporal predictors are the results of Coarse Search scaled by an appropriate coefficient. This scaling is performed since in the Coarse search each picture is always estimated with respect to the previous one, while in the Fine Search the temporal distance between the current picture and its references depend on the temporaldecomposition level. Moreover in Fine search the number and the value of the updates tested depend on the distance between the current picture and its references. These sets of updates are the result of a huge number of simulations on test sequences with different motion features. The proposed algorithm has been tested on the set of test sequences proposed by JVT group, using different resolutions and temporal decomposition structures. The proposed method can reduce the average coding complexity in terms of motion vector tested from 70 to 90 percent with respect to the Fast-ME JVT method, while the quality loss depends on the GOP dimension, that is the most critical parameter for the performance of the algorithm. In fact for small GOP dimensions (4 or 8) the algorithm has the same quality at equal bit-rate respect to the Fast-ME JVT method for almost all the sequences and better quality for some sequences. For medium and long GOP dimensions (16-32) the algorithm has a quality loss lower than 0.5 dB for all the tested sequences.

New Fast Search Algorithm for Base Layer of H.264 Scalable Video Coding Extension

LIMA, LIVIO;LEONARDI, Riccardo
2007-01-01

Abstract

In this contribution, a fast search motion estimation algorithm for H.264/AVC SVC (scalable video coding) [2] base layer with hierarchical B-frame structure for temporal decomposition is presented and compared with fast search motion estimation algorithm in JSVM software [1], that is the reference software for H.264/AVC SVC. The proposed technique is a block-matching based motion estimation algorithm working in two steps, called Coarse search and Fine search. The Coarse search is performed for each frame in display order, and for each 16x16 macroblock chooses the best motion vector at half pel accuracy. Fine search is performed for each frame in encoding order and finds the best prediction for each block type, reference frame and direction, choosing the best motion vector at quarter pel accuracy using R-D optimization. Both Coarse and Fine Search test 3 spatial and 3 temporal predictors, and add to the best one a set of updates. The spatial predictors for the fine search are the result of the Fine search already performed for the previous blocks, while the temporal predictors are the results of Coarse Search scaled by an appropriate coefficient. This scaling is performed since in the Coarse search each picture is always estimated with respect to the previous one, while in the Fine Search the temporal distance between the current picture and its references depend on the temporaldecomposition level. Moreover in Fine search the number and the value of the updates tested depend on the distance between the current picture and its references. These sets of updates are the result of a huge number of simulations on test sequences with different motion features. The proposed algorithm has been tested on the set of test sequences proposed by JVT group, using different resolutions and temporal decomposition structures. The proposed method can reduce the average coding complexity in terms of motion vector tested from 70 to 90 percent with respect to the Fast-ME JVT method, while the quality loss depends on the GOP dimension, that is the most critical parameter for the performance of the algorithm. In fact for small GOP dimensions (4 or 8) the algorithm has the same quality at equal bit-rate respect to the Fast-ME JVT method for almost all the sequences and better quality for some sequences. For medium and long GOP dimensions (16-32) the algorithm has a quality loss lower than 0.5 dB for all the tested sequences.
File in questo prodotto:
File Dimensione Formato  
LAPL_DCC-2007_pre-print.pdf

accesso aperto

Descrizione: LAPL_DCC-2007_pre-print
Tipologia: Documento in Pre-print
Licenza: Creative commons
Dimensione 37.57 kB
Formato Adobe PDF
37.57 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/14939
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
social impact