This paper deals with the problem of audio and video synchronization issues for real-time audio-visual communication over IP-based networks. Starting from the real-time transport protocol (RTP) speci"cations (Schulzrinne, 1995), it provides an accurate description on how to recover a reliable absolute time reference for audio and video signals from header information in RTP and RTP control protocol packets. Such temporal informations allow to synchronize both media within acceptable perceptual bounds for reconstruction at any receiver end, in a possibly multi-point videoconference. This may occur independently of the fact that all (audio/video) packets reach destination, or that multiple replications of such packets arrive at destination. The proposed solution does not take into account the possible different delays that may be introduced between the di!erent communication points, thus resulting eventually in potential jitters in the reconstruction of the audio-visual streams between di!erent receivers. Each receiver handles its reconstruction independently of any transmission/processing delay. In order to ensure a better quality of the reconstructed material, priority is given to audio information. If the audio stream anticipate the video stream, the receiver simply discards video packets. Conversely, when video is ahead of audio information, the video decoding stage is interrupted till audio information arrives. Experimental simulations over a LAN have demonstrated the validity of the proposed approach.

Intermedia Synchronization for Videoconference over IP

Riccardo LEONARDI
Methodology
;
Pierangelo MIGLIORATI
Membro del Collaboration Group
1999-01-01

Abstract

This paper deals with the problem of audio and video synchronization issues for real-time audio-visual communication over IP-based networks. Starting from the real-time transport protocol (RTP) speci"cations (Schulzrinne, 1995), it provides an accurate description on how to recover a reliable absolute time reference for audio and video signals from header information in RTP and RTP control protocol packets. Such temporal informations allow to synchronize both media within acceptable perceptual bounds for reconstruction at any receiver end, in a possibly multi-point videoconference. This may occur independently of the fact that all (audio/video) packets reach destination, or that multiple replications of such packets arrive at destination. The proposed solution does not take into account the possible different delays that may be introduced between the di!erent communication points, thus resulting eventually in potential jitters in the reconstruction of the audio-visual streams between di!erent receivers. Each receiver handles its reconstruction independently of any transmission/processing delay. In order to ensure a better quality of the reconstructed material, priority is given to audio information. If the audio stream anticipate the video stream, the receiver simply discards video packets. Conversely, when video is ahead of audio information, the video decoding stage is interrupted till audio information arrives. Experimental simulations over a LAN have demonstrated the validity of the proposed approach.
File in questo prodotto:
File Dimensione Formato  
BLM_SPIC_1999.pdf

solo utenti autorizzati

Tipologia: Full Text
Licenza: Creative commons
Dimensione 324.66 kB
Formato Adobe PDF
324.66 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/27291
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 8
social impact