This paper presents a comparison between different techniques for audio classification into homogeneous segments of speech and music. The first method is based on Zero Crossing Rate and Bayesian Classification (ZB), and it is very simple from a computational point of view. The second approach uses a Multi Layer Perceptron network (MLP) and requires therefore more computations. The performance of the proposed algorithms has been evaluated in terms of misclassification errors and precision in music-speech change detection. Both the proposed algorithms give good results, even if the MLP shows the best performance.
AUDIO CLASSIFICATION IN SPEECH AND MUSIC: A COMPARISON OF DIFFERENT APPROACHES
BUGATTI, Alessandro;FLAMMINI, Alessandra;LEONARDI, Riccardo;MARIOLI, Daniele;MIGLIORATI, Pierangelo;
2001-01-01
Abstract
This paper presents a comparison between different techniques for audio classification into homogeneous segments of speech and music. The first method is based on Zero Crossing Rate and Bayesian Classification (ZB), and it is very simple from a computational point of view. The second approach uses a Multi Layer Perceptron network (MLP) and requires therefore more computations. The performance of the proposed algorithms has been evaluated in terms of misclassification errors and precision in music-speech change detection. Both the proposed algorithms give good results, even if the MLP shows the best performance.File in questo prodotto:
File | Dimensione | Formato | |
---|---|---|---|
BFLMMP_WIAMIS-2001_post-print.pdf
accesso aperto
Descrizione: BFLMMP_WIAMIS-2001_post-print
Tipologia:
Documento in Post-print
Licenza:
Creative commons
Dimensione
70.76 kB
Formato
Adobe PDF
|
70.76 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.