We propose an approach to detect violence in CCTV feeds that is robust to new datasets and situations. This approach breaks with the traditional assumption of having large amounts of training data that are representative samples. Detecting violence in CCTV feeds is an objectively hard problem that is of paramount importance to solve for effective situational understanding. Violence comprises a large spectrum of activities that can go from abuse, to fighting, to road accidents, that can therefore take place in completely different environments, from public buildings, to underground stations, to roads during the day or the night. This large spectrum of activities and environments makes this a hard classification task for machines. We show that there are specific, detectable, and measurable features of video feeds that correlate with-among other things-violence and, by fusing such features with semantic knowledge, we can in principle provide estimates of sequences of videos that correlate with violence.

A Pilot Study on Detecting Violence in Videos Fusing Proxy Models

Cerutti F.
2019-01-01

Abstract

We propose an approach to detect violence in CCTV feeds that is robust to new datasets and situations. This approach breaks with the traditional assumption of having large amounts of training data that are representative samples. Detecting violence in CCTV feeds is an objectively hard problem that is of paramount importance to solve for effective situational understanding. Violence comprises a large spectrum of activities that can go from abuse, to fighting, to road accidents, that can therefore take place in completely different environments, from public buildings, to underground stations, to roads during the day or the night. This large spectrum of activities and environments makes this a hard classification task for machines. We show that there are specific, detectable, and measurable features of video feeds that correlate with-among other things-violence and, by fusing such features with semantic knowledge, we can in principle provide estimates of sequences of videos that correlate with violence.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11379/529013
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 6
social impact