Automotive DNN-Based Object Detection in the Presence of Lens Obstruction and Video Compression

IRIS

Recent advances in sensing, processing, machine learning, and communication technologies are accelerating assisted and automated functions development for commercial vehicles. Environmental perception sensor data streams are processed to generate a correct and complete situational awareness. It is of utmost importance to assess the robustness of the sensor data pipeline, particularly in the case of data degradation in a noisy and variable environment. Sensor data reduction and compression techniques are key for higher levels of driving automation, as there is an expectation that traditional automotive vehicle wired networks will not be able to support the needed sensor datarates (i.e. more than 10 perception sensors, including cameras, LiDARs, and RADARs, generating tens of Gb/s of data). This work proposes for the first time to consider video compression for camera data transmission on vehicle wired networks in the presence of highly noisy data, e.g. partially obstructed camera field of view. The effects are discussed in terms of machine learning vehicle detection accuracy drop, and also visualising how detection performance spatially varies on the frames using the recently introduced metric, the Spatial Recall Index (SRI). The presented parametric obstruction noise model is generated to emulate real-world patterns, whereas compression is based on the well-established AVC/H.264. While Deep Neural Networks’ (DNNs’) performance is stable with lossy compression (up to 70:1) of ‘ideal’ data, when noise is added there is a significant accuracy degradation, in the range of a 7%-90% decrease. The proposed compression and noise tuning of the DNN training improves the performance up to 35%, enhancing the noise and compression robustness of the system. However, in the presence of compression combined with extreme levels of noise (i.e. more than 80% of pixels affected), DNN performance significantly degrades, up to a 90% decrease, even with re-training. This issue needs to be carefully considered in the design phase of perception and communication networks used to transmit sensor data.

Automotive DNN-Based Object Detection in the Presence of Lens Obstruction and Video Compression

Baris G.;Li B.;Chan P. H.;Avizzano C. A.;Donzella V.

2025-01-01

Abstract

Recent advances in sensing, processing, machine learning, and communication technologies are accelerating assisted and automated functions development for commercial vehicles. Environmental perception sensor data streams are processed to generate a correct and complete situational awareness. It is of utmost importance to assess the robustness of the sensor data pipeline, particularly in the case of data degradation in a noisy and variable environment. Sensor data reduction and compression techniques are key for higher levels of driving automation, as there is an expectation that traditional automotive vehicle wired networks will not be able to support the needed sensor datarates (i.e. more than 10 perception sensors, including cameras, LiDARs, and RADARs, generating tens of Gb/s of data). This work proposes for the first time to consider video compression for camera data transmission on vehicle wired networks in the presence of highly noisy data, e.g. partially obstructed camera field of view. The effects are discussed in terms of machine learning vehicle detection accuracy drop, and also visualising how detection performance spatially varies on the frames using the recently introduced metric, the Spatial Recall Index (SRI). The presented parametric obstruction noise model is generated to emulate real-world patterns, whereas compression is based on the well-established AVC/H.264. While Deep Neural Networks’ (DNNs’) performance is stable with lossy compression (up to 70:1) of ‘ideal’ data, when noise is added there is a significant accuracy degradation, in the range of a 7%-90% decrease. The proposed compression and noise tuning of the DNN training improves the performance up to 35%, enhancing the noise and compression robustness of the system. However, in the presence of compression combined with extreme levels of noise (i.e. more than 80% of pixels affected), DNN performance significantly degrades, up to a 90% decrease, even with re-training. This issue needs to be carefully considered in the design phase of perception and communication networks used to transmit sensor data.

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2025

Appare nelle tipologie:

1.1 Articolo su Rivista/Article

File in questo prodotto:

File	Dimensione	Formato
Automotive_DNN-Based_Object_Detection_in_the_Presence_of_Lens_Obstruction_and_Video_Compression.pdf embargo fino al 01/01/2031 Tipologia: PDF Editoriale Licenza: Copyright dell'editore Dimensione 2.57 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.57 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11382/587775

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

5

social impact