Emerging Trends in Autonomous Vehicle Perception: Multimodal Fusion for 3D Object Detection

Alaba, Simegnew Yihunie; Gurbuz, Ali C.; Ball, John E.

Published in

MDPI, World Electric Vehicle Journal, 1(15), p. 20, 2024

DOI: 10.3390/wevj15010020

Tools

Export citation

Search in Google Scholar

Emerging Trends in Autonomous Vehicle Perception: Multimodal Fusion for 3D Object Detection

Journal article published in 2024 by Simegnew Yihunie Alaba

, Ali C. Gurbuz

, John E. Ball

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

The pursuit of autonomous driving relies on developing perception systems capable of making accurate, robust, and rapid decisions to interpret the driving environment effectively. Object detection is crucial for understanding the environment at these systems’ core. While 2D object detection and classification have advanced significantly with the advent of deep learning (DL) in computer vision (CV) applications, they fall short in providing essential depth information, a key element in comprehending driving environments. Consequently, 3D object detection becomes a cornerstone for autonomous driving and robotics, offering precise estimations of object locations and enhancing environmental comprehension. The CV community’s growing interest in 3D object detection is fueled by the evolution of DL models, including Convolutional Neural Networks (CNNs) and Transformer networks. Despite these advancements, challenges such as varying object scales, limited 3D sensor data, and occlusions persist in 3D object detection. To address these challenges, researchers are exploring multimodal techniques that combine information from multiple sensors, such as cameras, radar, and LiDAR, to enhance the performance of perception systems. This survey provides an exhaustive review of multimodal fusion-based 3D object detection methods, focusing on CNN and Transformer-based models. It underscores the necessity of equipping fully autonomous vehicles with diverse sensors to ensure robust and reliable operation. The survey explores the advantages and drawbacks of cameras, LiDAR, and radar sensors. Additionally, it summarizes autonomy datasets and examines the latest advancements in multimodal fusion-based methods. The survey concludes by highlighting the ongoing challenges, open issues, and potential directions for future research.

Published in

Links

Tools

Emerging Trends in Autonomous Vehicle Perception: Multimodal Fusion for 3D Object Detection

Abstract