Preprint / Version 1

Improving Safety in Autonomous Driving with the Use of AI for Object Detection Prediction

##article.authors##

  • Shahbaz Satti Polygence

DOI:

https://doi.org/10.58445/rars.2131

Keywords:

Machine Learning, Artificial Intelligence, Autonomous Vehicles, YOLO

Abstract

In the last decade, object detection and machine learning-based algorithms have enhanced significantly, making self-driving vehicles a reality rather than a vision. Our research explores the advancements in machine learning-based object detection methods and their application to autonomous vehicle systems, specifically using the YOLO (You Only Look Once) algorithm. We begin with an overview of the YOLO algorithm, covering the YOLO architecture and comparing YOLO to alternatives like SSD and LiDAR. We then describe our case study in which we trained our own YOLO model using a custom dataset. Next, we analyze the results from the trained YOLO model to make conclusions about the YOLO algorithm. After conducting a literature review of dozens of old experiments to compare the YOLO alternatives to YOLO itself, we presented how YOLO is a better option for real-time driving scenarios in comparison to SSD (Single Shot Detector) and LiDAR (Light Detection and Ranging). In our case study of training a YOLOv8 model with our manually crafted dataset, the object detection accuracy for the model went from 20% to about 90% in only 50 epochs. We concluded that our research has highlighted YOLO’s power and high speed when it comes to driverless vehicle object detection, but we also acknowledged the room available for future improvements to make roads safer.

References

Redmon et al. (2016). You Only Look Once: Unified, Real-Time Object Detection. CVPR.

Liu et al. (2016). SSD: Single Shot MultiBox Detector. ECCV.

Ren et al. (2015). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. NIPS.

Redmon et al. (2017). YOLO9000: Better, Faster, Stronger. CVPR.

Huang et al. (2017). Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. CVPR.

Lee et al. (2017). DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents. CVPR.

"YOLOv8: A New Era for Real-Time Object Detection" by Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao (2022) - [arXiv:2207.02696]

"SSD: Single Shot MultiBox Detector" by Liu et al. (2016) - [arXiv:1512.023

"Object Detection Benchmarking: A Comprehensive Review" by Zhang et al. (2022) - [arXiv:2204.03590]

Bochkovskiy, Alexey, et al. "YOLOv8: A New Era for Real-Time Object Detection." GitHub, 2022.

Rosebrock, Adrian. "YOLOv8 Tutorial: Real-Time Object Detection." PyImageSearch, 2022.

Bochkovskiy, Alexey, et al. "YOLOv8: A New Era for Real-Time Object Detection." arXiv, 2022.

Kumar, et al. "A Comprehensive Review of YOLO Variants." International Journal of Computer Vision, vol. 128, no. 10, 2022, pp. 2538-2556.

Lin, Tsung-Yi, et al. "Microsoft COCO: Common Objects in Context." Proceedings of the European Conference on Computer Vision, Springer, Cham, 2014, pp. 740-755, doi: 10.1007/978-3-319-10602-1_48

Zhang, Y., et al. "Deep Learning for LiDAR Point Clouds: A Review." IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 1, 2022, pp. 3-14.

Chen, X., et al. "LiDAR-based 3D Object Detection for Autonomous Vehicles." IEEE Transactions on Intelligent Transportation Systems, vol. 20, no. 10, 2019, pp. 2732-2742.

Templeton, Brad. “Former Head of Tesla AI Explains Why They’ve Removed Sensors; Others Differ.” Forbes, 1 Nov. 2022, www.forbes.com/sites/bradtempleton/2022/10/31/former-head-of-tesla-ai-explains-why-theyve-removed-sensors-others-differ.

Geiger, A., et al. "LiDAR Point Clouds for Object Detection and Tracking." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 10153-10162.

Murphy, Mike. “128-laser LiDAR Sensor Significantly Sharpens Autonomous Cars’ Vision.” New Atlas, 4 Dec. 2017, newatlas.com/velodyne-lidar-vls-128-sensor/52453.

Liu, Wei, et al. "SSD: Single Shot MultiBox Detector." Proceedings of the European Conference on Computer Vision, Springer, 2016, pp. 21-37.

Redmon, Joseph, et al. "You Only Look Once: Unified, Real-Time Object Detection." Journal of Computer Vision, vol. 120, no. 1, 2016, pp. 1-14.

Redmon, Joseph, and Ali Farhadi. "YOLO9000: Better, Faster, Stronger." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 6517-6525.

Lin, Tsung-Yi, et al. Feature Pyramid Networks for Object Detection, 19 Apr. 2017, arxiv.org/pdf/1612.03144.

Keita, Zoumana. “Yolo Object Detection Explained: A Beginner’s Guide.” DataCamp, DataCamp, 28 Sept. 2024, www.datacamp.com/blog/yolo-object-detection-explained.

Abdullah, S. M., et al. "Self-Driving Cars: A Survey of Major Developments and Challenges." IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 4, 2022, pp. 3133-3146.

Boesch, Gaudenz. “Yolov8: A Complete Guide [2025 Update].” Viso.Ai, 17 Oct. 2024, viso.ai/deep-learning/yolov8-guide/#:~:text=The%20neck%20merges%20these%20feature,%2Dshot%20Detector%20(SSD).

Lemay, Andréanne. "Kidney Recognition in CT Using YOLOv3." arXiv, 3 Oct. 2019, https://arxiv.org/abs/1910.01268.

Lawal, M.O. Tomato detection based on modified YOLOv3 framework. Sci Rep 11, 1447 (2021). https://doi.org/10.1038/s41598-021-81216-5.

Ahmed, Imran et al. “A deep learning-based social distance monitoring framework for COVID-19.” Sustainable cities and society vol. 65 (2021): 102571. https://doi.org/10.1016/j.scs.2020.102571.

Zhao, S., Hao, G., Zhang, Y., & Wang, S. (2021). “A real-time classification and detection method for mutton parts based on single shot multi-box detector.” Journal of Food Process Engineering, 44(8), e13749. https://doi.org/10.1111/jfpe.13749.

Downloads

Posted

2025-01-08