Leveraging Multimodal Large Language Models (MLLMs) for Enhanced Object Detection and Scene Understanding in Thermal Images for Autonomous Driving Systems
The integration of thermal imaging data with multimodal large language models (MLLMs) offers promising advancements for enhancing the safety and functionality of autonomous driving systems (ADS) and intelligent transportation systems (ITS). This study investigates the potential of MLLMs, specificall...
Saved in:
| Main Authors: | Huthaifa I. Ashqar, Taqwa I. Alhadidi, Mohammed Elhenawy, Nour O. Khanfar |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2024-10-01
|
| Series: | Automation |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2673-4052/5/4/29 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
A Survey on MLLMs in Education: Application and Future Directions
by: Weicheng Xing, et al.
Published: (2024-12-01) -
Multimodal Trajectory Prediction for Diverse Vehicle Types in Autonomous Driving with Heterogeneous Data and Physical Constraints
by: Maoning Ge, et al.
Published: (2024-11-01) -
An Algorithmic Study of Transformer-Based Road Scene Segmentation in Autonomous Driving
by: Hao Cui, et al.
Published: (2024-11-01) -
EdgeNet: An End-to-End Deep Neural Network Pretrained with Synthetic Data for a Real-World Autonomous Driving Application
by: Leanne Miller, et al.
Published: (2024-12-01) -
GPT-4 enhanced multimodal grounding for autonomous driving: Leveraging cross-modal attention with large language models
by: Haicheng Liao, et al.
Published: (2024-12-01)