InVDriver: Intra-instance aware vectorized query-based autonomous driving transformer

End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However,...

Full description

Saved in:
Bibliographic Details
Main Authors: Bo Zhang, Heye Huang, Chunyang Liu, Yaqin Zhang, Zhenhua Xu
Format: Article
Language:English
Published: Tsinghua University Press 2025-06-01
Series:Journal of Intelligent and Connected Vehicles
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/JICV.2025.9210060
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However, existing vectorized query-based frameworks often overlook the inherent spatial correlations among intra-instance points, resulting in geometrically inconsistent outputs (e.g., fragmented HD map elements or oscillatory trajectories). To address these limitations, we propose intra-instance vectorized driving transformer (InVDriver), a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness. Across all core modules, i.e., perception, prediction, and planning, InVDriver incorporates masked self-attention mechanisms that restrict attention to intra-instance point interactions, enabling coordinated refinement of structural elements while suppressing irrelevant inter-instance noise. The experimental results on the nuScenes benchmark demonstrate that InVDriver achieves state-of-the-art performance, surpassing prior methods in both accuracy and safety, while maintaining high computational efficiency.
ISSN:2399-9802