Text this: Driving-Related Cognitive Abilities Prediction Based on Transformer’s Multimodal Fusion Framework