Text this: Fusing Visual Quantified Features for Heterogeneous Traffic Flow Prediction