Text this: Lightweight Transformer traffic scene semantic segmentation algorithm integrating multi-scale depth convolution