Text this: Combining convolutional neural network with transformer to improve YOLOv7 for gas plume detection and segmentation in multibeam water column images