Text this: Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks