Text this: Music source feature extraction based on improved attention mechanism and phase feature