Text this: Fast deep reinforcement learning anti-jamming algorithm based on similar sample generation