Text this: Offline prompt reinforcement learning method based on feature extraction