Text this: Specialized curricula for training vision language models in retinal image analysis