Text this: Potential of multimodal large language models for data mining of medical images and free-text reports