Text this: Large language model-based multimodal system for detecting and grading ocular surface diseases from smartphone images