Text this: Multi-stage framework using transformer models, feature fusion and ensemble learning for enhancing eye disease classification