Text this: A multimodal framework for pepper diseases and pests detection