Text this: Multimodal individual emotion recognition with joint labeling based on integrated learning and clustering