Text this: Multi-label text classification via secondary use of large clinical real-world data sets