Text this: Semi-supervised learning by constructing query-document heterogeneous information network