Text this: Assessing Similarity Between Datasets Using Vector Representations