Text this: Accurate localization of indoor high similarity scenes using visual slam combined with loop closure detection algorithm