Text this: Analysis of super-long and sparse feature in pseudo-random sequence based on similarity