Text this: A large-scale risk assessment and classification model for pneumococcus using Finnish national health data