Overview of Finnish national patient data repository for research on medical risk assessment

The Kanta Patient Data Repository (PDR) contains healthcare data from the population of Finland for more than a decade. The repository is a continuously expanding real world dataset produced by many information systems and healthcare service providers. Kanta data has been available for secondary us...

Full description

Saved in:
Bibliographic Details
Main Authors: Viljami Männikkö, Klaus Förger, Henna Kujanen, Jani Tikkanen, Simo Antikainen, Joona Munukka
Format: Article
Language:English
Published: Finnish Social and Health Informatics Association 2024-10-01
Series:Finnish Journal of eHealth and eWelfare
Subjects:
Online Access:https://journal.fi/finjehew/article/view/146124
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846111769646333952
author Viljami Männikkö
Klaus Förger
Henna Kujanen
Jani Tikkanen
Simo Antikainen
Joona Munukka
author_facet Viljami Männikkö
Klaus Förger
Henna Kujanen
Jani Tikkanen
Simo Antikainen
Joona Munukka
author_sort Viljami Männikkö
collection DOAJ
description The Kanta Patient Data Repository (PDR) contains healthcare data from the population of Finland for more than a decade. The repository is a continuously expanding real world dataset produced by many information systems and healthcare service providers. Kanta data has been available for secondary uses such as scientific research since 2019. The data can be requested from the Finnish authority Findata. However, before a request has been accepted, it is difficult to assess if the accumulated data allows answering a specific research question. Publicly available descriptions of data structures in the Kanta PDR do not tell how much they are used in practice. This publication enables future data use cases by providing a view on the overall availability of types of structured health data in the Kanta PDR based on a sample of 96 200 medical histories of over 18-year-old patients. We conclude that the Kanta PDR is a promising source of real world data for development and evaluation of medical risk calculators within the Finnish population. The wide coverage of the Finnish population and timeliness of the data are its strengths as a source of research data also outside of Finnish context. However, the limitations on data availability in variable level need to be considered on a case-by-case basis. Main challenges in the use of data in the Kanta PDR are multiple code systems for laboratory results, short durations of recorded data for specific data types, and missing or very rarely used structured format e.g., in cases of tobacco and alcohol use.
format Article
id doaj-art-7b686811bdce46e9871a8a2f2287441f
institution Kabale University
issn 1798-0798
language English
publishDate 2024-10-01
publisher Finnish Social and Health Informatics Association
record_format Article
series Finnish Journal of eHealth and eWelfare
spelling doaj-art-7b686811bdce46e9871a8a2f2287441f2024-12-23T04:51:37ZengFinnish Social and Health Informatics AssociationFinnish Journal of eHealth and eWelfare1798-07982024-10-01163Overview of Finnish national patient data repository for research on medical risk assessmentViljami Männikkö0Klaus Förger1Henna Kujanen2Jani Tikkanen3Simo Antikainen4Joona Munukka5Tampere University (TUNI), Tampere; Atostek Oy, TampereAtostek Oy, TampereAtostek Oy, TampereOulu University Hospital, OuluAtostek Oy, TampereAtostek Oy, Tampere The Kanta Patient Data Repository (PDR) contains healthcare data from the population of Finland for more than a decade. The repository is a continuously expanding real world dataset produced by many information systems and healthcare service providers. Kanta data has been available for secondary uses such as scientific research since 2019. The data can be requested from the Finnish authority Findata. However, before a request has been accepted, it is difficult to assess if the accumulated data allows answering a specific research question. Publicly available descriptions of data structures in the Kanta PDR do not tell how much they are used in practice. This publication enables future data use cases by providing a view on the overall availability of types of structured health data in the Kanta PDR based on a sample of 96 200 medical histories of over 18-year-old patients. We conclude that the Kanta PDR is a promising source of real world data for development and evaluation of medical risk calculators within the Finnish population. The wide coverage of the Finnish population and timeliness of the data are its strengths as a source of research data also outside of Finnish context. However, the limitations on data availability in variable level need to be considered on a case-by-case basis. Main challenges in the use of data in the Kanta PDR are multiple code systems for laboratory results, short durations of recorded data for specific data types, and missing or very rarely used structured format e.g., in cases of tobacco and alcohol use. https://journal.fi/finjehew/article/view/146124big datahealth and wellness sectorhealth datarisk assessmentstatistics
spellingShingle Viljami Männikkö
Klaus Förger
Henna Kujanen
Jani Tikkanen
Simo Antikainen
Joona Munukka
Overview of Finnish national patient data repository for research on medical risk assessment
Finnish Journal of eHealth and eWelfare
big data
health and wellness sector
health data
risk assessment
statistics
title Overview of Finnish national patient data repository for research on medical risk assessment
title_full Overview of Finnish national patient data repository for research on medical risk assessment
title_fullStr Overview of Finnish national patient data repository for research on medical risk assessment
title_full_unstemmed Overview of Finnish national patient data repository for research on medical risk assessment
title_short Overview of Finnish national patient data repository for research on medical risk assessment
title_sort overview of finnish national patient data repository for research on medical risk assessment
topic big data
health and wellness sector
health data
risk assessment
statistics
url https://journal.fi/finjehew/article/view/146124
work_keys_str_mv AT viljamimannikko overviewoffinnishnationalpatientdatarepositoryforresearchonmedicalriskassessment
AT klausforger overviewoffinnishnationalpatientdatarepositoryforresearchonmedicalriskassessment
AT hennakujanen overviewoffinnishnationalpatientdatarepositoryforresearchonmedicalriskassessment
AT janitikkanen overviewoffinnishnationalpatientdatarepositoryforresearchonmedicalriskassessment
AT simoantikainen overviewoffinnishnationalpatientdatarepositoryforresearchonmedicalriskassessment
AT joonamunukka overviewoffinnishnationalpatientdatarepositoryforresearchonmedicalriskassessment