Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are o...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-01-01
|
Series: | Frontiers in Artificial Intelligence |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841558655871221760 |
---|---|
author | Anish Salvi Leo Arnal Kevin Ly Gabriel Ferreira Sophia Y. Wang Sophia Y. Wang Curtis Langlotz Curtis Langlotz Vinit Mahajan Vinit Mahajan Chase A. Ludwig Chase A. Ludwig |
author_facet | Anish Salvi Leo Arnal Kevin Ly Gabriel Ferreira Sophia Y. Wang Sophia Y. Wang Curtis Langlotz Curtis Langlotz Vinit Mahajan Vinit Mahajan Chase A. Ludwig Chase A. Ludwig |
author_sort | Anish Salvi |
collection | DOAJ |
description | Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models’ predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results. |
format | Article |
id | doaj-art-c80ef81229bd44cabc7fac7c53169c93 |
institution | Kabale University |
issn | 2624-8212 |
language | English |
publishDate | 2025-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Artificial Intelligence |
spelling | doaj-art-c80ef81229bd44cabc7fac7c53169c932025-01-06T06:59:39ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14287161428716Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurementsAnish Salvi0Leo Arnal1Kevin Ly2Gabriel Ferreira3Sophia Y. Wang4Sophia Y. Wang5Curtis Langlotz6Curtis Langlotz7Vinit Mahajan8Vinit Mahajan9Chase A. Ludwig10Chase A. Ludwig11School of Medicine, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Medicine, Chicago Medical School at Rosalind Franklin University, North Chicago, IL, United StatesDepartment of Medicine, Varzea Grande University Center, Várzea Grande, BrazilSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Radiology, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesGiven close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models’ predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results.https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/fulloptical character recognitionPaddleOCRGeminitext extractionLenstarIOL master 500 |
spellingShingle | Anish Salvi Leo Arnal Kevin Ly Gabriel Ferreira Sophia Y. Wang Sophia Y. Wang Curtis Langlotz Curtis Langlotz Vinit Mahajan Vinit Mahajan Chase A. Ludwig Chase A. Ludwig Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements Frontiers in Artificial Intelligence optical character recognition PaddleOCR Gemini text extraction Lenstar IOL master 500 |
title | Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements |
title_full | Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements |
title_fullStr | Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements |
title_full_unstemmed | Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements |
title_short | Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements |
title_sort | ocular biometry ocr a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements |
topic | optical character recognition PaddleOCR Gemini text extraction Lenstar IOL master 500 |
url | https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/full |
work_keys_str_mv | AT anishsalvi ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT leoarnal ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT kevinly ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT gabrielferreira ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT sophiaywang ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT sophiaywang ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT curtislanglotz ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT curtislanglotz ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT vinitmahajan ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT vinitmahajan ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT chasealudwig ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT chasealudwig ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements |