Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements

Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are o...

Full description

Saved in:
Bibliographic Details
Main Authors: Anish Salvi, Leo Arnal, Kevin Ly, Gabriel Ferreira, Sophia Y. Wang, Curtis Langlotz, Vinit Mahajan, Chase A. Ludwig
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841558655871221760
author Anish Salvi
Leo Arnal
Kevin Ly
Gabriel Ferreira
Sophia Y. Wang
Sophia Y. Wang
Curtis Langlotz
Curtis Langlotz
Vinit Mahajan
Vinit Mahajan
Chase A. Ludwig
Chase A. Ludwig
author_facet Anish Salvi
Leo Arnal
Kevin Ly
Gabriel Ferreira
Sophia Y. Wang
Sophia Y. Wang
Curtis Langlotz
Curtis Langlotz
Vinit Mahajan
Vinit Mahajan
Chase A. Ludwig
Chase A. Ludwig
author_sort Anish Salvi
collection DOAJ
description Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models’ predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results.
format Article
id doaj-art-c80ef81229bd44cabc7fac7c53169c93
institution Kabale University
issn 2624-8212
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Artificial Intelligence
spelling doaj-art-c80ef81229bd44cabc7fac7c53169c932025-01-06T06:59:39ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14287161428716Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurementsAnish Salvi0Leo Arnal1Kevin Ly2Gabriel Ferreira3Sophia Y. Wang4Sophia Y. Wang5Curtis Langlotz6Curtis Langlotz7Vinit Mahajan8Vinit Mahajan9Chase A. Ludwig10Chase A. Ludwig11School of Medicine, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Medicine, Chicago Medical School at Rosalind Franklin University, North Chicago, IL, United StatesDepartment of Medicine, Varzea Grande University Center, Várzea Grande, BrazilSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Radiology, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesGiven close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models’ predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results.https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/fulloptical character recognitionPaddleOCRGeminitext extractionLenstarIOL master 500
spellingShingle Anish Salvi
Leo Arnal
Kevin Ly
Gabriel Ferreira
Sophia Y. Wang
Sophia Y. Wang
Curtis Langlotz
Curtis Langlotz
Vinit Mahajan
Vinit Mahajan
Chase A. Ludwig
Chase A. Ludwig
Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
Frontiers in Artificial Intelligence
optical character recognition
PaddleOCR
Gemini
text extraction
Lenstar
IOL master 500
title Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_full Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_fullStr Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_full_unstemmed Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_short Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_sort ocular biometry ocr a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
topic optical character recognition
PaddleOCR
Gemini
text extraction
Lenstar
IOL master 500
url https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/full
work_keys_str_mv AT anishsalvi ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT leoarnal ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT kevinly ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT gabrielferreira ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT sophiaywang ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT sophiaywang ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT curtislanglotz ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT curtislanglotz ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT vinitmahajan ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT vinitmahajan ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT chasealudwig ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements
AT chasealudwig ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements