Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements

Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are o...

Full description

Saved in:

Bibliographic Details
Main Authors:	Anish Salvi, Leo Arnal, Kevin Ly, Gabriel Ferreira, Sophia Y. Wang, Curtis Langlotz, Vinit Mahajan, Chase A. Ludwig
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2025-01-01
Series:	Frontiers in Artificial Intelligence
Subjects:	optical character recognition PaddleOCR Gemini text extraction Lenstar IOL master 500
Online Access:	https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/full
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841558655871221760
author	Anish Salvi Leo Arnal Kevin Ly Gabriel Ferreira Sophia Y. Wang Sophia Y. Wang Curtis Langlotz Curtis Langlotz Vinit Mahajan Vinit Mahajan Chase A. Ludwig Chase A. Ludwig
author_facet	Anish Salvi Leo Arnal Kevin Ly Gabriel Ferreira Sophia Y. Wang Sophia Y. Wang Curtis Langlotz Curtis Langlotz Vinit Mahajan Vinit Mahajan Chase A. Ludwig Chase A. Ludwig
author_sort	Anish Salvi
collection	DOAJ
description	Given close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models’ predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results.
format	Article
id	doaj-art-c80ef81229bd44cabc7fac7c53169c93
institution	Kabale University
issn	2624-8212
language	English
publishDate	2025-01-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Artificial Intelligence
spelling	doaj-art-c80ef81229bd44cabc7fac7c53169c932025-01-06T06:59:39ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14287161428716Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurementsAnish Salvi0Leo Arnal1Kevin Ly2Gabriel Ferreira3Sophia Y. Wang4Sophia Y. Wang5Curtis Langlotz6Curtis Langlotz7Vinit Mahajan8Vinit Mahajan9Chase A. Ludwig10Chase A. Ludwig11School of Medicine, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Medicine, Chicago Medical School at Rosalind Franklin University, North Chicago, IL, United StatesDepartment of Medicine, Varzea Grande University Center, Várzea Grande, BrazilSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Radiology, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesSchool of Medicine, Stanford University, Palo Alto, CA, United StatesDepartment of Ophthalmology, Byers Eye Institute, Stanford University, Palo Alto, CA, United StatesGiven close relationships between ocular structure and ophthalmic disease, ocular biometry measurements (including axial length, lens thickness, anterior chamber depth, and keratometry values) may be leveraged as features in the prediction of eye diseases. However, ocular biometry measurements are often stored as PDFs rather than as structured data in electronic health records. Thus, time-consuming and laborious manual data entry is required for using biometry data as a disease predictor. Herein, we used two separate models, PaddleOCR and Gemini, to extract eye specific biometric measurements from 2,965 Lenstar, 104 IOL Master 500, and 3,616 IOL Master 700 optical biometry reports. For each patient eye, our text extraction pipeline, referred to as Ocular Biometry OCR, involves 1) cropping the report to the biometric data, 2) extracting the text via the optical character recognition model, 3) post-processing the metrics and values into key value pairs, 4) correcting erroneous angles within the pairs, 5) computing the number of errors or missing values, and 6) selecting the window specific results with fewest errors or missing values. To ensure the models’ predictions could be put into a machine learning-ready format, artifacts were removed from categorical text data through manual modification where necessary. Performance was evaluated by scoring PaddleOCR and Gemini results. In the absence of ground truth, higher scoring indicated greater inter-model reliability, assuming an equal value between models indicated an accurate result. The detection scores, measuring the number of valid values (i.e., not missing or erroneous), were Lenstar: 0.990, IOLM 500: 1.000, and IOLM 700: 0.998. The similarity scores, measuring the number of equal values, were Lenstar: 0.995, IOLM 500: 0.999, and IOLM 700: 0.999. The agreement scores, combining detection and similarity scores, were Lenstar: 0.985, IOLM 500: 0.999, and IOLM 700: 0.998. IOLM 500 was annotated for ground truths; in this case, higher scoring indicated greater model-to-annotator accuracy. PaddleOCR-to-Annotator achieved scores of detection: 1.000, similarity: 0.999, and agreement: 0.999. Gemini-to-Annotator achieved scores of detection: 1.000, similarity: 1.000, and agreement: 1.000. Scores range from 0 to 1. While PaddleOCR and Gemini demonstrated high agreement, PaddleOCR offered slightly better performance upon reviewing quantitative and qualitative results.https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/fulloptical character recognitionPaddleOCRGeminitext extractionLenstarIOL master 500
spellingShingle	Anish Salvi Leo Arnal Kevin Ly Gabriel Ferreira Sophia Y. Wang Sophia Y. Wang Curtis Langlotz Curtis Langlotz Vinit Mahajan Vinit Mahajan Chase A. Ludwig Chase A. Ludwig Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements Frontiers in Artificial Intelligence optical character recognition PaddleOCR Gemini text extraction Lenstar IOL master 500
title	Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_full	Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_fullStr	Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_full_unstemmed	Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_short	Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
title_sort	ocular biometry ocr a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements
topic	optical character recognition PaddleOCR Gemini text extraction Lenstar IOL master 500
url	https://www.frontiersin.org/articles/10.3389/frai.2024.1428716/full
work_keys_str_mv	AT anishsalvi ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT leoarnal ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT kevinly ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT gabrielferreira ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT sophiaywang ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT sophiaywang ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT curtislanglotz ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT curtislanglotz ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT vinitmahajan ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT vinitmahajan ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT chasealudwig ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements AT chasealudwig ocularbiometryocramachinelearningalgorithmleveragingopticalcharacterrecognitiontoextractintraocularlensbiometrymeasurements

Ocular Biometry OCR: a machine learning algorithm leveraging optical character recognition to extract intra ocular lens biometry measurements

Similar Items