Identifying Persons of Interest in Digital Forensics Using NLP-Based AI

The field of digital forensics relies on expertise from multiple domains, including computer science, criminology, and law. It also relies on different toolsets and an analyst’s expertise to parse enormous amounts of user-generated data to find clues that help crack a case. This process of investiga...

Full description

Saved in:
Bibliographic Details
Main Authors: Jonathan Adkins, Ali Al Bataineh, Majd Khalaf
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/16/11/426
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846153457039310848
author Jonathan Adkins
Ali Al Bataineh
Majd Khalaf
author_facet Jonathan Adkins
Ali Al Bataineh
Majd Khalaf
author_sort Jonathan Adkins
collection DOAJ
description The field of digital forensics relies on expertise from multiple domains, including computer science, criminology, and law. It also relies on different toolsets and an analyst’s expertise to parse enormous amounts of user-generated data to find clues that help crack a case. This process of investigative analysis is often done manually. Artificial Intelligence (AI) can provide practical solutions to efficiently mine enormous amounts of data to find useful patterns that can be leveraged to investigate crimes. Natural Language Processing (NLP) is a subdomain of research under AI that deals with problems involving unstructured data, specifically language. The domain of NLP includes several tools to parse text, including topic modeling, pairwise correlation, word vector cosine distance measurement, and sentiment analysis. In this research, we propose a digital forensic investigative technique that uses an ensemble of NLP tools to identify a person of interest list based on a corpus of text. Our proposed method serves as a type of human feature reduction, where a total pool of suspects is filtered down to a short list of candidates who possess a higher correlation with the crime being investigated.
format Article
id doaj-art-a98569cd8b1e437a981abfbd54469639
institution Kabale University
issn 1999-5903
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Future Internet
spelling doaj-art-a98569cd8b1e437a981abfbd544696392024-11-26T18:05:17ZengMDPI AGFuture Internet1999-59032024-11-01161142610.3390/fi16110426Identifying Persons of Interest in Digital Forensics Using NLP-Based AIJonathan Adkins0Ali Al Bataineh1Majd Khalaf2Senator Patrick Leahy School of Cybersecurity and Advanced Computing, Norwich University, Northfield, VT 05663, USAArtificial Intelligence Center, Norwich University, Northfield, VT 05663, USAArtificial Intelligence Center, Norwich University, Northfield, VT 05663, USAThe field of digital forensics relies on expertise from multiple domains, including computer science, criminology, and law. It also relies on different toolsets and an analyst’s expertise to parse enormous amounts of user-generated data to find clues that help crack a case. This process of investigative analysis is often done manually. Artificial Intelligence (AI) can provide practical solutions to efficiently mine enormous amounts of data to find useful patterns that can be leveraged to investigate crimes. Natural Language Processing (NLP) is a subdomain of research under AI that deals with problems involving unstructured data, specifically language. The domain of NLP includes several tools to parse text, including topic modeling, pairwise correlation, word vector cosine distance measurement, and sentiment analysis. In this research, we propose a digital forensic investigative technique that uses an ensemble of NLP tools to identify a person of interest list based on a corpus of text. Our proposed method serves as a type of human feature reduction, where a total pool of suspects is filtered down to a short list of candidates who possess a higher correlation with the crime being investigated.https://www.mdpi.com/1999-5903/16/11/426Artificial Intelligence (AI)criminologydigital forensicsNatural Language Processing (NLP)sentiment analysistopic modeling
spellingShingle Jonathan Adkins
Ali Al Bataineh
Majd Khalaf
Identifying Persons of Interest in Digital Forensics Using NLP-Based AI
Future Internet
Artificial Intelligence (AI)
criminology
digital forensics
Natural Language Processing (NLP)
sentiment analysis
topic modeling
title Identifying Persons of Interest in Digital Forensics Using NLP-Based AI
title_full Identifying Persons of Interest in Digital Forensics Using NLP-Based AI
title_fullStr Identifying Persons of Interest in Digital Forensics Using NLP-Based AI
title_full_unstemmed Identifying Persons of Interest in Digital Forensics Using NLP-Based AI
title_short Identifying Persons of Interest in Digital Forensics Using NLP-Based AI
title_sort identifying persons of interest in digital forensics using nlp based ai
topic Artificial Intelligence (AI)
criminology
digital forensics
Natural Language Processing (NLP)
sentiment analysis
topic modeling
url https://www.mdpi.com/1999-5903/16/11/426
work_keys_str_mv AT jonathanadkins identifyingpersonsofinterestindigitalforensicsusingnlpbasedai
AT alialbataineh identifyingpersonsofinterestindigitalforensicsusingnlpbasedai
AT majdkhalaf identifyingpersonsofinterestindigitalforensicsusingnlpbasedai