Ecotoxicity prediction of chemical compounds using machine learning and different molecular structure representations

Advancements in computational tools have facilitated interdisciplinary approaches in toxicology, enabling chemists to explore the toxicity and ecotoxicity of chemical compounds while minimizing ethically questionable or hazardous methods. This paper presents the development of models for predicting...

Full description

Saved in:
Bibliographic Details
Main Authors: Michał Marek, Rafał Kurczab
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Green Analytical Chemistry
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772577425000692
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Advancements in computational tools have facilitated interdisciplinary approaches in toxicology, enabling chemists to explore the toxicity and ecotoxicity of chemical compounds while minimizing ethically questionable or hazardous methods. This paper presents the development of models for predicting chemical ecotoxicity (HC50) based on machine learning algorithms and different molecular representations. A comprehensive set of descriptors was employed, including 100 molecular descriptors calculated using RDKit, 15 molecular connectivity (Chi) indices combined with shape (Kappa) indices, as well as MACCS and ECFP4 binary molecular fingerprints. The best model achieved an average RMSE of 0.740, an R² of 0.708, and an MAE of 0.546 through ten-fold cross-validation. The analysis of critical molecular descriptors identified logP, molar mass, heavy atom molar mass, Ipc, and the number of valence electrons as significant contributors to prediction of chemical ecotoxicity. This model not only facilitates ecotoxicity prediction but also provides valuable insights into the physicochemical properties influencing a molecule's ecotoxic profile, highlighting the potential of in silico approaches for ethical and efficient toxicology research.
ISSN:2772-5774