Pharmaceutical cocrystals are crystalline materials composed of at least two molecules, i.e., an active pharmaceutical ingredient (API) and a coformer, assembled by noncovalent forces. Cocrystallization is successfully applied to improve the physicochemical properties of APIs, such as solubility, dissolution profile, pharmacokinetics, and stability. However, choosing the ideal coformer is a challenging task in terms of time, efforts, and laboratory resources. Several computational tools and machine learning (ML) models have been proposed to mitigate this problem. However, the challenge of achieving a robust and generalizable predictive method is still open. In this study, we propose a new approach to quickly predict the formation of cocrystals, employing partial least squares-discriminant analysis, random forest, and neural networks. The models were based on the data sets of 13 structurally different APIs with both positive and negative cocrystallization outcomes. At the same time, the features were specially selected from a variety of molecular descriptors to explain the phenomenon of the cocrystallization. All of the proposed ML models showed a cross-validation accuracy higher than 83%. Furthermore, this approach was successfully applied to drive the cocrystallization experimental tests of 2-phenylpropionic acid, showcasing the high potential of the ML models in practice.

Speeding Up the Cocrystallization Process: Machine Learning-Combined Methods for the Prediction of Multicomponent Systems

Birolo R.;Bravetti F.;Alladio E.;Priola E.;Gobetto R.
;
Chierotti M. R.
2023-01-01

Abstract

Pharmaceutical cocrystals are crystalline materials composed of at least two molecules, i.e., an active pharmaceutical ingredient (API) and a coformer, assembled by noncovalent forces. Cocrystallization is successfully applied to improve the physicochemical properties of APIs, such as solubility, dissolution profile, pharmacokinetics, and stability. However, choosing the ideal coformer is a challenging task in terms of time, efforts, and laboratory resources. Several computational tools and machine learning (ML) models have been proposed to mitigate this problem. However, the challenge of achieving a robust and generalizable predictive method is still open. In this study, we propose a new approach to quickly predict the formation of cocrystals, employing partial least squares-discriminant analysis, random forest, and neural networks. The models were based on the data sets of 13 structurally different APIs with both positive and negative cocrystallization outcomes. At the same time, the features were specially selected from a variety of molecular descriptors to explain the phenomenon of the cocrystallization. All of the proposed ML models showed a cross-validation accuracy higher than 83%. Furthermore, this approach was successfully applied to drive the cocrystallization experimental tests of 2-phenylpropionic acid, showcasing the high potential of the ML models in practice.
2023
23
11
7898
7911
https://pubs.acs.org/doi/full/10.1021/acs.cgd.3c00696
cocrystal, crystallization, artificial intelligence, cocrystal formation prediction
Birolo R.; Bravetti F.; Alladio E.; Priola E.; Bianchini G.; Novelli R.; Aramini A.; Gobetto R.; Chierotti M.R.
File in questo prodotto:
File Dimensione Formato  
cryst growth des2023, 23,7898_completo.pdf

Accesso aperto

Tipo di file: PDF EDITORIALE
Dimensione 7.33 MB
Formato Adobe PDF
7.33 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1947516
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact