Developing Web Application for Virtual Screening and Prediction of Breast Cancer Drugs: Target Interaction and Drug Approval Using Machine Learning

Conference proceedings article


Authors/Editors


Strategic Research Themes


Publication Details

Author listChumsang, Jantharat; Phangsee, Thanaphon; Wiriyaprapanont, Nattaporn; Arromrit, Trirat; Prom-On, Santitham;
Mahikul, Wiriya

PublisherInstitute of Electrical and Electronics Engineers Inc.

Publication year2023

Start page196

End page201

Number of pages6

ISBN9798350333862

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85166381870&doi=10.1109%2fICEIB57887.2023.10170500&partnerID=40&md5=7975f512f4f1af8c17716071931d66bb

LanguagesEnglish-Great Britain (EN-GB)


View on publisher site


Abstract

Breast cancer is the most prevalent cancer globally with a high patient mortality rate. As a result, researchers are trying to find new effective drugs to treat the disease. However, drug discovery is costly and time-consuming, making it difficult to succeed. Virtual screening is a tool for drug discovery as it can quickly assess the potential of many drug candidates, reducing time and cost. The study aims to develop a web application for the virtual screening of breast cancer drugs using a machine-learning approach. The data for this study was collected from the ChEMBL database and contained 8 human breast cancer target proteins, including mTOR, Trop-2, PI3K, CDK4/6, HER2, Aromatase, Akt, and Estrogen Receptor. The data is preprocessed with Feature Engineering using Lipinski's rule with the SMILE structure of a compound. The data is also applied in developing machine learning algorithms with two main objectives: predicting Drug-Target interaction and Drug Approvals with a classification model. Results of the study show that the SMILE structure and Lipinski's rule with machine learning are used for the virtual screening of many compounds from new drug discovery. The Gradient boosting Classifier is found to be the most suitable algorithm for predicting drug approval with the geometric mean, F1-score, and AUC of 0.95, 0.96, and 0.98, while the Random Forest Classifier is the best for drug target interactive with accuracy, F-1 score, and AUC of 0.75, 0.74, and 0.80, respectively. Moreover, we deploy machine learning with a web application and reduce time and cost in finding new drugs, increasing the chances of success in finding new treatments to reduce the mortality rate of breast cancer patients. © 2023 IEEE.


Keywords

Drug discovery


Last updated on 2024-14-03 at 23:05