Skip to main content

Research Repository

Advanced Search

Data-driven solution to identify sentiments from online drug reviews.

Haque, Rezaul; Laskar, Saddam Hossain; Khushbu, Katura Gania; Hasan, Md Junayed; Uddin, Jia

Authors

Rezaul Haque

Saddam Hossain Laskar

Katura Gania Khushbu

Jia Uddin



Abstract

With the proliferation of the internet, social networking sites have become a primary source of user-generated content, including vast amounts of information about medications, diagnoses, treatments, and disorders. Comments on previously used medicines, contained within these data, can be leveraged to identify crucial adverse drug reactions, and machine learning (ML) approaches such as sentiment analysis (SA) can be employed to derive valuable insights. However, given the sheer volume of comments, it is often impractical for consumers to manually review all of them before determining a purchase decision. Therefore, drug assessments can serve as a valuable source of medical information for both healthcare professionals and the general public, aiding in decision making and improving public monitoring systems by revealing collective experiences. Nonetheless, the unstructured and linguistic nature of the comments poses a significant challenge for effective categorization, with previous studies having utilized machine and deep learning (DL) algorithms to address this challenge. Despite both approaches showing promising results, DL classifiers outperformed ML classifiers in previous studies. Therefore, the objective of our study was to improve upon earlier research by applying SA to medication reviews and training five ML algorithms on two distinct feature extractions and four DL classifiers on two different word-embedding approaches to obtain higher categorization scores. Our findings indicated that the random forest trained on the count vectorizer outperformed all other ML algorithms, achieving an accuracy and F1 score of 96.65% and 96.42%, respectively. Furthermore, the bidirectional LSTM (Bi-LSTM) model trained on GloVe embedding resulted in an even better accuracy and F1 score, reaching 97.40% and 97.42%, respectively. Hence, by utilizing appropriate natural language processing and ML algorithms, we were able to achieve superior results compared to earlier studies.

Citation

HAQUE, R., LASKAR, S.H., KHUSHBU, K.G., HASAN, M.J. and UDDIN, J. 2023. Data-driven solution to identify sentiments from online drug reviews. Computers [online], 12(4), article 87. Available from: https://doi.org/10.3390/computers12040087

Journal Article Type Article
Acceptance Date Apr 19, 2023
Online Publication Date Apr 21, 2023
Publication Date Apr 30, 2023
Deposit Date Apr 30, 2023
Publicly Available Date May 16, 2023
Journal Computers
Electronic ISSN 2073-431X
Publisher MDPI
Peer Reviewed Peer Reviewed
Volume 12
Issue 4
Article Number 87
DOI https://doi.org/10.3390/computers12040087
Keywords Deep learning; Word embedding; Bi-LSTM; GloVe; Drug sentiment analysis; Drug discovery
Public URL https://rgu-repository.worktribe.com/output/1947901

Files





You might also like



Downloadable Citations