TY - JOUR
T1 - Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts
AU - Korkontzelos, Ioannis
AU - Nikfarjam, Azadeh
AU - Shardlow, Matthew
AU - Sarker, Abeed
AU - Ananiadou, Sophia
AU - Gonzalez, Graciela H.
N1 - Funding Information:
This work was funded by the UK Medical Research Council (project reference: MR/L01078X/1 – Supporting Evidence-based Public Health Interventions using Text Mining) and by the European Community’s Horizon 2020 Program (project reference: 654021 – OpenMinted).
Publisher Copyright:
© 2016 The Authors
Copyright:
Copyright 2018 Elsevier B.V., All rights reserved.
PY - 2016/8/1
Y1 - 2016/8/1
N2 - Objective
The abundance of text available in social media and health related forums along with the rich expression of public opinion have recently attracted the interest of the public health community to use these sources for pharmacovigilance. Based on the intuition that patients post about Adverse Drug Reactions (ADRs) expressing negative sentiments, we investigate the effect of sentiment analysis features in locating ADR mentions.
Methods
We enrich the feature space of a state-of-the-art ADR identification method with sentiment analysis features. Using a corpus of posts from the DailyStrength forum and tweets annotated for ADR and indication mentions, we evaluate the extent to which sentiment analysis features help in locating ADR mentions and distinguishing them from indication mentions.
Results
Evaluation results show that sentiment analysis features marginally improve ADR identification in tweets and health related forum posts. Adding sentiment analysis features achieved a statistically significant F-measure increase from 72.14% to 73.22% in the Twitter part of an existing corpus using its original train/test split. Using stratified 10 × 10-fold cross-validation, statistically significant F-measure increases were shown in the DailyStrength part of the corpus, from 79.57% to 80.14%, and in the Twitter part of the corpus, from 66.91% to 69.16%. Moreover, sentiment analysis features are shown to reduce the number of ADRs being recognized as indications.
Conclusion
This study shows that adding sentiment analysis features can marginally improve the performance of even a state-of-the-art ADR identification method. This improvement can be of use to pharmacovigilance practice, due to the rapidly increasing popularity of social media and health forums.
AB - Objective
The abundance of text available in social media and health related forums along with the rich expression of public opinion have recently attracted the interest of the public health community to use these sources for pharmacovigilance. Based on the intuition that patients post about Adverse Drug Reactions (ADRs) expressing negative sentiments, we investigate the effect of sentiment analysis features in locating ADR mentions.
Methods
We enrich the feature space of a state-of-the-art ADR identification method with sentiment analysis features. Using a corpus of posts from the DailyStrength forum and tweets annotated for ADR and indication mentions, we evaluate the extent to which sentiment analysis features help in locating ADR mentions and distinguishing them from indication mentions.
Results
Evaluation results show that sentiment analysis features marginally improve ADR identification in tweets and health related forum posts. Adding sentiment analysis features achieved a statistically significant F-measure increase from 72.14% to 73.22% in the Twitter part of an existing corpus using its original train/test split. Using stratified 10 × 10-fold cross-validation, statistically significant F-measure increases were shown in the DailyStrength part of the corpus, from 79.57% to 80.14%, and in the Twitter part of the corpus, from 66.91% to 69.16%. Moreover, sentiment analysis features are shown to reduce the number of ADRs being recognized as indications.
Conclusion
This study shows that adding sentiment analysis features can marginally improve the performance of even a state-of-the-art ADR identification method. This improvement can be of use to pharmacovigilance practice, due to the rapidly increasing popularity of social media and health forums.
KW - Adverse drug reactionsSocial mediaSentiment analysisText mining
KW - Text mining
KW - Sentiment analysis
KW - Social media
KW - Adverse drug reactions
KW - Public Health
KW - Social Media
KW - Drug-Related Side Effects and Adverse Reactions
KW - Humans
KW - Internet
KW - Pharmacovigilance
UR - http://www.journals.elsevier.com/journal-of-biomedical-informatics/
UR - http://www.scopus.com/inward/record.url?scp=84978034203&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84978034203&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/c21c125a-16c8-3650-bb94-e45f412613ce/
U2 - 10.1016/j.jbi.2016.06.007
DO - 10.1016/j.jbi.2016.06.007
M3 - Article (journal)
C2 - 27363901
SN - 1532-0464
VL - 62
SP - 148
EP - 158
JO - Journal of Biomedical Informatics
JF - Journal of Biomedical Informatics
ER -