Automatic Extraction of Adverse Drug Reactions from Summary of Product Characteristics
Publication date
2021-03-02
Editors
Advisors
Supervisors
Document Type
Article
Metadata
Show full item recordCollections
License
cc_by
Abstract
The summary of product characteristics from the European Medicines Agency is a reference document on medicines in the EU. It contains textual information for clinical experts on how to safely use medicines, including adverse drug reactions. Using natural language processing (NLP) techniques to automatically extract adverse drug reactions from such unstructured textual information helps clinical experts to effectively and efficiently use them in daily practices. Such techniques have been developed for Structured Product Labels from the Food and Drug Administration (FDA), but there is no research focusing on extracting from the Summary of Product Characteristics. In this work, we built a natural language processing pipeline that automatically scrapes the summary of product characteristics online and then extracts adverse drug reactions from them. Besides, we have made the method and its output publicly available so that it can be reused and further evaluated in clinical practices. In total, we extracted 32,797 common adverse drug reactions for 647 common medicines scraped from the Electronic Medicines Compendium. A manual review of 37 commonly used medicines has indicated a good performance, with a recall and precision of 0.99 and 0.934, respectively.
Keywords
Adverse drug reactions, Information extraction, Natural language processing, Summary of product characteristics, General Materials Science, Instrumentation, General Engineering, Process Chemistry and Technology, Computer Science Applications, Fluid Flow and Transfer Processes
Citation
Shen, Z & Spruit, M 2021, 'Automatic Extraction of Adverse Drug Reactions from Summary of Product Characteristics', Applied Sciences, vol. 11, no. 6, 2663, pp. 1-11. https://doi.org/10.3390/app11062663