Improving RNN with Attention and Embedding for Adverse Drug Reactions

Chandra Pandey; Zina Ibrahim; Honghan Wu; Ehtesham Iqbal; Richard Dobson

doi:10.1145/3079452.3079501

Improving RNN with Attention and Embedding for Adverse Drug Reactions

Chandra Pandey, Zina Ibrahim, Honghan Wu, Ehtesham Iqbal, Richard Dobson

King's College London

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

25 Citations (Scopus)

1 Downloads (Pure)

Abstract

Electronic Health Records (EHR) narratives are a rich source of information, embedding high-resolution information of value to secondary research use. However, because the EHRs are mostly in natural language free-text and highly ambiguity-ridden, many natural language processing algorithms have been devised around them to extract meaningful structured information about clinical entities. The performance of the algorithms however, largely varies depending on the training dataset as well as the e effectiveness of the use of background knowledge to steer the learning process.

In this paper we study the impact of initializing the training of a neural network natural language processing algorithm with pre-de ned clinical word embeddings to improve feature extraction and relationship classification between entities. We add our embedding framework to a bi-directional long short-term memory (Bi-LSTM) neural network, and further study the effect of using attention weights in neural networks for sequence labelling tasks to extract knowledge of Adverse Drug Reactions (ADRs). We incorporate unsupervised word embeddings using Word2Vec and GloVe from widely available medical resources such as Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) II corpora, Uni- ed Medical Language System (UMLS) as well as embed pharmaco lexicon from available EHRs. Our algorithm, implemented using two datasets, shows that our architecture outperforms baseline Bi-LSTM or Bi-LSTM networks using linear chain and Skip-Chain conditional random fields (CRF).

Original language	English
Title of host publication	Proceedings of the ACM International Conference on Digital Health
Publisher	Association for Computing Machinery
Pages	67-71
Number of pages	5
Volume	Part F128634
ISBN (Electronic)	9781450352499
DOIs	https://doi.org/10.1145/3079452.3079501
Publication status	Published - 2 Jul 2017
Event	7th International Conference on Digital Health, DH 2017 - London, United Kingdom Duration: 2 Jul 2017 → 5 Jul 2017

Conference

Conference	7th International Conference on Digital Health, DH 2017
Country/Territory	United Kingdom
City	London
Period	2/07/2017 → 5/07/2017

Keywords

Adverse drug reactions
Named entity recognition
Recurrent neural networks

Access to Document

10.1145/3079452.3079501

Cite this

@inbook{62a94e5b247847889421f7f92bf3030f,

title = "Improving RNN with Attention and Embedding for Adverse Drug Reactions",

abstract = "Electronic Health Records (EHR) narratives are a rich source of information, embedding high-resolution information of value to secondary research use. However, because the EHRs are mostly in natural language free-text and highly ambiguity-ridden, many natural language processing algorithms have been devised around them to extract meaningful structured information about clinical entities. The performance of the algorithms however, largely varies depending on the training dataset as well as the e effectiveness of the use of background knowledge to steer the learning process.In this paper we study the impact of initializing the training of a neural network natural language processing algorithm with pre-de ned clinical word embeddings to improve feature extraction and relationship classification between entities. We add our embedding framework to a bi-directional long short-term memory (Bi-LSTM) neural network, and further study the effect of using attention weights in neural networks for sequence labelling tasks to extract knowledge of Adverse Drug Reactions (ADRs). We incorporate unsupervised word embeddings using Word2Vec and GloVe from widely available medical resources such as Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) II corpora, Uni- ed Medical Language System (UMLS) as well as embed pharmaco lexicon from available EHRs. Our algorithm, implemented using two datasets, shows that our architecture outperforms baseline Bi-LSTM or Bi-LSTM networks using linear chain and Skip-Chain conditional random fields (CRF).",

keywords = "Adverse drug reactions, Named entity recognition, Recurrent neural networks",

author = "Chandra Pandey and Zina Ibrahim and Honghan Wu and Ehtesham Iqbal and Richard Dobson",

year = "2017",

month = jul,

day = "2",

doi = "10.1145/3079452.3079501",

language = "English",

volume = "Part F128634",

pages = "67--71",

booktitle = "Proceedings of the ACM International Conference on Digital Health",

publisher = "Association for Computing Machinery",

note = "7th International Conference on Digital Health, DH 2017 ; Conference date: 02-07-2017 Through 05-07-2017",

}

Pandey, C, Ibrahim, Z , Wu, H , Iqbal, E & Dobson, R 2017, Improving RNN with Attention and Embedding for Adverse Drug Reactions. in Proceedings of the ACM International Conference on Digital Health. vol. Part F128634, Association for Computing Machinery, pp. 67-71, 7th International Conference on Digital Health, DH 2017, London, United Kingdom, 2/07/2017. https://doi.org/10.1145/3079452.3079501

TY - CHAP

T1 - Improving RNN with Attention and Embedding for Adverse Drug Reactions

AU - Pandey, Chandra

AU - Ibrahim, Zina

AU - Wu, Honghan

AU - Iqbal, Ehtesham

AU - Dobson, Richard

PY - 2017/7/2

Y1 - 2017/7/2

N2 - Electronic Health Records (EHR) narratives are a rich source of information, embedding high-resolution information of value to secondary research use. However, because the EHRs are mostly in natural language free-text and highly ambiguity-ridden, many natural language processing algorithms have been devised around them to extract meaningful structured information about clinical entities. The performance of the algorithms however, largely varies depending on the training dataset as well as the e effectiveness of the use of background knowledge to steer the learning process.In this paper we study the impact of initializing the training of a neural network natural language processing algorithm with pre-de ned clinical word embeddings to improve feature extraction and relationship classification between entities. We add our embedding framework to a bi-directional long short-term memory (Bi-LSTM) neural network, and further study the effect of using attention weights in neural networks for sequence labelling tasks to extract knowledge of Adverse Drug Reactions (ADRs). We incorporate unsupervised word embeddings using Word2Vec and GloVe from widely available medical resources such as Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) II corpora, Uni- ed Medical Language System (UMLS) as well as embed pharmaco lexicon from available EHRs. Our algorithm, implemented using two datasets, shows that our architecture outperforms baseline Bi-LSTM or Bi-LSTM networks using linear chain and Skip-Chain conditional random fields (CRF).

AB - Electronic Health Records (EHR) narratives are a rich source of information, embedding high-resolution information of value to secondary research use. However, because the EHRs are mostly in natural language free-text and highly ambiguity-ridden, many natural language processing algorithms have been devised around them to extract meaningful structured information about clinical entities. The performance of the algorithms however, largely varies depending on the training dataset as well as the e effectiveness of the use of background knowledge to steer the learning process.In this paper we study the impact of initializing the training of a neural network natural language processing algorithm with pre-de ned clinical word embeddings to improve feature extraction and relationship classification between entities. We add our embedding framework to a bi-directional long short-term memory (Bi-LSTM) neural network, and further study the effect of using attention weights in neural networks for sequence labelling tasks to extract knowledge of Adverse Drug Reactions (ADRs). We incorporate unsupervised word embeddings using Word2Vec and GloVe from widely available medical resources such as Multiparameter Intelligent Monitoring in Intensive Care (MIMIC) II corpora, Uni- ed Medical Language System (UMLS) as well as embed pharmaco lexicon from available EHRs. Our algorithm, implemented using two datasets, shows that our architecture outperforms baseline Bi-LSTM or Bi-LSTM networks using linear chain and Skip-Chain conditional random fields (CRF).

KW - Adverse drug reactions

KW - Named entity recognition

KW - Recurrent neural networks

UR - http://www.scopus.com/inward/record.url?scp=85025443946&partnerID=8YFLogxK

U2 - 10.1145/3079452.3079501

DO - 10.1145/3079452.3079501

M3 - Conference paper

AN - SCOPUS:85025443946

VL - Part F128634

SP - 67

EP - 71

BT - Proceedings of the ACM International Conference on Digital Health

PB - Association for Computing Machinery

T2 - 7th International Conference on Digital Health, DH 2017

Y2 - 2 July 2017 through 5 July 2017

ER -

Improving RNN with Attention and Embedding for Adverse Drug Reactions

Abstract

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this