Abstract
This work-in-progress article discusses DILIPAD (Digging into Linked Parliamentary Data), a project funded under the Digging Into Data Challenge. DILIPAD aims to create an extensive corpus of structured XML data of parliamentary proceedings from three countries (United Kingdom, Netherlands and Canada) in order to enable large-scale diachronic analyses of their content. The corpora integrate the textual data of proceedings within contextual metadata encoded in the XML schema Parliamentary Metadata Language (PML). The article discusses the background to the project, the construction of the corpora and highlights they ways in which they may be used for quantitative and qualitative analysis.
Original language | English |
---|---|
Title of host publication | Proceedings - 2014 IEEE International Conference on Big Data, IEEE Big Data 2014 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 47-50 |
Number of pages | 4 |
ISBN (Print) | 9781479956654 |
DOIs | |
Publication status | Published - 7 Jan 2015 |
Event | 2nd IEEE International Conference on Big Data, IEEE Big Data 2014 - Washington, United States Duration: 27 Oct 2014 → 30 Oct 2014 |
Conference
Conference | 2nd IEEE International Conference on Big Data, IEEE Big Data 2014 |
---|---|
Country/Territory | United States |
City | Washington |
Period | 27/10/2014 → 30/10/2014 |
Keywords
- corpus analysis
- metadata
- parliamentary history
- XML