TY - JOUR
T1 - Linking archival data to location - A case study at the UK National Archives
AU - Clough, P D
AU - Tang, Jaiyu
AU - Hall, Mark
AU - Warner, Amy
PY - 2011
Y1 - 2011
N2 - Purpose - The National Archives (TNA) is the UK government's official archive. It stores and maintains records spanning over a thousand years in both physical and digital form. Much of the information held by TNA includes references to place and frequently user queries to TNA’s online catalogue involve searches for location. The purpose of this paper is to illustrate how TNA have extracted the geographic references in their historic data to improve access to the archives.
Approach - To be able to quickly enhance the existing archival data with geographic information, existing technologies from Natural Language Processing (NLP) and Geographical Information Retrieval (GIR) have been utilised and adapted to historical archives.
Findings - Enhancing the archival records with geographic information has enabled TNA to quickly develop a number of case-studies highlighting how geographic information can improve access to large-scale archival collections. The use of existing methods from the GIR domain and technologies, such as OpenLayers, enabled us to quickly implement this process in a way that is easily transferrable to other institutions.
Practical implications - The methods and technologies described in this paper can be adapted by other archives to similarly enhance access to their historic data. Also the data-sharing methods described can be used to enable the integration of knowledge held at different archival institutions.
Value - Place is one of the core dimensions for TNA’s archival data. Many of the records that we hold make reference to place data (wills, legislation, court cases…) and approximately one fifth of users’ searches involve place names. However, there are still a number of open questions regarding the adaptation of existing GIR methods to the history domain. This paper presents an overview over available GIR methods and the challenges in applying them to historical data.
AB - Purpose - The National Archives (TNA) is the UK government's official archive. It stores and maintains records spanning over a thousand years in both physical and digital form. Much of the information held by TNA includes references to place and frequently user queries to TNA’s online catalogue involve searches for location. The purpose of this paper is to illustrate how TNA have extracted the geographic references in their historic data to improve access to the archives.
Approach - To be able to quickly enhance the existing archival data with geographic information, existing technologies from Natural Language Processing (NLP) and Geographical Information Retrieval (GIR) have been utilised and adapted to historical archives.
Findings - Enhancing the archival records with geographic information has enabled TNA to quickly develop a number of case-studies highlighting how geographic information can improve access to large-scale archival collections. The use of existing methods from the GIR domain and technologies, such as OpenLayers, enabled us to quickly implement this process in a way that is easily transferrable to other institutions.
Practical implications - The methods and technologies described in this paper can be adapted by other archives to similarly enhance access to their historic data. Also the data-sharing methods described can be used to enable the integration of knowledge held at different archival institutions.
Value - Place is one of the core dimensions for TNA’s archival data. Many of the records that we hold make reference to place data (wills, legislation, court cases…) and approximately one fifth of users’ searches involve place names. However, there are still a number of open questions regarding the adaptation of existing GIR methods to the history domain. This paper presents an overview over available GIR methods and the challenges in applying them to historical data.
U2 - 10.1108/00012531111135628
DO - 10.1108/00012531111135628
M3 - Article (journal)
SN - 2050-3806
VL - 63
SP - 127
EP - 147
JO - Aslib Journal of Information Management
JF - Aslib Journal of Information Management
IS - 2/3
ER -