Early market engagement |
We have run a discovery phase leading to plans for an alpha using AWS Neptune and Elasticsearch. You can read more about the project: https://www.nationalarchives.gov.uk/about/our-role/plans-policies-performance-and-projects/our-plans/our-digital-cataloguing-practices/project-omega/ The discovery produced a proposal for a new Catalogue Data Model using RDF, a new identifier scheme, and transformation routines for the existing data to the new model. We have held workshops identifying the key ways that staff managing the catalogue work with the data and what they would like in future. The archivist needs to search, analyse, add to, correct, edit, enrich, and enhance record descriptions so that the catalogue is properly maintained. The archivist needs to work with catalogue entries individually or as large sets, making (or reversing) bulk changes, so they can work efficiently. The archivists need to understand the version history of the catalogue so they can be confident about where the information has originated.We have investigated all the current databases that hold catalogue data and how they inter-relate. We have investigated a wide range of existing data standards and ontologies. We have documented all the findings in a detailed published report. |
Who the specialist will work with |
The specialist will work with an RDF Developer (another specialist being recruited at the same time). The core in-house team consists of a data analyst, two senior archivists and the Head of Cataloguing, Taxonomy and Data. The specialists will also work with a wider group of users, archivists across the organisation responsible for the management of the catalogue. |
What the specialist will work on |
We are developing a pan-archival catalogue, bringing together record descriptions from multiple catalogues into a single new system. We are looking for a technical architect and developer to lead the development work on an alpha catalogue management system. This work will involve developing API functions to search, select, add, export, edit, import and delete catalogue data; developing search for use by expert users (using SPARQL in combination with Elasticsearch); developing an Extract, Transform, Load process to migrate The National Archives catalogue data from multiple relational database (SQL Server) and RDF databases to a cloud based native RDF database (AWS Neptune). |