Q. The pages of the 1911 Census have already been digitally scanned, so what are the next steps that need to happen, to make the files into an online searchable database?
A. The images we received from The National Archives (TNA), owners of the original 1911 Census collection for England and Wales, were 'raw' files. This meant that they needed to be processed to go online so that the files were small enough for users to download.
We have a two-step process unique to our site that means for the first time users will be able to view a quick loading file and have the option of viewing a high-resolution version, with more quality and twice the resolution than has previously been available online.
At the same time, our transcription department tackles the task of keying in each individual's details, which are then fed into our large transcript database. The next step is to add the transcript records so that they become searchable using our unique tools:
- the keyword master search
- the family forename search
and the house and street search.
We then begin the alpha-testing stage in-house. This is followed by beta-testing with some of our users.
Q. How many bytes will the entire 1911 Census take up?
A. Including multiple versions (the quick-loading file, the high-resolution version and back-ups), arouns 200 terabytes*.
(*A terabyte is 1,048,576 megabytes - so the 1911 Census will take up 209,715,200 megabytes!)
Q. Was it difficult to decide on the level of detail and the number of search fields to include in your search window?
A. Not really, we pride ourselves on providing full transcripts to make it easier and quicker for our users to find their ancestors, so there was very little that we decided not to transcribe. |
Q. Will the transcriptions be double-checked to help make for more accurate search results?
A. All transcriptions go through two levels of quality assurance before going online.
Q. Is this the biggest single project that you have ever taken on?
A. Pretty much, and it's definitely bigger than any single year of the census we have done before.
Q. What has been the most challenging aspect?
A. The sheer volume of images, as there are now two images per household, totalling more than 16 million images. With earlier census returns, however, a single page image could list several households, maybe even half a street.
Q. How many people are working on the project?
A. Hundreds.
Q. When will the redacted information in the disability column become publicly accessible?
A. The Information Commissioner's Office rules that the disability column had to be hidden for the full 100 years. TNA will be sending us unredacted images early next year and we shall be processing them and uploading them to the website as soon as we get them.
Q. What's your favourite aspect to the project?
A. Finding my ancestors when doing the beta-testing. It's always satisfying seeing my ancestors there in high-resolution colour and reading their handwriting for the first time.
Q. What is the order of release for the counties?
A. The order has not yet been finalised, but the next area to be released is going to be London, and Buckinghamshire is already available on the website. |