Millions of sources for family history are made available online every day. The providers of these sources often gather them into collections or databases, to help people like us access them. Unfortunately, the ease of accessing databases can fool us into thinking we can use them quickly and not spend time analysing them.
Anyone can place a database of family history sources on the internet, without peer review or quality control. Some repositories have considerable funds to manage their databases, while others are managed by individuals. This means that the quality of databases varies considerably, as does the method by which they make them available.
Context and provenance
The first step with any source is to examine the context and provenance. For an online database this means we ask questions such as: Where is the database hosted? Who created the website? Who created the database? Why did they create it, when and how? Who is the audience? Does the website clearly identify which sources are contained in the database and where they came from?
To find out more about an online database, read any available bios about the creator(s), descriptions of the database, introductions and explanatory notes, and lists of abbreviations. If this information is not available, you will need to be more cautious when using the database as it will be more difficult to assess the reliability of the information.
Databases contain sources, but each database is unique in content and format. They also change over time, so this analysis may need to be conducted more than once.
After identifying the types of sources in the database (for example, birth records, baptism records), there are two more critical steps. First you need to establish whether they are complete sources, image extracts of the sources, or information that has been transcribed from the sources.
Next you need to determine the sources on which the database is based – I call these the ‘originating sources’. Sources in online databases may be based on original sources (the first version of a source), derivative sources (sources derived from other sources), or a mixture of both.
Next, examine the coverage of the database. Which places are included, which time periods and are there some categories excluded?
Do not just rely on the title of the database for this information. For example, a collection called ‘Australian Birth Records Index 1788-1922’ is misleading. Civil registration did not commence until 1856 in New South Wales (and on other dates in the other states), so there cannot possibly be any 1788 birth records in the database. In addition, just because a database title or description says that it includes a particular date range, it does not mean that it actually includes all the sources for that date range. A good quality online repository will provide specific information about the coverage if you make the effort to read the description or explanatory notes.
Another thing to check for is whether the repository is updating and adding to the collection. If that is the case, it is worth checking back again later to search the database again.
If the coverage of a database does not meet your needs, always check other repositories for similar collections as they may have a wider coverage.
Many databases have been indexed and some databases are comprised solely of an index without copies of the actual sources. This is what I was alluding to above when I suggested you check if the database contains information that has been transcribed from the sources. Some genealogists suggest that an index is just a finding aid, not a source and that it should never be cited. I disagree. If you extract information from it then it is a source and you should cite it. How else are you or another researcher going to track down the originating source if you do not do so? However, I do agree that an index is not the best source and you should always endeavour to track down the originating source and use that.
Information and evidence
Now you can examine the information within the source and the evidence it provides. The approach to doing so is the same as for any other source. Key questions: What information is provided, how is it expressed and how is it formatted? Are there gaps in the information? Are there any explanatory notes, what do the headings mean, do you understand all the terminology and abbreviations? Who were the informants, is the information primary or secondary, what is the quality? Does the information provide direct, indirect or negative evidence?
The key thing to remember for online databases is that this information may not necessarily be visible when you are viewing the source or entry – you may have to go searching elsewhere for it, either within that database, elsewhere on the website or even in another place.
Using the source
There are different ways to extract information from an online database. Usually this involves taking a copy, transcribing the information or attaching the source to your online family tree. There are a few things to watch out for here.
If you are making a copy or transcribing the information, you should also create a source citation and make a copy of the description of the database as these will be helpful for your analysis. If available, it is a good idea to make copies of the cover pages and any explanatory pages.
Check for any copyright or usage conditions that may apply, which will largely be dependent on whether you are planning to publish or just using the information for your own research. If you are transcribing the information, it is a good idea to make an image copy as well, in case you make errors in your transcription.
Do not just examine the entry that came up in your search. Always look at the surrounding entries in the database and you may wish to make copies of those as well. You might find related entries or entries with additional information that will assist you in the interpretation of your entry.
When you find an entry within a database, some sites provide hints to other entries or other databases for a person with the same or similar name. These can be useful research leads but take care to analyse the other entry carefully to establish for yourself whether they are actually the same person.
Not all online databases can be searched. With some you need to browse the entries. Browsing can also be useful for searchable databases, because the effectiveness of searching is limited by the search engine and the search criteria that you enter.
Citing online databases
You must cite the source that you used, which means the database. Do not just cite the website where you found the database, as that will not help you or others find it again; nor will it help anyone evaluate the reliability of the information extracted from the database. However, the website does need to be mentioned in the citation.
As our purpose of citing a source includes tracking down the originating sources, information about the originating source(s) needs to be included as well.
Website, name of database, website address, date accessed, description of entry; details to help you find it again; originating source
FamilySearch, “England Bishop’s transcripts,” database (www.FamilySearch.org: accessed 21 Jun 2016), entry for Frances, daughter of Peter & Frances Hawkins; St Nicholas Church, Brighton, Sussex, FHL film no. 1,468,821, page 186, no. 1481; citing West Sussex Record Office, Chichester, no.: EP II/16/27A-M.