Genealogical Workflow: Step 3 of 5 – Analyze
9 Mar 2010 by Dan LawyerOnce information has been collected during the gather phase, it must be analyzed. The reality is that the Gather and Analyze phases are often very intermingled. The researcher does micro-bounces back and forth between gathering and analyzing. They find a relevant record perform some analysis on it, decide whether there is anything that should be captured or if there should be any adjustments to their research plan, update their research log and move on. It is easy to tell when someone hits this phase – they look like the image below.
Photo by SAN DRINO
At times the analysis becomes more prolonged than these brief sessions interspersed throughout the gather phase. An individual pulls together all of the information they have gathered and analyzes it together to see what new clues, insights, learnings or conclusions can be gleaned. Whether it is a more formal analysis session or smaller analysis steps during the gather phase, the following questions are usually the focus of the anaylisis.
- Is this information helpful?
- Is this about my ancestor?
- Are these the same people?
- Does this conflict with other information?
- What information is missing and why?
One of the biggest challenges to the analysis phase is that it requires lots of knowledge and context to adequately perform an analysis. An individual often has to know the history, culture and jurisdictions of the places related to the information being analyzed. The individual has to understand the specifics of the particular artifact being analyzed – how, when, why and by whom it was created. One must have all known information about the person or family being researched. This is a lot of information to keep in context while performing an analysis. The complex context keeping problem is further exacerbated by common challenges with records that are hard to read because of fading, older or poor penmanship and different languages.
To overcome these challenges an individual uses a variety of different tools or seeks help from others with greater knowledge. It is common for an individual to plot out information on maps or use gazetteers. Timelines are often used to help identify unlikely differences between events. Family group sheets or other ways of depicting and comparing individuals, families and their relationships are brought to bear. Date calculators come in handy for calculating ages or specific dates of events. There are also many different forms that get used to try and capture, compare and analyze information.
While the context keeping concern is one of the main challenges in the Analyze phase, other challenges exist. For example, an extension of the context problem is that even after analysis has been performed it is common for an individual to need to refer back to earlier analysis efforts. It is extremely hard to keep all of the context brought to bear in the previous analysis. We’ve also observed that it is hard for an individual to do date calculations. Often one doesn’t know what information in a record is important or what it means so it is overlooked. The list of challenges goes on.
Several years ago I was involved with a study of how novices do family history. One of the interesting charts created as part of this study was an affinity diagram of all the challenges observed as novices attempted to do genealogy. When we had completed the diagram we had clustered together into related groupings all of the problems observed. While some of the problems were unique to novices, most of the problems observed were just inherent to the domain of genealogical research. After looking at the diagram for a while we decided to read through each of the nodes in the diagram and flag it if the node was an analysis step. We were surprised to discover that about 80% of the problems encountered were characterized as analysis. 80% of the challenges we observed while watching these novices do family history occurred during the analysis phase. Much of it was due to the complex context keeping conundrum described above.
As a result of this study we came to believe that there was a lack of meaningful analysis tools available to genealogists. We began a subsequent effort to prototype analysis tools. The first of these was an attempt to create a paper prototype of a tool that would help an ordinary person decide if two people (for example, an ancestor in his or her pedigree and an individual in a historical record) were the same. The tool basically helped the user compare relationships, events on a timeline and locations on a map for the persons in question. Our goal was to help an ordinary person make this determination within 30 seconds with high reliability. We created an initial mock-up and tested it with about five individuals. Images of these mockups follow. The initial test results were very good but unfortunately we were redirected onto higher priority efforts and have never returned to this effort.
Relationship Comparison
Timeline Comparison
Location Comparison (example 1)
Location Comparison (example 2)
It is my personal belief that next to availability of records, analysis tools (or the lack thereof) are the largest problem and opportunity in the domain of genealogy. What are your thoughts? Has this problem already been solved and I just don’t know it?












