Skip to content

Latest commit

 

History

History
20 lines (11 loc) · 1.19 KB

README.md

File metadata and controls

20 lines (11 loc) · 1.19 KB

Family Tree

Find family connections between people born thousands of years apart. Information from over 600,000 individuals has been pulled from Wikidata using Beautiful Soup 4 and requests in python.

From Arnie to Liz, via JFK! alt text

Person.py

The path() method in the Person object uses breadth-first search to find the shortest path between 2 people, and prints out all individuals in the path.

tree_builder.py

This is the script used to scrape wikidata.org. I let it run on a AWS Micro server, which took about 12 days to pull all 600,000 people

ancestorsFinal.txt

ancestorsFinal.txt holds all the data that was extracted by running the tree_builder.py script. One person per line, and columns are separated with an asterisk character (*) due to issues with comma in peoples names causing parsing issues

Disclaimer: As the data is from Wikidata, which is a collaboratively edited knowledge base, not all connections can be verified. References are included on the individuals wikidata page where available.