The Population Project has temporarily suspended its activities. Our database of about 700 million and 200 million dead remains freely accessible. Read more.

It has been a busy summer

AB

Antoine Bello · September 19, 2021

It has been a busy summer at the Population Project. After the initial sprint, we’re settling into a marathon pace more suited to our time horizon, which is measured in years, if not decades. We have more clarity on the challenges ahead:

Gathering lists is relatively easy in some countries (the US, France), much more difficult in others (Germany, Peru). Some countries have the good sense of separating first, middle, and last names, while others only display a full name that is sometimes difficult to break down. Hiring and coordinating volunteers has proved trying, leading us to target fewer countries where we hire highly productive freelance web researchers. It’s still a bit early to discuss metrics as numbers vary greatly from one week to the next. We can find the names of 20 million Mexican public employees on Monday and collect only 500,000 names the rest of the week. Overall though, we think we’ve only tapped the tip of the proverbial iceberg.

Processing lists is less rewarding than finding them but just as important. Most lists come in PDF format. They need to be converted and purged from any unnecessary or redondant information - a process that take anywhere between two minutes and one hour per list. As we can’t keep up with all the lists we receive, we’ve decided to outsource part of our workflow to a BPO (business process outsourcing) firm based in the Philippines and Madagascar. The first tests will begin next week. Being able to count on an apt and experienced partner should definitely free up some of our resources.

IT remains the biggest obstacle we face. The size of the database we plan on building doesn’t allow for shortcuts. Slight conception mistakes might prove absolutely catastrophic when we have one or five billion records. After an initial development in Postgres, we’ve scrapped our entire code and started anew on 4D, a powerful yet agile relational database system. The problems (comparing humans, merging records, etc.) haven’t gone away but they’re easier to frame. We won’t publish a beta version of the site as originally planned but hope to have a prototype some time during the first half of 2022.

As always, we welcome your feedback. If you feel like helping in any capacity - be it to collect lists, process them, or design our website - please contact me. One last ask: please share this post or your LinkedIn page or Facebook page - we haven’t had time to hire a PR firm yet!

A nonprofit organization striving to compile a list of every living person’s full name and place and date of birth.
The Population Project relies heavily on the work and contributions of volunteers. We believe that information-gathering and use should go hand-in-hand with transparency. This Privacy Policy explains how the Population Project, the non-profit organization that hosts this site, collects, uses, and shares information we receive from you through your use of the Population Project Site. It is essential to understand that, by using the Population Project Site, you consent to the collection, transfer, processing, storage, disclosure, and use of your information as described in this Privacy Policy. That means that reading this Policy carefully is important. We believe that you shouldn't have to provide nonpublic Personal Information to participate to the Population Project. You do not have to provide things like your real name, address, or country to sign up for a standard account or contribute content to the Population Project Site. We do not sell or rent your Personal Information, nor do we give it to others to sell you anything. We use it to figure out how to make the Population Project Site more engaging and accessible. Put simply: we use this information to make the Population Project Site better for you.