The Population Project has temporarily suspended its activities. Our database of about 700 million and 200 million dead remains freely accessible. Read more.

Populating the United States of America

How we quickly acquired 70% of the US population.

AB

Antoine Bello · June 6, 2021

Six months ago, when we started to think about what would become the Population Project, we decided to begin our investigations with the two countries we know best: France and the United States. Granted, these countries can hardly be viewed as representative of the world population. Already highly digital, they could be expected to post more information online than the average country. But that was precisely the point: if we couldn’t quickly account for a significant proportion of the French and American population, then the notion of one day mapping India or the African continent was outright delusional.

The US quickly proved to be a public data Eldorado. For starters, voters’ lists are available to the public in every state. Sometimes you can download them on the state or county electoral commission’s site, sometimes you have to order a disk. Each record contains the first and last name, the middle initial, the sex and the date (or only the year) of birth. In a little more than a month, we were sitting on 200 million records!

It took us a bit longer to discover a second gold mine: lists of public employees along with their salaries and pensions. Some states go to great lengths to conceal their data but with a bit of tenacity, we were able to get ahold of most of it. Needless to say, we have no interest in how much civil servants make in Minnesota. We only store a handful of data points for each human and salary isn’t one of them.

With something like 80% of the American adult population in hand, we shifted our efforts to smaller datasets. For instance, there are several million inmates and sex offenders in the US. Over a million students graduate from college every year, their names printed in universities commencement programs. This made us realize that if we didn’t know the graduate’s dates of birth, we could still guess their ages with a two or three-year confidence margin.

We then branched into sports. Individual sports federations typically publish rankings of their discipline. Swimming, track & field, ski proved the most fruitful avenues, with chess, bridge and Rubik’s cube (50,000 names!) not far behind.

It became obvious at this point that younger generations would be the most difficult ones to come by. Except for 12-year-old swimmers and the occasional chess prodigy, we are desperately lacking in the under 18 category.

Overall, we consider our American campaign an unmitigated success. It has left us with hundreds of millions of records, including lots of overlapping ones that enable us to corroborate data and to gradually enrich profiles.

Next week: France!

A nonprofit organization striving to compile a list of every living person’s full name and place and date of birth.
The Population Project relies heavily on the work and contributions of volunteers. We believe that information-gathering and use should go hand-in-hand with transparency. This Privacy Policy explains how the Population Project, the non-profit organization that hosts this site, collects, uses, and shares information we receive from you through your use of the Population Project Site. It is essential to understand that, by using the Population Project Site, you consent to the collection, transfer, processing, storage, disclosure, and use of your information as described in this Privacy Policy. That means that reading this Policy carefully is important. We believe that you shouldn't have to provide nonpublic Personal Information to participate to the Population Project. You do not have to provide things like your real name, address, or country to sign up for a standard account or contribute content to the Population Project Site. We do not sell or rent your Personal Information, nor do we give it to others to sell you anything. We use it to figure out how to make the Population Project Site more engaging and accessible. Put simply: we use this information to make the Population Project Site better for you.