IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Jane, John ... Leslie? A Historical Method for Algorithmic Gender Prediction.

Citation Type: Journal Article

Publication Year: 2015

Abstract: This article describes a new method for inferring the gender of personal names using large historical datasets. In contrast to existing methods of gender prediction that treat names as if they are timelessly associated with one gender, this method uses a historical approach that takes into account how naming practices change over time. It uses historical data to measure the likelihood that a name was associated with a particular gender based on the time or place under study. This approach generates more accurate results for sources that encompass changing periods of time, providing digital humanities scholars with a tool to estimate the gender of names across large textual collections. The article first describes the methodology as implemented in the gender package for the R programming language. It goes on to apply the method to a case study in which we examine gender and gatekeeping in the American historical profession over the past half-century. The gender package illustrates the importance of incorporating historical approaches into computer science and related fields. Please see the lmullen/gender-article GitHub repository for the code used to create this article.

Url: http://www.digitalhumanities.org/dhq/vol/9/3/000223/000223.html

User Submitted?: No

Authors: Blevins, Cameron; Mullen, Lincoln

Periodical (Full): Digital Humanities Quarterly

Issue: 3

Volume: 9

Pages:

Data Collections: IPUMS USA

Topics: Gender, Methodology and Data Collection

Countries:

IPUMS NHGIS NAPP IHIS ATUS Terrapop