Full Citation
Title: Jane, John ... Leslie? A Historical Method for Algorithmic Gender Prediction.
Citation Type: Journal Article
Publication Year: 2015
ISBN:
ISSN:
DOI:
NSFID:
PMCID:
PMID:
Abstract: This article describes a new method for inferring the gender of personal names using large historical datasets. In contrast to existing methods of gender prediction that treat names as if they are timelessly associated with one gender, this method uses a historical approach that takes into account how naming practices change over time. It uses historical data to measure the likelihood that a name was associated with a particular gender based on the time or place under study. This approach generates more accurate results for sources that encompass changing periods of time, providing digital humanities scholars with a tool to estimate the gender of names across large textual collections. The article first describes the methodology as implemented in the gender package for the R programming language. It goes on to apply the method to a case study in which we examine gender and gatekeeping in the American historical profession over the past half-century. The gender package illustrates the importance of incorporating historical approaches into computer science and related fields. Please see the lmullen/gender-article GitHub repository for the code used to create this article.
Url: http://www.digitalhumanities.org/dhq/vol/9/3/000223/000223.html
User Submitted?: No
Authors: Blevins, Cameron; Mullen, Lincoln
Periodical (Full): Digital Humanities Quarterly
Issue: 3
Volume: 9
Pages:
Data Collections: IPUMS USA
Topics: Gender, Methodology and Data Collection
Countries: