IPUMS.org Home Page

BIBLIOGRAPHY

Publications, working papers, and other research using data resources from IPUMS.

Full Citation

Title: Record Linkage for Character-Based Surnames: Evidence from Chinese Exclusion

Citation Type: Journal Article

Publication Year: 2023

ISSN: 10902457

DOI: 10.1016/J.EEH.2022.101493

Abstract: This paper proposes a novel pre-processing technique to improve record linkage for historical Chinese populations. Current matching approaches are relatively ineffective due to Chinese-specific naming conventions and enumeration errors. This paper develops a three-step process that both triples the match rate over baseline and improves match accuracy. The procedures developed in this paper can be applied in part or in full to other sources of historical data, and/or modified for use with other character-based languages such as Japanese. More broadly, this approach suggests the promise of language-specific linkage procedures to boost match rates for ethnic minority groups.

Url: https://www.sciencedirect.com/science/article/pii/S0014498322000717?via%3Dihub

User Submitted?: No

Authors: Postel, Hannah M.

Periodical (Full): Explorations in Economic History

Issue:

Volume: 87

Pages: 1-6

Data Collections: IPUMS USA - Ancestry Full Count Data

Topics: Race and Ethnicity

Countries: China

IPUMS NHGIS NAPP IHIS ATUS Terrapop