Contains total count and racial breakdown data for surnames in the United States.

cnames

Format

A data frame with 151671 rows and 7 variables:

surname

character: surname

count

integer: number of individuals with a specific surname

pctwhite

numeric: percentage of non-Hispanic whites among those who have a specific surname

pctblack

numeric: percentage of non-Hispanic blacks among those who have a specific surname

pctapi

numeric: percentage of non-Hispanic Asians and Pacific Islanders among those who have a specific surname

pcthispanic

numeric: percentage of Hispanic origin among those who have a specific surname

pctothers

numeric: percentage of the other racial groups among those who have a specific surname

Details

See QSS Table 6.3.

References

  • Imai, Kosuke. 2017. Quantitative Social Science: An Introduction. Princeton University Press. URL.

  • Kosuke Imai and Kabir Khanna (2016) “Improving ecological inference by predicting individual ethnicity from voter registration records.” Political Analysis, vol. 24, no. 2 (Spring), pp. 263–272. doi: https://doi.org/10.1093/pan/mpw001