Highlights
- •Two efficient computational methods were developed to infer surnames from Y-STR profiles.
- •More than 19,000 men bearing 266 surnames were typed for 17 Y-STR loci to demonstrate the performance of the methods.
- •The possibility of inferring surnames from Y-STR profiles reliably enables promising applications in forensics.
Abstract
Co-ancestry of human surnames and Y-chromosomes in most human populations and social
groups suggests the possibility of inferring one from the other. However, such an
intuitive perspective remains to be formally explored. In the present study, we develop
two computational methods, based on cosine distance (dcos) and coalescence distance (dcoal) respectively, to infer surnames from Y-STR profiles. We also survey Y-STR variations
at 15 loci for 19,009 individuals of Shandong Province in China. For a total of 266
surnames included in the data set, our methods can pinpoint to a single surname with
an average accuracy of 65%, and with an average accuracy higher than 80% when providing
>4 candidate surnames. We also demonstrate that increasing the sample size of surnames
and the number of STR loci improves the accuracy of surname inference. Our results
indicate that the 15 non-duplicated Y-STR loci contain information from which surname
can be reliably inferred for Chinese populations, showing a promising application
in forensics.
Keywords
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Forensic Science International: GeneticsAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- Genetic signatures of coancestry within surnames.Curr. Biol. 2006; 16: 384-388
- Identifying personal genomes by surname inference.Science. 2013; 339: 321-324
- The relationship between surname frequency and Y chromosome variation in Spain.Eur. J. Hum. Genet. 2016; 24: 120-128
- Identification of the remains of the Romanov family by DNA analysis.Nat. Genet. 1994; 6: 130-135
- Improving human forensics through advances in genetics, genomics and molecular biology.Nat. Rev. Genet. 2012; 12: 179-192
- Microsatellites and kinship.Trends Ecol. Evol. 1993; 8: 285-288
- Forensic DNA Typing: Biology, Technology, and Genetics of STR Markers.2nd ed. Elsevier, 2005
- DNA commission of the International Society of Forensic Genetics: recommendations on forensic analysis using Y-chromosome short tandem repeats.Leg. Med. 2001; 3: 252-257
- Chinese surnames and the genetic differences between North and South China.J. Chin. Ling. Monogr. Ser. 1992; 5
- In the name of the father: surnames and genetics.Trends Genet. 2001; 17: 353-357
- A study of surnames in China through isonymy.Am. J. Phys. Anthropol. 2012; 148: 341-350
- What’s in a name? Y chromosomes, surnames and the genetic genealogy revolution.Trends Genet. 2009; 25: 351-360
- Chinese Surnames: Community Heredity and Population Distribution.East China Normal University Press, Shanghai2002 (in Chinese)
- Population genetics of Chinese surnames I. Surname frequency distribution and genetic diversity in Chinese.Acta Genet. Sin. 2000; 27 (in Chinese): 471-476
- Genetic structure of the Han Chinese population revealed by genome-wide SNP variation.Am. J. Hum. Genet. 2009; 85: 775-785
- New method for surname studies of ancient patrilineal population structures, and possible application to improvement of Y-chromosome sampling.Am. J. Hum. Phys. Anthrop. 2005; 126: 214-228
- Surnames and Y-chromosomal markers reveal low relationship in Southern Spain.PLoS One. 2015; 10: e0123098
- People of the British Isles: preliminary analysis of genotypes and surnames in a UK-control population.Eur. J. Hum. Genet. 2012; 20: 203-210
- Family name distributions: master equation approach.Phys. Rev. E. 2007; 76: 046113
- Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material.Biotechniques. 1991; 10: 506-513
- DNA commission of the International Society of Forensic Genetics (ISFG): an update of the recommendations on the use of Y-STRs in forensic analysis.Forensic Sci. Int. 2006; 157: 187-197
- Estimating the time to the most recent common ancestor for the Y chromosome or mitochondrial DNA for a pair of individuals.Genetics. 2001; 158: 897-912
- Mutation rates at Y chromosome specific microsatellites.Hum. Mutat. 2005; 26: 520-528
- Comprehensive mutation analysis of 17 Y-chromosomal short tandem repeat polymorphisms included in the AmpFlSTR® Yfiler® PCR amplification kit.Int. J. Legal Med. 2009; 123: 471-482
- Mutability of Y-chromosomal microsatellites: rates, characteristics, molecular bases, and forensic implication.Am. J. Hum. Genet. 2010; 87: 341-353
- Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees.Am. J. Hum. Genet. 2001; 68: 444-456
- A model of mutation appropriate to estimate the number of electrophoretically detectable alleles in a finite population.Genet. Res. 1973; 22: 201-204
- Bessel functions of integer order.in: Abramowitz M. Stegun I.A. Handbook of Mathematical Functions. National Bureau of Standards, Washington, DC1964: 355-434
- Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows.Mol. Ecol. Resour. 2010; 10: 564-567
- Median-joining networks for inferring intraspecific phylogenies.Mol. Biol. Evol. 1999; 16: 37-48
- Estimating per-locus mutation rates.J. Genet. Gen. 2006; 2: 27-33
- GEP-ISFG, population and segregation data on 17 Y-STRs: results of a GEP-ISFG collaborative study.Int. J. Legal Med. 2008; 122: 529-533
- R: A Language and Environment for Statistical Computing.R Foundation for Statistical Computing, Vienna Austria2016URL https://www.R-project.org/
- Surname in Taiwan: interpretations based on geography and history.Hum. Biol. 1983; 55: 367-374
Article info
Publication history
Published online: November 24, 2017
Accepted:
November 23,
2017
Received in revised form:
November 21,
2017
Received:
June 11,
2017
Identification
Copyright
© 2017 Elsevier B.V. All rights reserved.