Highlights
- •226 ancestry (BGA) markers compiled for VISAGE Enhanced Tool (ET) combined with 184 appearance markers in one MPS assay.
- •Autosomal BGA SNP number in ET reduced to allow inclusion of 85 Y-SNPs, 16 X-SNPs and 21 Microhaplotypes (MHs).
- •Extra BGA markers give enhanced detail of co-ancestry patterns in admixed males and MH loci allow ancestry-based mixed DNA.
- •Comprehensive reference population datasets and analyses of global distribution of variation in the ET BGA markers outlined.
- •Expanded Middle East-informative SNPs enhance differentiation of these populations particularly when combined with nested K:5 STRUCTURE runs.
Abstract
Keywords
1. Introduction
- Heidegger A.
- Pisarek A.
- de la Puente M.
- Niederstätter H.
- Pośpiech E.
- Woźniak A.
- Schury N.
- Unterländer M.
- Sidstedt M.
- Junker K.
- et al.
- de la Puente M.
- Ruiz-Ramírez M.J.
- Ambroa-Conde A.
- Xavier C.
- Amigo J.
- Casares de Cal M.A.
- Gómez-Tato A.
- Carracedo A.
- Parson W.
- Phillips C.
- Lareu M.V.
2. Materials and methods
2.1 Selection of ancestry markers for ET
2.1.1 Autosomal BGA SNPs

- Byrska-Bishop M.
- Evani U.S.
- Zhao X.
- Basile A.O.
- Abel H.J.
- Regier A.A.
- Corvelo A.
- Clarke W.E.
- Musunuri R.
- Nagulapalli K.
- et al.
- Phillips C.
- Amigo J.
- Tillmar A.O.
- Peck M.A.
- de la Puente M.
- Ruiz-Ramírez J.
- Bittner F.
- Idrizbegović Š.
- Wang Y.
- Parsons T.J.
- et al.
2.1.2 Y-SNPs
- Ralf A.
- van Oven M.
- Montiel González D.
- de Knijff P.
- van der Beek K.
- Wootton S.
- Lagacé R.
- Kayser M.
No. | Marker name | SNP-ID | Position GRCh37 | Position GRCh38 | Substitution | ISOGG Nomenclature | Geographic distribution | No. | Marker name | SNP-ID | Position GRCh37 | Position GRCh38 | Substitution | ISOGG Nomenclature | Geographic distribution |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | V148 | rs181335666 | 6788191 | 6920150 | G->A | A0 | Central Africa, West Africa | 43 | M522 | rs9786714 | 7173143 | 7305102 | G->A | IJK | |
2 | L1086 | NA | 2826312 | 2958271 | A->T | A00 | Central Africa | 44 | M304 | rs13447352 | 22749853 | 20587967 | A->C | J | W Asia, North Africa, Horn of Africa, S Europe, Central Asia, South Asia |
3 | V168 | rs191505182 | 17947672 | 15835792 | G->A | A1 | 45 | M267 | rs9341313 | 22741818 | 20579932 | T->G | J1 | Northern Africa, Horn of Africa, West Asia, South Asia | |
4 | M31 | rs369315948 | 21739754 | 19577868 | G->C | A1a | West Africa, North Africa | 46 | M172 | rs2032604 | 14969634 | 12857709 | T->G | J2 | Southern Europe, West Asia |
5 | V50 | rs189205028 | 6845936 | 6977895 | T->C | A1b1a | Southern Africa, Central Africa | 47 | M9 | rs3900 | 21730257 | 19568371 | C->G | K | |
6 | M32 | rs558241924 | 21740436 | 19578550 | T->C | A1b1b | East Africa, Southern Africa | 48 | M526 | rs2033003 | 23550924 | 21389038 | A->C | K2 | |
7 | M13 | rs3904 | 21722098 | 19560212 | G->C | A1b1b2b | Central Africa, East Africa | 49 | M20 | rs3911 | 21733454 | 19571568 | A->G | L | South Asia, West Asia |
8 | M42 | rs2032630 | 21866840 | 19704954 | A->T | BT | 50 | P326 | rs372687543 | 8467290 | 8599249 | T->C | LT [K1] | ||
9 | M181 | rs2032599 | 14851554 | 12739620 | T->C | B | Central Africa, Southern Africa, East Africa | 51 | P256 | P256 | 8685231 | 8817190 | G->A | M or K2b1b | Near Oceania, Wallacea, Australia, Remote Oceania ??? |
10 | M168 | rs2032595 | 14813991 | 12702062 | C->T | CT | 52 | M231 | rs9341278 | 15469724 | 13357844 | G->A | N | Northern Asia, Central Asia, Americas | |
11 | M145 | rs3848982 | 21717208 | 19555322 | C->T | DE | 53 | M46 | rs34442126 | 14922583 | 12810648 | T->C | N1a1 | Siberia / East Asia | |
12 | M174 | rs2032602 | 14954280 | 12842354 | T->C | D | East Asia | 54 | VL29 | rs752512309 | 14570424 | 12458624 | T->C | N1a1a1a1a1a | NE Europe, Eastern Europe, Central Asia |
13 | F6251 | NA | 7681275 | 7813234 | C->T | D1a | East Asia, Central Asia | 55 | B479 | NA | 26271075 | 24124928 | C->A | N1a1a1a1a1c∼ | East Asia |
14 | M55 | rs2032621 | 21872738 | 19710852 | T->C | D1b | Japan | 56 | Z1936 | rs774008164 | 21463326 | 19301440 | C->T | N1a1a1a1a2 | NE Europe, Eastern Europe, Central Asia |
15 | L1378 | rs893924838 | 2828140 | 2960099 | C->T | D2 | SE Asia | 57 | F4205 | rs1028202961 | 16331432 | 14219552 | A->G | N1a1a1a1a3a | Mongolia |
16 | M96 | rs9306841 | 21778998 | 19617112 | C->G | E | Africa, West Asia, Southern Europe | 58 | B202 | NA | 2880546 | 3012505 | T->C | N1a1a1a1a3b | Russian Far East |
17 | M33 | rs368762706 | 21740450 | 19578564 | A->C | E1a | West Africa | 59 | M2118 | rs571876713 | 23259624 | 21097738 | A->G | N1a1a1a1a4 | Russian Far East |
18 | V38 | rs768983 | 6818291 | 6950250 | C->T | E1b1a | Sub Saharan Africa | 60 | F2930 | rs528311746 | 19080602 | 16968722 | G->A | N1b | East Asia |
19 | M215 | rs2032654 | 15467824 | 13355944 | A->G | E1b1b | 61 | P186 | rs16981290 | 7568568 | 7700527 | C->A | O | East Asia, SE Asia, South Asia, Oceania | |
20 | V32 | rs371254614 | 6932821 | 7064780 | G->C | E1b1b1a1a1b | East Africa | 62 | M119 | rs72613040 | 21762685 | 19600799 | T->G | O1a | SE Asia, East Asia, Oceania |
21 | V13 | rs368031074 | 6842263 | 6974222 | G->A | E1b1b1a1b1a | Southern Europe | 63 | P31 | rs200861659 | 14495243 | 12383440 | T->C | O1b | South Asia, SE Asia |
22 | M81 | rs2032640 | 21892572 | 19730686 | C->T | E1b1b1b1a | Northern Africa | 64 | M176 | rs11575897 | 2655180 | 2787139 | G->A | O1b2 | East Asia |
23 | M123 | rs371143248 | 21764586 | 19602700 | C->T | E1b1b1b2a1 | East Africa, West Asia | 65 | M122 | rs78149062 | 21764674 | 19602788 | A->G | O2 | East Asia, Oceania |
24 | M75 | rs2032639 | 21890177 | 19728291 | G->A | E2 | Sub Saharan Africa | 66 | JST-002611 | rs2075181 | 7546726 | 7678685 | G->A | O2a1b | East Asia |
25 | P143 | rs4141886 | 14197867 | 12077161 | G->A | CF | 67 | P201 | rs2267801 | 2828196 | 2960155 | T->C | O2a2 | Oceania, East Asia | |
26 | M130 | rs35284970 | 2734854 | 2866813 | C->T | C | Central, North & SE Asia, N America, East Asia, Near Oceania, Australia, Remote Oceania | 68 | P295 | rs895530 | 7963031 | 8094990 | T->G | P or K2b2 | |
27 | M38 | rs369611932 | 21742158 | 19580272 | T->G | C1b3a | Oceania / Indonesia | 69 | M242 | rs8179021 | 15018582 | 12906671 | C->T | Q | Northern Asia, Central Asia, America |
28 | M347 | rs868363758 | 2877479 | 3009438 | A->G | C1b3b | Australia | 70 | M3 | rs3894 | 19096363 | 16984483 | G->A | Q1b1a1a | America |
29 | M217 | rs2032668 | 15437333 | 13325453 | A->C | C2 | South Asia, Southern East Asia, Northern East Asia | 71 | M207 | rs2032658 | 15581983 | 13470103 | A->G | R | Europe, West Asia, Central Asia, South Asia, North Africa, Central Africa |
30 | P39 | rs887450245 | 14484581 | 12363850 | G->A | C2b1a1a1 | Northern America | 72 | M173 | rs2032624 | 15026424 | 12914512 | A->C | R1 | |
31 | M48 | rs373681213 | 21749881 | 19587995 | A->G | C2b1a1b | Siberia / Northern East Asia | 73 | M420 | rs17250535 | 23473201 | 21311315 | T->A | R1a | |
32 | M89 | rs2032652 | 21917313 | 19755427 | C->T | F | 74 | Z282 | rs112563127 | 15588401 | 13476521 | T->C | R1a1a1b1a | Eastern Europe, Balkan | |
33 | M201 | rs2032636 | 15027529 | 12915617 | G->T | G | West Asia, South-West Asia, Europe, Central Asia | 75 | Z284 | rs767265794 | 8717196 | 8849155 | C->G | R1a1a1b1a3a | Northern Europe |
34 | M285 | rs13447378 | 22741740 | 20579854 | G->C | G1 | South-West Asia Central Asia | 76 | Z93 | rs566323605 | 7552356 | 7684315 | G->A | R1a1a1b2 | South Asia, Middle East, Central Asia |
35 | P287 | rs4116820 | 22072097 | 19910211 | G->T | G2 | West Asia, South-West Asia, Europe, Central Asia | 77 | M343 | rs9786184 | 2887824 | 3019783 | C->A | R1b | Western Europe |
36 | L901 | rs567848586 | 17844304 | 15732424 | C->T | H | South Asia, Eastern Europe, South-West Europe, Western Europe | 78 | U106 | rs16981293 | 8796078 | 8928037 | C->T | R1b1a1b1a1a1 | Western Europe |
37 | P96 | rs1027017284 | 14869743 | 12757813 | C->A | H2 | Eastern Europe, South-West Europe, Western Europe | 79 | P312 | rs34276300 | 22157311 | 19995425 | C->A | R1b1a1b1a1a2 | Western Europe |
38 | M170 | rs2032597 | 14847792 | 12735858 | A->C | I | Europe, West Asia | 80 | L21 | rs11799226 | 15654428 | 13542548 | C->G | R1b1a1b1a1a2c1 | Western Europe |
39 | M253 | rs9341296 | 15022707 | 12910796 | C->T | I1 | North-Europe, West Europe | 81 | CTS1078 | rs567703217 | 7186135 | 7318094 | G->C | R1b1a1b1b | Caucasus, Balkan, Middle East |
40 | M438 | rs17307294 | 16638804 | 14526924 | A->G | I2 | South Europe, Central Europe, East Europe | 82 | V88 | rs180946844 | 4862861 | 4994820 | C->T | R1b1b | Sub Saharan Africa |
41 | M436 | rs17315680 | 18747493 | 16635613 | G->C | I2a1b | North-Europe, West Europe | 83 | M479 | rs372157627 | 20834667 | 18672781 | C->T | R2 | South Asia |
42 | M429 | rs17306671 | 14031334 | 11910628 | T->A | IJ | 84 | B254 | rs372295336 | 14102580 | 11981874 | C->A | S | Oceania, East Asia, Australia | |
85 | M184 | rs20320 | 14898163 | 12786229 | G->A | T | West Asia, Horn of Africa, North Africa, Southern Europe, South Asia |
2.1.3 X-SNPs
- Byrska-Bishop M.
- Evani U.S.
- Zhao X.
- Basile A.O.
- Abel H.J.
- Regier A.A.
- Corvelo A.
- Clarke W.E.
- Musunuri R.
- Nagulapalli K.
- et al.
2.1.4 Microhaplotypes
- de la Puente M.
- Ruiz-Ramírez M.J.
- Ambroa-Conde A.
- Xavier C.
- Amigo J.
- Casares de Cal M.A.
- Gómez-Tato A.
- Carracedo A.
- Parson W.
- Phillips C.
- Lareu M.V.
- de la Puente M.
- Ruiz-Ramírez M.J.
- Ambroa-Conde A.
- Xavier C.
- Amigo J.
- Casares de Cal M.A.
- Gómez-Tato A.
- Carracedo A.
- Parson W.
- Phillips C.
- Lareu M.V.
Principal SNPs in the haplotype | Extra SNPs in MPS output | Internal MH name | Original MH nomenclature | Principal component SNPs | Extra SNPs in Ion S5 MPS sequence output | 5′ coordinate: GRCh37 | 3′ coordinate: GRCh37 | 5′ coordinate: GRCh38 | 3′ coordinate: GRCh38 | MH span in nucleotides | Original MH span |
---|---|---|---|---|---|---|---|---|---|---|---|
4 | 1 | 1pA | rs28503881-rs4648788-rs72634811-rs28689700 | rs532405039 | 1529950 | 1529998 | 1594570 | 1594618 | 48 | ||
3 | 2 | MH01 | mh01KK-01 | rs6663840-rs58111155-rs6688969 | rs199565833 / rs548721351 | 3743319 | 3743391 | 3826755 | 3826827 | 72 | 259 |
3 | - | 1pD | rs6702428-rs12031966-rs6687440 | - | 106770076 | 106770110 | 106227454 | 106227488 | 34 | ||
3 | - | MH03 | mh02KK-134 | rs12469721-rs3101043-rs3111398 | - | 161079411 | 161079450 | 160222900 | 160222939 | 39 | 103 |
3 | 2 | MH04 | mh02KK-136 | rs6714835-rs6756898-rs12617010 | rs530973697 / rs546011313 | 228092389 | 228092459 | 227227673 | 227227743 | 70 | 70 |
5 | 1 | 3pB | rs11129981-rs11129982-rs75361533-rs11129983-rs1896565 | rs528474614 | 42924625 | 42924691 | 42883133 | 42883199 | 66 | ||
5 | 5 | 3qC | rs6583335-rs9848767-rs843520-rs9833841-rs965140 | rs559681042 / rs552643442 / rs550318827 / rs60667153 / rs183434367 | 196379897 | 196379993 | 196653026 | 196653122 | 96 | ||
4 | 1 | 4qD | rs34521178-rs4533811-rs4450974-rs61132367 | rs531239419 | 182795889 | 182795939 | 181874736 | 181874786 | 50 | ||
4 | 3 | 7pB | rs6951954-rs6969555-rs2158900-rs73080042 | rs139000977 / rs185814343 / rs552428908 | 25447589 | 25447640 | 25407970 | 25408021 | 51 | ||
4 | 1 | 8pA | rs10097211-rs80063668-rs73660014-rs7007616 | rs538206051 | 3306430 | 3306458 | 3448908 | 3448936 | 28 | ||
5 | 6 | 8pB | rs34821009-rs7822905-rs7836134-rs7822909-rs6474278 | rs577517386 / rs539800640 / rs113457629 / rs188201066 / rs113010596 / rs565537969 | 40664194 | 40664243 | 40806675 | 40806724 | 49 | ||
5 | 1 | 9pA | rs1408329-rs11789647-rs12555748-rs1535838-rs1408330 | rs567753466 | 2288647 | 2288718 | 2288647 | 2288718 | 71 | ||
3 | 4 | 10pB | rs11816330-rs10828819-rs4749046 | rs570240814 / rs536076967 / rs555668598 / rs572123381 | 25839394 | 25839446 | 25550465 | 25550517 | 52 | ||
3 | 2 | MH11 | mh11KK-180 | rs4752778-rs74047734-rs7112918-rs4752777 | rs140892495 / rs555496836 | 1690950 | 1690984 | 1669720 | 1669754 | 34 | 193 |
4 | 1 | 12qB | rs11177060-rs2111058-rs10878750-rs11835920 | rs571889826 | 68508276 | 68508353 | 68114496 | 68114573 | 77 | ||
4 | - | 15qD | rs1816771-rs74033914-rs5007156-rs4965040 | - | 98255928 | 98255978 | 97712698 | 97712748 | 50 | ||
4 | 2 | MH18 | mh16KK-255 | rs16956011-rs3934954-rs3934955-rs3934956- | rs576469239 / rs184092108 | 81970353 | 81970407 | 81936748 | 81936802 | 54 | 142 |
4 | 1 | MH20 | mh18KK-293 | rs621320-rs621340-rs678179-rs621766 | rs80093367 | 76089886 | 76089968 | 78329886 | 78329968 | 82 | 82 |
3 | 2 | MH21 | mh21KK-315 | rs6517970-rs202132081-rs8131148-rs6517971 | rs533846035 / rs538072435 | 21880158 | 21880231 | 20507846 | 20507919 | 73 | 145 |
3 | 2 | MH22 | mh21KK-324 | rs2838868-rs7279250-rs8133697 | rs537553521 / rs567533147 | 46714641 | 46714707 | 45294726 | 45294792 | 66 | 158 |
5 | 2 | 22qB | rs4925431-rs4925399-rs4925432-rs4925400-rs77899570 | rs192804904 / rs537823715 | 49060976 | 49061028 | 48665164 | 48665216 | 52 |
2.2 Reference and test population data
2.2.1 Public population data from human genome sequencing projects
- Byrska-Bishop M.
- Evani U.S.
- Zhao X.
- Basile A.O.
- Abel H.J.
- Regier A.A.
- Corvelo A.
- Clarke W.E.
- Musunuri R.
- Nagulapalli K.
- et al.
- Byrska-Bishop M.
- Evani U.S.
- Zhao X.
- Basile A.O.
- Abel H.J.
- Regier A.A.
- Corvelo A.
- Clarke W.E.
- Musunuri R.
- Nagulapalli K.
- et al.
2.2.2 VISAGE in-house study populations
2.2.3 Compilation of standardised reference population datasets
- Phillips C.
- Amigo J.
- McNevin D.
- de la Puente M.
- Cheung E.Y.Y.
- Lareu M.V.
2.3 Evaluation of ancestry and co-ancestry analysis using ET BGA SNPs
Available online: http://mathgene.usc.es/Snipper/ Multiple profiles classifier at: 〈http://mathgene.usc.es/snipper/analysismultipleprofiles.html〉 (both accessed 1st February 2023).
Available online: http://mathgene.usc.es/Snipper/ Multiple profiles classifier at: 〈http://mathgene.usc.es/snipper/analysismultipleprofiles.html〉 (both accessed 1st February 2023).
2.4 Microhaplotype reconstruction from ET data and pilot experiments to evaluate ancestry-based deconvolution of simple mixed DNA
N. Thomas, R Package - Microhaplot, (2019) 〈https://github.com/ngthomas/microhaplot〉. (Accessed 1st February 2023).
3. Results
3.1 Characteristics of autosomal BGA SNPs selected for ET
No | SNP | Source | Chr | GrCh37 coordinate | GrCh38 coordinate | No | SNP | Source | Chr | GrCh37 coordinate | GrCh38 coordinate | ||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
African | 1 | rs2814778 | BT | 1 | 159174683 | 159204893 | American | 1 | rs12498138 | BT | 7 | 83533047 | 83903731 |
2 | rs1871534 | Novel | 8 | 145639681 | 144414297 | 2 | rs12594144 | BT | 20 | 62157718 | 63526365 | ||
3 | rs2789823 | BT | 9 | 136769888 | 133904766 | 3 | rs7151991 | Novel | 3 | 121459589 | 121740742 | ||
4 | rs1197062 | BT | 17 | 58641118 | 60563757 | 4 | rs17130385 | BT | 14 | 32635572 | 32166366 | ||
5 | rs9479657 | Novel | 6 | 153928396 | 153607261 | 5 | rs3737576 | BT | 10 | 115196019 | 113436260 | ||
European | 1 | rs16891982 | EVC BT | 5 | 33951693 | 33951588 | 6 | rs6088466 | Novel | 1 | 101709563 | 101244007 | |
2 | rs1426654 | EVC BT | 15 | 48426484 | 48134287 | 7 | rs9847307 | Novel | 20 | 32913534 | 34325728 | ||
3 | rs12913832 | EVC BT | 15 | 28365618 | 28120472 | 8 | rs11960137 | Novel | 3 | 64525713 | 64540037 | ||
4 | rs12142199 | BT | 1 | 1249187 | 1313807 | 9 | rs2024566 | Novel | 5 | 155338081 | 155911071 | ||
5 | rs8072587 | BT | 17 | 19211073 | 19307760 | South Asian | 1 | rs182857716 | Novel | 22 | 41697338 | 41301334 | |
6 | rs10962599 | BT | 9 | 16795286 | 16795288 | 2 | rs367953206 | Novel | 16 | 48221771 | 48187860 | ||
7 | rs9522149 | BT | 13 | 111827167 | 111174820 | 3 | rs3857620 | Novel | 6 | 57496076 | 57629240 | ||
8 | rs2196051 | BT | 8 | 122124302 | 121112062 | 4 | rs1757928 | BT | 4 | 130022161 | 129101006 | ||
9 | rs1924381 | BT | 13 | 72321856 | 71747724 | 5 | rs2472304 | BT | 15 | 75044238 | 74751897 | ||
10 | rs2715883 | BT | 11 | 120133494 | 120262785 | 6 | rs12405776 | Novel | 1 | 242431557 | 242268255 | ||
11 | rs1592672 | Novel | 12 | 80128593 | 79734813 | 7 | rs2026999 | BT | 9 | 103140157 | 100377875 | ||
East Asian | 1 | rs3827760 | BT | 2 | 109513601 | 108897145 | 8 | rs3844336 | BT | 8 | 62214766 | 61302207 | |
2 | rs1545397 | Novel | 15 | 28187772 | 27942626 | 9 | rs1796048 | BT | 2 | 97643576 | 96977839 | ||
3 | rs1229984 | BT | 4 | 100239319 | 99318162 | 10 | rs1567803 | Novel | 2 | 101343018 | 100726556 | ||
4 | rs6437783 | Novel | 3 | 108172817 | 108453970 | 11 | rs6754311 | Novel | 2 | 136707982 | 135950412 | ||
5 | rs1371048 | BT | 15 | 64161351 | 63869152 | 12 | rs13280988 | BT | 8 | 112370516 | 111358287 | ||
6 | rs881929 | Novel | 2 | 145753166 | 144995599 | 13 | rs17625895 | BT | 16 | 25775102 | 25763781 | ||
7 | rs4657449 | BT | 16 | 31079371 | 31068050 | 14 | rs10764919 | BT | 10 | 131663651 | 129865387 | ||
Oceanian | 1 | rs4471745 | Novel | 1 | 165465281 | 165496044 | 15 | rs1040934 | BT | 10 | 78066260 | 76306502 | |
2 | rs3751050 | BT | 17 | 53568884 | 55491523 | Middle East | 1 | rs1024124 | Novel | 15 | 33617064 | 33324863 | |
3 | rs10954737 | Novel | 11 | 9091244 | 9069697 | 2 | rs12880237 | Novel | 14 | 68621818 | 68155101 | ||
Tri-allelic SNPs | 1 | rs1074689 | Novel | 16 | 52216074 | 52182162 | 3 | rs1317026 | Novel | 6 | 161154955 | 160733923 | |
2 | rs1150911 | Novel | 1 | 228494382 | 228306681 | 4 | rs1495085 | BT | 8 | 15298515 | 15441006 | ||
3 | rs12629397 | Novel | 3 | 65814779 | 65829104 | 5 | rs166054 | Novel | 16 | 11285202 | 11191345 | ||
4 | rs1382568 | Novel | 8 | 11351220 | 11493711 | 6 | rs17086288 | Novel | 6 | 124210612 | 123889467 | ||
5 | rs1398461 | BT | 13 | 83839778 | 83265643 | 7 | rs2156208 | Novel | 18 | 60131306 | 62464073 | ||
6 | rs17287498 | Novel | 10 | 54530788 | 52771028 | 8 | rs234623 | Novel | 20 | 57488964 | 58913909 | ||
7 | rs2375771 | Novel | 4 | 187371930 | 186450776 | 9 | rs262037 | Novel | 5 | 177990886 | 178563885 | ||
8 | rs2387842 | Novel | 12 | 38736442 | 38342640 | 10 | rs2835133 | Novel | 21 | 37133457 | 35761159 | ||
9 | rs2585339 | BT | 14 | 49134978 | 48665775 | 11 | rs310362 | Novel | 8 | 59925618 | 59013059 | ||
10 | rs2605361 | BT | 12 | 74903531 | 74509751 | 12 | rs3852253 | Novel | 7 | 18866190 | 18826567 | ||
11 | rs2737126 | BT | 17 | 3618815 | 3715521 | 13 | rs3862700 | Novel | 18 | 67862224 | 70194988 | ||
12 | rs392461 | Novel | 5 | 81720271 | 82424452 | 14 | rs4308478 | BT | 5 | 136334314 | 136998625 | ||
13 | rs393953 | Novel | 21 | 43389036 | 41968927 | 15 | rs4465645 | Novel | 17 | 50832843 | 52755483 | ||
14 | rs408046 | Novel | 15 | 80031510 | 79739168 | 16 | rs4737753 | BT | 8 | 54701811 | 53789251 | ||
15 | rs4540055 | BT | 4 | 38803255 | 38801634 | 17 | rs487750 | Novel | 9 | 138603740 | 135711894 | ||
16 | rs5030240 | Novel | 11 | 32424389 | 32402843 | 18 | rs6496996 | Novel | 15 | 93402496 | 92859266 | ||
17 | rs556365 | Novel | 16 | 65927802 | 65893899 | 19 | rs6701640 | Novel | 1 | 170696474 | 170727333 | ||
18 | rs6588145 | Novel | 1 | 65859784 | 65394101 | 20 | rs6894681 | Novel | 5 | 127218995 | 127883303 | ||
19 | rs6933094 | BT | 6 | 150297603 | 149976467 | 21 | rs7252391 | Novel | 19 | 44142771 | 43638619 | ||
20 | rs7171818 | Novel | 15 | 58855169 | 58562970 | 22 | rs7594173 | Novel | 2 | 32900330 | 32675263 | ||
21 | rs776912 | BT | 1 | 10847784 | 10787727 | 23 | rs7816786 | Novel | 8 | 101349662 | 100337434 | ||
22 | rs7989291 | Novel | 13 | 57572989 | 56998855 | 24 | rs7975017 | Novel | 12 | 26428793 | 26275860 | ||
23 | rs809540 | Novel | 2 | 7879001 | 7738870 | 25 | rs848461 | Novel | 7 | 77582265 | 77952948 | ||
24 | rs914468 | Novel | 20 | 62100463 | 63469110 | 26 | rs9467370 | Novel | 6 | 24968682 | 24968454 | ||
25 | rs9845503 | Novel | 3 | 59700977 | 59715251 | 27 | rs9817359 | Novel | 3 | 76473163 | 76424012 | ||
26 | rs6504633 | Novel | 17 | 48112927 | 50035563 | 28 | rs9899480 | Novel | 17 | 36185665 | 37826045 |
- Byrska-Bishop M.
- Evani U.S.
- Zhao X.
- Basile A.O.
- Abel H.J.
- Regier A.A.
- Corvelo A.
- Clarke W.E.
- Musunuri R.
- Nagulapalli K.
- et al.
3.2 Characteristics of X-SNPs selected for ET

3.2.1 16 X-SNPs

Admixed population samples | Total no. of samples | Undetermined PCA position | X Ancestry Inference Success Rate | % AFR X chromosomes | % EUR X chromosomes | % AMR X chromosomes |
---|---|---|---|---|---|---|
ACB Males | 47 | 2 | 96% | 90% | 6% | - |
ACB Females | 49 | 2 | 96% | 94% | 2% | - |
ASW Males | 26 | 4 | 85% | 65% | 20% | - |
ASW Females | 35 | 8 | 77% | 60% | 11% | 6% |
Urban Brazil Males* | 16 | 2 | 87% | 19% | 62% | 6% |
Rural Brazil Males* | 18 | 3 | 83% | 61% | 17% | 6% |
3.2.2 The 5-SNP X centromere haplotype


3.3 Characteristics of Microhaplotypes selected for ET and mixed DNA sequence analysis
3.3.1 Patterns of variation in the 21 Microhaplotypes
- de la Puente M.
- Ruiz-Ramírez M.J.
- Ambroa-Conde A.
- Xavier C.
- Amigo J.
- Casares de Cal M.A.
- Gómez-Tato A.
- Carracedo A.
- Parson W.
- Phillips C.
- Lareu M.V.
- de la Puente M.
- Ruiz-Ramírez M.J.
- Ambroa-Conde A.
- Xavier C.
- Amigo J.
- Casares de Cal M.A.
- Gómez-Tato A.
- Carracedo A.
- Parson W.
- Phillips C.
- Lareu M.V.
3.3.2 Pilot studies to evaluate ancestry-based deconvolution of simple mixtures using Microhaplotypes

3.4 Y-SNP genotypes in VISAGE Study populations
3.5 STRUCTURE analysis of ET BGA SNP data
3.5.1 Worldwide population structure patterns inferred from the autosomal BGA SNPs of ET

3.5.2 Analysing co-ancestry in admixed population samples with STRUCTURE
3.5.3 Comparisons of STRUCTURE analyses using 104 BGA SNPs vs combined 104 BGA plus 184 autosomal EVC SNPs
4. Discussion
Acknowledgments
Appendix A
Appendix B. Supplementary material
Supplementary material.
Supplementary material.
Supplementary material.
Supplementary material.
Supplementary material.
Supplementary material.
Supplementary material.
References
- Forensic genetic analysis of bio-geographical ancestry.Forensic Sci. Int. Genet. 2015; 18: 49-65
- Forensic DNA Phenotyping: Predicting human appearance from crime scene material for investigative purposes.Forensic Sci. Int. Genet. 2015; 18: 33-48
- Forensic individual age estimation with DNA: from initial approaches to methylation tests.Forensic Sci. Rev. 2017; 29: 121-144
- Development and evaluation of the ancestry informative marker panel of the VISAGE basic tool.Genes. 2021; 12: 1284
- Development and validation of the VISAGE AmpliSeq basic tool to predict appearance and ancestry from DNA.Forensic Sci. Int. Genet. 2020; 48102336
- VISAGE consortium, evaluation of the VISAGE basic tool for appearance and ancestry prediction using PowerSeq chemistry on the MiSeq FGx system.Genes. 2020; 11: 708
- VISAGE consortium, development and optimization of the VISAGE basic prototype tool for forensic age estimation.Forensic Sci. Int. Genet. 2020; 48102322
- Development of the VISAGE enhanced tool and statistical models for epigenetic age estimation in blood, buccal cells and bones.Aging. 2021; 13: 6459-6484
- Epigenetic age prediction in semen - marker selection and model development.Aging. 2021; 13: 19145-19164
- Development and inter-laboratory validation of the VISAGE enhanced tool for age estimation from semen using quantitative DNA methylation analysis.Forensic Sci. Int. Genet. 2020; 56102596
- Broadening the applicability of a custom multi-platform panel of Microhaplotypes: Bio-geographical ancestry inference and expanded reference data.Front. Genet. 2020; 11581041
- Development and validation of the EUROFORGEN NAME (North African and Middle Eastern) ancestry panel.Forensic Sci. Int. Genet. 2019; 42: 260-267
- Building a forensic ancestry panel from the ground up: the EUROFORGEN Global AIM-SNP set.Forensic Sci. Int. Genet. 2014; 11: 13-25
- Development of a panel of genome-wide ancestry informative markers to study admixture throughout the Americas.PLoS Genet. 2012; 8e1002554
- Eurasiaplex: a forensic SNP assay for differentiating European and South Asian ancestries.Forensic Sci. Int. Genet. 2013; 7: 359-366
- Pacifiplex: An ancestry-informative SNP panel centred on Australia and the Pacific region.Forensic Sci. Int. Genet. 2016; 20: 71-80
- PIMA: A population informative multiplex for the Americas.Forensic Sci. Int. Genet. 2020; 44102200
- A global reference for human genetic variation.Nature. 2015; 526: 68-74
- The SNPforID browser: an online tool for query and display of frequency data from the SNPforID project.Int. J. Leg. Med. 2008; 122: 435-440
- Insights into human genetic variation and population history from 929 diverse genomes.Science. 2020; 367: 1339-1349
- High coverage whole-genome-sequencing of the expanded 1000 Genomes Project cohort including 602 trios.Cell. 2022; 185 (VCF data available online: https://www.internationalgenome.org/dataportal/data-collection/30x-grch38 and): 3426-3440
- The genomic history of the Middle East.Cell. 2021; 184: 4612-4625
- A compilation of tri-allelic SNPs from 1000 Genomes and use of the most polymorphic loci for a large-scale human identification panel.Forensic Sci. Int. Genet. 2020; 46102232
- Forensic Y-SNP analysis beyond SNaPshot: High-resolution Y-chromosomal haplogrouping from low quality and quantity DNA using Ion AmpliSeq and targeted massively parallel sequencing.Forensic Sci. Int. Genet. 2019; 41: 93-106
- Worldwide human relationships inferred from genome-wide patterns of variation.Science. 2008; 319: 1100-1104
- The recombination landscape around forensic STRs: accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data.Forensic Sci. Int. Genet. 2012; 6: 345-365
- MAPlex-A massively parallel sequencing ancestry analysis multiplex for Asia-Pacific populations.Forensic Sci. Int. Genet. 2019; 42: 213-226
- Performance of ancestry-informative SNP and microhaplotype markers.Forensic Sci. Int. Genet. 2019; 43102141
- Evaluating 130 microhaplotypes across a global set of 83 populations.Forensic Sci. Int. Genet. 2017; 6: 29-37
- The simons genome diversity project: 300 genomes from 142 diverse populations.Nature. 2016; 538: 201-206
- Genomic analyses inform on migration events during the peopling of Eurasia.Nature. 2016; 538: 238-242
- Online population data resources for forensic SNP analysis with Massively Parallel Sequencing: An overview of online population data for forensic purposes.in: Pilli E. Berti A. In Forensic DNA Analysis: Technological Development and Innovative Applications. CRC Press, Boca Raton, FL, USA2021
Available online: http://mathgene.usc.es/Snipper/ Multiple profiles classifier at: 〈http://mathgene.usc.es/snipper/analysismultipleprofiles.html〉 (both accessed 1st February 2023).
- Inference of population structure using multilocus genotype data.Genetics. 2000; 155: 945-959
- Clumpak: a program for identifying clustering modes and packaging population structure inferences across K.Mol. Ecol. Resour. 2015; 15: 1179-1191
- Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study.Mol. Ecol. 2005; 14: 2611-2620
- Inference of ancestry in forensic analysis II: analysis of genetic data.Methods Mol. Biol. 2016; 1420: 255-285
- Building a custom large-scale panel of novel microhaplotypes for forensic identification using MiSeq and Ion S5 massively parallel sequencing systems.Forensic Sci. Int. Genet. 2020; 48102213
- Fast and accurate short read alignment with Burrows-Wheeler transform.Bioinformatics. 2009; 25: 1754-1760
- The sequence Alignment/Map format and SAMtools.Bioinformatics. 2009; 25: 2078-2079
N. Thomas, R Package - Microhaplot, (2019) 〈https://github.com/ngthomas/microhaplot〉. (Accessed 1st February 2023).
- Tetra-allelic SNPs: Informative forensic markers compiled from public whole-genome sequence data.Forensic Sci. Int. Genet. 2015; 19: 100-106
- Analysis of protein-coding genetic variation in 60,706 humans.Nature. 2016; 536: 285-291
〈http://www.ensembl.org/Homo_sapiens/Variation/Population?db=core;r=6:60527829–60528829;v=rs3857620;vdb=variation;vf=169483878〉, (Accessed 1st February 2023).
- Open source software EuroForMix can be used to analyse complex SNP mixtures.Forensic Sci. Int. Genet. 2017; 31: 105-110
- Development and inter-laboratory evaluation of the VISAGE Enhanced Tool for appearance and ancestry inference from DNA.Forensic Sci. Int. Genet. 2022; 61102779
- Ancestry analysis in rural Brazilian populations of African descent.Forensic Sci. Int. Genet. 2018; 36: 160-166
- Genetic structure of human populations.Science. 2002; 298: 2381-2385
- Mixture deconvolution by massively parallel sequencing of microhaplotypes.Int. J. Leg. Med. 2019; 133: 719-729
Article info
Publication history
Footnotes
☆Dedication: This paper is dedicated to co-author Peter Matthias Schneider, our esteemed scientific colleague and friend, who sadly died during its submission.
Identification
Copyright
User license
Creative Commons Attribution – NonCommercial – NoDerivs (CC BY-NC-ND 4.0) |
Permitted
For non-commercial purposes:
- Read, print & download
- Redistribute or republish the final article
- Text & data mine
- Translate the article (private use only, not for distribution)
- Reuse portions or extracts from the article in other works
Not Permitted
- Sell or re-use for commercial purposes
- Distribute translations or adaptations of the article
Elsevier's open access license policy