Advertisement
Forensic Population Genetics – Letter to the Editor| Volume 7, ISSUE 1, e21-e22, January 2013

Analysis of Investigator HDplex markers in Swedish and Somali populations

      Dear Editor,
      In this study we have performed population analyses of a Swedish and a Somali population for the twelve short tandem repeat (STR) markers included in the Investigator HDplex kit (Qiagen). The set of markers tested, was also evaluated for their efficiency in forensic casework.
      A total of 217 unrelated Swedish individuals and 211 Somali male individuals were selected for the population analyses. The Swedes were selected on their Swedish surnames and the Somalis were collected from immigration casework performed at the Department of Forensic Genetics and Forensic Toxicology, Linköping, Sweden. The Swedish population samples consisted of blood samples extracted as earlier described [
      • Lindblom B.
      • Holmlund G.
      Rapid DNA purification for restriction fragment length polymorphism analysis.
      ]. The samples from the Somali individuals consisted of buccal cells on FTA-cards. For the blood samples, approximately 0.5 ng of purified DNA was used for the PCR amplification which was performed according to the manufacturer's protocol except for a reduced total reaction volume (total amount of 10 μl) and reduced number of PCR-cycles (from 30 to 27 cycles). The FTA samples were washed in water prior to the PCR. The PCR products were prepared for capillary electrophoresis by adding 1 μl PCR product to 8 μl of a HiDi Formamide/DNA size standard 550 BTO mix and separation and detection were performed on a ABI 3500XL genetic analyser with 50 cm capillary array and POP7 polymer (Applied Biosystems).
      The Swedish population samples were also typed for the 15 loci included in the AmpFlSTR® Identifiler® PCR Amplification Kit (Applied Biosystems) and the 17 loci included in the PowerPlex® ESI 17 System kit (Promega) according the manufacturers’ protocols, except for a reduced total PCR sample volume (10 μl).
      The internal validation of the Investigator HDplex kit revealed one notable observation. The mean peak height balance for marker D21S2055 was estimated to 0.67 (±0.31), which was too low to be acceptable. Further analysis of this phenomenon showed that the peak height imbalance was restricted to heterozygous genotypes for which one allele was larger than allele 30.
      Allele frequencies for the twelve STRs are shown in Supplementary Data 1 for the Swedish and the Somali populations. SE33 was the locus with the highest power of discrimination for both tested populations. For the Swedish population the power of exclusion ranged from 0.51 to 0.90 for the different markers, while the corresponding values varied between 0.48 and 0.82 for the Somali population. The overall match probability for the Swedish population was 1 in 7 × 1016, and 1 in 2 × 1017 for the Somali population. These figures are in the same magnitude as for the Identifiler kit in both the Swedish and Somali populations [
      • Montelius K.
      • Karlsson A.O.
      • Holmlund G.
      STR data for the AmpFlSTR Identifiler loci from Swedish population in comparison to European, as well as with non-European population.
      ,
      • Tillmar A.O.
      • Backstrom G.
      • Montelius K.
      Genetic variation of 15 autosomal STR loci in a Somali population.
      ]. No significant departure, tested by the exact test using genetic data analysis (GDA) [
      • Lewis P.O.
      • Zaykin D.
      Genetic Data Analysis: Computer Program for the Analysis of Allelic Data.
      ], from Hardy Weinberg equilibrium (HWE) was detected for any of the markers (Supplementary Data 1).
      The allele frequencies for the Swedish and Somali populations were compared with frequencies from a German population [
      • Schmid D.
      • Anslinger K.
      • Rolf B.
      Allele frequencies of the ACTBP2 (=SE33), D18S51, D8S1132, D12S391, D2S1360, D3S1744, D5S2500, D7S1517, D10S2325 and D21S2055 loci in a German population sample.
      ], a Polish population [
      • Kuzniar P.
      • Jastrzebska E.
      • Ploski R.
      Validation of nine non-CODIS STR loci for forensic use in a population from Central Poland.
      ] and a Lithuanian population [
      • Pepinski W.
      • Niemcunowicz-Janica A.
      • Skawronska M.
      • Janica J.
      Polymorphism of 11 non-CODIS STRs in a population sample of Lithuanian minority residing in northeastern Poland.
      ] using the exact test of population differentiation and were carried out using the Arlequin Software version 3.5 [
      • Excoffier L.
      • Laval G.
      • Schneider S.
      Arlequin (version 3.0): an integrated software package for population genetics data analysis.
      ]. The test showed that the frequency distributions for the Somali population were significantly different from the Swedish population for all 12 STRs (Supplementary Data 2). No significant P-values were found (after correction for multiple testing [
      • Holm S.
      A simple sequentially rejective multiple test procedure.
      ,
      • Sidak Z.
      Rectangular confidence regions for the means of multivariate normal distributions.
      ]) between the Swedish and the German populations or between the Swedish and the Polish population. There were, however, differences between the Swedish and the Lithuanian population for three loci (D21S2055, D2S1360 and SE33).
      To test the usefulness of the STR loci in paternity testing, 28 paternity trio cases were typed. 14 of these represented cases with a previous confirmed true father (paternity index (PI) > 100,000) and 14 trios represented cases where the alleged father had, by previous DNA-typing, been excluded as the true father of the child. The analysis of the paternity trios resulted in high PIs (1.9 × 105–5.0 × 1010, median: 1.5 × 108) for the earlier confirmed true trios, and numerous genetic inconsistencies (6–12, median: 8) for the 14 non-paternity trios.
      Investigator HDplex has been suggested to be suitable as a supplementary marker kit. Since many of the loci included in the kit are located on the same chromosomes as many of the core STRs, allelic association (or linkage disequilibrium, LD) and the effect of linkage need to be tested for. When it comes to linkage Phillips et al. [
      • Phillips C.
      • Ballard D.
      • Gill P.
      • Court D.S.
      • Carracedo A.
      • Lareu M.V.
      The recombination landscape around forensic STRs: accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data.
      ] provided a map of the genetic distances between the commonly used STRs, including the markers in the Investigator HDplex kit. The effect of linkage is, apart from the genetic distance, also dependent on the pedigree and the homozygous/heterozygous status of the involved individuals, etc. [
      • Kling D.
      • Egeland T.
      • Tillmar A.O.
      FamLink – a user friendly software for linkage calculations in family genetics.
      ,
      • O’Connor K.L.
      • Tillmar A.O.
      Effect of linkage between vWA and D12S391 in kinship analysis.
      ].
      No significant P-values were found when allelic association between alleles at the different loci in the Investigator HDplex kit was tested, using GDA [
      • Lewis P.O.
      • Zaykin D.
      Genetic Data Analysis: Computer Program for the Analysis of Allelic Data.
      ], in the Swedish and in the Somali population (data not shown). No significant P-values were found when allelic association between pairs of loci were tested for all 30 STRs included in the Investigator HDplex kit, the Identifiler kit and the ESI 17 kit for the Swedish population (data not shown).
      Simulations were performed to demonstrate the informative value and the effect of linkage for the combined use of the 30 STRs included in the Investigator HDplex kit, the Identifiler kit and the ESI 17 kit for a case scenario with a question of sibship. FamLink [
      • Kling D.
      • Egeland T.
      • Tillmar A.O.
      FamLink – a user friendly software for linkage calculations in family genetics.
      ] was used for the simulation study and the hypotheses were H1: individual A and individual B are full siblings and H2: individual A and individual B are maternal half siblings. Data were available from individual A and B and their mother in common. 10,000 simulations were performed with Swedish allele frequencies and recombination rates from Phillips et al. [
      • Phillips C.
      • Ballard D.
      • Gill P.
      • Court D.S.
      • Carracedo A.
      • Lareu M.V.
      The recombination landscape around forensic STRs: accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data.
      ]. The results from the simulations shown that the median likelihood ratio (LR) from such analysis can be expected around 105 and that the LR is, in general, only slightly overestimated when linkage is ignored (Supplementary Data 3). However, as shown in the box plot in Supplementary Data 3, the case specific effect of linkage can be considerable, resulting in up to over 100 times overestimation of the true LR when linkage is ignored.
      In summary, the Investigator HDplex markers have shown to be highly informative in both the Swedish and the Somali populations and are well suitable for use as supplementary markers in relationship testing. Although no signs of allelic association, linkage should be accounted for when the Investigator HDplex markers are used in combination with closely located core STRs.
      This paper follows the ISFG guidelines for publication of population data requested by the journal [
      • Carracedo A.
      • Butler J.M.
      • Gusmao L.
      • Parson W.
      • Roewer L.
      • Schneider P.M.
      Publication of population data for forensic purposes.
      ].

      Appendix A. Supplementary data

      References

        • Lindblom B.
        • Holmlund G.
        Rapid DNA purification for restriction fragment length polymorphism analysis.
        Gene Anal. Tech. 1988; 5: 97-101
        • Montelius K.
        • Karlsson A.O.
        • Holmlund G.
        STR data for the AmpFlSTR Identifiler loci from Swedish population in comparison to European, as well as with non-European population.
        Forensic Sci. Int. Genet. 2008; 2: e49-e52
        • Tillmar A.O.
        • Backstrom G.
        • Montelius K.
        Genetic variation of 15 autosomal STR loci in a Somali population.
        Forensic Sci. Int. Genet. 2009; 4: e19-e20
        • Lewis P.O.
        • Zaykin D.
        Genetic Data Analysis: Computer Program for the Analysis of Allelic Data.
        2001
        • Schmid D.
        • Anslinger K.
        • Rolf B.
        Allele frequencies of the ACTBP2 (=SE33), D18S51, D8S1132, D12S391, D2S1360, D3S1744, D5S2500, D7S1517, D10S2325 and D21S2055 loci in a German population sample.
        Forensic Sci. Int. 2005; 151: 303-305
        • Kuzniar P.
        • Jastrzebska E.
        • Ploski R.
        Validation of nine non-CODIS STR loci for forensic use in a population from Central Poland.
        Forensic Sci. Int. 2006; 159: 258-260
        • Pepinski W.
        • Niemcunowicz-Janica A.
        • Skawronska M.
        • Janica J.
        Polymorphism of 11 non-CODIS STRs in a population sample of Lithuanian minority residing in northeastern Poland.
        Forensic Sci. Int. Genet. 2011; 5: e37
        • Excoffier L.
        • Laval G.
        • Schneider S.
        Arlequin (version 3.0): an integrated software package for population genetics data analysis.
        Evol. Bioinform. Online. 2005; 1: 47-50
        • Holm S.
        A simple sequentially rejective multiple test procedure.
        Scand. J. Stat. 1979; 6: 65-70
        • Sidak Z.
        Rectangular confidence regions for the means of multivariate normal distributions.
        J. Am. Stat. Assoc. 1967; 62: 626-633
        • Phillips C.
        • Ballard D.
        • Gill P.
        • Court D.S.
        • Carracedo A.
        • Lareu M.V.
        The recombination landscape around forensic STRs: accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data.
        Forensic Sci. Int. Genet. 2012; 6: 354-365
        • Kling D.
        • Egeland T.
        • Tillmar A.O.
        FamLink – a user friendly software for linkage calculations in family genetics.
        Forensic Sci. Int. Genet. 2012; 6: 616-620
        • O’Connor K.L.
        • Tillmar A.O.
        Effect of linkage between vWA and D12S391 in kinship analysis.
        Forensic Sci. Int. Genet. 2012; https://doi.org/10.1016/j.fsigen.2012.03.008
        • Carracedo A.
        • Butler J.M.
        • Gusmao L.
        • Parson W.
        • Roewer L.
        • Schneider P.M.
        Publication of population data for forensic purposes.
        Forensic Sci. Int. Genet. 2010; 4: 145-147