Advertisement
Research Article| Volume 10, P55-63, May 2014

Download started.

Ok

Comparing six commercial autosomal STR kits in a large Dutch population sample

Open AccessPublished:February 06, 2014DOI:https://doi.org/10.1016/j.fsigen.2014.01.008

      Abstract

      Regularly, STR results obtained with different PCR amplification kits are compared, for instance with old cases, when revisiting cold cases or when addressing cross-border crimes. It is known that differences in primer design for the same loci in different kits may give rise to null alleles or shifted alleles. In this study, the genotyping results of 2085 Dutch male samples were compared for six autosomal STR kits (Promega's PowerPlex® 16, ESX-16 and ESI-17 Systems, Qiagen's Investigator® ESSplex Kit and Applied Biosystems’ AmpFlSTR® Identifiler and NGM PCR Amplification Kits). A total of 19 discordant autosomal genotyping results were obtained that were examined by sequence analysis using Roche-454 next generation sequencing and/or Sanger sequencing. A further 25 discordances were found and sequenced for the Amelogenin locus. The 24 samples showing the same primer binding site mutation at the Amelogenin locus were subjected to X-STR analysis in order to assess whether they could share a common origin, which appeared not to be the case. Based on the sequencing results, we set the final genotypes and determined the allele frequencies of 23 autosomal STRs for the Dutch reference database.

      Keywords

      1. Introduction

      After expanding the European Standard Set (ESS) of markers used for autosomal forensic short tandem repeat (STR) genotyping [
      • Gill P.
      • Fereday L.
      • Morling N.
      • Schneider P.M.
      New multiplexes for Europe-amendments and clarification of strategic development.
      ,
      • Gill P.
      • Fereday L.
      • Morling N.
      • Schneider P.M.
      The evolution of DNA databases – recommendations for new European STR loci.
      ], a number of companies developed commercially available kits incorporating these loci [
      • Westen A.A.
      • Haned H.
      • Grol L.J.
      • Harteveld J.
      • van der Gaag K.J.
      • de Knijff P.
      • Sijen T.
      Combining results of forensic STR kits: HDplex validation including allelic association and linkage testing with NGM and identifiler loci.
      ]. Examples of these kits are the PowerPlex ESX and ESI Systems (Promega Corporation (Promega), Madison, WI, USA), the Investigator ESSplex (Plus) Kit (Qiagen Benelux B.V. (Qiagen), Venlo, The Netherlands) and the AmpFlSTR NGM™ (SElect) PCR Amplification Kit (Applied Biosystems (AB), Foster City, CA, USA). When comparing genotyping results of the same donor obtained with different PCR amplification kits, differences such as null (a.k.a. silent) alleles and shifted alleles have been observed, primarily due to differences in primer design (e.g. [
      • Hill C.R.
      • Duewer D.L.
      • Kline M.C.
      • Sprecher C.J.
      • McLaren R.S.
      • Rabbach D.R.
      • Krenke B.E.
      • Ensenberger M.G.
      • Fulmer P.M.
      • Storts D.R.
      • Butler J.M.
      Concordance and population studies along with stutter and peak height ratio analysis for the PowerPlex (R) ESX 17 and ESI 17 Systems.
      ,
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      ,
      • Welch L.A.
      • Gill P.
      • Phillips C.
      • Ansell R.
      • Morling N.
      • Parson W.
      • Palo J.U.
      • Bastisch I.
      European Network of Forensic Science Institutes (ENFSI): evaluation of new commercial STR multiplexes that include the European Standard Set (ESS) of markers.
      ]). With an increasing number of forensically available kits and a growing number of (international) DNA profile comparisons worldwide, for instance under the European Prüm Treaty, it is informative to know the extent of discordances at specific loci. When certain discordances occur regularly, this information may stimulate companies to include primer adjustments, such as degenerated primers, in newly developed forensic kits.
      In this study, 2085 Dutch male samples were typed for six autosomal STR kits: the PowerPlex® 16 (PP16), ESX-16 and ESI-17 Systems (Promega), the Investigator® ESSplex Kit (ESS, Qiagen) and the AmpFlSTR® Identifiler and NGM PCR Amplification Kits (AB). We evaluated the concordancy of the genotyping results obtained with these kits. Discordant allele calls were examined using Roche-454 next generation sequencing (NGS) and/or Sanger sequencing in order to identify the causal mutations. Additional analyses with X-STRs were performed to analyse whether a set of 24 donors having the same Amelogenin primer binding site mutation could share a common origin. Based on the sequencing results, the final genotypes were set and used to create a new Dutch allele frequency database.

      2. Material and methods

      2.1 DNA samples, extraction and quantification

      A total of 2085 male blood donors with self-declared Dutch ancestry were sampled from 99 locations across The Netherlands, while excluding major cities to avoid very recent admixture effects. All volunteers had given their informed consent, and a detailed description of the samples is given in [
      • Lao O.
      • Altena E.
      • Becker C.
      • Brauer S.
      • Kraaijenbrink T.
      • van O.M.
      • Nurnberg P.
      • de K.P.
      • Kayser M.
      Clinal distribution of human genomic diversity across the Netherlands despite archaeological evidence for genetic discontinuities in Dutch population history.
      ]. After anonymising the samples, DNA was robotically extracted either by the Autopure LS® system using the Gentra Puregene Blood Kit (Gentra Systems, Minneapolis, MN) or by the QIAcube using the QIAamp DNA Mini Kit (Qiagen Benelux B.V. (Qiagen), Venlo, The Netherlands). The samples were quantified with the Quantifiler® Duo DNA Quantification Kit on a 7500 Real-Time PCR System (Applied Biosystems (AB), Foster City, CA, USA).

      2.2 PCR, capillary electrophoresis and DNA profile analysis

      DNA amplifications were performed using six different autosomal STR kits: PP16, ESX-16 and ESI-17 (Promega), ESS (Qiagen) and Identifiler and NGM (AB). PCR amplification programs were performed according to the manufacturer's protocols, except for the reaction volumes (Table 1). The PCR input and reaction volume, the capillary electrophoresis instrument, setup and settings, and the DNA profile analysis software and detection threshold are shown in Table 1 for the six different STR kits.
      Table 1Characteristics of the PCR amplification, capillary electrophoresis and DNA profile analysis for the six autosomal PCR kits analysed in this study.
      PCRCapillary electrophoresisDNA profile analysis
      KitInput (ng)Reaction volume (μL)Instrument ABI Prism®ddH2O (μL)Hi-Di formamide (μL)Size standardSampleSettingsSoftwareDetection threshold (rfu)
      PP161.512.5310011.40.6 μL ILS 6001.0 μL3 kV/11 sGeneMarker v. 1.7530
      ESX-161.57.5310011.50.5 μL ILS 5001.0 μL
      30x diluted PCR product.
      3 kV/5 sGeneMarker v. 1.7530
      ESI-170.512.53130xl10.01.0 μL CC5 ILS 5001.0 μL1.5 kV/10 sGeneMapper ID-X v.1.1.150
      ESS1.57.5310011.50.5 μL DSS 5501.0 μL3 kV/10 sGeneMarker v. 1.7530
      Identifiler4.012.5310011.60.4 μL LIZ 6001.0 μL1 kV/22 sGeneMarker v. 1.7530
      NGM0.512.53130xl8.70.3 μL LIZ 5001.0 μL3 kV/15 sGeneMapper ID-X v.1.1.150
      a 30x diluted PCR product.

      2.3 Roche-454 next generation sequencing, Sanger sequencing and X-STR analysis

      Supplementary Text 1 describes the material and methods for Roche-454 NGS, analysis of NGS data [
      • Anvar S.Y.
      • van der Gaag K.J.
      • van der Heijden J.W.F.
      • Veltrop M.H.A.M.
      • Vossen R.H.A.M.
      • de Leeuw R.H.
      • Breukel C.
      • Buermans H.P.J.
      • Verbeek J.S.
      • de Knijff P.
      • den Dunnen J.T.
      • Laros J.F.J.
      TSSV: A tool for characterization of complex allelic variants in pure and mixed genomes.
      ,
      • Thompson J.D.
      • Higgins D.G.
      • Gibson T.J.
      CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
      ,
      • Hall T.A.
      BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT.
      ], Sanger sequencing and X-STR analysis. Primers used during Sanger sequencing and X-STR analysis are presented in Supplementary Table 1.

      2.4 Concordancy testing, allele frequencies and statistical analyses

      The complete data set was tested for the presence of population substructure as described in [
      • Lao O.
      • Altena E.
      • Becker C.
      • Brauer S.
      • Kraaijenbrink T.
      • van O.M.
      • Nurnberg P.
      • de K.P.
      • Kayser M.
      Clinal distribution of human genomic diversity across the Netherlands despite archaeological evidence for genetic discontinuities in Dutch population history.
      ], and no substructure was detected (data not shown). Therefore, the complete dataset was interpreted as one group. The 2085 genotypes from all six kits were compared using Excel and discordant allele calls were identified. After sequencing analysis of discordant alleles, one final database was created comprising 2085 genotypes for 23 autosomal STRs and Amelogenin. With this database, the allele frequencies and descriptive statistics for this Dutch population sample were determined using the Excel Microsatellite Toolkit [
      • Park S.D.E.
      Trypanotolerance in West African Cattle and the Population Genetic Effects of Selection.
      ].

      3. Results and discussion

      3.1 Discordances between six autosomal STR kits

      The 2085 DNA samples were analysed with six autosomal STR kits (PP16, ESX-16, ESI-17, ESS, Identifiler and NGM), comprising a total of 23 different autosomal STR loci and Amelogenin (Fig. 1). The genotyping results for the 20 STRs and Amelogenin that are present in more than one kit (Penta D and E only reside in PP16, SE33 only resides in ESI-17; see Fig. 1) were compared and 44 discordances were detected in 38 DNA samples (Table 2). These discrepancies comprise 43 null alleles on 12 loci and one shifted allele with a size difference of one nucleotide (Table 2). Just when all 2085 samples had been analysed with the NGM kit, Applied Biosystems adjusted the kit (without changing the name; kit lot-numbers 1103010 M and up) by adding extra primers for Amelogenin, D2S441 and D22S1045 that cover reported primer binding site mutations on these loci [
      • Oldroyd N.
      • Green R.
      • Mulero J.
      • Hennessy L.
      • Tabak J.
      Development of the AmpFℓSTR® NGM SElect™ Kit: new sequence discoveries and implications for genotype concordance.
      ,
      • Green R.L.
      • Lagace R.E.
      • Oldroyd N.J.
      • Hennessy L.K.
      • Mulero J.J.
      Developmental validation of the AmpFlSTR(R) NGM SElect PCR Amplification Kit: a next-generation STR multiplex with the SE33 locus.
      ]. All discordant samples were reanalysed with this new version of NGM (here denoted NGM2), and the 24 missing X alleles on Amelogenin and the absent allele 14.1 on D2S441 were now detected and concordant with the results from other kits. As a result, the final number of discordant results decreased to 19, and these discordances occurred in 13 of the 2085 samples (for six samples, the same discordant result was seen in two different kits; Table 2). Seven discordant alleles were found for Identifiler and NGM/NGM2, three for ESI-17, and one for both PP16 and ESS (Table 2); ESX-16 showed no discordant allele calls (Table 2).
      Figure thumbnail gr1
      Fig. 1Fragment lengths for 23 autosomal STR markers and Amelogenin and their presence in six forensic STR kits: PP16, ESX-16 and ESI-17 (Promega, grey shades), ESS (Qiagen, red shade), Identifiler and NGM (Applied Biosystems, blue shades). Amplicon sizes are based on the shortest and longest fragment length in the allelic ladder for each marker. When a kit name is presented in grey, the marker is not present in the kit.
      Table 2Discordancy between six autosomal STR kits: PP16, ESX-16, ESI-17, ESS, Identifiler and NGM, based on 2085 Dutch male donor samples. Discordant results are shown in grey.
      MarkerDiscordancyPromegaQiagenABn observationsSequencing methodPutative cause
      PP16ESX-16ESI-17ESSIdentifilerNGMNGM2
      NGM2 is only tested on the discordant samples.
      5 kits + NGM5 kits + NGM2Mutation position before/after repeat
      Mutation position before (B) or after (A) the repeat, and in the direction as given in STRbase (accessed October 2013); see also Supplementary Data 1.
      Location in primer
      For most primers, the (start) position is given in Supplementary Data 1.
      Amel.null allele XXYXYXYXYXYYXY240454A: 44 C>T21 nt after start NGM R-primer
      Amel.null allele YXYXYXYXYXYXas NGM11454/SangerA: 51 G>A14 nt after start NGM R-primer
      D2S1338null allele 1717–2017–2017–202017–20as NGM11SangerA: 174 G>A23 nt after start Identifiler R-primer
      D2S441null allele 14.111–14.111–14.111–14.11111–14.110454B: 1 insA23 nt after start NGM F-primer
      D7S820null allele 1212–131311454A: 125 C>Astart Identifiler R-primer unknown
      D8S1179null allele 12.112.1–1312.1–131312.1–1312.1–1312.1–13as NGM11454insT final repeat [TCTTA]start ESI-17 R-primer unknown
      D12S391null allele 2118–211818–2118as NGM2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      454/Sangerlarge del involving NGM, ESI-17, Kline
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      and 2 custom-made
      Up to 345 nt from repeat.
      R-primers
      D12S391null allele 2121–222221–2222as NGM2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      454/Sangerlarge del involving NGM, ESI-17, Kline
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      and 2 custom-made
      Up to 345 nt from repeat.
      R-primers
      D13S317null allele 8118–1111454A: 27 C>A18 nt after start PP16 R-primer
      D16S539null allele 1312–1312–1312–1312–131212–13as NGM11454B: 140 G>A17 nt after start Identifiler F-primer
      D18S51null allele 1615–1615–1615–1615–161515as NGM2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      454A: 172 G>A8 nt after start Identifiler + NGM R-primer
      D19S433allele shift 14 to 13.314–1514–1513.3–1514–1514–15as NGM11454B: 50 delAin between ESS and other (known) F-primers
      TH01null allele 66–96–96–96–999as NGM2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      454B: 81 C>T14 nt after start Identifiler + NGM F-primer
      vWAnull allele 1717–1917–1917–1917–191919as NGM2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      454B: 52 A>T19 nt after start Identifiler + NGM F-primer
      vWAnull allele 1716–1716–1716–1716–171616as NGM2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      2
      Both observations involve the same donor and the discordancy was seen in two different kits.
      454B: 52 A>T19 nt after start Identifiler + NGM F-primer
      Total103173274419
      a NGM2 is only tested on the discordant samples.
      b Both observations involve the same donor and the discordancy was seen in two different kits.
      c Mutation position before (B) or after (A) the repeat, and in the direction as given in STRbase (accessed October 2013); see also Supplementary Data 1.
      d For most primers, the (start) position is given in Supplementary Data 1.
      e Up to 345 nt from repeat.

      3.2 Roche-454 NGS and/or Sanger sequencing of samples with discordant results

      In order to determine the genetic variations causing the above-mentioned 44 discordant results, the loci involved were sequenced for the affected samples and control samples. First, Roche-454 NGS was applied using primers designed by Kline et al. [
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      ], and when no (clear) results were obtained, Sanger sequencing was used with custom-designed primers both further away from the repeat than those by Kline et al. [
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      ] (Supplementary Table 1). For 42 of the 44 discordances a nucleotide change or nucleotide insertion or deletion was observed that probably caused the discordant genotyping result (Table 2 and Supplementary Data 1). For two samples, with a null allele at D12S391 for the ESI-17 and NGM/NGM2 kits, the discordant allele could not be amplified and no sequence change was revealed. There is a clear relation, however, between the occurrence of the null allele and the location of the various primers: kits that do detect the allele have reverse primers close to the repeat structure (for ESX the reverse primer starts 30 nt and for ESS 43 nt after the repeat; Supplementary Data 1), while the kits and custom-designed primers that do not reveal the allele have reverse primers further on (primer start for NGM is at position 125 (Supplementary Data 1) and for the various sequencing primers at nucleotide 213 [
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      ], 271 or 345). We infer that both null alleles are caused by a deletion of at least a few hundred nucleotides, residing after the position of the ESS reverse primer and covering the binding sites of the sequencing primers and the reverse primers in NGM/NGM2 and ESI-17 (for this kit, the exact primer start is unknown).
      Many studies have been conducted to determine the molecular basis of discordant results (e.g. [
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      ,
      • Budowle B.
      • Masibay A.
      • Anderson S.J.
      • Barna C.
      • Biega L.
      • Brenneke S.
      • Brown B.L.
      • Cramer J.
      • DeGroot G.A.
      • Douglas D.
      • Duceman B.
      • Eastman A.
      • Giles R.
      • Hamill J.
      • Haase D.J.
      • Janssen D.W.
      • Kupferschmid T.D.
      • Lawton T.
      • Lemire C.
      • Llewellyn B.
      • Moretti T.
      • Neves J.
      • Palaski C.
      • Schueler S.
      • Sgueglia J.
      • Sprecher C.
      • Tomsey C.
      • Yet D.
      STR primer concordance study.
      ,
      • Delamoye M.
      • Duverneuil C.
      • Riva K.
      • Leterreux M.
      • Taieb S.
      • De M.P.
      False homozygosities at various loci revealed by discrepancies between commercial kits: implications for genetic databases.
      ,
      • Clayton T.M.
      • Hill S.M.
      • Denton L.A.
      • Watson S.K.
      • Urquhart A.J.
      Primer binding site mutations affecting the typing of STR loci contained within the AMPFlSTR SGM Plus kit.
      ,
      • Vanderheyden N.
      • Mai A.
      • Gilissen A.
      • Cassiman J.J.
      • Decorte R.
      Identification and sequence analysis of discordant phenotypes between AmpFlSTR SGM Plus and PowerPlex 16.
      ]). When null alleles are observed relatively often (for instance in specific populations [
      • Clayton T.M.
      • Hill S.M.
      • Denton L.A.
      • Watson S.K.
      • Urquhart A.J.
      Primer binding site mutations affecting the typing of STR loci contained within the AMPFlSTR SGM Plus kit.
      ,
      • Leibelt C.
      • Budowle B.
      • Collins P.
      • Daoudi Y.
      • Moretti T.
      • Nunn G.
      • Reeder D.
      • Roby R.
      Identification of a D8S1179 primer binding site mutation and the validation of a primer designed to recover null alleles.
      ,
      • Nelson M.S.
      • Levedakou E.N.
      • Matthews J.R.
      • Early B.E.
      • Freeman D.A.
      • Kuhn C.A.
      • Sprecher C.J.
      • Amin A.S.
      • McElfresh K.C.
      • Schumm J.W.
      Detection of a primer-binding site polymorphism for the STR locus D16S539 using the Powerplex 1.1 system and validation of a degenerate primer to correct for the polymorphism.
      ]) and are due to a primer binding site mutation, manufacturers may add degenerate primers to the commercially available STR kits in order to recover the specific allele [
      • Green R.L.
      • Lagace R.E.
      • Oldroyd N.J.
      • Hennessy L.K.
      • Mulero J.J.
      Developmental validation of the AmpFlSTR(R) NGM SElect PCR Amplification Kit: a next-generation STR multiplex with the SE33 locus.
      ,
      • Leibelt C.
      • Budowle B.
      • Collins P.
      • Daoudi Y.
      • Moretti T.
      • Nunn G.
      • Reeder D.
      • Roby R.
      Identification of a D8S1179 primer binding site mutation and the validation of a primer designed to recover null alleles.
      ,
      • Nelson M.S.
      • Levedakou E.N.
      • Matthews J.R.
      • Early B.E.
      • Freeman D.A.
      • Kuhn C.A.
      • Sprecher C.J.
      • Amin A.S.
      • McElfresh K.C.
      • Schumm J.W.
      Detection of a primer-binding site polymorphism for the STR locus D16S539 using the Powerplex 1.1 system and validation of a degenerate primer to correct for the polymorphism.
      ]. The majority of discordant results, however, are rare, like most of the discordances found in our study. Despite their rarity, at least three of the autosomal discordances that we detected have been described before: 1) a G>A substitution 172 bp after the D18S51 repeat motif, described as position 75615 in GenBank sequence AC021803 by Delamoye et al. [
      • Delamoye M.
      • Duverneuil C.
      • Riva K.
      • Leterreux M.
      • Taieb S.
      • De M.P.
      False homozygosities at various loci revealed by discrepancies between commercial kits: implications for genetic databases.
      ], and Vanderheyden et al. [
      • Vanderheyden N.
      • Mai A.
      • Gilissen A.
      • Cassiman J.J.
      • Decorte R.
      Identification and sequence analysis of discordant phenotypes between AmpFlSTR SGM Plus and PowerPlex 16.
      ], 2) a C>T substitution 81 bp before the repeat structure of TH01, which was described by Li et al. [
      • Li F.
      • Xuan J.
      • Xing J.
      • Ding M.
      • Wang B.
      • Pang H.
      Identification of new primer binding site mutations at TH01 and D13S317 loci and determination of their corresponding STR alleles by allele-specific PCR.
      ], and 3) an A>T substitution 52 bp before the repeat motif of vWA, described as position 1631 in GenBank sequence M25858 by Alves et al. [
      • Alves C.
      • Amorim A.
      • Gusmao L.
      • Pereira L.
      VWA STR genotyping: further inconsistencies between Perkin-Elmer and Promega kits.
      ], Delamoye et al. [
      • Delamoye M.
      • Duverneuil C.
      • Riva K.
      • Leterreux M.
      • Taieb S.
      • De M.P.
      False homozygosities at various loci revealed by discrepancies between commercial kits: implications for genetic databases.
      ], Vanderheyden et al. [
      • Vanderheyden N.
      • Mai A.
      • Gilissen A.
      • Cassiman J.J.
      • Decorte R.
      Identification and sequence analysis of discordant phenotypes between AmpFlSTR SGM Plus and PowerPlex 16.
      ], and Kline et al. [
      • Kline M.C.
      • Hill C.R.
      • Decker A.E.
      • Butler J.M.
      STR sequence analysis for characterizing normal, variant, and null alleles.
      ].
      Except for Amelogenin, the markers that are present in both Identifiler and NGM are said to have the same primer binding positions [
      • Oldroyd N.
      • Green R.
      • Mulero J.
      • Hennessy L.
      • Tabak J.
      Development of the AmpFℓSTR® NGM SElect™ Kit: new sequence discoveries and implications for genotype concordance.
      ]. Notwithstanding, null alleles were detected for D2S1338 and D16S539 for Identifiler that were not observed for NGM (Table 2). We think the prolonged annealing time in the NGM protocol (3 min versus 1 min) made it possible to detect these alleles, albeit with relatively low peak heights compared to the other allele in the heterozygous allele set (peak height ratios were 0.06 and 0.13, respectively).
      All 24 samples that showed a null allele for X on Amelogenin when typed with NGM have the same G>A substitution (Table 2 and Supplementary Data 1). Green et al. [
      • Green R.L.
      • Lagace R.E.
      • Oldroyd N.J.
      • Hennessy L.K.
      • Mulero J.J.
      Developmental validation of the AmpFlSTR(R) NGM SElect PCR Amplification Kit: a next-generation STR multiplex with the SE33 locus.
      ] described that the discovery of rare population-specific variant alleles for AMEL prompted the adjustment of the AMEL primers, which was put into effect in the summer of 2011. The use of this updated NGM kit (here denoted NGM2), resolved all AMELX null alleles in our samples, indicating that this mutation is not very rare in the Dutch dataset as it occurs with a frequency of 1.15%. Interestingly, none of the other reported AMELX null alleles [
      • Shadrach B.
      • Commane M.
      • Hren C.
      • Warshawsky I.
      A rare mutation in the primer binding region of the amelogenin gene can interfere with gender identification.
      ,
      • Alves C.
      • Coelho M.
      • Rocha J.
      • Amorim A.
      The amelogenin locus displays a high frequency of X homologue failures in Sao Tome Island (West Africa).
      ,
      • Caratti S.
      • Voglino G.
      • Cirigliano V.
      • Ghidini A.
      • Taulli R.
      • Torre C.
      • Robino C.
      Amplification failure of the amelogenin gene (AMELX) caused by a primer binding site mutation.
      ,
      • Maciejewska A.
      • Pawlowski R.
      A rare mutation in the primer binding region of the Amelogenin X homologue gene.
      ,
      • Ou X.
      • Chen W.
      • Chen H.
      • Zhao F.
      • Zheng J.
      • Tong D.
      • Chen Y.
      • Chen A.
      • Sun H.
      Null alleles of the X and Y chromosomal amelogenin gene in a Chinese population.
      ], with estimated frequencies up to 2% in specific populations, were found in our dataset. In order to analyse whether the Dutch male donors of these samples could share a common origin, we tested X-STRs in all these Amelogenin mutation carriers. A homemade multiplex was used that contains the SRY male gender marker and 13 X-STRs of all four linkage groups [
      • Hering S.
      • Edelmann J.
      • Augustin C.
      • Kuhlisch E.
      • Szibor R.
      X chromosomal recombination—a family study analysing 39 STR markers in German three-generation pedigrees.
      ,
      • Nothnagel M.
      • Szibor R.
      • Vollrath O.
      • Augustin C.
      • Edelmann J.
      • Geppert M.
      • Alves C.
      • Gusmao L.
      • Vennemann M.
      • Hou Y.
      • Immel U.D.
      • Inturri S.
      • Luo H.
      • Lutz-Bonengel S.
      • Robino C.
      • Roewer L.
      • Rolf B.
      • Sanft J.
      • Shin K.J.
      • Sim J.E.
      • Wiegand P.
      • Winkler C.
      • Krawczak M.
      • Hering S.
      Collaborative genetic mapping of 12 forensic short tandem repeat (STR) loci on the human X chromosome.
      ]. Three of the markers in this multiplex (DXS8378, DXS9902 and DXS6807) reside in the same area of the X-chromosome as Amelogenin, but no apparent similarities at these three loci were seen among the 24 samples (results not shown). Thus, no apparent association between the X-STR results and the G>A substitution in the Amelogenin gene could be made. This suggests there is no recent common origin for this mutation in our study population.

      3.3 Number and percentage discordant and concordant results

      A consolidated database (denoted 2085-database) comprising 2085 genotypes of 23 autosomal STR loci and Amelogenin was created. For the samples that showed a null allele, the detected allele was included in the database. For the sample with the shifted allele in ESS for D19S433, we included the allele call presented by the other four kits, as the deleted nucleotide was found in the flanking regions of the repeat (Table 2 and Supplementary Data 1). Next, we determined the number and percentage of discordant genotypes and alleles for each locus (Table 3). The highest percentage of discordant results was found for locus D12S391 (genotypes: 0.048% and alleles: 0.024%), which was typed by four different kits. Fully concordant results for all kits tested were obtained for 9 loci typed with 2 to 6 kits. For three loci (Penta D, Penta E and SE33) no comparisons could be made, as they were only present in one of the tested kits. Overall, 99.991% of the genotypes and 99.995% of the alleles show fully concordant results for our Dutch reference dataset of 2085 samples.
      Table 3Number and percentage of discordant results on genotype and allele level obtained with six autosomal STR kits.
      Present in # kitsn samplesn analysedn discordances% discordances
      GenotypeAlleleGenotypeAlleleGenotypeAllele
      Amelogenin6208512,51025,020110.0080.004
      D16S5396208512,51025,020110.0080.004
      D18S516208512,51025,020220.0160.008
      D21S116208512,51025,020000.0000.000
      D3S13586208512,51025,020000.0000.000
      D8S11796208512,51025,020110.0080.004
      FGA6208512,51025,020000.0000.000
      TH016208512,51025,020220.0160.008
      VWA6208512,51025,020440.0320.016
      D19S4335208510,42520,850110.0100.005
      D2S13385208510,42520,850110.0100.005
      D10S124842085834016,680000.0000.000
      D12S39142085834016,680440.0480.024
      D1S165642085834016,680000.0000.000
      D22S104542085834016,680000.0000.000
      D2S44142085834016,680000.0000.000
      CSF1PO2208541708340000.0000.000
      D13S3172208541708340110.0240.012
      D5S8182208541708340000.0000.000
      D7S8202208541708340110.0240.012
      TPOX2208541708340000.0000.000
      PENTA D1208520854170n.a.n.a.n.a.n.a.
      PENTA E1208520854170n.a.n.a.n.a.n.a.
      SE331208520854170n.a.n.a.n.a.n.a.
      Totaln.a.
      Not applicable.
      50,04020,224540,449019190.0090.005
      a Not applicable.

      3.4 Allele frequencies and descriptive statistics

      The frequencies in the new Dutch allele frequency database are given in Table 4. Compared to the former Dutch database [
      • Kloosterman A.D.
      The development and implementation of forensic DNA typing technologies in the Netherlands.
      ], which was based on 231 samples for the SGM Plus (AB) and 201 for the Profiler (AB) loci, 52 new alleles were detected that had between one and thirteen occurrences in the 2085-dataset. One allele that was detected once in the former database was not revealed in the 2085-dataset. The largest difference between the allele frequencies of the former and the new database is 0.048. In the new allele frequency database 97 of the 361 different alleles have frequencies below 0.001 (≤ four occurrences) of which 48 alleles occur only once.
      Table 4Overview of the number and frequency of alleles per marker.
      AlleleCountFrequency
      D1S1656
      1070.00168
      113120.07482
      125170.12398
      132390.05731
      143650.08753
      14.370.00168
      155470.13118
      15.33120.07482
      164950.11871
      16.110.00024
      16.32290.05492
      172410.05779
      17.130.00072
      17.35920.14197
      18160.00384
      18.32240.05372
      1920.00048
      19.3570.01367
      20.340.00096
      D2S441
      820.00048
      910.00024
      107870.18873
      1114310.34317
      11.31770.04245
      121980.04748
      12.370.00168
      131070.02566
      13.320.00048
      1412250.29376
      14.110.00024
      152070.04964
      16250.00600
      D2S1338
      1210.00024
      1410.00024
      1520.00048
      161590.03813
      178090.19400
      183490.08369
      194860.11655
      206320.15156
      211270.03046
      221310.03141
      233910.09376
      245120.12278
      254690.11247
      26900.02158
      2770.00168
      2840.00096
      D3S1358
      1010.00024
      1180.00192
      13250.00600
      145120.12278
      1510280.24652
      1610080.24173
      179020.21631
      186440.15444
      19390.00935
      2030.00072
      D5S818
      720.00048
      8150.00360
      91270.03046
      102110.05060
      1114680.35204
      1215710.37674
      137290.17482
      14400.00959
      1560.00144
      1610.00024
      D7S820
      7800.01918
      86900.16547
      97500.17986
      1010810.25923
      10.310.00024
      118030.19257
      11.310.00024
      125330.12782
      131900.04556
      14380.00911
      1530.00072
      D8S1179
      8800.01918
      9700.01679
      103710.08897
      113230.07746
      126650.15947
      12.110.00024
      1313330.31966
      148130.19496
      153830.09185
      161100.02638
      17210.00504
      D10S1248
      710.00024
      1030.00072
      11110.00264
      121480.03549
      1313180.31607
      1412810.30719
      158190.19640
      164890.11727
      17850.02038
      18140.00336
      1910.00024
      D12S391
      1410.00024
      151730.04149
      161560.03741
      16.310.00024
      174190.10048
      17.3950.02278
      187320.17554
      18.3920.02206
      194300.10312
      19.3410.00983
      205030.12062
      20.210.00024
      20.320.00048
      215460.13094
      224230.10144
      233130.07506
      241540.03693
      25650.01559
      26150.00360
      2760.00144
      2810.00024
      2910.00024
      D13S317
      760.00144
      84580.10983
      93090.07410
      102750.06595
      1112360.29640
      1212250.29376
      134680.11223
      141900.04556
      1530.00072
      D16S539
      710.00024
      8700.01679
      95400.12950
      102720.06523
      1113410.32158
      1211020.26427
      137480.17938
      14940.02254
      1510.00024
      1610.00024
      D18S51
      940.00096
      10330.00791
      11580.01391
      126550.15707
      135750.13789
      146870.16475
      155960.14293
      165140.12326
      174480.10743
      182990.07170
      191530.03669
      20820.01966
      21380.00911
      22180.00432
      2340.00096
      2460.00144
      D19S433
      1020.00048
      11210.00504
      122790.06691
      12.140.00096
      12.210.00024
      138900.21343
      13.2760.01823
      1415270.36619
      14.2970.02326
      157630.18297
      15.21760.04221
      162220.05324
      16.2700.01679
      17220.00528
      17.2140.00336
      1810.00024
      18.240.00096
      19.210.00024
      D21S11
      24.210.00024
      25.210.00024
      2670.00168
      271790.04293
      287060.16930
      28.210.00024
      298390.20120
      29.230.00072
      29.320.00048
      3010790.25875
      30.21370.03285
      313820.09161
      31.23250.07794
      32670.01607
      32.23060.07338
      33120.00288
      33.21090.02614
      3410.00024
      34.120.00048
      34.280.00192
      3520.00048
      35.210.00024
      D22S1045
      1060.00144
      115860.14053
      12560.01343
      13250.00600
      142120.05084
      1513560.32518
      1615460.37074
      173510.08417
      18280.00671
      1940.00096
      CSF1PO
      610.00024
      750.00120
      8100.00240
      9940.02254
      1010630.25492
      10.310.00024
      1112800.30695
      1213890.33309
      132670.06403
      14520.01247
      1570.00168
      1610.00024
      FGA
      1750.00120
      18750.01799
      192530.06067
      19.120.00048
      19.240.00096
      205590.13405
      20.210.00024
      216960.16691
      21.2130.00312
      227180.17218
      22.110.00024
      22.2310.00743
      236140.14724
      23.2140.00336
      245740.13765
      24.250.00120
      254280.10264
      261560.03741
      27200.00480
      2810.00024
      VWA
      1010.00024
      1220.00048
      1380.00192
      144120.09880
      154070.09760
      168590.20600
      1711430.27410
      188330.19976
      194420.10600
      20600.01439
      2130.00072
      PENTA D
      2.210.00024
      530.00072
      620.00048
      7280.00671
      8850.02038
      98700.20863
      104130.09904
      115680.13621
      129680.23213
      138850.21223
      13.450.00120
      142830.06787
      14.130.00072
      15410.00983
      16130.00312
      1710.00024
      1810.00024
      PENTA E
      52980.07146
      6100.00240
      77120.17074
      8430.01031
      9440.01055
      103670.08801
      113980.09544
      128250.19784
      133910.09376
      142560.06139
      152100.05036
      162180.05228
      172050.04916
      181010.02422
      19440.01055
      20310.00743
      21120.00288
      2230.00072
      2320.00048
      TH01
      5300.00719
      68750.20983
      77540.18082
      84630.11103
      8.310.00024
      95640.13525
      9.314450.34652
      10380.00911
      TPOX
      720.00048
      822580.54149
      93950.09472
      102390.05731
      1110770.25827
      121930.04628
      1360.00144
      SE33
      10.210.00024
      1120.00048
      11.220.00048
      12190.00456
      12.210.00024
      13320.00767
      13.270.00168
      13.320.00048
      141120.02686
      14.260.00144
      14.310.00024
      151470.03525
      15.220.00048
      15.350.00120
      161820.04365
      16.210.00024
      16.340.00096
      172650.06355
      17.210.00024
      17.340.00096
      182800.06715
      18.230.00072
      18.340.00096
      192610.06259
      19.120.00048
      19.2220.00528
      202020.04844
      20.120.00048
      20.2650.01559
      20.310.00024
      211340.03213
      21.2740.01775
      22360.00863
      22.21190.02854
      23110.00264
      23.21130.02710
      2430.00072
      24.21340.03213
      25.21490.03573
      25.340.00096
      26.22650.06355
      2720.00048
      27.23550.08513
      27.310.00024
      28.23470.08321
      29.23010.07218
      30.22360.05659
      3110.00024
      31.21110.02662
      3280.00192
      32.2460.01103
      33400.00959
      33.2190.00456
      34100.00240
      34.230.00072
      3550.00120
      35.220.00048
      3620.00048
      3710.00024
      The summary statistics together with the heterozygosity and PIC values are shown in Supplementary Table 2. SE33 shows the highest heterozygosity and PIC values, which can be explained by the high number of microvariants resulting in 59 different alleles (Table 4) detected for this locus compared to the average of 15.65 different alleles per marker.

      4. Concluding remarks

      Nineteen discordant alleles were detected in 2085 Dutch male donor samples that were analysed for six autosomal STR kits (giving 99.995% concordant alleles overall). The numbers of null or shifted alleles vary for the different commercial kits: seven in Identifiler, seven in NGM2, three in ESI-17, one in PP16, one in ESS and none in ESX-16. Upon Roche-454 and/or Sanger sequencing nucleotide changes, insertions and deletions (single nucleotide or stretches of at least few hundred nucleotides) were found that could account for the discordant genotypes. The use of as much as 2085 samples generated a highly detailed allele frequency database, which can be of assistance in evidentiary value calculations.

      Acknowledgements

      This study was supported by a grant from The Netherlands Genomics Initiative/Netherlands Organization for Scientific Research (NWO) within the framework of the Forensic Genomics Consortium Netherlands. The authors would like to thank Applied Biosystems, Promega and Qiagen for their unconditional in kind support, and Arnoud Kal for critically reading the manuscript.

      Appendix A. Supplementary data

      References

        • Gill P.
        • Fereday L.
        • Morling N.
        • Schneider P.M.
        New multiplexes for Europe-amendments and clarification of strategic development.
        Forensic Sci. Int. 2006; 163: 155-157
        • Gill P.
        • Fereday L.
        • Morling N.
        • Schneider P.M.
        The evolution of DNA databases – recommendations for new European STR loci.
        Forensic Sci. Int. 2006; 156: 242-244
        • Westen A.A.
        • Haned H.
        • Grol L.J.
        • Harteveld J.
        • van der Gaag K.J.
        • de Knijff P.
        • Sijen T.
        Combining results of forensic STR kits: HDplex validation including allelic association and linkage testing with NGM and identifiler loci.
        Int. J. Legal Med. 2012; 126: 781-789
        • Hill C.R.
        • Duewer D.L.
        • Kline M.C.
        • Sprecher C.J.
        • McLaren R.S.
        • Rabbach D.R.
        • Krenke B.E.
        • Ensenberger M.G.
        • Fulmer P.M.
        • Storts D.R.
        • Butler J.M.
        Concordance and population studies along with stutter and peak height ratio analysis for the PowerPlex (R) ESX 17 and ESI 17 Systems.
        Forensic Sci. Int. Genet. 2011; 5: 269-275
        • Kline M.C.
        • Hill C.R.
        • Decker A.E.
        • Butler J.M.
        STR sequence analysis for characterizing normal, variant, and null alleles.
        Forensic Sci. Int. Genet. 2011; 5: 329-332
        • Welch L.A.
        • Gill P.
        • Phillips C.
        • Ansell R.
        • Morling N.
        • Parson W.
        • Palo J.U.
        • Bastisch I.
        European Network of Forensic Science Institutes (ENFSI): evaluation of new commercial STR multiplexes that include the European Standard Set (ESS) of markers.
        Forensic Sci. Int. Genet. 2012; 6: 819-826
        • Lao O.
        • Altena E.
        • Becker C.
        • Brauer S.
        • Kraaijenbrink T.
        • van O.M.
        • Nurnberg P.
        • de K.P.
        • Kayser M.
        Clinal distribution of human genomic diversity across the Netherlands despite archaeological evidence for genetic discontinuities in Dutch population history.
        Invest. Genet. 2013; 4: 9
        • Anvar S.Y.
        • van der Gaag K.J.
        • van der Heijden J.W.F.
        • Veltrop M.H.A.M.
        • Vossen R.H.A.M.
        • de Leeuw R.H.
        • Breukel C.
        • Buermans H.P.J.
        • Verbeek J.S.
        • de Knijff P.
        • den Dunnen J.T.
        • Laros J.F.J.
        TSSV: A tool for characterization of complex allelic variants in pure and mixed genomes.
        Bioinformatics. 2014; https://doi.org/10.1093/bioinformatics/btu068
        • Thompson J.D.
        • Higgins D.G.
        • Gibson T.J.
        CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
        Nucleic Acids Res. 1994; 22: 4673-4680
        • Hall T.A.
        BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT.
        Nucleic Acids Symp. Ser. 1999; 41: 95-98
        • Park S.D.E.
        Trypanotolerance in West African Cattle and the Population Genetic Effects of Selection.
        University of Dublin, Ireland2001
        • Oldroyd N.
        • Green R.
        • Mulero J.
        • Hennessy L.
        • Tabak J.
        Development of the AmpFℓSTR® NGM SElect™ Kit: new sequence discoveries and implications for genotype concordance.
        Forensic News. 2011;
        • Green R.L.
        • Lagace R.E.
        • Oldroyd N.J.
        • Hennessy L.K.
        • Mulero J.J.
        Developmental validation of the AmpFlSTR(R) NGM SElect PCR Amplification Kit: a next-generation STR multiplex with the SE33 locus.
        Forensic Sci. Int. Genet. 2013; 7: 41-51
        • Budowle B.
        • Masibay A.
        • Anderson S.J.
        • Barna C.
        • Biega L.
        • Brenneke S.
        • Brown B.L.
        • Cramer J.
        • DeGroot G.A.
        • Douglas D.
        • Duceman B.
        • Eastman A.
        • Giles R.
        • Hamill J.
        • Haase D.J.
        • Janssen D.W.
        • Kupferschmid T.D.
        • Lawton T.
        • Lemire C.
        • Llewellyn B.
        • Moretti T.
        • Neves J.
        • Palaski C.
        • Schueler S.
        • Sgueglia J.
        • Sprecher C.
        • Tomsey C.
        • Yet D.
        STR primer concordance study.
        Forensic Sci. Int. 2001; 124: 47-54
        • Delamoye M.
        • Duverneuil C.
        • Riva K.
        • Leterreux M.
        • Taieb S.
        • De M.P.
        False homozygosities at various loci revealed by discrepancies between commercial kits: implications for genetic databases.
        Forensic Sci. Int. 2004; 143: 47-52
        • Clayton T.M.
        • Hill S.M.
        • Denton L.A.
        • Watson S.K.
        • Urquhart A.J.
        Primer binding site mutations affecting the typing of STR loci contained within the AMPFlSTR SGM Plus kit.
        Forensic Sci. Int. 2004; 139: 255-259
        • Vanderheyden N.
        • Mai A.
        • Gilissen A.
        • Cassiman J.J.
        • Decorte R.
        Identification and sequence analysis of discordant phenotypes between AmpFlSTR SGM Plus and PowerPlex 16.
        Int. J. Legal Med. 2007; 121: 297-301
        • Leibelt C.
        • Budowle B.
        • Collins P.
        • Daoudi Y.
        • Moretti T.
        • Nunn G.
        • Reeder D.
        • Roby R.
        Identification of a D8S1179 primer binding site mutation and the validation of a primer designed to recover null alleles.
        Forensic Sci. Int. 2003; 133: 220-227
        • Nelson M.S.
        • Levedakou E.N.
        • Matthews J.R.
        • Early B.E.
        • Freeman D.A.
        • Kuhn C.A.
        • Sprecher C.J.
        • Amin A.S.
        • McElfresh K.C.
        • Schumm J.W.
        Detection of a primer-binding site polymorphism for the STR locus D16S539 using the Powerplex 1.1 system and validation of a degenerate primer to correct for the polymorphism.
        J. Forensic Sci. 2002; 47: 345-349
        • Li F.
        • Xuan J.
        • Xing J.
        • Ding M.
        • Wang B.
        • Pang H.
        Identification of new primer binding site mutations at TH01 and D13S317 loci and determination of their corresponding STR alleles by allele-specific PCR.
        Forensic Sci. Int. Genet. 2014; 8: 143-146
        • Alves C.
        • Amorim A.
        • Gusmao L.
        • Pereira L.
        VWA STR genotyping: further inconsistencies between Perkin-Elmer and Promega kits.
        Int. J. Legal Med. 2001; 115: 97-99
        • Shadrach B.
        • Commane M.
        • Hren C.
        • Warshawsky I.
        A rare mutation in the primer binding region of the amelogenin gene can interfere with gender identification.
        J. Mol. Diagn. 2004; 6: 401-405
        • Alves C.
        • Coelho M.
        • Rocha J.
        • Amorim A.
        The amelogenin locus displays a high frequency of X homologue failures in Sao Tome Island (West Africa).
        Int. Congress Series. 2006; 1288: 271-273
        • Caratti S.
        • Voglino G.
        • Cirigliano V.
        • Ghidini A.
        • Taulli R.
        • Torre C.
        • Robino C.
        Amplification failure of the amelogenin gene (AMELX) caused by a primer binding site mutation.
        Prenat. Diagn. 2009; 29: 1180-1182
        • Maciejewska A.
        • Pawlowski R.
        A rare mutation in the primer binding region of the Amelogenin X homologue gene.
        Forensic Sci. Int. Genet. 2009; 3: 265-267
        • Ou X.
        • Chen W.
        • Chen H.
        • Zhao F.
        • Zheng J.
        • Tong D.
        • Chen Y.
        • Chen A.
        • Sun H.
        Null alleles of the X and Y chromosomal amelogenin gene in a Chinese population.
        Int. J. Legal Med. 2012; 126: 513-518
        • Hering S.
        • Edelmann J.
        • Augustin C.
        • Kuhlisch E.
        • Szibor R.
        X chromosomal recombination—a family study analysing 39 STR markers in German three-generation pedigrees.
        Int. J. Legal Med. 2010; 124: 483-491
        • Nothnagel M.
        • Szibor R.
        • Vollrath O.
        • Augustin C.
        • Edelmann J.
        • Geppert M.
        • Alves C.
        • Gusmao L.
        • Vennemann M.
        • Hou Y.
        • Immel U.D.
        • Inturri S.
        • Luo H.
        • Lutz-Bonengel S.
        • Robino C.
        • Roewer L.
        • Rolf B.
        • Sanft J.
        • Shin K.J.
        • Sim J.E.
        • Wiegand P.
        • Winkler C.
        • Krawczak M.
        • Hering S.
        Collaborative genetic mapping of 12 forensic short tandem repeat (STR) loci on the human X chromosome.
        Forensic Sci. Int. Genet. 2012; 6: 778-784
        • Kloosterman A.D.
        The development and implementation of forensic DNA typing technologies in the Netherlands.
        2002