Advertisement
Research paper| Volume 56, 102596, January 2022

Development and inter-laboratory validation of the VISAGE enhanced tool for age estimation from semen using quantitative DNA methylation analysis

Open AccessPublished:September 11, 2021DOI:https://doi.org/10.1016/j.fsigen.2021.102596

      Highlights

      • Development of new targeted bisulfite MPS assays for age estimation from semen.
      • Bisulfite multiplex PCR design targeting 13 or 5 age markers.
      • Inter-laboratory validation showed stable and accurate DNA methylation quantification.
      • Robust results with 50 ng DNA input at bisulfite conversion level.

      Abstract

      The analysis of DNA methylation has become an established method for chronological age estimation. This has triggered interest in the forensic community to develop new methods for age estimation from biological crime scene material. Various assays are available for age estimation from somatic tissues, the majority from blood. Age prediction from semen requires different DNA methylation markers and the only assays currently developed for forensic analysis are based on SNaPshot or pyrosequencing. Here, we describe a new assay using massively parallel sequencing to analyse 13 candidate CpG sites targeted in two multiplex PCRs. The assay has been validated by five consortium laboratories of the VISible Attributes through GEnomics (VISAGE) project within a collaborative exercise and was tested for reproducible quantification of DNA methylation levels and sensitivity with DNA methylation controls. Furthermore, DNA extracts and stains on Whatman FTA cards from two semen samples were used to evaluate concordance and mimic casework samples. Overall, the assay yielded high read depths (> 1000 reads) at all 13 marker positions. The methylation values obtained indicated robust quantification with an average standard deviation of 2.8% at the expected methylation level of 50% across the 13 markers and a good performance with 50 ng DNA input into bisulfite conversion. The absolute difference of quantifications from one participating laboratory to the mean quantifications of concordance and semen stains of remaining laboratories was approximately 1%. These results demonstrated the assay to be robust and suitable for age estimation from semen in forensic investigations. In addition to the 13-marker assay, a more streamlined protocol combining only five age markers in one multiplex PCR was developed. Preliminary results showed no substantial differences in DNA methylation quantification between the two assays, indicating its applicability with the VISAGE age model for semen developed with data from the complete 13-marker tool.

      Keywords

      1. Introduction

      The discovery of the correlation between DNA hypo- and hypermethylation at cytosine-guanine dinucleotides (CpG sites) with the human lifespan led to the wide usage of DNA methylation as a biomarker of aging [
      • Bell J.T.
      • Tsai P.-C.
      • Yang T.-P.
      • Pidsley R.
      • Nisbet J.
      • Glass D.
      • Mangino M.
      • Zhai G.
      • Zhang F.
      • Valdes A.
      • Shin S.-Y.
      • Dempster E.L.
      • Murray R.M.
      • Grundberg E.
      • Hedman A.K.
      • Nica A.
      • Small K.S.
      • Dermitzakis E.T.
      • McCarthy M.I.
      • Mill J.
      • Spector T.D.
      • Deloukas P.
      Epigenome-wide scans identify differentially methylated regions for age and age-related phenotypes in a healthy ageing population.
      ,
      • Horvath S.
      DNA methylation age of human tissues and cell types.
      ,
      • Hannum G.
      • Guinney J.
      • Zhao L.
      • Zhang L.
      • Hughes G.
      • Sadda S.
      • Klotzle B.
      • Bibikova M.
      • Fan J.-B.
      • Gao Y.
      • Deconde R.
      • Chen M.
      • Rajapakse I.
      • Friend S.
      • Ideker T.
      • Zhang K.
      Genome-wide methylation profiles reveal quantitative views of human aging rates.
      ]. The quantitative analysis of DNA methylation at such CpG sites provides accurate chronological age estimation from biological material, which makes it a valuable tool for forensic genetic investigations [
      • Vidaki A.
      • Kayser M.
      Recent progress, methods and perspectives in forensic epigenetics.
      ].
      A prerequisite for forensic assays is a maximum possible sensitivity due to the limited DNA amount that can be recovered from crime scene samples. This impedes the usage of assays targeting a high number of markers with microarray-based DNA methylation analysis, which require approximately 250 ng of DNA []. Therefore, forensic age estimation assays have focused on fewer markers capable of age prediction from a specific tissue, and showing reliable results with reported DNA inputs of 50 ng to 10 ng for bisulfite conversion (5 ng at PCR level [
      • Lee J.W.
      • Choung C.M.
      • Jung J.Y.
      • Lee H.Y.
      • Lim S.-K.
      A validation study of DNA methylation-based age prediction using semen in forensic casework samples.
      ] and 2 ng at PCR level [
      • Aliferi A.
      • Ballard D.
      • Gallidabino M.D.
      • Thurtle H.
      • Barron L.
      D. Syndercombe Court, DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models.
      ], respectively). As DNA methylation patterns are specific to the cell type, age estimation models are only applicable to the respective tissue [
      • Naue J.
      • Sänger T.
      • Hoefsloot H.C.J.
      • Lutz-Bonengel S.
      • Kloosterman A.D.
      • Verschure P.J.
      Proof of concept study of age-dependent DNA methylation markers across different tissues by massive parallel sequencing.
      ]. The majority of published age estimation models were trained for blood samples [
      • Weidner C.
      • Lin Q.
      • Koch C.
      • Eisele L.
      • Beier F.
      • Ziegler P.
      • Bauerschlag D.
      • Jöckel K.-H.
      • Erbel R.
      • Mühleisen T.
      • Zenke M.
      • Brümmendorf T.
      • Wagner W.
      Aging of blood can be tracked by DNA methylation changes at just three CpG sites.
      ,
      • Zbieć-Piekarska R.
      • Spólnicka M.
      • Kupiec T.
      • Parys-Proszek A.
      • Makowska Ż.
      • Pałeczka A.
      • Kucharczyk K.
      • Płoski R.
      • Branicki W.
      Development of a forensically useful age prediction method based on DNA methylation analysis.
      ,
      • Xu C.
      • Qu H.
      • Wang G.
      • Xie B.
      • Shi Y.
      • Yang Y.
      • Zhao Z.
      • Hu L.
      • Fang X.
      • Yan J.
      • Feng L.
      A novel strategy for forensic age prediction by DNA methylation and support vector regression model.
      ,
      • Vidaki A.
      • Ballard D.
      • Aliferi A.
      • Miller T.H.
      • Barron L.P.
      • Syndercombe Court D.
      DNA methylation-based forensic age prediction using artificial neural networks and next generation sequencing.
      ,
      • Naue J.
      • Hoefsloot H.C.J.
      • Kloosterman A.D.
      • Verschure P.J.
      Forensic DNA methylation profiling from minimal traces: How low can we go?.
      ]. Nevertheless, some of the known blood age markers also show age-associated DNA methylation in other forensically relevant tissues like saliva and bones [
      • Naue J.
      • Sänger T.
      • Hoefsloot H.C.J.
      • Lutz-Bonengel S.
      • Kloosterman A.D.
      • Verschure P.J.
      Proof of concept study of age-dependent DNA methylation markers across different tissues by massive parallel sequencing.
      ,
      • Jung S.-E.
      • Lim S.M.
      • Hong S.R.
      • Lee E.H.
      • Shin K.-J.
      • Lee H.Y.
      DNA methylation of the ELOVL2, FHL2, KLF14, C1orf132/MIR29B2C, and TRIM59 genes for age prediction from blood, saliva, and buccal swab samples.
      ,
      • Slieker R.C.
      • Relton C.L.
      • Gaunt T.R.
      • Slagboom P.E.
      • Heijmans B.T.
      Age-related DNA methylation changes are tissue-specific with ELOVL2 promoter methylation as exception.
      ]. However, DNA methylation patterns in semen are distinct from those in somatic cells [
      • Horvath S.
      DNA methylation age of human tissues and cell types.
      ,
      • Aliferi A.
      • Ballard D.
      • Gallidabino M.D.
      • Thurtle H.
      • Barron L.
      D. Syndercombe Court, DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models.
      ] and although semen stains are commonly analyzed in forensic laboratories, only a handful of studies established age estimation models for this cell type. Lee et al. [
      • Lee H.Y.
      • Jung S.-E.
      • Oh Y.N.
      • Choi A.
      • Yang W.I.
      • Shin K.-J.
      Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
      ] demonstrated for the first time that age estimation from semen is possible by developing an age prediction model based on three CpG sites (cg06304190 in TTC7B, cg06979108 in NOX4/FOLH1B and cg12837463 in LOC401324) targeted by SNaPshot. The mean absolute error (MAE) from the chronological age of this assay was approximately 5 years [
      • Lee J.W.
      • Choung C.M.
      • Jung J.Y.
      • Lee H.Y.
      • Lim S.-K.
      A validation study of DNA methylation-based age prediction using semen in forensic casework samples.
      ,
      • Lee H.Y.
      • Jung S.-E.
      • Oh Y.N.
      • Choi A.
      • Yang W.I.
      • Shin K.-J.
      Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
      ]. Two of the CpG sites (cg06304190 in TTC7B and cg06979108 in NOX4/FOLH1B) were used by Li et al. [
      • Li L.
      • Song F.
      • Lang M.
      • Hou J.
      • Wang Z.
      • Prinz M.
      • Hou Y.
      Methylation‐based age prediction using pyrosequencing platform from seminal stains in han chinese males.
      ] to develop a pyrosequencing assay, which exhibited a MAE of approximately 4.2 years. Jenkins et al. [
      • Jenkins T.G.
      • Aston K.I.
      • Cairns B.
      • Smith A.
      • Carrell D.T.
      Paternal germ line aging: DNA methylation age prediction from human sperm.
      ] recently reported a smaller MAE of 2.4 years, however, their prediction was based on 51 microarray-typed genomic regions, a number which is currently difficult to incorporate into a viable forensic assay.
      In this study, we aimed to further advance chronological age prediction from semen for routine forensic applications by developing a new assay in the course of the VISible Attributes through GEnomics (VISAGE) project. The VISAGE consortium has aimed to develop and validate tools based on targeted massively parallel sequencing (MPS) for predicting appearance, ancestry and age to provide information on an unknown sample donor for police investigations.
      The VISAGE enhanced tool for age estimation from semen (ET-13; Enhanced Tool-13) combines ten novel markers (regions) identified by Pisarek et al. [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ] with the three markers described by Lee et al. [
      • Lee H.Y.
      • Jung S.-E.
      • Oh Y.N.
      • Choi A.
      • Yang W.I.
      • Shin K.-J.
      Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
      ]. The ET-13 was validated by five VISAGE consortium laboratories in a collaborative exercise to assess the assay’s reproducibility and sensitivity. Subsequent to the completed validation phase, the assay was used for data generation to establish the VISAGE age model for semen, which is presented in a separate publication that also provides details regarding marker selection [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]. As MPS allows for the analysis of CpG sites adjacent to the initially selected 13 candidate CpG sites (Table 1), 23 additional CpG sites located within the 13 marker regions were considered by Pisarek et al. [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]. The final VISAGE model for semen consists of six CpG sites from five age markers (model CpG sites, Table 2). Aiming to provide a more streamlined option for routine forensic laboratories, we optimized an additional five-marker assay (ET-5) based on only one multiplex PCR. A first evaluation of the ET-5 was performed to compare DNA methylation quantification between the two assays.
      Table 1Age markers, genomic locations of candidate CpG sites used for initial assay development, primer sequences and final assay concentrations.
      Gene symbolCpG IDGRCh38

      position
      Amp.

      size

      [bp]
      DNA

      strand
      Primer sequences (5′–3′)ET-13

      multiplex
      Conc.

      ET-13

      [µM]
      Conc.

      ET-5

      [µM]
      SYT7cg17147820chr11:6155478375+F: GGGGATTTAGGAATAAAGTAGGG

      R: ATACATAACCCCATCCCCCTCTACCTA
      7-plex0.48
      TUBB3cg18701351chr16:89921897109+F: GGGAGTTGTTTTTTGGTAGGG

      R: CAAAACAACCAACTCCTACT
      0.24
      SH2B2cg00018181chr7:102288444125+F: TTTTGGGGGTTTTAGAGATAGT

      R: CTAAAAACATTCAACCAAACAACATC
      1.381
      ARHGEF17cg09855959chr11:73311506128+F: AGTTAGGATTAGATTGGTAGTTTGT

      R: ACAAAAAAACCAACAAAAATTAATAACTCA
      0.8
      EXOC3cg10528482chr5:525656173+F: AGGGGTTTGGTTTTAATGTTATT

      R: CCCAAAATAAAAACAAACAACTCAAAATC
      1.181
      GALR2cg07909178chr17:76077795179+F: GTTATTAGGAAAGAGGGTTGATTATATT

      R: CCTATCACACACCAAAACACAA
      0.320.4
      PPP2R2Ccg02766173chr4:6473455242+F: TTTTTTGGTAATTAGTTTGGTATATAGTGG

      R: TACACTTACCCCTCCCAAACA
      0.6
      TBX4cg19862839chr17:6146636578F: GAGGGGTTTGAAGTTAAGGAGAATATG

      R: ACTTTAAAAACAAACCATTACCTACTATAT
      6-plex1
      PALMcg17704154chr19:71860898F: AGGTATTATTTAGGGGGAGAGGAG

      R: ATCCCTTCCCACCCCAAATTA
      0.8
      IFITM2cg01886988chr11:312560137+F: GGATTTAGATATAGTTTGGTTTAAGTGG

      R: CCTTACCCTTACTTAAAATCCCTACT
      0.20.2
      NOX4/FOLH1B/ LOC729960cg06979108/no IDchr11:89589683/ chr11:49251356194+F: TAGTTATTTGAGTGAAGTGTGTTGG

      R: ACCTCCCAAAATACTAAATTACTC
      0.40.2
      TTC7Bcg06304190chr14:90817262167+F: GGGTTTTTTGTTTTGGTTATTTAGATTG

      R: AAACTCTCTCAAACCAAAAATTTTATT
      0.6
      LOC401324cg12837463chr7:35260617138+F: AGGGTTGGATTTTTTTTTTTTAATTTATGT

      R: AAACCTCTAATACAATACCTAACACAT
      0.4
      Markers included in the ET-5 are marked in bold
      For CpG dinucleotides in primer sequences a base mismatch (underlined) was introduced
      Table 2Genomic locations of model CpG sites selected by Pisarek et al.
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      .
      Gene symbolCpG IDGRCh38 position
      SH2B2no IDchr7:102288454
      EXOC3no IDchr5:525617
      GALR2no IDchr17:76077680
      GALR2no IDchr17:76077748
      IFITM2no IDchr11:312518
      NOX4/FOLH1B/LOC729960cg06979108/no IDchr11:89589683/ chr11:49251356

      2. Material and methods

      2.1 Age markers

      Selection of age markers for the VISAGE enhanced tools for age estimation from semen was described previously [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ] and includes ten newly identified candidate CpG sites and three CpG sites from Lee et al. [
      • Lee H.Y.
      • Jung S.-E.
      • Oh Y.N.
      • Choi A.
      • Yang W.I.
      • Shin K.-J.
      Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
      ]. The combined marker set, termed ET-13, was designed to target the 13 candidate CpG sites listed in Table 1. Development of an age prediction model based on data generated with the ET-13 allowed for the simultaneous analysis of CpG sites adjacent to the selected candidate CpG sites (in total 36 CpG sites). Marker selection, which was conducted using multivariable stepwise linear regression, showed that only six CpG sites located in five of the 13 candidate markers were optimal for accurate age prediction (Table 2) [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]. These CpG sites were included in the final prediction model and used to develop a second assay (ET-5).

      2.2 Concordance samples, stains and DNA methylation standards

      Frozen semen samples of one 33-year-old (Sample 1; Cat No: 991–04-AS2) and one 42-year-old (Sample 2; Cat No: 991–04-AS2) donor were purchased from Lee Biosolutions (Maryland Heights, MO, USA). DNA was extracted with the QIAamp DNA Mini Kit (Qiagen, Hilden, Germany) following the user-developed “Isolation of genomic DNA from sperm” protocol 2 (QA04 Jul-10) available on the Qiagen website [

      Qiagen, Isolation of genomic DNA from sperm using the QIAamp® DNA Mini Kit; protocol 2 - (EN), (n.d.). https://www.qiagen.com/us/resources/resourcedetail?id=5f0e4e08-d405–42d9-acf6–73eaaa52190c&lang=en, (accessed 2 February 2021).

      ]. In brief, 100 µl semen were incubated with 100 µl Buffer X2 (20 mM Tris-Cl (pH 8.0), 20 mM EDTA, 200 mM NaCl, 80 mM DTT, 4% SDS and 250 µg/ml Proteinase K) for 1 h at 55 °C. After adding 200 µl Buffer AL and 200 µl ethanol, the protocol followed the “Tissue Protocol” of the QIAamp DNA Mini Kit. Final DNA extracts were quantified on a Qubit 2.0 Fluorometer using the Qubit dsDNA HS Assay Kit (Thermo Fisher Scientific, Waltham, MA, USA, hereafter referred to as TFS) and were used as concordance samples. Stains were prepared by applying 50 µl semen on Whatman FTA cards (Merck, Darmstadt, Germany). DNA was extracted from stains by the five participating consortium laboratories according to their in-house DNA extraction and quantification protocols (Table S1). DNA methylation standards were prepared using the WGA methylated & non-methylated DNA Set (Zymo Research, Irvine, CA, USA) diluted to 20 ng/µl in low TE (10 mM Tris, 0.1 mM EDTA, pH 8). The unmethylated DNA control sample was mixed with the fully methylated DNA control sample at different volume ratios to achieve 0%, 5%, 10%, 25%, 50%, 75% and 100% methylated DNA standards.

      2.3 Assay development and protocols

      Assay development was based on the previously published VISAGE basic tool for age estimation from blood [
      • Heidegger A.
      • Xavier C.
      • Niederstätter H.
      • de la Puente M.
      • Pośpiech E.
      • Pisarek A.
      • Kayser M.
      • Branicki W.
      • Parson W.
      Development and optimization of the VISAGE basic prototype tool for forensic age estimation.
      ] and the VISAGE enhanced tool for age estimation from blood, buccal cells and bones [
      • Woźniak A.
      • Heidegger A.
      • Piniewska-Róg D.
      • Pośpiech E.
      • Xavier C.
      • Pisarek A.
      • Kartasińska E.
      • Boroń M.
      • Freire-Aradas A.
      • Wojtas M.
      • de la Puente M.
      • Niederstätter H.
      • Płoski R.
      • Spólnicka M.
      • Kayser M.
      • Phillips C.
      • Parson W.
      • Branicki W.
      Development of the VISAGE enhanced tool and statistical models for epigenetic age estimation in blood, buccal cells and bones.
      ].

      2.3.1 Bisulfite conversion

      Bisulfite conversion was performed with the Premium Bisulfite kit (Diagenode, Liège, Belgium), using 200 ng DNA if not otherwise specified. One extra dry spin was added before elution in 10 µl elution buffer to avoid ethanol carryover into the subsequent PCR step. Eluates were used for amplification on the same day.

      2.3.2 ET-13: two multiplex PCR approach

      Primers were designed using the PyroMark Assay Design Software 2.0 (Qiagen), except for primer sequences for NOX4/FOLH1B, which were described in [
      • Lee H.Y.
      • Jung S.-E.
      • Oh Y.N.
      • Choi A.
      • Yang W.I.
      • Shin K.-J.
      Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
      ]. The SYT7 reverse and the PALM forward primers each contained a single CpG dinucleotide in their binding sites. Therefore, we introduced an intentional base mismatch into these two primer sequences (Table 1) at the positions of the CpG cytosines. All primer sequences (Table 1) were tested in silico for specificity using BiSearch [
      • Tusnády G.E.
      BiSearch: primer-design and search tool for PCR on bisulfite-treated genomes.
      ]. In vitro, all primers were tested in singleplex reactions using the Multiplex PCR Kit (Qiagen) and correct amplification was verified using Sanger sequencing. The BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) was used according to the manufacturer's instructions. Sequencing electrophoresis was performed on an Applied Biosystems 3500 Series Genetic Analyzer (TFS). Temperature gradient PCR was utilized for optimizing annealing conditions. Amplicon size and yield of multiplex reactions were evaluated on a Bioanalyzer instrument using the DNA 1000 Kit (both Agilent Technologies, Santa Clara, CA, USA) and the results were used to adjust primer concentrations. Different primer combinations were tested to obtain optimal multiplex PCRs. The final ET-13 is based on two multiplex PCRs carried out with 4 µl eluate from bisulfite conversion in 25 µl total volume using the Multiplex PCR kit (Qiagen). Cycling conditions were as follows: initial denaturation at 95 °C for 15 min, 40 cycles consisting of 94 °C for 30 s, 59 °C for 30 s, 72 °C for 1 min and a final extension step at 72 °C for 10 min. Allocation of primers to the two multiplex PCR assays of the ET-13, as well as all final primer concentrations are listed in Table 1. Clean up of PCR products was performed using 1.5X ratio Agencourt AMPure XP (Beckman Coulter Brea, CA, USA) according to the manufacturer’s protocol and eluted in 15 µl low TE buffer.

      2.3.3 ET-5: one multiplex PCR assay

      For development of the 5-plex PCR assay targeting the five age markers included in the final VISAGE age model (based on 6 CpG sites; Table 2), primer concentrations and thermocycler conditions were re-optimized with the same primer sequences used in the ET-13 set (Table 1). The final 5-plex PCR assay was performed using 8 µl eluate from bisulfite conversion in 25 µl total PCR volume and the following thermocycler protocol: Initial denaturation step at 95 °C for 15 min, 35 cycles consisting of 94 °C for 30 s, 59 °C for 30 s, 72 °C for 1 min and a final extension step at 72 °C for 10 min

      2.3.4 Library preparation

      All PCR products were quantified using the Qubit dsDNA HS Assay Kit (TFS). Library preparation was performed using the KAPA Hyper Prep (96rxn) kit with the KAPA Library Amplification Primer Mix (Roche, Basel, Switzerland) in half volumes following the manufacturer’s instructions. For each sample, 25 ng PCR products from the two multiplex reactions (ET-13) per sample were mixed and adjusted to 25 µl total volume for end repair and A tailing. Adapter ligation was performed with KAPA SI Adapter Kit Set A+B (Roche) diluted to 15 µM and library amplifications were carried out with five cycles. Agencourt AMPure XP (Beckman Coulter) were used for post-ligation and post-amplification clean-up steps, elution was performed in 12 µl and 20 µl low TE, respectively. Library quantification was carried out with the KAPA Library Quantification Kit - Complete Universal for Illumina Platforms (Roche) following the manufacturer’s instructions. Library preparation with the ET-5 was performed with 50 ng purified PCR products.

      2.3.5 Massively parallel sequencing and data analysis

      Libraries were diluted according to MiSeq System Denature and Dilute Libraries Guide, Protocol A (Document #15039740 v10; Illumina, San Diego, CA, USA). In brief, libraries were pooled equimolarly at 4 nM and further diluted to 7 pM with 2 µl of 20 pM PhiX Control v3 (Illumina) spiked into 598 µl total volume. The maximum number of samples pooled in one sequencing run was 24. Sequencing was performed with paired end sequencing and 201 cycles in both directions using the MiSeq FGx ForenSeq Reagent Kit (Verogen, San Diego, CA, USA). Generated fastq files were aligned with the Burrows-Wheeler alignment for bisulfite converted DNA sequences using the bwa-meth software [
      • Pedersen B.S.
      • Eyring K.
      • De S.
      • Yang I.V.
      • Schwartz D.A.
      Fast and accurate alignment of long bisulfite-seq reads.
      ] and a custom reference genome containing only targeted sequences (+/- 300 bp to the candidate CpG sites, Table S2). Samtools v1.4 [
      • Li H.
      • Handsaker B.
      • Wysoker A.
      • Fennell T.
      • Ruan J.
      • Homer N.
      • Marth G.
      • Abecasis G.
      • Durbin R.
      The sequence alignment/map format and SAMtools.
      ] was used for bam file creation, sorting, filtering and indexing. All sample alignments were analysed using the Integrative Genomics Viewer (IGV) v2.4.10 [
      • Robinson J.T.
      • Thorvaldsdóttir H.
      • Winckler W.
      • Guttman M.
      • Lander E.S.
      • Getz G.
      • Mesirov J.P.
      Integrative genomics viewer.
      ]. Read counts at candidate CpG sites, model CpG sites (Table 2) and all non-CpG-Cs were extracted using bam-readcount (https://github.com/genome/bam-readcount) with a minimum mapping quality and minimum base quality set to 30. Methylation values were calculated as percentage beta-values (C reads were divided by the sum of C reads and T reads). Bisulfite conversion efficiency was estimated as the mean T read percentage at all non-CpG positions per sample. Sample coverage was calculated as the sum of reads at candidate CpG sites and normalized read depth represents the number of reads at a candidate CpG position divided by sample coverage. Base misincorporation rates at CpG sites were calculated by dividing read depth by the sum of unexpected bases at the respective positions, which would refer to the sum of A and G reads for CpG sites targeted on the sense strand and the sum of C and T reads for CpG sites targeted on the antisense strand. For methylation analysis, only CpG sites with a minimum number of 1000 reads were considered. Age prediction analysis was performed using the VISAGE model for semen, a multivariate linear regression model using percentage beta values obtained from model CpG sites (Table 2; [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]). Statistical analysis was performed with Microsoft Excel and R (https://www.r-project.org/) [

      (R Foundation for Statistical Computing) R Core Team, R: A Language and Environment for Statistical Computing, (2019). https://www.r-project.org.

      ]. The standard deviation was used to measure variation around mean values.

      2.4 Validation framework

      Five VISAGE consortium laboratories participated in a collaborative exercise and performed one MiSeq FGx run per laboratory for the developmental validation of the ET-13. Each laboratory analysed seven differentially methylated DNA standards at optimum DNA input (200 ng) to evaluate the assay’s reproducibility. One DNA methylation standard was assigned to each participant to test the assay’s sensitivity with duplicates of 50 ng, 20 ng, 10 ng and 1 ng DNA input into bisulfite conversion. One concordance sample (200 ng DNA input) and one stain were processed in duplicate by each laboratory, except for Laboratory 1, which analysed both samples (1 and 2). Duplicates were prepared after DNA extraction and quantification. Sample assignment to laboratories and sample numbers per MiSeq FGx run are indicated in Table S3. Overall, 99 samples were analysed for assay validation.

      2.5 Species specificity testing

      DNA extracts from the ten animal samples used for species specificity testing (Table S4) derived from an in-house sample collection [
      • Esteve Codina A.
      • Niederstätter H.
      • Parson W.
      “GenderPlex” a PCR multiplex for reliable gender determination of degraded human DNA samples and complex gender constellations.
      ]. All samples were quantified by real-time PCR [
      • Niederstätter H.
      • Köchl S.
      • Grubwieser P.
      • Pavlic M.
      • Steinlechner M.
      • Parson W.
      A modular real-time PCR concept for determining the quantity and quality of human nuclear and mitochondrial DNA.
      ], bisulfite converted (100 ng DNA input) and amplified following the ET-13 protocol. All samples were processed with human gDNA as an amplification control and PCR-grade water as a negative template control. Purified PCR products were loaded on a DNA 1000 chip for electrophoretic separation using the Bioanalyzer instrument. BiSearch's Primer search ePCR function [
      • Tusnády G.E.
      BiSearch: primer-design and search tool for PCR on bisulfite-treated genomes.
      ] was used to scan animal databases for primers that may have created a PCR product in the ten animal samples. Primer pairs were tested in silico against bisulfite converted animal genomes using default parameters except for the “PCR product to display” option, which was increased to 500. The sequence of PCR products retrieved from BiSearch were further analysed using the BLAST [
      • Altschul S.
      Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
      ] program “Needleman-Wunsch alignment of two sequences” to compare sequence homology. BiSearch results that hits on the antisense strand were reverse complemented prior to sequence comparison.

      3. Results

      3.1 Development and inter-laboratory validation of the 13-marker enhanced tool for age estimation from semen (ET-13)

      The ET-13 was designed to target 13 candidate CpG sites in two multiplex PCRs (Table 1) that can be run in parallel on one thermocycler and are combined for library preparation. The minimum and maximum amplicon sizes were 75 bp and 242 bp (mean = 141.8 bp), respectively. In silico testing showed alternative PCR products for TUBB3 (size 733, antisense strand) and NOX4/FOLH1B (size 194, antisense strand). Due to three mismatches within the primer binding sites in each of the two TUBB3 primers, the amplification of the alternative larger amplicon was considered as negligible. In contrast, primer sequences of NOX4/FOLH1B match the binding sites of the alternative PCR product with the same amplicon size. Further investigation of the NOX4 gene showed that a pseudogene (LOC729960) is located on the other arm of chromosome 11, which impedes the design of primer specific only for the candidate CpG site. In fact, a previous study has identified a potential cross hybridization for probes targeting the NOX4/FOLH1B CpG site (cg06979108) on the Illumina Infinium MethylationEPIC BeadChip [
      • McCartney D.L.
      • Walker R.M.
      • Morris S.W.
      • McIntosh A.M.
      • Porteous D.J.
      • Evans K.L.
      Identification of polymorphic and off-target probe binding sites on the Illumina Infinium MethylationEPIC BeadChip.
      ]. However, the very same primer pair has been used in a previous study on age prediction [
      • Lee H.Y.
      • Jung S.-E.
      • Oh Y.N.
      • Choi A.
      • Yang W.I.
      • Shin K.-J.
      Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
      ] and measurement of the combined DNA methylation level from the NOX4/FOLH1B candidate CpG site and the CpG site of the co-amplified pseudogene (chr11:49,251,356) showed statistically significant age correlation in the study by Pisarek et al. [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]. Therefore, the candidate CpG site was retained in the marker set and is hereafter referred to as NOX4/FOLH1B.
      A first test run was performed using differentially methylated DNA standards, which showed a promising assay performance and robust methylation quantification at all target CpG sites (data not shown). The five participating VISAGE consortium laboratories received detailed instructions for validation experiments. The data presented here is a combined analysis of the 99 samples to validate the technical performance of the assay prior to data generation for the age model development.

      3.1.1 Performance evaluation

      General assay performance was evaluated using methylated DNA standards at optimum DNA input for bisulfite conversion (N = 35). In a first step, sample coverage was calculated and compared among laboratories (Fig. 1A, Table S5). Whereas Laboratory 1 (median = 388,165 paired reads), 2 (median = 199,483 paired reads) and 5 (median = 574,037 paired reads) showed uniform coverage values, Laboratories 3 (median = 801,010 paired reads) and 4 (median = 1,816,451 paired reads) yielded wider coverage variability and higher sample coverages compared to the overall results (median = 540,564 paired reads). The higher read depth obtained by Laboratory 4 could be due to a higher cluster density during sequencing (1568 K/mm2) compared to the overall average cluster density (674.4 K/mm2, Table S6). Furthermore, variable library inputs during preparations of the library pool might explain high read depth outliers within one run. Consequently, median values were considered for evaluation of the sequencing depths. The overall median read depth at the 13 candidate CpG sites was 29,815 reads. Dropouts (target position < 1000 paired reads) were only observed for samples from Laboratory 4 at candidate CpG positions on SYT7 and TBX4. These CpG sites were excluded from downstream analysis in all samples of Laboratory 4. The read distribution between the 13 amplicons was evaluated by calculating the normalized read depth at candidate CpG sites for each sample (Fig. 1B). Four markers showed higher than expected read depth values, with NOX4/FOLH1B outperforming the remaining markers. In contrast, SH2B2, TBX4 and ARHGEF17 showed lower read depths in all laboratories. However, median read depth at underperforming markers can still be considered sufficient with 7874; 9555; 13,923 paired reads at the three candidate CpG sites, respectively.
      Fig. 1
      Fig. 1(A) Sample coverage of the seven DNA methylation standards used per laboratory. The median is shown for each MiSeq FGx run. (B) Normalized read depth per candidate CpG sites of DNA methylation standards (N = 35) across laboratories. The dashed line indicates the expected normalized read depth per marker (= 0.08; 1 divided by 13 markers).
      Sequence quality was evaluated by calculating base misincorporation rates, bisulfite conversion efficiency and sequencing baseline noise. Base misincorporations rates refer to the percentage of wrongly incorporated bases and help to estimate the intrinsic precision of the assay. Additionally, higher than expected frequencies could indicate misaligned reads. The obtained mean misincorporation rate at the 13 candidate CpG sites was low with a mean of 0.1% ( ± 0.3) wrongly incorporated bases. Base misincorporations above 1% were only observed at the SYT7 candidate CpG site (six DNA methylation standards, Laboratory 2) and the ARGHEF17 candidate CpG (four DNA methylation standards, Laboratory 1) and carefully inspected using the IGV software. Misalignments were found at a low percentage, not reaching the full amplicon. This may result from the reduced sequence complexity of bisulfite converted DNA. Examples can be found in Supplementary File 1A–B. The minimum obtained bisulfite conversion efficiency at non-CpG C bases per sample was 99.4%, indicating that non-converted cytosines were unlikely to compromise the final calculated methylation levels. Sequencing baseline noise was assessed for each run by calculating the percentage of total reads from NTCs to the respective median sample coverage. Only one NTC showed a percentage of aligned reads above 1% (1.2%). Visual inspection of alignments using the IGV software showed incorrect read alignment indicated by high base misincorporation rates (mean = 8.9% ± 19.9) and incomplete reads (Supplementary File 1C – D).
      For the six model CpG sites, additional quality control was performed by calculating read depth and base misincorporation rates. Since model CpG sites of each marker are located on the same amplicons as their initial candidate CpG sites, only minor differences (<1.5%) in read depth were observed between them. Lower read depths were only observed at the IFITM2 model CpG site, with a difference of 12,665.8 (40.2%) paired reads between overall means. Furthermore, the average base misincorporation rates were comparable to rates at candidate CpG sites with an elevated frequency of 1.8% ( ± 1.5) only at the model CpG site of IFITM2. Visual inspection of alignments showed misaligned reads in the reverse strand after a poly-G tract causing the incorporation of Gs at the IFITM2 model CpG site (Supplementary File 1E). Notwithstanding, methylation quantification at the model CpG site was robust. When comparing standard deviations of the mean methylation quantifications (N = 5) of the seven DNA methylation standards, no statistically significant differences were observed (paired t-test, P-value > 0.05) between the IFITM2 model and the candidate CpG site.

      3.1.2 Reproducibility

      Seven premixed DNA methylation standards were sent to the five participating laboratories to assess methylation quantification (Fig. 2). A tendency to underestimate the expected methylation levels was observed for most markers, appearing as a uniform bias from all laboratories. Variability between laboratories was assessed by calculating the standard deviation for each DNA methylation standard. Notably, the highest average standard deviation across markers was found at 75% expected DNA methylation level (mean SD = 3.1%), followed by 50% (mean SD = 2.8%) and 25% (mean SD = 2.2%). Whereas almost no variability was observed at 0% methylation (mean SD = 0.4%), the fully methylated control DNA showed an average standard deviation of 1.6%, which was higher than at 10% (mean SD = 1.3%) and 5% (mean SD = 1.0%) expected methylation level. When comparing markers, the highest standard deviations were observed at SH2B2 (SD = 7.4% at 50% methylation level), which also showed lower read depths compared to other markers (Fig. 1B). Overall, the methylation levels obtained by the five laboratories demonstrate that the assay generates reproducible results, with small standard deviations for the majority of markers.
      Fig. 2
      Fig. 2Mean methylation quantifications versus the expected methylation of seven differentially methylated DNA standards analysed by five laboratories using the ET-13 and quantifications obtained for the two replicates with the ET-5. Error bars of the ET-13 represent the standard deviation, the dashed line represents the line of identity (intercept = 0, slope = 1).

      3.1.3 Sensitivity

      Sensitivity was tested by lowering the DNA input into bisulfite conversion of five methylated DNA standards to 50 ng, 20 ng, 10 ng and 1 ng. The 1000 reads threshold was met for almost all markers and samples down to 10 ng DNA input. Dropouts only occurred at SYT7 and TBX4 candidate CpG sites of samples prepared by Laboratory 4 and one additional dropout was registered at the candidate CpG site of EXOC3 at 10 ng DNA input. In contrast, all 1 ng samples yielded read depths below the threshold at two or more candidate CpG sites (in total 36 dropouts). As read depths between MiSeq FGx runs showed large differences (Fig. 1-A), normalized read depth was considered for comparing the assay performance between 200 ng and lower DNA inputs (Fig. S1-A). Overall, the 200 ng samples showed no statistically significant differences (Pairwise Wilcoxon test, Bonferroni corrected P-value > 0.05) to samples with 50 ng DNA input. At lower DNA inputs, two markers from 20 ng SD samples (NOX4/FOLH1B and ARHGEF17), three markers of 10 ng samples (NOX4/FOLH1B, ARHGEF17 and IFITIM2) and six markers of 1 ng samples (NOX4/FOLH1B, ARHGEF17, IFITIM2, PALM, GALR2 and EXOC3) showed significant differences in normalized read depths to the optimum DNA input samples. Assessment of sequence quality showed base misincorporation rates similar to the optimum DNA input samples. The average bisulfite conversion efficiency decreased at 1 ng DNA input (mean = 97.2%), however, comparison between amplicons showed that almost all incomplete conversions of non-CpG C bases derive from the EXOC3-amplicon (Supplementary File 1F). Misaligned reads at non-CpG C bases of this amplicon reached the 1000 read threshold, which decreased the average bisulfite conversion rates, whereas the candidate CpG sites of EXOC3 were excluded due to insufficient read depth.
      Methylation quantification at low DNA inputs was evaluated at two levels. First, we assessed differences between duplicates. At low DNA inputs, large differences in DNA methylation rate estimates are expected due to stochastic effects. The mean difference in DNA methylation percentage (DNAm%) between duplicates was 4.0% (median = 1.9%) for 50 ng samples and increased to 8.8% (median = 4.3%) for 20 ng, 11.7% (median = 4.9%) for 10 ng and 19.9% (median = 8.4%) at 1 ng DNA input. Second, we calculated the mean of duplicates and compared it to the mean value obtained with 200 ng DNA input (Fig. 3) for all five methylation ratios. At 50 ng DNA input, the assay could retain accurate methylation quantification. Although some markers showed stable methylation quantification down to 10 ng DNA input (e.g. CpG sites at NOX4/FOLH1B, TBX4, SYT7, LOC401324), the number of outliers with high differences started to increase below 50 ng DNA input.
      Fig. 3
      Fig. 3Absolute difference between mean methylation of sensitivity samples to optimum DNA input samples (200 ng) per candidate CpG site.

      3.1.4 Concordance and “mock” casework samples

      The ET-13 set was tested with two concordance samples (DNA extracts from semen) and two “mock” casework samples (semen stains). The assay demonstrated a good overall performance with semen samples, showing low base misincorporation rates (mean = 0.07% ± 0.3) and bisulfite conversion efficiencies per sample exceeding 99.5%. High read depth values at all candidate CpG sites were observed for all laboratories (median = 38,615 reads, excluding SYT7 and TBX4 candidate CpG sites of Laboratory 4). Moreover, normalized read depth of semen samples exhibited a different read distribution between the 13 markers than observed for DNA methylation standards (Fig. S1-B).
      Methylation quantifications were evaluated based on DNAm% differences between duplicates and the comparison of the mean methylation quantifications from duplicates among laboratories (Fig. 4). One outlier (Laboratory 3, stain 2) was detected, resulting in a large difference in observed methylation for these duplicates (mean = 26.7%). When inspecting the respective quantification values from library preparation, the laboratory reported a library concentration of only 0.3 pM (mean = 281.3 nM), which indicates sample loss during the laboratory workflow probably leading to distorted methylation quantification results although normal read depth values were obtained. The respective sample was removed from further analysis. Methylation quantifications of the concordance and “mock" casework samples showed only small differences between technical duplicates with a mean difference of 0.9% ( ± 0.8) and 0.7% ( ± 0.8) across the 13 markers for concordance and ”mock” casework samples, respectively.
      Fig. 4
      Fig. 4Observed DNA methylation values at candidate CpG sites for concordance and “mock” casework samples 1 and 2. Each sample was analyzed by three participating laboratories in duplicates (one outlier removed) using the ET-13: Sample 1 by Laboratory 1, 2 and 5; Sample 2 by Laboratory 1, 3 and 4. DNA methylation values obtained by Laboratory 1 using the ET-5 are indicated for the respective markers at SH2B2, EXOC3, GALR2, IFITM2 and NOX4/FOLH1B CpG sites only.
      Laboratory comparisons revealed an absolute difference in DNAm% from one laboratory (mean of duplicates) to the mean quantifications obtained by the other laboratories of 0.8% and 0.9% for concordance sample 1 and 2 and 0.8% and 1.4% for stain 1 and 2 across candidate CpG sites (Fig. S2). The highest differences between laboratories were seen for SYT7 with a mean absolute difference of 3.8%. Additionally, the absolute differences in the overall mean methylation quantification results between concordance and “mock” casework samples was low for all candidate CpG sites with a mean difference of 0.7% ( ± 0.5). This suggests robust methylation quantification throughout all laboratories with low differences between the analyses of pre-extracted DNA samples and semen stains. Furthermore, we compared the final age prediction outcome among laboratories (Table 3). DNA methylation percentages at the six model CpG sites were extracted for each laboratory and used for age prediction with the VISAGE age model for semen. The mean predicted age varied from 35.7 to 41.2 years (5.5 years difference) for sample 1 and 38.6–42.7 years (4.1 years difference) for sample 2. The difference from the chronological age was on average 5.4 years for sample 1 and 2.0 years for sample 2 across concordance and “mock” casework samples.
      Table 3Mean predicted age (N = 2) based on DNAm levels obtained with the ET-13 or the ET-5 by laboratory.
      SampleRunToolConcordance [years]Stains [years]
      Sample 1Lab 1ET-1337.537.7
      (33 years)Lab 2ET-1341.239.4
      Lab 5ET-1338.735.7
      Lab 1ET-538.940.1
      Sample 2Lab 1ET-1339.140.3
      (42 years)Lab 3ET-1342.742*
      Lab 4ET-1338.638.6
      Lab 1ET-539.441
      *one replicate removed

      3.1.5 Species specificity

      Comparison between electropherogram profiles (Agilent Bioanalyzer) produced by animal DNA samples and the human control showed clear differences in both multiplex PCR assays, except for the 7-plex PCR products of the Orangutan sample (Supplementary File 2). This can be explained by the genetic similarities amongst primates. However, some animal samples yielded PCR products, potentially deriving from conserved genetic regions that could interfere with human methylation quantification. Even though most of the PCR products were detected at much lower concentration than in the human control sample, all primers were tested in an in-silico PCR using BiSearch with the respective animal genomes. No bisulfite converted genomes were available for Goat and Orangutan. BiSearch results showed that only the primer pairs targeting TUBB3, ARHGEF17 and LOC401324 were able to produce in silico amplicons in samples from Cattle, Rat, Sheep, Dog and Horse. These sequences were further compared to the human amplicons using BLAST (Needleman-Wunsch alignment of two sequences). Amplicon sequences retrieved with primers targeting TUBB3 and LOC401324 obtained only a low percentage of identity (16–54%) when compared to the human sequence, which would most likely lead to unmapped reads which would not interfere with the final methylation quantification results. For the primer pair targeting ARHGEF17, one amplicon with a high sequence identity (91%) was obtained for the converted Dog genome. Although this may lead to mapped sequence reads, ARHGEF17 primers show five base mismatches within the primer binding sequence, which likely reduces PCR efficiency. A careful inspection of alignments should be performed, in particular if animal-human mixtures may be present.

      3.2 Development and performance evaluation of the five-marker enhanced tool for age estimation from semen (ET-5)

      The ET-5 set combines only the final five VISAGE age model markers in one multiplex PCR and was evaluated with duplicates of the seven DNA methylation standards and the two concordance and “mock” casework samples by Laboratory 1. The assay achieved a median read depth of 113,235 paired reads across the five targeted regions (Fig. S3-A). When evaluating normalized read depth, a more balanced distribution was observed for concordance and casework type samples (N = 8) compared to DNA methylation standards (N = 14, Fig. S3-B). Sequence quality was assessed with the same parameters as with ET-13: by calculating base misincorporation rates at candidate (mean = 0.05% ± 0.02%, median = 0.04%) and model CpG sites (mean = 0.81% ± 1.99%, median = 0.04%), bisulfite conversion efficiency (mean > 99.7%) and analysis of the total reads percentage of two NTCs from the mean sample coverage (1.8% and 5.2%). Visual inspection of alignments from the NTCs showed that no uniform read depth across the amplicons was achieved. IGV captures of NTC amplicons with a read depth above 1000 reads at candidate CpG sites can be found in Supplementary File 1G-M. As with the ET-13, a higher frequency of base misincorporations was only detected at the model CpG site of IFITM2 (mean = 4.5 ± 2.8), caused by misalignments in the reverse strand. Likewise, these misalignments had no influence on the obtained DNA methylation level, which showed low differences between duplicates of DNA methylation standards (2.1% ± 1.3) and concordance and “mock” casework samples (0.9% ± 0.5) analysed with the ET-5.
      Obtained methylation values at candidate CpG positions of DNA methylation standards were in accordance with the ET-13 findings (Fig. 2), indicating that no substantial differences in methylation quantification were introduced by changing the multiplex PCR conditions. Furthermore, the discrepancy between the ET-13 and the ET-5 methylation quantifications of concordance and “mock” casework samples were small (Fig. 4) with a mean absolute difference in DNAm% of 0.7% ( ± 0.6) across the five shared markers. To further evaluate the effect of differences in obtained DNA methylation values, we compared age prediction outcomes between ET-13 and ET-5. Age estimations obtained using the ET-5 are in accordance with ET-13 (Table 3).

      4. Discussion

      The presented protocols for the ET-13 and ET-5 marker sets are the first targeted bisulfite MPS assays intended for age estimation from forensic semen samples. Even though quantitative DNA methylation methods were compared among laboratories in previous studies [
      • Bock C.
      • Halbritter F.
      • Carmona F.J.
      • Tierling S.
      • Datlinger P.
      • Assenov Y.
      • Berdasco M.
      • Bergmann A.K.
      • Booher K.
      • Busato F.
      • Campan M.
      • Dahl C.
      • Dahmcke C.M.
      • Diep D.
      • Fernández A.F.
      • Gerhauser C.
      • Haake A.
      • Heilmann K.
      • Holcomb T.
      • Hussmann D.
      • Ito M.
      • Kläver R.
      • Kreutz M.
      • Kulis M.
      • Lopez V.
      • Nair S.S.
      • Paul D.S.
      • Plongthongkum N.
      • Qu W.
      • Queirós A.C.
      • Reinicke F.
      • Sauter G.
      • Schlomm T.
      • Statham A.
      • Stirzaker C.
      • Strogantsev R.
      • Urdinguio R.G.
      • Walter K.
      • Weichenhan D.
      • Weisenberger D.J.
      • Beck S.
      • Clark S.J.
      • Esteller M.
      • Ferguson-Smith A.C.
      • Fraga M.F.
      • Guldberg P.
      • Hansen L.L.
      • Laird P.W.
      • Martín-Subero J.I.
      • Nygren A.O.H.
      • Peist R.
      • Plass C.
      • Shames D.S.
      • Siebert R.
      • Sun X.
      • Tost J.
      • Walter J.
      • Zhang K.
      Quantitative comparison of DNA methylation assays for biomarker development and clinical applications.
      ,
      • Jung S.E.
      • Cho S.
      • Antunes J.
      • Gomes I.
      • Uchimoto M.L.
      • Oh Y.N.
      • Di Giacomo L.
      • Schneider P.M.
      • Park M.S.
      • van der Meer D.
      • Williams G.
      • McCord B.
      • Ahn H.J.
      • Choi D.H.
      • Lee Y.H.
      • Lee S.D.
      • Lee H.Y.
      A collaborative exercise on DNA methylation based body fluid typing.
      ], to the best of our knowledge, this current study presents the first inter-laboratory validation of a single targeted bisulfite MPS assay for forensic applications. Such validation studies are essential to fully understand technical assay variabilities and enable the implementation of the MPS-based age tools into routine forensic laboratories.
      Our results show that the ET-13 set has an overall good performance and produced high quality sequences for all 13 amplicons. Read distributions between amplicons showed some variability between laboratories, however, the same tendencies for over- and underperforming markers were obtained. For optimum input samples (200 ng/bisulfite conversion), drops below the 1000 read threshold were observed in Laboratory 4 only, with candidate CpG sites on SYT7 and TBX4 being affected in all samples. The associated sequencing run featured a high level of imbalance. Here, the markers with very high read depths (Fig. 1A) may have outcompeted SYT7 and TBX4 target region amplification.
      Reproducibility analysis showed that the quantification levels obtained were stable between laboratories, with an average standard deviation across markers of 2.8% for the 50% methylated DNA standard. As already documented for the previously established VISAGE tools for age estimation [
      • Heidegger A.
      • Xavier C.
      • Niederstätter H.
      • de la Puente M.
      • Pośpiech E.
      • Pisarek A.
      • Kayser M.
      • Branicki W.
      • Parson W.
      Development and optimization of the VISAGE basic prototype tool for forensic age estimation.
      ,
      • Woźniak A.
      • Heidegger A.
      • Piniewska-Róg D.
      • Pośpiech E.
      • Xavier C.
      • Pisarek A.
      • Kartasińska E.
      • Boroń M.
      • Freire-Aradas A.
      • Wojtas M.
      • de la Puente M.
      • Niederstätter H.
      • Płoski R.
      • Spólnicka M.
      • Kayser M.
      • Phillips C.
      • Parson W.
      • Branicki W.
      Development of the VISAGE enhanced tool and statistical models for epigenetic age estimation in blood, buccal cells and bones.
      ] and a study by Daunay et al. [
      • Daunay A.
      • Baudrin L.G.
      • Deleuze J.-F.
      • How-Kit A.
      Evaluation of six blood-based age prediction models using DNA methylation analysis by pyrosequencing.
      ], higher variation was observed for the highly methylated DNA standards compared to lower methylated standards, which may be a result of the bisulfite conversion process or could be inherent to the in vitro methylated control DNA used.
      A thorough characterization of the assay sensitivity is of major importance for forensic applications. We observed only minor differences in read distribution and methylation quantification results between the 200 ng (optimum as indicated by the manufacturer) and 50 ng DNA input into bisulfite conversion. Below 50 ng, differences in normalized read depth increased with decreasing DNA input, possibly resulting from imbalances at the multiplex PCR step, where certain markers seem to have outcompeted other markers. At 1 ng DNA for bisulfite conversion, the generally high sequencing depth could no longer overcome differences in marker performance. Variability in methylation quantification results increased below 50 ng DNA, affecting the difference between technical duplicates and the difference to the optimum input values (Fig. 3). It should be noted that the DNA amount at PCR level may be considerably lower due to DNA loss and degradation during bisulfite conversion. When assuming 55% [
      • Leontiou C.A.
      • Hadjidaniel M.D.
      • Mina P.
      • Antoniou P.
      • Ioannides M.
      • Patsalis P.C.
      Bisulfite conversion of DNA: performance comparison of different kits and methylation quantitation of epigenetic biomarkers that have the potential to be used in non-invasive prenatal testing.
      ] to 76% [
      • Kint S.
      • De Spiegelaere W.
      • De Kesel J.
      • Vandekerckhove L.
      • Van Criekinge W.
      Evaluation of bisulfite kits for DNA methylation profiling in terms of DNA fragmentation and DNA recovery using digital PCR.
      ] DNA recovery and 4 µl input into each multiplex PCR, a DNA input of 50 ng, 20 ng, 10 ng and 1 ng into bisulfite conversion would correspond to 11–14.8 ng, 4.4–6.1 ng, 2.2–3 ng and 0.2–0.3 ng at the PCR level, respectively. Further studies evaluating semen samples will be needed to pinpoint the assay’s sensitivity with regard to the final age prediction outcome.
      Two semen samples were used to assess the assay performance and inter-laboratory variability with DNA extracts and “mock” casework samples (Fig. 4). For both sample types, the ET-13 showed robust methylation quantifications with small differences between duplicates performed by each laboratory. The results were consistent with previously recorded technical variabilities of a targeted bisulfite MPS assay reporting a maximum of 3% average difference of DNAm% for technical duplicates [
      • Aliferi A.
      • Ballard D.
      • Gallidabino M.D.
      • Thurtle H.
      • Barron L.
      D. Syndercombe Court, DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models.
      ]. Additionally, inter-laboratory differences between mean quantifications remained small for DNA extracts and stains resulting in robust age prediction results. Differences between estimated and chronological age were within the expected range for the VISAGE model for semen (MAE = 5.1 years [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]).
      After assay validation was completed, the ET-13 was used for data generation and age model development by Pisarek et al. [
      • Pisarek A.
      • Pośpiech E.
      • Heidegger A.
      • Xavier C.
      • Papież A.
      • Piniewska-Róg D.
      • Kalamara V.
      • Potabattula R.
      • Bochenek M.
      • Sikora-Polaczek M.
      • Macur A.
      • Woźniak A.
      • Janeczko J.
      • Phillips C.
      • Haaf T.
      • Polańska J.
      • Parson W.
      • Kayser M.
      • Branicki W.
      Epigenetic age prediction in semen – marker selection and model development.
      ]. Therefore, not only were the 13 candidate CpG sites considered for age estimation, but also neighboring CpG sites. The final VISAGE age model for semen comprises six CpG sites on SH2B2, EXOC3, GALR2, IFITM2 and NOX4/FOLH1B (Table 2). Consequently, we developed the ET-5, targeting only these five model markers in one 5-plex PCR assay. The DNA methylation quantifications obtained from the ET-5 assay were in accordance with the ET-13, which is crucial for the age prediction model based on data from the whole ET-13 assay. The simultaneous amplification of all relevant age model markers is less labor-intensive and likely improves the assay sensitivity, since the eluate does not have to be split between two amplification reactions. The ET-5 therefore, represents a promising tool for forensic routine laboratories, which will be completed by a statistical software automating MPS data analysis and age prediction, currently developed by the VISAGE consortium.
      Apart from validation studies that prepare the ground for forensic applications of age estimation assays, further investigations determining possible confounding factors like lifestyle-related influences (e.g., alcohol consumption [
      • Rosen A.D.
      • Robertson K.D.
      • Hlady R.A.
      • Muench C.
      • Lee J.
      • Philibert R.
      • Horvath S.
      • Kaminsky Z.A.
      • Lohoff F.W.
      DNA methylation age is accelerated in alcohol dependence.
      ] or smoking [
      • Gao X.
      • Jia M.
      • Zhang Y.
      • Breitling L.P.
      • Brenner H.
      DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies.
      ]), diseases [
      • Spólnicka M.
      • Zbieć-Piekarska R.
      • Karp M.
      • Machnicki M.M.
      • Własiuk P.
      • Makowska Ż.
      • Pięta A.
      • Gambin T.
      • Gasperowicz P.
      • Branicki W.
      • Giannopoulos K.
      • Stokłosa T.
      • Płoski R.
      DNA methylation signature in blood does not predict calendar age in patients with chronic lymphocytic leukemia but may alert to the presence of disease.
      ,
      • Spólnicka M.
      • Piekarska R.Z.
      • Jaskuła E.
      • Basak G.W.
      • Jacewicz R.
      • Pięta A.
      • Makowska Ż.
      • Jedrzejczyk M.
      • Wierzbowska A.
      • Pluta A.
      • Robak T.
      • Berent J.
      • Branicki W.
      • Jędrzejczak W.
      • Lange A.
      • Płoski R.
      Donor age and C1orf132/MIR29B2C determine age-related methylation signature of blood after allogeneic hematopoietic stem cell transplantation.
      ] and biogeographic ancestry [
      • Horvath S.
      • Gurven M.
      • Levine M.E.
      • Trumble B.C.
      • Kaplan H.
      • Allayee H.
      • Ritz B.R.
      • Chen B.
      • Lu A.T.
      • Rickabaugh T.M.
      • Jamieson B.D.
      • Sun D.
      • Li S.
      • Chen W.
      • Quintana-Murci L.
      • Fagny M.
      • Kobor M.S.
      • Tsao P.S.
      • Reiner A.P.
      • Edlefsen K.L.
      • Absher D.
      • Assimes T.L.
      An epigenetic clock analysis of race/ethnicity, sex, and coronary heart disease.
      ] need to be addressed. A recent review by Koop et al. [
      • Koop B.E.
      • Reckert A.
      • Becker J.
      • Han Y.
      • Wagner W.
      • Ritz-Timme S.
      Epigenetic clocks may come out of rhythm—implications for the estimation of chronological age in forensic casework.
      ] provides an overview about the current literature on factors affecting epigenetic age, and thus will require systematic studies to evaluate their implications on forensic age estimation models. Furthermore, Jenkins et al. [
      • Jenkins T.G.
      • Aston K.I.
      • Cairns B.
      • Smith A.
      • Carrell D.T.
      Paternal germ line aging: DNA methylation age prediction from human sperm.
      ] have indicated the effect of environmental influences on sperm DNA methylation. Whereas the ET-5 protocol was re-optimized for routine applications, additional studies with the ET-13 may be useful to evaluate the impact of confounding factors on the DNA methylation levels of individual candidate CpG sites. This would enable further optimizations of the VISAGE model for semen. Of particular relevance for the VISAGE tools may be the effect of biogeographic ancestry on DNA methylation at age-associated CpG sites. By replicating an age estimation assay established for a Polish sample set [
      • Zbieć-Piekarska R.
      • Spólnicka M.
      • Kupiec T.
      • Parys-Proszek A.
      • Makowska Ż.
      • Pałeczka A.
      • Kucharczyk K.
      • Płoski R.
      • Branicki W.
      Development of a forensically useful age prediction method based on DNA methylation analysis.
      ] with samples from Korea, Cho et al. [
      • Cho S.
      • Jung S.-E.
      • Hong S.R.
      • Lee E.H.
      • Lee J.H.
      • Lee S.D.
      • Lee H.Y.
      Independent validation of DNA-based approaches for age prediction in blood.
      ] found slight differences in the predictive power of analyzed CpG sites. By retraining the model, they could improve the final age prediction outcome. Such differences in measured epigenetic ages of groups of different ancestry were reported by several studies [
      • Horvath S.
      • Gurven M.
      • Levine M.E.
      • Trumble B.C.
      • Kaplan H.
      • Allayee H.
      • Ritz B.R.
      • Chen B.
      • Lu A.T.
      • Rickabaugh T.M.
      • Jamieson B.D.
      • Sun D.
      • Li S.
      • Chen W.
      • Quintana-Murci L.
      • Fagny M.
      • Kobor M.S.
      • Tsao P.S.
      • Reiner A.P.
      • Edlefsen K.L.
      • Absher D.
      • Assimes T.L.
      An epigenetic clock analysis of race/ethnicity, sex, and coronary heart disease.
      ,
      • Javed R.
      • Chen W.
      • Lin F.
      • Liang H.
      Infant’s DNA methylation age at birth and epigenetic aging accelerators.
      ,
      • Tajuddin S.M.
      • Hernandez D.G.
      • Chen B.H.
      • Noren Hooten N.
      • Mode N.A.
      • Nalls M.A.
      • Singleton A.B.
      • Ejiogu N.
      • Chitrala K.N.
      • Zonderman A.B.
      • Evans M.K.
      Novel age-associated DNA methylation changes and epigenetic age acceleration in middle-aged African Americans and whites.
      ,
      • Thurston R.C.
      • Joffe H.
      Vasomotor symptoms and menopause: findings from the study of women’s health across the nation.
      ]. An evaluation of the ET-13 with sample sets deriving from different biogeographic ancestries would allow for the development of fully optimized age estimation models for the respective populations. In combination with the VISAGE tools for appearance and ancestry [
      • Xavier C.
      • de la Puente M.
      • Mosquera-Miguel A.
      • Freire-Aradas A.
      • Kalamara V.
      • Vidaki A.
      • Gross T.E.
      • Revoir A.
      • Pośpiech E.
      • Kartasińska E.
      • Spólnicka M.
      • Branicki W.
      • Ames C.E.
      • Schneider P.M.
      • Hohoff C.
      • Kayser M.
      • Phillips C.
      • Parson W.
      Development and validation of the VISAGE AmpliSeq basic tool to predict appearance and ancestry from DNA.
      ,
      • Palencia-Madrid L.
      • Xavier C.
      • de la Puente M.
      • Hohoff C.
      • Phillips C.
      • Kayser M.
      • Parson W.
      Evaluation of the VISAGE basic tool for appearance and ancestry prediction using powerSeq chemistry on the MiSeq FGx system.
      ], it would even become possible to choose the most accurate age model for an unknown sample donor.

      Funding

      The study received financial support from the European Union’s Horizon 2020 Research and Innovation Programme under grant agreement No. 740580 within the framework of the Visible Attributes through Genomics (VISAGE) Project and Consortium. MdlP is supported by a postdoctoral fellowship awarded by the Consellería de Cultura, Educación e Ordenación Universitaria and the Consellería de Economía, Emprego e Industria from Xunta de Galicia (Modalidade A, ED481B 2017/088).

      Author contributions

      AH developed assay design, performed laboratory work and data analysis. AP, AW, EP, WB, CX and HN contributed to assay development. MdP and CX contributed to laboratory work and data analysis. TS, FXL, JH, IB contributed to assay validation. NS, MU, MS, KJ, MVG, AU, JV performed laboratory work for assay validation. AH drafted the first version of the manuscript and WP and CX shaped the final version of the manuscript. WP conceived the study. All authors contributed to the article and approved the final version.

      Data availability

      The datasets generated for this study can be found in the European Nucleotide Archive (ENA) [https://www.ebi.ac.uk/ena/browser/home] under accession number PRJEB43859.

      Acknowledgments

      The authors would like to thank the Forensic Genetics and Casework section of the Institute of Legal Medicine in Innsbruck for DNA extraction of “mock” casework type samples and Leire Palencia Madrid (BIOMICs Research Group, Lascaray Research Center, University of the Basque Country, Vitoria-Gasteiz, Spain) for her laboratory support.

      Appendix A. Supplementary material

      References

        • Bell J.T.
        • Tsai P.-C.
        • Yang T.-P.
        • Pidsley R.
        • Nisbet J.
        • Glass D.
        • Mangino M.
        • Zhai G.
        • Zhang F.
        • Valdes A.
        • Shin S.-Y.
        • Dempster E.L.
        • Murray R.M.
        • Grundberg E.
        • Hedman A.K.
        • Nica A.
        • Small K.S.
        • Dermitzakis E.T.
        • McCarthy M.I.
        • Mill J.
        • Spector T.D.
        • Deloukas P.
        Epigenome-wide scans identify differentially methylated regions for age and age-related phenotypes in a healthy ageing population.
        PLoS Genet. 2012; 81002629https://doi.org/10.1371/journal.pgen.1002629
        • Horvath S.
        DNA methylation age of human tissues and cell types.
        Genome Biol. 2013; 14: 115https://doi.org/10.1186/gb-2013-14-10-r115
        • Hannum G.
        • Guinney J.
        • Zhao L.
        • Zhang L.
        • Hughes G.
        • Sadda S.
        • Klotzle B.
        • Bibikova M.
        • Fan J.-B.
        • Gao Y.
        • Deconde R.
        • Chen M.
        • Rajapakse I.
        • Friend S.
        • Ideker T.
        • Zhang K.
        Genome-wide methylation profiles reveal quantitative views of human aging rates.
        Mol. Cell. 2013; 49: 359-367https://doi.org/10.1016/j.molcel.2012.10.016
        • Vidaki A.
        • Kayser M.
        Recent progress, methods and perspectives in forensic epigenetics.
        Forensic Sci. Int. Genet. 2018; 37: 180-195https://doi.org/10.1016/j.fsigen.2018.08.008
      1. Infinium MethylationEPIC Kit, (n.d.). https://emea.illumina.com/products/by-type/microarray-kits/infinium-methylation-epic.html?langsel=/at/ (accessed 23 October 2020).

        • Lee J.W.
        • Choung C.M.
        • Jung J.Y.
        • Lee H.Y.
        • Lim S.-K.
        A validation study of DNA methylation-based age prediction using semen in forensic casework samples.
        Leg. Med. 2018; 31: 74-77https://doi.org/10.1016/j.legalmed.2018.01.005
        • Aliferi A.
        • Ballard D.
        • Gallidabino M.D.
        • Thurtle H.
        • Barron L.
        D. Syndercombe Court, DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models.
        Forensic Sci. Int. Genet. 2018; 37: 215-226https://doi.org/10.1016/j.fsigen.2018.09.003
        • Naue J.
        • Sänger T.
        • Hoefsloot H.C.J.
        • Lutz-Bonengel S.
        • Kloosterman A.D.
        • Verschure P.J.
        Proof of concept study of age-dependent DNA methylation markers across different tissues by massive parallel sequencing.
        Forensic Sci. Int. Genet. 2018; 36: 152-159https://doi.org/10.1016/j.fsigen.2018.07.007
        • Weidner C.
        • Lin Q.
        • Koch C.
        • Eisele L.
        • Beier F.
        • Ziegler P.
        • Bauerschlag D.
        • Jöckel K.-H.
        • Erbel R.
        • Mühleisen T.
        • Zenke M.
        • Brümmendorf T.
        • Wagner W.
        Aging of blood can be tracked by DNA methylation changes at just three CpG sites.
        Genome Biol. 2014; 15: 24https://doi.org/10.1186/gb-2014-15-2-r24
        • Zbieć-Piekarska R.
        • Spólnicka M.
        • Kupiec T.
        • Parys-Proszek A.
        • Makowska Ż.
        • Pałeczka A.
        • Kucharczyk K.
        • Płoski R.
        • Branicki W.
        Development of a forensically useful age prediction method based on DNA methylation analysis.
        Forensic Sci. Int. Genet. 2015; 17: 173-179https://doi.org/10.1016/j.fsigen.2015.05.001
        • Xu C.
        • Qu H.
        • Wang G.
        • Xie B.
        • Shi Y.
        • Yang Y.
        • Zhao Z.
        • Hu L.
        • Fang X.
        • Yan J.
        • Feng L.
        A novel strategy for forensic age prediction by DNA methylation and support vector regression model.
        Sci. Rep. 2015; 5: 17788https://doi.org/10.1038/srep17788
        • Vidaki A.
        • Ballard D.
        • Aliferi A.
        • Miller T.H.
        • Barron L.P.
        • Syndercombe Court D.
        DNA methylation-based forensic age prediction using artificial neural networks and next generation sequencing.
        Forensic Sci. Int. Genet. 2017; 28: 225-236https://doi.org/10.1016/j.fsigen.2017.02.009
        • Naue J.
        • Hoefsloot H.C.J.
        • Kloosterman A.D.
        • Verschure P.J.
        Forensic DNA methylation profiling from minimal traces: How low can we go?.
        Forensic Sci. Int. Genet. 2018; 33: 17-23https://doi.org/10.1016/j.fsigen.2017.11.004
        • Jung S.-E.
        • Lim S.M.
        • Hong S.R.
        • Lee E.H.
        • Shin K.-J.
        • Lee H.Y.
        DNA methylation of the ELOVL2, FHL2, KLF14, C1orf132/MIR29B2C, and TRIM59 genes for age prediction from blood, saliva, and buccal swab samples.
        Forensic Sci. Int. Genet. 2019; 38: 1-8https://doi.org/10.1016/j.fsigen.2018.09.010
        • Slieker R.C.
        • Relton C.L.
        • Gaunt T.R.
        • Slagboom P.E.
        • Heijmans B.T.
        Age-related DNA methylation changes are tissue-specific with ELOVL2 promoter methylation as exception.
        Epigenetics Chromatin. 2018; 11: 25https://doi.org/10.1186/s13072-018-0191-3
        • Lee H.Y.
        • Jung S.-E.
        • Oh Y.N.
        • Choi A.
        • Yang W.I.
        • Shin K.-J.
        Epigenetic age signatures in the forensically relevant body fluid of semen: a preliminary study.
        Forensic Sci. Int. Genet. 2015; 19: 28-34https://doi.org/10.1016/j.fsigen.2015.05.014
        • Li L.
        • Song F.
        • Lang M.
        • Hou J.
        • Wang Z.
        • Prinz M.
        • Hou Y.
        Methylation‐based age prediction using pyrosequencing platform from seminal stains in han chinese males.
        J. Forensic Sci. 2020; 65: 610-619https://doi.org/10.1111/1556-4029.14186
        • Jenkins T.G.
        • Aston K.I.
        • Cairns B.
        • Smith A.
        • Carrell D.T.
        Paternal germ line aging: DNA methylation age prediction from human sperm.
        BMC Genom. 2018; 19: 763https://doi.org/10.1186/s12864-018-5153-4
        • Pisarek A.
        • Pośpiech E.
        • Heidegger A.
        • Xavier C.
        • Papież A.
        • Piniewska-Róg D.
        • Kalamara V.
        • Potabattula R.
        • Bochenek M.
        • Sikora-Polaczek M.
        • Macur A.
        • Woźniak A.
        • Janeczko J.
        • Phillips C.
        • Haaf T.
        • Polańska J.
        • Parson W.
        • Kayser M.
        • Branicki W.
        Epigenetic age prediction in semen – marker selection and model development.
        Aging. 2021; 13: 19145-19164https://doi.org/10.18632/aging.203399
      2. Qiagen, Isolation of genomic DNA from sperm using the QIAamp® DNA Mini Kit; protocol 2 - (EN), (n.d.). https://www.qiagen.com/us/resources/resourcedetail?id=5f0e4e08-d405–42d9-acf6–73eaaa52190c&lang=en, (accessed 2 February 2021).

        • Heidegger A.
        • Xavier C.
        • Niederstätter H.
        • de la Puente M.
        • Pośpiech E.
        • Pisarek A.
        • Kayser M.
        • Branicki W.
        • Parson W.
        Development and optimization of the VISAGE basic prototype tool for forensic age estimation.
        Forensic Sci. Int. Genet. 2020; 48102322https://doi.org/10.1016/j.fsigen.2020.102322
        • Woźniak A.
        • Heidegger A.
        • Piniewska-Róg D.
        • Pośpiech E.
        • Xavier C.
        • Pisarek A.
        • Kartasińska E.
        • Boroń M.
        • Freire-Aradas A.
        • Wojtas M.
        • de la Puente M.
        • Niederstätter H.
        • Płoski R.
        • Spólnicka M.
        • Kayser M.
        • Phillips C.
        • Parson W.
        • Branicki W.
        Development of the VISAGE enhanced tool and statistical models for epigenetic age estimation in blood, buccal cells and bones.
        Aging. 2021; 13: 6459-6484https://doi.org/10.18632/aging.202783
        • Tusnády G.E.
        BiSearch: primer-design and search tool for PCR on bisulfite-treated genomes.
        Nucleic Acids Res. 2005; 33 (e9–e9): 9https://doi.org/10.1093/nar/gni012
        • Pedersen B.S.
        • Eyring K.
        • De S.
        • Yang I.V.
        • Schwartz D.A.
        Fast and accurate alignment of long bisulfite-seq reads.
        Bioinformatics. 2014; 00: 1-2
        • Li H.
        • Handsaker B.
        • Wysoker A.
        • Fennell T.
        • Ruan J.
        • Homer N.
        • Marth G.
        • Abecasis G.
        • Durbin R.
        The sequence alignment/map format and SAMtools.
        Bioinformatics. 2009; 25: 2078-2079https://doi.org/10.1093/bioinformatics/btp352
        • Robinson J.T.
        • Thorvaldsdóttir H.
        • Winckler W.
        • Guttman M.
        • Lander E.S.
        • Getz G.
        • Mesirov J.P.
        Integrative genomics viewer.
        Nat. Biotechnol. 2011; 29: 24-26https://doi.org/10.1038/nbt.1754
      3. (R Foundation for Statistical Computing) R Core Team, R: A Language and Environment for Statistical Computing, (2019). https://www.r-project.org.

        • Esteve Codina A.
        • Niederstätter H.
        • Parson W.
        “GenderPlex” a PCR multiplex for reliable gender determination of degraded human DNA samples and complex gender constellations.
        Int. J. Leg. Med. 2009; 123: 459-464https://doi.org/10.1007/s00414-008-0301-z
        • Niederstätter H.
        • Köchl S.
        • Grubwieser P.
        • Pavlic M.
        • Steinlechner M.
        • Parson W.
        A modular real-time PCR concept for determining the quantity and quality of human nuclear and mitochondrial DNA.
        Forensic Sci. Int. Genet. 2007; 1: 29-34https://doi.org/10.1016/j.fsigen.2006.10.007
        • Altschul S.
        Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
        Nucleic Acids Res. 1997; 25: 3389-3402https://doi.org/10.1093/nar/25.17.3389
        • McCartney D.L.
        • Walker R.M.
        • Morris S.W.
        • McIntosh A.M.
        • Porteous D.J.
        • Evans K.L.
        Identification of polymorphic and off-target probe binding sites on the Illumina Infinium MethylationEPIC BeadChip.
        Genom. Data. 2016; 9: 22-24https://doi.org/10.1016/j.gdata.2016.05.012
        • Bock C.
        • Halbritter F.
        • Carmona F.J.
        • Tierling S.
        • Datlinger P.
        • Assenov Y.
        • Berdasco M.
        • Bergmann A.K.
        • Booher K.
        • Busato F.
        • Campan M.
        • Dahl C.
        • Dahmcke C.M.
        • Diep D.
        • Fernández A.F.
        • Gerhauser C.
        • Haake A.
        • Heilmann K.
        • Holcomb T.
        • Hussmann D.
        • Ito M.
        • Kläver R.
        • Kreutz M.
        • Kulis M.
        • Lopez V.
        • Nair S.S.
        • Paul D.S.
        • Plongthongkum N.
        • Qu W.
        • Queirós A.C.
        • Reinicke F.
        • Sauter G.
        • Schlomm T.
        • Statham A.
        • Stirzaker C.
        • Strogantsev R.
        • Urdinguio R.G.
        • Walter K.
        • Weichenhan D.
        • Weisenberger D.J.
        • Beck S.
        • Clark S.J.
        • Esteller M.
        • Ferguson-Smith A.C.
        • Fraga M.F.
        • Guldberg P.
        • Hansen L.L.
        • Laird P.W.
        • Martín-Subero J.I.
        • Nygren A.O.H.
        • Peist R.
        • Plass C.
        • Shames D.S.
        • Siebert R.
        • Sun X.
        • Tost J.
        • Walter J.
        • Zhang K.
        Quantitative comparison of DNA methylation assays for biomarker development and clinical applications.
        Nat. Biotechnol. 2016; 34: 726-737https://doi.org/10.1038/nbt.3605
        • Jung S.E.
        • Cho S.
        • Antunes J.
        • Gomes I.
        • Uchimoto M.L.
        • Oh Y.N.
        • Di Giacomo L.
        • Schneider P.M.
        • Park M.S.
        • van der Meer D.
        • Williams G.
        • McCord B.
        • Ahn H.J.
        • Choi D.H.
        • Lee Y.H.
        • Lee S.D.
        • Lee H.Y.
        A collaborative exercise on DNA methylation based body fluid typing.
        Electrophoresis. 2016; 37: 2759-2766https://doi.org/10.1002/elps.201600256
        • Daunay A.
        • Baudrin L.G.
        • Deleuze J.-F.
        • How-Kit A.
        Evaluation of six blood-based age prediction models using DNA methylation analysis by pyrosequencing.
        Sci. Rep. 2019; 9: 8862https://doi.org/10.1038/s41598-019-45197-w
        • Leontiou C.A.
        • Hadjidaniel M.D.
        • Mina P.
        • Antoniou P.
        • Ioannides M.
        • Patsalis P.C.
        Bisulfite conversion of DNA: performance comparison of different kits and methylation quantitation of epigenetic biomarkers that have the potential to be used in non-invasive prenatal testing.
        PLoS One. 2015; 100135058https://doi.org/10.1371/journal.pone.0135058
        • Kint S.
        • De Spiegelaere W.
        • De Kesel J.
        • Vandekerckhove L.
        • Van Criekinge W.
        Evaluation of bisulfite kits for DNA methylation profiling in terms of DNA fragmentation and DNA recovery using digital PCR.
        PLoS One. 2018; 130199091https://doi.org/10.1371/journal.pone.0199091
        • Rosen A.D.
        • Robertson K.D.
        • Hlady R.A.
        • Muench C.
        • Lee J.
        • Philibert R.
        • Horvath S.
        • Kaminsky Z.A.
        • Lohoff F.W.
        DNA methylation age is accelerated in alcohol dependence.
        Transl. Psychiatry. 2018; 8: 182https://doi.org/10.1038/s41398-018-0233-4
        • Gao X.
        • Jia M.
        • Zhang Y.
        • Breitling L.P.
        • Brenner H.
        DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies.
        Clin. Epigenetics. 2015; 7: 113https://doi.org/10.1186/s13148-015-0148-3
        • Spólnicka M.
        • Zbieć-Piekarska R.
        • Karp M.
        • Machnicki M.M.
        • Własiuk P.
        • Makowska Ż.
        • Pięta A.
        • Gambin T.
        • Gasperowicz P.
        • Branicki W.
        • Giannopoulos K.
        • Stokłosa T.
        • Płoski R.
        DNA methylation signature in blood does not predict calendar age in patients with chronic lymphocytic leukemia but may alert to the presence of disease.
        Forensic Sci. Int. Genet. 2018; 34: e15-e17https://doi.org/10.1016/j.fsigen.2018.02.004
        • Spólnicka M.
        • Piekarska R.Z.
        • Jaskuła E.
        • Basak G.W.
        • Jacewicz R.
        • Pięta A.
        • Makowska Ż.
        • Jedrzejczyk M.
        • Wierzbowska A.
        • Pluta A.
        • Robak T.
        • Berent J.
        • Branicki W.
        • Jędrzejczak W.
        • Lange A.
        • Płoski R.
        Donor age and C1orf132/MIR29B2C determine age-related methylation signature of blood after allogeneic hematopoietic stem cell transplantation.
        Clin. Epigenetics. 2016; 8: 93https://doi.org/10.1186/s13148-016-0257-7
        • Horvath S.
        • Gurven M.
        • Levine M.E.
        • Trumble B.C.
        • Kaplan H.
        • Allayee H.
        • Ritz B.R.
        • Chen B.
        • Lu A.T.
        • Rickabaugh T.M.
        • Jamieson B.D.
        • Sun D.
        • Li S.
        • Chen W.
        • Quintana-Murci L.
        • Fagny M.
        • Kobor M.S.
        • Tsao P.S.
        • Reiner A.P.
        • Edlefsen K.L.
        • Absher D.
        • Assimes T.L.
        An epigenetic clock analysis of race/ethnicity, sex, and coronary heart disease.
        Genome Biol. 2016; 17: 171https://doi.org/10.1186/s13059-016-1030-0
        • Koop B.E.
        • Reckert A.
        • Becker J.
        • Han Y.
        • Wagner W.
        • Ritz-Timme S.
        Epigenetic clocks may come out of rhythm—implications for the estimation of chronological age in forensic casework.
        Int. J. Leg. Med. 2020; 134: 2215-2228https://doi.org/10.1007/s00414-020-02375-0
        • Cho S.
        • Jung S.-E.
        • Hong S.R.
        • Lee E.H.
        • Lee J.H.
        • Lee S.D.
        • Lee H.Y.
        Independent validation of DNA-based approaches for age prediction in blood.
        Forensic Sci. Int. Genet. 2017; 29: 250-256https://doi.org/10.1016/j.fsigen.2017.04.020
        • Javed R.
        • Chen W.
        • Lin F.
        • Liang H.
        Infant’s DNA methylation age at birth and epigenetic aging accelerators.
        Biomed. Res. Int. 2016; 2016: 1-10https://doi.org/10.1155/2016/4515928
        • Tajuddin S.M.
        • Hernandez D.G.
        • Chen B.H.
        • Noren Hooten N.
        • Mode N.A.
        • Nalls M.A.
        • Singleton A.B.
        • Ejiogu N.
        • Chitrala K.N.
        • Zonderman A.B.
        • Evans M.K.
        Novel age-associated DNA methylation changes and epigenetic age acceleration in middle-aged African Americans and whites.
        Clin. Epigenetics. 2019; 11: 119https://doi.org/10.1186/s13148-019-0722-1
        • Thurston R.C.
        • Joffe H.
        Vasomotor symptoms and menopause: findings from the study of women’s health across the nation.
        Obstet. Gynecol. Clin. North Am. 2011; 38: 489-501https://doi.org/10.1016/j.ogc.2011.05.006
        • Xavier C.
        • de la Puente M.
        • Mosquera-Miguel A.
        • Freire-Aradas A.
        • Kalamara V.
        • Vidaki A.
        • Gross T.E.
        • Revoir A.
        • Pośpiech E.
        • Kartasińska E.
        • Spólnicka M.
        • Branicki W.
        • Ames C.E.
        • Schneider P.M.
        • Hohoff C.
        • Kayser M.
        • Phillips C.
        • Parson W.
        Development and validation of the VISAGE AmpliSeq basic tool to predict appearance and ancestry from DNA.
        Forensic Sci. Int. Genet. 2020; 48102336https://doi.org/10.1016/j.fsigen.2020.102336
        • Palencia-Madrid L.
        • Xavier C.
        • de la Puente M.
        • Hohoff C.
        • Phillips C.
        • Kayser M.
        • Parson W.
        Evaluation of the VISAGE basic tool for appearance and ancestry prediction using powerSeq chemistry on the MiSeq FGx system.
        Genes. 2020; : 708https://doi.org/10.3390/genes11060708