1. Introduction
The use of microbial signatures for human identification constitutes a new form of information that can be used in forensic applications. The wide variety of data types, sample locations, environmental factors, analysis methods, and the diversity of microorganisms make this emerging area of forensic biology increasingly relevant [
[1]- Clarke T.H.
- Gomez A.
- Singh H.
- Nelson K.E.
- Brinkac L.M.
Integrating the microbiome as a resource in the forensics toolkit.
]. Although the experimental study of microbiome analysis in relation to casework is relatively new, the use of bacteria in forensically relevant cases dates back to the late 1800s [
[2]- Perego U.A.
- Achilli A.
- Ekins J.E.
- Milani L.
- Lari M.
- Pilli E.
- Brown A.
- Price E.P.
- Wolken S.R.
- Matthews M.
- Allen C.A.
- Pearson T.R.
- Angerhofer N.
- Caramelli D.
- Kupferschmid T.
- Keim P.S.
- Woodward S.R.
The Mountain Meadows Massacre and “poisoned springs”: scientific testing of the more recent, anthrax theory.
]. In one modern example, researchers have examined commercial honey products for the presence of
Clostridium botulinum as a vector for cases of induced poisoning from botulism [
[3]- Olivieri C.
- Marota I.
- Rollo F.
- Luciani S.
Tracking plant, fungal, and bacterial DNA in honey specimens.
]. More recently,
Neisseria gonorrhoeae infections in children have been considered as evidence in sexual assault cases [
[4]- Sathirareuangchai S.
- Phuangphung P.
- Leelaporn A.
- Boon-yasidhi V.
The usefulness of Neisseria gonorrhoeae strain typing by pulse-field gel electrophoresis (PFGE) and DNA detection as the forensic evidence in child sexual abuse cases: a case series.
]. As such, the application of microbes in forensic science is not a new concept.
The potential of utilizing the human microbiome in forensic science has previously been described by Hampton-Marcell et al. [
[5]- Hampton-Marcell J.T.
- Larsen P.
- Anton T.
- Cralle L.
- Sangwan N.
- Lax S.
- Gottel N.
- Salas-Garcia M.
- Young C.
- Duncan G.
- Lopez J.V.
- Gilbert J.A.
Detecting personal microbiota signatures at artificial crime scenes.
] who performed experiments to identify associations between human skin bacterial composition and the physical environment. That study [
[5]- Hampton-Marcell J.T.
- Larsen P.
- Anton T.
- Cralle L.
- Sangwan N.
- Lax S.
- Gottel N.
- Salas-Garcia M.
- Young C.
- Duncan G.
- Lopez J.V.
- Gilbert J.A.
Detecting personal microbiota signatures at artificial crime scenes.
] concluded that using the bacterial 16S ribosomal RNA (rRNA) marker region provided weak predictions of human identity compared to more traditional methods of forensic human DNA analysis. However, they also recognized that there could be better methods of analysis and that the field of study is currently in its infancy [
[5]- Hampton-Marcell J.T.
- Larsen P.
- Anton T.
- Cralle L.
- Sangwan N.
- Lax S.
- Gottel N.
- Salas-Garcia M.
- Young C.
- Duncan G.
- Lopez J.V.
- Gilbert J.A.
Detecting personal microbiota signatures at artificial crime scenes.
]. In a recent study addressing the human skin bacterial microbiome for the purpose of human identification, researchers identified a set of bacterial taxa that were able to classify individuals beyond a one-year period with up to 85% accuracy [
[6]- Watanabe H.
- Nakamura I.
- Mizutani S.
- Kurokawa Y.
- Mori H.
- Kurokawa K.
- Yamada T.
Minor taxa in human skin microbiome contribute to the personal identification.
]. Additional work conducted by Schmedes et al. [
[7]- Schmedes S.E.
- Woerner A.E.
- Budowle B.
Forensic human identification using skin microbiomes.
] used publicly available shotgun metagenomic data collected from 12 human individuals spanning multiple skin site locations. Their findings identified multiple bacterial clade specific and single nucleotide variant markers from the bacterium
Propionibacterium acnes that identified the 12 study participants with up to 100% accuracy [
[7]- Schmedes S.E.
- Woerner A.E.
- Budowle B.
Forensic human identification using skin microbiomes.
].
While there has been ample work using bacteria as targets for forensic identification, there have been very few studies addressing the application of viruses and the human virome for forensic purposes. A potential method for tracing unidentified human cadavers using the JC virus was published by Ikegaya et al. [
[8]- Ikegaya H.
- Iwase H.
- Sugimoto C.
- Yogo Y.
JC virus genotyping offers a new means of tracing the origins of unidentified cadavers.
] where the authors suggested that the presence of this virus in humans may aid in determining the geographical location of origin of the decedent. Wilson et al. [
[9]- Wilson M.R.
- Weaver S.C.
- Winegar R.A.
Legal, technical, and interpretational considerations in the forensic analysis of viruses.
] described specific considerations and presented seven discrete steps for analysis of viral samples in a forensic case. They also described a broad range of topics relevant to working with viral biomarkers, including considering viruses as weapons and the intentional/unintentional transmission of virus-mediated diseases in criminal cases. Additionally, the hidSkinPlex, a panel of microbial clade-specific markers which were developed for the assessment of the skin microbiome for human identification, utilizes the
Propionibacterium phage P101A as a marker [
[10]- Schmedes S.E.
- Woerner A.E.
- Novroski N.M.M.
- Wendt F.R.
- King J.L.
- Stephens K.M.
- Budowle B.
Targeted sequencing of clade-specific markers from skin microbiomes for forensic human identification.
]. The development and validation study of this panel evaluated a small set of viral markers specific to
Propionibacterium phage isolates; however, this study only utilized a single bacteriophage virus and did not address other viral types. Authors in this study acknowledged there is potential for viral markers to be used for human identification, however, they point out the need for further studies and investigation into phage and viral marker development utilizing viral targeted extraction and improved amplification techniques [
[10]- Schmedes S.E.
- Woerner A.E.
- Novroski N.M.M.
- Wendt F.R.
- King J.L.
- Stephens K.M.
- Budowle B.
Targeted sequencing of clade-specific markers from skin microbiomes for forensic human identification.
].
The human skin virome offers a unique advantage over just using bacterial based microbiome markers. The community composition of human skin bacteria is affected by a number of confounding factors, such as antibiotic intervention or the use of antibacterial soap [
[11]Acne, the skin microbiome, and antibiotic treatment.
]. For instances like these, alternative microbial markers must be used as they provide an alternative form of data to that of pre-established bacterial markers. Targeting viral populations that are part of the core skin virome, such as eukaryotic infecting viral populations not affected by antibacterial agents, not only offers additional biomarkers to those already established but potentially offers increased stability and detection even in the presence of outside environmental contributory factors.
To date, no study has attempted to exclusively investigate the human virome over a time series across various physical locations of the body to identify its potential as a tool for human identification separate from that of the human bacterial microbiome. One reason for the limited number of virome studies has been the lack of bioinformatic and molecular tools for human virome investigation. Despite this, the development of high-throughput sequencing methods and the resulting decrease in sequencing costs have led to studies investigating the viromes of humans, primarily focusing on describing viral diversity in the human gut [
12- Reyes A.
- Haynes M.
- Hanson N.
- Angly F.E.
- Heath A.C.
- Rohwer F.
- Gordon J.I.
Viruses in the faecal microbiota of monozygotic twins and their mothers.
,
13- Minot S.
- Sinha R.
- Chen J.
- Li H.
- Keilbaugh S.A.
- Wu G.D.
- Lewis J.D.
- Bushman F.D.
The human gut virome: Inter-individual variation and dynamic response to diet.
,
14- Minot S.
- Bryson A.
- Chehoud C.
- Wu G.D.
- Lewis J.D.
- Bushman F.D.
Rapid evolution of the human gut virome.
,
15- Hannigan G.D.
- Meisel J.S.
- Tyldsley A.S.
- Zheng Q.
- Hodkinson B.P.
- Sanmiguel A.J.
- Minot S.
- Bushman F.D.
- Grice E.A.
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.
,
16The human gut virome: a multifaceted majority.
,
17- Rascovan N.
- Duraisamy R.
- Desnues C.
Metagenomics and the human virome in asymptomatic individuals.
]. These studies have demonstrated that human gut viromes tend to be “highly individual and temporally stable” [
[16]The human gut virome: a multifaceted majority.
], two key features that are highly desirable for a forensic biomarker.
In order to address the human skin virome in a forensic context, we investigated the temporal human skin virome stability on three body locations (left hand, right hand, and scalp) in 42 study participants using five longitudinal samples taken across a 6-month time period. The goal of this study was to address two hypotheses. First, we expect that the human skin virome consists of both stable and variable viral taxa but are unsure which viral taxa may be in each category. Second, we hypothesize that there will be viral taxa that may be diagnostic for human individualization and that we may attribute sampling location or lifestyle traits as a proxy to explain diagnostic differences in the human skin virome. Our overall goal in this exploratory study was to identify unique but stable viral sub-populations within the human skin virome and to assess this virome diversity for potential forensic human identification.
4. Discussion
The utilization of alternative sources of biological data for forensic human identification should incorporate aspects of both target marker stability over time and the ability to utilize those markers to have probative discriminatory power across individuals within a population. This study set out to address the viability of the human skin viral microbiome as a proof-of-concept for its stability and utilization as a source for human identification. Previous studies, such as those by [
6- Watanabe H.
- Nakamura I.
- Mizutani S.
- Kurokawa Y.
- Mori H.
- Kurokawa K.
- Yamada T.
Minor taxa in human skin microbiome contribute to the personal identification.
,
7- Schmedes S.E.
- Woerner A.E.
- Budowle B.
Forensic human identification using skin microbiomes.
,
10- Schmedes S.E.
- Woerner A.E.
- Novroski N.M.M.
- Wendt F.R.
- King J.L.
- Stephens K.M.
- Budowle B.
Targeted sequencing of clade-specific markers from skin microbiomes for forensic human identification.
], have identified a suite of bacterial biomarkers (along with one bacteriophage) for human identification. Additionally, studies have shown certain viral taxa associated with human skin disease have yearlong stability and thus could be an additional microbial target for human identification [
[42]- Tirosh O.
- Conlan S.
- Deming C.
- Lee-Lin S.Q.
- Huang X.
- Barnabas B.B.
- Bouffard G.G.
- Brooks S.Y.
- Marfani H.
- Dekhtyar L.
- Guan X.
- Han J.
- ling Ho S.
- Legaspi R.
- Maduro Q.L.
- Masiello C.A.
- McDowell J.C.
- Montemayor C.
- Mullikin J.C.
- Park M.
- Riebow N.L.
- Schandler K.
- Scharer C.
- Schmidt B.
- Sison C.
- Stantripop S.
- Thomas J.W.
- Thomas P.J.
- Vemulapalli M.
- Young A.C.
- Su H.C.
- Freeman A.F.
- Segre J.A.
- Kong H.H.
Expanded skin virome in DOCK8-deficient patients.
]. However, no previously published studies have exclusively investigated the potential utilization of the skin virome for forensics or have identified a panel of potential taxonomic and sequence variant viral markers for human identification. The goal of the study presented here was to test the feasibility and proof-of-concept of targeting the viral component of the human skin microbiome, as has been done for the bacterial microbiome, for forensic purposes.
In this study, we sequenced a total of 652 human skin viral metagenomes and analyzed that data to identify viral markers for human identification. Samples were pre- and post-sequence processed to reduce eukaryotic and prokaryotic contamination. Bacterial contamination was assessed by quantifying the presence of 16S rRNA genes in the viral assemblies. Samples contained a low amount of bacterial 16S rRNA sequences which suggested our viromes to be highly enriched for human-associated viruses with minimal contamination. Enrichment for virus-like particles (VLPs) as opposed to an overall bulk shotgun microbiome approach allows for deeper sequencing of viral genomes which aids in increased viral sequence data recovery and viral annotation [
[43]- Trubl G.
- Bin Jang H.
- Roux S.
- Emerson J.B.
- Solonenko N.
- Vik D.R.
- Solden L.
- Ellenbogen J.
- Runyon A.T.
- Bolduc B.
- Woodcroft B.J.
- Saleska S.R.
- Tyson G.W.
- Wrighton K.C.
- Sullivan M.B.
- Rich V.I.
Soil viruses are underexplored players in ecosystem carbon processing.
]. A study conducted by Trubl et al. [
[43]- Trubl G.
- Bin Jang H.
- Roux S.
- Emerson J.B.
- Solonenko N.
- Vik D.R.
- Solden L.
- Ellenbogen J.
- Runyon A.T.
- Bolduc B.
- Woodcroft B.J.
- Saleska S.R.
- Tyson G.W.
- Wrighton K.C.
- Sullivan M.B.
- Rich V.I.
Soil viruses are underexplored players in ecosystem carbon processing.
] found that enrichment for VLP metagenomes outperformed bulk metagenomes 2-fold in overall viral recovery. An additional study conducted by Gregory et al. [
[44]- Gregory A.C.
- Zablocki O.
- Zayed A.A.
- Howell A.
- Bolduc B.
- Sullivan M.B.
The gut virome database reveals age-dependent patterns of virome diversity in the human gut.
] found that the use of a bulk metagenomic methodology resulted in biased recovery of viruses and prophage that were actively infecting bacteria where enrichment for VLPs resulted in greater amounts of free viral particles. For the proof-of-concept of this study, we provided alternative biomarkers from previous studies (i.e. human or bacterial communities) just using a bulk metagenomic methodology with laboratory filtering and bioinformatic data filtering methods. In the context of forensics, intuitively one would want to target free viral particles seeing as how these particles have the greater potential to be transferred from a subject to an evidentiary object whereas targeting prophages implies the necessity of host-specific bacteria within the collected microbiome sample. By utilizing viral enrichment methods, we were able to identify and recover eukaryotic and human infecting viruses which will not be affected by as many environmental factors such as that of antibiotic drugs or hand washing using antibacterial soap. Additionally, by targeting human-associated viruses this will allow for the generation of additional biomarkers for instances when bacterial biomarkers are affected or cannot be used such as that of the environmental effects previously mentioned.
The sequence data isolated from 42 study participants across five time points for 3 sampling locations on the human body was assembled. Of the 62,101 assembled contigs with > 1000 bp, 1298 contigs were identified as viral based on current database dependent identification tools. This is a small percentage of the overall metagenome assembly. Due to the fact that annotated putatively viral genes within the assembled contigs were taxonomically identified by comparing to current viral databases, our data indicated that these reference databases are lacking in viral diversity and this lack of representation makes it difficult to study viromes in any environment. Thus, similar studies coupled with genome annotations are greatly needed to improve viral annotation. As such, this study should be viewed as a first step towards uncovering viral diversity and stability in the human virome in addition to its application to forensics.
Of the identified viral contigs, both double and single stranded DNA viruses were identified. The most abundant double stranded DNA viruses detected were those of bacteriophage belonging to the viral taxonomic order Caudovirales. Previous studies have found the skin virome diversity to be mainly composed of Caudovirales bacteriophage [
15- Hannigan G.D.
- Meisel J.S.
- Tyldsley A.S.
- Zheng Q.
- Hodkinson B.P.
- Sanmiguel A.J.
- Minot S.
- Bushman F.D.
- Grice E.A.
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.
,
42- Tirosh O.
- Conlan S.
- Deming C.
- Lee-Lin S.Q.
- Huang X.
- Barnabas B.B.
- Bouffard G.G.
- Brooks S.Y.
- Marfani H.
- Dekhtyar L.
- Guan X.
- Han J.
- ling Ho S.
- Legaspi R.
- Maduro Q.L.
- Masiello C.A.
- McDowell J.C.
- Montemayor C.
- Mullikin J.C.
- Park M.
- Riebow N.L.
- Schandler K.
- Scharer C.
- Schmidt B.
- Sison C.
- Stantripop S.
- Thomas J.W.
- Thomas P.J.
- Vemulapalli M.
- Young A.C.
- Su H.C.
- Freeman A.F.
- Segre J.A.
- Kong H.H.
Expanded skin virome in DOCK8-deficient patients.
] and our findings here were in agreement with this observation. The viral families identified in this study that fall under the order Caudovirales included Siphoviridae, Podoviridae, and Herelleviridae. These viral families include bacteriophage that are obligately associated with bacteria commonly found in the skin microbiome (i.e.,
Staphylococcus and
Streptococcus) [
15- Hannigan G.D.
- Meisel J.S.
- Tyldsley A.S.
- Zheng Q.
- Hodkinson B.P.
- Sanmiguel A.J.
- Minot S.
- Bushman F.D.
- Grice E.A.
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.
,
45- Foulongne V.
- Sauvage V.
- Hebert C.
- Dereure O.
- Cheval J.
- Gouilh M.A.
- Pariente K.
- Segondy M.
- Burguière A.
- Manuguerra J.C.
- Caro V.
- Eloit M.
Human skin microbiota: high diversity of DNA viruses identified on the human skin by high throughput sequencing.
,
46- van Zyl L.J.
- Abrahams Y.
- Stander E.A.
- Kirby-McCollough B.
- Jourdain R.
- Clavaud C.
- Breton L.
- Trindade M.
Novel phages of healthy skin metaviromes from South Africa.
]. With bacteria being the dominant component in the skin microbiome, it is logical that the abundance of bacteriophages would show a similar level of abundance in the human skin virome. These bacteriophages may help control bacterial populations on the skin and may help structure the bacterial communities of the human skin microbiome with relation to environmental inputs (seasonality, humidity, etc.) and lifestyle of the person (travel, activities, etc.). In addition to double stranded DNA viruses, a large amount of single stranded DNA viruses were identified in the skin virome samples. Small circular DNA viruses, those that are typically associated with eukaryotes, such as those found in the viral families of the Adenoviridae, Anelloviridae, Circoviridae, Herpesviridae, Papillomaviridae, and Polyomaviridae, have all previously been reported to be associated with the human skin virome. A large proportion of the single stranded viruses identified are Papillomaviruses, which are common skin associated viruses that may act as opportunistic pathogens [
[47]- Pastrana D.V.
- Peretti A.
- Welch N.L.
- Borgogna C.
- Olivero C.
- Badolato R.
- Notarangelo L.D.
- Gariglio M.
- FitzGerald P.C.
- McIntosh C.E.
- Reeves J.
- Starrett G.J.
- Bliskovsky V.
- Velez D.
- Brownell I.
- Yarchoan R.
- Wyvill K.M.
- Uldrick T.S.
- Maldarelli F.
- Lisco A.
- Sereti I.
- Gonzalez C.M.
- Androphy E.J.
- McBride A.A.
- Van Doorslaer K.
- Garcia F.
- Dvoretzky I.
- Liu J.S.
- Han J.
- Murphy P.M.
- McDermott D.H.
- Buck C.B.
Metagenomic discovery of 83 new human papillomavirus types in patients with immunodeficiency.
]. We also observed a high abundance of the small circular single-stranded DNA viruses belonging to the phylum of Cressdnaviricota across all subjects in this study (
Fig. 1,
Supplementary Fig. 3). These viruses have previously not been reported at high abundance in the human skin virome. The lack of Cressdnaviricota virus reporting in previous human skin virome studies may be attributed to the recent discovery of novel Cress-DNA viruses and their recent addition to NCBI databases [
15- Hannigan G.D.
- Meisel J.S.
- Tyldsley A.S.
- Zheng Q.
- Hodkinson B.P.
- Sanmiguel A.J.
- Minot S.
- Bushman F.D.
- Grice E.A.
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.
,
41The human virome: assembly, composition and host interactions.
,
45- Foulongne V.
- Sauvage V.
- Hebert C.
- Dereure O.
- Cheval J.
- Gouilh M.A.
- Pariente K.
- Segondy M.
- Burguière A.
- Manuguerra J.C.
- Caro V.
- Eloit M.
Human skin microbiota: high diversity of DNA viruses identified on the human skin by high throughput sequencing.
,
48- Tisza M.J.
- Pastrana D.V.
- Welch N.L.
- Stewart B.
- Peretti A.
- Starrett G.J.
- Pang Y.Y.S.
- Krishnamurthy S.R.
- Pesavento P.A.
- McDermott D.H.
- Murphy P.M.
- Whited J.L.
- Miller B.
- Brenchley J.
- Rosshart S.P.
- Rehermann B.
- Doorbar J.
- Ta’ala B.A.
- Pletnikova O.
- Troncoso J.C.
- Resnick S.M.
- Bolduc B.
- Sullivan M.B.
- Varsani A.
- Segall A.M.
- Buck C.B.
Discovery of several thousand highly diverse circular DNA viruses.
]. Of the contigs that were annotated under
Cressdnaviricota many of them only displayed minimal similarity to reference Cressdnaviricota viruses suggesting we have identified a group that is more diverse than previously reported. These viruses did contain a
Rep gene which is specific to the phylum Cressdnaviricota, however they had low sequence similarity to the sequences in the current database suggesting the viruses identified in this study could be novel viruses related to Cressdnaviricota and other small DNA viruses.
Many contigs that were identified as being of viral origin were small and circular in nature. These viruses have similarity to other small circular DNA viruses in current viral reference databases. However, due to their low percent identity to any known virus or organism they remained unclassified in our dataset at lower taxonomic levels (e.g., at the level of viral genus or species). This was especially evident in the unclassified viruses having similarity to viruses belonging to family Microviridae. As previously mentioned, the skin virome contains many novel viruses, particularly noted here in the family Microviridae. In order to better characterize the viral diversity in the human skin virome, more research into viral discovery is needed. Since there was a large proportion of viral contigs having no similarity to characterized viruses, we implemented a database independent approach paired with database dependent methods to alleviate annotation inaccuracies and missing reference material.
To evaluate the applicability of the human virome for subject identification, viral contig diversity and abundance across locations and subjects was further analyzed to identify the top ten most abundant viral families for each subject by sampling location (
Fig. 2). Distinct differences in the relative abundance of these contigs were observed across individuals. Relative abundance across locations within an individual was also visualized in
Fig. 2 and similarity across all three locations was observed within most individuals. However, differences were noted for some subjects between locations (right hand, left hand, and scalp). It is possible that this may have to do with hand dominance coming into contact with that individual's hair and scalp thus sharing similar viral taxa across locations. The main difference observed across these locations is either the addition of a viral family or the loss of one.
The relative abundance of the most abundant annotated contigs visually displayed greater variation between subjects compared to that of within subject variation across sampling locations (
Fig. 2). This suggests viral marker presence can be discriminatory for differentiation of individuals within the study population. It is important to note, however, that
Fig. 2 is a comparison of only subject and location within subject, with time of sampling and viral stability not taken into account. In
Fig. 2, we evaluated the top ten most abundant taxa for each sampling anatomical location and subject summed across the five collection time points. Therefore, this comparison should not be used as an ultimate viral marker set for human identification purposes and thus further evaluation into viral stability was conducted.
Not only do viral markers need to be diagnostic for individuals, but they also need to be stable over time. Therefore, for forensic purposes, we recognize that the core stable virome needs to be targeted for biomarker development. The virome, especially those viruses found on body locations that are consistently exposed and interact with objects and the environment, will potentially have a large temporal component in addition to the stable component [
15- Hannigan G.D.
- Meisel J.S.
- Tyldsley A.S.
- Zheng Q.
- Hodkinson B.P.
- Sanmiguel A.J.
- Minot S.
- Bushman F.D.
- Grice E.A.
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.
,
39- Oh J.
- Byrd A.L.
- Park M.
- Kong H.H.
- Segre J.A.
Temporal stability of the human skin microbiome.
]. Temporal viruses do have the potential to provide informative forensic information that would be unique to certain lifestyle characteristics like occupation, contact with animals, recent travel, or hobbies (such as gardening, etc.). Temporal viruses, while not stable for direct forensic applications, could provide important information towards identifying circumstantial characteristics specific to an individual. However, for the purpose of acting as an alternative genetic source to traditional STR methods, target viral markers should be stable over time, and therefore, we required that viral markers be consistently present within an individual's skin virome during our sample collecting regime, which we repeated for each individual at all three locations over a time course of six months.
For the purposes of this study, we considered a virus to be stable if a particular consistently annotated virus was present in at least four out of the five time points collected within a single anatomical location for an individual. Of the viral families identified in the overall assembly, 15 families were considered to have stability in at least one location within an individual. Of the 15 families, nine viral families presented stability in at least one individual for all three body location sites. Of note, many of the nine families fell under the order of Caudovirales. As previously mentioned, several studies have evaluated and classified the core bacterial microbiome on the skin and thus it is not surprising that bacteriophage would be both present and temporally stable seeing as how their bacterial hosts are also present and stable on human skin [
39- Oh J.
- Byrd A.L.
- Park M.
- Kong H.H.
- Segre J.A.
Temporal stability of the human skin microbiome.
,
45- Foulongne V.
- Sauvage V.
- Hebert C.
- Dereure O.
- Cheval J.
- Gouilh M.A.
- Pariente K.
- Segondy M.
- Burguière A.
- Manuguerra J.C.
- Caro V.
- Eloit M.
Human skin microbiota: high diversity of DNA viruses identified on the human skin by high throughput sequencing.
,
49- Byrd A.L.
- Belkaid Y.
- Segre J.A.
The human skin microbiome.
]. In a few older studies, Papillomaviruses were found to not be stable or not less stable as the communities of certain phage viruses as well as both bacterial and fungal microbiome communities [
15- Hannigan G.D.
- Meisel J.S.
- Tyldsley A.S.
- Zheng Q.
- Hodkinson B.P.
- Sanmiguel A.J.
- Minot S.
- Bushman F.D.
- Grice E.A.
The human skin double-stranded DNA virome: topographical and temporal diversity, genetic enrichment, and dynamic associations with the host microbiome.
,
39- Oh J.
- Byrd A.L.
- Park M.
- Kong H.H.
- Segre J.A.
Temporal stability of the human skin microbiome.
]. However, in this study, in addition to families of Caudovirales, the family of Papillomaviridae was found to be stable which is consistent with findings observed in another study [
[42]- Tirosh O.
- Conlan S.
- Deming C.
- Lee-Lin S.Q.
- Huang X.
- Barnabas B.B.
- Bouffard G.G.
- Brooks S.Y.
- Marfani H.
- Dekhtyar L.
- Guan X.
- Han J.
- ling Ho S.
- Legaspi R.
- Maduro Q.L.
- Masiello C.A.
- McDowell J.C.
- Montemayor C.
- Mullikin J.C.
- Park M.
- Riebow N.L.
- Schandler K.
- Scharer C.
- Schmidt B.
- Sison C.
- Stantripop S.
- Thomas J.W.
- Thomas P.J.
- Vemulapalli M.
- Young A.C.
- Su H.C.
- Freeman A.F.
- Segre J.A.
- Kong H.H.
Expanded skin virome in DOCK8-deficient patients.
]. Interestingly both Baculoviridae and Genomoviridae were also found to be stable across multiple individuals in all three physical locations sampled in this study. Notably,
Baculoviridae are sometimes used for bioengineering purposes in sequencing laboratories so there is a chance that their presence in our study is due to contamination from a sequencing provider, however, we think this is highly unlikely as certain Baculoviridae have the ability to infect mammalian cells, such as
Autographa californica multi-nuclear polyhedrosis virus. Therefore, our identification of similar viruses may potentially be a true biological observation that has not previously been accounted for due to other studies assuming it was contamination [
[50]- van Loo N.-D.
- Fortunati E.
- Ehlert E.
- Rabelink M.
- Grosveld F.
- Scholte B.J.
Baculovirus infection of nondividing mammalian cells: mechanisms of entry and nuclear transport of capsids.
]. Although Genomoviridae have previously been identified from human vaginal samples, very few studies have explored these small single stranded DNA viruses and their diversity and stable presence as a part of the human skin core virome [
[51]- Siqueira J.D.
- Curty G.
- Xutao D.
- Hofer C.B.
- Machado E.S.
- Seuánez H.N.
- Soares M.A.
- Delwart E.
- Soares E.A.
Composite analysis of the virome and bacteriome of HIV/HPV co-infected women reveals proxies for immunodeficiency.
]. Interestingly, previous studies have identified Genomoviridae as being a stable and persistent virus in other mammals such as that of bats [
[52]- Bolatti E.M.
- Zorec T.M.
- Montani M.E.
- Hošnjak L.
- Chouhy D.
- Viarengo G.
- Casal P.E.
- Barquez R.M.
- Poljak M.
- Giri A.A.
A preliminary study of the virome of the south american free-tailed bats (Tadarida brasiliensis) and identification of two novel mammalian viruses.
].
Of the nine viral families that we identified as being temporally stable, their continuously observed persistence was found across many, though not all, of the subjects in our study (
Fig. 3). This is potentially due to certain viral genera or species within a particular family being temporally stable as opposed to similar genera or species within that same family that may not have been temporally stable. Additionally, there is a possibility that our viral metagenome assemblies could be biased due to the high amount of repetitive sequences seen in some particular viruses. Therefore, in order to reduce assembly-based bias and increase annotation, we trimmed sample reads and mapped that data to all genome sequences available in NCBI’s nucleotide database pertaining to the order Caudovirales and the viral families Papillomaviridae, Genomoviridae, and Baculoviridae
, to improve viral detection and sample annotation. Species level mapped NCBI reference genomes were evaluated during this process. In addition to stability of reference mapped counts, species level stability was also assessed for viral contigs. Species that were identified as being persistent across all three locations in at least four out of the five time points for an individual were considered for putative viral markers for human identification.
In order to address viral diversity missing from databases, we first filtered sequencing reads on the basis of base-call quality and then mapped the sequencing reads to the total contigs in our viral metagenome assembly. Contig sequences that were stable within an individual for each sampling location were analyzed with Blast using the Blastn alignment algorithm. Contigs that did not share a high percent similarity (> 70%) to known non-viral genomes were considered to be of potential viral origin and retained in the analysis. Of the contigs that were deemed to be of potential viral origin, those that were stable across all three body locations were considered to be additional putative viral markers for human identification, though their exact taxonomic classification is not known because they did not show sequence similarity or homology to any reference sequences.
In this study we identified a total of 188 viral markers, which included all stable NCBI reference mapped species, stable viral contig annotated species, and stable potential viral contigs that showed limited sequence similarity to databases or included viral-like gene regions. Of those 188 viral markers, 59 markers were found to be stable across all three body site locations sampled (
Fig. 4D). These 59 target viral markers are proposed as having potential for the purpose of human identification, as shown across 42 individual test subjects. Of the 59 viral markers, seven were persistent across > 90% of the sample population (
Fig. 5). These seven markers may not be usable for subject-to-subject individualization, in regard to their presence or absence in the virome of an individual due to their low power of discrimination. However, they are prime candidates for genetic polymorphism analysis across a sampled population which we suspect will add an additional level of marker exploitation at the scale of single nucleotide variation, as well as presence/absence of insertions and deletions at the nucleotide level. As for the other 52 markers, they may be used as a presence/absence basis for human identification.
Distinct profile patterns were observed across the subjects (
Fig. 5) for specific viral taxa. In particular, for subject participants P15-P21, they shared similar presence/absence profiles for certain viruses such as that of
Gammapapillomaviruses which were not observed in the rest of the population. The presence of these distinct profile subpopulations is not due to sample processing or sequencing contamination due to the randomization of processing of samples and is believed to be a true biological finding. However, it is noted that for the subpopulation of P15-P21 these subjects frequently shared physical space and objects or were a cohabiting partner of one of the members of the aforementioned group. Not only does this subpopulation stand out from the rest of the population, but they can also be distinguished from one another based on their profiles. This suggests that not only can the virome be used as an individualizing characteristic but also bears circumstantial lifestyle or environmental characteristics. This data indicates that additional work is merited to examine subpopulations of people that cohabitate or share working environments.
The identified markers were found to be more significantly similar within subjects across time points compared to between subjects using presence/absence of the three identified marker sets. The more markers that were used (i.e., the combination of all sets) resulted in a greater significance in the similarity differences (P = 5.3 × 10−15). Thus, the addition of sets B and C were necessary in development of a more stable set for human identification profile production and evaluation. Diversity across subjects was also evaluated. We found that subject to subject variation was a significant variable associated with both ɑ-diversity and β-diversity. Therefore, showing that not only are these viral markers stable but there is significant difference in subject-to-subject diversity which highlights the ability to separate the skin virome of one individual from another.