Highlights
- •A Cloud-based system was developed to analyze next-generation sequencing data.
- •Data from a variety of bench top and portable DNA sequencers were analyzed.
- •Results are reported following ISFG guidelines and stored in a relational database.
- •Benefits included on-demand scalability, ease-of-use, access controls, and security.
Abstract
Next-generation Sequencing (NGS) is a rapidly evolving technology with demonstrated
benefits for forensic genetic applications, and the strategies to analyze and manage
the massive NGS datasets are currently in development. Here, the computing, data storage,
connectivity, and security resources of the Cloud were evaluated as a model for forensic
laboratory systems that produce NGS data. A complete front-to-end Cloud system was
developed to upload, process, and interpret raw NGS data using a web browser dashboard.
The system was extensible, demonstrating analysis capabilities of autosomal and Y-STRs
from a variety of NGS instrumentation (Illumina MiniSeq and MiSeq, and Oxford Nanopore
MinION). NGS data for STRs were concordant with standard reference materials previously
characterized with capillary electrophoresis and Sanger sequencing. The computing
power of the Cloud was implemented with on-demand auto-scaling to allow multiple file
analysis in tandem. The system was designed to store resulting data in a relational
database, amenable to downstream sample interpretations and databasing applications
following the most recent guidelines in nomenclature for sequenced alleles. Lastly,
a multi-layered Cloud security architecture was tested and showed that industry standards
for securing data and computing resources were readily applied to the NGS system without
disadvantageous effects for bioinformatic analysis, connectivity or data storage/retrieval.
The results of this study demonstrate the feasibility of using Cloud-based systems
for secured NGS data analysis, storage, databasing, and multi-user distributed connectivity.
Keywords
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Forensic Science International: GeneticsAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- Molecular analysis of the human mitochondrial DNA control region for forensic identity testing.Curr. Protoc. Hum. Genet. 2012; (Chapter 14 Unit14.7)
- Massively parallel sequencing of short tandem repeats-Population data and mixture analysis results for the PowerSeq system.Forensic Sci. Int. Genet. 2016; 24: 86-96
- Development and assessment of an optimized next-generation DNA sequencing approach for the mtgenome using the Illumina MiSeq.Forensic Sci. Int. Genet. 2014; 13: 20-29
- An evaluation of the PowerSeq Auto System: a multiplex short tandem repeat marker kit compatible with massively parallel sequencing.Forensic Sci. Int. Genet. 2015; 19: 172-179
- Evaluation of the Illumina ForenSeq DNA Signature Prep Kit − MPS forensic application for the MiSeq FGx benchtop sequencer.Forensic Sci. Int. Genet. 2017; 28: 188-194
- Performance and concordance of the ForenSeq system for autosomal and Y chromosome short tandem repeat sequencing of reference-type specimens.Forensic Sci. Int. Genet. 2017; 28: 1-9
- Evaluation of the Illumina((R)) beta version ForenSeq DNA signature prep kit for use in genetic profiling.Forensic Sci. Int. Genet. 2016; 20: 20-29
- Developmental validation of the MiSeq FGx forensic genomics system for targeted next generation sequencing in forensic DNA casework and database laboratories.Forensic Sci. Int. Genet. 2017; 28: 52-70
- Evaluation of the Precision ID Ancestry Panel for crime case work: a SNP typing assay developed for typing of 165 ancestral informative markers.Forensic Sci. Int. Genet. 2017; 28: 138-145
- Statistical modeling of ion PGM HID STR 10-plex MPS data.Forensic Sci. Int. Genet. 2017; 28: 82-89
- Compare Illumina Sequencers.2017 (https://www.illumina.com/systems/sequencing-platforms.html (2017). Accessed 10 Mar 2017)
- Ten years of next-generation sequencing technology.Trends Genet. 2014; 30: 418-426
- Forensic SNP genotyping using nanopore MinION sequencing.Sci. Rep. 2017; 7: 41759
- Big Data.1 st ed. John Wiley & Sons Inc., GB2015
- Cloud Adoption Practices and Priorities Survey Report.Computer Security Alliance, 2015: 13
- AWS Case Studies.2017 (https://aws.amazon.com/solutions/case-studies/(2017). Accessed 10 Mar 2017)
- The Business Impact of the Cloud.2012: 19
- Short-read, high-throughput sequencing technology for STR genotyping.Biotech. Rapid Dispatches. 2012; 2012: 1-6
- STRait razor: a length-based forensic STR allele-calling tool for use with second generation sequencing data.Forensic Sci. Int. Genet. 2013; 7: 409-417
- STRait Razor v2.0: the improved STR Allele Identification Tool.Forensic Sci. Int. Genet. 2015; 14: 182-186
- My-Forensic-Loci-queries (MyFLq) framework for analysis of forensic STR data generated by massive parallel sequencing.Forensic Sci. Int. Genet. 2014; 9: 1-8
- FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise.Forensic Sci. Int. Genet. 2017; 27: 27-40
- lobSTR: A short tandem repeat profiler for personal genomes.Genome Res. 2012; 22: 1154-1162
- Profiling short tandem repeats from short reads.Methods Mol. Biol. 2013; 1038: 113-135
- Evaluation of GeneMarker(R) HTS for improved alignment of mtDNA MPS data haplotype determination, and heteroplasmy assessment.Forensic Sci. Int. Genet. 2017; 28: 90-98
- Massively parallel sequencing of forensic STRs: considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements.Forensic Sci. Int. Genet. 2016; 22: 54-63
- Poretools: a toolkit for analyzing nanopore sequence data.Bioinformatics. 2014; 30: 3399-3401
- STRait Razor v2s: advancing sequence-based STR allele reporting and beyond to other marker systems.Forensic Sci. Int. Genet. 2017; 29: 21-28
- Introduction to AWS Security.2015
- AWS White Paper: Introduction to AWS Security.2016: 79
- Forensic science, genetics and wildlife biology: getting the right mix for a wildlife DNA forensics lab.Forensic Sci. Med. Pathol. 2010; 6: 172-179
Article info
Publication history
Published online: August 08, 2017
Accepted:
August 6,
2017
Received in revised form:
July 24,
2017
Received:
March 16,
2017
Identification
Copyright
© 2017 Elsevier B.V. All rights reserved.