The need for standardized procedures is of paramount importance in the highly regulated
and often controversial field of genetically modified organisms (GMOs) and food forensics
[
1
,
2
,
3
]. A genetically modified (GM) or transgenic plant contains a gene or genes that have
been introduced artificially into their genome in order to improve agronomic traits
[
4
,
5
]. Despite the benefits, public concern regarding the potential impacts of GMO on human
health and environment has increased over the years [
6
,
7
]. These concerns have fuelled the implementation of strict legislation on the planting,
marketing, labelling and trade of GMOs. The collaboration between research and enforcement
laboratories (e.g., the European Network of GMO Laboratories) has resulted in the
harmonisation and standardisation of means and methods for GMOs analysis. The use
of certified reference materials (CRMs) is mandatory for the certification and accreditation
of laboratories [
[8]
]. The CRMs are available in the form of powder, extracted DNA or plasmids with transgenic
DNA of a known GMO and are used for the validation of analytical procedures regarding
the detection and quantification of GMOs [
[9]
]. In contrast with the accessibility of CRMs, professionals working with GMOs face
great difficulties when looking for reference DNA sequences of transgenic elements.
Several countries have established a legal framework for risk assessment and authorization
of GMOs that includes freely accessible databases [
10
,
11
,
12
,
13
], however, there is no indication of which reference sequences should be used for
indexing purposes. In fact, the transgenic sequences used in many GMOs are usually
kept confidential by biotech companies in order to protect their technology against
competitors. It is also difficult to find the most common transgenic sequences used
in many commercialized GMOs by searching public databases (e.g., NCBI Entrez Nucleotide
database). The problem often resides in the different names used for the same element
(e.g., P35S, p35S, P-35s, CMV 35S, CaMV 35S), making it difficult to find the correct
sequence. Moreover, the sequences used in GMOs (e.g., inserts or plasmids) are retrieved
simultaneously with hundreds of sequences from population studies of the donor species,
which makes it very difficult to choose an adequate sequence for reference purposes.
In many cases, a search for a transgenic element only retrieves complete chromosome
sequences or large sequence contigs, forcing the researcher to do additional sequence
analyses with specialized software. Furthermore, there is no sequence information
available for some elements in public databases, particularly for junction regions
between elements, which are often the target of DNA-based detection methods. In this
case, the researcher has to reconstruct the junction regions by the tedious process
of downloading, aligning and editing sequences from different elements. The absence
of reference sequences also hinders the description of methods and communication of
results among laboratories. The simple process of describing polymorphisms on a transgenic
element or sharing the location of PCR primers is problematic and highly prone to
mistakes without a proper reference. By all these reasons, it is becoming evident
that the existence of an organized list of reference sequences for the most common
transgenic elements would facilitate the development, study and detection of GMOs.
The use of reference sequences is a common practice on forensic investigations [
14
,
15
]. Here we propose the standardization of transgenic reference sequences and provide
a catalogue of curated sequences for the most common transgenic elements used in GM
plants. The list is freely accessible via the web at http://portugene.com/GMOrefseq.html and a subset of the database is described in Table 1 as an example. The reference sequences are organized in a dynamic table with hyperlinks
to the NCBI Sequence Viewer, Taxonomy and PubMed. The sequences can be visualized
and downloaded with full annotations. Our reference dataset was also compared by blast
with the GMO-related sequences available in the JRC GMO-Amplicons database [
[13]
].
Table 1List of reference sequences for some common transgenic elements used in genetically
modified organisms (GMOs). A full list of references can be found at http://portugene.com/GMOrefseq.html.
Genetic element | Reference sequence | ||||
---|---|---|---|---|---|
Abbreviation | Genomic region | Donor organism | Accession number | Source | Reference |
P35S (P-35s) | Cauliflower Mosaic Virus (CaMV) 35S promoter | Cauliflower mosaic virus | NC_001497.1 | Cauliflower mosaic virus |
[21]
|
FMV35S (P-FMV) | Figwort mosaic virus 35S promoter | Figwort mosaic virus | NC_003554.1 | Figwort mosaic virus |
[22]
|
T-nos | Nopaline Synthase Gene Terminator | Agrobacterium tumefaciens | EU880444.1 | Oryza sativa Indica Group |
17
,
23
|
bar | Glufosinate ammonium tolerance gene (codes for phosphinothricin acetyltransferase − PAT) | Streptomyces hygroscopicus | X05822.1 | Streptomyces hygroscopicus |
[24]
|
Cry1Ab | Cry1Ab delta-endotoxin gene | Bacillus thuringiensis | AX392802.1 | Synthetic construct |
[25]
|
cp4epsps | 5-enolpyruvulshikimate-3-phosphate synthase gene (epsps) | Agrobacterium tumefaciens strain CP4 | AB209952.1 (298–1881) | Synthetic construct (Glycine max) | GenBank direct submission |
ctp2-cp4epsps | Chloroplast transit peptide (ctp2) + 5-enolpyruvulshikimate-3-phosphate synthase gene (epsps) |
Arabidopsis thaliana (ctp2) + Agrobacterium tumefaciens strain CP4 (epsps) |
FN550387.1 + JN400385.1 + AB209952.1 | Concatenated sequence |
17
,
26
|
P35S-bar | CaMV P-35S promoter + Glufosinate ammonium tolerance gene | Cauliflower Mosaic Virus + Streptomyces hygroscopicus | NC_001497.1 (6101–7445) + X05822.1 | Concatenated sequence (Cauliflower Mosaic Virus + Streptomyces hygroscopicus) |
21
,
24
|
To read this article in full you will need to make a payment
Purchase one-time access:
Academic & Personal: 24 hour online accessCorporate R&D Professionals: 24 hour online accessOne-time access price info
- For academic or personal research use, select 'Academic and Personal'
- For corporate R&D use, select 'Corporate R&D Professionals'
Subscribe:
Subscribe to Forensic Science International: GeneticsAlready a print subscriber? Claim online access
Already an online subscriber? Sign in
Register: Create an account
Institutional Access: Sign in to ScienceDirect
References
- Food analysis using organelle DNA and the effects of processing on assays.Annu. Rev. Food Sci Technol. 2017; 8: 57-74
- Food forensics: using DNA technology to combat misdescription and fraud.Trends Biotechnol. 2004; 22: 222-226
- Applications of DNA-based methods in food forensics.Handbook of Forensic Genetics: Biodiversity and Heredity in Civil and Criminal Investigation. World Scientific, 2017 (p. 493–517)
- Biotechnology: transgenic crops in natural habitats.Nature. 2001; 409: 682-683
- The risks and benefits of genetically modified crops: a multidisciplinary perspective.Conserv. Ecol. 2000; 4
- Genetically modified foods: safety, risks and public concerns—a review.J. Food Sci. Technol. 2013; 50: 1035-1046
- Genetically modified plants-the debate continues.Trends Ecol. Evol. 2000; 15: 14-18
- How to Reliably Test for GMOs.Springer, 2012
- Production of certified reference materials for the detection of genetically modified organisms.J. AOAC Int. 2002; 85: 775-779
- The GMOseek matrix: a decision support tool for optimizing the detection of genetically modified plants.BMC Bioinf. 2013; 14: 256
- GMOMETHODS: the European union database of reference methods for GMO analysis.J. AOAC Int. 2012; 95: 1713-1719
- GMDD: a database of GMO detection methods.BMC Bioinf. 2008; 9: 260
- JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.Database. 2015; (bav101)
- ISFG: recommendations regarding the use of non-human (animal) DNA in forensic genetic investigations.Forensic Sci. Int.: Genet. 2011; 5: 501-505
- A cautionary note on switching mitochondrial DNA reference sequences in forensic genetics.Forensic Sci. Int.: Genet. 2012; 6: e182-e184
- Genetically Modified (GM) Crops: Molecular and Regulatory Details.Centre for Biosafety and Sustainability, Basel, Švica, BATS2003
- A practical approach to screen for authorised and unauthorised genetically modified plants.Anal. Bioanal. Chem. 2010; 396: 2065-2072
- Next generation sequencing and its applications in forensic genetics.Forensic Sci. Int.: Genet. 2015; 18: 78-89
- Next-generation sequencing as a tool for the molecular characterisation and risk assessment of genetically modified plants: added value or not?.Trends Food Sci. Technol. 2015; 45: 319-326
- Statistical framework for detection of genetically modified organisms based on next generation sequencing.Food Chem. 2016; 192: 788-798
- Nucleotide sequence of cauliflower mosaic virus DNA.Cell. 1980; 21: 285-294
- Sequence of figwort mosaic virus DNA (caulimovirus group).Nucleic Acids Res. 1987; 15: 8451-8466
- Real-time PCR method for detection of the transgenic rice event TT51-1.Food Chem. 2010; 119: 417-422
- Characterization of the herbicide-resistance gene bar from Streptomyces hygroscopicus.EMBO J. 1987; 6: 2519
N. Carozzi, S. Rabe, P. Miles, G. Warren, P. De Haan, Novel insecticidal toxins derived from Bacillus thuringiensis crystal proteins, Patent WO0215701. 2002.
- Expression of the Arabidopsis thaliana BBX32 gene in soybean increases grain yield.PLoS One. 2012; 7 (e30717-e)
Article info
Publication history
Published online: May 02, 2017
Received:
March 28,
2017
Identification
Copyright
© 2017 Elsevier B.V. All rights reserved.