High Throughput Sequencing for Detection of Foodborne Pathogens

Sekse, Camilla; Holst-Jensen, Arne; Dobrindt, Ulrich; Johannessen, Gro S.; Li, Weihua; Spilsberg, Bjørn; Shi, Jianxin

doi:10.3389/fmicb.2017.02029

REVIEW article

Front. Microbiol., 20 October 2017

Sec. Food Microbiology

Volume 8 - 2017 | https://doi.org/10.3389/fmicb.2017.02029

High Throughput Sequencing for Detection of Foodborne Pathogens

$\r\nCamilla Sekse&#x;$ Camilla Sekse¹^†

Arne Holst-Jensen¹^†^*

¹Department of Animal Health and Food Safety, Norwegian Veterinary Institute, Oslo, Norway
²Institute of Hygiene, University of Münster, Münster, Germany
³Joint International Research Laboratory of Metabolic and Developmental Sciences, Shanghai Jiao Tong University–University of Adelaide Joint Centre for Agriculture and Health, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
⁴Department of Analysis and Diagnostics, Norwegian Veterinary Institute, Oslo, Norway

High-throughput sequencing (HTS) is becoming the state-of-the-art technology for typing of microbial isolates, especially in clinical samples. Yet, its application is still in its infancy for monitoring and outbreak investigations of foods. Here we review the published literature, covering not only bacterial but also viral and Eukaryote food pathogens, to assess the status and potential of HTS implementation to inform stakeholders, improve food safety and reduce outbreak impacts. The developments in sequencing technology and bioinformatics have outpaced the capacity to analyze and interpret the sequence data. The influence of sample processing, nucleic acid extraction and purification, harmonized protocols for generation and interpretation of data, and properly annotated and curated reference databases including non-pathogenic “natural” strains are other major obstacles to the realization of the full potential of HTS in analytical food surveillance, epidemiological and outbreak investigations, and in complementing preventive approaches for the control and management of foodborne pathogens. Despite significant obstacles, the achieved progress in capacity and broadening of the application range over the last decade is impressive and unprecedented, as illustrated with the chosen examples from the literature. Large consortia, often with broad international participation, are making coordinated efforts to cope with many of the mentioned obstacles. Further rapid progress can therefore be prospected for the next decade.

Introduction

Foodborne Pathogens and Their Impact

Foodborne pathogens (FBPs) cause foodborne diseases (FBDs) either directly (by infectious agents) or indirectly (by toxic metabolites, i.e., bacterial toxins and mycotoxins; EMAN, 2015; Martinovic et al., 2016) and can have devastating health and economic consequences in both developed and developing countries (Pires et al., 2012; EFSA, 2014; ECDC, 2015a; Henao et al., 2015). A major fraction of FBDs are diarrheal diseases, with particularly high impact on children (Pires et al., 2015). Typical FBPs are bacteria and several viruses, but also parasites and some fungi can cause FBDs.

Conventional Microbiological Food Analyses

Microbiological analyses of foods are carried out for verification and control, surveillance, investigation of disease outbreaks or sporadic cases, or for research (Figure 1). Time and labor consuming culture dependent methods, including enrichment and/or selective steps, are often used. Selective enrichment may be crucial to capture the species of interest. Isolation of the pathogen of interest is an optimal starting point for further characterization and research, and contributes to ensure that public health agencies can perform their basic mandate of FBD surveillance and response (Forbes et al., 2017). Over the last decades traditional culture dependent methods have gradually been complemented with molecular analytical methods. Speed and costs are the main drivers of this development. Concerns about the possible lack of necessary sensitivity, specificity or correspondence between molecular findings and presence or absence of viable, pathogenic microorganisms are among factors restraining this development. Polymerase chain reaction (PCR) analysis of enriched samples from food is established for a range of FBPs improving the efficiency of screening of samples, but viable microorganisms are, in most cases, still required for definitive confirmation of positive samples. High throughput sequencing (HTS) based workflows now gradually emerge as options also for routine applications to FBP detection and characterization (Figure 1; Table 1).

FIGURE 1

Figure 1. Four sectors are considered here as potential users of high throughput sequencing (HTS) technologies for detection and characterization of foodborne pathogens (FBPs). Research (upper left) is a knowledge driver providing exploitable reference data and detection methods among others to the other three sectors (green arrows), and receives valuable data and material back from the other sectors (yellow arrows). The food industry (upper right) is legally obliged to take preventive measures and to monitor its products and production systems to prevent contamination with FBPs, with economy as a main priority driver. Documentation of the systematic efforts to maintain low risk (goal = pathogen free) products must be available for inspection. The health sector (lower left) treats patients and is usually the first to isolate and characterize outbreak-associated strains, thereby providing key information necessary for the other sectors to investigate and minimize the impact of outbreaks. The competent authorities (lower right) enforce the food law and surveil the food industry and products, but also coordinate the outbreak investigations based on data provided by the other sectors. Epidemiological data, legal acts and quality control documents are the main information sources used and shared by the competent authorities (blue arrows). Outbreak investigations have a strong focus on specific source tracking (red arrow).

TABLE 1

Table 1. Users, criteria and limitations on use of analytical methods for detection of foodborne pathogens^a.

High Throughput Sequencing

HTS can generate thousands to millions of sequence reads, and up to several hundred billion base pairs (bp) of sequence information per sample. The read length, error rate and number of reads and sequenced bases vary substantially. Selective amplification (targeted) and non-selective, random (shotgun) approaches exist. The number of high quality genomes for the most important food pathogens is already high and rapidly growing, in part benefiting from the relatively small genome sizes of most microorganisms (≤100 Mbp). In cases where a sequenced reference genome is unavailable it is necessary to perform de novo sequencing and genome assembly (Figures 2, 3). De novo assembly to obtain a draft quality genome based on high quality, short-read (< 250 bp) sequence data from single, cultured isolates is complex, but can be (semi-)automated (Emond-Rheault et al., 2017). Closing a genome often requires highly skilled bioinformaticians and long sequence reads (up to several kbp) or PCR and Sanger sequencing of the unknown gaps in scaffolds (Goodwin et al., 2015; Loman et al., 2015; Rhoads and Au, 2015). Even this can sometimes be (semi-)automated (Emond-Rheault et al., 2017). Analysis of re-sequencing data employing a mapping strategy (Figures 2, 3) is less complex, and can be (semi-)automated. It is, however, time and computer intensive and interpretation of the mapped data can be quite challenging (Goodwin et al., 2015; Loman et al., 2015; Rhoads and Au, 2015). Alignment-independent comparisons using statistic (probabilistic) approaches (k-mers; Figures 2, 3) can be much faster and automated and emerge as an attractive option at the cost of detailed resolution (Ondov et al., 2016). The optimal HTS strategy is therefore purpose dependent. Large sequence databases including thousands of completely sequenced genomes are now available, facilitating identification of common as well as rare but potentially important genes by mapping of sequence reads to the database sequences. Low error rate, long reads and high coverage facilitate sequence assembly (Figures 2, 3). Altogether, this has expanded our ability to gain a more comprehensive insight into the genetics of individual strains or species, and the microbiota (microbial species composition) and microbiome (functional gene pool of microbiota) of a very broad spectrum of sample types, including environmental, food and clinical samples.

FIGURE 2

Figure 2. Metagenomics data analysis. At least three different approaches for analysis of HTS sequence reads can be selected, but combinations are often preferred. (A) Assembly of sequences (e.g., reads) into contigs (consensus sequences) requires mapping. Sets of contigs are often further assembled into scaffolds (not shown), where the relative position of contigs is known but gaps of ± known size between the contigs remain to be closed. The example shows that four of the six reads can be assembled into a consensus contig while the two remaining reads cannot be assembled with any of the others. (B) Mapping of sequences (e.g., reads) to other sequences (e.g., in database) also requires mapping. The example shows one perfect and one partial match between two query sequences (e.g., reads) and a reference sequence. The mismatch in the partial match is shown in red. (C) Any sequence larger than one nucleotide can be divided into subsequences of length k ≥ 1. The size of k will affect the likelihood of any random k-mer being unique to a data set. A small k will reduce the number of unique k-mers. This is demonstrated in the example, as for the given reference sequence two k-mers will not be unique with k = 3, while with k = 5 all k-mers are unique. Rare k-mers or k-mer frequencies can be used to estimate relationships between two sets of sequences (e.g., two shotgun metagenomes or a sequence isolate and a reference genome).

FIGURE 3

Figure 3. Approaches to HTS sequence read analysis and their dependence on alignment, time and coverage. At least three different approaches for analysis of HTS sequence reads can be selected, but combinations are often preferred. Top right: Assembly of sequences into contigs (see also Figure 2), scaffolds and complete genome assemblies is alignment dependent, time consuming and the success probability is usually correlated with the coverage. This approach is typically taken when time is not the limiting factor and a complete assembly is desired for successive analysis and reference applications. Bottom: Mapping (see also Figure 2) of reads to existing assembly/assemblies is also alignment dependent and time consuming but can also be performed successfully at low coverage (a single read can be mapped to a reference assembly). The size of the reference (e.g., database or genome) and the degree to which mismatches are accepted will have a significant impact on the time required for data analysis (olive arrows). This approach is typically taken to determine functional aspects of metagenome and transcriptome sequences and in metataxonomics. Top left: K-mer analysis (see also Figure 2) is a fast, alignment independent, statistical (probabilistic) approach to investigate properties of a sequenced genome such as its similarity and relationship to other (reference) genomes. It is typically used to screen sequenced genomes to identify genomes of particular interest for more comprehensive analysis.

Tremendous progress in HTS technology developments has been made during the recent decade and this review will not discuss per se the sequencing technologies, because several excellent reviews are available (Loman and Pallen, 2015; Goodwin et al., 2016). Understanding advantages and disadvantages associated with the different HTS platforms, (in particular the read length, read number, sequencing error rate and costs) may, however, help readers to better understand the choices made in the following cited examples.

Illumina sequencing is currently the prevailing HTS technology and also offers the highest fidelity. It provides very large data sets of relatively short reads (100–300 bp) at an error rate per sequenced base of approximately 1%. Roche 454 (phased out in 2016) was the previously dominating technology and provided smaller data sets of longer reads (≤700 bp) with higher error rates. Much of the initial HTS literature reports on Roche 454 data (Liu et al., 2012; Mayo et al., 2014; van Dijk et al., 2014). Long read sequences, up to several kbp, can be obtained using other platforms (see below) but the error rates are high (5–40%). All HTS technologies allow for assembly of good quality draft bacterial genomes with up to 100 contigs. A fully closed genome sequence is usually obtained through application of different technologies combining the accuracy of short reads with the ability of long reads to span gaps in assembly scaffolds (Tallon et al., 2014). Due to the capacity of the Single Molecule Real Time sequencing technology of Pacific Biosciences to provide reads of 1–10 kbp, this technology has recently been popular for de novo assembly of completely closed genomes, long repetitive sequences, plasmids and bacteriophages (Rhoads and Au, 2015). The Oxford Nanopore MinION has competitive potential, as it is claimed to be able to provide even longer reads in real-time at low costs (Goodwin et al., 2015; Loman et al., 2015; Quick et al., 2015). The short read technologies generally provide significantly higher coverage than the long read technologies, at much lower costs per sequenced base.

HTS Based Microbiological Analyses

Foods, feeds, clinical and environmental samples harbor complex and diverse microbial communities. We use the expression “metagenomics” for analysis of such samples by HTS. For further distinction we use the term “shotgun metagenomics” for whole genome sequencing (WGS) or whole-sample-DNA based metagenomics (Figure 4). Targeted amplicon analysis of various ribosomal RNA coding DNA (rDNA) and other conserved markers is, as suggested by Marchesi and Ravel (2015), denoted “metataxonomics” throughout this review. “Metabarcoding” is a related term sometimes used in the literature (e.g., Jones et al., 2013; Staats et al., 2016). Shotgun metagenomics, and “metatranscriptomics” (i.e., shotgun sequencing of RNA transcripts) enables full genome or transcriptome sequencing, respectively, in a complex sample. RNA based shotgun metagenomics (i.e., sequencing of all RNA in the sample) and metatranscriptomics are often combined into a single approach “RNAseq”. We therefore only refer to metatranscriptomics or RNA based shotgun metagenomics when we need to distinguish from RNAseq.

FIGURE 4

Figure 4. The difference between shotgun metagenomics and amplicon based metataxonomic sequencing. (A) Six different genomes (1–6) shown in different colors with five of the six (1 and 3–6) containing a shared genomic region. The shared genomic region in all five has a conserved motif on the left side (red circle = forward primer binding site), but one of them has a significant change in the conserved motif on the right side (yellow circle = reverse primer binding site) resulting in primer mismatch (black circle). (B) The sequenced fragments with shotgun metagenomics are random motifs from the six different genomes, and only one of the conserved (primer binding) motifs will exceptionally be included. (C) The sequenced fragments with amplicon sequencing are only those delimited both by a conserved left and right motif (red and yellow circles). The difference in mean coverage per nucleotide is significant. Assuming that each genome has the same length (e.g., 10⁸ bp) and is present in equal concentration in the original sample, a read length of 200 bp, and an invariant length of the shared genomic region delimited by the primer sites of 250 bp, the mean coverage per nucleotide of the targets will be: (B) R × L/N × G = 10⁶ × 200 bp/6 × 10⁸ bp = 1/3 where R = number of reads, L = read length, N = number of genomes and G = genome size. (C) R × L/A × D = 10⁶ × 200 bp/4 × 250 bp = 2 × 10⁵ where R and L are the same as for B, while A = number of genomes flanked both by conserved forward and reverse primer sites, and D = the length of the shared genomic region delimited by the primer sites. In this example C is 6 × 10⁵ times more sensitive than B. Shotgun reads may be analyzed applying all bioinformatics approaches (assembly, mapping and k-mer analysis, alignment dependent and alignment free; cf. Figures 2, 3). Amplicon reads are usually analyzed by mapping, clustering and phylogenetic approaches, while assembly is only exceptionally applied.

Metataxonomics is generally more sensitive than shotgun metagenomics, due to enrichment of targets by amplification. Metataxonomics is, due to the targeted enrichment, prone to bias and may fail to detect novel variants of relevant targets, e.g., 16S rDNA with mismatches in the PCR primer binding motifs or genes involved in previously unknown but relevant biosynthetic pathways. Shotgun metagenomics on the other hand presumably has a low bias, is independent of a priori knowledge of target sequences, and can be used to monitor alterations in the microbiome that may not be evident from the composition of the microbiota. RNAseq is biased by the reverse transcription used to synthesize cDNA prior to sequencing, possibly affecting detectability of RNA viruses. Metatranscriptomics is further biased by RNA transcription rates. RNAseq is otherwise comparable to shotgun metagenomics. The main drawbacks of shotgun metagenomics are: several logs higher (inferior) limit of detection (LOD; Figure 4) and complex data analyses (bioinformatics) that, at present, are difficult to automate/standardize. The composition of the sample's microbiota and/or the causative agent is typically not well known in advance (limiting the possibility to apply targeted approaches such as metataxonomics), and relevant reference sequences may be lacking from public databases (limiting the possibility to perform mapping and assign function to sequences).

Selected, Illustrative Examples of Approaches for Specific Pathogen Taxa and Applications

FBDs are caused by bacterial, viral, fungal or parasitic pathogens entering the body via contaminated foods and have entered the food chain at some point from farm to fork. Bacteria and viruses are the most commonly reported sources of disease. Bacteria are more easily identified than viruses because the former can often be cultured. The vast majority of published HTS based FBP studies have focused on particular taxa, mainly bacterial, and specific applications, e.g., outbreak investigation starting with a clinical isolate. The organization of the following sub-chapters reflects this. Future advancements, with some included pioneer examples, are expected to allow for simultaneous detection of multiple higher and lower level taxa.

Bacterial Foodborne Pathogens

Many FBPs are well-studied and the use of genomic data and large scale WGS have become important for studies in epidemiology, evolution, surveillance and outbreak investigations. On August 16th 2017, there were 104,667 (84,726) genome assemblies and 8,119 (6,286) complete bacterial genomes (number in brackets was 7 months earlier) available from National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/genome/browse/). The contribution of bacteriophages to bacterial genome size, evolution and virulence is very significant (Brüssow et al., 2004; Salmond and Fineran, 2015). Shotgun metagenomics can provide new insights into these aspects and the possible relevance of phages to FBDs (Nieuwenhuijse and Koopmans, 2017).

Bacterial FBPs are often present in foods in low numbers heterogeneously spread in the product. Consequently, ability to detect very low levels of FBPs in various food sources is important. Current detection methods normally involve one or more enrichment steps, screening (e.g., by PCR), followed by an isolation step. Isolation of a FBP from a food matrix may be challenging due to low recovery of isolates;—a sample can be positive with PCR screening, yet isolation of a corresponding bacterial strain may not be achieved. Some of the most important bacterial FBPs are Salmonella, Listeria monocytogenes and Shiga toxin-producing E. coli (STEC) causing many outbreaks and sporadic cases with severe or fatal outcome (Crim et al., 2014; Astridge et al., 2015; EFSA, 2015b). These three FBPs are used in the following as illustrative examples. For Salmonella infections contaminated food sources typically include poultry, eggs, swine and ready-to-eat foods, and affect people at all ages (EFSA, 2015b). L. monocytogenes is commonly detected in ready-to-eat foods such as smoked fish and soft cheeses and often affect elderly, immunocompromised patients, pregnant women and have high mortality rate (EFSA, 2015b). The main food vehicles of STEC infections are bovine meat followed by vegetables and juice (EFSA, 2013, 2015b). STEC can cause severe complications like acute kidney failure (hemolytic uremic syndrome) and often affects children under the age of five, elderly and immunocompromised people (Davis et al., 2014).

For outbreak investigation the pathogen must be linked to the correct food product (source of infection). Food producers on the other hand, need to determine if their products or production line is contaminated, how an unwanted pathogen entered their production facilities, and/or if it is a persistent household strain (Figure 1; Table 1). Several strategies can be applied for comparison of isolates, e.g., pulsed-field gel electrophoresis (PFGE), multi-locus variable number of tandem repeats analysis (MLVA) and multi-locus sequence typing (MLST). For many FBPs the traditional typing or subtyping offers too low (phylogenetic) resolution to distinguish closely related but distinct strains. High resolution is required to discriminate parallel outbreaks or to separate sporadic cases from an outbreak, but also to assess if a reemerging contamination problem is caused by a persistent strain or reintroduction of similar strains.

WGS will provide highly discriminatory data for subtyping of strains by single nucleotide polymorphism (SNP) analysis or extended (core genome or whole genome) MLST (cgMLST, wgMLST) for strain comparison for outbreak investigations and surveillance purposes. Among possibilities beyond traditional molecular fingerprinting is the reanalysis of complete genome sequences when subsets (e.g., MLST) provide insufficient information/resolution. Polymorphisms can be investigated with or without mapping of the HTS data to a reference genome (Figures 2, 3). WGS-based analyses can also aid in the identification of other relevant factors such as virulence and antibiotic resistance genes (Joensen et al., 2014; Holmes et al., 2015; Octavia et al., 2015; Forbes et al., 2017). By standardizing the workflow of the actual HTS and bioinformatics analysis, this can take only a few days. However, comparable data and standardized protocols and pipelines are required, a topic discussed in further detail below.

Bacterial Genomic (Isolate and Strain Typing) Approaches

Outbreak Investigations

The starting points for outbreak investigations with strain typing are access to clinical isolates. WGS has been used many times in recent years for comparison of isolates in outbreak investigations. Most published studies were retrospective, but a few were performed in real-time. A selection of examples is summarized in Table 2. In an early prospective study (2009) isolates from human patients and animals associated with an STEC O157:H7 outbreak were selected for WGS for comparison of isolates and source identification (Underwood et al., 2013; Table 2). A combination of Roche 454 and Illumina data were used to generate a reference assembly from the strain with best quality data. The hybrid assembly resulted in 463 contigs with average size of 12,028 bp and served as a reference genome for the successive analysis. Shotgun metagenome reads from 16 isolates associated with the outbreak were mapped and examined for SNPs over the entire genome. Based on the SNP results five subtypes of the outbreak strain were identified, providing for design of assays for detection of six specific SNPs. These assays were used to follow the outbreak, including analysis of 106 additional isolates obtained from the outbreak, demonstrating that the five subtypes were widely distributed on the involved farm prior to the first human clinical case (Underwood et al., 2013). Variable number of tandem repeats and PFGE typing indicated that there were two different strains in the sample collection, but the data from each typing method did not overlap and were therefore inconclusive. The HTS data on the other hand documented that the outbreak was caused by a strain differing in four SNPs from the hypervirulent O157:H7 ST11 clade 8. This early study demonstrated that HTS can provide better resolution (five subtypes vs. two subtypes) and therefore can be superior to more traditional characterization methods. HTS in addition provided data suitable for design of specific diagnostic assays that improved the monitoring of the outbreak.

TABLE 2

Table 2. Examples^a of published high throughput sequencing based investigations of foodborne pathogen (FBP) outbreaks.

Specific diagnostic sequence motifs are not always available or known, and a specific pathogenic agent may exhibit new and unexpected combinations of involved virulence genes for which current tests are not optimally designed. This was for example the case in the large STEC O104:H4 outbreak in Germany and other European countries in 2011 (Scheutz et al., 2011). In mid-May the public health authority in Germany was informed about a cluster of three cases with hemolytic uremic syndrome and raw WGS data from a patient derived isolate were already on June the 2nd published by Beijing Genome Institute (NCBI accession no. SRX067313; Kupferschmidt, 2011). Public release of these data incited a huge joint effort from bioinformaticians and researchers around the world, very quickly resulting in in-depth knowledge of the strain. This also facilitated design of specific diagnostic tools for further investigation of the outbreak (Struelens et al., 2011).

A complex outbreak investigation in the UK identified watercress as the source of STEC O157 in two simultaneous outbreaks with different sources of contamination (Jenkins et al., 2015; Table 2). SNP positions of high quality in all genomes of the Public Health England STEC O157 database were then extracted. Pseudo sequences of polymorphic positions were used to create maximum-likelihood trees and compared to the WGS data of additional strains held in the database. Phylogenetic analysis supported a foreign source for the outbreak, but no microbiological link to a specific country of origin was identified. Only one isolate was identified from the irrigation water from the implicated watercress, indicating a low level of contamination. This isolate was compared to the human isolates, and a maximum of 3 SNP differences were reported for the second outbreak, confirming the source of this outbreak (Jenkins et al., 2015).

One of the first published studies using WGS in outbreak investigations concerned a large L. monocytogenes outbreak in Canada in 2008 (Gilmour et al., 2010; Table 2). Two clinical isolates with similar but distinct PFGE patterns were subjected to WGS to assess the genetic diversity of these isolates. Altogether 28 SNPs and three indels, including a 33-kbp motif corresponding to presence/absence of a prophage were observed. The additional information obtained with WGS compared to PFGE indicated that not one, but three distinct, yet closely related strains were possibly involved in the outbreak.

Investigations of an Australian hospital outbreak of L. monocytogenes in 2013 identified a chocolate profiterole from a specific food manufacturer as the common food consumed by the patients. A follow-up WGS study identified more SNP differences in the environmental isolates from the food manufacturing facility than the patients' isolates from the outbreak (Wang et al., 2015b; Table 2). However, the five outbreak isolates shared multiple distinctive genetic features including five prophage insertions. Wang et al. (2015b) suggested that the human isolates were less divergent because of successful adaptation to the relatively stable human environment while the environmental strains (19–20 SNP differences from human isolates) were under increased survival pressure due to less favorable conditions.

Schmid et al. (2014; Table 2) investigated a cluster of listeriosis in Austria and Germany by WGS where the human isolates shared PFGE and fluorescent amplified fragment length polymorphism profiles. Gene-by-gene comparison or cgMLST based on 2,298 genes revealed that four of the human isolates belonged to a single cluster differing by ≤6 alleles (genes). This cluster was distinct from but related to food isolates from two Austrian producers (differing by ≤8 and ≤19 alleles, respectively). The study did not explain if the allelic differences corresponded to SNPs or were more substantial. The other three human isolates were more distinct and unrelated to the outbreak cluster.

Octavia et al. (2015) used SNP analysis in an attempt to define whether an isolate was part of an outbreak or not and identify whether one or more strains were implicated in an outbreak (Figure 5). They modeled the mutation rate in S. typhimurium using 250 bp paired-end reads and estimated a cutoff value for the intra-strain number of SNPs the bacteria could have. When using a high or low substitution rate, and including a time limit of an outbreak from less than a month to up to 3 months the number of SNPs was estimated to differ from 2 to 9. Other studies have identified variable numbers of SNPs in Salmonella outbreaks. Several studies have reported 0–3 SNP differences in one outbreak (Ashton et al., 2015; Taylor et al., 2015; Wuyts et al., 2015). However, some outbreaks have reported to have larger SNP variation based on the core genome (Leekitcharoenphon et al., 2014). In concordance with many of the Salmonella reports, a low intra-strain number of SNPs (0–7) have been reported from epidemiologically linked cases of STEC O157:H7 (Turabelidze et al., 2013; Underwood et al., 2013; Joensen et al., 2014; Holmes et al., 2015; Jenkins et al., 2015; Figure 5).

FIGURE 5

Figure 5. Isogenic or non-isogenic isolates? The distance or number of observed differences between isolates, usually measured as single nucleotide polymorphisms (SNPs) in HTS studies, can provide clues to determine if isolates belong to the same strain, i.e., whether they are isogenic or not. This is important for outbreak investigations, epidemiology and to assess if a persistent strain is present in a food production system. Fewer than ten SNPs is often interpreted as evidence of an isogenic origin of bacterial isolates (see examples and discussion in the main text of this paper). Practice is currently not harmonized and also depends on the taxon in question, how SNPs or other differences are calculated, and which part of the genome the study covers (e.g., core or whole genome). An inferred phylogenetic relationship between nine isolates (A–I; terminal nodes) is shown. For each isolate, a blue letter (a–i) indicates the number of unique SNPs associated with each individual isolate. Internal nodes labeled X–Z connect three clusters of isolates, while internode N connects all isolates. Brown letters (x–z) indicate the number of shared SNPs separating each individual cluster of isolates from the others. The distance (Δ) between any pair of isolates is the sum of SNPs (i.e., blue and brown letters) separating them, e.g., if a = 3, d = 2, x = 2 and y = 4 then Δ_AD = 3 + 2 + 2 + 4 = 11. The following two examples serve to illustrate the difference between putatively isogenic and non-isogenic clusters of isolates (with a threshold of 9 for isogenics): If a = b = c = d = e = f = 2, g = h = i = 3, x = 2, and y = z = 3 then all the isolates A–F might be considered isogenic (internal distance between any pair of isolates Δ_max ≤ 9), as might G-I (Δ_max = 6), whereas A–F might not be considered isogenic with G–I (internal distance between any members from two different clusters Δ_min ≥10). Similarly, if a = b = c = d = e = f = 6, g = h = i = 1, x = 1, and y = z = 3 then only isolates G–I (Δ_max = 2) might be considered isogenic (any other pair of isolates would yield Δ_min ≥11).

Applications of Strain and Isolate Typing to Surveillance and Control

Surveillance of specific FBPs has been ongoing in public health laboratories for a long time, and can benefit from access to clinical and/or food derived isolates. A few countries and laboratories have implemented WGS as a routine typing tool for public health surveillance (i.e., on clinical isolates) for selected FBPs (Joensen et al., 2014; Ashton et al., 2016; Chattaway et al., 2016; Lindsey et al., 2016). Implementation of WGS as a standard typing tool for isolates from foods as well, is still in a start-up phase and routinely done in very few countries.

Denmark has implemented WGS typing of L. monocytogenes isolates from patients and the food surveillance program. Two unexpected genetic clusters, as classified by MLST type, were identified through the WGS analysis during 2013–2015 and further analyzed for SNP differences by mapping to a reference genome of the same MLST sequence type (Lassen et al., 2016). Another study on L. monocytogenes has developed a gene-by-gene (cgMLST) method based on 1,748 loci among 957 genomes (Moura et al., 2016). High robustness was shown as different DNA extraction methods, library preparations and sequencing instruments were used as well as assembly-free and de novo assembly-based methods to ensure that the allelic profiles generated were the same despite differences in the WGS methodology.

WGS and alignment-free SNP analysis were used to differentiate between persistent and repeatedly reintroduced strains of L. monocytogenes in a longitudinal study of food-associated environments (Stasiewicz et al., 2015). The PFGE patterns suggested reintroduction due to observed differences. Patterns unique to single retailers or single states supported persistence or clonal spread. However, the WGS analysis revealed that the observed PFGE differences were caused by a single mobile element, suggesting persistent contamination. Identifying clonal isolates from different food-associated environments emphasize the importance of strong epidemiological data in traceback of foodborne outbreaks.

Both SNP-based approaches and cgMLST yield a high discriminatory power and are reproducible for comparison of isolates. A prerequisite for detailed typing methods is a clear understanding of what makes two bacteria isogenic (belong to the same strain or clonal lineage; Figure 5). Lack of harmonization complicates the conclusive linking of clinical and food isolates, epidemiology and tracing and tracking of contamination in food processing facilities. Expert opinions will depend on the bacterial species and how SNPs are calculated (whole, core or extended genome). A prerequisite when performing reference-based SNP analysis is the availability of good quality reference genomes from strains closely related to the target strain(s). Unfortunately, in case of outbreaks due to rare variants of the causative agent, such reference genomes are not always available. This is true even for some variants of pathogenic E. coli, Salmonella spp. and L. monocytogenes.

Metagenomics for Typing of Bacterial Communities and FBPs

Isolates are often unavailable and may be difficult to obtain. Most foods harbor complex and composite microbial communities. Contaminating pathogens are often heterogeneously dispersed and represent a minority of the microorganisms present in the sometimes complex food sample. Metagenomics approaches offer the opportunity to investigate the composition of microbes in food matrices in toto without selective isolation, including the detection of non-viable and “viable but not cultivable” microbes (Bergholz et al., 2014), and will capture a broader range of the microbial community than classical microbiology. Most of the numerous HTS metagenomics studies report on 16S rDNA analysis, i.e., what we refer to as metataxonomics. This approach has proven useful for identification of bacteria, phylogenetic studies and characterization of bacterial communities in different foods, water and other environments (Mayo et al., 2014; Kergourlay et al., 2015; Tan et al., 2015). However, the 16S rDNA has limited resolution power and cannot be used to detect non-bacterial taxa. Reliable 16S rDNA based classification of bacteria rarely extends beyond phylum, group or genus level (Livezey et al., 2013). A few FBPs can be identified by 16S rDNA sequencing, but only exceptionally at the species level and never to pathotype. All Salmonella species are considered as pathogens (Jarvis et al., 2015; Zhang et al., 2015), but only two of the Listeria species known so far are pathogenic, i.e., L. monocytogenes and L. ivanovii. These species are partly distinguishable based on 16S rDNA sequence. In contrast, 16S rDNA sequences cannot distinguish STEC from non-diarrheagenic or commensal E. coli. For identification of STEC, detection of virulence-associated genes including the Shiga toxin- and intimin-encoding stx and eae genes will be essential for meaningful strain typing. Database limitations (not all relevant taxa and haplotypes represented + possible erroneous sequences and taxonomic annotations) and the common use of only a subsection of the 16S rDNA (missing or covering only some of the inter-strain variation) further reduces the fitness of metataxonomics for FBP detection (Adeolu et al., 2016; Singer et al., 2016).

Shotgun metagenomics of the entire DNA present in a sample offers a more comprehensive insight into the microbial diversity of a sample with regard to the richness of microbial taxa at all levels, or with regard to the presence of gene families or biomarkers in general (Ferri et al., 2015; Blagden et al., 2016; Ranjan et al., 2016). Shotgun metagenomic approaches used for improved FBP detection have been tested in a few studies. Bioinformatics methods to distinguish two genomes of the same species in a complex sample are needed, but it seems possible to determine whether one or more FBP strains is involved (Leonard et al., 2016). In investigations where it is essential to detect a specific organism assumed present in low numbers in a complex matrix, culture-based bacterial enrichment is still necessary. Then, inevitably, an enrichment bias of the composition of the bacterial community relative to the original sample will emerge.

Shotgun Metagenomics in Outbreak Investigations

When screening of food products and even in outbreak investigations, the genome sequence of the specific FBP strain/strains is usually not available. HTS technology may be used to identify the genome sequence of the causative agent of the outbreak by de novo assembly of sequence reads from complex samples with high prevalence of the agent, e.g., clinical specimens or food samples (Figures 6, 7). Theoretically, the approach shown in Figure 6 can also be applied to control of food products by the manufacturers or enforcement authorities. However, the current costs and other resource requirements (skills, lack of standardized data analyses, time) prevent justification of the approach for routine controls.

FIGURE 6

Figure 6. Reference guided metagenome sequencing based approach for identification and characterization of pathogenic and outbreak associated strain(s). In case of an outbreak, fecal samples from patients are subjected to culturing, in order to isolate the outbreak strain. Patients are also interviewed in order to try to identify food products that may be the source(s) of infection. The metagenomes of stool samples, food products and cultured strains can then be amplicon sequenced (metataxonomics) or shotgun sequenced (metagenomics) and the data mapped to reference databases for identification of virulence markers. Shotgun reads can also be assembled into larger contigs or genomes for identification of pathogenic strains. The latter is facilitated if the sequence data are derived from single isolates. Black arrows indicate forward flow direction of the analysis, while gray arrows indicate feedback changing the premises for earlier steps. Feedback from the sequencing analysis can be used to refine and narrow the search for a specific FBP. If successful, the outbreak will be terminated. This review includes multiple examples of the application of the described approach to outbreak investigations.

FIGURE 7

Figure 7. Reference independent shotgun metagenome sequencing based approach for identification and characterization of outbreak strain(s). In case of smaller outbreaks the possibility to compare metagenomes from affected people (patients) and healthy controls is limited. In these cases the availability of clinical isolates may be required to avoid exhaustive open ended bioinformatics (in silico) analyses, as exemplified by Brzuszkiewicz et al. (2011) and Rasko et al. (2011). Environmental gene tags (EGTs) from metagenomes of people affected by the outbreak and controls (people not affected) can be compared in case of a larger outbreak, as exemplified by Loman et al. (2013). In that study, EGTs present only in affected patients were characteristic of the outbreak strain and provided sufficient information to near complete characterization of its genome. Scaffolds and in particular assembled genomes may and should be uploaded to reference database(s), for successive use in analytical approaches like those described in Figure 6.

In a retrospective study of the German/European STEC O104:H4 outbreak in 2011 it was demonstrated that such an approach could be used to identify the infectious agent in human fecal samples (Loman et al., 2013; Table 2). Forty-five fecal samples from patients were sequenced by shotgun metagenomics. Human DNA was subtracted in silico and assembly was performed to create environmental gene tags (EGTs). EGTs found in more than 20 of the fecal samples were selected for further analysis. The total outbreak metagenome was screened for sequence reads from healthy humans and matching EGTs were subtracted. A set of 450 outbreak-specific EGTs were then subjected to taxonomic analysis and almost 65% were assigned to the Enterobacteriales. The original metagenomics data sets were then used in an attempt to reconstruct the E. coli outbreak strain genome. Functional annotation confirmed the presence of important strain-specific and virulence-associated genes, and ten samples had more than 10 × coverage of reads mapped to the reference genome of the specific outbreak strain. The coverage was > 1 in 26 samples and Shiga toxin genes were detected in 27 of 40 STEC-positive samples. In some of the individual samples sequences from other human pathogens such as Campylobacter, Salmonella, and Clostridium difficile were also identified. This study indicates the potential of shotgun metagenomic analyses for the culture-independent identification of bacterial pathogens in samples with complex microbial composition.

Shotgun Metagenomics for Food Surveillance and Control

A selection of studies is described below and details are presented in Table 3. Tomatoes have been implicated in Salmonella outbreaks several times, but isolation of Salmonella from tomatoes has only been successful a few times. Ottesen et al. (2013) used shotgun metagenomics to describe taxa associated with pre-enrichment and throughout the enrichment steps of a protocol for Salmonella detection in environmental tomato samples (Table 3). DNA was extracted prior to enrichment and the remaining tomato samples were enriched overnight in a universal pre-enrichment broth and aliquots successively added to two different growth media. The sequencing depth was insufficient to capture the majority of the diversity within the samples. To achieve about 1 × coverage of all genomes Ottesen et al. (2013) estimated that they would have needed approximately 250 × more sequence data. Variation among samples suggested differences in the microbial community in the starting material of the samples. An important biological finding was the significant enrichment of Paenibacillus sp. from uncultured to cultured samples. This taxon is known to inhibit and kill Salmonella. The study also identified a number of sequences as Salmonella-specific despite negative PCR and culture results when those samples were tested for Salmonella. A comparison of results from the two different applied assembly approaches showed that increased read length, contrary to what might be expected, reduced the ability to assign taxonomy. Others have made similar observations (Luo et al., 2012). It is not clear if this phenomenon is associated with database limitations.

TABLE 3

Table 3. Examples^a of published high throughput sequencing based approaches to detection of foodborne pathogens for industrial and control purposes.

Jarvis et al. (2015) aimed to characterize the microbiota in cilantro (coriander leaves), and simultaneously identify Salmonella from the samples (Table 3). Metataxonomics based on sequencing of the 16S rDNA from 91 samples was complemented with shotgun metagenomics. Gram-negative Proteobacteria dominated in the cilantro samples before enrichment. After 24 h of enrichment the microbial composition had shifted to mainly Gram-positive Firmicutes, as described above for tomato (Ottesen et al., 2013). These findings suggest that the culture-based method should be optimized for the detection of the organisms of interest. Low detection of Salmonella by metataxonomics was thought to be due to low sequencing depth and/or reduced amplification efficiency caused by imperfect match in one of the primers. Shotgun metagenomics was performed on six cilantro samples culture-positive for Salmonella (enriched samples), and variable levels of Salmonella were identified. The genomes of the Salmonella isolates from these samples were already fully sequenced and were therefore included in the reference database used in the similarity analysis. The variable levels of Salmonella detected after 24 h enrichment illustrate the challenge of detecting Salmonella in matrices with a complex microbial background. Again, the sequencing depth of the analysis and levels of contamination were reported to influence the ability to detect the suspected agent.

Similarly, predominance of other species than L. monocytogenes was observed until the end of the enrichment procedure (after 40 h) in a study on L. monocytogenes and associated microbiota in naturally contaminated ice cream (Ottesen et al., 2016; Table 3).

Leonard et al. (2015) applied shotgun metagenomics to detection of STEC in bagged spinach (Table 3). Spiked samples with known concentrations of a STEC O157:H7 were sequenced and sufficient coverage of the genome required spiking with at least 10,000 colony forming units (CFU) of STEC per 100 g spinach followed by enrichment for 5 h to enable full pathogen characterization. However, enrichment for 23 h allowed the full pathogen characterization by shotgun metagenomics from as little as 10 CFU of STEC spiked into 100 g of spinach. Then, the sequencing coverage of the STEC strain was 184 × and the consensus sequence after reference-based assembly covered the whole reference genome with only six gaps. However, reliable detection to low levels like 10 CFU of STEC per 100 g of spinach, required enrichment for at least 8 h. Then, approximately 2.9% of the reads could be mapped to the reference genome with coverage of approximately 10×. Leonard et al. (2015) concluded that this should be sufficient to enable DNA sequence-based determination of the serotype and essential virulence genes of the contaminating pathogen.

The same team demonstrated the possibility to detect and identify STEC down to strain-level in spinach samples spiked at 10 CFU/100 g of spinach using a variety of STEC strains (Leonard et al., 2016; Table 3). A shotgun metagenomics approach as described in the abovementioned study (Leonard et al., 2015) was applied. For microbial community analysis a database of unique 25-mers for species identification was used. This k-mer approach could also differentiate between E. coli phylogroups and demonstrated presence of more than one E. coli phylogroup in some of the samples. Conserved chromosomal E. coli genes (2,542) were extracted from WGS data from the STEC strains used in the spiking experiment as well as the metagenomic assemblies for whole genome phylogeny and SNP analysis. When the metagenomic assemblies only included the spiked STEC strain or the abundance of other E. coli was much lower than the spiked strain, the number of mismatches from the SNP analysis was less than 20, i.e., at or close to the intra-strain SNP-variability level (Figure 5).

The above-mentioned studies indicate that at present, it is difficult to achieve the large number of reads required for ≥1 × coverage. These studies also showed how enrichment will bias the microbial composition, potentially favoring other taxa or strains than those intended for enrichment. Since multiple samples often need to be analyzed, it would be cost-efficient if < 1 × coverage would be sufficient for most samples. K-mer approaches to screen samples after a minimum of enrichment (see e.g., Ondov et al., 2016) to classify samples according to risk of presence of FBP, could be used to reduce the number of samples for which more in-depth sequencing and analysis is needed. Significant correspondence between a mapped read and FBP specific reference sequences may also provide enough evidence even at < 1 × coverage (Spilsberg et al., 2017). Better sample preparation and enrichment protocols could also contribute. Results from metataxonomics and shotgun metagenomics could aid in optimization of such protocols.

Viral Food Pathogens

Complete genome assemblies for 7,409 viruses were available from NCBI on August 16th 2017 (an increase of nearly 500 in 7 months). Viruses lack the genes necessary to transcribe protein-coding genes and replicate and reproduce, and therefore depend completely on the biological machinery of their host cells (Moreira and Lopez-Garcia, 2009). They are highly diverse and lack a common genetic constitution such as genes coding for ribosomal RNAs (Moreira and Lopez-Garcia, 2009), limiting the metataxonomic options. Some viruses can be cultured, but cultivation require advanced protocols, suitable host cells for propagation, and is time-consuming (Rodríguez-Lazaro et al., 2012). Viruses play a dual role in food pathogenesis. Some viruses like bacteriophages can affect the virulence and population structure of microorganisms (Hayes et al., 2017). Other viruses can be FBPs themselves (Newell et al., 2010; EFSA, 2011). For a virus to be transmissible through foods, it must have some environmental stability and remain infectious for some time on or in a food matrix. As viruses are unable to replicate in the food matrix they must have a low infectious dose. Foodborne viruses are commonly shed in large amounts by a fecal route and viral contamination of foods is primarily via human fecal material (Rodríguez-Lazaro et al., 2012). Normal cooking or frying inactivates viruses. Food sources of infections are typically raw foods like fresh produce, soft berry fruits, herbs, shellfish, ready-to-eat products in general and undercooked meat or foods served cold that are contaminated by an infected food-handler post cooking (Halliday et al., 1991; Hedberg and Osterholm, 1993; de Wit et al., 2003; Fiore, 2004; EFSA, 2011, 2015a). Most viruses that can infect through a foodborne route can also utilize a person-to-person infectious route. Many outbreaks, potentially the majority, are simultaneously propagated via combined person-to-person and foodborne infectious routes. Food and water associated transmission is also suspected to enhance the spread of zoonotic viruses and facilitates the occurrence of zoonotic events, e.g., through the handling of bushmeats (Nieuwenhuijse and Koopmans, 2017 and refs. therein). The viral FBP load is often low and heterogeneously dispersed in the food matrix while it is high in clinical patients. It is not trivial to distinguish between foodborne and person-to-person infections, unless initial cases are identified and analyzed. The reporting and surveillance of foodborne viruses is limited, and disease symptoms can be very similar for some viruses and for other viruses differ substantially between infected individuals. All this results in low documentation of the real impact and diversity of foodborne viruses (Nieuwenhuijse and Koopmans, 2017).

The most notable FBP viruses are norovirus (NoV), hepatitis A virus (HAV) and hepatitis E virus (HEV) which are all positive-sense, single-stranded, non-enveloped RNA viruses, and the double-stranded RNA rotavirus (RV) (Newell et al., 2010; EFSA, 2011). Severe acute and Middle East respiratory syndrome (SARS and MERS) and Ebola viruses are examples of other (zoonotic) RNA viruses suspected to be transmissible via food (Newell et al., 2010; Nieuwenhuijse and Koopmans, 2017 and refs. therein). Most FBP viruses are RNA viruses and require special sample processing and nucleic acid extraction methods in contrast to the vast majority of bacteriophages, which are double-stranded DNA viruses.

Viruses, and in particular RNA viruses, evolve rapidly, both via genetic drift and in response to active selection pressures (Holland et al., 1982). Detection and genotyping with PCR approaches, including metataxonomic HTS approaches, can therefore fail. Particular sample processing steps can significantly improve the probability of detection, e.g., by concentration of the viruses and/or removal of interfering substances [reviewed in Hartmann and Halden (2012); see also (EFSA, 2011)]. Detailed understanding of virus stability, inactivation times and temperatures is lacking due to limitations in model systems (Cook, 2013).

Outbreak Investigations

Isolates of viruses are rare in clinical settings, but clinical samples from viral outbreaks can contain high titers of the causative virus strain(s). The starting points for outbreak investigations are therefore availability of clinically derived samples. Due to the relative heterogeneity of samples compared to isolates, the analytical approaches derive from metagenomics approaches. The small genome size and high mean coverage per nucleotide, however, usually allows for characterization at strain level.

The severity of HAV infection varies strongly with age. HAV is endemic in regions with inadequate sanitation and limited access to clean water, creating population wide immunity. Contrastingly, regions with high quality sanitation and water supply can experience outbreaks of hepatitis A unless broad scale vaccination has been performed. Nearly 300,000 persons were infected in a clam-related epidemic of HAV in Shanghai in 1988 (Halliday et al., 1991) while 1,589 were reported infected, including 2 casualties, in a recent outbreak in Europe associated with frozen berries (Severi et al., 2015). In a recent epidemiological case study, a single food product was associated with a HAV outbreak (Collier et al., 2014; Table 2). HAV was extracted from serum and fecal samples from 120 patients. Metataxonomics targeting a 315 bp HAV fragment yielded 117 (98%) positive for HAV genotype IB. Of these, 99 (85%) were identical in the 315 bp sequenced segment. Attempts to isolate HAV from the food product (frozen pomegranate arils imported from Turkey) were unsuccessful. This demonstrates the challenge of establishing etiologies. Chiapponi et al. (2014) analyzed two samples of frozen berries from two apparently unrelated HAV outbreaks in Italy, and successfully detected HAV by reverse transcription, quantitative PCR (Table 2). RNA was then extracted from the two samples and complete sub-genotype IA HAV genomes of 7,398 and 7,393 nucleotides, respectively, were obtained by a combined RNAseq and amplicon HTS strategy. Chiapponi et al. were able to link the two outbreaks and also link the food derived sequences to an existing patient derived sequence by ≥99.9% nucleotide identities.

RV primarily infects children, is the most common source of gastroenteritis among infants (Desselberger, 2014) and is estimated to cause approximately 5% of total child deaths worldwide. An outbreak of foodborne gastroenteritis in Japan in 2012 caused by RV was probably associated with consumption of raw sliced cabbage (Mizukoshi et al., 2014; Table 2). Samples from patients and food handlers were positive for RV. The study combined a broad spectrum of tests. One clinical, fecal sample was subjected to RNAseq. Sequences of all 11 segments of the viral genome, sufficient to determine the specific viral strain, were identified. No food, however, was assessed for presence of pathogens.

HTS has accelerated the study of viral genetics, the assembly of novel viral genomes, and molecular epidemiology of viral outbreaks (Finkbeiner et al., 2009; Kundu et al., 2013; Wong et al., 2013; Smits et al., 2014; Ganova-Raeva et al., 2015). Studies of virus variants and quasispecies, analyses of vaccine escapes and drug resistance and viral evolution are now almost exclusively performed with HTS (Barzon et al., 2011).

The etiology of virus-associated outbreaks will often remain unknown, for various reasons. Finding and characterizing the causative virus may not be included among the analytical objectives, or the applied method is not always sufficiently sensitive. Other reasons for failure can be genetic drift in the virus, or an outbreak caused by a novel virus. Discovery of pathogens almost exclusively starts with samples from a diseased hosts and not a food matrix. There are many examples of investigations unraveling the etiology of viral outbreaks, but very few where the source of infection is identified or a specific food suspected and tested. It is difficult to establish if an outbreak started as a food contamination event, and it is reasonable to assume that these events are underreported. The small genome sizes of viruses may partly explain why the massive capacity of HTS is rarely used in viral FBP outbreak investigations, as may the limited availability of effective enrichment methods. The potency of new HTS platforms to provide valuable epidemiological information to large viral outbreaks was recently demonstrated by Quick et al. (2016), although on clinical isolates. A similar approach could be applied to epidemiological monitoring and source identification in case of a foodborne outbreak.

Surveillance and Control Purposes

The viral titers in food products are commonly much lower than in clinical samples, and isolates are not available. As for outbreak investigations the approaches applied for surveillance and control are metagenomics derived while the end-point is strain characterization.

Aw et al. (2016) used assembly of RNAseq and shotgun metagenomics reads (mean contig size 680 bp) and mapping to a database of viral reference genomes on samples from field grown and retail lettuce (Table 3). A small fraction of the reads corresponded to RV and other viruses that infect humans. In the study, 16S rDNA metataxonomic screening was used to verify absence of contaminating bacterial DNA.

Fernandez-Cassi et al. (2017) used RNAseq and mapping to examine the viral contamination of fresh parsley plants irrigated with fecally tainted river water (Table 3). A small fraction (< 1%) of the reads was related to FBPs, including among others HEV and NoV.

NoV is extremely contagious, and can cause large outbreaks of gastroenteritis (de Wit et al., 2003). NoV is the no. 1 cause of diarrheal disease and mortalities in the world (Pires et al., 2015) and the leading source of foodborne illness in the USA (Scallan et al., 2011). The etiology is reviewed elsewhere (Moore et al., 2015). Imamura et al. (2016b) used HTS to characterize NoV diversity in shellfish from two commercial producers in Japan, using a combination of RNAseq on virus suspensions and PCR enriched targeted sequencing (genotyping approach similar to metataxonomics; Table 3). NoV genotypes GI.3 and GI.4 were most prevalent and identified in a surprisingly high proportion of 20–25% of the samples. This proof of concept study could not actually address the diversity of NoV in single shellfish as 3 individuals were pooled to one sample prior to analysis to obtain sufficient starting material. The same genotyping approach was applied to a study of the efficiency of removing NoV from shellfish by depuration (Imamura et al., 2016a; Table 3). Depuration is used to decontaminate commercial shellfish. The study demonstrated that depuration is insufficient with respect to NoV.

Fungal Food Pathogens

On August 16th 2017, there were 2,515 genome assemblies and 29 complete fungal genomes available from NCBI. So far, there are very few examples of application of HTS to studies of epidemiology and virulence of fungal FBPs (Billmyre et al., 2014; Lee et al., 2014; Litvintseva et al., 2014, 2015; Vaux et al., 2014). Only two published studies relate to specific cases of fungal food pathogenesis (Lee et al., 2014; Vaux et al., 2014). Molecular typing of fungal isolates and mycobiotas in relation to FBD is almost exclusively metataxonomic, but usually limited to PCR amplification of one or a few genetic loci followed by Sanger sequencing with exceptional examples of MLST analysis in the published literature (e.g., Byrnes et al., 2010; Desnos-Ollivier et al., 2015; Wang et al., 2015c). The relevance of fungi as FBPs is debated, significantly lower than that of bacteria and viruses, but possibly also understudied.

Plants are commonly infected in the field by “field fungi.” Some of these produce toxic metabolites and may consequently cause disease when plant derived products are consumed (Lee et al., 2015; Stoev, 2015). Typical examples are Fusarium spp. on cereal grains, producing zearalenone, fumonisins and trichothecenes (e.g., deoxynivalenol, T-2 and HT-2 toxin), and Penicillium spp. on fruits, producing patulin. Other fungi infect a broad range of food and feed products during storage (“post-harvest fungi”). Typical examples are Aspergillus spp. and Penicillium spp., producing acute or chronically toxic compounds such as aflatoxins, ochratoxins and citrinin, and multiple antimicrobials with indirect health effects via modulation of the gut microbiota (Gillings et al., 2015; Stoev, 2015). Some mycotoxins are persistent to food processing (EMAN, 2015). They can exacerbate the infections with a wide range of non-fungal and fungal pathogens (Antonissen et al., 2014; Stoev, 2015), and it is hypothesized that mycotoxins may contribute to fungal infections (mycoses; Withlow and Hagler, 2016).

Several fungi are opportunistic, infective FBPs (Clemons et al., 2010; Iriart et al., 2010; Gurgui et al., 2011; Kazan et al., 2011; Benedict et al., 2016). Invasive fungal infections are primarily a problem in immunocompromised people (Brown et al., 2012; Bitar et al., 2014; Benedict et al., 2016). Reported examples of verified foodborne fungal infections are sparse (Benedict et al., 2016), and the problem is perhaps under-investigated. Hitherto, there is only one example of the use of HTS to investigate a fungal FBP outbreak. We suspect that the investigation of several other outbreaks or sporadic cases of fungus associated FBD reported in the literature (Benedict et al., 2016) could have benefited from application of HTS approaches similar to those described for other FBP taxa in this review.

Outbreak Investigation

As with bacteria the starting points for fungal foodborne outbreak investigations are usually clinically derived isolates. Both strain characterization and metagenomics approaches can be used and are described in the literature, but only one published example applied HTS.

Mucoralean fungi cause mucormycosis (zygomycosis), fatal fungal infections in humans whose incidence has been increasing lately (Roden et al., 2005; Spellberg, 2012). Gastrointestinal mucormycosis is rare and thought to be secondary to ingestion of fungi (Roden et al., 2005). In 2013 a strain of Mucor circinelloides f. circinelloides, the most virulent subspecies of M. circinelloides, was found as a contaminant in a batch of yogurt in the USA (FDA, 2013). More than 200 consumers became ill, although no fatalities were recorded (Lee et al., 2014). The affected consumers were immunocompetent. An isolate obtained from a yogurt container was subjected to WGS in order to characterize its genetic potential to cause significant infections, and to establish its genetic relationship to other strains of M. circinelloides (Lee et al., 2014; Table 2). A reference genome assembly was obtained from a strain of M. circinelloides isolated from human skin (Findley et al., 2013). Reads from the yogurt isolate were mapped to the reference genome for SNPs analysis. Comparison with a third isolate of a Mucor sp. using whole-genome alignments, and pathogenicity studies in murine models contributed to verify the pathogenicity of the yogurt strain. No clinical isolate from the outbreak was included in this study.

Parasitic Food Pathogens

Parasites are a diverse (polyphyletic) group of small animals or animal-like Eukaryotes, and their pathogenic potential is less frequently linked with metabolites and more frequently with their energy consumption and predation on host tissues than bacteria and fungi. World-wide, more than 100 species of foodborne parasites cause disease in humans (Orlandi et al., 2002). Complete or draft genome assemblies were available from NCBI for at least 40 of these species on August 16th 2017. Globalization, i.e., the movement of people, animals and food and feed increases the risk of moving and spreading parasites that are originally endemic, to new countries and hosts (Robertson et al., 2014). The lack of suitable enrichment methods for parasites, as opposed to most of the known bacterial and fungal FBPs, means that recovery/isolation steps are particularly important (Robertson et al., 2014). This also suggests that molecular detection can be useful to monitor presence, distribution and epidemiology, as well as the efficiency of clinical treatments after parasite infections.

Published studies applying HTS technologies to parasites are with few exceptions limited to characterization of genomes and transcriptomes, a topic not covered here. These studies, however, provide for detailed insight into the genetics of adaptations to specialized parasitism, and provide urgently needed reference sequence data for detection and identification purposes and clues to possible target/drug combinations (e.g., Tsai et al., 2013; Foth et al., 2014; Young et al., 2014; Barratt et al., 2015).

The presence of a multitude of other taxa and the frequently low abundance of parasites or derived DNA in clinical fecal samples challenge the detectability and may prevent effective sampling and purification of DNA for detection of parasites. Specialized protocols may be required to purify and enrich the parasite relative to a background matrix such as food or feces. Improved sample preparation and enrichment methods are discussed later.

Parasites are Eukaryotes with substantially more genetic similarity to their human and animal hosts than Bacteria. The size of parasite genomes (10–1,000 Mbp) ranges from 10 × the size of bacterial genomes and up to nearly the size of the human genome. New databases are in development, collecting genomic information for various parasites (Martin et al., 2015). The lack of annotated genomes and transcriptomes has been a major obstacle to the effective use of HTS for detection of parasites. Molecular discrimination is dependent on detailed knowledge of genomes and genetic variation. However, many of the parasites are detectable by visual, macro- or microscopic inspection, and this is likely to be a more cost-efficient approach in many instances.

Outbreak Investigation

The intraspecific variation in virulence among foodborne parasites is, to our knowledge, not reported to be high or significant. Combined with the large genome size of Eukaryotes, this suggests that metataxonomic approaches can be sufficient for detection and outbreak investigations.

More than 200 seafood poisoning cases of unknown etiology were reported in Japan from 2008 to 2010. Victims commonly reported to have ingested raw Paralichthys olivaceus (a flounder). Kawai et al. (2012) therefore extracted total DNA or RNA from frozen P. olivaceus filets for shotgun metagenomics and RNAseq (Table 2). In parallel, muscle tissue was sieved to recover spores from suspected parasite infections. The presence of spores and 18S rDNA from Kudoa septempunctata in the fish samples was observed. The pathogenicity of this myxosporean was confirmed in suckling mice and house musk shrews.

Vectors of Foodborne Pathogens

In addition to pathogens naturally associated with the food producing organisms such as gut microbes, dermal yeasts and bacteria, plant pathogenic fungi, etc. many of the most severe food pathogeneses are caused by incidental transfer from animal vectors (pests; Olsen et al., 2001; Jones et al., 2013). The Food and Drug Administration has identified the 22 most common pests contributing to the spread of FBD in the USA (Olsen et al., 2001). Four of these are rodents (mouse and rat species), while the remaining 18 are insects (cockroaches, ants and flies). The traditional approach to detect and identify these is microscopy. Recently it was proposed to use metataxonomics by sequencing of the mitochondrial cytochrome c oxidase subunit I (COI) as a faster, more reliable, sensitive and cost-efficient approach (Jones et al., 2013). COI metataxonomics can be performed using HTS (see Figure 4) and may therefore suit as an attractive approach to screen routinely for presence of these vectors in raw materials for food production, to reduce the risk of introducing pathogens in the production. Vectors other than, or in addition to the 22 identified by Olsen et al. (2001) may be identified as particularly relevant, e.g., in other parts of the world. The introduction of additional sequence targets in an HTS based metataxonomic screening should be feasible (Lammers et al., 2014; Leray and Knowlton, 2015; Arulandhu et al., 2017) despite the limitations of COI and metataxonomic approaches in general (Deagle et al., 2014; Staats et al., 2016; Arulandhu et al., 2017).

Other Applications of HTS with Potential Relevance to FBPs

The discovery and characterization of biosynthetic gene clusters in microorganisms is radically facilitated with the availability of HTS (Cacho et al., 2015). Secondary metabolites are often toxic, and many of the microorganisms producing them are FBPs. Characterization of the biosynthetic pathways provides a basis for faster detection of agents producing the metabolites, as well as for interception of undesirable biological effects and exploitation of the biosynthetic pathways for production of new bioactive compounds.

Direct shotgun metagenomics on DNA purified from microorganisms isolated without enrichment culturing is an attractive approach. Such an approach applied to clinical, polymicrobial urine samples was found to have comparable identifiability of bacteria as more conventional and much more time consuming approaches (Hasman et al., 2014). The complexity of such urine samples may be comparable to that of some food matrixes. Several approaches to data analysis were taken. First: identification of microorganisms by presence of specific k-mer motifs identified in a database of complete bacterial genomes. Secondly: alignment based subtraction of host-DNA derived reads and estimation of relative bacterial species distribution. Third: mapping of reads against a larger database of complete and draft bacterial, archaeal, fungal, protozoan and viral genomes. Finally, MLST and resistance gene identification performed against relevant databases. In a follow up study, direct shotgun metagenomics on toilet waste samples from long-distance flights was used to identify bacteria and antimicrobial resistance genes (Petersen et al., 2015) by mapping of reads to the abovementioned databases.

Host-specific genetic markers may be present in strains of pathogenic and non-pathogenic taxa, and then hold potential for source tracking (Gomi et al., 2014). See also Franz et al. (2016) for a more detailed discussion on source attribution.

Discussion

Acknowledging the Importance of Sample Preparation

A major challenge for detection of FBPs in complex matrixes is the recovery of the organism or its genes. Foods are physically and chemically complex matrices hosting complex microbial communities. The extraction and purification of DNA from the samples is of major importance as the presence of inhibitors may hamper the analysis further down the line (Ceuppens et al., 2014; Moore et al., 2015). The sample storage conditions and preparation steps can introduce taxonomic biases linked to recovery (Ceuppens et al., 2015; Menke et al., 2017). Similar or even stronger effects on the microbiome and RNA population of samples are predictable, as also the intraspecific diversity and gene expression can be affected. The size of isolated/purified nucleic acid fragments is also relevant. Longer fragments are usually superior and complementary to shorter fragments for genome assembly. The purity and relative concentration of target isolated nucleic acids affect their detectability and certainly their quantifiability (Holst-Jensen et al., 2003). Different taxa and life stages of potential pathogens require different treatments for detachment from substrates and complex structures, recovery and lysis. Protocols required for extraction and purification of DNA from strongly attaching, biofilm-forming taxa with tough cell walls can cause shearing of nucleic acids of other relevant taxa, yielding undesirably short fragments or totally degraded RNA/DNA. The stability of nucleic acids is a critical parameter, and this topic is reviewed by Ceuppens et al. (2014). Enrichment processes required to achieve the necessary target concentration can also bias the post-enrichment microbial composition in undesirable ways, as illustrated with Salmonella spp. in examples discussed earlier (Ottesen et al., 2013; Jarvis et al., 2015).

Examples of non-culturing based enrichment of particular organisms include size based filtering, buoyancy, affinity based columns or immuno-magnetic separation (Hadfield et al., 2015). Molecular enrichment can be achieved by subtraction hybridization (Galbraith et al., 2004) and various combinations of digestion with restriction enzymes, adapter ligation and sequence specific amplification (Leichty and Brisson, 2014; Arulandhu et al., 2016).

Specific Challenges Related to Molecular Analyses

Ability to discriminate viable/infective agents from dead/non-infective agents and to obtain isolates of the agent(s) from food samples for comparison purposes is often important (Ceuppens et al., 2014; Forbes et al., 2017). Molecular analytical methods can potentially circumvent need for enrichment culturing and further selective steps for the detection of many of the most important FBPs, but the issues of viability, recovery and LOD remain critical. Transcriptomics or use of propidium monoazide prior to sequencing are options to discriminate living from dead cells (Weinmaier et al., 2015). There is no harmonized requirement for detection and/or identification of an FBP by HTS, neither with respect to the number of reads, coverage of a specific diagnostic target motif, minimum length of (assembled) contig or maximum number of mismatches. Depending on the choice of actual minimum performance parameters and associated acceptance values it is possible to calculate the probability of detection (POD) of relevant target(s) and perform a statistical comparison of the POD to the actual observations (Holst-Jensen et al., 2016; Spilsberg et al., 2017). This would provide clues to the probability and reliability of findings. POD calculations performed prior to analyses can be useful to assess the cost-efficiency of alternative approaches. HTS is generally not quantitative, and complementary tools are needed to verify the LOD and recovery. Direct sequencing of nucleic acids from a clinical or food derived sample can be faster than culture-dependent analytical approaches. However, successive analysis of HTS data, necessary for interpretation, may be too time-consuming to render HTS really competitive in many cases. For bacteria in particular, the combination of limited enrichment culturing and fast HTS with (semi-)automated bioinformatics is currently the most optimal, realistic option that can provide very detailed information while simultaneously offering sufficient sensitivity and speed. For those FBPs that cannot be enriched by culturing, future pipeline developments may improve the situation, at least for certain types of matrixes and scenarios.

Harmonization and Validation of HTS Approaches

The use of different technologies and methods negatively affect the comparability of results (Junemann et al., 2013), as does lack of harmonized terminology and data interpretation (Lambert et al., 2017; Taboada et al., 2017). Recent studies have tried to overcome this variability in end-point results by establishing standard WGS data sets from outbreaks with Salmonella, STEC and L. monocytogenes (Timme et al., 2015) and a benchmarking dataset consisting of 101 whole genome sequences from one E. coli hypermutator strain (Ahrenfeldt et al., 2017). Use of these data sets is proposed to facilitate standardization and harmonization of bioinformatics pipelines (Timme et al., 2015; Ahrenfeldt et al., 2017).

Traditional approaches to standardization and validation cannot be directly applied to HTS approaches. There is for example no common cognition as to what is required for “identification” of a FBP, such as the number of organism-specific reads (positive calls), sequence depth (confidence), LOD, etc. for each platform and protocol, or number of differences distinguishing isogenic from non-isogenic isolates (Figure 5). There are, however, several efforts to improve the situation. Guidelines for harmonization of clinical testing by application of HTS have been published and may provide useful guidance also to other sectors including FBP detection (Gargis et al., 2012, 2015; Weiss et al., 2013; Aziz et al., 2015). The Global Microbial Identifier (GMI) was initiated in 2011 (GMI, 2011; Aarestrup et al., 2012). The background for this initiative was the high number of microbiological isolates that are characterized annually with very diverse and expensive typing systems, the increasing number of infectious diseases with global epidemiology requiring rapid detection and identification of microbial agents, the likelihood that microbiological laboratories (primarily clinical) will have DNA sequencers available in the foreseeable future, and that the likely future limiting factor is not the HTS cost but the assembly, processing and data handling in a standardized way to make the information useful (GMI, 2011). In 2015 the GMI launched the first HTS proficiency testing scheme in this respect (GMI, 2015; Moran-Gilad et al., 2015). Recent progress by GMI is reported by Taboada et al. (2017). The World Organization for Animal Health also recently launched its first standards for HTS, bioinformatics and computational genomics (OIE, 2017). Two other examples are the International Organization for Standardization's initiative to standardize WGS for typing and genomic characterization (ISO/TC 34/SC 9/WG 25) and the initiative to standardize the format of HTS derived SNP data (in a cancer context; Pipan and Kunaj, 2015), respectively.

Results Interpretation

HTS can generate a vast amount of data describing the microbial community of foods. When technical issues such as speed and standardization of laboratory and bioinformatics methods are improved, HTS and especially shotgun metagenomics may find applications in routine analysis of food, also for control purposes, as it has already done in clinical settings. Interpretation of the results is an important issue, for both the agri-food industry and the competent authorities, but also how to act on the results (Lambert et al., 2017; Taboada et al., 2017).

In the quality control of products and production lines, the detection of a pathogen, a genus that includes pathogenic species, and/or specific virulence genes/factors may, but will not necessarily lead to withdrawal of the product or lead to decisions to decontaminate the entire production line. The decision will depend on the specific context, i.e., the specific combination of pathogen marker, how well characterized the pathogen is, the type of product and the intended use. Quantitative data may also play a role, but sometimes are not or cannot be made available. Legal requirements can direct the decision processes. In the European Union (EU), both food safety criteria and process hygiene criteria are listed with details on sampling plans, limits, analytical reference methods and stage where the criterion applies (European Commission, 2005). Similarly, the safety of the food supply chain in the USA is regulated (FDA, 2011).

The application of WGS in microbiological risk assessments of foods is largely unexplored and faces important challenges. The number of hazards increase exponentially when zooming from serovars or serotypes to genotypes. The translation of multidimensional genotypic data into reduced information on phenotypes to ultimately generate a measure of risk that matches the requirements of food safety authorities and policy makers is therefore necessary (Franz et al., 2016). The presence or absence of specific genetic markers in a bacterial isolate or in a sample can direct the acceptance or rejection of a particular product. Furthermore, specific detection of FBP-markers may only be required within a narrow concentration range. For example, the observed presence of the STEC-associated virulence genes stx1 or stx2 in ground beef can lead to rejection or at least to particular caution and further examination to preclude the presence of STEC. We suggest that the observed background level of E. coli can provide a basis for decision in the case of STEC. The presence of E. coli itself is undesirable and un-tolerable background levels of E. coli, e.g., ≥ 10⁴ CFU g⁻¹ should lead to rejection. Thus, we believe it is most critical to ensure that the POD for STEC (markers) is acceptable within the entire range of concentrations at tolerable background levels, e.g., POD ≥ 99% at < 10⁴ CFU E. coli g⁻¹. We suggest that such an approximation can be used to guide both method developments and the enrichment efforts needed to obtain conclusive data. The infectious dose of FBPs vary, and for some FBPs it can be < 100 cells (Tilden et al., 1996; Tuttle et al., 1999; Hall, 2012) and must be considered for this type of calculation.

Recently, it has been questioned if all protists and helminths, historically thought of as “parasites” are indeed parasites (Lukes et al., 2015). Growing evidence of the diverse functional roles that specific bacteria can take on in humans, has forced us to acknowledge that beneficial and pathogenic may be two sides of the same organism, and that it must be understood in the broader contexts of ecology, symbiosis and diversity, among others. Collecting data on the entire microbial communities associated with their hosts may provide for better understanding of the functional interplay between hosts and hosted microorganisms, though so far no clear trend is emerging (Martin et al., 2015). In the future, perhaps, by learning more aided by HTS studies, it will be possible to stimulate beneficial and prevent pathogenic behavior by microorganisms with broad phenotypic potentials.

Capacity Challenges

Low Sequencing Costs, but High Costs Overall

HTS has profound impacts on both academic research and practical diagnostic and clinical surveillance. Despite this, there are several obstacles to its full adaptation in the routine detection of FBPs. The ideal HTS solution for the detection of FBPs should be rapid, accurate, operable, and economic. Obstacles include the purchase and implementation of the HTS platforms, but perhaps more importantly the subsequent data analysis and results interpretation (Loman et al., 2013; ECDC, 2015b). Sequencing costs have declined exponentially in the past decade, a development expected to continue. However, also costs associated with sample collection and preparation, nucleic acid extraction and purification, as well as any other steps in the preparation of sequencing libraries, sequencing, data management and downstream analyses must be considered (Sboner et al., 2011). As long as the rapid decline in the cost of data generation is not matched by a corresponding reduction in data storage, maintenance and processing costs, there will remain a substantial economic barrier preventing the full implementation of HTS in research and routine for FBP identification.

Computational Capacity is a Major Hurdle

HTS requires large computational capacities, i.e., various tools for data storage and data analysis, including data management, quality control, mapping and alignment, de novo assembly, scaffolding, gene annotation, metagenomics and biological significance interpretation (Cuccuru et al., 2014; Mayo et al., 2014). These bioinformatics tools, particularly their proper usage and output interpretation appear as an impenetrable barrier facing almost all scientists in either fields of food science and microbiology lacking or having weak bioinformatics background. A substantial effort will therefore be required to train relevant staff in use of these tools (Taboada et al., 2017). The application of HTS in FBP research and surveillance is also impeded or slowed down by the complexity of different formats of diverse software tools and limitations of computational capacities (Li et al., 2012; Xia et al., 2012; Cuccuru et al., 2014; Mayo et al., 2014). Several integrative frameworks consisting of publicly available research software and specifically designed pipelines are available for visualizing, querying and downloading the data released, and for HTS data processing and analysis for diverse microorganisms (Li et al., 2012; Wang et al., 2012; Cuccuru et al., 2014; Taboada et al., 2017). We believe it is essential that a corresponding user friendly web-based framework for FBPs detection using HTS is created. To fulfill this purpose, a close and effective collaboration between different scientists from related fields is urgently needed (Stapleton, 2014). These should jointly develop more automatic and reliable bioinformatics tools for sequencing reads' quality control (Edgar and Flyvbjerg, 2015), data compression and extraction (Wang and Zhang, 2011; Lassmann, 2015), data analysis on cloud computation (Kwon et al., 2015), and downstream analysis (Davis et al., 2015; Gweon et al., 2015; Müenz et al., 2015; Ratan et al., 2015; Wang et al., 2015a).

Database Limitations on Access and Contents

The establishment of a reference database for each major FBP species or subspecies poses yet another challenge. An accurate and complete reference database for all major FBPs is fundamental to epidemiological investigations and design of pathogen detection assays. The genome sequence is the prerequisite for understanding the molecular basis of a given phenotype of a FBP. Thus, genome sequencing is invaluable in advancing our understanding of virulence mechanisms and epidemiology of FBPs. Many complete or draft genome assemblies of the most common FBPs have been released (Mellmann et al., 2011; Timme et al., 2012; Schmitz-Esser and Wagner, 2014; Quick et al., 2015; Wu et al., 2015). This development is rapidly progressing and open data sharing including uploading of sequence data to public high-quality databases should be encouraged by all stakeholders. The availability of reference genomes is expected to speed up diagnosis and shorten FBD outbreaks, thus promoting public health. Additional complementary efforts are needed for more endemic FBPs.

Cost-Efficiency of Data Generation

The massive capacity of HTS can be utilized to find a causative agent or other target of relevance by unbiased ultra-deep sequencing, but this “brute force” approach will generate massive non-target data and, with current sequence pricing, is only applicable to research applications. Late availability of interpretable data can be a drawback of HTS, depending on the protocols used (Loman et al., 2012; Reuter et al., 2013; Quick et al., 2015). Quick et al. (2015) demonstrated (on Salmonella) that the time to answer in a hospital outbreak can be reduced significantly without impact on the results using draft shotgun metagenomics by rapid MiSeq sequencing, compared to standard protocols for MiSeq and HiSeq. The same study also demonstrated that samples could be assigned to species level within 20 min, serotype within 40 min and whether the isolate was part of the outbreak in less than an hour using the MinION sequencing technology and a mapping approach. With alignment-free bioinformatics the speed of analysis is further improved (Ondov et al., 2016). Third generation sequencing platforms can report sequences in real-time and this opens the possibility to selectively sequence only the nucleic acid of interest (Loose et al., 2016). That could permit more cost-efficient use of the sequencing capacity. These examples illustrate some recent improvements to the potential of HTS in management of outbreaks.

Global Imbalance in Capacity vs. Needs

The populations in developing countries and rural areas are more commonly struggling with FBPs than the inhabitants of major cities in developed, Western countries. The access to advanced analytical technologies is inversely distributed. HTS technologies are only exceptionally developed for point-of-care and in-field applications, but recent examples of outbreak investigations of Ebola and Zika viruses using the MinION (Quick et al., 2016, 2017) indicate that more robust and portable platforms may be available in the foreseeable future. However, even with access to the sequencers, the access to necessary consumables, computer capacities including databases, and competent manpower will likely remain obstacles to the implementation of HTS based analytical methods for many years. This is a serious challenge that must be solved in order to ensure that initiatives to build up global, collaborative networks and systems to cope with emerging FBPs are successful.

Conclusion

Despite the obvious benefits associated with potential independence from culturing, and the broad spectrum and level of detailed characterization of targets that can be detected simultaneously, we believe that in a short perspective HTS is unlikely to find application outside the most advanced laboratories. The reductionist tradition prevails in many laboratories, i.e., of testing analytically for single agents, in particular in the food sector. Detailed sequencing based characterization of reference strains and representative clinical, food and environmental samples is still in its relative infancy, with many FBP species still practically missing in the databases, and few species where the range of genetic diversity is close to completely characterized. However, as HTS studies accumulate so does the data pool and the appreciation of the potentials of the technology. Third generation HTS technology, as exemplified with the Oxford Nanopore MinION sequencer promise to permit on-site, long-read, real-time sequencing (Mikheyev and Tin, 2014; Greninger et al., 2015; Quick et al., 2015), but still struggles with laborious sample preparation requirements and high error rates. As these obstacles are mitigated, this type of technology will inevitably lead to a paradigm shift in FBP detection.

Author Contributions

CS drafted most of the introduction and section on bacterial food pathogens and contributed to the discussion; AH co-ordinated the manuscript drafting, wrote the sections on fungal and parasitic food pathogens, and vectors and other HTS applications, and contributed to the introduction, discussion and sections on bacterial and viral pathogens; UD, GJ, and WL contributed to the introduction, the section on bacterial pathogens and the discussion; BS drafted the section on viral pathogens and contributed to the introduction and discussion; and WL and JS initiated the review. All authors (CS, AH, UD, GJ, WL, BS, and JS) critically revised the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

This review was undertaken in connection with research activities in the Decathlon project (FP7-KBBE-2013-7-613908-Decathlon) funded by the European Commission in the 7th Framework Programme. This publication and all its contents reflect the views of the authors only, and the Commission cannot be held responsible for any use which may be made of the information contained herein. Complementary financial support was obtained from the Research Council of Norway. The authors are grateful to a reviewer for critical and useful comments and recommendations.

References

Aarestrup, F. M., Brown, E. W., Detter, C., Gerner-Smidt, P., Gilmour, M. W., Harmsen, D., et al. (2012). Integrating genome-based informatics to modernize global disease monitoring, information sharing, and response. Emerg. Infect. Dis. 18:120453. doi: 10.3201/eid1811.120453

PubMed Abstract | CrossRef Full Text | Google Scholar

Adeolu, M., Alnajar, S., Naushad, S., and Gupta, R. S. (2016). Genome-based phylogeny and taxonomy of the ‘Enterobacteriales’: proposal for Enterobacterales ord. nov divided into the families Enterobacteriaceae, Erwiniaceae fam. nov., Pectobacteriaceae fam. nov., Yersiniaceae fam. nov., Hafniaceae fam. nov., Morganellaceae fam. nov., and Budviciaceae fam. nov. Int. J. Syst. Evol. Microbiol. 66, 5575–5599. doi: 10.1099/ijsem.0.001485

PubMed Abstract | CrossRef Full Text | Google Scholar

Ahrenfeldt, J., Skaarup, C., Hasman, H., Pedersen, A. G., Aarestrup, F. M., and Lund, O. (2017). Bacterial whole genome-based phylogeny: construction of a new benchmarking dataset and assessment of some existing methods. BMC Genomics 18:19. doi: 10.1186/s12864-016-3407-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Antonissen, G., Martel, A., Pasmans, F., Ducatelle, R., Verbrugghe, E., Vandenbroucke, V., et al. (2014). The impact of Fusarium mycotoxins on human and animal host susceptibility to infectious diseases. Toxins 6, 430–452. doi: 10.3390/toxins6020430

PubMed Abstract | CrossRef Full Text | Google Scholar

Arulandhu, A. J., Staats, M., Hagelaar, R., Voorhuijzen, M. M., Prins, T. W., Scholtens, I., et al. (2017). Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples. GigaScience 6:gix080. doi: 10.1093/gigascience/gix080

PubMed Abstract | CrossRef Full Text | Google Scholar

Arulandhu, A. J., van Dijk, J. P., Dobnik, D., Holst-Jensen, A., Shi, J., Zel, J., et al. (2016). DNA enrichment approaches to identify unauthorized genetically modified organisms (GMOs). Anal. Bioanal. Chem. 408, 4575–4593. doi: 10.1007/s00216-016-9513-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashton, P. M., Nair, S., Peters, T. M., Bale, J. A., Powell, D. G., Painset, A., et al. (2016). Identification of Salmonella for public health surveillance using whole genome sequencing. Peerj 4:e1752. doi: 10.7717/peerj.1752

PubMed Abstract | CrossRef Full Text | Google Scholar

Ashton, P. M., Peters, T., Ameh, L., McAleer, R., Petrie, S., Nair, S., et al. (2015). Whole genome sequencing for the retrospective investigation of an outbreak of Salmonella Typhimurium DT 8. PLoS Curr. Outbreaks 2015:1. doi: 10.1371/currents.outbreaks.2c05a47d292f376afc5a6fcdd8a7a3b6

CrossRef Full Text | Google Scholar

Astridge, K., Barker, M., Bell, R., Combs, B., Boyle, C., Fearnley, E., et al. (2015). Monitoring the incidence and causes of diseases potentially transmitted by food in Australia: Annual report of the OzFoodNet network, (2011). Commun. Dis. Intell. 39, E236–E264. Available online at: http://www.health.gov.au/internet/main/publishing.nsf/Content/cda-cdi3902-pdf-cnt.htm/$FILE/cdi3902g.pdf

Aw, T. G., Wengert, S., and Rose, J. B. (2016). Metagenomic analysis of viruses associated with field-grown and retail lettuce identifies human and animal viruses. Int. J. Food Microbiol. 223, 50–56. doi: 10.1016/j.ijfoodmicro.2016.02.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Aziz, N., Zhao, Q., Bry, L., Driscoll, D. K., Funke, B., Gibson, J. S., et al. (2015). College of American Pathologists' laboratory standards for next-generation sequencing clinical tests. Arch. Pathol. Lab. Med. 139, 481–493. doi: 10.5858/arpa.2014-0250-CP

PubMed Abstract | CrossRef Full Text | Google Scholar

Barratt, J. L., Cao, M., Stark, D. J., and Ellis, J. T. (2015). The transcriptome sequence of Dientamoeba fragilis offers new biological insights on its metabolism, kinome, degradome and potential mechanisms of pathogenicity. Protist 166, 389–408. doi: 10.1016/j.protis.2015.06.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Barzon, L., Lavezzo, E., Militello, V., Toppo, S., and Palù, G. (2011). Applications of next-generation sequencing technologies to diagnostic virology. Int. J. Mol. Sci. 12, 7861–7884. doi: 10.3390/ijms12117861

PubMed Abstract | CrossRef Full Text | Google Scholar

Benedict, K., Chiller, T. M., and Mody, R. K. (2016). Invasive fungal infections acquired from contaminated food or nutritional supplements: a review of the literature. Foodborne Pathog. Dis. 13, 343–349. doi: 10.1089/fpd.2015.2108

CrossRef Full Text | Google Scholar

Bergholz, T. M., Switt, A. I., and Wiedmann, M. (2014). Omics approaches in food safety: fulfilling the promise? Trends Microbiol. 22, 275–281. doi: 10.1016/j.tim.2014.01.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Billmyre, R. B., Croll, D., Li, W., Mieczkowski, P., Carter, D. A., Cuomo, C. A., et al. (2014). Highly recombinant VGII Cryptococcus gattii population develops clonal outbreak clusters through both sexual macroevolution and asexual microevolution. mBio 5:e01494–14. doi: 10.1128/mBio.01494-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Bitar, D., Lortholary, O., Le Strat, Y., Nicolau, J., Coignard, B., Tattevin, P., et al. (2014). Population-based analysis of invasive fungal infections, France, 2001-2010. Emerging Infect. Dis. 20, 1149–1155. doi: 10.3201/eid2007.140087

PubMed Abstract | CrossRef Full Text | Google Scholar

Blagden, T., Schneider, W., Melcher, U., Daniels, J., and Fletcher, J. (2016). Adaptation and validation of e-probe diagnostic nucleic acid analysis for detection of Escherichia coli O157:H7 in metagenomic data from complex food matrices. J. Food Prot. 79, 574–581. doi: 10.4315/0362-028X.JFP-15-440

PubMed Abstract | CrossRef Full Text | Google Scholar

Brown, G. D., Denning, D. W., and Levitz, S. M. (2012). Tackling human fungal infections. Science 336, 647–647. doi: 10.1126/science.1222236

PubMed Abstract | CrossRef Full Text | Google Scholar

Brüssow, H., Canchaya, C., and Hardt, W. D. (2004). Phages and the evolution of bacterial pathogens: from genomic rearrangements to lysogenic conversion. Microbiol. Mol. Biol. Rev. 68, 560–602. doi: 10.1128/MMBR.68.3.560-602.2004

PubMed Abstract | CrossRef Full Text | Google Scholar

Brzuszkiewicz, E., Thürmer, A., Schuldes, J., Leimbach, A., Liesegang, H., Meyer, F. D., et al. (2011). Genome sequence analyses of two isolates from the recent Escherichia coli outbreak in Germany reveal the emergence of a new pathotype: Entero-Aggregative-Haemorrhagic Escherichia coli (EAHEC). Arch. Microbiol. 193, 883–891. doi: 10.1007/s00203-011-0725-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Byrnes, E. J. III, Li, W., Lewit, Y., Ma, H., Voelz, K., Ren, P., et al. (2010). Emergence and pathogenicity of highly virulent Cryptococcus gattii genotypes in the northwest United States. PLoS Pathog. 6:e1000850. doi: 10.1371/journal.ppat.1000850

PubMed Abstract | CrossRef Full Text | Google Scholar

Cacho, R. A., Tang, Y., and Chooi, Y.-H. (2015). Next-generation sequencing approach for connecting secondary metabolites to biosynthetic gene clusters in fungi. Front. Microbiol. 5:774. doi: 10.3389/fmicb.2014.00774

PubMed Abstract | CrossRef Full Text | Google Scholar

Ceuppens, S., Delbeke, S., De Coninck, D., Boussemaere, J., Boon, N., and Uyttendaele, M. (2015). Characterization of the bacterial community naturally present on commercially grown basil leaves: evaluation of sample preparation prior to culture-independent techniques. Int. J. Environ. Res. Public Health 12, 10171–10197. doi: 10.3390/ijerph120810171

PubMed Abstract | CrossRef Full Text | Google Scholar

Ceuppens, S., Li, D., Uyttendaele, M., Renault, P., Ross, P., Van Ranst, M., et al. (2014). Molecular methods in food safety microbiology: interpretation and implications of nucleic acid detection. Comprehen. Rev. Food Science Food Safety 13, 551–577. doi: 10.1111/1541-4337.12072

CrossRef Full Text | Google Scholar

Chattaway, M. A., Dallman, T. J., Gentle, A., Wright, M. J., Long, S. E., Ashton, P. M., et al. (2016). Whole genome sequencing for public health surveillance of Shiga toxin-producing Escherichia coli other than serogroup O157. Front. Microbiol. 7:258. doi: 10.3389/fmicb.2016.00258

PubMed Abstract | CrossRef Full Text | Google Scholar

Chiapponi, C., Pavoni, E., Bertasi, B., Baioni, L., Scaltriti, E., Chiesa, E., et al. (2014). Isolation and genomic sequence of hepatitis A virus from mixed frozen berries in Italy. Food Environ. Virol. 6, 202–206. doi: 10.1007/s12560-014-9149-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Clemons, K. V., Salonen, J. H., Issakainen, J., Nikoskelainen, J., McCullough, M. J., Jorge, J. J., et al. (2010). Molecular epidemiology of Saccharomyces cerevisiae in an immunocompromised host unit. Diagn. Microbiol. Infect. Dis. 68, 220–227. doi: 10.1016/j.diagmicrobio.2010.06.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Collier, M. G., Khudyakov, Y. E., Selvage, D., Adams-Cameron, M., Epson, E., Cronquist, A., et al. (2014). Outbreak of hepatitis A in the USA associated with frozen pomegranate arils imported from Turkey: an epidemiological case study. Lancet Infect. Dis. 14, 976–981. doi: 10.1016/S1473-3099(14)70883-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Cook, N. (2013). Viruses in Food and Water: Risks, Surveillance and Control. Cambridge, UK: Woodhead Publishing.

Google Scholar

Crim, S. M., Iwamoto, M., Huang, J. Y., Griffin, P. M., Gilliss, D., Cronquist, A. B., et al. (2014). Incidence and trends of infection with pathogens transmitted commonly through food - Foodborne Diseases Active Surveillance Network: 10 US Sites, 2006-2013. MMWR 63, 328–332. Available online at: https://www.cdc.gov/mmwr/pdf/wk/mm6315.pdf

Google Scholar

Cuccuru, G., Orsini, M., Pinna, A., Sbardellati, A., Soranzo, N., Travaglione, A., et al. (2014). Orione, a web-based framework for NGS analysis in microbiology. Bioinformatics 30, 1928–1929. doi: 10.1093/bioinformatics/btu135

PubMed Abstract | CrossRef Full Text | Google Scholar

Davis, S., Pettengill, J., Luo, Y., Payne, J., Shpuntoff, A., Rand, H., et al. (2015). CFSAN SNP Pipeline: an automated method for constructing SNP matrices from next-generation sequence data. PeerJ Comput. Sci. 1:11. doi: 10.7717/peerj-cs.20

CrossRef Full Text | Google Scholar

Davis, T. K., Van De Kar, N. C. A. J., and Tarr, P. I. (2014). Shiga toxin/Verocytotoxin-producing Escherichia coli infections: practical clinical perspectives. Microbiol. Spect. 2:EHEC-0025-2014. doi: 10.1128/microbiolspec.EHEC-0025-2014

PubMed Abstract | CrossRef Full Text | Google Scholar

de Wit, M. A., Koopmans, M. P. G., and van Duynhoven, Y. (2003). Risk factors for norovirus, Sapporo-like virus, and group A rotavirus gastroenteritis. Emerging Infect. Dis. 9, 1563–1570. doi: 10.3201/eid0912.020076

PubMed Abstract | CrossRef Full Text | Google Scholar

Deagle, B. E., Jarman, S. N., Coissac, E., Pompanon, F., and Taberlet, P. (2014). DNA metabarcoding and the cytochrome c oxidase subunit I marker: not a perfect match. Biol. Lett. 10:20140562. doi: 10.1098/rsbl.2014.0562

PubMed Abstract | CrossRef Full Text | Google Scholar

Desnos-Ollivier, M., Patel, S., Raoux-Barbot, D., Heitman, J., Dromer, F., and Group, F. C. S. (2015). Cryptococcosis serotypes impact outcome and provide evidence of Cryptococcus neoformans speciation. mBio 6:e00311–15. doi: 10.1128/mBio.00311-15

PubMed Abstract | CrossRef Full Text | Google Scholar

Desselberger, U. (2014). Rotaviruses. Virus Res. 190, 75–96. doi: 10.1016/j.virusres.2014.06.016

PubMed Abstract | CrossRef Full Text | Google Scholar

ECDC (2015a). Annual Epidemiological Reports [Online]. European Centre for Disease Prevention and Control,. Available online at: http://ecdc.europa.eu/EN/PUBLICATIONS/SURVEILLANCE_REPORTS/annual_epidemiological_report/Pages/epi_index.aspx [Accessed 4th November 2015].

ECDC (2015b). Expert Opinion on the Introduction of Next-Generation Typing Methods for Food- and Waterborne Diseases in the EU and EEA. Stockholm: Sweden: European Centre for Disease Prevention and Control.

Edgar, R. C., and Flyvbjerg, H. (2015). Error filtering, pair assembly and error correction for next-generation sequencing reads. Bioinformatics 31, 3476–3482. doi: 10.1093/bioinformatics/btv401

PubMed Abstract | CrossRef Full Text | Google Scholar

EFSA (2011). Scientific Opinion on an update on the present knowledge on the occurrence and control of foodborne viruses. EFSA J. 9, 2190–2285. doi: 10.2903/j.efsa.2011.2190

CrossRef Full Text

EFSA (2013). Scientific opinion on VTEC-seropathotype and scientific criteria regarding pathogenicity assessment. EFSA J. 11, 3138–3243. doi: 10.2903/j.efsa.2013.3138

CrossRef Full Text

EFSA (2014). Use of Whole Genome Sequencing (WGS) of Food-Borne Pathogens for Public Health Protection. EFSA.

EFSA (2015a). The European Union summary report on trends and sources of zoonoses, zoonotic agents and food-borne outbreaks in (2013). EFSA J. 13, 165. doi: 10.2903/j.efsa.2015.3991

CrossRef Full Text

EFSA (2015b). The European Union summary report on trends and sources of zoonoses, zoonotic agents and food-borne outbreaks in (2014). EFSA J. 13, 191. doi: 10.2903/j.efsa.2015.4329

CrossRef Full Text

EMAN (2015). European Mycotoxin Awareness Network: Basic Factsheet Trichothecenes. European Mycotoxin Awareness Network. Available online at: http://eman.leatherheadfood.com/node/45

Emond-Rheault, J.-G., Jeukens, J., Freschi, L., Kukavica-Ibrulj, I., Boyle, B., Dupont, M.-J., et al. (2017). A syst-OMICS approach to ensuring food safety and reducing the economic burden of salmonellosis. Front. Microbiol. 8:996. doi: 10.3389/fmicb.2017.00996

PubMed Abstract | CrossRef Full Text | Google Scholar

European Commission (2005). Commission Regulation (EC) No 2073/2005 of 15 November 2005 on microbiological criteria for foodstuffs. Official J. Eur. Union 2005, 1–26. Available online at: http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2005:338:0001:0026:EN:PDF

FDA (2011). FDA Food Safety Modernization Act, Public Law 111-353-January 4th 2011, 124 Stat. (3885). Washington, DC: United States Government Printing Office; Government of the United States of America.

FDA (2013). Establishment Inspection Report, Chobani Idaho, FEEI: 3009726115. U.S. Food and Drug Administration. Available online at: http://www.fda.gov/ucm/groups/fdagov-public/@fdagov-afda-orgs/documents/document/ucm376634.pdf.

Fernandez-Cassi, X., Timoneda, N., Gonzales-Gustavson, E., Abril, J. F., Bofill-Mas, S., and Girones, R. (2017). A metagenomic assessment of viral contamination on fresh parsley plants irrigated with fecally tainted river water. Int. J. Food Microbiol. 257, 80–90. doi: 10.1016/j.ijfoodmicro.2017.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Ferri, E., Galimberti, A., Casiraghi, M., Airoldi, C., Ciaramelli, C., Palmioli, A., et al. (2015). Towards a universal approach based on omics technologies for the quality control of food. BioMed Res. Int. 2015, 14. doi: 10.1155/2015/365794

PubMed Abstract | CrossRef Full Text | Google Scholar

Findley, K., Oh, J., Yang, J., Conlan, S., Deming, C., Meyer, J. A., et al. (2013). Topographic diversity of fungal and bacterial communities in human skin. Nature 498, 367–370. doi: 10.1038/nature12171

PubMed Abstract | CrossRef Full Text | Google Scholar

Finkbeiner, S. R., Li, Y., Ruone, S., Conrardy, C., Gregoricus, N., Toney, D., et al. (2009). Identification of a novel astrovirus (Astrovirus VA1) associated with an outbreak of acute gastroenteritis. J. Virol. 83, 10836–10839. doi: 10.1128/JVI.00998-09

PubMed Abstract | CrossRef Full Text | Google Scholar

Fiore, A. E. (2004). Hepatitis A transmitted by food. Clin. Infect. Dis. 38, 705–715. doi: 10.1086/381671

PubMed Abstract | CrossRef Full Text | Google Scholar

Forbes, J. D., Knox, N. C., Ronholm, J., Pagotto, F., and Reimer, A. (2017). Metagenomics: the next culture-independent game changer. Front. Microbiol. 8:1069. doi: 10.3389/fmicb.2017.01069

PubMed Abstract | CrossRef Full Text | Google Scholar

Foth, B. J., Tsai, I. J., Reid, A. J., Bancroft, A. J., Nichol, S., Tracey, A., et al. (2014). Whipworm genome and dual-species transcriptome analyses provide molecular insights into an intimate host-parasite interaction. Nat. Genet. 46, 693–700. doi: 10.1038/ng.3010

PubMed Abstract | CrossRef Full Text | Google Scholar

Franz, E., Gras, L., and Dallman, T. (2016). Significance of whole genome sequencing for surveillance, source attribution and microbial risk assessment of foodborne pathogens. Curr. Opin. Food Sci. 8, 74–79. doi: 10.1016/j.cofs.2016.04.004

CrossRef Full Text | Google Scholar

Galbraith, E. A., Antonopoulos, D. A., and White, B. A. (2004). Suppressive subtractive hybridization as a tool for identifying genetic diversity in an environmental metagenome: the rumen as a model. Environ. Microbiol., 6, 928–937. doi: 10.1111/j.1462-2920.2004.00575.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Ganova-Raeva, L., Punkova, L., Campo, D. S., Dimitrova, Z., Skums, P., Vu, N. H., et al. (2015). Cryptic hepatitis B and E in patients with acute hepatitis of unknown etiology. J. Infect. Dis. 212, 1962–1969. doi: 10.1093/infdis/jiv315

PubMed Abstract | CrossRef Full Text | Google Scholar

Gargis, A. S., Kalman, L., Berry, M. W., Bick, D. P., Dimmock, D. P., Hambuch, T., et al. (2012). Assuring the quality of next-generation sequencing in clinical laboratory practice. Nat. Biotechnol. 30, 1033–1036. doi: 10.1038/nbt.2403

PubMed Abstract | CrossRef Full Text | Google Scholar

Gargis, A. S., Kalman, L., Bick, D. P., da Silva, C., Dimmock, D. P., Funke, B. H., et al. (2015). Good laboratory practice for clinical next-generation sequencing informatics pipelines. Nat. Biotechnol. 33, 689–693. doi: 10.1038/nbt.3237

PubMed Abstract | CrossRef Full Text | Google Scholar

Gillings, M. R., Paulsen, I. T., and Tetu, S. G. (2015). Ecology and evolution of the human microbiota: fire, farming and antibiotics. Genes 6, 841–857. doi: 10.3390/genes6030841

PubMed Abstract | CrossRef Full Text | Google Scholar

Gilmour, M. W., Graham, M., Van Domselaar, G., Tyler, S., Kent, H., Trout-Yakel, K. M., et al. (2010). High-throughput genome sequencing of two Listeria monocytogenes clinical isolates during a large foodborne outbreak. BMC Genomics 11:120. doi: 10.1186/1471-2164-11-120

PubMed Abstract | CrossRef Full Text | Google Scholar

GMI (2011). Perspectives of a Global, Real-Time Microbiological Genomic Identification System - Implications for National and Global Detection and Control of Infectious Diseases - Consensus Report of an Expert Meeting 1-2 September 2011, Bruxelles, Belgium: Global Microbial Identifier website.

GMI (2015). Protocol for GMI Proficiency Test, 2015 (Global Microbial Identifier). Available online at: http://www.globalmicrobialidentifier.org/-/media/Sites/gmi/Work-groups/GMI_PT_Protocol_v2_Incl_Appendices_2015_final_24082015.ashx?la=da.

Gomi, R., Matsuda, T., Matsui, Y., and Yoneda, M. (2014). Fecal source tracking in water by next-generation sequencing technologies using host-specific Escherichia coli genetic markers. Environ. Sci. Technol. 48, 9616–9623. doi: 10.1021/es501944c

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodwin, S., Gurtowski, J., Ethe-Sayers, S., Deshpande, P., Schatz, M. C., and McCombie, W. R. (2015). Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome. Genome Res. 25, 1750–1756. doi: 10.1101/gr.191395.115

PubMed Abstract | CrossRef Full Text | Google Scholar

Goodwin, S., McPherson, J. D., and McCombie, W. R. (2016). Coming of age: ten years of next-generation sequencing technologies. Nat. Rev. Genet. 17, 333–351. doi: 10.1038/nrg.2016.49

PubMed Abstract | CrossRef Full Text | Google Scholar

Greninger, A. L., Naccache, S. N., Federman, S., Yu, G., Mbala, P., Bres, V., et al. (2015). Rapid metagenomic identification of viral pathogens in clinical samples by real-time nanopore sequencing analysis. Genome Med. 7:99. doi: 10.1186/s13073-015-0220-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Gurgui, M., Sanchez, F., March, F., Lopez-Contreras, J., Martino, R., Cotura, A., et al. (2011). Nosocomial outbreak of Blastoschizomyces capitatus associated with contaminated milk in a haematological unit. J. Hosp. Infect. 78, 274–278. doi: 10.1016/j.jhin.2011.01.027

PubMed Abstract | CrossRef Full Text | Google Scholar

Gweon, H. S., Oliver, A., Taylor, J., Booth, T., Gibbs, M., Read, D. S., et al. (2015). PIPITS: an automated pipeline for analyses of fungal internal transcribed spacer sequences from the Illumina sequencing platform. Methods Ecol. Evol. 6, 973–980. doi: 10.1111/2041-210X.12399

PubMed Abstract | CrossRef Full Text | Google Scholar

Hadfield, S. J., Pachebat, J. A., Swain, M. T., Robinson, G., Cameron, S. J. S., Alexander, J., et al. (2015). Generation of whole genome sequences of new Cryptosporidium hominis and Cryptosporidium parvum isolates directly from stool samples. BMC Genomics 16:650. doi: 10.1186/s12864-015-1805-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Hall, A. J. (2012). Noroviruses: the Perfect Human Pathogens? J. Infect. Dis. 205, 1622–1624. doi: 10.1093/infdis/jis251

PubMed Abstract | CrossRef Full Text | Google Scholar

Halliday, M. L., Kang, L. Y., Zhou, T. K., Hu, M. D., Pan, Q. C., Fu, T. Y., et al. (1991). An epidemic of hepatitis-A attributable to the ingestion of raw clams in Shanghai, China. J. Infect. Dis. 164, 852–859. doi: 10.1093/infdis/164.5.852

PubMed Abstract | CrossRef Full Text | Google Scholar

Hartmann, E. M., and Halden, R. U. (2012). Analytical methods for the detection of viruses in food by example of CCL-3 bioagents. Anal. Bioanal. Chem. 404, 2527–2537. doi: 10.1007/s00216-012-5974-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Hasman, H., Saputra, D., Sicheritz-Ponten, T., Lund, O., Svendsen, C. A., Frimodt-Möller, N., et al. (2014). Rapid whole-genome sequencing for detection and characterization of microorganisms directly from clinical samples. J. Clin. Microbiol. 52, 139–146. doi: 10.1128/JCM.02452-13

PubMed Abstract | CrossRef Full Text | Google Scholar

Hayes, S., Mahony, J., Nauta, A., and van Sinderen, D. (2017). Metagenomic approaches to assess bacteriophages in various environmental niches. Viruses 9:127. doi: 10.3390/v9060127

PubMed Abstract | CrossRef Full Text | Google Scholar

Hedberg, C. W., and Osterholm, M. T. (1993). Outbreaks of food-borne and waterborne viral gastroenteritis. Clin. Microbiol. Rev. 6, 199–210. doi: 10.1128/CMR.6.3.199

PubMed Abstract | CrossRef Full Text | Google Scholar

Henao, O. L., Jones, T. F., Vugia, D. J., Griffin, P. M., and Network, F. D. A. S. (2015). Foodborne diseases active surveillance network - 2 decades of achievements, 1996-2015. Emerging Infect. Dis. 21, 1529–1536. doi: 10.3201/eid2109.150581

PubMed Abstract | CrossRef Full Text | Google Scholar

Holland, J., Spindler, K., Horodyski, F., Grabau, E., Nichol, S., and VandePol, S. (1982). Rapid evolution of RNA genomes. Science 215, 1577–1585. doi: 10.1126/science.7041255

PubMed Abstract | CrossRef Full Text | Google Scholar

Holmes, A., Allison, L., Ward, M., Dallman, T. J., Clark, R., Fawkes, A., et al. (2015). Utility of whole-genome sequencing of Escherichia coli O157 for outbreak detection and epidemiological surveillance. J. Clin. Microbiol. 53, 3565–3573. doi: 10.1128/JCM.01066-15

PubMed Abstract | CrossRef Full Text | Google Scholar

Holst-Jensen, A., Johannessen, G., Sekse, C., Spilsberg, B., Dobnik, D., Dreo, T., et al. (2016). Minimum Performance Parameters for Molecular Analytical Methods - Deliverable 6.6 of the Decathlon Project. Available online at: http://www.decathlon-project.eu/reports-and-deliverables

Holst-Jensen, A., Rønning, S. B., Løvseth, A., and Berdal, K. G. (2003). PCR technology for screening and quantification of genetically modified organisms (GMOs). Anal. Bioanal. Chem. 375, 985–993. doi: 10.1007/s00216-003-1767-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Imamura, S., Haruna, M., Goshima, T., Kanezashi, H., Okada, T., and Akimoto, K. (2016a). Application of next-generation sequencing to evaluate the profile of noroviruses in pre- and post-depurated oysters. Foodborne Pathog. Dis. 13, 559–565. doi: 10.1089/fpd.2016.2150

PubMed Abstract | CrossRef Full Text | Google Scholar

Imamura, S., Haruna, M., Goshima, T., Kanezashi, H., Okada, T., and Akimoto, K. (2016b). Application of next-generation sequencing to investigation of norovirus diversity in shellfish collected from two coastal sites in Japan from 2013 to (2014). Japan. J. Veterin. Res. 64, 113–122. doi: 10.14943/jjvr.64.2.113

PubMed Abstract | CrossRef Full Text | Google Scholar

Iriart, X., Fior, A., Blanchet, D., Berry, A., Neron, P., and Aznar, C. (2010). Monascus ruber: invasive gastric infection caused by dried and salted fish consumption. J. Clin. Microbiol. 48, 3800–3802. doi: 10.1128/JCM.01000-10

PubMed Abstract | CrossRef Full Text | Google Scholar

Jarvis, K. G., White, J. R., Grim, C. J., Ewing, L., Ottesen, A. R., Beaubrun, J. J.-G., et al. (2015). Cilantro microbiome before and after nonselective pre-enrichment for Salmonella using 16S rRNA and metagenomic sequencing. BMC Microbiol. 15:160. doi: 10.1186/s12866-015-0497-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Jenkins, C., Dallman, T. J., Launders, N., Willis, C., Byrne, L., Jorgensen, F., et al. (2015). Public health investigation of two outbreaks of Shiga toxin-producing Escherichia coli O157 associated with consumption of watercress. Appl. Environ. Microbiol. 81, 3946–3952. doi: 10.1128/AEM.04188-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Joensen, K. G., Scheutz, F., Lund, O., Hasman, H., Kaas, R. S., Nielsen, E. M., et al. (2014). Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J. Clin. Microbiol. 52, 1501–1510. doi: 10.1128/JCM.03617-13

PubMed Abstract | CrossRef Full Text | Google Scholar

Jones, Y. L., Peters, S. M., Weland, C., Ivanova, N. V., and Yancy, H. F. (2013). Potential use of DNA barcodes in regulatory science: identification of the US food and drug administration's “Dirty 22,” contributors to the spread of foodborne pathogens. J. Food Prot. 76, 144–149. doi: 10.4315/0362-028X.JFP-12-168

CrossRef Full Text | Google Scholar

Junemann, S., Sedlazeck, F. J., Prior, K., Albersmeier, A., John, U., Kalinowski, J., et al. (2013). Updating benchtop sequencing performance comparison. Nat. Biotechnol. 31, 294–296. doi: 10.1038/nbt.2522

PubMed Abstract | CrossRef Full Text | Google Scholar

Kawai, T., Sekizuka, T., Yahata, Y., Kuroda, M., Kumeda, Y., Iijima, Y., et al. (2012). Identification of Kudoa septempunctata as the causative agent of novel food poisoning outbreaks in Japan by consumption of Paralichthys olivaceus in raw fish. Clin. Infect. Dis. 54, 1046–1052. doi: 10.1093/cid/cir1040

PubMed Abstract | CrossRef Full Text | Google Scholar

Kazan, E., Maertens, J., Herbrecht, R., Weisser, M., Gachot, B., Vekhoff, A., et al. (2011). A retrospective series of gut aspergillosis in haematology patients. Clin. Microbiol. Infect. 17, 588–594. doi: 10.1111/j.1469-0691.2010.03310.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kergourlay, G., Taminiau, B., Daube, G., and Champomier Verges, M.-C. (2015). Metagenomic insights into the dynamics of microbial communities in food. Int. J. Food Microbiol. 213, 31–39. doi: 10.1016/j.ijfoodmicro.2015.09.010

PubMed Abstract | CrossRef Full Text | Google Scholar

Kundu, S., Lockwood, J., Depledge, D. P., Chaudhry, Y., Aston, A., Rao, K., et al. (2013). Next-generation whole genome sequencing identifies the direction of norovirus transmission in linked patients. Clin. Infect. Dis. 57, 407–414. doi: 10.1093/cid/cit287

PubMed Abstract | CrossRef Full Text | Google Scholar

Kupferschmidt, K. (2011). Scientists rush to study genome of lethal E. coli. Science 332, 1249–1250. doi: 10.1126/science.332.6035.1249

PubMed Abstract | CrossRef Full Text | Google Scholar

Kwon, T., Yoo, W. G., Lee, W.-J., Kim, W., and Kim, D.-W. (2015). Next-generation sequencing data analysis on cloud computing. Genes Genomics 37, 489–501. doi: 10.1007/s13258-015-0280-7

CrossRef Full Text | Google Scholar

Lambert, D., Pightling, A., Griffiths, E., Van Domselaar, G., Evans, P., Berthelet, S., et al. (2017). Baseline practices for the application of genomic data supporting regulatory food safety. J. AOAC Int. 100, 721–731. doi: 10.5740/jaoacint.16-0269

PubMed Abstract | CrossRef Full Text | Google Scholar

Lammers, Y., Peelen, T., Vos, R. A., and Gravendeel, B. (2014). The HTS barcode checker pipeline, a tool for automated detection of illegally traded species from high-throughput sequencing data. BMC Bioinformatics 15:44. doi: 10.1186/1471-2105-15-44

PubMed Abstract | CrossRef Full Text | Google Scholar

Lassen, S. G., Ethelberg, S., Bjorkman, J. T., Jensen, T., Sorensen, G., Jensen, A. K., et al. (2016). Two listeria outbreaks caused by smoked fish consumption-using whole-genome sequencing for outbreak investigations. Clin. Microbiol. Infect. 21, 620–624. doi: 10.1016/j.cmi.2016.04.017

CrossRef Full Text | Google Scholar

Lassmann, T. (2015). TagDust2: a generic method to extract reads from sequencing data. BMC Bioinformatics 16:24. doi: 10.1186/s12859-015-0454-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Lee, H. B., Patriarca, A., and Magan, N. (2015). Alternaria in food: ecophysiology, mycotoxin production and toxicology. Mycobiology 43, 93–106. doi: 10.5941/MYCO.2015.43.2.93

CrossRef Full Text | Google Scholar

Lee, S. C., Billmyre, R. B., Li, A., Carson, S., Sykes, S. M., Huh, E. Y., et al. (2014). Analysis of a food-borne fungal pathogen outbreak: virulence and genome of a Mucor circinelloides isolate from yogurt. mBio 5:e01390–14. doi: 10.1128/mBio.01390-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Leekitcharoenphon, P., Nielsen, E. M., Kaas, R. S., Lund, O., and Aarestrup, F. M. (2014). Evaluation of whole genome sequencing for outbreak detection of Salmonella enterica. PLoS ONE 9:e87991. doi: 10.1371/journal.pone.0087991

PubMed Abstract | CrossRef Full Text | Google Scholar

Leichty, A. R., and Brisson, D. (2014). Selective whole genome amplification for resequencing target microbial species from complex natural samples. Genetics 198, 473–481. doi: 10.1534/genetics.114.165498

PubMed Abstract | CrossRef Full Text | Google Scholar

Leonard, S. R., Mammel, M. K., Lacher, D. W., and Elkins, C. A. (2015). Application of metagenomic sequencing to food safety: detection of Shiga toxin-producing Escherichia coli on fresh bagged spinach. Appl. Environ. Microbiol. 81, 8183–8191. doi: 10.1128/AEM.02601-15

PubMed Abstract | CrossRef Full Text | Google Scholar

Leonard, S. R., Mammel, M. K., Lacher, D. W., and Elkins, C. A. (2016). Strain-level discrimination of Shiga toxin-producing Escherichia coli in spinach using metagenomic sequencing. PLoS ONE 11:e0167870. doi: 10.1371/journal.pone.0167870

PubMed Abstract | CrossRef Full Text | Google Scholar

Leray, M., and Knowlton, N. (2015). DNA barcoding and metabarcoding of standardized samples reveal patterns of marine benthic diversity. Proc. Natl. Acad. Sci. U.S.A. 112, 2076–2081. doi: 10.1073/pnas.1424997112

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J.-W., Schmieder, R., Ward, R. M., Delenick, J., Olivares, E. C., and Mittelman, D. (2012). SEQanswers: an open access community for collaboratively decoding genomes. Bioinformatics 28, 1272–1273. doi: 10.1093/bioinformatics/bts128

PubMed Abstract | CrossRef Full Text | Google Scholar

Lindsey, R. L., Pouseele, H., Chen, J. C., Strockbine, N. A., and Carleton, H. A. (2016). Implementation of Whole Genome Sequencing (WGS) for Identification and Characterization of Shiga Toxin-Producing Escherichia coil (STEC) in the United States. Front. Microbiol. 7:766. doi: 10.3389/fmicb.2016.00766

CrossRef Full Text | Google Scholar

Litvintseva, A. P., Hurst, S., Gade, L., Frace, M. A., Hilsabeck, R., Schupp, J. M., et al. (2014). Whole-genome analysis of Exserohilum rostratum from an outbreak of fungal meningitis and other infections. J. Clin. Microbiol. 52, 3216–3222. doi: 10.1128/JCM.00936-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Litvintseva, A. P., Marsden-Haug, N., Hurst, S., Hill, H., Gade, L., Driebe, E. M., et al. (2015). Valley fever: finding new places for an old disease: Coccidioides immitis found in Washington State soil associated with recent human infection. Clin. Infect. Dis. 60, E1–E3. doi: 10.1093/cid/ciu681

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, L., Li, Y., Li, S., Hu, N., He, Y., Pong, R., et al. (2012). Comparison of next-generation sequencing systems. J. Biomed. Biotechnol. 2012:251364. doi: 10.1155/2012/251364

PubMed Abstract | CrossRef Full Text | Google Scholar

Livezey, K., Kaplan, S., Wisniewski, M., and Becker, M. M. (2013). A new generation of food-borne pathogen detection based on ribosomal RNA. Annu. Rev. Food. Sci. Technol. 4, 313–325. doi: 10.1146/annurev-food-050412-104448

PubMed Abstract | CrossRef Full Text | Google Scholar

Loman, N. J., Constantinidou, C., Christner, M., Rohde, H., Chan, J. Z. M., Quick, J., et al. (2013). A culture-independent sequence-based metagenomics approach to the investigation of an outbreak of Shiga-toxigenic Escherichia coli O104:H4. JAMA 309, 1502–1510. doi: 10.1001/jama.2013.3231

PubMed Abstract | CrossRef Full Text | Google Scholar

Loman, N. J., Misra, R. V., Dallman, T. J., Constantinidou, C., Gharbia, S. E., Wain, J., et al. (2012). Performance comparison of benchtop high-throughput sequencing platforms. Nat. Biotechnol. 30:434. doi: 10.1038/nbt0612-562f

PubMed Abstract | CrossRef Full Text | Google Scholar

Loman, N. J., and Pallen, M. J. (2015). Twenty years of bacterial genome sequencing. Nat. Rev. Microbiol. 13, 787–794. doi: 10.1038/nrmicro3565

PubMed Abstract | CrossRef Full Text | Google Scholar

Loman, N. J., Quick, J., and Simpson, J. T. (2015). A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat. Methods 12, 733–U51. doi: 10.1038/nmeth.3444

PubMed Abstract | CrossRef Full Text | Google Scholar

Loose, M., Malla, S., and Stout, M. (2016). Real-time selective sequencing using nanopore technology. Nat. Methods 13, 751–754. doi: 10.1038/nmeth.3930

PubMed Abstract | CrossRef Full Text | Google Scholar

Lukes, J., Stensvold, C. R., Jirku-Pomajbikova, K., and Parfrey, L. W. (2015). Are human intestinal eukaryotes beneficial or commensals? PLoS Pathog. 11:e1005039. doi: 10.1371/journal.ppat.1005039

CrossRef Full Text | Google Scholar

Luo, C. W., Tsementzi, D., Kyrpides, N., Read, T., and Konstantinidis, K. T. (2012). Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample. PLoS ONE 7:e30087. doi: 10.1371/journal.pone.0030087

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchesi, J. R., and Ravel, J. (2015). The vocabulary of microbiome research: a proposal. Microbiome 3:3. doi: 10.1186/s40168-015-0094-5

PubMed Abstract | CrossRef Full Text

Martin, J., Rosa, B. A., Ozersky, P., Hallsworth-Pepin, K., Zhang, X., Bhonagiri-Palsikar, V., et al. (2015). Helminth.net: expansions to Nematode.net and an introduction to Trematode.net. Nucleic Acids Res. 43, D698–D706. doi: 10.1093/nar/gku1128

PubMed Abstract | CrossRef Full Text | Google Scholar

Martinovic, T., Andjelkovic, U., Gajdosik, M. S., Resetar, D., and Josic, D. (2016). Foodborne pathogens and their toxins. J. Proteomics 147, 226–235. doi: 10.1016/j.jprot.2016.04.029

PubMed Abstract | CrossRef Full Text | Google Scholar

Mayo, B., Rachid, C. T. C. C., Alegria, A., Leite, A. M. O., Peixoto, R. S., and Delgado, S. (2014). Impact of next generation sequencing techniques in food microbiology. Curr. Genomics 15, 293–309. doi: 10.2174/1389202915666140616233211

PubMed Abstract | CrossRef Full Text | Google Scholar

Mellmann, A., Harmsen, D., Cummings, C. A., Zentz, E. B., Leopold, S. R., Rico, A., et al. (2011). Prospective genomic characterization of the German enterohemorrhagic Escherichia coli O104:H4 outbreak by rapid next generation sequencing technology. PLoS ONE 6:e22751. doi: 10.1371/journal.pone.0022751

PubMed Abstract | CrossRef Full Text | Google Scholar

Menke, S., Gillingham, M. A. F., Wilhelm, K., and Sommer, S. (2017). Home-made cost effective preservation buffer is a better alternative to commercial preservation methods for microbiome research. Front. Microbiol. 8:102. doi: 10.3389/fmicb.2017.00102

PubMed Abstract | CrossRef Full Text | Google Scholar

Mikheyev, A. S., and Tin, M. M. (2014). A first look at the Oxford Nanopore MinION sequencer. Mol. Ecol. Resour. 14, 1097–1102. doi: 10.1111/1755-0998.12324

PubMed Abstract | CrossRef Full Text | Google Scholar

Mizukoshi, F., Kuroda, M., Tsukagoshi, H., Sekizuka, T., Funatogawa, K., Morita, Y., et al. (2014). A food-borne outbreak of gastroenteritis due to genotype G1P[8] rotavirus among adolescents in Japan. Microbiol. Immunol. 58, 536–539. doi: 10.1111/1348-0421.12176

PubMed Abstract | CrossRef Full Text | Google Scholar

Moore, M. D., Goulter, R. M., and Jaykus, L.-A. (2015). Human norovirus as a foodborne pathogen: challenges and developments. Annu. Rev. Food. Sci. Technol. 6, 411–433. doi: 10.1146/annurev-food-022814-015643

PubMed Abstract | CrossRef Full Text | Google Scholar

Moran-Gilad, J., Sintchenko, V., Pedersen, S. K., Wolfgang, W. J., Pettengill, J., Strain, E., et al. (2015). Proficiency testing for bacterial whole genome sequencing: an end-user survey of current capabilities, requirements and priorities. BMC Infect. Dis. 15:174. doi: 10.1186/s12879-015-0902-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Moreira, D., and López-García, P. (2009). Ten reasons to exclude viruses from the tree of life. Nat. Rev. Microbiol. 7, 306–311. doi: 10.1038/nrmicro2108

PubMed Abstract | CrossRef Full Text | Google Scholar

Moura, A., Criscuolo, A., Pouseele, H., Maury, M., Leclercq, A., Tarr, C., et al. (2016). Whole genome-based population biology and epidemiological surveillance of Listeria monocytogenes. Nat. Microbiol. 10:16185. doi: 10.1038/nmicrobiol.2016.185

CrossRef Full Text

Müenz, M., Ruark, E., Renwick, A., Ramsay, E., Clarke, M., Mahamdallie, S., et al. (2015). CSN and CAVA: variant annotation tools for rapid, robust next-generation sequencing analysis in the clinical setting. Genome Med. 7:76. doi: 10.1186/s13073-015-0195-6

CrossRef Full Text | Google Scholar

Newell, D. G., Koopmans, M., Verhoef, L., Duizer, E., Aidara-Kane, A., Sprong, H., et al. (2010). Food-borne diseases - The challenges of 20 years ago still persist while new ones continue to emerge. Int. J. Food Microbiol. 139, S3–S15. doi: 10.1016/j.ijfoodmicro.2010.01.021

PubMed Abstract | CrossRef Full Text | Google Scholar

Nieuwenhuijse, D. F., and Koopmans, M. P. G. (2017). Metagenomic sequencing for surveillance of food- and waterborne viral diseases. Front. Microbiol. 8:230. doi: 10.3389/fmicb.2017.00230

PubMed Abstract | CrossRef Full Text | Google Scholar

Octavia, S., Wang, Q., Tanaka, M. M., Kaur, S., Sintchenko, V., and Lan, R. (2015). Delineating community outbreaks of Salmonella enterica serovar Typhimurium by use of whole-genome sequencing: insights into genomic variability within an outbreak. J. Clin. Microbiol. 53, 1063–1071. doi: 10.1128/JCM.03235-14

PubMed Abstract | CrossRef Full Text | Google Scholar

OIE (2017). Manual of Diagnostic Tests and Vaccines for Terrestrial Animals. Paris: World Organisation for Animal Health.

Olsen, A. R., Gecan, J. S., Ziobro, G. C., and Bryce, J. R. (2001). Regulatory action criteria for filth and other extraneous materials V. Strategy for evaluating hazardous and nonhazardous filth. Regul. Toxicol. Pharmacol. 33, 363–392. doi: 10.1006/rtph.2001.1472

PubMed Abstract | CrossRef Full Text | Google Scholar

Ondov, B. D., Treangen, T. J., Melsted, P., Mallonee, A. B., Bergman, N. H., Koren, S., et al. (2016). Mash: fast genome and metagenome distance estimation using MinHash. Genome Biol. 17:132. doi: 10.1186/s13059-016-0997-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Orlandi, P. A., Chu, D. M. T., Bier, J. W., and Jackson, G. J. (2002). Parasites and the food supply. Food Technol. 56, 72–81. Available online at: http://www.ift.org/~/media/Knowledge%20Center/Science%20Reports/Scientific%20Status%20Summaries/parasitesfoodsupply_0402.pdf

Google Scholar

Ottesen, A., Ramachandran, P., Reed, E., White, J. R., Hasan, N., Subramanian, P., et al. (2016). Enrichment dynamics of Listeria monocytogenes and the associated microbiome from naturally contaminated ice cream linked to a listeriosis outbreak. BMC Microbiol. 16:275. doi: 10.1186/s12866-016-0894-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Ottesen, A. R., Gonzalez, A., Bell, R., Arce, C., Rideout, S., Allard, M., et al. (2013). Co-enriching microflora associated with culture based methods to detect Salmonella from tomato phyllosphere. PLoS ONE 8:e73079. doi: 10.1371/journal.pone.0073079

PubMed Abstract | CrossRef Full Text | Google Scholar

Petersen, T. N., Rasmussen, S., Hasman, H., Caroe, C., Baelum, J., Schultz, A. C., et al. (2015). Meta-genomic analysis of toilet waste from long distance flights; a step towards global surveillance of infectious diseases and antimicrobial resistance. Sci. Rep. 5:11444. doi: 10.1038/srep11444

CrossRef Full Text | Google Scholar

Pipan, V., and Kunaj, T. (2015). Initiative for standardization of the format of the next generation sequencing (NGS) results. Discoveries 3:4. doi: 10.15190/d.2015.36

CrossRef Full Text | Google Scholar

Pires, S. M., Fischer-Walker, C. L., Lanata, C. F., Devleesschauwer, B., Hall, A. J., Kirk, M. D., et al. (2015). Aetiology-specific estimates of the global and regional incidence and mortality of diarrhoeal diseases commonly transmitted through food. PLoS ONE 10:e0142927. doi: 10.1371/journal.pone.0142927

PubMed Abstract | CrossRef Full Text | Google Scholar

Pires, S. M., Vieira, A. R., Perez, E., Wong, D. L. F., and Hald, T. (2012). Attributing human foodborne illness to food sources and water in Latin America and the Caribbean using data from outbreak investigations. Int. J. Food Microbiol. 152, 129–138. doi: 10.1016/j.ijfoodmicro.2011.04.018

PubMed Abstract | CrossRef Full Text | Google Scholar

Quick, J., Ashton, P., Calus, S., Chatt, C., Gossain, S., Hawker, J., et al. (2015). Rapid draft sequencing and real-time nanopore sequencing in a hospital outbreak of Salmonella. Genome Biol. 16:114. doi: 10.1186/s13059-015-0677-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Quick, J., Grubaugh, N. D., Pullan, S. T., Claro, I. M., Smith, A. D., Gangavarapu, K., et al. (2017). Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat. Protoc. 12, 1261–1276. doi: 10.1038/nprot.2017.066

PubMed Abstract | CrossRef Full Text

Quick, J., Loman, N. J., Duraffour, S., Simpson, J. T., Ettore, S., Cowley, L., et al. (2016). Real-time, portable genome sequencing for Ebola surveillance. Nature 530, 228-+. doi: 10.1038/nature16996

PubMed Abstract | CrossRef Full Text | Google Scholar

Ranjan, R., Rani, A., Metwally, A., McGee, H. S., and Perkins, D. L. (2016). Analysis of the microbiome: advantages of whole genome shotgun versus 16S amplicon sequencing. Biochem. Biophys. Res. Commun. 469, 967–977. doi: 10.1016/j.bbrc.2015.12.083

PubMed Abstract | CrossRef Full Text | Google Scholar

Rasko, D. A., Webster, D. R., Sahl, J. W., Bashir, A., Boisen, N., Scheutz, F., et al. (2011). Origins of the E. coli strain causing an outbreak of hemolytic-uremic syndrome in Germany. New Engl. J. Med. 365, 709–717. doi: 10.1056/NEJMoa1106920

PubMed Abstract | CrossRef Full Text | Google Scholar

Ratan, A., Olson, T. L., Loughran, T. P. Jr., and Miller, W. (2015). Identification of indels in next-generation sequencing data. BMC Bioinformatics 16:42. doi: 10.1186/s12859-015-0483-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Reuter, S., Ellington, M. J., Cartwright, E. J. P., Köser, C. U., Török, M. E., Gouliouris, T., et al. (2013). Rapid bacterial whole-genome sequencing to enhance diagnostic and public health microbiology. JAMA Intern. Med. 173, 1397–1404. doi: 10.1001/jamainternmed.2013.7734

PubMed Abstract | CrossRef Full Text | Google Scholar

Rhoads, A., and Au, K. F. (2015). PacBio sequencing and its applications. Genom. Proteom. Bioinform. 13, 278–289. doi: 10.1016/j.gpb.2015.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Robertson, L. J., Sprong, H., Ortega, Y. R., van der Giessen, J. W. B., and Fayer, R. (2014). Impacts of globalisation on foodborne parasites. Trends Parasitol. 30, 37–52. doi: 10.1016/j.pt.2013.09.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Roden, M. M., Zaoutis, T. E., Buchanan, W. L., Knudsen, T. A., Sarkisova, T. A., Schaufele, R. L., et al. (2005). Epidemiology and outcome of zygomycosis: a review of 929 reported cases. Clin. Infect. Dis. 41, 634–653. doi: 10.1086/432579

PubMed Abstract | CrossRef Full Text | Google Scholar

Rodríguez-Lazaro, D., Cook, N., Ruggeri, F. M., Sellwood, J., Nasser, A., Nascimento, M. S. J., et al. (2012). Virus hazards from food, water and other contaminated environments. FEMS Microbiol. Rev. 36, 786–814. doi: 10.1111/j.1574-6976.2011.00306.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Salmond, G. P. C., and Fineran, P. C. (2015). A century of the phage: past, present and future. Nat. Rev. Microbiol. 13, 777–786. doi: 10.1038/nrmicro3564

PubMed Abstract | CrossRef Full Text | Google Scholar

Sboner, A., Mu, X. J., Greenbaum, D., Auerbach, R. K., and Gerstein, M. B. (2011). The real cost of sequencing: higher than you think! Genome Biol. 12:125. doi: 10.1186/gb-2011-12-8-125

PubMed Abstract | CrossRef Full Text | Google Scholar

Scallan, E., Hoekstra, R. M., Angulo, F. J., Tauxe, R. V., Widdowson, M.-A., Roy, S. L., et al. (2011). Foodborne illness acquired in the United States - major pathogens. Emerging Infect. Dis. 17, 7–15. doi: 10.3201/eid1701.P11101

PubMed Abstract | CrossRef Full Text | Google Scholar

Scheutz, F., Nielsen, E. M., Frimodt-Möller, J., Boisen, N., Morabito, S., Tozzoli, R., et al. (2011). Characteristics of the enteroaggregative Shiga toxin/verotoxin-producing Escherichia coli O104:H4 strain causing the outbreak of haemolytic uraemic syndrome in Germany, May to June (2011). Eurosurveillance 16, 5–10. doi: 10.2807/ese.16.24.19889-en

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmid, D., Allerberger, F., Huhulescu, S., Pietzka, A., Amar, C., Kleta, S., et al. (2014). Whole genome sequencing as a tool to investigate a cluster of seven cases of listeriosis in Austria and Germany, 2011-2013. Clin. Microbiol. Infect. 20, 431–436. doi: 10.1111/1469-0691.12638

PubMed Abstract | CrossRef Full Text | Google Scholar

Schmitz-Esser, S., and Wagner, M. (2014). Genome sequencing of Listeria monocytogenes. Methods Mol. Biol. 1157, 223–232. doi: 10.1007/978-1-4939-0703-8_19

PubMed Abstract | CrossRef Full Text | Google Scholar

Severi, E., Verhoef, L., Thornton, L., Guzman-Herrador, B. R., Faber, M., Sundqvist, L., et al. (2015). Large and prolonged food-borne multistate hepatitis A outbreak in Europe associated with consumption of frozen berries, 2013 to (2014). Eurosurveillance 20, 11–19. doi: 10.2807/1560-7917.ES2015.20.29.21192

PubMed Abstract | CrossRef Full Text | Google Scholar

Singer, E., Bushnell, B., Coleman-Derr, D., Bowman, B., Bowers, R. M., Levy, A., et al. (2016). High-resolution phylogenetic microbial community profiling. ISME J. 10, 2020–2032. doi: 10.1038/ismej.2015.249

PubMed Abstract | CrossRef Full Text | Google Scholar

Smits, S. L., Schapendonk, C. M. E., van Beek, J., Vennema, H., Schürch, A. C., Schipper, D., et al. (2014). New viruses in idiopathic human diarrhea cases, the Netherlands. Emerging Infect. Dis. 20, 1218–1222. doi: 10.3201/eid2007.140190

PubMed Abstract | CrossRef Full Text | Google Scholar

Spellberg, B. (2012). Gastrointestinal mucormycosis: an evolving disease. Gastroenterol. Hepatol. 8, 140–142.

PubMed Abstract | Google Scholar

Spilsberg, B., Lagesen, K., Kristoffersen, A. B., and Holst-Jensen, A. (2017). “Identification and quantification of genetically modified organisms (GMO) from high throughput sequencing data,” in qPCR dPCR & NGS (2017). eds S. Bustin and M. W. Pfaffl (Freising: Biomolecular Detection and Quantification), 11, S33.

Staats, M., Arulandhu, A. J., Gravendeel, B., Holst-Jensen, A., Scholtens, I., Peelen, T., et al. (2016). Advances in DNA metabarcoding for food and wildlife forensic species identification. Anal. Bioanal. Chem. 408, 4615–4630. doi: 10.1007/s00216-016-9595-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Stapleton, A. E. (2014). A biologist, a statistician, and a bioinformatician walk into a conference room…and walk out with a great metagenonnics project plan. Front. Plant Sci. 5:250. doi: 10.3389/fpls.2014.00250

CrossRef Full Text | Google Scholar

Stasiewicz, M. J., Oliver, H. F., Wiedmann, M., and den Bakker, H. C. (2015). Whole-genome sequencing allows for improved identification of persistent Listeria monocytogenes in food-associated environments. Appl. Environ. Microbiol. 81, 6024–6037. doi: 10.1128/AEM.01049-15

PubMed Abstract | CrossRef Full Text | Google Scholar

Stoev, S. D. (2015). Foodborne mycotoxicoses, risk assessment and underestimated hazard of masked mycotoxins and joint mycotoxin effects or interaction. Environ. Toxicol. Pharmacol. 39, 794–809. doi: 10.1016/j.etap.2015.01.022

CrossRef Full Text | Google Scholar

Struelens, M. J., Palm, D., and Takkinen, J. (2011). Enteroaggregative, Shiga toxin-producing Escherichia coli O104:H4 outbreak: new microbiological findings boost coordinated investigations by European public health laboratories. Eurosurveillance 16, 2–4. doi: 10.2807/ese.16.24.19890-en

PubMed Abstract | CrossRef Full Text | Google Scholar

Taboada, E. N., Graham, M. R., Carriço, J. A., and Van Domselaar, G. (2017). Food safety in the age of next generation sequencing, bioinformatics, and open data access. Front. Microbiol. 8:909. doi: 10.3389/fmicb.2017.00909

PubMed Abstract | CrossRef Full Text | Google Scholar

Tallon, L. J., Liu, X., Bennuru, S., Chibucos, M. C., Godinez, A., Ott, S., et al. (2014). Single molecule sequencing and genome assembly of a clinical specimen of Loa loa, the causative agent of loiasis. BMC Genomics 15:788. doi: 10.1186/1471-2164-15-788

PubMed Abstract | CrossRef Full Text | Google Scholar

Tan, B., Ng, C., Nshimyimana, J. P., Loh, L. L., Gin, K. Y. H., and Thompson, J. R. (2015). Next-generation sequencing (NGS) for assessment of microbial water quality: current progress, challenges, and future opportunities. Front. Microbiol. 6:1027. doi: 10.3389/fmicb.2015.01027

PubMed Abstract | CrossRef Full Text | Google Scholar

Taylor, A. J., Lappi, V., Wolfgang, W. J., Lapierre, P., Palumbo, M. J., Medus, C., et al. (2015). Characterization of foodborne outbreaks of Salmonella enterica serovar Enteritidis with whole-genome sequencing single nucleotide polymorphism-based analysis for surveillance and outbreak detection. J. Clin. Microbiol. 53, 3334–3340. doi: 10.1128/JCM.01280-15

PubMed Abstract | CrossRef Full Text | Google Scholar

Tilden, J., Young, W., McNamara, A. M., Custer, C., Boesel, B., LambertFair, M., et al. (1996). A new route of transmission for Escherichia coli: infection from dry fermented salami. Am. J. Public Health 86, 1142–1145. doi: 10.2105/AJPH.86.8_Pt_1.1142

PubMed Abstract | CrossRef Full Text | Google Scholar

Timme, R., Rand, H., Trees, E., Agarwala, R., David, S., Shumway, M., et al. (2015). “Benchmark datasets for validating foodborne outbreak investigations: integrating WGS and phylogenomic analyses,” in 1st ASM Conference on Rapid Next-Generation Sequencing and Bioinformatic Pipelines for Enhanced Molecular Epidemiologic Investigation of Pathogens (Washington, DC).

Timme, R. E., Allard, M. W., Luo, Y., Strain, E., Pettengill, J., Wang, C., et al. (2012). Draft genome sequences of 21 Salmonella enterica serovar Enteritidis strains. J. Bacteriol. 194, 5994–5995. doi: 10.1128/JB.01289-12

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsai, I. J., Zarowiecki, M., Holroyd, N., Garciarrubio, A., Sanchez-Flores, A., Brooks, K. L., et al. (2013). The genomes of four tapeworm species reveal adaptations to parasitism. Nature 496, 57–63. doi: 10.1038/nature12031

PubMed Abstract | CrossRef Full Text | Google Scholar

Turabelidze, G., Lawrence, S. J., Gao, H., Sodergren, E., Weinstock, G. M., Abubucker, S., et al. (2013). Precise dissection of an Escherichia coli O157:H7 outbreak by single nucleotide polymorphism analysis. J. Clin. Microbiol. 51, 3950–3954. doi: 10.1128/JCM.01930-13

PubMed Abstract | CrossRef Full Text | Google Scholar

Tuttle, J., Gomez, T., Doyle, M. P., Wells, J. G., Zhao, T., Tauxe, R. V., et al. (1999). Lessons from a large outbreak of Escherichia coli O157: H7 infections: insights into the infectious dose and method of widespread contamination of hamburger patties. Epidemiol. Infect. 122, 185–192. doi: 10.1017/S0950268898001976

PubMed Abstract | CrossRef Full Text | Google Scholar

Underwood, A. P., Dallman, T., Thomson, N. R., Williams, M., Harker, K., Perry, N., et al. (2013). Public health value of next-generation DNA sequencing of enterohemorrhagic Escherichia coli isolates from an outbreak. J. Clin. Microbiol. 51, 232–237. doi: 10.1128/JCM.01696-12

PubMed Abstract | CrossRef Full Text | Google Scholar

van Dijk, E. L., Auger, H., Jaszczyszyn, Y., and Thermes, C. (2014). Ten years of next-generation sequencing technology. Trends Genet. 30, 418–426. doi: 10.1016/j.tig.2014.07.001

PubMed Abstract | CrossRef Full Text | Google Scholar

Vaux, S., Criscuolo, A., Desnos-Ollivier, M., Diancourt, L., Tarnaud, C., Vandenbogaert, M., et al. (2014). Multicenter outbreak of infections by Saprochaete clavata, an unrecognized opportunistic fungal pathogen. mBio 5:e02309–14. doi: 10.1128/mBio.02309-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, B., Cunningham, J. M., and Yang, X. (2015a). Seq2pathway: an R/Bioconductor package for pathway analysis of next-generation sequencing data. Bioinformatics 31, 3043–3045. doi: 10.1093/bioinformatics/btv289

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, C., and Zhang, D. (2011). A novel compression tool for efficient storage of genome resequencing data. Nucleic Acids Res. 39, E45–U74. doi: 10.1093/nar/gkr009

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Q., Holmes, N., Martinez, E., Howard, P., Hill-Cawthorne, G., and Sintchenko, V. (2015b). It is not all about single nucleotide polymorphisms: comparison of mobile genetic elements and deletions in Listeria monocytogenes genomes links cases of hospital-acquired listeriosis to the environmental source. J. Clin. Microbiol. 53, 3492–3500. doi: 10.1128/JCM.00202-15

CrossRef Full Text | Google Scholar

Wang, S.-H., Shen, M., Lin, H.-C., Sun, P.-L., Lo, H.-J., and Lu, J.-J. (2015c). Molecular epidemiology of invasive Candida albicans at a tertiary hospital in northern Taiwan from 2003 to (2011). Med. Mycol. 53, 828–836. doi: 10.1093/mmy/myv065

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Y., Ye, Z., and Ying, Y. (2012). New trends in impedimetric biosensors for the detection of foodborne pathogenic bacteria. Sensors 12, 3449–3471. doi: 10.3390/s120303449

PubMed Abstract | CrossRef Full Text | Google Scholar

Weinmaier, T., Probst, A. J., La Duc, M. T., Ciobanu, D., Cheng, J. F., Ivanova, N., et al. (2015). A viability-linked metagenomic analysis of cleanroom environments: eukarya, prokaryotes, and viruses. Microbiome 3:62. doi: 10.1186/s40168-015-0129-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Weiss, M. M., Van der Zwaag, B., Jongbloed, J. D. H., Vogel, M. J., Brüggenwirth, H. T., Deprez, R. H. L., et al. (2013). Best practice guidelines for the use of next-generation sequencing applications in genome diagnostics: a national collaborative study of Dutch genome diagnostic laboratories. Hum. Mutat. 34, 1313–1321. doi: 10.1002/humu.22368

PubMed Abstract | CrossRef Full Text | Google Scholar

Withlow, L., and Hagler, W. Jr. (2016). Mold and Mycotoxin Issues in Dairy Cattle: Effects, Prevention and Treatment. EXTENSION [Online]. Available online at: http://articles.extension.org/pages/11768/mold-and-mycotoxin-issues-in-dairy-cattle:-effects-prevention-and-treatment

Wong, T. H. N., Dearlove, B. L., Hedge, J., Giess, A. P., Piazza, P., Trebes, A., et al. (2013). Whole genome sequencing and de novo assembly identifies Sydney-like variant noroviruses and recombinants during the winter 2012/2013 outbreak in England. Virol. J. 10:335. doi: 10.1186/1743-422X-10-335

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, Y., Zheng, J., Wang, Y., Li, S., Jin, H., Li, Z., et al. (2015). Draft genome sequence of Listeria monocytogenes LM201, isolated from foodstuff. Genome Announce. 3:e01417–14. doi: 10.1128/genomeA.01417-14

PubMed Abstract | CrossRef Full Text | Google Scholar

Wuyts, V., Denayer, S., Roosens, N. H. C., Mattheus, W., Bertrand, S., Marchal, K., et al. (2015). Whole genome sequence analysis of Salmonella Enteritidis PT4 outbreaks from a national reference laboratory's viewpoint. PLoS Curr. Outbreaks 2015:1. doi: 10.1371/currents.outbreaks.aa5372d90826e6cb0136ff66bb7a62fc

CrossRef Full Text

Xia, J., Wang, Q., Jia, P., Wang, B., Pao, W., and Zhao, Z. (2012). NGS Catalog: a database of next generation sequencing studies in humans. Hum. Mutat. 33, E2341–E2355. doi: 10.1002/humu.22096

PubMed Abstract | CrossRef Full Text | Google Scholar

Young, N. D., Nagarajan, N., Lin, S. J., Korhonen, P. K., Jex, A. R., Hall, R. S., et al. (2014). The Opisthorchis viverrini genome provides insights into life in the bile duct. Nat. Commun. 5:4378. doi: 10.1038/ncomms5378

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, S. K., Yin, Y. L., Jones, M. B., Zhang, Z. Z., Kaiser, B. L. D., Dinsmore, B. A., et al. (2015). Salmonella serotype determination utilizing high-throughput genome sequencing data. J. Clin. Microbiol. 53, 1685–1692. doi: 10.1128/JCM.00323-15

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: bacteria and viruses, fungi and parasites, metagenomics, microbial profiling, outbreak investigation, surveillance, metataxonomics, whole genome sequencing

Citation: Sekse C, Holst-Jensen A, Dobrindt U, Johannessen GS, Li W, Spilsberg B and Shi J (2017) High Throughput Sequencing for Detection of Foodborne Pathogens. Front. Microbiol. 8:2029. doi: 10.3389/fmicb.2017.02029

Received: 29 March 2017; Accepted: 04 October 2017;
Published: 20 October 2017.

Edited by:

David Rodriguez-Lazaro, University of Burgos, Spain

Reviewed by:

Eelco Franz, Centre for Infectious Disease Control, Netherlands
Mieke Uyttendaele, Ghent University, Belgium

Copyright © 2017 Sekse, Holst-Jensen, Dobrindt, Johannessen, Li, Spilsberg and Shi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Arne Holst-Jensen, arne.holst-jensen@vetinst.no

^†These authors have contributed equally to this work.

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.