
Whole genome sequencing data requirements
Users are requested to submit only high-quality assemblies, generated from pure cultures sequenced at a minimum coverage of 40X. Assembly files consisting of high numbers of contigs (> 500), or presenting a cumulative contigs length outside the typical range of Bordetella species (~ 3.2 – 6 Mbp), will not be accepted. Submissions containing low quality assemblies may be entirely rejected.
NB. For quality purposes, we only accept assemblies either generated from high quality short-reads or combining both short and long reads (hybrid assemblies). Please note that genomes obtained using long-read sequence technology, or by Ion Torrent/Roche/454 will not be uploaded to the database, nor used to define new alleles. However, if using one of these assemblies you discover a new MLST profile(s) composed solely of existing alleles, you may make a 'profile' submission type to define a new ST.
Please refer to the assembly metrics below:
| Species | Size of genome | Number of contigs | C+G% | Coverage |
|---|---|---|---|---|
| B. bronchiseptica | 5 000 000 - 5 600 000 | ≤ 500 | 67.8 - 68.7 | >= 40 |
| B. holmesii | 3 400 000 - 3 800 000 | ≤ 500 | 62.6 - 62.9 | >= 40 |
| B. parapertussis | 4 400 000 - 4 900 000 | ≤ 500 | 67.7 - 68.2 | >= 40 |
| B. pertussis | 3 300 000 - 4 100 000 | ≤ 500 | 67.1 - 68.1 | >= 40 |
| other | 3 700 000 - 5 900 000 | ≤ 500 | 59.4 - 69 | >= 40 |