Skip to main content

Advertisement

Table 1 The data sets and reference genomes

From: Jabba: hybrid error correction for long sequencing reads

  ID Number of reads Number of bases (Mbp) Maximal read length N50 Estimated coverage
Escherichia coli
 Reference NC_000913a      
 Short reads ART 28.4 M 2840 100 100 600×
 Long reads SRR1284073b 163 K 649 49,424 13,578 135×
Aeromonas hydrophila
 Reference NC_008570a      
 Short reads ART 4.74 M 474 100 100 100×
 Long reads pbsim 515 4.74 24,430 10,421
Saccharomyces cerevisiae
 Reference NC_001133a      
 Short reads ART 9.72 M 2430 250 250 200×
 Long reads SRR1284074b 1.96 M 5580 37,008 3973 453×
  SRR1284662b      
Ostreococcus tauri
 Reference NC_014426a      
 Short reads [30] 9.72 M 1778 76 76 135×
 Long reads [30] 225 K 1135 22,892 7322 86×
Arabidopsis thaliana
 Reference NC_003070a      
 Short reads ART 23.9 M 5975 250 250 49×
 Long reads SRR1284093b 327 K 1439 86,350 14,256 12×
  SRR1284094b      
Drosophila melanogaster
 Reference Release 5c      
 Short reads ART 24.1 M 6025 250 250 49×
 Long reads SRR1204085b 327 K 686 55,988 12,478
  SRR1204086b      
  1. aReference genome available at http://www.ncbi.nlm.nih.gov/nuccore
  2. bReads available at http://www.ncbi.nlm.nih.gov/sra
  3. cReference genome available at http://www.fruitfly.org/sequence/release5genomic.shtml