22 March 2018, Thursday, 08:23:01
All statistics are based on contigs of size >= 500 bp, unless otherwise noted (e.g., "# contigs (>= 0 bp)" and "Total length (>= 0 bp)" include all contigs).
Statistics without reference | Contigs.txt |
# contigs | 256466 |
# contigs (>= 0 bp) | 122167789 |
# contigs (>= 1000 bp) | 15788 |
# contigs (>= 5000 bp) | 1 |
# contigs (>= 10000 bp) | 1 |
# contigs (>= 25000 bp) | 0 |
# contigs (>= 50000 bp) | 0 |
Largest contig | 13067 |
Total length | 172180760 |
Total length (>= 0 bp) | 9864702575 |
Total length (>= 1000 bp) | 19606851 |
Total length (>= 5000 bp) | 13067 |
Total length (>= 10000 bp) | 13067 |
Total length (>= 25000 bp) | 0 |
Total length (>= 50000 bp) | 0 |
N50 | 645 |
N75 | 559 |
L50 | 102813 |
L75 | 174795 |
GC (%) | 32.57 |
Misassemblies | |
Unaligned | |
Mismatches | |
# N's | 0 |
# N's per 100 kbp | 0 |
Genome statistics | |
Predicted genes | |
Similarity statistics |
Plots:Cumulative lengthNxGC content
0 34000 68000 102000 136000 170000 204000 238000th contig 0 25 50 75 100 125 150 175 200 Mbp |
Contigs are ordered from largest (contig #1) to smallest. |
{"subreports":[],"report":[["Genome statistics",[]],["Misassemblies",[]],["Unaligned",[]],["Mismatches",[{"values":[0],"quality":"Less is better","isMain":false,"metricName":"# N's"},{"values":["0.00"],"quality":"Less is better","isMain":true,"metricName":"# N's per 100 kbp"}]],["Statistics without reference",[{"values":[256466],"quality":"Equal","isMain":true,"metricName":"# contigs"},{"values":[122167789],"quality":"Equal","isMain":false,"metricName":"# contigs (>= 0 bp)"},{"values":[15788],"quality":"Equal","isMain":false,"metricName":"# contigs (>= 1000 bp)"},{"values":[1],"quality":"Equal","isMain":false,"metricName":"# contigs (>= 5000 bp)"},{"values":[1],"quality":"Equal","isMain":false,"metricName":"# contigs (>= 10000 bp)"},{"values":[0],"quality":"Equal","isMain":false,"metricName":"# contigs (>= 25000 bp)"},{"values":[0],"quality":"Equal","isMain":false,"metricName":"# contigs (>= 50000 bp)"},{"values":[13067],"quality":"More is better","isMain":true,"metricName":"Largest contig"},{"values":[172180760],"quality":"More is better","isMain":true,"metricName":"Total length"},{"values":[9864702575],"quality":"More is better","isMain":false,"metricName":"Total length (>= 0 bp)"},{"values":[19606851],"quality":"More is better","isMain":true,"metricName":"Total length (>= 1000 bp)"},{"values":[13067],"quality":"More is better","isMain":false,"metricName":"Total length (>= 5000 bp)"},{"values":[13067],"quality":"More is better","isMain":true,"metricName":"Total length (>= 10000 bp)"},{"values":[0],"quality":"More is better","isMain":false,"metricName":"Total length (>= 25000 bp)"},{"values":[0],"quality":"More is better","isMain":true,"metricName":"Total length (>= 50000 bp)"},{"values":[645],"quality":"More is better","isMain":false,"metricName":"N50"},{"values":[559],"quality":"More is better","isMain":false,"metricName":"N75"},{"values":[102813],"quality":"Less is better","isMain":false,"metricName":"L50"},{"values":[174795],"quality":"Less is better","isMain":false,"metricName":"L75"},{"values":["32.57"],"quality":"Equal","isMain":false,"metricName":"GC (%)"}]],["Predicted genes",[]],["Similarity statistics",[]],["Reference statistics",[]]],"assembliesWithNs":null,"referenceName":"","date":"22 March 2018, Thursday, 08:23:01","subreferences":[],"minContig":500,"order":[0],"assembliesNames":["Contigs.txt"]}
{{ qualities }}
{{ mainMetrics }}
{{ assembliesLengths }}
{{ referenceLength }}
{{ coordNGx }}
{{ coordNAx }}
{{ coordNGAx }}
{{ coordmisassemblies }}
{{ genesInContigs }}
{{ operonsInContigs }}
[{{ num_contigs }},
{{ Largest_alignment }},
{{ Total_aligned_length }},
{{ num_misassemblies }},
{{ Misassembled_contigs_length }},
{{ num_mismatches_per_100_kbp }},
{{ num_indels_per_100_kbp }},
{{ num_N's_per_100_kbp }},
{{ Genome_fraction }},
{{ Duplication_ratio }},
{{ NGA50 }}]
{{ allMisassemblies }}
{{ krona }}
{"links_names":["View in Icarus contig browser"],"links":["icarus_viewers/contig_size_viewer.html"]}
"# contigs" : "is the total number of contigs in the assembly.",
"Largest contig" : "is the length of the longest contig in the assembly.",
"Total length" : "is the total number of bases in the assembly.",
"Reference length" : "is the total number of bases in the reference.",
"# contigs (>= 0 bp)" : "is the total number of contigs in the assembly that have size greater than or equal to 0 bp.",
"Total length (>= 0 bp)" : "is the total number of bases in the contigs having size greater than or equal to 0 bp.",
"N50" : "is the contig length such that using longer or equal length contigs produces half (50%) of the bases of the assembly. Usually there is no value that produces exactly 50%, so the technical definition is the maximum length x such that using contigs of length at least x accounts for at least 50% of the total assembly length.",
"NG50" : "is the contig length such that using longer or equal length contigs produces half (50%) of the bases of the reference genome. This metric is computed only if a reference genome is provided.",
"N75" : "is the contig length such that using longer or equal length contigs produces 75% of the bases of the assembly. Usually there is no value that produces exactly 75%, so the technical definition is the maximum length x such that using contigs of length at least x accounts for at least 75% of the total assembly length.",
"NG75" : "is the contig length such that using longer or equal length contigs produces 75% of the bases of the reference genome. This metric is computed only if a reference genome is provided.",
"L50" : "is the minimum number of contigs that produce half (50%) of the bases of the assembly. In other words, it's the number of contigs of length at least N50.",
"LG50" : "is the minimum number of contigs that produce half (50%) of the bases of the reference genome. In other words, it's the number of contigs of length at least NG50. This metric is computed only if a reference genome is provided.",
"L75" : "is the minimum number of contigs that produce 75% of the bases of the assembly. In other words, it's the number of contigs of length at least N75.",
"LG75" : "is the minimum number of contigs that produce 75% of the bases of the reference genome. In other words, it's the number of contigs of length at least NG75. This metric is computed only if a reference genome is provided.",
"NA50" : "is N50 where the lengths of aligned blocks are counted instead of contig lengths. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces. This metric is computed only if a reference genome is provided.",
"NGA50" : "is NG50 where the lengths of aligned blocks are counted instead of contig lengths. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces. This metric is computed only if a reference genome is provided.",
"NA75" : "is N75 where the lengths of aligned blocks are counted instead of contig lengths. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces. This metric is computed only if a reference genome is provided.",
"NGA75" : "is NG75 where the lengths of aligned blocks are counted instead of contig lengths. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces. This metric is computed only if a reference genome is provided.",
"LA50" : "is L50 where aligned blocks are counted instead of contigs. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces.",
"LGA50" : "is LG50 where aligned blocks are counted instead of contigs. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces.",
"LA75" : "is L75 where aligned blocks are counted instead of contigs. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces.",
"LGA75" : "is LG75 where aligned blocks are counted instead of contigs. I.e., if a contig has a misassembly with respect to the reference, the contig is broken into smaller pieces.",
"Average %IDY" : "is the average of alignment identity percent (Nucmer measure of alignment accuracy) among all contigs.",
"# misassemblies" : "is the number of positions in the assembled contigs where the left flanking sequence aligns over 1 kbp away from the right flanking sequence on the reference (relocation) or they overlap on more than 1 kbp (relocation) or flanking sequences align on different strands (inversion) or different chromosomes (translocation).",
"# misassembled contigs" : "is the number of contigs that contain misassembly events.",
"Misassembled contigs length" : "is the number of total bases contained in all contigs that have one or more misassemblies.",
"# relocations" : "is the number of relocation events among all misassembly events. Relocation is a misassembly where the left flanking sequence aligns over 1 kbp away from the right flanking sequence on the reference, or they overlap by more than 1 kbp and both flanking sequences align on the same chromosome.",
"# translocations" : "is the number of translocation events among all misassembly events. Translocation is a misassembly where the flanking sequences align on different chromosomes.",
"# interspecies translocations" : "is the number of interspecies translocation events among all misassembly events. Interspecies translocation is a misassembly where the flanking sequences align on different references (based on alignments to the combined reference).",
"# inversions" : "is the number of inversion events among all misassembly events. Inversion is a misassembly where it is not a relocation and the flanking sequences align on opposite strands of the same chromosome.",
"# local misassemblies" : "is the number of local misassemblies. We define a local misassembly breakpoint as a breakpoint that satisfies these conditions:
- Two or more distinct alignments cover the breakpoint.
- The gap between left and right flanking sequences is less than 1 kbp.
- The left and right flanking sequences both are on the same strand of the same chromosome of the reference genome.