Basic Statistics
Measure | Value |
---|---|
Filename | Geoduck-NMP-gDNA-5to7kb-8_S31_L008_R1_001.fastq.gz |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 341141 |
Sequences flagged as poor quality | 0 |
Sequence length | 35-151 |
%GC | 35 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 25504 | 7.476087600141877 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTTCCGTATCTCGTAT | 549 | 0.16093052432865002 | TruSeq Adapter, Index 14 (97% over 44bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGGCGGT | 35 | 0.009623709 | 130.85034 | 145 |
GATCGGA | 145 | 0.0 | 59.087997 | 1 |
GAAGAGC | 160 | 0.0 | 53.37799 | 6 |
TCGGAAG | 170 | 0.0 | 50.23811 | 3 |
CGGAAGA | 185 | 0.0 | 49.71588 | 4 |
AGAGCAC | 190 | 0.0 | 44.949886 | 8 |
GGAAGAG | 205 | 0.0 | 44.865555 | 5 |
ATCGGAA | 215 | 0.0 | 39.742138 | 2 |
AAGAGCA | 285 | 0.0 | 32.271713 | 7 |
GAGCACA | 295 | 0.0 | 31.177757 | 9 |
GGGGGGG | 180 | 0.0025349567 | 25.443121 | 145 |
AGTTCCG | 115 | 1.1734301E-8 | 14.954911 | 30-34 |
CGTATGC | 150 | 2.5616282E-7 | 11.816233 | 45-49 |
TGCCGTC | 175 | 1.375156E-7 | 11.022747 | 50-54 |
CCGTCTT | 210 | 1.657737E-5 | 10.904194 | 145 |
CAGTTCC | 175 | 2.4098154E-7 | 10.583477 | 30-34 |
GTTCCGT | 185 | 4.447811E-7 | 10.121729 | 35-39 |
CGTCTGA | 170 | 2.071707E-6 | 10.047622 | 15-19 |
CCGTATC | 175 | 2.3904468E-6 | 9.93582 | 35-39 |
ACGTCTG | 175 | 3.0008741E-6 | 9.760547 | 15-19 |