Basic Statistics
| Measure | Value |
|---|---|
| Filename | Geoduck-NMP-gDNA-5to7kb-8_S31_L008_R1_001.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 341141 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 35-151 |
| %GC | 35 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 25504 | 7.476087600141877 | No Hit |
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTTCCGTATCTCGTAT | 549 | 0.16093052432865002 | TruSeq Adapter, Index 14 (97% over 44bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GGGCGGT | 35 | 0.009623709 | 130.85034 | 145 |
| GATCGGA | 145 | 0.0 | 59.087997 | 1 |
| GAAGAGC | 160 | 0.0 | 53.37799 | 6 |
| TCGGAAG | 170 | 0.0 | 50.23811 | 3 |
| CGGAAGA | 185 | 0.0 | 49.71588 | 4 |
| AGAGCAC | 190 | 0.0 | 44.949886 | 8 |
| GGAAGAG | 205 | 0.0 | 44.865555 | 5 |
| ATCGGAA | 215 | 0.0 | 39.742138 | 2 |
| AAGAGCA | 285 | 0.0 | 32.271713 | 7 |
| GAGCACA | 295 | 0.0 | 31.177757 | 9 |
| GGGGGGG | 180 | 0.0025349567 | 25.443121 | 145 |
| AGTTCCG | 115 | 1.1734301E-8 | 14.954911 | 30-34 |
| CGTATGC | 150 | 2.5616282E-7 | 11.816233 | 45-49 |
| TGCCGTC | 175 | 1.375156E-7 | 11.022747 | 50-54 |
| CCGTCTT | 210 | 1.657737E-5 | 10.904194 | 145 |
| CAGTTCC | 175 | 2.4098154E-7 | 10.583477 | 30-34 |
| GTTCCGT | 185 | 4.447811E-7 | 10.121729 | 35-39 |
| CGTCTGA | 170 | 2.071707E-6 | 10.047622 | 15-19 |
| CCGTATC | 175 | 2.3904468E-6 | 9.93582 | 35-39 |
| ACGTCTG | 175 | 3.0008741E-6 | 9.760547 | 15-19 |