Basic Statistics
| Measure | Value |
|---|---|
| Filename | Geoduck-NMP-gDNA-5to7kb-6_S23_L006_R1_001.fastq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 337457885 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 35-151 |
| %GC | 36 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 50321000 | 14.911786695990227 | No Hit |
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTTCCGTATCTCGTAT | 704275 | 0.20870011675679173 | TruSeq Adapter, Index 14 (97% over 44bp) |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GATCGGA | 314235 | 0.0 | 30.424515 | 1 |
| TCGGAAG | 322860 | 0.0 | 29.69602 | 3 |
| CGGAAGA | 323965 | 0.0 | 29.565157 | 4 |
| GAAGAGC | 335750 | 0.0 | 28.55025 | 6 |
| ATCGGAA | 357905 | 0.0 | 26.934465 | 2 |
| AGAGCAC | 370145 | 0.0 | 26.06464 | 8 |
| GGAAGAG | 395715 | 0.0 | 24.44751 | 5 |
| AAGAGCA | 412835 | 0.0 | 23.519438 | 7 |
| GAGCACA | 419030 | 0.0 | 23.122732 | 9 |
| AGATCGG | 225675 | 0.0 | 16.334044 | 1 |
| CGTATGC | 290600 | 0.0 | 10.273528 | 45-49 |
| CGTCTTC | 298530 | 0.0 | 10.106112 | 50-54 |
| TATGCCG | 293350 | 0.0 | 10.087253 | 45-49 |
| GGCGCGG | 18365 | 0.0 | 9.977952 | 145 |
| CCGTATC | 270840 | 0.0 | 9.792214 | 35-39 |
| TCTCGTA | 285885 | 0.0 | 9.562967 | 40-44 |
| CGGGGGG | 22845 | 0.0 | 9.479641 | 145 |
| TATCTCG | 292755 | 0.0 | 9.447189 | 40-44 |
| CTTCTGC | 337245 | 0.0 | 9.383907 | 55-59 |
| ATCTCGT | 299335 | 0.0 | 9.253742 | 40-44 |