adaptercheck.py

adaptercheck.py searches fastq files for potential read-through contaminants in sequencing reads.

Where:

Run adaptercheck in the same folder as your read files.

Input:

Before running adaptercheck.py, select a single read file in blreads to process eg. DL300_R1.fastq.
The file must be an uncompressed fastq file.


Parameters:

Adapters to find - Choose an adapter file, or "custom adapter file"
adapter file - If custom adapter, choose a fasta file containing custom adapters
Size of 5' oligo - (default 12)  It is often instructive to re-run adaptercheck.py with a few different oligo sizes. The frequency with which an oligonucleotide of k nt  is expected to occur within a long sequence  is given in the table below. For example, if a fastq file had reads totaling to 4 million nucleotides, an 11-mer would be expected to be seen once by random chance. k-mers from the adapter used in sequencing will usually occur several orders of magnitude more frequently than background k-mers.

k 4k
10 1,048,576
11 4,194,304
12 16,777,216
13 67,108,864
14 268,435,456
15 1,073,741,824
16 4,294,967,296
17 17,179,869,184
18 68,719,476,736

If you using adaptercheck.py to determine which adapter was used for your reads, you may get hits above background for more than one adapter in the list. Use k=20, which will distinguish between adapters that differ within the first 20 nt.


WHERE TO SEND OUTPUT
- By default, the output will pop up in a spreadsheet. If Text file is chosen send to an output file.
Output file name - file to send report to a spreadsheet, or to a .tsv file.
 

Output:

Output is written to the input directory.