bl_seqkit_sample.py

bl_seqkit_sample.py runs seqkit sample to create files with reads sampled from a larger file.

Where:

Run bl_seqkit_sample.py in the same folder as your read files.

Input:

RNAseq reads: Before opening this menu, select RNA readfiles. For paired-end reads,  first use File --> guesspairs.py to group left and right read pair files, which will pop up in a new blreads window. Then run SeqKit sample from the new blreads window.

file extension of input files (eg. .fastq.gz): Of the files selected, only those with a specific file extension will be processed. Seqkit sample
works on both Fasta and Fastq files.
prefix to prepend to file extension - Each output file will have the same name as the input file, with a prefix added before the file extension. For example, with the default value of _S5, if the input filename was D104.fq.gz, the output filename would be D104_S5.fq.gz. This prefex can be almost anything, but a few guidelines are suggested:
Percentage of sequences written to output - Because there may be many different types of files in a directory which have similar names, setting this to Yes narrows down the file names that bl_seqkit_sample.py has to consider. If you choose yes, you must also put in at least one file extension below.

Output:

The current blreads window will be refreshed to show the output files in the current directory.