Sequora dna
Part 1-2 - FASTA Splitting
Parts 1 and 2 have been combined into one step. For this step you take one FASTA file containing several small genomes (such as prokaryotes or mitochondria). You then upload this file as the Query FASTA and then you upload it again as the Reference FASTA.
This part of the program will divide each genome into "chunks" that can then be analyzed later on in the pipeline. By default, the Query file is divided into 1,000 base pair chunks, and the Reference file is divided into 10,000 base pair chunks.
Reference chunks are then concatenated accordingly, with the default option to combine the first, second to last, and last chunk for circularity purposes. Reverse compliment chunks are handled in the same way so that the complimentary sequences are not overlooked.
When I first designed this tool, I used one FASTA file with 6 bacterial genomes as both the Query and Reference file. This meant that there were 6 FASTA headers in each file. I suggest using the same file for both the Query and Reference, however this is by no means mandatory.