SortFastaByCount.pl - Sort a preprocessed fasta file by their counts, showing the most abundant sequeces at the top
# call of the script, specifying all required + possible parameters. SortFastaByCount.pl --input fastafile.fa --output trimmedsequences.fa
The input file. Has to be a multiple fasta file. Fasta entries must have the 'count' property. Mandatory parameter
The output file. Will be a fasta file. Mandatory parameter
Display the help pages
This script sorts the fasta entries of a preprocessed file according to the count property. The most abundant sequences will be shown on top. Remember that preprocessing aggregates identical reads and tallies the counts.
Multiple fasta files. For example:
>43||Count=12 GAAATTTAAGAAACAATTATAATCCAC >44||Count=1 ATTCGCGTTCAGCTGAGGCAGAGTGATGGT >45||Count=2 TCCCTGTGGTCTATTGTTTATGATTCGGCT >46||Count=23 TCCCGGGGCGTCTAGTGGTTAGGGTTTGGCG >47||Count=6 TTCCTGTTGTCTAGTGGTTAGG
A sorted multiple fasta file. For example:
>46||Count=23 TCCCGGGGCGTCTAGTGGTTAGGGTTTGGCG >43||Count=12 GAAATTTAAGAAACAATTATAATCCAC >47||Count=6 TTCCTGTTGTCTAGTGGTTAGG >45||Count=2 TCCCTGTGGTCTATTGTTTATGATTCGGCT >44||Count=1 ATTCGCGTTCAGCTGAGGCAGAGTGATGGT
Perl 5.8 or higher
Robert Kofler
Heinz Himmelbauer
robert.kofler at crg.es