Check FASTA databases

Check FASTA databases

Description

Check FASTA databases for redundant id lines and protein/nucleotide sequences. If you find that there are redundancies and choose to receive output files which will help you in removing redundancies please process one database at a time.

Input files

  • at least 1 FASTA database file (.fa | .faa | .fas | .fasta)

Output files

  • Fixed database (fixed-database.fasta)

Context



Parameters

Max id line length

Default: 400 characters

Source code

check-fasta.rb, check-fasta.yaml (GitHub)