maskfeat

Function

Description

maskfeat reads a sequence with associated features and writes the same information to file but with features of the specified type omitted (masked). Sequence regions corresponding to the masked-out features are masked such that characters in those region are (optionally) converted to lower case and / or (optionally) replaced with the specified mask character.

Usage

Command line arguments


Input file format

maskfeat reads one or more nucleotide or protein sequences with features.

Output file format

The sequence file has features masked with the specified character.

Data files

None.

Notes

Formats such as EMBL or SWISSPROT include features in the sequence annotation. Otherwise, features they may be supplied in a separate file and specified with the qualifier -ufo or -fopenfile / -fformat.

The feature type to mask is by default repeat*, i.e. any type whose name starts with repeat will be omitted from the output file. Any other type of feature(s) may be specified with the -type qualifier. To specify more than one type of feature, separate their names with spaces or commas. The names may be wild-carded with asterisks * to find gruops of feature types sharing a common part of their names.

If you are unsure of the names of feature types in use, please consult http://www.ebi.ac.uk/Services/WebFeat/ (EMBL feature types) and see Appendix A of the Swissprot user manual in http://www.uniprot.org/manual/sequence_annotation (Swissprot feature types).

References

None.

Warnings

None.

Diagnostic Error Messages

None.

Exit status

It always exits with a status of 0.

Known bugs

None.

Author(s)

History

Target users

Comments