Search for patterns in nucleotide sequences.
This tool runs EMBOSS command fuzznuc that searches for a specified PROSITE-style pattern in nucleotide sequences. Such patterns are specifications of a (typically short) length of sequence to be found. They can specify a search for an exact sequence or they can allow various ambiguities, matches to variable lengths of sequence and repeated subsections of the sequence. One or more nucleotide sequences are read from file.
The standard IUPAC one-letter codes for the nucleotides are used in the search patterns. The symbol 'n' is used for a position where any nucleotide is accepted. In standard PROSITE pattern notation square parentheses '[ ]', and curly brackets '{ }' are used to define ambiguities, but at the moment these characters can't be used with this tool.
The input nucleotide sequences can be in any of the EMBOSS compatible sequence formats.
For more details, please check the manual page of fuzznuc command.
Output is a table or a text formatted report file.