Alingment statistics

Description

Display basic information about a multiple sequence alignment.

This tool runs EMBOSS command infoalign that lists basic information about sequences in a multiple sequence alignment. This includes the sequences' name, two measures of length, counts of gaps, and numbers of identical, similar and different residues or bases in this sequence when compared to a reference sequence, together with a simple statistic of the % change between the reference sequence and this sequence.

The reference sequence is the one against which all the other sequences are compared using a specified substitution matrix. It is either the calculated consensus sequence of the alignment (the default) or it can be one of the set of aligned sequences, specified by either the ordinal number of that sequence in the input file, or by name. There are various options to control how the consensus is calculated.

The input sequences can be in any of the EMBOSS compatible sequence alignment formats. Note that this tool reads just one input file, that should contain all the sequences to be analyzed.

For more details, please check the manual pages of infoalign command.

Output

Output is a fasta formatted sequence file.