hmmalign man page

hmmalign — align sequences to a profile HMM

Synopsis

hmmalign [options] <hmmfile> <seqfile>

Description

Perform a multiple sequence alignment of all the sequences in <seqfile> by aligning them individually to the profile HMM in <hmmfile>. The new alignment is output to stdout in Stockholm format.

The <hmmfile> should contain only a single profile. If it contains more, only the first profile in the file will be used.

Either <hmmfile> or <seqfile> (but not both) may be '-' (dash), which means reading this input from stdin rather than a file.

The sequences in <seqfile> are aligned in unihit local alignment mode. Therefore they should already be known to contain only a single domain (or a fragment of one). The optimal alignment may assign some residues as nonhomologous (N and C states), in which case these residues are still included in the resulting alignment, but shoved to the outer edges. To trim these unaligned nonhomologous residues from the result, see the --trim option.

Options

-h
Help; print a brief reminder of command line usage and all available options.
-o <f>
Direct the output alignment to file <f>, rather than to stdout.
--mapali <f>
Merge the existing alignment in file <f> into the result, where <f> is exactly the same alignment that was used to build the model in <hmmfile>. This is done using a map of alignment columns to consensus profile positions that is stored in the <hmmfile>. The multiple alignment in <f> will be exactly reproduced in its consensus columns (as defined by the profile), but the displayed alignment in insert columns may be altered, because insertions relative to a profile are considered by convention to be unaligned data.
--trim
Trim nonhomologous residues (assigned to N and C states in the optimal alignments) from the resulting multiple alignment output.
--amino
Specify that all sequences in <seqfile> are proteins. By default, alphabet type is autodetected from looking at the residue composition.
--dna
Specify that all sequences in <seqfile> are DNAs.
--rna
Specify that all sequences in <seqfile> are RNAs.
--informat <s>
Declare that the input <seqfile> is in format <s>. Accepted sequence file formats include FASTA, EMBL, GenBank, DDBJ, UniProt, Stockholm, and SELEX. Default is to autodetect the format of the file.
--outformat <s>
Specify that the output multiple alignment is in format <s>. Currently the accepted multiple alignment sequence file formats only include Stockholm and SELEX.

See Also

See hmmer(1) for a master man page with a list of all the individual man pages for programs in the HMMER package.

For complete documentation, see the user guide that came with your HMMER distribution (Userguide.pdf); or see the HMMER web page ().

Author

The Eddy/Rivas Laboratory
Janelia Farm Research Campus
19700 Helix Drive
Ashburn VA 20147 USA
http://eddylab.org

Info

February 2015 HMMER 3.1b2 HMMER Manual