esl-seqrange - Man Page

determine a range of sequences for one of many parallel processes

Synopsis

esl-sfetch [options] seqfile procidx nproc

Description

esl-seqrange reads an SSI-indexed seqfile and determines the range of sequence indices in that file that process number procidx out of nproc total processes should operate on during a parallel processing of seqfile.

The seqfile must be indexed first using esl-sfetch --index seqfile. This creates an SSI index file seqfile.ssi. An SSI file is required in order for esl-seqrange to work.

Sequence index ranges are calculated using a simple rule: the number of sequences for each process should be identical, or as close as possible to identical, across all processes. The lengths of the sequences are not considered (even though they probably should be).

Options

-h

Print brief help; includes version number and summary of all options, including expert options.

--informat <s>

Assert that input seqfile is in format <s>, bypassing format autodetection. Common choices for <s> include: fasta, embl, genbank. Alignment formats also work; common choices include: stockholm, a2m, afa, psiblast, clustal, phylip. For more information, and for codes for some less common formats, see main documentation. The string <s> is case-insensitive (fasta or FASTA both work).

See Also

http://bioeasel.org/

Author

http://eddylab.org

Info

Nov 2020 Easel 0.48 Easel Manual