[ssml] Finding Matches using N-term & C-term sequences
Kevin Karplus
karplus at soe.ucsc.edu
Wed Dec 10 16:46:27 EST 2003
It sounds like you want to use a tool like phi-blast to do pattern searches.
(see http://www.ncbi.nlm.nih.gov/BLAST/producttable.shtml for a
selection guide to the NCBI programs). There is a pointer from that
table to documentation about the program:
Pattern-Hit Initiated (PHI) BLAST is designed to search for
proteins that contain a pattern specified by the user, AND are
similar to the query sequence in the vicinity of the pattern. This
dual requirement is intended to reduce the number of database hits
that contain the pattern, but are likely to have no true homology
to the query.
To run PHI-BLAST, enter your query (which contains one or more
instances of the pattern) into the "Search" box, and enter your
pattern into the "PHI pattern" box in the "Options"
section. Patterns must follow the syntax conventions of
PROSITE. The documentation on Pattern Syntax is at:
http://www.ncbi.nlm.nih.gov/blast/html/PHIsyntax.html.
More information about the ssml-general
mailing list