[BiO BB] Re: BiO_Bulletin_Board Digest, Vol 20, Issue 5
Yannick Wurm
Yannick.Wurm at unil.ch
Mon Jun 5 19:28:32 EDT 2006
Hi Daniel,
have a look at iprscan, which is a very complete tool:
http://www.ebi.ac.uk/InterProScan/
For command-line access, see:
http://www.ebi.ac.uk/Tools/webservices/WSInterProScan.html
You can also set it up locally.
For each protein, the output is one line per domain found on that
protein.
I concatenate the output files into one big file, and then count the
different domains I have.
Best,
yannick
___________________________________
yannick.wurm at unil.ch - Doctoral student
Department of Ecology and Evolution
http://www.unil.ch/dee/page28685.html
#3106, Biophore, Universite de Lausanne
1015 Lausanne, Switzerland
land: +41.21.692.4182 fax: +41.21.692.4165
cell: +41.78.87.87.001
On 5 juin 06, at 12:00, bio_bulletin_board-request at bioinformatics.org
wrote:
>
>
> I am after a way in which I can analyze large data sets of protein
> sequences, where the readout is a quantification of different protein
> domains that are found within a given list of sequences (e.g. a
> list of
> 500 protein sequences in FASTA format). Preferably the output
> would be
> at the systems level (e.g. 230 Tyrosine Kinase domains) rather than
> that
> describing domains only at a protein-by protein level.
>
> Thanks,
>
> Daniel
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.bioinformatics.org/pipermail/bbb/attachments/20060605/1f00bfa8/attachment.html>
More information about the BBB
mailing list