[BiO BB] using protein databases
l x yi
lxyiwc at yahoo.com
Fri Feb 6 20:13:56 EST 2004
Hi, I'm a new user of protein databases, could someone help me with a few questions?
1. can anyone tell me how to download 200 random sequences of length >150 from a reliable protein database?
2. For the profiles in pfam, if there are gaps between two continuoud patterns, does it make sense to search only one of the pattern? Is it correct the whole thing is a domain, and the patterns seperated by the gaps are motifs?
3. also, in pfam, the uk site
http://www.sanger.ac.uk/Software/Pfam/data/jtml/seed/PF00047.shtml
i got results like
PIGR_HUMAN/33-112 GNSVSITCYYP.........PTSVNRHTRKYWCR....QGARGGCITLISSEGYVSSK............YAGRANLTNF
O60667/30-106 GGSVTIKCPLP.............EMHVRIYLC.......REMAGSGTCGTVVSTTNFIK........AEYKGRVTLKQY
TVA1_HUMAN/35-113 KEDVTLDCVYE...........TRDTTYYLFWY.......KQPPSGELVFLIRRNSFDEQ........NEISGRYSWNFQ
TVA1_MOUSE/35-112 GASLQLRCKYS............YSATPYLFWY.......VQYPRQGLQLLLKYYSGDPV........VQGVNGFEAEFS
TVA1_MOUSE/35-112-SS TSCEEECCCEC............CSSCCEEEEE.......EECTTCCCEEEEEECSSCSE........EECTTTCEEEEE
TVA1_MOUSE/35-112-SA 43603040406............2735020000.......01275521531041246633........172675040511
TVA2_MOUSE/36-111 GARTSLNCTFS............DSASQYFWWY.......RQHSGKAPKALMSIFSNGE..........KEEGRFTIHLN
TVB1_MOUSE/35-113 GQKAKMRCIPE.............KGHPVVFWY.......QQNKNNEFKFLINFQNQEVLQ......QIDMTEKRFSAEC
TVB7_MOUSE/35-113 GQEATLWCEPI.............SGHSAVFWY.......RQTIVQGLEFLTYFRNQAPID......DSGMPKERFSAQM
TCB_FLV/44-121 GQQVTLSCFPI.............SGHLSLYWY.......QQAVGQGPQLLIQYYNREER.......GKGNFPERFSAQQ
what does the numbers on line 6 mean?
thanks very much.
Lily
---------------------------------
Do you Yahoo!?
Yahoo! Finance: Get your refund fast by filing online
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.bioinformatics.org/pipermail/bbb/attachments/20040206/81d9a9a8/attachment.html>
More information about the BBB
mailing list