>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 29.2 bits (65), Expect = 0.022 Identities = 32/120 (26%), Positives = 51/120 (42%), Gaps = 18/120 (15%) Query: 157 IAEELGISRAQFD-----QFLRMMQGGAQFGGGYQQQSGGGNWQQAQRGPTLEDACNVLG 211 EEL R FD F+ + QQQ G G QQAQ T ++A Sbjct: 310 TLEEL---RDSFDGYINNAFVNQIHLNFVMPPQAQQQQGQGQQQQAQ--ATAQEAVAAAA 364 Query: 212 VKPTDDATTIKRAYRKLMS-EHHPDKLVAKGLPPEMMEMAKQKAQEIQ-QAYELIKQQKG 269 V+ + + I + Y+ L+ + H G+ M ++A Q+ ++ + Q KQQ+G Sbjct: 365 VRLLNGSDQIAQLYKDLVKLQRH------AGIRKAMEKLAAQQEEDAKNQGKGDCKQQQG 418
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 26.3 bits (58), Expect = 0.047 Identities = 13/29 (44%), Positives = 17/29 (58%), Gaps = 1/29 (3%) Query: 2 SKYIYILLSF-LVLFFIFFYAYISLMSKE 29 IYI+ F L++FF FFY IS +E Sbjct: 314 DHPIYIVTYFLLIVFFAFFYVAISFNPEE 342
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.014 Identities = 17/80 (21%), Positives = 27/80 (33%), Gaps = 5/80 (6%) Query: 4 RRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGDWVAVWQDS-YLWHVVRFSFWQ 62 R GWL + L V A +W+ A ++W+ ++ Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVAN----TSIWRLLAFINTKPVAFTLP 115 Query: 63 AFLSAQLSVVPAIFLARALY 82 LS +VV F+ LY Sbjct: 116 LALSIIFNVVVVTFMWSLLY 135
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 181 bits (459), Expect = 1e-51 Identities = 92/253 (36%), Positives = 125/253 (49%), Gaps = 21/253 (8%) Query: 343 GEGTPYENVRVANMQWNEQTQRYEFT---PAHDVDGPLITWTPENPEHGNVPGHTGN--D 397 G P + V V +N T YE T + ++TWTP +P P T Sbjct: 377 GVSVP-KAVPVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVP 435 Query: 398 RPPLDQPTILVTPIPDGTDTYTTPPFPVPDPKEFNDYILVFPAGSGIKPIYVYLKEDPRK 457 +P +TP+ T +P D I+ FPA SGIKPIYV + DPR Sbjct: 436 KPVPVYEGATLTPV-----KATPETYPGVITLP-EDLIIGFPADSGIKPIYVMFR-DPRD 488 Query: 458 LPGVVTGHGVPLSPGTRWLDMSVSNNGNGAPIPAHIADKLRGREFKTFDEFREALWLEVS 517 +PG TG G P+ WL ++ G GAPIP+ IADKLRG+ FK + +FRE W+ V+ Sbjct: 489 VPGAATGKGQPV--SGNWLG--AASQGEGAPIPSQIADKLRGKTFKNWRDFREQFWIAVA 544 Query: 518 QDPELIAQFSSGNQTRIKQGLTAKAPIDGWHYGPKDIVKKFQIHHRVAIEYGGSVYDIDN 577 DPEL QF+ G+ ++ G G + K +IHH+V + GG VY++ N Sbjct: 545 NDPELSKQFNPGSLAVMRDGGAPYVRESE-QAGGR---IKIEIHHKVRVADGGGVYNMGN 600 Query: 578 LRIVTPRLHDEIH 590 L VTP+ H EIH Sbjct: 601 LVAVTPKRHIEIH 613
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 53.3 bits (127), Expect = 4e-12 Identities = 17/64 (26%), Positives = 31/64 (48%), Gaps = 4/64 (6%) Query: 1 MSQYPELIAQFSSGNQTRIKQGLIAKAPLEGWHYGTKEIVKKFHMYHRVAIEYSGGIYDI 60 ++ PEL QF+ G+ ++ G E G + K ++H+V + GG+Y++ Sbjct: 543 VANDPELSKQFNPGSLAVMRDGGAPYVR-ESEQAGGRI---KIEIHHKVRVADGGGVYNM 598 Query: 61 DNLR 64 NL Sbjct: 599 GNLV 602
>PF04605#Virulence-associated protein D (VapD) Length = 125 Score = 26.0 bits (57), Expect = 0.019 Identities = 13/34 (38%), Positives = 20/34 (58%) Query: 1 MYNFKDKIEDYTEREFIELLGEFTNPTGDNAQLK 34 Y+ K+ I+D ++F + L EFT T N +LK Sbjct: 88 QYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLK 121
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 44.0 bits (103), Expect = 3e-09 Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 4/53 (7%) Query: 4 QFSTGNQTRIKQGLIAKAPLEGWHYGSKEIVKEFHIYHSVAIECGGEIYDIDN 56 QF+ G+ ++ G E G + + I+H V + GG +Y++ N Sbjct: 552 QFNPGSLAVMRDGGAPYVR-ESEQAGGRI---KIEIHHKVRVADGGGVYNMGN 600
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 29.2 bits (65), Expect = 0.017 Identities = 27/100 (27%), Positives = 40/100 (40%), Gaps = 22/100 (22%) Query: 110 MVKIEGGEWL----VETVQMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAGDRLL- 164 V +E G L + V L AV GL P +V + D++G LL Sbjct: 176 TVTLEPGRALDEGQISAVVHLVSSAVA-----GLPPGNVTLV---------DQSG-HLLT 220 Query: 165 -SDALALEAAGAQLLVLECVPVELAKRITEALAIPVIGIG 203 S+ + AQL V + +RI L+ P++G G Sbjct: 221 QSNTSGRDLNDAQLKFANDVESRIQRRIEAILS-PIVGNG 259
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 805 bits (2081), Expect = 0.0 Identities = 260/869 (29%), Positives = 423/869 (48%), Gaps = 40/869 (4%) Query: 12 IATFCALLYSNSALCAELVEYDHTFLMGKDASNIDLSRYTEGNPTLPGIYDVSVYVNDQP 71 + CA AE + ++ FL + DLSR+ G PG Y V +Y+N+ Sbjct: 30 LFVACAFAAQAPLSSAE-LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGY 88 Query: 72 IMSQSIAFAVIEGKKNAQACITQKNLLQFHISSPDKNSEKAILLKRDEDLGDCLNLAEMI 131 + ++ + F + ++ C+T+ L +++ + C+ L MI Sbjct: 89 MATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMN------LLADDACVPLTSMI 142 Query: 132 PQSSIRYDVNDQRLDIDVPQAWIMKNYQNYVDPSLWENGINAAMLSYNLNGYHSESP-GR 190 ++ + DV QRL++ +PQA++ + Y+ P LW+ GINA +L+YN +G ++ G Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG 202 Query: 191 TNDSIYAAFNGGINLGAWRLRASGNYNWMTNVHS-----DYDFQNRYLQRDLASLRSQLV 245 + Y G+N+GAWRLR + +++ ++ S + N +L+RD+ LRS+L Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 246 IGESYTTGETFDSVSIRGIRLYSDSRMLPPVLASFAPIIHGVANTNAKVTVMQNGYKIYE 305 +G+ YT G+ FD ++ RG +L SD MLP FAP+IHG+A A+VT+ QNGY IY Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 306 TTVPPGAFAIDDLSPSGYGSDLIVTIEEADGTKRTFSQPFSSVVQMLRPGVGRWDISAGQ 365 +TVPPG F I+D+ +G DL VTI+EADG+ + F+ P+SSV + R G R+ I+AG+ Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 366 VLKD-SIQDEPNLFQASYYYGLNNYLTGYTGIQLTDNNYTAGLLGLGMNT-PVGAFSVDV 423 + Q++P FQ++ +GL T Y G QL D Y A G+G N +GA SVD+ Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFNFGIGKNMGALGALSVDM 441 Query: 424 THSNVSIPDDKTYQGQSYRISWNKLFENTSTSLNIAAYRYSTQHYLGLNDALTLIDEVEH 483 T +N ++PDD + GQS R +NK + T++ + YRYST Y D + Sbjct: 442 TQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501 Query: 484 PEQELE--------PKSMRNYSRMKNQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASGQ 535 E + + ++ +++ Q L + YLSGS YW + Sbjct: 502 IETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL----GRTSTLYLSGSHQTYWGTSN 557 Query: 536 NSTNYSIGYSNSASWGSYSISAQRSLNE-DGQTDDSIYLSFTIPIENLLGTEHRSS-GFQ 593 + G + + ++++S + N D + L+ IP + L ++ +S Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617 Query: 594 SIDTQLNSDFKGNNQLNISSSGYSDT-NRISYSVNTGYMMNKSSDDLSYIGGYASYESPW 652 S ++ D G G N +SYSV TGY + S +Y + Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677 Query: 653 GTLSGSASASSDNSRQFSLNTDGGFVLHSGGLTFSNDSFSDSDTLAVIQAPGAKGARINY 712 G + S S D +Q GG + H+ G+T +DT+ +++APGAK A++ Sbjct: 678 GNANIGYSHSDDI-KQLYYGVSGGVLAHANGVTLGQPL---NDTVVLVKAPGAKDAKVEN 733 Query: 713 GNST-VDRWGYGVTSALSPYHENRIALDINDLENDVELKSTSTVAVPRQGAVVFADFETV 771 D GY V + Y ENR+ALD N L ++V+L + VP +GA+V A+F+ Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793 Query: 772 QGQSAIMNIVRSDGKNIPFAADIYDEQNNIIGNVGQGGQAFVRGIGQEGNIRITWIEEGK 831 G +M + + K +PF A + E + G V GQ ++ G+ G +++ W EE Sbjct: 794 VGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852 Query: 832 PVSCFAHYQQNTTSEKIAQSIILNGLRCQ 860 C A+YQ S++ Q + C+ Sbjct: 853 A-HCVANYQLPPESQQ--QLLTQLSAECR 878
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 506 bits (1304), Expect = 0.0 Identities = 290/296 (97%), Positives = 292/296 (98%) Query: 1 MSGLPLISRRRLLTAMALSPFLWQMNTAHAAVIDPNRIVALEWLPVELLLALGIVPYGVA 60 MSGLPLISRRRLLTAMALSP LWQMNTAHAA IDPNRIVALEWLPVELLLALGIVPYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLTHYEDFIRSMKPRFVKRGARPLLLT 180 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHL YEDFIRSMKPRFVKRGARPLLLT Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 DNSKDMNALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRILDNAIGGKA 296 DNSKDM+ALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVR+LDNAIGGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 34.3 bits (78), Expect = 4e-04 Identities = 12/55 (21%), Positives = 28/55 (50%), Gaps = 4/55 (7%) Query: 186 NDYYRKVKELRAKNQITLPVILKNERQINVFLRT----EDIDLINVINEETLLQQ 236 + ++ EL A N T+ +K ++N+ +R D + I V +E+++++ Sbjct: 589 QNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESVVKE 643
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 31.6 bits (71), Expect = 0.012 Identities = 30/107 (28%), Positives = 42/107 (39%), Gaps = 10/107 (9%) Query: 519 TETIGNDQKITVGLG--QTVNVGSKKEGGHDQKVTVANDQHLTIKNDRHKVVNNNQTSKV 576 T+T G D +T G G QT GS G+ T D L + QT+ Sbjct: 359 TQTAGEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIAG------YGSTQTAGE 412 Query: 577 TGTDTEEVVKKQSIKIGDNYELKVEHGTNIISGDSIELICGQGESGT 623 T T Q+ + G + L +G+ +GD LI G G + T Sbjct: 413 ESTQTAGYGSTQTAQKGSD--LTAGYGSTGTAGDDSSLIAGYGSTQT 457
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.1 bits (62), Expect = 0.044 Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%) Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70 S IA+ G++R + + + KS+ + + + I + + Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81 Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115 P + ++ + +L I + V+ Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 27.1 bits (60), Expect = 0.029 Identities = 15/42 (35%), Positives = 20/42 (47%), Gaps = 4/42 (9%) Query: 110 GQCRVERCF--RVTWPDTSEQYVALKTAVQSL--IPLVIATI 147 G R E + R P EQ+ A K VQ + P+VI T+ Sbjct: 294 GLYRTEFLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 146 bits (370), Expect = 2e-49 Identities = 84/102 (82%), Positives = 88/102 (86%) Query: 1 MAQHEVITRGGDAFLLKLRESALSSGSMSEEQFFLLIGISSIHSDRVILAMKDYLVSGHS 60 MA HEVI+R G+AFLL +RES L GSMSE FFLLIGISSIHSDRVILAMKDYLV GHS Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60 Query: 61 RKDVCEKYQMNNGYFSTTLGRLTRLNVLVARLAPYYTDSVSA 102 RK+VCEKYQMNNGYFSTTLGRL RLN L ARLAPYYTD SA Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSA 102
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 28.8 bits (64), Expect = 0.009 Identities = 39/160 (24%), Positives = 64/160 (40%), Gaps = 23/160 (14%) Query: 16 AALAGNHWHVMLPGGNMRFQGKIIAEACSLALSDRQMTVDMGQLSSNRFHAAGEYGDPVG 75 A L H H N+ F+GK+I AC++ + V+ G + +G G+ Sbjct: 15 AVLMSQHVHA---ADNLTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSG--GNQKD 65 Query: 76 FDIHLQDCSTVVSQRVGISFYGVSDIHEPELLSVEEENDASDGIAIALFNES----GELV 131 F + + ++ + +V I+ G + +L + DG+ I L+N + G V Sbjct: 66 FTVDMNCPYSLGTMKVTITSNGQTG---NSILVPNTSTASGDGLLIYLYNSNNSGIGNAV 122 Query: 132 KLNQPPENWVHLTRGDMKLHMQARYKATHYPVAGGKANGQ 171 L +T G + AR K T Y G K N Q Sbjct: 123 TLGSQ------VTPGKITGTAPAR-KITLYAKLGYKGNMQ 155
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 960 bits (2483), Expect = 0.0 Identities = 545/861 (63%), Positives = 690/861 (80%), Gaps = 9/861 (1%) Query: 25 RMRFNILPLAFFIGIIVSPAR------AELYFNPRFLSDDPDAVADLSAFTQGQELPPGV 78 + + + + + A AELYFNPRFL+DDP AVADLS F GQELPPG Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77 Query: 79 YRVDIYLNDTYISTRDVQFQMSQDGKQLAPCLSPEHMSAMGVNRYAVPGMERLPADTCTS 138 YRVDIYLN+ Y++TRDV F + + PCL+ +++MG+N +V GM L D C Sbjct: 78 YRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP 137 Query: 139 LNSMIQGATFRFDVGQQRLYLTVPQIYMSNQARGYIAPEYWDNGITAALLNYDFSGNRVR 198 L SMI AT + DVGQQRL LT+PQ +MSN+ARGYI PE WD GI A LLNY+FSGN V+ Sbjct: 138 LTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQ 197 Query: 199 DSYGGTSDYAYLNLKTGLNIGSWRLRDNTSWSYSAGKGYS--QNNWQHINTWLERDIVPL 256 + GG S YAYLNL++GLNIG+WRLRDNT+WSY++ S +N WQHINTWLERDI+PL Sbjct: 198 NRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPL 257 Query: 257 RSRLTMGDSYTRGDIFDGVNFRGIQLASDDNMVPDSQRGYAPTIHGISRGTSRISIRQNG 316 RSRLT+GD YT+GDIFDG+NFRG QLASDDNM+PDSQRG+AP IHGI+RGT++++I+QNG Sbjct: 258 RSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNG 317 Query: 317 YEIYQSTLPPGPFEINDIYPAGSGGDLQVTLQEADGSVQRFNVPWSSVPVLQREGHLKYA 376 Y+IY ST+PPGPF INDIY AG+ GDLQVT++EADGS Q F VP+SSVP+LQREGH +Y+ Sbjct: 318 YDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYS 377 Query: 377 LSAGEFRSGGHQQDNPRFAEGTLKYGLPAGWTVYGGAWIAERYRAFNLGVGKNMGWLGAV 436 ++AGE+RSG QQ+ PRF + TL +GLPAGWT+YGG +A+RYRAFN G+GKNMG LGA+ Sbjct: 378 ITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437 Query: 437 SLDATRANARLPDESRYDGQSYRFLYNKSLTETGTNIQLIGYRYSTRGYFSFADTAWKKM 496 S+D T+AN+ LPD+S++DGQS RFLYNKSL E+GTNIQL+GYRYST GYF+FADT + +M Sbjct: 438 SVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRM 497 Query: 497 SGYSVLTQDGVIQIQPKYTDYYNLAYNKRGRVQVSISQQTGESSTLYLSGSHQSYWGTDR 556 +GY++ TQDGVIQ++PK+TDYYNLAYNKRG++Q++++QQ G +STLYLSGSHQ+YWGT Sbjct: 498 NGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSN 557 Query: 557 TDRQLNAGFNSSVNDISWSLNYSLSRNAWQHETDRILSFDVSIPFSHWMRSDSTSAWRNA 616 D Q AG N++ DI+W+L+YSL++NAWQ D++L+ +V+IPFSHW+RSDS S WR+A Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617 Query: 617 SARYSQTLEAHGQAASTAGLYGTLLEDNNLGYSIQSGYTRGGYEGSSKTGYASLNYRGGY 676 SA YS + + +G+ + AG+YGTLLEDNNL YS+Q+GY GG S TGYA+LNYRGGY Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677 Query: 677 GNASAGYSHSGGYRQLYYGLSGGILAHANGLTLSQPLGDTLILVRAPGASDTRIENQTGV 736 GNA+ GYSHS +QLYYG+SGG+LAHANG+TL QPL DT++LV+APGA D ++ENQTGV Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737 Query: 737 STDWRGYAVLPYATDYRENRVALDTNTLADNVDIENTVVSVVPTHGAVVRADYKTRVGVK 796 TDWRGYAVLPYAT+YRENRVALDTNTLADNVD++N V +VVPT GA+VRA++K RVG+K Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797 Query: 797 VLMTLMRNGKAVPFGSVVTARNGGS-SIAGENGQVYLSGMPLSGQVSVKWGSQTTDQCTA 855 +LMTL N K +PFG++VT+ + S I +NGQVYLSGMPL+G+V VKWG + C A Sbjct: 798 LLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVA 857 Query: 856 DYKLPKESAGQILSHVTVSCR 876 +Y+LP ES Q+L+ ++ CR Sbjct: 858 NYQLPPESQQQLLTQLSAECR 878
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.4 bits (68), Expect = 0.004 Identities = 25/91 (27%), Positives = 42/91 (46%), Gaps = 5/91 (5%) Query: 4 SLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVATYASEINRLKALVAKLQRMQFGKSS 63 + A A AL Q +D + + +N + A N A+ A+ +R++ K+ Sbjct: 79 AQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHANN--AAMQAEDERLRLAKAE 136 Query: 64 EKLR---AKTERQIQEAQERISALQEEMAET 91 EK R E+ QEA++R ++ E AET Sbjct: 137 EKARKEAEAAEKAFQEAEQRRKEIEREKAET 167
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.023 Identities = 7/20 (35%), Positives = 17/20 (85%) Query: 26 RAARILGISQSAISQKIKKL 45 +AA +LG++++ + +KI++L Sbjct: 454 KAADLLGLNRNTLRKKIREL 473
>cdtoxina#Cytolethal distending toxin A signature. Length = 258 Score = 28.1 bits (62), Expect = 0.008 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 5/60 (8%) Query: 51 SGVELLPVEITPDEQKVPMTAIAPSLSTSTQTTVCASSCKVEFRHGKMTLENPSPELLTV 110 VE P +PDE +P+ P+L T+ + ++L N +LT+ Sbjct: 38 PQVEGGPTVPSPDEPGLPLPGPGPALPTNGAIPIPEPGTAPA-----VSLMNMDGSVLTM 92
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 42.7 bits (100), Expect = 8e-06 Identities = 91/438 (20%), Positives = 140/438 (31%), Gaps = 41/438 (9%) Query: 286 TAGNTTINQNGELKVHAGGEASDVTQNTGGALVTSTAATVTGTNRLGAFSVVEGKADNVV 345 T + I G +H G S ++ + V VT GA + V + + Sbjct: 172 TVQRSAIVDGG---LHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASEL 228 Query: 346 LENGGRLDVLSGHTATNTRVDDGGTLDVRNGGTATTVSMGNGGVLLADSGAAVSGTRSDG 405 +GG + G A + R G A G AV G G Sbjct: 229 TLDGGH--ITGGRAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPG 286 Query: 406 TAFRIGGGQA----DALMLEKGSSFTLNAGDTATDTTVNGGLFTARGGSLAGTTTLNNGA 461 + G +E S A G T GGSL+ G Sbjct: 287 GFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVTVSGGSLSAPH----GN 342 Query: 462 ILTLSGKTV---NNDTLTIR-EGDALLQGGSLTGNGSVEKSGSGTLTVSNTTLTQKAVNL 517 ++ G L+I + A QG +L E LT++ Q + Sbjct: 343 VIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPV---KLTLTGGADAQGDIVA 399 Query: 518 NEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDPTNVTLASGATWNIPDNATVQSV 577 E S DV GA + ATW + DN+ V ++ Sbjct: 400 TELPSIPGTSIGPLDVALASQARWT--------GATRAVDSLSIDNATWVMTDNSNVGAL 451 Query: 578 VDDLSHAGQIHF-TSTRTGKFVPATLKVKNLNGQNGTISLRVRPDMAQNNADRLVIDGGR 636 L+ G + F G+F L V L G +G + V D+ + D+LV+ Sbjct: 452 --RLASDGSVDFQQPAEAGRF--KVLTVNTLAG-SGLFRMNVFADLGLS--DKLVVMQD- 503 Query: 637 ATGKTILNLVNAGNSASGLATSGKGIQVVEAINGATTEEGAFVQGNRLQAGAFNYSLNRD 696 A+G+ L + N+G+ + T + V + A T A ++ G + Y L + Sbjct: 504 ASGQHRLWVRNSGSEPASANTL---LLVQTPLGSAATFTLANK-DGKVDIGTYRYRLAAN 559 Query: 697 SDESWYLRSENAYRAEVP 714 + W L A A P Sbjct: 560 GNGQWSLVGAKAPPAPKP 577
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 616 bits (1590), Expect = 0.0 Identities = 270/887 (30%), Positives = 410/887 (46%), Gaps = 123/887 (13%) Query: 42 SLALSALLPTVAGASTVGGNNPYQTYRDFAENKGQFQAGATNIPIFNNKGELVGHL--DK 99 +L ++ L A+ V + YQ +RDFAENKG+F GATN+ + + + +G + Sbjct: 12 ALTVAYALTPYTEAALVRDDVDYQIFRDFAENKGKFSVGATNVLVKDKNNKDLGTALPNG 71 Query: 100 APMVDFSSVNVSSNPGVATLINPQYIASVKH-NKGYQSVSFG------------------ 140 PM+DFS V + +ATLINPQY+ VKH + G + FG Sbjct: 72 IPMIDFSVV--DVDKRIATLINPQYVVGVKHVSNGVSELHFGNLNGNMNNGNAKAHRDVS 129 Query: 141 DGQNSYHIVDRNEHSSS-----------------DLHTPRLDKLVTEVAPATVTSSST-- 181 +N Y V++NE+ + D + PRLDK VTEVAP +++S+ Sbjct: 130 SEENRYFSVEKNEYPTKLNGKTVTTEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDA 189 Query: 182 ADILTPSKYSAFYRAGSGSQYIQDSQGKRHWVTGGYGYLTGGILPTSFFYH--------- 232 +KY AF R GSGSQ+I + + + Y Sbjct: 190 GTYNDQNKYPAFVRLGSGSQFIYKKGDNYSLILNNHEVGGNNLKLVGDAYTYGIAGTPYK 249 Query: 233 --GSDGIQLYMGGNIHDHSI---------LPSFGEAGDSGSPLFGWNTAKGQWELVGVYS 281 + + G + +HS L ++ GDSGSPLF ++ KG+W +G Y Sbjct: 250 VNHENNGLIGFGNSKEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYD 309 Query: 282 ---GVGGGTNLIYSLIPQSFLSQIYSEDNDAPVFFNASSGAPLQWKFDSSTGTGSLKQGS 338 G + +++ F + ++D+ + + + + S+ T ++ G Sbjct: 310 FWAGYNKKSWQEWNIYKSQFTKDVLNKDSAGSLIGSK-----TDYSWSSNGKTSTITGGE 364 Query: 339 DEYAMHGQKGSDL-NAGKNLTFLGHNGQIDLENSVTQGAGSLTFTDDYTVT-TSNGSTWT 396 + G D N GK++TF G +G + L N++ QGAG L F DY V TS+ +TW Sbjct: 365 KSLNVDLADGKDKPNHGKSVTFEG-SGTLTLNNNIDQGAGGLFFEGDYEVKGTSDNTTWK 423 Query: 397 GAGIIVDKDASVNWQVNGVKGDNLHKIGEGTLVVQGTGVNEGGLKVGDGTVVLNQQADSS 456 GAG+ V + +V W+V+ + D L KIG+GTL+V+GTG N+G LKVGDGTV+L QQ + S Sbjct: 424 GAGVSVAEGKTVTWKVHNPQYDRLAKIGKGTLIVEGTGDNKGSLKVGDGTVILKQQTNGS 483 Query: 457 GHVQAFSSVNIASGRPTVVLADNQQVNPDNISWGYRGGVLDVNGNDLTFHKLNAADYGAT 516 G AF+SV I SGR T+VL D++QV+P++I +G+RGG LD+NGN LTF + D GA Sbjct: 484 GQ-HAFASVGIVSGRSTLVLNDDKQVDPNSIYFGFRGGRLDLNGNSLTFDHIRNIDDGAR 542 Query: 517 LGNS-SDKTANITLD---YQTHPADVKV---------NEWSSSNRGTVGSLYIYNNPYTH 563 L N +NIT+ T P + N ++ G LY+ YT Sbjct: 543 LVNHNMTNASNITITGESLITDPNTITPYNIDAPDEDNPYAFRRIKDGGQLYLNLENYT- 601 Query: 564 TVDYFILK--TSSYGWFP-TGQVSNEHWEYVGHDQNSAQALLANRINNK------GYLY- 613 Y+ L+ S+ P SNE+W Y+G + A+ + N INN+ GY Sbjct: 602 ---YYALRKGASTRSELPKNSGESNENWLYMGKTSDEAKRNVMNHINNERMNGFNGYFGE 658 Query: 614 -HGKLLGNINFSNKATPGTTGALVMDGSANMSGTFTQENGRLTIQGHPVIHASTSQSIAN 672 GK GN+N + K ++ G N++G T E G L + G P HA + IA Sbjct: 659 EEGKNNGNLNVTFKGKSE-QNRFLLTGGTNLNGDLTVEKGTLFLSGRPTPHA---RDIAG 714 Query: 673 TVSSLGDNSVLTQPTSFTQDDWENRTFSFGSLVLK-DTDFGLGRN-ATLNTTIQADNSS- 729 S+ D +DDW NR F ++ + + GRN A + + I A N + Sbjct: 715 ISSTKKDPHFAENNEVVVEDDWINRNFKATTMNVTGNASLYSGRNVANITSNITASNKAQ 774 Query: 730 ----VTLGDSRVFIDKKDGQGTAFTLEEGTSVATKDADKSVFNGTVNLDNQS--VLNIND 783 GD+ G T T ++ + A + + G VNL + VL + Sbjct: 775 VHIGYKTGDTVCVRSDYTGYVTC-TTDKLSDKALNSFNPTNLRGNVNLTESANFVLGKAN 833 Query: 784 IFNGGIQANNSTVNISSDS--AILGNS-----TLTSTALNLNKGANA 823 +F NS V ++ +S + GNS L + ++LN N+ Sbjct: 834 LFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLANGHIHLNSADNS 880 Score = 52.0 bits (124), Expect = 2e-08 Identities = 64/329 (19%), Positives = 119/329 (36%), Gaps = 54/329 (16%) Query: 760 KDADKSVFNGTVNLDNQSVLNINDIFNGGIQANNSTVNISSDSAILGNSTLTSTALNLNK 819 K +D++ N +++N+ + N F NN +N++ N L + NLN Sbjct: 631 KTSDEAKRNVMNHINNERMNGFNGYFGEEEGKNNGNLNVTFKGKSEQNRFLLTGGTNLNG 690 Query: 820 GANALASQSFVSDGPVNISDATLSLNSRPDEVSHTLLPVYDYAGSW----------NLKG 869 F+S P + ++S + W N+ G Sbjct: 691 DLTVEKGTLFLSGRPTPHARDIAGISSTKKDPHFAENNEVVVEDDWINRNFKATTMNVTG 750 Query: 870 DDARLNVGPYSMLSGNINVQDKGTVTLG--------------GEGELSPDLTLQNQMLYS 915 + + + + ++ NI +K V +G G + D L ++ L S Sbjct: 751 NASLYSGRNVANITSNITASNKAQVHIGYKTGDTVCVRSDYTGYVTCTTD-KLSDKALNS 809 Query: 916 LFN-----------------GYRNTWSGSLNAPDATVSMT-DTQWSMNGNSTAGNMKLNR 957 G N + + ++ V +T ++ W + GNS + L Sbjct: 810 FNPTNLRGNVNLTESANFVLGKANLFGTIQSRGNSQVRLTENSHWHLTGNSDVHQLDLAN 869 Query: 958 TIVGFNGGTSS-----FTTLTTDNLDAVQSAFVMRTDL--NKADKLVINKSATGHDNSIW 1010 + N +S + TLT ++L +F TDL + DK+V+ KSATG+ Sbjct: 870 GHIHLNSADNSNNVTKYNTLTVNSLSG-NGSFYYLTDLSNKQGDKVVVTKSATGNFTLQV 928 Query: 1011 VNFLKKPSDKDTLDIPLVSAPEATADNLF 1039 + +P+ ++ L A +A D+L Sbjct: 929 ADKTGEPNHN---ELTLFDASKAQRDHLN 954
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 62.6 bits (152), Expect = 5e-12 Identities = 29/247 (11%), Positives = 73/247 (29%), Gaps = 23/247 (9%) Query: 487 TLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQSVYSGTFGSLGLRAGIQRYNNGDSS 546 L + + T +S + Y + +Q+ + F + N Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588 Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTV 594 + +AL++++P +W + Q + A+ S + + Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647 Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYINTNLTANGSVGWQGK 654 ++ +G + +G A + Y + + S +D +G V Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706 Query: 655 NIAASGRTDGNAGVIFDTGLEN---DGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQ 711 + + ++ G ++ + Q + + R G + Y V L Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761 Query: 712 NSKNSLD 718 + + + Sbjct: 762 TNTLADN 768
>INTIMIN#Intimin signature. Length = 939 Score = 547 bits (1410), Expect = e-177 Identities = 232/828 (28%), Positives = 350/828 (42%), Gaps = 70/828 (8%) Query: 41 PVMAARAQHAVQPRLSMENTTVTADNNVEKNVASLAANAGTFLSSQPDS-----DATRNF 95 P++AA +L+ + VT N + + AA L SQ S D ++ Sbjct: 131 PLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDT 190 Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKNFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155 G+A +A+ ++Q WL YGTA V L NF SSL+ L P YD+ + F Q Sbjct: 191 ALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDFLLPFYDSEKMLAFGQVGA 248 Query: 156 HRTDDRTQSNIGFGWRHFSENDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGY 215 D R +N+G G R F + M G N FID D S +TR+G+G EYWRDY K S NGY Sbjct: 249 RYIDSRFTANLGAGQRFF-LPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGY 307 Query: 216 IRASGWKTSPDVEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275 R SGW S + +DY ERPANG+DIR GYLP++P LGA LMYEQYYGD V LF DK Q Sbjct: 308 FRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQ 367 Query: 276 KDPHAITAEVNYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLEKQLDTDSIRER 335 +P A T VNYTP+PL+T+ ++ G END + ++ Y+ +P +Q++ + E Sbjct: 368 SNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNEL 427 Query: 336 RMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTVSLGLVVSKATHGLKNVQ 395 R L+GSRYDLV+RNNNI+LEY+K +++ + +P I G T + L+V K+ +GL + Sbjct: 428 RTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIV-KSKYGLDRIV 486 Query: 396 WEAPSLLAAGGKITGQG----NQWQVTLPAYQAGKDNYYAISAIAYDNKGNASKRVQTEV 451 W+ +L + GG+I G +Q LPAY G N Y ++A AYD GN+S V + Sbjct: 487 WDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTI 546 Query: 452 VISGAGMSADRTALTLDGQSRIQMLANGNEQKPLVLSLRDAEGQPVTGMKDQIKTELTFK 511 + G D+ +T + A+G E +++ G Sbjct: 547 TVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVA--------------- 590 Query: 512 PAGNIVTRSLKVTKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSKTVTA 571 A V S + A + +G + G+ ++ M+ + A Sbjct: 591 QANVPV--SFNIVSGTAVLSANSANTNGSGKATVTLKSDK-PGQVVVSAKTAEMTSALNA 647 Query: 572 ELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDSEGNPVTGEASRLRLVPQDTN 631 + S VA+GQ A T T+ V PV+ + T Sbjct: 648 NAVIFVDQTKASITEIKADKTTAVANGQDAITYTVK-VMKGDKPVSNQEVTF-----TTT 701 Query: 632 GVTVGAIS--EIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGP------- 682 + + G T++ST G +V A + ++F Sbjct: 702 LGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI 761 Query: 683 ------LDAAHSSITLNPDK---PVVGGTVTAIWTAKDANDNPVTGLNPDAPSLSGAAAA 733 + ++ L + GG W + + V + +L Sbjct: 762 EIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDA-SSGQVTLKEKGTT 820 Query: 734 GSTASGWTDNGDGTWTAQISLGTTAGELDVMPKLNGQDAAANAAKVTVVADALSSNQSKV 793 + +DN T+T T L V L S+Q+++ Sbjct: 821 TISVIS-SDNQTATYTIA-----TPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNEL 874 Query: 794 -------SVAEDHVKAGESTTVTLVAKDAHGNAISGLSLSASLTGTAS 834 A + S T+ + +A SG++ + L Sbjct: 875 ENVFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVASTYDLVKQNP 922 Score = 75.5 bits (185), Expect = 1e-15 Identities = 74/347 (21%), Positives = 115/347 (33%), Gaps = 46/347 (13%) Query: 905 KTTTELTFTVK----DAYGNPVTGLKPDAPVFSGAASTGSERPSAGNWTEKGNGVYVSTL 960 T TVK PV+ + SG A SA + G+G TL Sbjct: 575 TEAITYTATVKKNGVAQANVPVSFN-----IVSGTAV-----LSANSANTNGSGKATVTL 624 Query: 961 TLGSAAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVNNQLANGQSANQITL 1020 + +A+ V+ V D +KA I ++ +ANGQ A IT Sbjct: 625 KSDKPGQVVVSAKTAEMTSALNANAVIFV--DQTKASITEIKADKTTAVANGQDA--ITY 680 Query: 1021 TV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDIELMSTVAGEHNISASVNG 1079 TV V P+ QEVT T G S + T T+ G + L ST G+ +SA V+ Sbjct: 681 TVKVMKGDKPVSNQEVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSD 738 Query: 1080 AQ---KTVTVKFNADASTGQANLQVDTAVQKVANGKDAFTLTATVK-DQYGNLLPGAVVV 1135 K V+F + N+++ V G T ++ Q G Sbjct: 739 VAVDVKAPEVEFFTTLTIDDGNIEI------VGTGVKGKLPTVWLQYGQVNLKASGGNGK 792 Query: 1136 FNLPRGVKPLADGNIMVNADKEGKAELKVVSVTAGTYEITASAGNDQPSNAQSVTFVADK 1195 + A+ I G+ LK GT I+ + ++Q T+ Sbjct: 793 YTW-----RSANPAIASVDASSGQVTLK----EKGTTTISVISSDNQT-----ATYTIAT 838 Query: 1196 TTATISSIEVIGNRAVADGKTKQTYKVTVTDANNNLLKDSEVTLTAS 1242 + I + D ++ N L++ A+ Sbjct: 839 PNSLI-VPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAA 884 Score = 52.0 bits (124), Expect = 1e-08 Identities = 59/368 (16%), Positives = 106/368 (28%), Gaps = 56/368 (15%) Query: 779 VTVVADALSSNQSKV---SVAEDHVKAGESTTVTLVA------KDAHGNAISGLSLSASL 829 +TV+++ +Q V + + KA + +T A +S +S Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVSG-- 603 Query: 830 TGTASEGATVSSWTEKGDGSYVAT--LTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGE 887 A +S+ + +GS AT L + + A A + + Sbjct: 604 ------TAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALN--ANAVIFVDQ 655 Query: 888 MSSANSTLVADNKTPTVKTTTELTFTVKDAY-GNPVTGLKPDAPVFSGAASTGSERPSAG 946 ++ + + AD T +T+TVK PV+ + +T + S Sbjct: 656 TKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV-------TFTTTLGKLSNS 708 Query: 947 NWTEKGNGVYVSTLTLGSAAGQLSVMPRVNGQN-AVAQPLVLNVAG---DASKAEIRDMT 1002 NG TLT + G+ V RV+ V P V D EI Sbjct: 709 TEKTDTNGYAKVTLT-STTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEI---- 763 Query: 1003 VKVNNQLANGQSANQITLTV-VDSYGNPLQGQEVTLTLPQGVTSKTGNTVTTNAAGKVDI 1061 + G T+ + G T ++ ++G+V + Sbjct: 764 ------VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTW---RSANPAIASVDASSGQVTL 814 Query: 1062 ELMSTVAGEHNISASVNGAQKTVTVKFNADASTGQANLQVDTAVQKVANGKDAFTLTATV 1121 + G IS + Q T + + V + + Sbjct: 815 K----EKGTTTISVISSDNQ---TATYTIATPNSLIVPNMSKRVT-YNDAVNTCKNFGGK 866 Query: 1122 KDQYGNLL 1129 N L Sbjct: 867 LPSSQNEL 874 Score = 50.5 bits (120), Expect = 5e-08 Identities = 45/248 (18%), Positives = 79/248 (31%), Gaps = 23/248 (9%) Query: 1168 TAGTYEITASA----GNDQPSNAQSVTFVADKTTAT---ISSIEVIGNRAVADGKTKQTY 1220 + Y++TA A GN + ++T +++ ++ A ADG TY Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580 Query: 1221 KVTVTDANNNLLKDSEVTLTASPENLVLTPNGTATTNEQGQAIFTATTTVAATYTLTAKV 1280 TV S + +A TN G+A T + ++AK Sbjct: 581 TATVKKNGVAQANVPVSFNIVS--GTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK- 637 Query: 1281 EQADGQESTKTAESKFVADDKNAELAATSDVHSLVADGVTTATLTVTLFSANNPVGGTMW 1340 A+ + FV K + +D + VA+G T TV + + PV Sbjct: 638 -TAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV 696 Query: 1341 VDIEA--PEGVTEADYQFLPSKNDHFASGKITRTFSTNKPGTYTFTFNSLTYGGYEMKPV 1398 + K D +G T ++ PG + ++ ++K Sbjct: 697 TFTTTLGKLSNSTE-------KTD--TNGYAKVTLTSTTPGKSLVS-ARVSDVAVDVKAP 746 Query: 1399 TVTINAVP 1406 V Sbjct: 747 EVEFFTTL 754
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.025 Identities = 12/42 (28%), Positives = 19/42 (45%) Query: 3 RQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRN 44 RQ IL L S+ +IA+ +G +R I F++ Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 120 bits (302), Expect = 4e-30 Identities = 149/738 (20%), Positives = 245/738 (33%), Gaps = 109/738 (14%) Query: 104 INATGSTITAQGEGTYVRTAMVIDSTGDVVVNGGNFVTKNEKGSATGISLEGARGNNVTL 163 N T + V A + G + G +G+ + R + Sbjct: 206 TNVTAVPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAPA 265 Query: 164 NGTT--INAQGNKSSSNASTAIFAQKGSLLQGFDGDATDNITLA---------GSNIING 212 G G F G D + + LA G+ I G Sbjct: 266 GGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSS-VELAQSIVEAPELGAAIRVG 324 Query: 213 RIEAIVIAGNNTGTHTVNLNIKDGSVI---GAANNKQTIYASASAQGAGSATQNLNLSVA 269 R + ++G + N+ G+ AA T+ A A AQG + L V Sbjct: 325 RGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVK 384 Query: 270 DSTIYSDVLALSSSNSSVGTTTNVNMNVARSYWEGNAYTFNSGDKAGSDLDINLSDSSVW 329 L L+ + G + G G LD+ L+ + W Sbjct: 385 --------LTLTGGADAQGDIVATELPSI------------PGTSIGP-LDVALASQARW 423 Query: 330 KGKVSGAGDASVSLQNGSVWNVTGSSTVDALAVKDSTVNITKATVNTGTFA-------SQ 382 G S+S+ N W +T +S V AL + + G F + Sbjct: 424 TGATRAVD--SLSIDNA-TWVMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAG 480 Query: 383 NGTLI----VDASSENTLDISGKASGDLRVY---------SAGSLDLINEQ----TAFIS 425 +G D + L + ASG R++ SA +L L+ F Sbjct: 481 SGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAATFTL 540 Query: 426 TGKDSTLKATGTTEGGLYQYDLTQGADGNFYFVKNTHK---------------------- 463 KD G + G Y+Y L +G + V Sbjct: 541 ANKD------GKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPE 594 Query: 464 ------------ASNASSVIQAMA-AAPANVANLQADTLSARQDAVRLSENDKGGVWIQY 510 ++ A++ + + + +++ LS R +RL+ D GG W + Sbjct: 595 APAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNP-DAGGAWGRG 653 Query: 511 FGGKQKHTTAGNASYDLDVNGVMLGGDTRFMTEDGSWLAGVAMSSAKGDMT-TMQSKGDT 569 F +Q+ +D V G LG D G W G +GD T G T Sbjct: 654 FAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHT 713 Query: 570 EGYSFHAYLSRQYNNGIFIDTAAQFGHYSNTADVRLMNGGGTIKADFNTNGFGAMVKGGY 629 + Y + ++G ++D + N V +G +K + T+G GA ++ G Sbjct: 714 DSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGY-AVKGKYRTHGVGASLEAGR 772 Query: 630 TWKDGNGLFIQPYAKLSALTLEGVDYQL-NGVDVHSDSYNSVLGEAGTRVGYDFAVGNA- 687 + +G F++P A+L+ G Y+ NG+ V + +SVLG G VG + Sbjct: 773 RFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGR 832 Query: 688 TVKPYLNLAALNEFSDGNKVRLGDESVNASIDGAAFRVGAGVQADITKNMGAYASLDYTK 747 V+PY+ + L EF V + + G +G G+ A + + YAS +Y+K Sbjct: 833 QVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSK 892 Query: 748 GDDIENPLQGVVGINVTW 765 G + P G +W Sbjct: 893 GPKLAMPWTFHAGYRYSW 910
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.7 bits (152), Expect = 2e-14 Identities = 31/172 (18%), Positives = 58/172 (33%), Gaps = 15/172 (8%) Query: 16 RRRQLIDATLEAINEVGMHDATIAQIARRAGVSTGIISHYFRDKNGLLEATMRDITSQLR 75 R+ ++D L ++ G+ ++ +IA+ AGV+ G I +F+DK+ L S + Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71 Query: 76 DAVLNRLHALPQGSAELRLQAIVGGNFDETQVSSAAMKAWLAFWASSMHQP-------ML 128 + L P L+ I+ + T V+ + + Sbjct: 72 ELELEYQAKFPGDPLS-VLREILIHVLEST-VTEERRRLLMEIIFHKCEFVGEMAVVQQA 129 Query: 129 YRLQQVSSRRLLSNLVSEFRRE---LPRQQAQEAGYGLAALIDGL---WLRA 174 R + S + + + A + I GL WL A Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.6 bits (69), Expect = 0.009 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%) Query: 6 TEENLLAFTTAARFGSFSKAAEELGLTTSAISYTIKRMETGLDVVLFTRS 55 E L+ A G+ KAA+ LGL + + I+ + G+ V +RS Sbjct: 436 MEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL--GVSVYRSSRS 483
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 431 bits (1110), Expect = e-155 Identities = 140/315 (44%), Positives = 201/315 (63%), Gaps = 3/315 (0%) Query: 1 MKELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADTVLEMLASDYDIVLTHGNGPQVGLD 60 M + VV+A+GGN++ + S E + V+ A + E++A Y++V+THGNGPQVG Sbjct: 1 MGKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSL 60 Query: 61 LRRAEIAHEREGLPLTPLANCVADTQGGIGYLIQQALNNRLARHG-EKKAVTVVTQVEVD 119 L + G+P P+ A +QG IGY+IQQAL N L + G EKK VT++TQ VD Sbjct: 61 LLHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVD 120 Query: 120 KNDPGFAHPTKPIGEFFSESQRDELQKANPDWRFVEDAGRGYRRVVASPEPKRIVEAPAI 179 KNDP F +PTKP+G F+ E L + W ED+GRG+RRVV SP+PK VEA I Sbjct: 121 KNDPAFQNPTKPVGPFYDEETAKRLAR-EKGWIVKEDSGRGWRRVVPSPDPKGHVEAETI 179 Query: 180 KALIQQGFVVIGAGGGGIPVVRTEAGDYQSVDAVIDKDLSTALLAREIHADILVITTGVE 239 K L+++G +VI +GGGG+PV+ E G+ + V+AVIDKDL+ LA E++ADI +I T V Sbjct: 180 KKLVERGVIVIASGGGGVPVIL-EDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVN 238 Query: 240 KVCIHFGKPQQQALDRVDIATMTRYMQEGHFPPGSMLPKIIASLTFLEQGGKEVIITTPE 299 +++G ++Q L V + + +Y +EGHF GSM PK++A++ F+E GG+ II E Sbjct: 239 GAALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLE 298 Query: 300 CLPAALRGETGTHII 314 AL G+TGT ++ Sbjct: 299 KAVEALEGKTGTQVL 313
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 338 bits (869), Expect = e-113 Identities = 121/401 (30%), Positives = 200/401 (49%), Gaps = 54/401 (13%) Query: 164 DLAEEAGMTGIFIYSAATVRQAFSDALDMTRMSLRHNTHDATRNALRTRYVLGDMLGQSP 223 A +A G + Y ++ + + +L ++ ++G+S Sbjct: 88 MTAIKASEKGAYDYLPKPFDL--TELIGIIGRALAEPKRRPSK-LEDDSQDGMPLVGRSA 144 Query: 224 QMEQVRQTILLYARSSAAVLIEGETGTGKELAAQAIHREYFARHDARQGKKSHPFVAVNC 283 M+++ + + ++ ++I GE+GTGKEL A+A+H + R+ PFVA+N Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-----YGKRRNG---PFVAINM 196 Query: 284 GAIAESLLEAELFGYEEGAFTGSRRGGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVL 343 AI L+E+ELFG+E+GAFTG++ G FE A GGTLFLDEIG+MP+ QTRLLRVL Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTR-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVL 255 Query: 344 EEKEVTRVGGHQPVPVDVRVISATHCNLEEDMQQGQFRRDLFYRLSILRLQLPPLRERVA 403 ++ E T VGG P+ DVR+++AT+ +L++ + QG FR DL+YRL+++ L+LPPLR+R Sbjct: 256 QQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAE 315 Query: 404 DILPLAESFLKVSLAALSAPFAAALRQGLQASETMLVHYDWPGNIRELRNMMERLALFLS 463 DI L F++ ++ L+ + + WPGN+REL N++ RL Sbjct: 316 DIPDLVRHFVQ-QAEKEGLDVKRFDQEALEL----MKAHPWPGNVRELENLVRRLTALYP 370 Query: 464 VES-TPDLTPQFLQ-----------------LLLPELARESAKTPIPGLLTA-------- 497 + T ++ L+ L + + E+ + A Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430 Query: 498 -----------QQALEKFNGDKTAAANYLGISRTTFWRRLK 527 AL G++ AA+ LG++R T ++++ Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIR 471
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 29.8 bits (67), Expect = 0.023 Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%) Query: 65 LIHGKLPTRDE-LAAYKTKLKALRGLPANVRTV 96 + +LPT +E AYK ++ + G P +RT+ Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 33.3 bits (76), Expect = 0.003 Identities = 21/96 (21%), Positives = 42/96 (43%), Gaps = 4/96 (4%) Query: 92 AAVGVVQQLRTDVMDAA--LRQPLSEFDTQ-PVGQVISRVTNDTEVIRDLYVTVVATVLR 148 A +G+ + +D A ++ L+E P G + + T ++ VV T+ Sbjct: 287 AGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFE 346 Query: 149 SAALVGAMLVAMFSLDWRMALVAIMIFPVVLVVMVI 184 + LV +++ +F + R L+ + PVVL+ Sbjct: 347 AIMLV-FLVMYLFLQNMRATLIPTIAVPVVLLGTFA 381
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.013 Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%) Query: 245 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 304 W+ L L+ G +A +LR R+ + + P++ G+I Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272 Query: 305 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDWLRQHSQQHISINLE 363 AR+ +T + S +PL Q +S + ++ R D +R+ H + LE Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328 Query: 364 STVLTSEKIPQLLREMI 380 T L P ++R MI Sbjct: 329 QTAL----FPPMMRHMI 341
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1367 bits (3539), Expect = 0.0 Identities = 802/1033 (77%), Positives = 916/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300 + EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540 SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTNYYLT 600 YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT+YYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKNWADRPGEENKVEAITMRATRAFSQIKD 660 EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900 M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020 +EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.9 bits (88), Expect = 4e-05 Identities = 29/173 (16%), Positives = 58/173 (33%), Gaps = 22/173 (12%) Query: 17 KQEYDQ-ALADAQQANAAVTAAKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQN 74 Q + L +Q + + + + +P+S ++ + V TEG +V Sbjct: 293 TQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT 352 Query: 75 GQATALATVQQLDPIYVDVTQSSNDFLRLKQELA----------NGTLKQENGKAKVSLI 124 + T + V + D + V + D + KV I Sbjct: 353 AE-TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNI 408 Query: 125 TSDGIKFPQDGTLEFSDVTVDQTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 177 D I+ + G + +++++ S + I L GM V A ++ G Sbjct: 409 NLDAIEDQRLGLVFNVIISIEENCLSTGNKNIP------LSSGMAVTAEIKTG 455
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 202 bits (514), Expect = 2e-68 Identities = 196/196 (100%), Positives = 196/196 (100%) Query: 2 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA 61 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQA 79 Query: 62 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 121 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD Sbjct: 80 KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYD 139 Query: 122 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 181 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL Sbjct: 140 RIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199 Query: 182 EMYLLCPTLRNPATNE 197 EMYLLCPTLRNPATNE Sbjct: 200 EMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.017 Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 6/125 (4%) Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRVKEE 87 N RA L + + L L+ + A L++ ++ E Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDVDND--EETRKIL--STLSLRQLETRVAQALDD 143 +LR ++ + + +A V E L +T ++ L +A+ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324 Query: 144 LQNAQ 148 Q + Sbjct: 325 QQASV 329
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 38.5 bits (89), Expect = 9e-05 Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 31/251 (12%) Query: 404 PLPETTSQVLAARQQLLRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459 P E +Q + + + P+ AR + A + A T Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037 Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508 V ++ E AT Q +E V A + + A E ++T Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097 Query: 509 LAAKLAA---------EAIERDPWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558 K A E+ +V+ PK + + E +N ++++ Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157 Query: 559 SQRHLNNRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617 Q N ++ A+ S+ ++ E T V N V P A + + + Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217 Query: 618 IIADNNIQTLR 628 + + +++R Sbjct: 1218 KPKNRHRRSVR 1228
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 77.8 bits (191), Expect = 4e-19 Identities = 49/212 (23%), Positives = 81/212 (38%), Gaps = 7/212 (3%) Query: 3 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNS----MGFT--GVLIDLDS 56 K ITG + GIG A L QG H+ A P+ +E++ S D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 57 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 116 ++D + + + N AG G + ++S + E FS N G + + Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 117 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 176 M+ G IV S + AYA+SK A ++ L +EL I+ +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 208 G T ++ ++ G F G Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 5e-04 Identities = 14/30 (46%), Positives = 16/30 (53%) Query: 41 LVGESGSGKSTLLAILAGLDDGSSGEVRLG 70 L G G GKSTL+ L GLD S +G Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>PF09025#YopR Core Length = 143 Score = 28.1 bits (62), Expect = 0.020 Identities = 17/61 (27%), Positives = 25/61 (40%), Gaps = 8/61 (13%) Query: 126 EAVLIGQLECKSMVRMCAPLGSR--------LPLHASGAGKALLYPLAEEELMSIILQTG 177 + + +LE K+M+R PLG + L G L LA EL +I G Sbjct: 68 QGLEADRLELKAMLRAELPLGRQQQTFLLQLLGAVEHAPGGEYLAQLARRELQVLIPLNG 127 Query: 178 L 178 + Sbjct: 128 M 128
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 56.3 bits (136), Expect = 1e-10 Identities = 39/163 (23%), Positives = 60/163 (36%), Gaps = 32/163 (19%) Query: 4 DLIIKNGTVILENEARVVDIAVKDGKIAAIG-------QD-----LGDAKDVMDASGLVV 51 D +I N ++ DI +KDG+IAAIG Q +G +V+ G +V Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128 Query: 52 SPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRAS------- 104 + G +D+H H P + A G+T M+ PA A+ Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177 Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFK 146 I +AA ++ A G + L E+ G K Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 383 bits (984), Expect = e-136 Identities = 124/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAKNQYRNIASAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + + N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKE---VEPYPLDILVAESQGMIGYMLAQSLSAQPQM----PPVTTVLTRIEVSPD 113 A + + P+D+ A SQG IGYM+ Q+L + + V T++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRD-GKYLRRVVASPQPRKILDSEAIELL 172 DPAF P K +GP Y E + L GW +K D G+ RRVV SP P+ +++E I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVAEDG---AGSEAVIDKDLAAALLAEQINADGLVILTDADAVYE 229 ++ G +VI SGGGGVPV + G EAVIDKDLA LAE++NAD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 NWGTPQQRAIRHATPDELAPFAKAD----GSMGPKVTAVSGYVRSRGKPAWIGALSRIEE 285 +GT +++ +R +EL + + GSMGPKV A ++ G+ A I L + E Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGEAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.4 bits (66), Expect = 0.029 Identities = 16/150 (10%), Positives = 44/150 (29%), Gaps = 8/150 (5%) Query: 299 RSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTP----- 353 + ++ +L QAR R R + P E F +++ Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 354 EAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEV 413 E +S + + + A + + + + + + + + F + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS---LLHKQAI 249 Query: 414 AEIEALIQQRLDARKAKDWAAADAARDRLN 443 A+ L Q+ + + +++ Sbjct: 250 AKHAVLEQENKYVEAVNELRVYKSQLEQIE 279
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 27.0 bits (59), Expect = 0.015 Identities = 16/65 (24%), Positives = 28/65 (43%), Gaps = 8/65 (12%) Query: 41 IPSGVPLSVLQEMGG---WESREMVRRYAHLAPNHLTEHARKIDDIFGDNVPLWN-YRRN 96 IP V L + E+GG + ++V H L+E + + G+ VP + + Sbjct: 89 IPKDV-LEIYSELGGEIYFTDIDLVE---HKELQDLSEEEKNSMNSRGEKVPFASRFVFE 144 Query: 97 KEGVT 101 K+ T Sbjct: 145 KKRET 149
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 63.7 bits (155), Expect = 4e-18 Identities = 18/52 (34%), Positives = 28/52 (53%) Query: 11 INMLTKYALVAVIVLCLTVLGFTLLVGDSLCEFTVKERNIEFKAVLAYEPKK 62 + + + V+++CLT+L FT L SLCE ++ E A +AYE K Sbjct: 1 MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 266 bits (681), Expect = 3e-93 Identities = 109/184 (59%), Positives = 132/184 (71%), Gaps = 1/184 (0%) Query: 4 MKTTHTSLPFAGHTLHFVEFDPASFREQDLLWLPHYAQLQHAGRKRKTEHLAGRIAAIYA 63 M T+H LPFAGH LH V+FD +SFRE DLLWLPH+ +L+ AGRKRK EHLAGRIAA++A Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 64 LREYGYKCVPAIGELRQPVWPAGVYGSISHCGTTALAVVSRQPIGIDIEEIFSAQTAREL 123 LRE G + VP +G+ RQP+WP G++GSISHC TTALAV+SRQ IGIDIE+I S TA EL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 124 TDNIITPAEHKRLADCGLAFPLALTLAFSAKESAFKA-SEIQAAQGFLDYQIISWNKQQI 182 +II E + L L FPLALTLAFSAKES +KA S+ GF ++ S I Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 Query: 183 IIRL 186 + L Sbjct: 181 SLHL 184
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.0 bits (83), Expect = 2e-04 Identities = 82/394 (20%), Positives = 146/394 (37%), Gaps = 38/394 (9%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83 + V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+ Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74 Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141 V+L + G ++ + P L +Y+ + G + G A A + Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126 Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201 + + G V P++GGL+ GG + + AA L L LP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253 + PL+ + LA FR+ +V + + ++ + A+ V++ D + Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241 Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSVRPGLLMLLSTLG---AFLAIGLFGLMP 309 A IG AA L + A+ +G +A + ++L + ++ + Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369 M +V LA G ML Q E G++ G A +G L Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403 + A + + +G+ + L LL L LRR Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 64.2 bits (156), Expect = 7e-14 Identities = 60/280 (21%), Positives = 100/280 (35%), Gaps = 35/280 (12%) Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99 H P RIV+ LLA+ VAD + R W E L Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75 Query: 100 RLYIG-----EPSAEAVATQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151 I EP+ E + P ++ SA G S + L+ IAP N+ D Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131 Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209 + LT++ ++ + A +AQ++ + + K + + ++ Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191 Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269 P S ++L++ G NA Q + + + LAA + + L + Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243 Query: 270 KDADAIYANPLLAHLPAVQNKQVYALGTETFRLDYYSAMQ 309 KD DA+ A PL +P V+ + + F SAM Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMH 283
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 439 bits (1130), Expect = e-159 Identities = 145/299 (48%), Positives = 194/299 (64%), Gaps = 18/299 (6%) Query: 1 MAIPRLQAYALPESHDIPHNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60 MAIP +Q Y +P + D+P NKV W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120 L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPIPASKA-----------ALREVIL 223 FS ++H M+L+Y AGR VMT+ LL PA + + A +R+ I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVYGDIDFVMLAKNPTIDAWWKLLS 281 LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 364 bits (936), Expect = e-131 Identities = 110/258 (42%), Positives = 150/258 (58%), Gaps = 20/258 (7%) Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49 GK ++TGA +GIG A A GA + D +A E +P Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 50 VMDVADAAQVAQVCQRLLAETERLDVLVNAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109 DV D+A + ++ R+ E +D+LVN AG+LR G LS E+W+ TF+VN G FN Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169 + +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229 N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 230 ASHITLQDIVVDGGSTLG 247 A HIT+ ++ VDGG+TLG Sbjct: 243 AGHITMHNLCVDGGATLG 260
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 28.8 bits (64), Expect = 0.015 Identities = 18/98 (18%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 30 QGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISG--YMYNEKGENLSNTLIEK 87 + + + F+ YLGK+ ++ + G ++ + N+ G ++ N Sbjct: 21 ESVKPVSDFEL----NNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLN----- 71 Query: 88 EIYAPAGREMWQRMEQSHWLLDGKKDAPVIVYVFADPF 125 Y+ + W+ E + ++G D + V F PF Sbjct: 72 RGYSEE-KGEWKEAEGKAYFVNGSTDGYLKVSFFG-PF 107
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 58.2 bits (140), Expect = 4e-11 Identities = 31/194 (15%), Positives = 72/194 (37%), Gaps = 14/194 (7%) Query: 114 QEQKNQAEEAAKQAELKQKQAEEAAAKAAADAKAKAE----------ADAKAAEEAAK-- 161 E++NQ + QA+ + + + A+ + ++ E A+ Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044 Query: 162 KAAADAKKKAEAEAAKAAVEAQKKAEAAAAALKKKAEAAEAA--AAEARKKAATEAAEKA 219 K + +K E +A + + ++ A+ A + +K + E A +E ++ TE E A Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104 Query: 220 KAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAK 279 E E+KA E + + K+ + +A ++ + ++ Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164 Query: 280 AAAEKAAAAKAAAE 293 A + A + ++ Sbjct: 1165 TADTEQPAKETSSN 1178 Score = 57.4 bits (138), Expect = 8e-11 Identities = 31/236 (13%), Positives = 86/236 (36%), Gaps = 11/236 (4%) Query: 68 QSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKNQAEEAAKQA 127 Q+ S ++E+ ++ +E ++ + +KN E+ A + Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061 Query: 128 ELKQKQ-AEEAAAKAAAD------AKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAV 180 + ++ A+EA + A+ A++ +E E + A + ++KA+ E K Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121 Query: 181 EAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA 240 + ++ + ++++E + A AR+ T ++ +++ A E+ A + + Sbjct: 1122 VPKVTSQVSPK--QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 241 EKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADD 296 E+ + + + A ++ K + ++ + Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235 Score = 55.1 bits (132), Expect = 4e-10 Identities = 33/265 (12%), Positives = 83/265 (31%), Gaps = 14/265 (5%) Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103 D V A + ++ ++K+ + + EQ A E + ++ A++ + Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080 Query: 104 KQLEKERLAAQEQKNQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163 + E + E K K+ +K+ KA + + E ++ + K+ Sbjct: 1081 QTNEVAQSG-SETKETQTTETKETATVEKE-----EKAKVETEKTQEVPKVTSQVSPKQE 1134 Query: 164 AADA-KKKAEAEAAKAAVEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAE 222 ++ + +AE K+ ++ + A+ ++ + Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194 Query: 223 AEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAA 282 + A + +++ K + + + + A + A + Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254 Query: 283 EKAAAAKAAAEADDIFGELSSGKNA 307 A + A A F L+ GK Sbjct: 1255 TNTNAVLSDARAKAQFVALNVGKAV 1279 Score = 54.7 bits (131), Expect = 4e-10 Identities = 29/229 (12%), Positives = 71/229 (31%), Gaps = 4/229 (1%) Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKNQAEEAAK 125 R ++E+ + + + Q+ E +E Q E + +EKE A E + E Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125 Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAVEAQKK 185 +++ KQ + + A+ + + +E + A + A+ + VE Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDP-TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184 Query: 186 AEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA---EK 242 E E + + +++ + A ++ Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244 Query: 243 AAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 291 + +D +A A+ A + A ++ ++ + Sbjct: 1245 STVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 26.7 bits (59), Expect = 0.009 Identities = 6/18 (33%), Positives = 12/18 (66%) Query: 44 AAKLLNITQPALTRRIKK 61 AA LL + + L ++I++ Sbjct: 455 AADLLGLNRNTLRKKIRE 472
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 47.2 bits (112), Expect = 3e-08 Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340 P+ H D+ + I L +D+ Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.012 Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%) Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352 PR E + +LG P + + + K HV + Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589 Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378 + G F L G G GKST + GL Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.3 bits (65), Expect = 0.047 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 62.2 bits (151), Expect = 7e-13 Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%) Query: 82 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 141 Q + + +A+ +LA E + + + + L + Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260 Query: 142 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 196 N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+ Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320 Query: 197 AELNLQDSTLVAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 254 E Q S + AP + V G V+ T+ + + V A V +++ Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380 Query: 255 QPGRKVLLYTDGRPNKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 308 G+ ++ + P Y G++ ++ A D R LV+ + I + Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432 Query: 309 ----DADDALRQGMPVTVQ 323 + + L GM VT + Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 72.4 bits (177), Expect = 9e-18 Identities = 32/220 (14%), Positives = 74/220 (33%), Gaps = 29/220 (13%) Query: 13 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 71 + ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 72 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSK---FISR 128 IGE E + P + +RE+++ + + + + + Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121 Query: 129 EQLSPTAAYHLVHEQVISPLHSHLTRLIAAW---TGCDASDTRMILHTHALIGEILAFRL 185 E A + + + L I A +I+ I ++ Sbjct: 122 EMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM---- 175 Query: 186 GKETILLRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 225 W + + + ++ ++L+ Sbjct: 176 --------ENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.026 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 75.4 bits (185), Expect = 3e-18 Identities = 44/176 (25%), Positives = 71/176 (40%), Gaps = 23/176 (13%) Query: 12 TFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQ 71 DP ++ L++ DMQN + +D S + ANI+ G+ +++ Sbjct: 25 VPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQLGIPVVY-- 76 Query: 72 NGWDEQYVEAGGPGSPNFHKSNALKTMRNQPQLQGKLLAKGSWDYQLVDELMPQPGDIVL 131 PGS N L G L G ++ +++ EL P+ D+VL Sbjct: 77 ---------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELAPEDDDLVL 121 Query: 132 PKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDA 187 K RYS F T L ++R G L+ TGI ++ T + F + + DA Sbjct: 122 TKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 8/165 (4%) Query: 10 GKRSRAVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAV 69 K + ++ IL AL FSQ G T L +IA+ AGV++ + ++F K L+ + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 70 LRQILDIWLAPLKAFREDF--APLAAIKEYIRLKLEVSRDYPQASRLFCM-----EMLAG 122 ++ F PL+ ++E + LE + + L + E + Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122 Query: 123 APLLMDELTGDLKALIDEKSALIAGWVKSGKL-APIDPQHLIFMI 166 ++ D + +++ L A + + ++ Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 29.8 bits (67), Expect = 0.047 Identities = 27/183 (14%), Positives = 61/183 (33%), Gaps = 23/183 (12%) Query: 450 WPRAAENELKK-AEVIEPRNINLEVEQAWTALTLQEWQQA--AVLTHDVVEREPQDPGVV 506 + A E + A +++ + +E + + L ++ ++E E + + Sbjct: 47 YLEVARQEHEVFASILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTI 106 Query: 507 -RLK---RAVDVHNLAELRIAGSTGIDAEGPDSGKHDVDLTTIVYS---PPLKDNWRGFA 559 LK ++ + N+ I+G E + DL P+ + F Sbjct: 107 NLLKDYFSSLTIDNMISKMISGVVT--EELKNYTSSLDDLVNGANLFIIDPMPNVL--FT 162 Query: 560 GFGYADGQFSEGKGIVRDWLAGVEWRSRNIWLEAEYAERVFNHEHKPGARLSGWYDFNDN 619 D S G G+ + + + R E +AE +F + + W + + Sbjct: 163 ----RDPFASIGNGVT---INKMFTKVRQ--RETIFAEYIFKYHPVYKENVPIWLNRWEE 213 Query: 620 WRI 622 + Sbjct: 214 ASL 216
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 29.6 bits (66), Expect = 0.027 Identities = 22/77 (28%), Positives = 36/77 (46%), Gaps = 6/77 (7%) Query: 335 DQVIKTVVNIIGKSIRPDDLLA--RVGGEEFGVLLTEIDTECAKALAERIRENVERLTGD 392 D + + N + + P +L+ R G +EFG+ LT + + K E I E+ G Sbjct: 313 DSKVNNIENALKLTPIPSNLIVYRRSGPQEFGLTLTSPEYDFNK--IENIDAFKEKWEGK 370 Query: 393 NPEYAIPQKVTISIGAV 409 Y P ++ SIG+V Sbjct: 371 VITY--PNFISTSIGSV 385
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.1 bits (96), Expect = 6e-06 Identities = 17/49 (34%), Positives = 29/49 (59%) Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 34.9 bits (80), Expect = 5e-04 Identities = 21/56 (37%), Positives = 29/56 (51%), Gaps = 4/56 (7%) Query: 6 AVSGLNVAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 A+SGLN A L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 350 bits (898), Expect = e-126 Identities = 232/232 (100%), Positives = 232/232 (100%) Query: 4 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 63 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 64 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 123 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 124 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 183 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 184 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 235 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 426 bits (1097), Expect = e-151 Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%) Query: 4 FLSALILLLVITAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63 F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72 Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123 ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131 Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183 T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191 Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239 L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250 Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299 +N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308 Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359 GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+ Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367 Query: 360 AKL 362 A+L Sbjct: 368 AEL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 509 bits (1311), Expect = 0.0 Identities = 309/313 (98%), Positives = 310/313 (99%) Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKGMRDALPKDG 60 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLK MRDALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEEPTPAAPMKFPLET 120 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE TPAAPMKFPLET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGNSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 VVRYQNQALSQLVQKAVPRNYDDSLPG+SKAFLAQLSLPAQLASQQSGVPHHLILAQAAL Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 Query: 241 EALSDYVGLLTRNPRYAAVTTAVSAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 EALSDYVGLLTRNPRYAAVTTA SAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 Query: 301 VSKTYSMNIDNLF 313 VSKTYSMNIDNLF Sbjct: 301 VSKTYSMNIDNLF 313
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 677 bits (1747), Expect = 0.0 Identities = 540/546 (98%), Positives = 544/546 (99%) Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 61 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSTADPSRTTVAYIDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301 RQLAAVPS+ADPSRTTVAY+DGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFAGAFNTQHKAGFDANGDAGKDFFAIGKPAVLQNTKNNGDVAIGATVTDASAVLATD 361 ALAFA AFNTQHKAGFDANGDAG+DFFAIGKPAVLQNTKN GDVAIGATVTDASAVLATD Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360 Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420 Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNNKTVGGAKSFNDAYASLVSDIGN 481 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSN+KTVGGAKSFNDAYASLVSDIGN Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480 Query: 482 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540 Query: 542 ALINIR 547 ALINIR Sbjct: 541 ALINIR 546
>FLAGELLIN#Flagellin signature. Length = 507 Score = 46.2 bits (109), Expect = 8e-08 Identities = 42/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%) Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66 ++ Q N+ +S + + E++S+G R+ + DD + A + +Q + Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67 Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126 E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127 Query: 127 TTDGNGRYIFAGYKTETAPFSEADGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186 T NG + + DG E+I + +G G + + Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181 Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232 + T A + + TA DK Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 65.1 bits (158), Expect = 2e-12 Identities = 42/226 (18%), Positives = 79/226 (34%), Gaps = 12/226 (5%) Query: 590 PAEQSAPKAEAKPERQQDRR-----KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRR 644 P+ S + A+ + N ++++++ D E +NR Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067 Query: 645 QAQQQTAETRESRQQAEV------TEKARTTDEQQAPRRERSRRRNDDKRQAQQEVKALN 698 A++ + + + Q EV T++ +TT+ ++ E+ + + + QEV + Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK-TQEVPKVT 1126 Query: 699 VEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSVAEEAVVAPVVEETVAAEPIVQEA 758 + QE + + + R +N K Q+ P E + E V E+ Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTES 1186 Query: 759 AAPRTELVKVPLPVVAQTAPEQQEENNADNRDNGGMPRRSRRSPRH 804 T V P A Q N+ + RRS RS H Sbjct: 1187 TTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232 Score = 62.8 bits (152), Expect = 6e-12 Identities = 48/289 (16%), Positives = 92/289 (31%), Gaps = 38/289 (13%) Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAATATPASPAQPGLL 571 P E+ + DVP P+ E A AP P A ATP+ + Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE---- 1038 Query: 572 SRFFGALKALFSGGEETKPAEQSAPKAEAKPERQQDRRKP-RQNNRRDRNERRDTRSER- 629 AE S +++ + +QD + QN + + + ++ Sbjct: 1039 -----------------TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ 1081 Query: 630 -TEGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKR 688 E + + E + + ++TA + + TEK + + + + + + Sbjct: 1082 TNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141 Query: 689 QAQ---QEVKALNVEEQSVQETEQEERVRPVQPRRKQRQLNQKVRYEQSV--AEEAVVAP 743 QA+ + +N++E Q + +P + + Q V +V V P Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPA--KETSSNVEQPVTESTTVNTGNSVVENP 1199 Query: 744 VVEETVAAEPIVQEAAA------PRTELVKVPLPVVAQTAPEQQEENNA 786 +P V ++ R + VP V T A Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVA 1248
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 507 bits (1307), Expect = 0.0 Identities = 241/388 (62%), Positives = 280/388 (72%), Gaps = 33/388 (8%) Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYAR 60 MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY R Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58 Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120 +GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117 Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180 V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+ Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177 Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTDGQVAYGK 223 D+ NGDGFG STTY+ GF GA Y SDRT+ QV G Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237 Query: 224 SKFNASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEAV 277 + A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFE Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295 Query: 278 AQYQFDFGLRPSVAYLQSKGKDLGVH----GDRDLVKYVDVGATYYFNKNMSTFVDYKIN 333 AQYQFDFGLRP+V++L SKGKDL + D+DLVKY DVGATYYFNKN ST+VDYKIN Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 334 LID-DSKFTKTAGIDTDDIVAVGLVYQF 360 L+D D F K AGI TDDIVA+G+VYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>PF06291#Lambda prophage Bor protein Length = 102 Score = 175 bits (444), Expect = 5e-61 Identities = 97/97 (100%), Positives = 97/97 (100%) Query: 1 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 60 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 65 Query: 61 AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 97 AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ Sbjct: 66 AENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ 102
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 68.5 bits (167), Expect = 2e-17 Identities = 33/82 (40%), Positives = 46/82 (56%) Query: 23 ADEPRQLVTVYPRYPEYAAANYIKGLVEVKFDIGADGTVTRIVFLRSEPHNLFRDEVVKA 82 A PR L P+YP A A I+G V+VKFD+ DG V + L ++P N+F EV A Sbjct: 150 ASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNA 209 Query: 83 MAKWRFEKNRPCQGVKRQFIFT 104 M +WR+E +P G+ +F Sbjct: 210 MRRWRYEPGKPGSGIVVNILFK 231
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 36.2 bits (83), Expect = 6e-04 Identities = 43/240 (17%), Positives = 81/240 (33%), Gaps = 24/240 (10%) Query: 377 TLQADLEKAREMAAKDWAESEASRLKYTEEAQKAYERLQTPLEKYTARQEELNKALKDGK 436 L A + S A + + L+ + E Sbjct: 222 ALAARKADLEKALEGAMNFSTADS-AKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280 Query: 437 ILQTDYNTLMAAAKKDYEATLKKPKQSGVKVSAGERQEDSAHAALLTLQAELRTLEKHAG 496 AA + + + + + R D++ A L+AE + LE+ Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340 Query: 497 ANEKISQQ-RRDL-------WKAESQFAVLKEAAQRRQLSAQEKS--LLAHKDETLEYKR 546 +E Q RRDL + E++ L+E + + S Q L A ++ + ++ Sbjct: 341 ISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEK 400 Query: 547 QLAALGDKVTYQEHLNALAQQADKFAQQQRAKRAAIDAKNRGLTDRQAAREATEQRLKEQ 606 L K+ E LN +++ K ++++A + QA EA + LKE+ Sbjct: 401 ALEEANSKLAALEKLNKELEESKKLTEKEKA-------------ELQAKLEAEAKALKEK 447
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.7 bits (61), Expect = 0.014 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 5/40 (12%) Query: 135 MTGILFSLGASMVLGGVAQML-----APKARTPRTQTTDN 169 M +LFS +M++ G AQ P A TP+ T + Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.8 bits (95), Expect = 2e-05 Identities = 27/132 (20%), Positives = 56/132 (42%), Gaps = 15/132 (11%) Query: 121 SQSAAAAKKSETAAASSRNA--AKTSETNAGNSAKAAASSKTAAQNAATAAERSETNARA 178 S + A+ E A ++T+ET A NS + + + + Q+A + N Sbjct: 1012 SNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQ---NREV 1068 Query: 179 SEEASADSEEASRRN--AESAAENAGVATTKAREAAADATKAGQKKDEALSAATRAEKAA 236 ++EA ++ + ++ N A+S +E +E TK ++ A EK Sbjct: 1069 AKEAKSNVKANTQTNEVAQSGSE--------TKETQTTETKETATVEKEEKAKVETEKTQ 1120 Query: 237 DRAEVAAEVTAE 248 + +V ++V+ + Sbjct: 1121 EVPKVTSQVSPK 1132
>adhesinb#Adhesin B signature. Length = 310 Score = 329 bits (846), Expect = e-115 Identities = 90/296 (30%), Positives = 163/296 (55%), Gaps = 7/296 (2%) Query: 9 MLLGGLALTCSIAFQASATEKFKVITTFTIIADMAKNVAGDAAEVSSITKPGAEIHEYQP 68 +G A + + + + K V+ T +IIAD+ KN+AGD + SI G + HEY+P Sbjct: 13 AFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEP 72 Query: 69 TPGDIKRAQGAQLILANGMNLEL----WFQRFYQHLNGVPE---VIVSSGVTPVGITEGP 121 P D+K+ A LI NG+NLE WF + ++ VS GV + + Sbjct: 73 LPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQS 132 Query: 122 YEGKPNPHAWMSPDNALIYVDNIRDALIKYDPANAQTYQRNADTYKAKITQTLAPLRKQI 181 +GK +PHAW++ +N +IY NI L + DPAN +TY++N Y K++ +++ Sbjct: 133 EKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKF 192 Query: 182 TELPENQRWMVTSEGAFSYLARDLGLKELYLWPINADQQGTPQQVRKVVDIVKKNHIPAV 241 +P ++ +VTSEG F Y ++ + Y+W IN +++GTP Q++ +V+ ++K +P++ Sbjct: 193 NNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSL 252 Query: 242 FSESTISDKPARQVARETGAHYGGVLYVDSLSTENGPVPTYIDLLKVTTSTLVQGI 297 F ES++ D+P + V+++T ++ DS++ + +Y ++K + +G+ Sbjct: 253 FVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 49.2 bits (117), Expect = 9e-10 Identities = 17/61 (27%), Positives = 25/61 (40%) Query: 126 NDYWWIKSFYIAPEHRGMGLADELIKHLIKEAKSEKALELRLYVHGDNGRAIRAYERCGF 185 N Y I+ +A ++R G+ L+ I+ AK L L N A Y + F Sbjct: 87 NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146 Query: 186 I 186 I Sbjct: 147 I 147
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.1 bits (62), Expect = 0.021 Identities = 9/41 (21%), Positives = 21/41 (51%), Gaps = 2/41 (4%) Query: 3 KRAKNQIVDSDIARLLLKLRKSRNLTVTELAQRSGVSQAMI 43 + + I+D A L + + ++ E+A+ +GV++ I Sbjct: 10 QETRQHILDV--ALRLFSQQGVSSTSLGEIAKAAGVTRGAI 48
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 30.3 bits (68), Expect = 0.002 Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 5/59 (8%) Query: 72 LEALFVDASARGLGVGKHLISHAL--ALHPD---LSVDVNEQNHQAVGFYQHMGFKLSG 125 +E + V R GVG L+ A+ A L ++ + N A FY F + Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 42.7 bits (100), Expect = 5e-08 Identities = 27/101 (26%), Positives = 46/101 (45%), Gaps = 1/101 (0%) Query: 8 TRSIYRELGATLSYNMRLGNGMEIEPWLKAAVRKEFVDDNRVKVNSDGNFVNDLSGRRGI 67 S+ LG + + L G +++P++KA+V +EF V N + +L G R Sbjct: 811 GSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELRGTRAE 869 Query: 68 YQAGIKASFSSTLSGHLGVGYSRGAGVESPWNAVAGVNWSF 108 G+ A+ S + YS+G + PW AG +S+ Sbjct: 870 LGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 30.4 bits (68), Expect = 8e-04 Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%) Query: 41 GPMPAVDSNDPGTAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 97 P+PA G GS E + EA W +P A V +V KV Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 253 bits (648), Expect = 1e-87 Identities = 234/239 (97%), Positives = 236/239 (98%), Gaps = 1/239 (0%) Query: 6 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 65 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 66 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQ-PKRDVKPVESR 124 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKV++ PKRDVKPVESR Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120 Query: 125 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 184 PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180 Query: 185 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 243 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 114 bits (288), Expect = 9e-31 Identities = 65/410 (15%), Positives = 136/410 (33%), Gaps = 97/410 (23%) Query: 11 VVAIGILLAGVVFFIW-WVSK--------GRFIQTTDDAYIGGNITTVASKVSGYISAIE 61 +VA I+ V+ FI + + G+ G + + + I Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLT-------HSGRSKEIKPIENSIVKEII 111 Query: 62 VRDNQSVKKGDIILRLDDRDYRANVARLEAKIKSSKANLESIQATI-------------- 107 V++ +SV+KGD++L+L A+ + ++ + ++ Q Sbjct: 112 VKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLP 171 Query: 108 -------------AMQQSIIQSASETWQAVKHEEQKRLRD--------TERYEKLAQSAA 146 S+I+ TWQ K++++ L R + + Sbjct: 172 DEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR 231 Query: 147 ISQQIIDNAR-------FDYQQVAAKERK---AANDFLVEKQRLAVLSAQEENVRASI-- 194 + + +D+ V +E K A N+ V K +L + ++ + + Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291 Query: 195 ------EEVLAALTQALL--------------DLEYTLVRAPIDGIVANRSAHT-GSWVE 233 E+L L Q + +++RAP+ V HT G V Sbjct: 292 VTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVT 351 Query: 234 GGTSLVSLVPVSE-LWVDANYKENQIAGMKPGMKAEIRADILKGEVFH---GHIESLSPA 289 +L+ +VP + L V A + I + G A I+ + + G +++++ Sbjct: 352 TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLD 411 Query: 290 TGASFSLIPIENATGNFTKIVQRVPVRIAFDDAKELKQLLRPGLSVTVSV 339 + G ++ + K + L G++VT + Sbjct: 412 A-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSGMAVTAEI 452
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 106 bits (267), Expect = 5e-27 Identities = 81/418 (19%), Positives = 170/418 (40%), Gaps = 21/418 (5%) Query: 3 SMRKHIAFASMCIGLFIAQLDIQIVSSSLNEIGGGLSAGKDEMAWLQTSYLIAEIIVIPL 62 ++R + +CI F + L+ +++ SL +I + W+ T++++ I + Sbjct: 9 NLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAV 68 Query: 63 SGWLSRVFSTRWLFTLSAGIFTLMSIACGLAWN-IQIMIFFRALQGVAGASMIPLVFTTA 121 G LS + L I S+ + + ++I R +QG A+ LV Sbjct: 69 YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVV 128 Query: 122 FIYYQGKELGLAAAVVSALASLSPTLGPTLGGWITDNLDWRWLFYINILPGIYLVLSIPF 181 Y + G A ++ ++ ++ +GP +GG I + W +L I ++ ++++PF Sbjct: 129 ARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI----TIITVPF 184 Query: 182 LVNFDKPDLSLLKVADYPSIILLAMTLGCLEYTLEEGARWGWLDDNTILLTSVLALVSFI 241 L+ K ++ + D IIL+++ + +L +++++SF+ Sbjct: 185 LMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTS-YSISFL---------IVSVLSFL 234 Query: 242 LFAARTLKISNPIMDLHAFKDKNFTLGCFFSFSGGVGIFSTVYLIPVFLGQVRGLNAEEI 301 +F K+++P +D K+ F +G + V ++P + V L+ EI Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294 Query: 302 GFAVCTTG-IFQLFSVPFYFWLSKKINLRWLLMAGMGGFVFSMYL--FTPITHEWGWQEL 358 G + G + + L + ++L G+ S F T W + + Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-FMTI 353 Query: 359 LFPQAIRGISQQFAMAHIVTLTLGGIPKERLKLASGVFNLTRNLGGAIGIALCGSILN 416 + + G+S F I T+ + ++ + N T L GIA+ G +L+ Sbjct: 354 IIVFVLGGLS--FTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 30.8 bits (69), Expect = 1e-04 Identities = 12/31 (38%), Positives = 12/31 (38%) Query: 10 PVPEPIPGDPVPVPDPIPRPQPMPDPPPDEE 40 P P P PG P P P P PP E Sbjct: 575 PKPAPQPGPQPPQPPQPQPEAPAPQPPAGRE 605 Score = 26.6 bits (58), Expect = 0.005 Identities = 11/23 (47%), Positives = 11/23 (47%) Query: 19 PVPVPDPIPRPQPMPDPPPDEEP 41 P P P P P PQP P P E Sbjct: 573 PAPKPAPQPGPQPPQPPQPQPEA 595
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 33.2 bits (75), Expect = 0.006 Identities = 25/133 (18%), Positives = 53/133 (39%), Gaps = 8/133 (6%) Query: 545 GHDQSITVANDRCITVRNDQTLQVTNDRTVSVSNDDGLYVRNDRKVTVEGKQEHKTTGNH 604 G +S + +R + + + Q R+ +S D + + +R + G +T G+ Sbjct: 1091 GP-ESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAGADSTQTAGDR 1149 Query: 605 VSLVEGKHSLVVKGDLARKVSGALGIKVDGDIVLESSSRISLKVGGSFVVIHSGGVDIVG 664 L+ G +S + GD ++ +G D +L + R L G + ++ ++G Sbjct: 1150 SKLLAGNNSYLTAGDRSKLTAG-------NDCILMAGDRSKLTAGINSILTAGCRSKLIG 1202 Query: 665 PKISLNSGGSPGT 677 S + G Sbjct: 1203 SNGSTLTAGENSV 1215 Score = 30.5 bits (68), Expect = 0.030 Identities = 15/69 (21%), Positives = 35/69 (50%) Query: 567 QVTNDRTVSVSNDDGLYVRNDRKVTVEGKQEHKTTGNHVSLVEGKHSLVVKGDLARKVSG 626 Q+ + R+ ++ + + +R + + GK +T G +L+ G S+ + G+ + ++G Sbjct: 1080 QIASHRSSLIAGPESTQITGNRSMLIAGKGSSQTAGYRSTLISGADSVQMAGERGKLIAG 1139 Query: 627 ALGIKVDGD 635 A + GD Sbjct: 1140 ADSTQTAGD 1148
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 28.3 bits (63), Expect = 0.010 Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 13/51 (25%) Query: 61 LNDMYAFIPGDNYYFIKS------SGYKFVND-------KWFTLKSINNIF 98 + M AFI D Y FIKS +G+ ND K ++ I ++F Sbjct: 4 VIKMEAFIRSDQYNFIKSQAYILANGHATANDRGVIQALKSLAIEKIIHVF 54
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 54.5 bits (131), Expect = 3e-10 Identities = 41/192 (21%), Positives = 84/192 (43%), Gaps = 8/192 (4%) Query: 36 LSDIAHSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95 L DIA+ F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154 F+ S F++L+++R A F ++ + R P R +A LI + A+ +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PLGRIVGQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214 +G ++ Y W I + +IT+ L+KLL + LMS+ Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209 Query: 215 YLLTVVVVTAHY 226 ++ ++ T Y Sbjct: 210 GIVFFMLFTTSY 221
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.7 bits (98), Expect = 3e-06 Identities = 42/239 (17%), Positives = 82/239 (34%), Gaps = 18/239 (7%) Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLI---GYAMTIALTIGVVFSLGF 63 R +L++ L +G G +P + L R S D+ G + + + + Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 64 GILADKFDKKRYMLMAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFAD 123 G L+D+F ++ +L+++ A + + + ++ + + + A A+ AD Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122 Query: 124 NLSSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSINLPFWLAAICSAFPMLFIQIWVK 183 + + F G GP LG L+ S + PF+ AA + L + Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182 Query: 184 RSEK---------IIATETGSVWSPKVLLQDKALLWFTCSGFLASFVSGAFASCISQYV 233 S K + W + A L F+ V A+ + Sbjct: 183 ESHKGERRPLRREALNPLASFRW--ARGMTVVAALMAV--FFIMQLVGQVPAALWVIFG 237 Score = 32.5 bits (74), Expect = 0.002 Identities = 21/155 (13%), Positives = 60/155 (38%), Gaps = 2/155 (1%) Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLIGYAMTIALTIGVVF-SLGFGI 65 +AL+A ++ + I+ ++ IG ++ + + ++ G Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGP 269 Query: 66 LADKFDKKRYMLMAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFADNL 125 +A + ++R +++ + A +G+I + + L+ + L+A + + Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQV 328 Query: 126 SSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSI 160 + ++ + ++ +GP L T + SI Sbjct: 329 DEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASI 363
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 100 bits (249), Expect = 2e-27 Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%) Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58 I +TGA G GE + R QG + A E+L+++ L A+ DVR+ Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 59 AAIEEMLASLPAEWCNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118 AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178 M++R G I+ +GS P Y ++KA F+ L +L +R + PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227 T+ + ++G + +T++ + L P D+++AV + VS H+ Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 228 INTL 231 ++ L Sbjct: 248 MHNL 251
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 48.7 bits (116), Expect = 3e-08 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 16/118 (13%) Query: 72 VGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLG 131 +G ++GK+ D++G K++L I + + + V ++ + + A R IQG G Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAG 116 Query: 132 AGAEISGAGTMLAEYAPKGKR----GIISSFVAMGTNCGTLSATAI-----WAFMFFI 180 A A + ++A Y PK R G+I S VAMG G I W+++ I Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (301), Expect = 3e-39 Identities = 34/89 (38%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 31.6 bits (72), Expect = 0.002 Identities = 14/61 (22%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 74 SNKAELTAIIARETGKPRWEAATEVTAMINKIAISIKAYHVRTGEQRSEMPDGAASLRHR 133 +NK +L A +A T + ++A V A+ + ++ + GE+ + G +R R Sbjct: 2 ANKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAK-----GEKVQLIGFGNFEVRER 56 Query: 134 P 134 Sbjct: 57 A 57
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 104 bits (261), Expect = 1e-26 Identities = 81/373 (21%), Positives = 138/373 (36%), Gaps = 89/373 (23%) Query: 3 IGIDLGTTNSLAAVWRNGQSELIPNALGKFLTPSVVCVDEDG------MVLTGEAARDLQ 56 + IDLGT N+L V G ++ N PSVV + +D + G A Sbjct: 13 LSIDLGTANTLIYVKGQG---IVLN------EPSVVAIRQDRAGSPKSVAAVGHDA---- 59 Query: 57 LIKPQNCASNFKRMMGTS-------KTLKLG--GREFRAEELSSLILRQLKEDAENYLGE 107 K+M+G + + +K G F E++ ++Q+ ++ Sbjct: 60 -----------KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNS---FMR 105 Query: 108 EVTEAVISVPAYFGDMQRKATKAAATMAGLNVERLINEPTAAALAYGLHNKDDEHQFLVF 167 ++ VP ++R+A + +A AG LI EP AAA+ GL + +V Sbjct: 106 PSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGS-MVV 164 Query: 168 DLGGGTFDVSILELFDNIMEVRAS-AGDNFLGGEDIVDILIDAYCSRRDLPENIEWREPT 226 D+GGGT +V+++ L + GD F E I++ + Y E T Sbjct: 165 DIGGGTTEVAVISLNGVVYSSSVRIGGDRF--DEAIINYVRRNY--------GSLIGEAT 214 Query: 227 FQRHLRIEAERVKRVLS--VRDEATFSVEIEGRRYYWHL-------TTEKFEFL---LQT 274 AER+K + + +E+ GR + + E E L L Sbjct: 215 --------AERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTG 266 Query: 275 FFERVHMPLER-------AIRDAKINISQLDQVVLVGGTTRMPLIRKLVTRLFGRIPAMH 327 V + LE+ I + + VL GG + + +L+ G + Sbjct: 267 IVSAVMVALEQCPPELASDISERGM--------VLTGGGALLRNLDRLLMEETGIPVVVA 318 Query: 328 LNPDEVIAQGAAI 340 +P +A+G Sbjct: 319 EDPLTCVARGGGK 331
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 36.9 bits (85), Expect = 3e-05 Identities = 35/192 (18%), Positives = 55/192 (28%), Gaps = 58/192 (30%) Query: 2 PPRALLLV-DLQNDFCAGGALAVPEGDSTVDVANRLIDWCQSRGEAVI-----ASQD--- 52 P RA+LL+ D+QN F +L + C G V+ SQ+ Sbjct: 28 PNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNPDD 87 Query: 53 -------WHPANHGSFASQHGVEPYTPGQLDGLPQTFWPDHCVQNSEGAQLHPLLKQKAI 105 W P + + + P D + T W Sbjct: 88 RALLTDFWGPGLNSGPYEEKIITELAPEDDDLV-LTKW---------------------- 124 Query: 106 AAVFHKGENPLVDSYSAFFDNGRRQKTALDDWLRAHVINELIVMGLATDYCVKFTVLDAL 165 YSAF +T L + +R ++LI+ G+ T +A Sbjct: 125 -------------RYSAFK------RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAF 165 Query: 166 QLGYKVNVITDG 177 K + D Sbjct: 166 MEDIKAFFVGDA 177
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.5 bits (92), Expect = 2e-05 Identities = 30/129 (23%), Positives = 50/129 (38%), Gaps = 1/129 (0%) Query: 65 ALMFGYFIGSLTGGFIGDYFGRRRAFRINLLIVGIAATGAAFVPDMY-WLIFFRFLMGTG 123 A M + IG+ G + D G +R ++I + + LI RF+ G G Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116 Query: 124 MGALIMVGYASFTEFIPATVRGKWSARLSFVGNWSPMLSAAIGVVVIAFFSWRIMFLLGG 183 A + +IP RGK + + + AIG ++ + W + L+ Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 184 IGILLAWFL 192 I I+ FL Sbjct: 177 ITIITVPFL 185
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.011 Identities = 33/142 (23%), Positives = 48/142 (33%), Gaps = 23/142 (16%) Query: 71 MFLGALVGGIIGDKTGRRNAFILYEAIHIASMVVGAFSPNMDF-LIACRFVMGVGLGALL 129 +G V G + D+ G + + I+ V+G + LI RF+ G G A Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121 Query: 130 VTLFAGFTEYMPGRNR----GTWSSRVSFIGNWSYPLCSLIAMGLTPLISA----EWNWR 181 + Y+P NR G S V+ + G+ P I +W Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVA------------MGEGVGPAIGGMIAHYIHWS 169 Query: 182 VQLLIPAILSLIATALAWRYFP 203 LLIP I I T Sbjct: 170 YLLLIPMI--TIITVPFLMKLL 189
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 28.9 bits (64), Expect = 0.021 Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 13/81 (16%) Query: 158 ETTSALHTYFNVGDIAKVSVSGLGDRFIDKVNDAKED-----------VLTDGIQTFPDR 206 E ++AL + N D K S S L + F ++V + + V ++ F + Sbjct: 57 EMSAALAQFRNRRDYEKKS-SNLSNSF-ERVLEDEALPKAKQILKLISVHGGALEDFLRQ 114 Query: 207 TDRVYLNPQDCSVINDEALNR 227 ++ +P D ++ E L R Sbjct: 115 ARSLFPDPSDLVLVLRELLRR 135
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 27.7 bits (61), Expect = 0.022 Identities = 18/61 (29%), Positives = 26/61 (42%) Query: 49 QGLSIGIIILTIGVMAPIASGTLPPSTLIHSFLNWKSLVAIAVGVIVSWLGGRGVTLMGS 108 Q +I L IG + + LPPS ++ N ++ A VS LG +TL G Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGG 233 Query: 109 Q 109 Sbjct: 234 H 234
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.017 Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%) Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218 R L R + + + A L + P R R M ++L + Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 6e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 510 bits (1314), Expect = 0.0 Identities = 240/388 (61%), Positives = 282/388 (72%), Gaps = 33/388 (8%) Query: 1 MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYVR 60 MK+ +A+ V ++L A +A AAEIYNKD NKLDLYGKV+ HYFS + + DGD TY+R Sbjct: 1 MKRKVLAL--VIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMR 58 Query: 61 LGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYG 120 +GFKGETQINDQLTG+GQWEY + N E +G++ TRLAFAGLKFGDYGS DYGRNYG Sbjct: 59 VGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYG 117 Query: 121 VAYDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNNDFFGLVDGLNFAAQYQGKND 180 V YD+ WTD+LPEFGGD++T D +MTGR GVATYRN DFFGLVDGLNFA QYQGKN+ Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177 Query: 181 R----------------TDVTEANGDGFGFSTTYEY-EGFGVGATYAKSDRTNDQVIYGN 223 D+ NGDGFG STTY+ GF GA Y SDRTN+QV G Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237 Query: 224 NSLNASGQNAEVWAAGLKYDANNIYLATTYSETQNMTVFG------NNHIANKAQNFEVV 277 A G A+ W AGLKYDANNIYLAT YSET+NMT +G + +ANK QNFEV Sbjct: 238 T--IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295 Query: 278 AQYQFDFGLRPSVAYLQSKGKDLG----AWGDQDLVEYIDVGATYYFNKNMSTFVDYKIN 333 AQYQFDFGLRP+V++L SKGKDL D+DLV+Y DVGATYYFNKN ST+VDYKIN Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 334 LIDKSD-FTKASGVATDDIVAVGLVYQF 360 L+D D F K +G++TDDIVA+G+VYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 117 bits (293), Expect = 8e-38 Identities = 102/103 (99%), Positives = 102/103 (99%) Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTVARTQAEKFTL 61 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQT ARTQAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 751 bits (1941), Expect = 0.0 Identities = 476/555 (85%), Positives = 513/555 (92%), Gaps = 5/555 (0%) Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62 +TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64 Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122 IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124 Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182 FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184 Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242 LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244 Query: 243 QRRIEAILSPIVGNGNIHAQVSAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302 QRRIEAILSPIVGNGN+HAQV+AQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304 Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPTNQNNRQQ--QASTTSNS---GPRSTQRNETSN 357 +GYPGGVPGALSNQPAP N API+TPPTNQ N Q Q ST++NS GPRSTQRNETSN Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364 Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEALTREAMGFSEK 417 YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIE LTREAMGFS+K Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424 Query: 418 RGDSLNVVNSPFNSSDESGGALPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477 RGD+LNVVNSPF++ D +GG LPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484 Query: 478 RRAEAVKTVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537 RR E K Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544 Query: 538 VVALVIRQWINNDHE 552 VVALVIRQW++NDHE Sbjct: 545 VVALVIRQWMSNDHE 559
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 341 bits (876), Expect = e-119 Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60 +S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120 + + DY R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRAQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 370 bits (951), Expect = e-134 Identities = 224/228 (98%), Positives = 227/228 (99%) Query: 1 MSDNLPWKTWMPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 MSDNLPWKTW PDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 Query: 61 AEGRQQGHEQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 AEGRQQGH+QGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 Query: 121 MQMALEAARQVIGQTPTMDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 MQMALEAARQVIGQTPT+DNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 202 bits (515), Expect = 2e-70 Identities = 146/147 (99%), Positives = 147/147 (100%) Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 +TSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147 AALLAENRLDQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 469 bits (1208), Expect = e-168 Identities = 367/375 (97%), Positives = 370/375 (98%) Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60 Query: 61 GEPLVSDIVSDAQQADLLIPVDETLPVINDEQSTSTPLTTAQTMTLAAVADKNTTKDEKA 120 GEPL+SDIVSDAQQA+LLIPVDET PVINDEQSTSTPLTTAQTM LAAVADKNTTKDEKA Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120 Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLP EKPTLFTKLTS QLTTAQPDDAP Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180 Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTADASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240 GTPAQPLTPLVAEAQSKAEVISTPSPVTA ASPLITPHQTQPLPTVAAPVLSAPLGSHEW Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240 Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMISPHQHVRAALEAA 300 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQM+SPHQHVRAALEAA Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300 Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 Query: 361 LQGRVTGNSGVDIFA 375 LQGRVTGNSGVDIFA Sbjct: 361 LQGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 385 bits (989), Expect = e-136 Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 20 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 77 +LSQ EID LL S + E +S I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 78 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 137 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 138 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 197 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 198 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 255 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 256 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 312 + + L ++ +++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 313 GVPVLTSQYGTLNGQYALRIEHLI 336 Q G + + A +I I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 211 bits (538), Expect = 4e-74 Identities = 125/137 (91%), Positives = 133/137 (97%) Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSGKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60 MSDMNNP+D+N GA+DDLWA+AL+EQK+T+ KSAADAVFQQ GGGDVSG +QDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 333 bits (856), Expect = e-119 Identities = 244/245 (99%), Positives = 244/245 (99%) Query: 1 MRRLFSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 MRRL SVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.1 bits (164), Expect = 1e-18 Identities = 22/78 (28%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63 + ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 202 bits (516), Expect = 6e-67 Identities = 256/261 (98%), Positives = 259/261 (99%) Query: 1 MMQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 M+QVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPGSHL 120 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDP SHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGSEPLNSNAFLALTKAGSLIF 180 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIG EPLNSNAFLALTKAGSLIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEIFNLLADIISELPLI 261 EHLFSEIFNLLADIISELPLI Sbjct: 241 EHLFSEIFNLLADIISELPLI 261
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.045 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%) Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379 A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210 Query: 380 WD 381 WD Sbjct: 211 WD 212
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 34.4 bits (79), Expect = 2e-04 Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%) Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEETRRLLREEFVQFPA-- 94 +KLA + + D+ +ILT + +L + Q + EE R+ E F A Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEEG--HFKAGS 273 Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124 K+ A I A IA L + ++ Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 410 bits (1054), Expect = e-145 Identities = 199/388 (51%), Positives = 246/388 (63%), Gaps = 41/388 (10%) Query: 1 MKRKVLAMLVPALLVAGAANAAEIYNKNGNKVELYGKMVGERILTDRESGEKGDNSQDTS 60 MKRKVLA+++PALL AGAA+AAEIYNK+GNK++LYGK+ G +D S D + Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSS-----KDGDQT 55 Query: 61 YARVGVKGETQINPELTGYGQFELDLEASNRHNPDQ---TRLAYAGLSYKDFGSFDYGRN 117 Y RVG KGETQIN +LTGYGQ+E +++A+ TRLA+AGL + D+GSFDYGRN Sbjct: 56 YMRVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRN 115 Query: 118 VGVAYDAEAFTDMFVEWGGDSWAGTDLFMTNRTNGVATYRNTDFFGMVEGLNFALQYQGK 177 GV YD E +TDM E+GGDS+ D +MT R NGVATYRNTDFFG+V+GLNFALQYQGK Sbjct: 116 YGVLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGK 175 Query: 178 NEGTGNY----------------KANGDGHGLSATYTID-GFSFAGAYANSDRTDWQSGD 220 NE NGDG G+S TY I GFS AY SDRT+ Q Sbjct: 176 NESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNA 235 Query: 221 GK----GERAEVWALSTKYDANNVYAAVMYGESHNM-------NSDDGDVVNKTQNFEAV 269 G G++A+ W KYDANN+Y A MY E+ NM DG V NKTQNFE Sbjct: 236 GGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVT 295 Query: 270 LQYQFDFGLRPSIGYSYSKALDVA----GYKDSDRLNYIEIGTWYYFNKNMNVYTAYQIN 325 QYQFDFGLRP++ + SK D+ D D + Y ++G YYFNKN + Y Y+IN Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKIN 355 Query: 326 LLDKSD-YVLAHGLNTDDQLAVGIVYQF 352 LLD D + G++TDD +A+G+VYQF Sbjct: 356 LLDDDDPFYKDAGISTDDIVALGMVYQF 383
>PF06580#Sensor histidine kinase Length = 349 Score = 37.2 bits (86), Expect = 1e-04 Identities = 35/181 (19%), Positives = 64/181 (35%), Gaps = 37/181 (20%) Query: 290 ENILFLARADKNNVLVKLDALS----------------LNKEVENLLDYL--EYLSDEKE 331 NI L D L +LS L E+ + YL + E Sbjct: 180 NNIRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDR 239 Query: 332 IRFKVECNQQIFADKI---LLQRMLSNLIVNAIRYSPEKSRIHITSFLDANGSLNIDIAS 388 ++F+ + N I ++ L+Q ++ N I + I P+ +I + D NG++ +++ + Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD-NGTVTLEVEN 298 Query: 389 PGTKINEPEKLFRRFWRGDNSRHSVGQGLGLSLVKA-IAELHGGSATYHYLSKHNVFRIT 447 G+ + K G GL V+ + L+G A K Sbjct: 299 TGSLALKNTKE--------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344 Query: 448 L 448 + Sbjct: 345 V 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.5 bits (209), Expect = 9e-21 Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%) Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98 IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154 +L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 32.6 bits (74), Expect = 0.002 Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 8/86 (9%) Query: 2 IKKKGFTLLEVTIVL---GIGTLIAFMKFQDMRNDQEAVLADNVGTQIKQLGE--AVNRY 56 ++++GFTLLE+ ++L G+ + + F R+D A Q++ + + Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60 Query: 57 ---ISIRYDKISTLSSSNNQSSDPGP 79 +S+ D+ L +DP P Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAP 86
>PilS_PF08805#PilS N terminal Length = 185 Score = 73.4 bits (180), Expect = 8e-19 Identities = 47/179 (26%), Positives = 84/179 (46%), Gaps = 17/179 (9%) Query: 7 KRKSKKGFSLLELLLVLGIIAALVVAAFIVYPKVQASQRAQAESNNIATIQAGVKALYTS 66 K++ KG +L+E+LLV+G+I L +A+ +Y VQ++ ++ E NN+ T+ A +K+L Sbjct: 21 KKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQ 80 Query: 67 AS-SFTGLTNTVAVQAKIFPDNMLSGSGTAAKPINAFKGNVTLAATATGPSSATGSSFTI 125 + + T+ + P +M+ T A N + G+VT+ S+ SF + Sbjct: 81 GRYTDSNYIKTL-YAQGLLPSDMI-ADTTGASAKNPWGGSVTITT------SSDKYSFNV 132 Query: 126 TYDNVPAAECVKIATAAAGNFYITTVGTKVVKAAGGTLDVAATAAACTNATSNTLVFTS 184 NVP C+ + A + + T +AA + SNTL F++ Sbjct: 133 VEANVPQKNCMAMVNA--------LRSSSAISKINNTSTSTVSAATVCASDSNTLTFST 183
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.0 bits (67), Expect = 0.022 Identities = 23/85 (27%), Positives = 41/85 (48%), Gaps = 1/85 (1%) Query: 184 ERIDHRSLRTQCADALAQAE-EAFSAEEKAFWLAKATETNRPAMQRVHRAKWNDTESQEQ 242 E + H + RT A LA A A AE++ LAKA E R + +A + +++ Sbjct: 100 EALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKE 159 Query: 243 RAAEQAQRDQQIEEAKKVYTTFSEL 267 E+A+ ++Q++ A+ + L Sbjct: 160 IEREKAETERQLKLAEAEEKRLAAL 184
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 27.5 bits (61), Expect = 0.006 Identities = 9/48 (18%), Positives = 17/48 (35%), Gaps = 1/48 (2%) Query: 29 VLYGTYPGWYAAVVLLLTFGLSTLIGMSTGMAGATISLPIIAVVGFIA 76 L Y W V ++L L ++G+ + +VG + Sbjct: 886 CLAALYESWSIPVSVMLVVPL-GIVGVLLAATLFNQKNDVYFMVGLLT 932
>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin signature. Length = 574 Score = 30.3 bits (68), Expect = 0.001 Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 12/53 (22%) Query: 67 WFFTWKD------TGIQ-PGTAFVSSVVAGICFGVLMAAYHWWRKVVN--NLP 110 W W T I + ++A C G+ A+ WWRKV++ ++ Sbjct: 502 WDNNWYSKTSPFSTVIPLGANSRNIRIMARECTGL---AWEWWRKVIDERDVK 551
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 25.1 bits (55), Expect = 0.034 Identities = 8/25 (32%), Positives = 14/25 (56%) Query: 35 SVVNERREEYYQEIGEKKAHKLKMK 59 +++N R Y IGE A ++K + Sbjct: 197 AIINYVRRNYGSLIGEATAERIKHE 221
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 51.2 bits (122), Expect = 2e-08 Identities = 22/70 (31%), Positives = 44/70 (62%) Query: 22 QQLRERLIQELNLTPQQLHEESNLIQAGLDSIRLMRWLHWFRKNGYRLTLRELYAAPTLA 81 + +R+++ + L TP+ + ++ +L+ GLDS+R+M + +R+ G +T EL PT+ Sbjct: 233 ENIRKQIAELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIE 292 Query: 82 AWNQLMLSRS 91 W +L+ +RS Sbjct: 293 EWQKLLTTRS 302
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 45.8 bits (108), Expect = 1e-06 Identities = 32/156 (20%), Positives = 55/156 (35%), Gaps = 19/156 (12%) Query: 1561 LVTGAFGGLGRLAVNWLREKGARRIALLAPRVDESWLRDVEGGQTRVCR------CDVGD 1614 +TGA G+G L +GA + A + L V R DV D Sbjct: 12 FITGAAQGIGEAVARTLASQGAH---IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 1615 AGQLATVLDDLAAN-GGIAGAIHAAGVLADAPLQELDDHQLAAVFAVKAQAASQLLQTLR 1673 + + + + G I ++ AGVL + L D + A F+V + +++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 1674 NH-----DGRYLILYSSAAAT----LGAPGQSAHAL 1700 + G + + S+ A + A S A Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAA 164
>INTIMIN#Intimin signature. Length = 939 Score = 74.7 bits (183), Expect = 2e-17 Identities = 20/60 (33%), Positives = 29/60 (48%), Gaps = 3/60 (5%) Query: 162 QQIASTSQLIGSLLAEDMNSEQAANIARGWASSQASGVMTDWLSRFGTARITLGVDEDFS 221 QQ AS + S +N + A + A G A +QAS + WL +GTA + L +F Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224
>INTIMIN#Intimin signature. Length = 939 Score = 55.8 bits (134), Expect = 3e-10 Identities = 62/263 (23%), Positives = 91/263 (34%), Gaps = 20/263 (7%) Query: 175 IAVKAHVNDQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVK 234 I A V G + P +F+ S ++S N+ +TN G A VT+ ++ G V Sbjct: 578 ITYTATVKKN-GVAQANVPVSFNIV-SGTAVLSANSANTNGSGKATVTLKSDKPGQVVVS 635 Query: 235 ASLANGASLEKQLEAI---DEKLTLTSSPLIGVNAPKGATLTATLT---SANGTPVEGQV 288 A A S I K ++T A T T PV Q Sbjct: 636 AKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQE 695 Query: 289 INFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTASFHNGVTIQTQTTVKVTGN 348 + F+ TL LS +T+++G A V LTS G V+A + V+ Sbjct: 696 VTFTTTL--GKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTT 753 Query: 349 PSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNL-IEGLTVYFALKSGSTTLTSLTA 407 + I T ++ G NL G + +S + + S Sbjct: 754 LTID------DGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIAS--- 804 Query: 408 VTDQNGIATTSVKGEITGSVTVS 430 V +G T KG T SV S Sbjct: 805 VDASSGQVTLKEKGTTTISVISS 827 Score = 52.4 bits (125), Expect = 3e-09 Identities = 46/170 (27%), Positives = 65/170 (38%), Gaps = 7/170 (4%) Query: 271 TLTATLTSANGTPVEGQVINFSVTLEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTAS 330 T TAT+ NG ++F++ A LS TN SG+A V L S+K G V+A Sbjct: 579 TYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAK 637 Query: 331 FHNGV-TIQTQTTVKVTGNPSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGL 389 + + V + A + AD +T A D T V G + Sbjct: 638 TAEMTSALNANAVIFVDQ--TKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQ 694 Query: 390 TVYFALKSGSTTLTSLTAVTDQNGIATTSVKGEITGSVTVSAVTSAGGMQ 439 V F G + + T TD NG A ++ G VSA S + Sbjct: 695 EVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742 Score = 51.2 bits (122), Expect = 7e-09 Identities = 51/233 (21%), Positives = 89/233 (38%), Gaps = 16/233 (6%) Query: 13 AVTDADGKAKVTLKGTKAGAHTVTASMVGGKS--EQLVVNFTADTLTAQVNLNVTEDNFI 70 A T+ GKA VTLK K G V+A S V F T + + + + Sbjct: 612 ANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAV 671 Query: 71 ANNIGMTRLQATVTDGNGNPVEGIKVNFRGTSVTLSSTSVETDDQVFAEILVTSTEVGLK 130 AN V PV +V F T LS+++ +TD +A++ +TST G Sbjct: 672 ANGQDAITYTVKVMK-GDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730 Query: 131 TVSASLADKPTEVISRLLN----AKVDVNSATI----TSQEIPEGQVMVAQDIAVKAHVN 182 VSA ++D +V + + +D + I ++P + Q + N Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGN 790 Query: 183 DQFGNPVTHQPATFSAAPSSQMIISQNTVSTNTQGVAEVTMTPERNGSYTVKA 235 ++ + A S Q+ + + +T + V + + +YT+ Sbjct: 791 GKYTWRSANPAIASVDASSGQVTLKEKGTTTIS-----VISSDNQTATYTIAT 838 Score = 40.1 bits (93), Expect = 2e-05 Identities = 35/213 (16%), Positives = 63/213 (29%), Gaps = 18/213 (8%) Query: 4 NFTLSDGDKAVTDADGKAKVTLKGTKAGAHTVTASMVGGKSE--QLVVNFTADTLTAQVN 61 TD +G AKVTL T G V+A + + V F N Sbjct: 701 TLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGN 760 Query: 62 LNVTEDNFIANNIGMTRLQATVTDGNGN-PVEGIKVNFRGTSVTLSSTSVETDDQVFAEI 120 + + + + + G N G + S + SV+ Sbjct: 761 IEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQ---- 811 Query: 121 LVTSTEVGLKTVSASLADKPTEVISRLLNAKVDVNSATITSQEIPEGQVMVAQDIAVKAH 180 VT E G T+S +D T + + + ++ + V ++ Sbjct: 812 -VTLKEKGTTTISVISSDNQT--ATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGG--- 865 Query: 181 VNDQFGNPVTHQPATFSAAPSSQMIISQNTVST 213 N + + + AA + S T+ + Sbjct: 866 KLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898
>INTIMIN#Intimin signature. Length = 939 Score = 27.7 bits (61), Expect = 0.022 Identities = 22/129 (17%), Positives = 46/129 (35%), Gaps = 6/129 (4%) Query: 11 KISAIDYSQNINGDYKATVTGGGEGIATLIPVLNGVHQAGLSTTIEFISAETRPMTGTVS 70 K+S + NG K T+T G + + ++ V + +EF G + Sbjct: 704 KLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFF-TTLTIDDGNIE 762 Query: 71 VNSANLPTASFPSQGFTGAYYQLNNDNFAPGKTAADYSFSSSASWVGVDATGKVTFKNDG 130 + + P+ L + G + ++ A ++G+VT K G Sbjct: 763 IVGTGV-KGKLPTVWLQYGQVNL---KASGGNGKYTWRSANPAIASVDASSGQVTLKEKG 818 Query: 131 DSNTVIITA 139 + T+ + + Sbjct: 819 -TTTISVIS 826
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 39/259 (15%), Positives = 96/259 (37%), Gaps = 18/259 (6%) Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138 +G ++G D+LG KR+L+ + + + + + SF ++ I+ ++ A Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL----LIMARFIQGAGAAA 119 Query: 139 VGGEWGGAALLSVESAPKNKK-AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWG 197 + + K S V +G GVG + + I Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------------H 167 Query: 198 WRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHNQAAAKKRIPVIEALLRHPGAFLKIIA 257 W L ++ ++ ++ +++ + + + ++ +L + + Sbjct: 168 WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227 Query: 258 LRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRR 317 + + L +++ + + GL + + IG+L GG+ T+ F + + Sbjct: 228 VSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDV 286 Query: 318 VYITGALIGTLSAFPFFMA 336 ++ A IG++ FP M+ Sbjct: 287 HQLSTAEIGSVIIFPGTMS 305
>BICOMPNTOXIN#Staphylococcal bi-component toxin signature. Length = 315 Score = 33.3 bits (76), Expect = 0.002 Identities = 6/41 (14%), Positives = 16/41 (39%) Query: 303 LAADNRILYASGWFIDQNQGPYISHGGQNPNFSSCIALRPD 343 + +F+ ++ P + G NP+F + ++ Sbjct: 210 VGYKPHSKDPRDYFVPDSELPPLVQSGFNPSFIATVSHEKG 250
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 42.3 bits (99), Expect = 9e-06 Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 3/87 (3%) Query: 927 DVRQMVATVRNTAPASGSER-LGDAAIRHSVRVCVEGALEQTEFDDNENLYVLGLDSIKS 985 ++ A V+ T+ +G + IR + ++ E + D E+L GLDS++ Sbjct: 209 QLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQETPE--DITDQEDLLDRGLDSVRI 266 Query: 986 IQIAAQLRHHGWTMSAVQVMECGTVNA 1012 + + Q R G ++ V++ E T+ Sbjct: 267 MTLVEQWRREGAEVTFVELAERPTIEE 293
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 51.2 bits (122), Expect = 2e-08 Identities = 32/167 (19%), Positives = 58/167 (34%), Gaps = 7/167 (4%) Query: 2188 IPGNVLWIIGGEKGIGRMIGEALAQREGVRVVLSSRTGYHHEAVQQDAL------DVIHC 2241 I G + +I G +GIG + LA +G + E V + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLAS-QGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 2242 DVTQAEAVRACLATLLERYGRLDGVIFAADATTTLTLHQLSESALRDTLTVKERGTANVL 2301 DV + A+ A + G +D ++ A +H LS+ T +V G N Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 2302 HALAQRNLLDERLLLLFCNSLAAVNAEIGQTGYATASAYLDALAQQL 2348 ++++ + ++ S A YA++ A + L Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCL 171
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 29.5 bits (66), Expect = 0.010 Identities = 13/39 (33%), Positives = 20/39 (51%) Query: 48 EKNVQIADQVIIDESAGEVVIGANTRICHGAVIQGPVVI 86 +V+I+E G +VIGA+ RI AV G + + Sbjct: 254 TVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTV 292
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.7 bits (74), Expect = 0.002 Identities = 12/22 (54%), Positives = 13/22 (59%) Query: 32 VTVLLGPNGCGKSTLLRALAGL 53 VL G G GKSTL+ L GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 174 bits (444), Expect = 3e-54 Identities = 74/351 (21%), Positives = 143/351 (40%), Gaps = 40/351 (11%) Query: 1 MNILVTGGAGYIGSHTAIELLNAGHEIIVLDNFSNASYKCIEK---IKEITRRDFITITG 57 M LVTG AG+IG H + LL AGH+++ +DN N Y K ++ + + F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNL-NDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 58 DAGCRKTLSAIFEKHAIDIVIHFAGFKSVSESKSEPLKYYQNNVGVTITLLQVMEEYRIK 117 D R+ ++ +F + V +V S P Y +N+ + +L+ +I+ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 118 KFIFSSSATVYGEPEIIPIPETAKIGGTTNPYGTSKYFVEKILEDVSSTGKLDIICLRYF 177 +++SS++VYG +P + + Y +K E + S L LR+F Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179 Query: 178 NPVGAHSSGKIGEAPSGIPNNLVPYLL--DVASGKRDKLFIYGNDYPTNDGTGVRDFIHV 235 G P G P ++ + + GK ++ N G RDF ++ Sbjct: 180 TVYG----------PWGRP-DMALFKFTKAMLEGKSIDVY--------NYGKMKRDFTYI 220 Query: 236 VDLAKGHLAAMNYL---------------SINSGYNIFNLGTGKGYSVLELITTFEKLTN 280 D+A+ + + + + + Y ++N+G +++ I E Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280 Query: 281 IKVNKSFIERRAGDVASCWADADKANSLLDWQAEQTLEQMLLDSWRWKKNY 331 I+ K+ + + GDV AD ++ + E T++ + + W +++ Sbjct: 281 IEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 48.7 bits (116), Expect = 2e-08 Identities = 48/369 (13%), Positives = 106/369 (28%), Gaps = 87/369 (23%) Query: 4 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGR------- 51 S + R V ++ IA G+ + + A G + + Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 52 --RGMRAG-PLA---PVQAATAVEQAVPRYLTGLGTITAANTVTVRSRVDG--QLMALHF 103 +R G L + A + L T ++ ++ +L Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 104 QEGQQVKAGDLLAEI------------DPSQFKVALAQAQGQLA-------KDKATLANA 144 Q V ++L Q ++ L + + + + + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 145 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 187 + L + L +++ + Q+ E ++ ++ + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 188 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 220 + + S I APV +V LK G +++ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 221 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 279 +T +V++ + +++ + DI + Q A + VEA+ T L + ++L Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410 Query: 280 DNQIDATTG 288 D D G Sbjct: 411 DAIEDQRLG 419
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 919 bits (2376), Expect = 0.0 Identities = 300/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%) Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72 + FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189 + + S + +M S TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAIAALGLTSESVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302 ++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S E + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653 + V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+ Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658 Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709 D F+ P I + T + F L DAL+ QL+ Q P L V Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 NTPGLAALDTIRLTSSDGGVVPLSSIAKIEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829 +D + + S++G +VP S+ + + + P I S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889 DA A+M+ + LP I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009 G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 916 bits (2368), Expect = 0.0 Identities = 288/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDDTHRWQIQTNDELK 236 A+R+ L+ L ++ DV + N + G AL I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRARLPELQSTIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355 T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530 +LV+L LTP +C +LK + GF Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 AVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 + +A + L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRGERS---ETAQQIIDRLRKKLAKEPGAN 641 + +V V GF+ G N+GM F++LKP ER+ +A+ +I R + +L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696 + + I G ++ L D + + R +L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QEDNGAEMNLIYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816 ++K++V + G+ +P S F + + I G S D Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876 A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 80.7 bits (199), Expect = 2e-17 Identities = 76/448 (16%), Positives = 161/448 (35%), Gaps = 26/448 (5%) Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRGERSETAQQIIDRLRKKLAKEPGANLFLMAVQDI 650 +DN+ + S S + +T + + Q+ ++L+ P + Q I Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127 Query: 651 RVGGRQANASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQEDNGAE-- 703 V ++ +SD+ ++ ++ L+ L + DV GA+ Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183 Query: 704 MNLIYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759 M + D D + + + + + + T P Q + R+ Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243 Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817 + +N++G + L A+ N + G AA +L D Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302 Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874 + AI + +L P ++ + T Q +++ V + AI V++V+ + ++ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934 L +P +G L F + + + G++L IG++ +AI++V+ Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994 L P+EA ++ ++ + +P+ GG + + ITIV + +S Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPKQA 1022 L+ L TP + + + K Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGG 510
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 123 bits (310), Expect = 7e-33 Identities = 97/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%) Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79 F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138 I++ GS+ + + LL +AR +QG G A + + V + +P+E A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLM-LMPNYTMQTRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP+ I + LM L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLAIAGLVAVGVVALVLYLLHAQNNNRALFSLKL 257 G +L++VG+ L + + V V++ ++++ H + L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 + F +G+ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372 +V+R G VL +G++ +++ F+T + L W+ + V L G+ S + Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367 Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432 ++T+ L A +G SLL+ LS G+ I G LL + + Q+ + Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427 Query: 433 MYT--WLSMASIIAL 445 +Y+ L + II + Sbjct: 428 LYSNLLLLFSGIIVI 442
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 34.0 bits (78), Expect = 0.001 Identities = 28/95 (29%), Positives = 36/95 (37%), Gaps = 20/95 (21%) Query: 164 RQTSWLIVALSTLLAALATF------PLARGLLAPVKRLVDGTHKLAAGDFTTRVAPTSE 217 RQ + L+ A L AL P L+A V+ V H LA + P S Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131 Query: 218 DEL-----------GRLAEDFNQLASTLEKNQQMR 241 + L G L N+LA E+ QQMR Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 130 PQRELQQQDAESPLII 145 + + D++ + + Sbjct: 124 RRPSKLEDDSQDGMPL 139
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 31.9 bits (72), Expect = 0.002 Identities = 21/92 (22%), Positives = 35/92 (38%), Gaps = 2/92 (2%) Query: 125 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGAMQTFNSRE 183 A+G E K I GA IG + + GA + A+D + EKL S + ++ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 184 MSAPQIQGVLRELRFNQLILETAGVPQTVELA 215 A + ++ E + V +A Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 27.9 bits (62), Expect = 0.007 Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 6 KMLLGALLLVTSAAWAAPATAGSTNTSGISKYE-LSSFIADF 46 ++L G LLL++S +WA ++K E L + DF Sbjct: 11 RVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 28.5 bits (63), Expect = 0.045 Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%) Query: 93 NITLSNNQ---TSFTSGYSVTVTPAASNAKVNVSAGGGGSVMINGVATLSSA-----SSS 144 NI LS N+ T T + T++ S ++ + S G + + + + S+S Sbjct: 297 NIILSKNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNS 356 Query: 145 TRGSAAVQFLLCLLGGKSW 163 + A+ L L G ++W Sbjct: 357 NSSTVAIDHSLSLAGERTW 375
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 725 bits (1873), Expect = 0.0 Identities = 243/843 (28%), Positives = 391/843 (46%), Gaps = 35/843 (4%) Query: 2 LRMTPLASAI---VALLIGIEAYAAEETFDTHFMIGGMKDQQVSNIRL--EDNQPLPGQY 56 R+ + A +AE F+ F+ Q V+++ + PG Y Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD--PQAVADLSRFENGQELPPGTY 78 Query: 57 DIDIYVNKQWRGKYEIIVKDNPQET----CLSREMIKRLGINTD-----NFASGKQCLTF 107 +DIY+N + ++ E CL+R + +G+NT N + C+ Sbjct: 79 RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPL 138 Query: 108 KQLIQGGSYTWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYVSQYYSDY 167 +I + D+G RL+ ++PQA++ GY+PPE W+ GINA +Y S Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198 Query: 168 KASGNSKSTYVRFNSGLNLLGWQLHSDASFSKTNNNPGG-----WKSNTLYLERGFAQLL 222 + GNS Y+ SGLN+ W+L + ++S +++ W+ +LER L Sbjct: 199 RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLR 258 Query: 223 GTLRVGDMYTSSDIFDSVRFSGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGF 282 L +GD YT DIFD + F G +L D MLP+S++ F P + GIA+ A VTI+QNG+ Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318 Query: 283 VVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVTTYLVPYAAVPNMLQPGVSKYDF 342 +Y VPPGPF I D+ AG DL V++KEADGS + VPY++VP + + G ++Y Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378 Query: 343 AAGRSHIEGASKQSD-FVQAGHQYGFNNLLTLYGGSMVANNYYAFTLGTGWNT-RIGAIS 400 AG A ++ F Q+ +G T+YGG+ +A+ Y AF G G N +GA+S Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438 Query: 401 VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNK 460 VD T+++S + DGQS + YNK ++++ T L +RYS+ Y F D ++ Sbjct: 439 VDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMN 498 Query: 461 DNYRRDENDIYDI----ADYYQNDFGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSG 516 ++ + + DYY + ++ ++Q L ++ LS + YWG S Sbjct: 499 GYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSN 557 Query: 517 SSKDYQLSYSNNLRRISYTLAASHAYDENHHE-EKRFNIFISIPFD--WGDDVTTPRRQI 573 + +Q + I++TL+ S + ++ + ++IPF D + R Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617 Query: 574 YMSKSTTFDDQGVASNNTGLSGTVGSRDQFNYGVNLSYQYQGN---ETTAGANLTWNAPV 630 S S + D G +N G+ GT+ + +Y V Y G+ +T A L + Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677 Query: 631 ATVNGSYSQSSAYRQAGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYR 690 N YS S +Q VSGG++A + GV L L++T ++ APG KDA V Q Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737 Query: 691 TTNRNGVVVYDGMTPYRENYLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKP 750 T+ G V T YREN + LD + +L P RGA+V F + Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA-RVGI 796 Query: 751 WFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEVPPSVNVAIDKQQGLSCT 810 + L + +PL FG V + G+V Q+++ + V V +++ C Sbjct: 797 KLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856 Query: 811 ITF 813 + Sbjct: 857 ANY 859
>PERTACTIN#Pertactin signature. Length = 922 Score = 27.0 bits (59), Expect = 0.025 Identities = 15/43 (34%), Positives = 26/43 (60%), Gaps = 2/43 (4%) Query: 40 VFAVIEKGGLLEV--KATGDFKIFVTDTGASPAAGDNLTLVTT 80 VFA + L V A+G +++V ++G+ PA+G+ + LV T Sbjct: 484 VFADLGLSDKLVVMRDASGQHRLWVRNSGSEPASGNTMLLVQT 526
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.5 bits (157), Expect = 2e-14 Identities = 22/113 (19%), Positives = 48/113 (42%), Gaps = 2/113 (1%) Query: 9 VMIVDDHPLMRRGVRQLLELDSGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68 +++ DD +R + Q L +G++V + A+ D D+++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121 D L +++ +++++ + + GA YL K D L+ I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 28.0 bits (62), Expect = 0.033 Identities = 7/47 (14%), Positives = 19/47 (40%), Gaps = 2/47 (4%) Query: 3 RKVLLIPLIIFLAIAAALLWQLARN--AEGDDPTNLESALIGKPVPK 47 ++ LL+ ++F++ W+ +N + T + G + Sbjct: 4 QRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQ 50
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 28.0 bits (62), Expect = 0.007 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%) Query: 34 KHIVLWLGLALACLGLAMVLWLLVL-QNVPV 63 + I+ +L + L C LAM+ W + L N PV Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPV 45
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 29.7 bits (67), Expect = 0.023 Identities = 32/192 (16%), Positives = 58/192 (30%), Gaps = 37/192 (19%) Query: 268 GYGLTEFASTVCAKEADGLADVGSPL----PGREVKIVNDEVWLRAASMAEGYWRNGQRV 323 G+G+ S + A + L ++ + G + I+ E + A + + R+ Sbjct: 40 GHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPILMLEGFFHAQDLEIY---DQHRL 96 Query: 324 PLVNDEGWYATRDRGEMHNGKLTI-------VGRLDNLFFSGGEGIQPEEVERVIAAHPA 376 W + L I + RL G QP+ V V A Sbjct: 97 TTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRL---------GFQPDRVLTVWQQLRA 147 Query: 377 VLQVFIVPVADKEFGHRPVAVVEYDQQTVDLDEWVKDKLARFQQPVRWLTLPPELKNGGI 436 + V + + H A + + + +AR +Q L L N Sbjct: 148 MANVGEMTL----MSHFAEA---------EHPDGISGAMARIEQAAEGLECRRSLSNSAA 194 Query: 437 KISRQALK-EWV 447 + +WV Sbjct: 195 TLWHPEAHFDWV 206
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 121 bits (306), Expect = 4e-32 Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%) Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78 + I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+ Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137 ++G RL L + S++ + + +LI R +QG A L ++ R P Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197 E R A L V + GP +GG I W +L+ +PM I+ L L +E Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192 Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257 ++ G+ L+ +G+ + ML F +S I +VSV+S + V Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241 Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317 +P +D L K+ F IG++ + +G + ++P ++++ + G Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376 G M ++I + G ++ ++ +V + S T F II+ G Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361 Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420 + ++TI S L + S+ NF LS G ++ Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 74.1 bits (182), Expect = 1e-16 Identities = 47/277 (16%), Positives = 94/277 (33%), Gaps = 46/277 (16%) Query: 56 AKNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDYNRRV----PLAKQGVIS 108 K + Q + L + AE + + Y+ R+ L + I+ Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250 Query: 109 KEALEHTKDTLI----------SSKAALNAAIQAYKANKALVMNTPLNRQPQVIEAADAT 158 K A+ ++ + S + + I + K + T L + + + T Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKNEILDKLRQTT 308 Query: 159 KE----------AWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPGQSLMAVVPARQ-MWV 206 + + I++PV+ + Q V G V+ ++LM +VP + V Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368 Query: 207 NANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNAFSLLPAQNATGNWIK 266 A + + + +GQ+ I + F G +G + + Sbjct: 369 TALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK---VKNINLDAIEDQRLG 419 Query: 267 IVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDT 299 +V V +S++ L PL G+++TA I T Sbjct: 420 LVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKT 454
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.1 bits (117), Expect = 3e-09 Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%) Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63 ++ DD + L + ++ + + + + D+V+ DV +P N + Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123 L ++K + ++++SA+N + AI+A++ G + Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101 Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148 PF L + + L ++ Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.1 bits (195), Expect = 4e-17 Identities = 30/105 (28%), Positives = 51/105 (48%) Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019 +IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1020 LARKLREQNSSLPIWGLTANAQANEREKGLNCGMNLCLFKPLTLD 1064 L ++++ LP+ ++A K G L KP L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 7e-04 Identities = 11/33 (33%), Positives = 16/33 (48%) Query: 30 MVALLGPSGSGKTTLLRIIAGLEHQTSGHIRFH 62 V L G G GK+TL+ + GL+ + H Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.9 bits (122), Expect = 1e-09 Identities = 34/116 (29%), Positives = 51/116 (43%), Gaps = 9/116 (7%) Query: 63 VRDGIVWDFFGAVTIVRRHLD-TLEQQFGRRFSHVATSFPPGTDP---RISINVLESAGL 118 ++DG++ DFF +++ + F R V P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 119 EVSHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKKGKVTYSADEATGG 169 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>STREPKINASE#Streptococcus streptokinase protein signature. Length = 440 Score = 29.3 bits (65), Expect = 0.015 Identities = 27/120 (22%), Positives = 52/120 (43%), Gaps = 21/120 (17%) Query: 130 GNPLSSQEVLEGGESLILSE-----VAEPPAQMIDSLTTLFKTIKPVKRAFICSIKENEE 184 G+ ++SQE+L +S++ + E + ++ +F+TI P+ + F +K E+ Sbjct: 217 GDTITSQELLAQAQSILNKNHPGYTIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQ 276 Query: 185 A-QPNLLIGIEADGDIEEIIQAAGSVATDTLPGDEPIDICQVKKGEKGISHFITEHIAPF 243 A + N G+ + + ++I V +KKGEK F H+ F Sbjct: 277 AYRINKKSGLNEEINNTDLISEKYYV---------------LKKGEKPYDPFDRSHLKLF 321
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 114 bits (288), Expect = 5e-30 Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 74/371 (19%) Query: 23 GIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHS-------VGYDARTNAA 75 IDLGT N+L+ G + +E PSVV +Q VG+DA+ Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVVAIRQDRAGSPKSVAAVGHDAK-QML 63 Query: 76 LDTANTISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILK 135 T I++++ + +AD V+ +L+ Sbjct: 64 GRTPGNIAAIRPMKDGVIADF-------------------------------FVTEKMLQ 92 Query: 136 ALAARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYG 194 + V++ VP +R+ +++A+ AG + L+ EP AAAI G Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152 Query: 195 LDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG 254 L + V D+GGGT +++++ L+ V +GGD FD + +Y+R G Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207 Query: 255 --IPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAG---WQG-----EISREQFNELI 304 I + + R++ E+ A + + V G +G ++ + E + Sbjct: 208 SLIGEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEAL 260 Query: 305 APLVKRTLLACRRALKDAGVE-ADEVLE--VVMVGGSTRVPLVRERVGEFFGRPPLTSID 361 + + A AL+ E A ++ E +V+ GG + + + E G P + + D Sbjct: 261 QEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED 320 Query: 362 PDKVVAIGAAI 372 P VA G Sbjct: 321 PLTCVARGGGK 331
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 26.3 bits (58), Expect = 0.032 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%) Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61 K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122 Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88 MSD N + G G+T Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 82.4 bits (203), Expect = 9e-21 Identities = 67/257 (26%), Positives = 120/257 (46%), Gaps = 7/257 (2%) Query: 3 QVAVVIGGGQTLGAFLCHGLAAEGYRVAVVDIQSDKAANVAQEINAEYGEGTAYGFGADA 62 ++A + G Q +G + LA++G +A VD +K V + AE A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEQSVLALSRGVDEIFGRVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122 ++ ++ ++ G +D+LV AG+ + I +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSPMFQSL-LPQYATKLGIKPEQVEQYYIDKVPLKRGCDYQDVLNMLLFYASPK 241 G+ ++ M SL + + IK +E + +PLK+ D+ + +LF S + Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242 Query: 242 ASYCTGQSINVTGGQVM 258 A + T ++ V GG + Sbjct: 243 AGHITMHNLCVDGGATL 259
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.9 bits (62), Expect = 0.024 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%) Query: 1 MKPRQRQAAILEYLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40 M QR I E + + +EL ++ T T+ +D+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 372 bits (956), Expect = e-127 Identities = 125/388 (32%), Positives = 193/388 (49%), Gaps = 33/388 (8%) Query: 149 IAALAAGALS----------NALLIEQLESQNMLPGDAAPFEAVKQTQMIGLSPGMTQLK 198 I A GA +I + ++ ++ ++G S M ++ Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150 Query: 199 KEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLVYLNCAALPESVAESELFG 258 + + + +DL ++I+GE+GTGKELVA+A+H+ R P V +N AA+P + ESELFG Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210 Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318 H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270 Query: 319 VDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRL 378 DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ L +F +Q Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329 Query: 379 RLGLSRVVLSAGARNLLQHYNFPGNVRELEHAIHRAVVLARATRSGDEVIL-----EAQH 433 + GL A L++ + +PGNVRELE+ + R L E+I E Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389 Query: 434 FAFPEVTLPPPEAAAVPVVKQNLR-----------------EATEAFQRETIRQALAQNH 476 + + V++N+R + I AL Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449 Query: 477 HNWAACARMLETDVANLHRLAKRLGLKD 504 N A +L + L + + LG+ Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGVSV 477
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.036 Identities = 17/93 (18%), Positives = 29/93 (31%), Gaps = 7/93 (7%) Query: 3 TTMLEVAKRAGVSKATVSRVLSG-----NGYVSQETKDRVFQAVEESGYRPNLLARNLSA 57 T++ E+AK AGV++ + + + +E P L Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91 Query: 58 KSTQTLGLVVTNTLYHGIYFSELLFHAARMAEE 90 L VT + E++FH E Sbjct: 92 ILIHVLESTVTEERRRLLM--EIIFHKCEFVGE 122
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.0 bits (59), Expect = 0.012 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%) Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRVGQWVLVHVGFAMSVINEAEARDTLD 69 I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D + Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226 Query: 70 ALQN--MFDVEPDVG 82 A+ + V+PD+ Sbjct: 227 AINQEPVPHVQPDIA 241
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 389 bits (1001), Expect = e-131 Identities = 140/373 (37%), Positives = 204/373 (54%), Gaps = 39/373 (10%) Query: 350 YQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYSVLKQVEMVAQSDSTVLILG 409 E+ + R + E +L + + ++GRS AM + + + + Q+D T++I G Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 410 ETGTGKELIARAIHNLSGRNNRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 469 E+GTGKEL+ARA+H+ R N V +N AA+P L+ES+LFGHE+GAFTGA + GRF Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 470 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKIIQTDVRLIAATNRDLKKMV 529 E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G I++DVR++AATN+DLK+ + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 530 ADREFRSDLYYRLNVFPIHLPPLRERPEDIPLLAKAFTFKIARRLGRNIDSIPAETLRIL 589 FR DLYYRLNV P+ LPPLR+R EDIP L + F + A + G ++ E L ++ Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346 Query: 590 SNMEWPGNVRELENVIERAVLLTRGNVLQLSL---------------------PDIALPE 628 WPGNVRELEN++ R L +V+ + +++ + Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 629 PETPPAATVVAQEG--------------EDEYQLIVRVLKETNGVVAGPKGAAQRLGLKR 674 A G E EY LI+ L T G AA LGL R Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQI---KAADLLGLNR 463 Query: 675 TTLLSRMKRLGID 687 TL +++ LG+ Sbjct: 464 NTLRKKIRELGVS 476
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 27.0 bits (60), Expect = 0.027 Identities = 16/138 (11%), Positives = 39/138 (28%), Gaps = 28/138 (20%) Query: 22 NDEVELTLAGGAKLVAIV--------------THSSQQALGLAKGKEAIAL----IKAPW 63 N + A A++ ++V + L +EAI L K P Sbjct: 17 NLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEAITLRERGWKGPI 76 Query: 64 VTL--ATEDCGLKFSARNQFAGSVSTI--------TEGAVNATVHIKTDAGFEIVAVVTN 113 + L L+ +++ V + +++K ++G + + Sbjct: 77 LMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKVNSGMNRLGFQPD 136 Query: 114 ESQDEMKLTTGSRVIALI 131 + + + Sbjct: 137 RVLTVWQQLRAMANVGEM 154
>PF07675#Cleaved Adhesin Length = 1358 Score = 30.4 bits (68), Expect = 0.021 Identities = 20/92 (21%), Positives = 39/92 (42%), Gaps = 12/92 (13%) Query: 206 ILGQTYLPRKFKTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIE 260 ++ +P+ T +P PQN + A+ ++VAI+++G L G + G++ Sbjct: 240 VMPYRAMPKT--NTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATV 297 Query: 261 HGNK-----KTYARTASEFGYLPLEHTLAVAE 287 + K Y + YLP+ + E Sbjct: 298 NMTKQITENGNYDVVITRSNYLPVIKQIQAGE 329
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.4 bits (84), Expect = 2e-04 Identities = 45/314 (14%), Positives = 112/314 (35%), Gaps = 36/314 (11%) Query: 69 LGSLVLGWISDHIGRQKIFTFSFMLITLASFLQFFATTP-EHLIGLRILIGIGLGGDYSV 127 +G+ V G +SD +G +++ F ++ S + F + LI R + G G ++ Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123 Query: 128 GHTLLAEFSPRRHRGILLGAFSVVWT----VGYVLASIAGHHFISESPEAWRWLLASAAL 183 ++A + P+ +RG G + VG + + H+ W +LL + Sbjct: 124 VMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI 177 Query: 184 PALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVATATHKHIKTLF-- 241 + + L + R +G F I+ +L + + + L Sbjct: 178 TIITVPFLMKLLKKEVR---IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL 234 Query: 242 -SSRYWRRTA--------FNSVFFVCLVIPWFVIYT----WLPTIAQTIGLEDALTASLM 288 ++ R+ ++ F+ V+ +I+ ++ + + L+ + + Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294 Query: 289 LNALLIVGALLGLV-------LTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLF 341 + ++ G + ++ L L L+ + + + L +S + + Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354 Query: 342 VLFSTTISAVSNLV 355 ++F + + V Sbjct: 355 IVFVLGGLSFTKTV 368
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 105 bits (264), Expect = 2e-29 Identities = 73/257 (28%), Positives = 117/257 (45%), Gaps = 11/257 (4%) Query: 11 MDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANVFIPSFVKDNGETKEMIEK-QGVEVD 69 M+ ++GK A +TG G+G+A A LA GA++ + + E K + + Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 70 FMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAA 129 D+ A +I A G +DILVN AG+ + + +W+ VN T Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 130 FELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNI 189 F S +K M+ ++SG I+ + S + + AY+++K A FTK EL +YNI Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 190 QVNGIAPGYYATDI--TLATRSNPETNQRVLDH-------IPANRWGDTQDLMGAAVFLA 240 + N ++PG TD+ +L N Q + IP + D+ A +FL Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAE-QVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 241 SPASNYVNGHLLVVDGG 257 S + ++ H L VDGG Sbjct: 240 SGQAGHITMHNLCVDGG 256
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.9 bits (75), Expect = 0.002 Identities = 21/76 (27%), Positives = 34/76 (44%), Gaps = 1/76 (1%) Query: 41 GFSNTEIGLIMSTFGIAAIIFYA-PSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLW 99 + T IG+ ++ FGI + A +G +A + R+ + MI G +L+A W Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 100 VMLCIQVAFAITTILM 115 + I V A I M Sbjct: 302 MAFPIMVLLASGGIGM 317 Score = 30.9 bits (70), Expect = 0.008 Identities = 42/268 (15%), Positives = 94/268 (35%), Gaps = 30/268 (11%) Query: 48 GLIMSTFGIAAIIFYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVA 107 G++++ + + G ++D+F R ++ ++ + +MAT P LWV+ ++ Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 108 FAITTILMLWSVSIKAASLLGD---HSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAP 164 IT + A + + D E+ + G+M G G+++ V Sbjct: 106 AGITG-----ATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGP-----VLGGLMG 155 Query: 165 DDSASLKTVIIIYSVVYILLGILCWFFV-----SDNNNLRNTNNEEKQSFQLSDILAVLR 219 S + + L + F + + LR SF+ + + V+ Sbjct: 156 GFSPHAP--FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVA 213 Query: 220 ISTTWYCSMVIFGVF--TIYAILSYST-NYLTEMYGMSLVAASYMGIVINKIFRALCGPL 276 + M + G ++ I ++ G+SL A + + +A+ + Sbjct: 214 ALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS----LAQAM---I 266 Query: 277 GGIITTYSKVKSPTRVVQILSIIGLLAL 304 G + + + I G + L Sbjct: 267 TGPVAARLGERRALMLGMIADGTGYILL 294
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 80.7 bits (199), Expect = 1e-18 Identities = 44/142 (30%), Positives = 63/142 (44%), Gaps = 14/142 (9%) Query: 415 PEQKMEVTASLQVQTVRLDSMSLFDVGQARLKDGSTKVL---VDALVNIRAKPGWLILVA 471 +Q + L S LF+ +A LK L L N+ K G ++V Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG-SVVVL 258 Query: 472 GYTDATGDEKSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGR-- 529 GYTD G + NQ LS RRA++V ++ L + IPA + +G+GES P N + Sbjct: 259 GYTDRIGSDAYNQGLSERRAQSVVDY-LISKGIPADKISARGMGESNPVTGNTCDNVKQR 317 Query: 530 -------AVNRRVEISLVPRSD 544 A +RRVEI + D Sbjct: 318 AALIDCLAPDRRVEIEVKGIKD 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 35.2 bits (81), Expect = 0.001 Identities = 36/189 (19%), Positives = 66/189 (34%), Gaps = 34/189 (17%) Query: 512 IMTLRQEGTDSTELQQQLRTHQGFAPLLALDVDARAVATVVADWTGI--------PLSSL 563 + + ++ +L +++ + P+L + + + A G L+ L Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111 Query: 564 LKDEQSDLLSMEKSLENR---------VVGQSPALCAIAQRL-RAAKTGLTPENGPQGVF 613 + L ++ +VG+S A+ I + L R +T LT Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT--------L 163 Query: 614 LLTGPSGTGKTETALTLADTLFGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGV 673 ++TG SGTGK A L D + IN++ S+L G + G Sbjct: 164 MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGA 215 Query: 674 LTEAVRKRP 682 T A + Sbjct: 216 FTGAQTRST 224
>PF04183#IucA / IucC family Length = 580 Score = 31.4 bits (71), Expect = 0.004 Identities = 9/36 (25%), Positives = 19/36 (52%), Gaps = 3/36 (8%) Query: 114 AEIITALEEYKKQYPHLAKRVEKISGYVDDIDKEVL 149 A ++ +Y K++P +++R S + I + VL Sbjct: 511 AAVL---SDYMKKHPQMSERFALFSLFRPQIIRVVL 543
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 31.0 bits (70), Expect = 0.034 Identities = 16/71 (22%), Positives = 24/71 (33%), Gaps = 6/71 (8%) Query: 302 LRLAHTLAERGIAHWQSVL---KPLLAGGAFSSLRLRGLMFSPPLAAVPEAAPHAWLPSP 358 + +T ER I +S L G F + RG + +P +P Sbjct: 243 WQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLP---DSQRGFAP 299 Query: 359 VWAGITGDNAR 369 V GI A+ Sbjct: 300 VIHGIARGTAQ 310
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 29.3 bits (65), Expect = 0.010 Identities = 13/83 (15%), Positives = 35/83 (42%), Gaps = 9/83 (10%) Query: 33 ESKSVASAVFYKQIKILHLDFFSR---------SALNTDAEDTPLSTMVHVWQLKTREDF 83 + + V+Y+ K + LD S+ + + + ++D+ S ++ + K + + Sbjct: 161 INSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLEL 220 Query: 84 DKADYDTLFMQEEKTLEKDVLAK 106 + D F++E T + + Sbjct: 221 NNKSIDINFIKENLTEFQHAFSL 243
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 107 bits (268), Expect = 3e-32 Identities = 67/172 (38%), Positives = 101/172 (58%), Gaps = 8/172 (4%) Query: 1 MRITVFLLTFLSFLSDLWAVDIPINITGTIIIPPCQINNSNPVDVDFGNIRVSELDTKEH 60 +R+++F+ L+ ++ L D+ INI G + IPPC INN + VDFGNI +D Sbjct: 2 IRLSLFISLLLTSVAVL--ADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRG 59 Query: 61 IKVVSFPVYCPYHQGEAYVKMTGQSM-TGKDNVLATNIDGLGIELYQGGEGTGNHLILGS 119 + + CPY G ++K+TG +M G++NVLATNI GI LYQ G+G L LG+ Sbjct: 60 EVTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQ-GKGMSTPLTLGN 118 Query: 120 GSSGYGYEVINALSEKNVERTTFTFTAKIYKAEGVTINSGEFSASALINIVY 171 G SG GY V L + R+TFTFT+ ++ +N G+F +A ++++Y Sbjct: 119 G-SGNGYRVTAGL---DTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>cloacin#Cloacin signature. Length = 551 Score = 28.5 bits (63), Expect = 0.040 Identities = 24/75 (32%), Positives = 32/75 (42%), Gaps = 5/75 (6%) Query: 3 GGHPGTSGPGTTVAAALSSGEVTLYTPAI----VCISRQKNVKKQRAENMQKMKPALKKT 58 GG GT G + VAA ++ G L TP V IS + A+ M +K K Sbjct: 72 GGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGA-LSAAIADIMAALKGPFKFG 130 Query: 59 LMAVACLSAVPAAQA 73 L VA +P+ A Sbjct: 131 LWGVALYGVLPSQIA 145
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 759 bits (1961), Expect = 0.0 Identities = 248/885 (28%), Positives = 386/885 (43%), Gaps = 65/885 (7%) Query: 15 LNRLHIMKKNKSTFTINFITYSLMLSLAGVPVYAVDFNTDVLDAADRQNIDFSRFSRAGY 74 LHI K + F + + A + + FN L + D SRF Sbjct: 13 TQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQE 72 Query: 75 IMPGQYQMEIRVNGQDISPSAFQIAFLEPPFSDSDNEKPLPEPCLTPEIVSRMGLTEASQ 134 + PG Y+++I +N +A + F+ D+E+ + PCLT ++ MGL AS Sbjct: 73 LPPGTYRVDIYLNNG-------YMATRDVTFNTGDSEQGI-VPCLTRAQLASMGLNTASV 124 Query: 135 EKVTYWNNGQCADFRQL-SGVEIRPNPAEGMLYINMPQAWLEYSDASWLPPSRWDNGIPG 193 + + C + + + + L + +PQA++ ++PP WD GI Sbjct: 125 SGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINA 184 Query: 194 LLFDYNINGTVNKPHQGKQSQSLNYNGTAGANFGAWRLRADYQGNLNHTTGSAQGTDSQF 253 L +YN +G + G S N +G N GAWRLR + + N + S+ G+ +++ Sbjct: 185 GLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSS-GSKNKW 243 Query: 254 TWSRFYMYRAIPRWRANLTLGENYINSEIFSSWRYTGASLESDDRMLPPKLRGYAPQVSG 313 ++ R I R+ LTLG+ Y +IF + GA L SDD MLP RG+AP + G Sbjct: 244 QHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHG 303 Query: 314 IADTNARVVISQQGRILYDSTVPAGPFTIQDLD-SSVRGRLDVEVIEQDGRKKTFQVDTA 372 IA A+V I Q G +Y+STVP GPFTI D+ + G L V + E DG + F V + Sbjct: 304 IARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYS 363 Query: 373 YVPYLTRPGQVRYKLVSGRSRTYEHTMEGPVFAAGEASWGISNTWSLYGGSIVAGDYNAL 432 VP L R G RY + +G R+ E P F G+ W++YGG+ +A Y A Sbjct: 364 SVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAF 423 Query: 433 AVGLGRDLSKFGTVSADVTQSVARIPGYDTKQGKSWRLSYSKRFDEVNTDITFAGYRFSE 492 G+G+++ G +S D+TQ+ + +P G+S R Y+K +E T+I GYR+S Sbjct: 424 NFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYST 483 Query: 493 RNYMTMDQYLNARYR--------------------NDFTGREKELYTVTLNKNFEDWKAS 532 Y +R + ++ +T+ + ++ Sbjct: 484 SGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT-ST 542 Query: 533 VNLQYSHQTYWDRRTSD-YYTLSVNRYFDAFSFKNIALGISASRSKYLNRD--NDSAFVR 589 + L SHQTYW D + +N +F++I +S S +K + + + Sbjct: 543 LYLSGSHQTYWGTSNVDEQFQAGLNT-----AFEDINWTLSYSLTKNAWQKGRDQMLALN 597 Query: 590 LSVPWGT------------GTASYSGSMSND-RYTNTVGYSDTL-NNGLSSYSLNAGVNS 635 +++P+ +ASYS S + R TN G TL + SYS+ G Sbjct: 598 VNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAG 657 Query: 636 GGGQPSQRQMSAYYNHNGSLTNLSASFSAVENGYSSFGMSASGGATVTMKGAALHAGGMN 695 GG S A N+ G N + +S + SGG G L G Sbjct: 658 GGDGNSGSTGYATLNYRGGYGNANIGYSH-SDDIKQLYYGVSGGVLAHANGVTL--GQPL 714 Query: 696 GGTRLLVDTDGVGGVPVDGGR-VYTNRWGIGVVTDVSSYYRNTTSVDLNKLPEDMEATRS 754 T +LV G V+ V T+ G V+ + Y N ++D N L ++++ + Sbjct: 715 NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774 Query: 755 VVESVLTEGAIGYREFEVLKGSRLFAVLRMSDNSYPPFGASVTNAKGRELGMVADSGLAW 814 V V T GAI EF+ G +L L +N PFGA VT+ + G+VAD+G + Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVY 833 Query: 815 LSGVNPGETLNVGW--DGRTQCVVDIPAHPDPAQQLL----LPCR 853 LSG+ + V W + CV + P+ QQLL CR Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 33.5 bits (76), Expect = 3e-04 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 9/86 (10%) Query: 29 GMTLPEYWG----EEHVWWDGRASFKGQVIAPACTLSMEDAWQEIDMGTTPLRDLQNSPA 84 G+ LP G +HV +FKG++I PACT+ E++ G +++L S Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQN----AEVNWGDIEIQNLVQS-G 60 Query: 85 GPEKKFRLRLRNCELTGAGKQVYTAT 110 G +K F + + G K T+ Sbjct: 61 GNQKDFTVDMNCPYSLGTMKVTITSN 86
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 240 bits (614), Expect = 1e-76 Identities = 112/479 (23%), Positives = 188/479 (39%), Gaps = 83/479 (17%) Query: 10 SILLIDDDADVLDAYTQLLEQSGYRVFACNNPFEAQAWIQPDWPGIVLSDVCMPGCSGID 69 +IL+ DDDA + Q L ++GY V +N WI +V++DV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 70 LMMLFHQDDQQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKLLSLVEEALRQRQS 129 L+ + LP+L+++ A+ A +KGA+D+L KP D +L+ ++ AL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 130 IIARRQYCQQTLQVELIGRSEWINQYRRRLQQLSETDIAVWLYGAPGTGRMTGARYLHQF 189 ++ + Q L+GRS + + R L +L +TD+ + + G GTG+ AR LH + Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 190 GRNAQGEFVYRELTPDNAPQLND------------------------FIALAQGGTLVLS 225 G+ G FV N + A+GGTL L Sbjct: 184 GKRRNGPFV-----AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 226 HPEHLTREQQYHLVQ-LQSQEHRP----------FRLIGIGDTSLVELAASNHIIAELYY 274 + + Q L++ LQ E+ R++ + L + +LYY Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 275 CFAMTQIACLPLTQRPDDIEPLFRHYLCKACQRLNHPVPEVGKEMLKEMMRRMWPNNVRE 334 + + PL R +DI L RH++ +A + V +E L+ M WP NVRE Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRE 357 Query: 335 LANAAE-----------------LFTVGILPLAETANPLMHVGT---------------- 361 L N +P + G+ Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 362 --------PAPLDRRVEDAERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 412 DR + + E +I AL +G + A+ L + R L ++++ G+S Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.4 bits (92), Expect = 2e-05 Identities = 66/387 (17%), Positives = 131/387 (33%), Gaps = 39/387 (10%) Query: 52 TPYLKEQLDLSATQI---GVLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 108 P L L S G+L + + V+ +L+D+ + + L A+ Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87 Query: 109 VGLGFSTAFWIFAVLVILNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 168 + + W+ + I+ G+ G IA+ ER R F +S G G+ Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144 Query: 169 VA-PIVGAAFALLGSEHWQSASYIVPACVAIVFAVIVLILGKGSPRQEGLPSLEEMMPEE 227 VA P++G A + A + + + L +PE Sbjct: 145 VAGPVLGGLMGGFSPH----APFFAAAALNGLNFLTGCFL----------------LPE- 183 Query: 228 KVVLNTRQTVKAPENMSAFQIFCTYVLRNKNAWYVSLVDVFVYMVRFGMISWLPIYLLTV 287 + + + P A ++ +L+ VF M G + + Sbjct: 184 -----SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 288 KHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKLFKGRRMPLAMICMALIFICLIGYW 344 F + ++ + ++ ++ G ++ +L + R + L MI +I L Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 345 KSESLFMVTIFAAIVGCLIYVPQFLASVQTMEIVPSFAVGSAVGLRGFMSYIFGASLGTS 404 + F + + A G + Q + S Q E GS L ++ I G L T+ Sbjct: 299 RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS-LTSIVGPLLFTA 357 Query: 405 LFGIMVDHIGWHGGFYLLGCGIICCII 431 ++ + W+G ++ G + + Sbjct: 358 IYAASITT--WNGWAWIAGAALYLLCL 382
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 28.5 bits (63), Expect = 0.018 Identities = 28/99 (28%), Positives = 42/99 (42%), Gaps = 7/99 (7%) Query: 57 ITQSDLEQLEATSLESITKTISEL---KSLKTNKNSTQEEILDLEKKRKEMELLVKKASM 113 + + DLE++ LES+ K + E K+LK N + L + + LL +AS Sbjct: 133 LRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLSLKPGLL--RASY 190 Query: 114 ALFFREQLNYHERRILSEIKGSETLNHSLSEIKEIKGKL 152 F Q HE I S+ S L + I+G L Sbjct: 191 RQFI--QSESHEVEIYSDWIASYGYQRRLVVLDFIEGSL 227
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 27.5 bits (61), Expect = 0.026 Identities = 16/48 (33%), Positives = 22/48 (45%), Gaps = 3/48 (6%) Query: 1 MRRAS--AGFTLLEMLVAIAIFASLA-LMAQQVTNGVTRVNSAVAGHD 45 MR GFTLLE++V I I LA L+ + + + A D Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 33.8 bits (77), Expect = 8e-05 Identities = 13/24 (54%), Positives = 18/24 (75%) Query: 2 KRGFTLLEVMLALAIFALAATAVL 25 +RGFTLLE+ML L + ++A VL Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 76.5 bits (188), Expect = 5e-20 Identities = 42/196 (21%), Positives = 70/196 (35%), Gaps = 41/196 (20%) Query: 1 MPERGFTLLEIMLVIFLIGLASAGVVQTFATDSESPAKKAAQDFLTRFAQFKDRAVIEGQ 60 M +RGFTLLE+ML++ L+G+++ V+ F + A + F + + R + GQ Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60 Query: 61 TLGVLIDPPGYQFMQRRQGQWLPVSATRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120 GV + P +QF+ + P D W L L+ Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101 Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171 R+ ++ +L L + P + P TPF L L Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148 Query: 172 AHDGALSLNQCDERMP 187 ++ N E +P Sbjct: 149 GEAPGIAFNARGESLP 164
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 217 bits (554), Expect = 3e-76 Identities = 90/146 (61%), Positives = 109/146 (74%), Gaps = 3/146 (2%) Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65 R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61 Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125 DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+ Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121 Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151 + G DG+ E DI NW L + + Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.0 bits (96), Expect = 6e-06 Identities = 59/329 (17%), Positives = 107/329 (32%), Gaps = 37/329 (11%) Query: 34 PTLMEELNISTQQ---YSYIIAAYSAAYTVMQPVAGYVLDVLGTK----IGYAMFAVLWA 86 P L+ +L S Y ++A Y+ PV G + D G + + A AV +A Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88 Query: 87 VFCGATALAGSWGGLAVA--RGAVGAAEAAMIPAGLKASSEWFPAKERSIAVGYFNVGSS 144 + A L + G VA GA GA A I ++ ER+ G+ + Sbjct: 89 IMATAPFLWVLYIGRIVAGITGATGAVAGAYI-------ADITDGDERARHFGFMSACFG 141 Query: 145 IGAMIAPPLVVWAIVMHSWQMAFIISGALSFIWAMAWLIFYKHPRDQKHLTDEERDYIIN 204 G M+A P++ + S F + AL+ + + K R +N Sbjct: 142 FG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALN 198 Query: 205 GQEAQHQVDTAKKMSVGQILRNRQFWGIALPRFLAEPAWGTFNAWIPLFMFKVYGFNLKE 264 + ++ + F+ + L + W +F + ++ Sbjct: 199 PLASFRWARGMTVVAALMAV----FFIMQLVGQVPAALWV-------IFGEDRFHWDATT 247 Query: 265 IAMFAWMPMLFADLGCILGGYLPPLFQRWFGVNLIVSRKMVV-TLGAVLMIGPGMIGLFT 323 I + F L + + G + M+ G +L+ + Sbjct: 248 IGI---SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMA- 303 Query: 324 NPYVAIMLLCIGGFAHQALSGALITLSSD 352 + ++LL GG AL L + Sbjct: 304 --FPIMVLLASGGIGMPALQAMLSRQVDE 330
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 74.1 bits (182), Expect = 1e-15 Identities = 72/433 (16%), Positives = 139/433 (32%), Gaps = 30/433 (6%) Query: 185 NSRVDAYRNEQLLGSFYLNSGSQFIDTSSFPPGSYSVALKVYENNQLTRTELVPFTKTGG 244 ++V +N + + + G I+ S + + + E + T+ VP++ Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367 Query: 245 LT-DGNAQWFLQAGKTTSQVS-DDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGN 302 L +G+ ++ + AG+ S + ++ +Q + L + +Y G AD AF G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 303 NWTADLGGAGNLAISASVFRNDDGGKGDMQQANWSH-PGWPTLGF------YRTNSDGDA 355 GA ++ ++ + D + D Q + + G YR ++ G Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 356 CTTDNRESYNALSCYES--ISATVSQNFVGWNMMLGYTRTQNNTDDSLRWDKQQSFENNY 413 D S E+ V F + + R + + + + + + Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547 Query: 414 LRQT--SAQSISETVQLSASRAFVMRDWILSTSLGVFHRNDNGGDNDDNGLYLSFS--LS 469 QT ++ E Q + AF ++ +L + D L L+ + S Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFED----INWTLSYSLTKNAWQKGRDQMLALNVNIPFS 603 Query: 470 DTPTMDSNNNSHSTNVSTDYRYSDQDGDQTSWQLSHTFYNDSFSHKEL--GVTVGGLNTD 527 DS + + S + + T D+ + G GG Sbjct: 604 HWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNS 663 Query: 528 TINSAVNGRWDGQYGNVYATVSDSYDRQNHDHLSAFTGTYSSTLAVSRYGINVGASGSDD 587 + G YGN S S D + S + G+ +G +D Sbjct: 664 GSTGYATLNYRGGYGNANIGYSHSDDIKQ------LYYGVSGGVLAHANGVTLGQPLND- 716 Query: 588 LLGAVLVDVKGFS 600 VLV G Sbjct: 717 --TVVLVKAPGAK 727 Score = 31.0 bits (70), Expect = 0.025 Identities = 40/222 (18%), Positives = 68/222 (30%), Gaps = 35/222 (15%) Query: 199 SFYLNSGSQFIDTSSF------PPGSYSVALKVYENNQLTRTELVPFTKTGGLTDGNAQW 252 F + D S F PPG+Y V + + NN T V F Sbjct: 52 RFLADDPQAVADLSRFENGQELPPGTYRVDIYL--NNGYMATRDVTFNTGDSEQG----- 104 Query: 253 FLQAGKTTSQVSDDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGNNWTADLG-GA 311 + T +Q++ +G+ L A A S D+G Sbjct: 105 -IVPCLTRAQLA-------SMGLNTASVSGMNLLADDACVPLTSMIH-DATAQLDVGQQR 155 Query: 312 GNLAISASVFRNDDGGKGDMQQANWSHPGWPTLGFYRTNSDGDACTTDNRESYNALSCYE 371 NL I + N +G + W L Y + + NR N+ Y Sbjct: 156 LNLTIPQAFMSNRA--RGYIPPELWDPGINAGLLNYNFS----GNSVQNRIGGNSHYAYL 209 Query: 372 SISATVSQNFVGW----NMMLGYTRTQNNTDDSLRWDKQQSF 409 ++ + + N W N Y + +++ +W ++ Sbjct: 210 NLQSGL--NIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 28.0 bits (62), Expect = 0.031 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.014 Identities = 8/22 (36%), Positives = 13/22 (59%) Query: 4 VLITGATGLVGGHLLRMLINEP 25 L+TGA G +G H+ + L+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 73.4 bits (180), Expect = 2e-15 Identities = 70/313 (22%), Positives = 110/313 (35%), Gaps = 77/313 (24%) Query: 396 IMGHVDHGKTSLLDYI-----RSTKVASGEAG-------------GITQHIGAYHVETEN 437 ++ HVD GKT+L + + T++ S + G GIT G + EN Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67 Query: 438 GMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIQHAKAAGVPVVVAV 497 + +DTPGH F + R D +L+++A DGV QT + G+P + + Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127 Query: 498 NKIDKPEADPDRV----KNELSQYGI-----------------LPEEWG----------- 525 NKID+ D V K +LS + E+W Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187 Query: 526 ---------------GESQFV---------HVSAKAGTGIDELLDAILLQAEVLELKAVR 561 ES H SAK GID L++ I + Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245 Query: 562 KGMASGAVIESFLDKGRGPVATVLVREGTLHKGDIVL-CGFEYGRVRAMRNELGQEVLEA 620 + G V + + R +A + + G LH D V E ++ M + E+ + Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305 Query: 621 GPSIPVEILGLSG 633 + EI+ L Sbjct: 306 DKAYSGEIVILQN 318
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.9 bits (145), Expect = 8e-12 Identities = 64/406 (15%), Positives = 135/406 (33%), Gaps = 32/406 (7%) Query: 30 LLDGFDFVLIALVLTEVQGEFGLTTVQAASLISAAFISRWFGGLMLGAMGDRYGRRLAMV 89 + +++ + L ++ +F + +A ++ G + G + D+ G + ++ Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 90 TSIVLFSAGTLACGFAPGYITMFI-ARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGF 148 I++ G++ + ++ I AR + G G A V PK R KA G Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 149 LISGFSVGAVVAAQVYSLVVPVWGWRALFFIGILPIIFALWLRKNIPEAEDWKEKHGGKA 208 + S ++G V + ++ W L I ++ II +L K + + Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVR--------- 194 Query: 209 PVRTMVDILYRGEHRIANIVMTLAAATALWFCFAGNLQNAAIVAVLGLLCAAIFISFMVQ 268 +G I I++ + IV+VL L IF+ + + Sbjct: 195 ---------IKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL---IFVKHIRK 242 Query: 269 STGK----RWPTGVMLMVVVLFAFLYSWPIQA---LLPTYLKTDLAYDPHTVANVLFFSG 321 T + M+ VL + + ++P +K + +V+ F G Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302 Query: 322 -FGAAVGCCVGGFLGDWLGTRK-AYVCSLLASQLLIIPVFAIGGANVWVLGLLLFFQQML 379 + +GG L D G + S + F + W + +++ F Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLET-TSWFMTIIIVFVLGG 361 Query: 380 GQGIAGILPKLIGGYFDTDQRAAGLGFTYNVGALGGALAPIIGALI 425 ++ ++ + AG+ L I + Sbjct: 362 LSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 27.5 bits (61), Expect = 0.026 Identities = 8/27 (29%), Positives = 16/27 (59%) Query: 127 IEADKSGTVKAILVESGQPVEFDEPLV 153 I+ ++ VK I+V+ G+ V + L+ Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLL 125
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 220 bits (563), Expect = 7e-68 Identities = 91/294 (30%), Positives = 145/294 (49%), Gaps = 15/294 (5%) Query: 148 PDSPGAVTIIDLDLSRYNAIASRFAEERQQALDFLKSGIATRNSHFNRMIEQIEKVAIKS 207 D + II L+ S+ ++ Q + + R++ + + ++ ++ Sbjct: 106 FDLTELIGIIGRALAEPKRRPSKLEDDSQDGM-----PLVGRSAAMQEIYRVLARLM-QT 159 Query: 208 RAPILLNGPTGAGKSFLARRIFELKQARHQFSGAFVEVNCATLRGDTAMSTLFGHVKGAF 267 +++ G +G GK +AR + + + R+ G FV +N A + D S LFGH KGAF Sbjct: 160 DLTLMITGESGTGKELVARALHDYGKRRN---GPFVAINMAAIPRDLIESELFGHEKGAF 216 Query: 268 TGARESREGLLRSANGGMLFLDEIGELGADEQAMLLKAIEEKTFYPFGSDRQVSSDFQLI 327 TGA+ G A GG LFLDEIG++ D Q LL+ +++ + G + SD +++ Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276 Query: 328 AGTVRDLRQLVAEGKFREDLYARINLWTFTLPGLRQRQEDIEPNLDYEVERHASLTGDSV 387 A T +DL+Q + +G FREDLY R+N+ LP LR R EDI + + V++ D Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVK 336 Query: 388 RFNTEARRAWLAFATSPQATWRGNFRELSASVTRMATFATSGRITLDTVEDEIN 441 RF+ EA A W GN REL V R+ IT + +E+E+ Sbjct: 337 RFDQEALELMKAHP------WPGNVRELENLVRRLTALYPQDVITREIIENELR 384
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 33.7 bits (77), Expect = 2e-04 Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 5/48 (10%) Query: 1 MKQTQRHNGIIELVKQQGYVSTEELV-----EHFSVSPQTIRRDLNEL 43 M + QRH I E++ + +ELV + ++V+ T+ RD+ EL Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 884 bits (2287), Expect = 0.0 Identities = 398/866 (45%), Positives = 568/866 (65%), Gaps = 28/866 (3%) Query: 19 KRVVPLLLVIMPACSIA--------GMRFNPAFLSGDTEAVADLSRFEKGMTYLPGSYEV 70 R+ + + AC+ A + FNP FL+ D +AVADLSRFE G PG+Y V Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80 Query: 71 EVWVNDSPLLSRTVTFKADD-ENQLIPCLSLADLLSLGINKNALPEQALASSENSCLDLR 129 ++++N+ + +R VTF D E ++PCL+ A L S+G+N ++ L + +++C+ L Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA-DDACVPLT 139 Query: 130 IWFPDVHYMPELDAQRLKLTFPQAIIKRDARGYIPPEQWDNGITAFLLNYDFSGN--NDR 187 D ++ QRL LT PQA + ARGYIPPE WD GI A LLNY+FSGN +R Sbjct: 140 SMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199 Query: 188 GDYSSNNYYLNLRAGINIGAWRFRDYSTWSR-----GSNSAGKLEHISSTLQRVIIPFRS 242 +S+ YLNL++G+NIGAWR RD +TWS S S K +HI++ L+R IIP RS Sbjct: 200 IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRS 259 Query: 243 ELTLGDTWSSSDVFDSVSIRGIKLESDENMLPDSQSGFAPTVRGIAKSRAQVTIKQNGYV 302 LTLGD ++ D+FD ++ RG +L SD+NMLPDSQ GFAP + GIA+ AQVTIKQNGY Sbjct: 260 RLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYD 319 Query: 303 IYQTYMPPGPFEISDLNPTSSAGDLEVTIKESDNSETVYTVPYAAVPILQREGHLKYSTT 362 IY + +PPGPF I+D+ ++GDL+VTIKE+D S ++TVPY++VP+LQREGH +YS T Sbjct: 320 IYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSIT 379 Query: 363 VGQYRSNSYNQKSPYVFQGELIWGLPWDITAYGGAQFSEDYRALALGLGLNLGVFGATSF 422 G+YRS + Q+ P FQ L+ GLP T YGG Q ++ YRA G+G N+G GA S Sbjct: 380 AGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSV 439 Query: 423 DVTQANSSLVDGSKHQGQSYRFLYSKSLVQTGTAFHIIGYRYSTQGFYTLSDTTYQQMSG 482 D+TQANS+L D S+H GQS RFLY+KSL ++GT ++GYRYST G++ +DTTY +M+G Sbjct: 440 DMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNG 499 Query: 483 TVVDPKTLDDKDYVYNWNDFYNLRYSKRGKFQASVSQPFGNYGSMYLSASQQTYWNTDKK 542 ++ + + + D+YNL Y+KRGK Q +V+Q G ++YLS S QTYW T Sbjct: 500 YNIETQDGVIQVKPK-FTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNV 558 Query: 543 DSLYQVGYNTSIKGIYLNVAWNYSKSPGTN-ADKIVSLNVSLPISNWLSSTNDGRSSSNA 601 D +Q G NT+ + I ++++ +K+ D++++LNV++P S+WL S D +S Sbjct: 559 DEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS--DSKSQWRH 616 Query: 602 MTATYGYSQDNHGQVNQYTGVSGSLLEQHNLSYNIQHGFANQDNSSSGSVG---VNYRGA 658 +A+Y S D +G++ GV G+LLE +NLSY++Q G+A + +SGS G +NYRG Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676 Query: 659 YGSLNSAYSYDNEGNQQINYGISGALVVHENGLTLSQPLGETNVLIKAPGANNVDVQRGT 718 YG+ N YS+ + +Q+ YG+SG ++ H NG+TL QPL +T VL+KAPGA + V+ T Sbjct: 677 YGNANIGYSHSD-DIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQT 735 Query: 719 GISTDWRGYAVVPYATEYRRNNISLDPMSMNMHTELDITSTEVIPGKGALVRAEFAAHIG 778 G+ TDWRGYAV+PYATEYR N ++LD ++ + +LD V+P +GA+VRAEF A +G Sbjct: 736 GVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVG 795 Query: 779 IRGLFTVRYRNKSVPFGATASAQIKNSSQITGIVGDNGQLYLSGLPLEGVINIQWGDGVQ 838 I+ L T+ + NK +PFGA +++ SSQ +GIV DNGQ+YLSG+PL G + ++WG+ Sbjct: 796 IKLLMTLTHNNKPLPFGAMVTSE---SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852 Query: 839 QKCQANYKLPETELDNPVSYATLECR 864 C ANY+LP ++ + ECR Sbjct: 853 AHCVANYQLPPESQQQLLTQLSAECR 878
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 35.7 bits (82), Expect = 3e-05 Identities = 21/92 (22%), Positives = 32/92 (34%), Gaps = 16/92 (17%) Query: 55 VACIDGIVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 108 + ++ +G + I + + D + D R K GV +AL+ + IE C Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125 Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKFGFEIEG 140 L I N A Y K F I Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 32.0 bits (72), Expect = 0.005 Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%) Query: 272 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQIMAEAEKYA 330 R P+ G+ R + SMPPP G H +I N+ F Q G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133 Query: 331 YADRSEYLGDPDFVKVPWQA 350 Y P F WQ+ Sbjct: 134 Y---------PTFSYQDWQS 144
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.041 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTTGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 39.3 bits (91), Expect = 2e-05 Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%) Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193 G L++ P L YNKD PPKTW+++ +LKA G + + Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251 + +A G F +N +D D ++ K + +++ + D Y Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236 Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291 + F G+ AMT + +NI + +K NYGV ++P Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 49.7 bits (118), Expect = 2e-08 Identities = 42/208 (20%), Positives = 72/208 (34%), Gaps = 21/208 (10%) Query: 20 QTPEK-ETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAADVVEVTEQV 78 TP + +V + EEI + E T AE + VE EQ Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057 Query: 79 AESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSP----EEWQAEAET 134 A AQ EV + + V+ + + + E E +E +A+ ET Sbjct: 1058 ATETTAQ-NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVET 1116 Query: 135 VEIVEAAE---EEAAKDEITDE---------EPEAQALAAEAAEEAVMVVSPAEEEQPVE 182 + E + + + K E ++ E + E + + A+ EQP + Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQT---NTTADTEQPAK 1173 Query: 183 EIAQEQEKPTKEGFFARLKRSLLKTKEN 210 E + E+P E S+++ EN Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVVENPEN 1201 Score = 49.3 bits (117), Expect = 3e-08 Identities = 35/178 (19%), Positives = 56/178 (31%), Gaps = 7/178 (3%) Query: 19 EQTPEKETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAEAETFAADVVEVTEQV 78 TP + TE E E + A+E + + A EV + Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG 1089 Query: 79 AESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQAEAETVEIV 138 +E+++ Q E E E +E E+ V ++ VSP++ Q+E + Sbjct: 1090 SETKETQTT----ETKETATVEKEEKAKVETEKTQEVPKVTSQ-VSPKQEQSETVQPQAE 1144 Query: 139 EAAEEEAAK--DEITDEEPEAQALAAEAAEEAVMVVSPAEEEQPVEEIAQEQEKPTKE 194 A E + E + A E + V P E V E P Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202 Score = 43.1 bits (101), Expect = 3e-06 Identities = 28/156 (17%), Positives = 45/156 (28%), Gaps = 14/156 (8%) Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASE------HAVEEQPQAHTEAEAETFAAD 70 Q +T E T + E+ VE + P S+ + QPQA E + Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155 Query: 71 VVEVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEAVSPEEWQA 130 ++ ++ QP E + E V E+ V + PE+ P Sbjct: 1156 KEPQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQPTVNSE 1214 Query: 131 EAETVEI-------VEAAEEEAAKDEITDEEPEAQA 159 + + E A D A Sbjct: 1215 SSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250 Score = 37.4 bits (86), Expect = 1e-04 Identities = 25/182 (13%), Positives = 52/182 (28%), Gaps = 11/182 (6%) Query: 17 QKEQTPEKETEVQNEQPVVEEIVQAQEPVKASEHAVEEQPQAHTEAE-AETFAADVVEVT 75 +E E ++ V+ E+ Q+ K ++ ++ + E A+ EV Sbjct: 1065 NREVAKEAKSNVKAN-TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVP 1123 Query: 76 EQVAE-------SEKAQPEAEVVAQPEPVV--EETPEPVAIEREELPLPEDVNAEAVSPE 126 + ++ SE QP+AE + +P V +E + ++ ++ P Sbjct: 1124 KVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPV 1183 Query: 127 EWQAEAETVEIVEAAEEEAAKDEITDEEPEAQALAAEAAEEAVMVVSPAEEEQPVEEIAQ 186 T V E + + + P E Sbjct: 1184 TESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSND 1243 Query: 187 EQ 188 Sbjct: 1244 RS 1245
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 53.3 bits (128), Expect = 8e-10 Identities = 80/398 (20%), Positives = 147/398 (36%), Gaps = 32/398 (8%) Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70 ++ N ++ I+ + IGL + VLPG + D++ G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 71 PHAGRYADLLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 129 P G +D G + +++ L G + + Y L V L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112 Query: 130 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIM 187 A G+ + + H G + + G +G +G H A AL + Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172 Query: 188 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIATF 240 LL + + P + + +A +A V A Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232 Query: 241 ITLFYDAK-GWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 296 +F + + WD ++L + + + ++ R+G M+ + G + Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292 Query: 297 LVGVATMPWMAKIG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 355 L+ AT WMA VLLA G + PAL + + V ++ QG + L+ + Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349 Query: 356 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 389 GPL + A + ++A A L + L R Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>BORPETOXINB#Bordetella pertussis toxin B subunit signature. Length = 226 Score = 28.1 bits (62), Expect = 0.046 Identities = 21/77 (27%), Positives = 32/77 (41%), Gaps = 10/77 (12%) Query: 204 GQRHVTWARLRGLSDKQTERRHILRNASLPMITAVGMHIGELIGGTMIIENIFAWPGVG- 262 R +T A LRG D Q RH+ R S+ + G ++G GG +I++ PG Sbjct: 53 KTRALTVAELRGSGDLQEYLRHVTRGWSIFALYD-GTYLGGEYGG--VIKD--GTPGGAF 107 Query: 263 ----RYAVSAIFNRDYP 275 + + N P Sbjct: 108 DLKTTFCIMTTRNTGQP 124
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.2 bits (68), Expect = 0.008 Identities = 10/34 (29%), Positives = 19/34 (55%) Query: 25 QAVLNNVSLALKSGETVALLGRSGCGKSTLARLL 58 Q + ++ +++ T+ + G SG GK +AR L Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 59.1 bits (143), Expect = 3e-11 Identities = 44/147 (29%), Positives = 69/147 (46%), Gaps = 18/147 (12%) Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47 I HVD GKTTL +++ +G D E++RG+TI G + Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65 Query: 48 PDGRVPGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTREHLAILQLTGNPML 107 + +V ID PGH FL+ + + +D A+L+++ DGV AQTR L+ G P + Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTI 124 Query: 108 TVALTKADRVDEARVDEVERQVKEMLR 134 + K D+ + V + +KE L Sbjct: 125 -FFINKIDQNG-IDLSTVYQDIKEKLS 149
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 64.1 bits (156), Expect = 2e-13 Identities = 56/314 (17%), Positives = 103/314 (32%), Gaps = 82/314 (26%) Query: 66 ITPQVTGIVTEVTDKNNQLIQKGEVLFKLDPVR------------YQARVD--RLQA--- 108 I P IV E+ K + ++KG+VL KL + QAR++ R Q Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 109 ------------------------DLMTATHNIK----TLRAQLTEAQANTTQVSAERDR 140 +++ T IK T + Q + + N + AER Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218 Query: 141 LFKNYQRY----------LKGSQAAVNPFS---------ERDIDDARQNF---LAQDALV 178 + RY L + ++ + E +A +Q + Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278 Query: 179 KGSVAE----QAQIQSQLDSMVNGE----QSQIVSLRAQLTEAKYNLEQTVIRAPSNGYV 230 + + + + + + I L +L + + + +VIRAP + V Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338 Query: 231 TQVLIR-PGTYAAALPLRPVMVFIPEQKRQIV-AQFRQNSLLRLKPGDDAEVVFNALPGQ 288 Q+ + G +MV +PE V A + + + G +A + A P Sbjct: 339 QQLKVHTEGGVVT--TAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYT 396 Query: 289 VFH---GKLTSILP 299 + GK+ +I Sbjct: 397 RYGYLVGKVKNINL 410
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 65.2 bits (159), Expect = 5e-15 Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 2/79 (2%) Query: 1547 ESKLSGGIASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGV-SMVSANGRWVYKLQ 1605 +L G+A+ A++ L Q G + S G Y ++A+A+GV S ++ + Sbjct: 2 SKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVA 61 Query: 1606 GSTNSQGEYSAALGAGIQW 1624 +T + G S G ++ Sbjct: 62 FNTYN-GGMSYGASVGYEF 79
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 104 bits (260), Expect = 7e-28 Identities = 77/348 (22%), Positives = 127/348 (36%), Gaps = 67/348 (19%) Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLDI 47 +VTG AGFIG ++ K L + G ++ +DNL D +D+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 48 ADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100 AD + + + A F E +F + +Y ++N + Y+ + Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109 Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158 L C +I LYASS++ YG F + P+++Y +K + Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169 Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218 G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224 Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258 + + W +E G ++N+G A + + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305 +P G T AD L G+ P TV +GV ++ W Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327
>SECA#SecA protein signature. Length = 901 Score = 38.3 bits (89), Expect = 1e-04 Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 7/67 (10%) Query: 291 MRLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFAPL 344 M L + + G GKTL A L A L A+ GK V ++ + LA++ A N R F L Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150 Query: 345 GIEVGWL 351 G+ VG Sbjct: 151 GLTVGIN 157
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 34.1 bits (78), Expect = 0.001 Identities = 79/491 (16%), Positives = 168/491 (34%), Gaps = 73/491 (14%) Query: 7 RQNRLLRFLLPRREYTTIVTIAGYLNVSEKTIQRDLRLLEQWL-GQWRINVEKRAGAGVM 65 + +L+ + I +A ++ + L + + ++KR M Sbjct: 45 SKCQLVVLFF-KTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKR-----M 98 Query: 66 LSAENIADLLHLDHLLVAECEEIDGVMNNARRVKIASQLLSETPNETSISKLSERYFISG 125 + H ++ + + ++ +++ + L+ + ++ + +F+S Sbjct: 99 I-------SCQFTHP--SKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSN 149 Query: 126 ASIVNDLRVIESWLAPLGLSLIRSPSGTHIEGSEGQVRQAMALLINGIINHNEPQGVVYS 185 +S + L L L S I G E ++R +ALL G+ Sbjct: 150 SSAYRMREALIPLLRNFELKL----SKNKIVGEEYRIRYLIALL-------YSKFGIKVY 198 Query: 186 RLDPGSYKALVHYFGEEEVLFVQSLLLDMENELSWSLGEPYYVNIFTHILIMMYRNTHGN 245 L K ++H F L S L LS E + F IL+ + H Sbjct: 199 DLTQQD-KNIIHSF-----LSHSSTHLKTSPWLS----ESFS---FYDILLALSWKRHQF 245 Query: 246 ALSREEDQTRQYDENIF---NVASQMIHKIEQRIAHTLPDDEVWFIYQ-YIISSGVAIDG 301 +++ + + Q + +F ++ IE ++ ++Y YI ++ Sbjct: 246 SVTIPQTRIFQQLKKLFVYDSLKKSSRDIIETYCQLNFSAGDLDYLYLIYITANNSFASL 305 Query: 302 Q---KDVSIISHMQASNEA-RLITWRLITVFSDIVD---------CDFSEDSALYDGLLV 348 Q + + + N+ RL+ +IT+ ++ + FS+ S L++ L Sbjct: 306 QWTPEHIRQCCQLFEENDTFRLLLNPIITLLPNLKEQKASLVKALMFFSK-SFLFN--LQ 362 Query: 349 HIKPLINRLNYRIHIRNPLLEDIKAELADVWRLTQYVVNQVFKTWGENAVSEDEVGYLTV 408 H P N + N L + + W + K G+ ++ Sbjct: 363 HFIPETNLFVSPYYKGNQKLYTSLKLIVEEW---------MAKLPGKRYLNHKHFHLFCH 413 Query: 409 HFQAAMERQIARKRVLLVCSTGIGTSHLLKSRILRAFPEWTI---VDVISAANLSQVLPD 465 + + + V+ V S I +HLL R F + +I + N+ Q+ Sbjct: 414 YVEQILRNIQPPLVVVFVASNFI-NAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDL 472 Query: 466 NIELIISTINL 476 +L+I+ L Sbjct: 473 KPDLVITHSQL 483
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.1 bits (62), Expect = 0.042 Identities = 12/106 (11%), Positives = 31/106 (29%), Gaps = 14/106 (13%) Query: 16 QGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIAD----- 70 S IA+ G++R + + + KS+ + + + I + + Sbjct: 30 SSTSLGEIAKAAGVTRGAIYWHFKDKSD--------LFSEIWELSESNIGELELEYQAKF 81 Query: 71 -AHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVR 115 P + ++ + +L I + V+ Sbjct: 82 PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 27.9 bits (61), Expect = 0.011 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 2/52 (3%) Query: 21 GMALSSWSASDATGAVTVGVVAK--GTHQNSMAQGEFSCTTRENEVYIGYDS 70 G+A+ S +DA +V +G + H S+A G+ S T REN V IG++S Sbjct: 140 GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 740 bits (1912), Expect = 0.0 Identities = 259/872 (29%), Positives = 420/872 (48%), Gaps = 47/872 (5%) Query: 5 NLSCLIYCRCSLLLFAALGLTVTNHSF----AAEEAEFDSEFLHLDKGINAIDIRRFSHG 60 N CL + L F + ++ E F+ FL D D+ RF +G Sbjct: 12 NTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD-PQAVADLSRFENG 70 Query: 61 NPVPEGRYYSDIYVNNVWKGKADLQYLRTANTGAPTLCLTPELLS-----LIDLVKDTMS 115 +P G Y DIY+NN + D+ + + CLT L+ + + Sbjct: 71 QELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLL 130 Query: 116 GNTSCFPASTGLSSASINFDLSTLRLNIEIPQALLNTRPRGYISPSQWQSGVPAAFINYD 175 + +C P ++ + A+ D+ RLN+ IPQA ++ R RGYI P W G+ A +NY+ Sbjct: 131 ADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYN 190 Query: 176 ANYYQY-SSSGTSNEQTYLGLKAGFNLWGWALRHRGSESWNNSYPAG-----YQNIETSI 229 + + G ++ YL L++G N+ W LR + S+N+S + +Q+I T + Sbjct: 191 FSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWL 250 Query: 230 MHDLAPLRAQFTLGDFYTNGELMDSLSLRGVRLASDERMLPGSLRGYAPAVRGIANSNAK 289 D+ PLR++ TLGD YT G++ D ++ RG +LASD+ MLP S RG+AP + GIA A+ Sbjct: 251 ERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310 Query: 290 VTIYQNAHILYETTVPAGPFVINDLYPSGYAGDLIVKITESNGQTRMFTVPFAAVAQLIR 349 VTI QN + +Y +TVP GPF IND+Y +G +GDL V I E++G T++FTVP+++V L R Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370 Query: 350 PGFSRWQMSVGKYR-YANKTYNDLIAQGTYQYGLTNDITLNSGLTTASGYTAGLAGLAFN 408 G +R+ ++ G+YR + Q T +GL T+ G A Y A G+ N Sbjct: 371 EGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430 Query: 409 T-PLGAIASDITLSRTAFRYSGVTRKGYSLHSSYSINIPASNTNITLAAYRYSSKDFYHL 467 LGA++ D+T + + G S+ Y+ ++ S TNI L YRYS+ +++ Sbjct: 431 MGALGALSVDMTQANSTLP-DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489 Query: 468 KDALSANHNAF-------IDDVSVKSTAFY----RPRNQFQISINQELGEKWGGMYLTGT 516 D + N + + V K T +Y R + Q+++ Q+LG +YL+G+ Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGS 548 Query: 517 TYNYWGHKGSRNEYQMGYSNFWKQLGYQIGLSQSRDNEQQRRDDRFYINFTLPLGE---- 572 YWG ++Q G + ++ + + + S +++ Q+ RD +N +P Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRS 608 Query: 573 ----SVQSPVFSTVLNYSKEEKNSIQTSISGTGGEDNQFSYGLS-----GNSQENGPSGY 623 + S +++ + + + GT EDN SY + G +G +GY Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 624 AMNGGYRSPYVNITTTVGHDTQNNNQRSFGASGAVVAHPYGVTLSNDLSDTFAIIHAEGA 683 A YR Y N H + Q +G SG V+AH GVTL L+DT ++ A GA Sbjct: 669 A-TLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGA 726 Query: 684 QGAAINNASGSRLDFWGNGIVPYVTPYEKNQISIDPSNLDLNVELSATEQEIIPRANSAT 743 + A + N +G R D+ G ++PY T Y +N++++D + L NV+L ++P + Sbjct: 727 KDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIV 786 Query: 744 LVKFDTKTGRSLLFDIRMSTGNPPPMASEVLDEHGQLAGYVAQAGKVFTRGLPEKGHLSV 803 +F + G LL + + P P + V E Q +G VA G+V+ G+P G + V Sbjct: 787 RAEFKARVGIKLLMTLTHN-NKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQV 845 Query: 804 VWGPDNKDRCSFVYHVAHNKDDMQSQLVPVLC 835 WG + C Y + + C Sbjct: 846 KWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>PHAGEIV#Gene IV protein signature. Length = 426 Score = 29.9 bits (67), Expect = 0.021 Identities = 18/88 (20%), Positives = 35/88 (39%), Gaps = 10/88 (11%) Query: 253 FNQKVELTPADI-EFVK---KITGLPVIVKGILRGEDAVVAIDAGADAI------QVSNH 302 F Q +E+ + + +FV K TG VIV ++G V + D + + + + Sbjct: 20 FAQVIEMNNSSLRDFVTWYSKQTGESVIVSPDVKGTVTVYSSDVKPENLRDFFISVLRAN 79 Query: 303 GGRQIDGVPSAISQLQEVAARVGHKVPV 330 + +PS I + ++P Sbjct: 80 NFDMVGSIPSIIQKYNPNNQDYIDELPS 107
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 78.9 bits (194), Expect = 1e-19 Identities = 38/162 (23%), Positives = 59/162 (36%), Gaps = 11/162 (6%) Query: 77 ARKTRSCSPEKTARTRQQIARAALEEFSAQGFARASISNISKRAGVAKGTVYNYFPTKEL 136 ARKT+ ++ TRQ I AL FS QG + S+ I+K AGV +G +Y +F K Sbjct: 2 ARKTK----QEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57 Query: 137 LFEAVLKE----FIATVRTELESSPRRNGETVKAYLLRVMLPAVRKIDDASTGRARIAHL 192 LF + + P ++ L+ +L + + I H Sbjct: 58 LFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIH-VLESTVTEERRRLLMEIIFHK 116 Query: 193 VMTEGSRFPVIAQAYLREIHQPLQQAMTQLIQEAASAGELKA 234 G V R + + Q ++ A L A Sbjct: 117 CEFVGEMAVVQQAQ--RNLCLESYDRIEQTLKHCIEAKMLPA 156
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 658 bits (1698), Expect = 0.0 Identities = 242/854 (28%), Positives = 419/854 (49%), Gaps = 59/854 (6%) Query: 20 AEDYFDPSLLATDIIGEGNIDLSAFSRPGGGMEGEQEVAIYVNDEFY-SRNTLFFKNTLD 78 AE YF+P LA D + DLS F G V IY+N+ + +R+ F + Sbjct: 45 AELYFNPRFLADD--PQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSE 102 Query: 79 KGLLPEFTP------GFFDELLSGDFLVSEEDKTISSSDFLKKVPYSDINFNQGMSRVNV 132 +G++P T G +SG L++++ + + + G R+N+ Sbjct: 103 QGIVPCLTRAQLASMGLNTASVSGMNLLADDACV----PLTSMIHDATAQLDVGQQRLNL 158 Query: 133 SIPQAYLGDGAKLISSPDTWEYGGPAFLLDYNISGNRNDS-GNYDSRSLYISSQMGVNLM 191 +IPQA++ + A+ P+ W+ G A LL+YN SGN + +S Y++ Q G+N+ Sbjct: 159 TIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIG 218 Query: 192 KWRLRTSSSYSNYKTNSVWGGARSEQNSFYNTYAERDISSLRAILRLGEVSTAGLILDSV 251 WRLR ++++S ++S G Q+ NT+ ERDI LR+ L LG+ T G I D + Sbjct: 219 AWRLRDNTTWSYNSSDSSSGSKNKWQHI--NTWLERDIIPLRSRLTLGDGYTQGDIFDGI 276 Query: 252 PFRGMKLSSSDDMLGMRLRNYTPTVRGMASSQAVVTITQNGRQVYQTNVPAGPFELNDFY 311 FRG +L+S D+ML R + P + G+A A VTI QNG +Y + VP GPF +ND Y Sbjct: 277 NFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIY 336 Query: 312 LSGYSGDMLVTVREADGSEHSFLQPYSTLPEMKREGVSGFEVSVGRYDNNGAEHYYDAES 371 +G SGD+ VT++EADGS F PYS++P ++REG + + ++ G Y + + Sbjct: 337 AAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGN--AQQEKPR 394 Query: 372 FVYGNWSRGFARGVTFFAETLQAEKYQSLGGGSTLSLGRLGAASADISLSRADKYGDIR- 430 F G G T + T A++Y++ G ++G LGA S D++ + + D + Sbjct: 395 FFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQH 454 Query: 431 IGQSYGFKYSKSQIETGTTVTLATYRYSTENFYTFRDFV------------------SKT 472 GQS F Y+KS E+GT + L YRYST ++ F D Sbjct: 455 DGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPK 514 Query: 473 DTARYIWENKLKSRMTFSLSQSLGEYGYLSANASQQDYWNSREVSRNYSLTHSFSWNDIY 532 T Y + ++ +++Q LG L + S Q YW + V + + ++ DI Sbjct: 515 FTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDIN 574 Query: 533 FSTTLSMDDQRGRETGHLSNKQAGIYASVPLSKLLPRTDPTS---SSLTWSTSHADH-KV 588 ++ + S+ ++ ++ + ++P S L + +S ++S SH + ++ Sbjct: 575 WTLSYSLTKNAWQKG---RDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRM 631 Query: 589 RNSVTLDGKVPESD-VRYRVGGSW---GNGTTEGSRMASVSWTGDHASTSLGYTRVGKYR 644 N + G + E + + Y V + G+G + + A++++ G + + ++GY+ + Sbjct: 632 TNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIK 691 Query: 645 TLDYSMSGAAVMYPWGIAVGNSSVTGDGAIVVETPGAKGVR--TSTGYKTSWLGTALISS 702 L Y +SG + + G+ +G D ++V+ PGAK + TG +T W G A++ Sbjct: 692 QLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPY 749 Query: 703 PQKYTENRINLYPDGLPSDTVLGETSKTAVPAKGAVVVLDYTVFRGSQVVFTLRQTDGNP 762 +Y ENR+ L + L + L VP +GA+V ++ G +++ TL + P Sbjct: 750 ATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTH-NNKP 808 Query: 763 LPFGTVITLDGVSRGKENSGIVGEEGRVYMAGIPEKGTLTASWGL--NKTCSIPFRINQH 820 LPFG ++T + ++SGIV + G+VY++G+P G + WG N C +++ Sbjct: 809 LPFGAMVTSE----SSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPE 864 Query: 821 KAEAVIREVQGVCR 834 + ++ ++ CR Sbjct: 865 SQQQLLTQLSAECR 878
>adhesinb#Adhesin B signature. Length = 310 Score = 237 bits (605), Expect = 3e-79 Identities = 87/294 (29%), Positives = 157/294 (53%), Gaps = 7/294 (2%) Query: 5 ILVVALSSLLVSPLVIAKELNVVASFSVLGDMVSQIGGPYVHVTDLVQPDGDPHEFEPSP 64 + + A SS S + +LNVVA+ S++ D+ I G +++ +V DPHE+EP P Sbjct: 15 VGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSIVPVGQDPHEYEPLP 74 Query: 65 KDSKTLAQADVVFVNGLGLE----GWLDRLMKASGYRGE--VITASNGIDTLKMKEDGTT 118 +D K +QAD++F NG+ LE W +L++ + + S G+D + ++ Sbjct: 75 EDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVSEGVDVIYLEGQSEK 134 Query: 119 IT-DPHAWNSMKNGIVYAHNIVNGLSKADPEHASDYRKQGDSYIQQLQQLDNYATQTFAA 177 DPHAW +++NGI+YA NI LS+ DP + Y K +Y+++L LD A + F Sbjct: 135 GKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEKLSALDKEAKEKFNN 194 Query: 178 IPREKRKVLTSHDAFGYFAAAYGVRFLSPVGYSTESEASSKNVAKLINQIKREHVKLYFI 237 IP EK+ ++TS F YF+ AY V +TE E + + L+ ++++ V F+ Sbjct: 195 IPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTLVEKLRKTKVPSLFV 254 Query: 238 ENQTDPRLVKQIANASGAQAGGELYPEALTDSSGLAATYTAAFKHNVDTLAAGM 291 E+ D R +K ++ + +++ +++ + +Y + K+N++ +A G+ Sbjct: 255 ESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYNLEKIAEGL 308
>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C signature. Length = 170 Score = 316 bits (811), Expect = e-114 Identities = 163/170 (95%), Positives = 166/170 (97%) Query: 1 MNRNNPLEVLGHVSWLWASSPLHRNWPVSLFAINVLPAIRANQYALLTRDNYPVAYCSWA 60 MN N PLE+LGHVSWLWASSPLHRNWPVSLFAINVLPAI+ANQY LLTRD+YPVAYCSWA Sbjct: 1 MNINKPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWA 60 Query: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR Sbjct: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120 Query: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKNKSDFNFSLTG 170 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVK KSDFNFSLTG Sbjct: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKRKSDFNFSLTG 170
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 1477 bits (3824), Expect = 0.0 Identities = 978/1024 (95%), Positives = 992/1024 (96%) Query: 1 MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60 M TITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ Sbjct: 1 MTTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60 Query: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120 Query: 121 YQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSMKIDELIKKQKSGSNVSSSEL 180 YQKAGN LGG AENIGDNLGKAG +LSTFQNFLGTALSSMKIDELIKKQKSG NVSSSEL Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180 Query: 181 AKASIELINQLVDTAASINNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240 AKASIELINQLVDT AS+NNNVNSFSQQLN LGSVLSNTKHLNGVGNKLQNLPNLDNIGA Sbjct: 181 AKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240 Query: 241 GLDTVSGILSVISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300 GLDTVSGILS ISASFILSNADADT TKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL Sbjct: 241 GLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300 Query: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE Sbjct: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360 Query: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420 Query: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW Sbjct: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480 Query: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKPDEFQKQVFDPLKGNIDLSDSKSS 540 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKK DEFQKQVFDPLKGNIDLSDSKSS Sbjct: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKSS 540 Query: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHA 600 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKG+VYDYSNLIQHA Sbjct: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGAVYDYSNLIQHA 600 Query: 601 SVGNNQYREIRIESHLGDGDDKVFLAAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660 SVGNNQYREIRIESHLGDGDDKVFL+AGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE Sbjct: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660 Query: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE 720 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHING +LTETDNLYSVE Sbjct: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE 720 Query: 721 ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGD 780 ELIGT RADKFFGSKFTDIFHGADGDD IEGNDGNDRLYGDKGNDTL GGNGDDQLYGGD Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGD 780 Query: 781 GNDKLTGGVGNNYLNGGDGDDELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGND 840 GNDKL G GNNYLNGGDGDDE QVQGNSLAKNVL GGKGNDKLYGSEGADLLDGGEG+D Sbjct: 781 GNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDD 840 Query: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFRDVAFKREGNDLIMYKAEGNV 900 LLKGGYGNDIYRYLSGYGHHIIDDDGGK+DKLSLADIDFRDVAFKREGNDLIMYK EGNV Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900 Query: 901 LSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY 960 LSIGHKNGITFRNWFEKESGDISNH+IEQIFDK GR+ITPDSLKKA EYQQ NN+A+YVY Sbjct: 901 LSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY 960 Query: 961 GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITL 1020 G A Y +LNPLINEISKIISAAG+FDVKEER+AASLLQLSGNASDFSYGRNSITL Sbjct: 961 GNDALAYGSQGDLNPLINEISKIISAAGSFDVKEERTAASLLQLSGNASDFSYGRNSITL 1020 Query: 1021 TASA 1024 T SA Sbjct: 1021 TTSA 1024
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 601 bits (1551), Expect = 0.0 Identities = 462/478 (96%), Positives = 468/478 (97%) Query: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV Sbjct: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60 Query: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTLSGRSKEIKPIENSIVKEIIVKEGESVRK 120 AYFIMGFLVIAFILSVLGQVEIVATANGKLT SGRSKEIKPIENSIVKEIIVKEGESVRK Sbjct: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRK 120 Query: 121 GDVLLKLTALGAEADTLKTQSSLLQTRLEQIRYQILSRSIELNKLPELKLPDEPYFQNVS 180 GDVLLKLTALGAEADTLKTQSSLLQ RLEQ RYQILSRSIELNKLPELKLPDEPYFQNVS Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180 Query: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTILARINRYENLSRVEKSRLDDF 240 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT+LARINRYENLSRVEKSRLDDF Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240 Query: 241 RSLLHKQAIAKHAVLEQENKYVEAANELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 SLLHKQAIAKHAVLEQENKYVEA NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 Query: 301 LDKLRQTTDSIELLTLELEKNEERQQASVIRAPVSGKVQQLKVHTEGGVVTTAETLMVIV 360 LDKLRQTTD+I LLTLEL KNEERQQASVIRAPVS KVQQLKVHTEGGVVTTAETLMVIV Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV 360 Query: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQKLGL 420 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQ+LGL Sbjct: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGL 420 Query: 421 VFNVIVSVEENDLSTGNKHIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLHER 478 VFNVI+S+EEN LSTGNK+IPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL ER Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.2 bits (65), Expect = 0.048 Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%) Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426 LGD + D V + AG+ N G DV T G AT A T Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665 Query: 427 VTRNVGENALAISRVPQTQK 446 VTR +G + + V + Q+ Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.8 bits (80), Expect = 9e-04 Identities = 40/196 (20%), Positives = 62/196 (31%), Gaps = 51/196 (26%) Query: 170 KHALEHPKPTNAVSRALQHDLSDVVGQEQG----KRGLEITAAGGHNLLLIGPPGTGKTM 225 AL PK + D +VG+ R L L++ G GTGK + Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKEL 175 Query: 226 LASRINGLLPDLSNEEALESAAILSLVNAESVQKQWRQRPFRSPHHSA--------SLTA 277 +A ++ R PF + + +A L Sbjct: 176 VARALHDYGK-------------------------RRNGPFVAINMAAIPRDLIESELFG 210 Query: 278 MVGG---GAIP-GPGEISLAHNGVLFLDEL----PEFERRTLDALREPIESGQIHLSRTR 329 G GA G A G LFLDE+ + + R L L++ G+ Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ----GEYT--TVG 264 Query: 330 AKITYPARFQLVAAMN 345 + + ++VAA N Sbjct: 265 GRTPIRSDVRIVAATN 280
>SECA#SecA protein signature. Length = 901 Score = 29.5 bits (66), Expect = 0.007 Identities = 11/71 (15%), Positives = 29/71 (40%) Query: 14 AKARRKTREELDQEARDRKRLKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73 +K + + EE+++ + R+ +R ++ + + + ++G P P Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886 Query: 74 LGVAEKVTKQH 84 G +K + H Sbjct: 887 CGSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 601 bits (1552), Expect = 0.0 Identities = 205/478 (42%), Positives = 299/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60 M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNVQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARDLGVEAKLLHPETEAALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A ++ G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESNVPESTSHMQPDSWATLLAQWADRALRS---- 416 EN R LT + + + + EL S + ++Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 180 bits (458), Expect = 4e-51 Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%) Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61 K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247 + I + + ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307 K+ ++ + E + D A +G+IV + L ++ + DT+ + + P + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346 Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367 + + D L LR +S G++ + Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397 Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391 V ++ + E+ + P VI+ E Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.5 bits (74), Expect = 0.005 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.5 bits (66), Expect = 0.025 Identities = 31/161 (19%), Positives = 64/161 (39%), Gaps = 15/161 (9%) Query: 227 NVFFVYAVYCGLTFFIPFLKNIYLLP----------VALVGAYGIINQYCLKMIGGPIGG 276 N+ F+ V CG F + ++P A +G+ I +I G IGG Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314 Query: 277 MISDKILKSPSKYLCYTFIISTAALVLLIMLPHESMPVYLGMACTLGFGAIVFTQRAVFF 336 ++ D+ + P L + + + L E+ ++ + G + FT+ Sbjct: 315 ILVDR--RGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTK--TVI 369 Query: 337 APIGEAKIAENKTGAAMALGSFIGYAPAMFCFSLYGYILDL 377 + I + + + + GA M+L +F + ++ G +L + Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.2 bits (68), Expect = 0.018 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%) Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 81 T +++ G +G GK +AR K N PF+ + Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 41.6 bits (97), Expect = 4e-06 Identities = 32/155 (20%), Positives = 63/155 (40%), Gaps = 5/155 (3%) Query: 114 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLVQQSR 173 + +QAD+ P+ E+ ++ P + +AE + Q+S+ Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049 Query: 174 TTEQSWQQQT-RTSQAAPVQAQPRQSKPAYTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 232 T E++ Q T T+Q V + + + A TQ + T ++ ++ A V + Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109 Query: 233 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 267 A T + ++ + Q + EQ+ETV+ Q Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 341 bits (875), Expect = e-121 Identities = 167/257 (64%), Positives = 200/257 (77%), Gaps = 6/257 (2%) Query: 1 MNVIGRTDSRFGPRLTNDLYPEYTVAGRKDWFDFYGYVDLPKFFGVGSHYDVGIWDEGSP 60 +NV+G +RFGP++ ND Y EY +KDWFDFYGY+D P FFG G+ GIW++GSP Sbjct: 39 VNVVGSYHTRFGPQIRNDTYLEYEAFAKKDWFDFYGYIDAPVFFG-GNSTAKGIWNKGSP 97 Query: 61 LFTEIEPRFSIDKLTGLNLAFGPFKEWFIANNYVYDMGDNQSSRQSTWYMGLGTDIDTGL 120 LF EIEPRFSIDKLT +L+FGPFKEW+ ANNY+YDMG N S QSTWYMGLGTDIDTGL Sbjct: 98 LFMEIEPRFSIDKLTNTDLSFGPFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGL 157 Query: 121 PIKLSANIYAKYQWQNYGAANENEWDGYRFKIKYSIPLTNLFGGRLVYNSFTNFDFGSDL 180 P+ LS N+YAKYQWQNYGA+NENEWDGYRFK+KY +PLT+L+GG L Y FTNFD+GSDL Sbjct: 158 PMSLSLNVYAKYQWQNYGASNENEWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDL 217 Query: 181 ADKSHNN-----KRTSNAIASSHILSLLYEHWKFAFTLRYFHNGGQWNAGEKVNFGDGPF 235 D + + RTSN+IASSHIL+L Y HW ++ RYFHNGGQW K+NFGDGPF Sbjct: 218 GDDNFYDLNGKHARTSNSIASSHILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPF 277 Query: 236 ELKNTGWGTYTTIGYQF 252 +++TGWG Y +GY F Sbjct: 278 SVRSTGWGGYFVVGYNF 294
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 26.7 bits (59), Expect = 0.019 Identities = 5/33 (15%), Positives = 13/33 (39%), Gaps = 1/33 (3%) Query: 17 ELVEKR-QRFATILSIIMLAVYIGFILLIAFAP 48 EL+E R +++ ++ + +L Sbjct: 47 ELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 29.9 bits (67), Expect = 0.007 Identities = 6/21 (28%), Positives = 12/21 (57%) Query: 179 FGNLDDPSSEISQLLRQKPTY 199 GNL++P+ ++ L+ P Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.6 bits (74), Expect = 3e-04 Identities = 20/83 (24%), Positives = 32/83 (38%), Gaps = 5/83 (6%) Query: 51 LALLDGEVVGMIGLHLQFHLHHVNWIGEIQELVVMPQARGLNVGSKLLAWAEEEARQAGA 110 L L+ +G I + + N I+++ V R VG+ LL A E A++ Sbjct: 69 LYYLENNCIGRIKIRSNW-----NGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123 Query: 111 EMTELSTNVKRHDAHRFYLREGY 133 L T A FY + + Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.021 Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 8/70 (11%) Query: 36 CVVLHGHSGSGKSTLLRSLYANYLPDEGQIQIKHGEEWVDLVTAPARKVVEI------RK 89 VVL G G GKSTL+ +L + I G++ + + E+ R+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIV--AYELSEMTAFRR 655 Query: 90 TTIGWVSQFL 99 V F Sbjct: 656 ADAEAVKAFF 665
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 4e-16 Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%) Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63 +L+ DDDA + + + +++ G+ S I + DL++ D+ M EN Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61 Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112 DLLP + AR V+V+S+ T + G DYL KPF + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110
>PF06580#Sensor histidine kinase Length = 349 Score = 41.0 bits (96), Expect = 8e-06 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%) Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499 L+EN ++ + GG+I + +G + EV + G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310 Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535 G GL V+++++ L G I + + G V IP Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 26.4 bits (58), Expect = 0.012 Identities = 9/28 (32%), Positives = 16/28 (57%) Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59 A+IE V + + +G+G L+ K +E Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.8 bits (67), Expect = 0.028 Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%) Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102 H L + YA P+LG +DR G R ++ + + ++ + L Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99 Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161 Y+ + G+ + + + D D R F + A G +A P+ GL Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155 Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220 + H F A + L FL+G + +++ L L + W M Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211 Query: 221 LAPVFFTLLL 230 +A + + Sbjct: 212 VAALMAVFFI 221
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 36.8 bits (85), Expect = 8e-05 Identities = 16/97 (16%), Positives = 36/97 (37%), Gaps = 7/97 (7%) Query: 391 PLDEKQLAALNTEIDNIVTLPELNNLS-----IIYQIKAVSALVKGKTDESYQAINTGID 445 ++ A+ + + T+ LN +S +Y + A + GK +++++ Sbjct: 6 TDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSL-AFNQYQSGKYEDAHKVFQALCV 64 Query: 446 LEMSWLNYVL-LGKVYEMKGMNREAADAYLTAFNLRP 481 L+ + L LG + G A +Y + Sbjct: 65 LDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI 101
>SECA#SecA protein signature. Length = 901 Score = 31.8 bits (72), Expect = 0.005 Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 6/144 (4%) Query: 282 HVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKIDMLDDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P L ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIDY 424 +R I R +++P +Y Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>cloacin#Cloacin signature. Length = 551 Score = 31.6 bits (71), Expect = 0.006 Identities = 25/81 (30%), Positives = 30/81 (37%), Gaps = 10/81 (12%) Query: 17 GSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGP---- 72 S G +SE N GG G G GGG GTG G S+ P Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG-GNLSAVAAPVAFG 91 Query: 73 -----RPQLGGRVVTIAAAAI 88 P GG V+I+A A+ Sbjct: 92 FPALSTPGAGGLAVSISAGAL 112
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.6 bits (69), Expect = 0.028 Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%) Query: 165 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218 +VP+D L L+ I +G ++++ P R + GK+ + D + Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.5 bits (61), Expect = 0.037 Identities = 6/19 (31%), Positives = 7/19 (36%), Gaps = 2/19 (10%) Query: 105 FNGDVQI--ELTGYWTWEQ 121 F G + L W EQ Sbjct: 62 FKGQEDLGNGLKAIWQVEQ 80
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 144 bits (365), Expect = 1e-44 Identities = 86/256 (33%), Positives = 133/256 (51%), Gaps = 8/256 (3%) Query: 7 LAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVKKLHQEGIQAVAAPFN 66 + GK ITG+AQGIG +A L GA I D E+ E V L E A A P + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTHKHEIDAAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQ 126 V ID IE+++GPID+LVN AG+ R ++EW +VN T VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 AVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGI 186 +V+++M++R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 187 APGYFKTEMTKALVEDE--------AFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDF 238 +PG +T+M +L DE P + P ++ A +FL S + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 239 VNGHLLFVDGGMLVAV 254 + H L VDGG + V Sbjct: 246 ITMHNLCVDGGATLGV 261
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 41.1 bits (96), Expect = 2e-06 Identities = 50/246 (20%), Positives = 79/246 (32%), Gaps = 55/246 (22%) Query: 1 MNKVFVVSVVAAACVFAANAGAKEGKSGFYLTGKAGASVVSLSDQRFLSGDEEETSKYKG 60 M K + VA A FA A A + +Y K G S D F++ Sbjct: 1 MKKTAIAIAVALA-GFATVAQAAPKDNTWYTGAKLGWS--QYHDTGFIN---------NN 48 Query: 61 GDDHDTVFSGGIAAGYDFYPQFSIPVRTELEFYARGKADSKYNVDKDSWSGGYWRDDLKN 120 G H+ G GY P E+ + G+ K +V+ + Sbjct: 49 GPTHENQLGAGAFGGYQVNPYVGF----EMGYDWLGRMPYKGSVE-----------NGAY 93 Query: 121 EVSVNTLMLNAYYDFRNDSAFTPWVSAGIGYARIHQKTTGISTWDYEYGSSGRESLSRSG 180 + L Y +D Y R+ G W + S+ +G Sbjct: 94 KAQGVQLTAKLGYPITDDLDI---------YTRL-----GGMVWRADTKSNVYGKNHDTG 139 Query: 181 SADNFAWSLGAGVRYDVTPDIALDLSYRYLDAGDSSVSYKDEWGDKYKSEVDVKSHDIML 240 + F GV Y +TP+IA L Y++ + GD + + + L Sbjct: 140 VSPVF----AGGVEYAITPEIATRLEYQWT----------NNIGDAHTIGTRPDNGMLSL 185 Query: 241 GMTYNF 246 G++Y F Sbjct: 186 GVSYRF 191
>PF03627#PapG Length = 336 Score = 549 bits (1416), Expect = 0.0 Identities = 192/339 (56%), Positives = 232/339 (68%), Gaps = 7/339 (2%) Query: 1 MKKWFPAFLF-LSLSGCNDALAANQSTIFYSFNDNIYHPQLSVKVTDIVQFIVDINSASS 59 MKKWFPA LF L +SG + A + +FYS + + +V +T QFI + Sbjct: 1 MKKWFPALLFSLCVSGESSAW---NNIVFYSLGNVNSYQGGNVVITQRPQFITSWRPGIA 57 Query: 60 TATLSYVACNGFTWTHGLYWSEYFAWLVVPKHV-SYNGYNIYLELQSKGGFSLD-AEDND 117 T T + GF Y+ EY AW+V PK V + NGY +++E+ +KG +S + DND Sbjct: 58 TVTWNQCNGPGFADGSWAYYREYIAWVVFPKKVMTKNGYPLFIEVHNKGSWSEENTGDND 117 Query: 118 NYYLTKGFAWDE-VNSSGRVCFDIGEKRSLAWSFGGVTLNARLPVDLPKGDYTFPVKFLR 176 +Y+ KG+ WDE +G +C GE L F + LP DLP GDY+ + + Sbjct: 118 SYFFLKGYKWDERAFDAGNLCQKPGETTRLTEKFDDIIFKVALPADLPLGDYSVTIPYTS 177 Query: 177 GIQRNNYDYIGGRYKIPSSLMKTFPFNGTLNFSIKNTGGCRPSAQSLEINHGDLSINSAN 236 G+QR+ Y+G R+KIP ++ KT P + F KN GGCRPSAQSLEI HGDLSINSAN Sbjct: 178 GMQRHFASYLGARFKIPYNVAKTLPRENEMLFLFKNIGGCRPSAQSLEIKHGDLSINSAN 237 Query: 237 NHYAAQTLSVSCDVPTNIRFFLLSNTTPAYSHGQQFSVGLGHGWDSIVSINGVDTGETTM 296 NHYAAQTLSVSCDVP NIRF LL NTTP YSHG++FSVGLGHGWDSIVS+NGVDTGETTM Sbjct: 238 NHYAAQTLSVSCDVPANIRFMLLRNTTPTYSHGKKFSVGLGHGWDSIVSVNGVDTGETTM 297 Query: 297 RWYRAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 335 RWY+AGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP Sbjct: 298 RWYKAGTQNLTIGSRLYGESSKIQPGVLSGSATLLMILP 336
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 292 bits (749), Expect = e-105 Identities = 165/167 (98%), Positives = 165/167 (98%) Query: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNINPEHVDNSRGE 60 Query: 61 VTKTISISCPYKSGSLWIKVTGNTMGGGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120 VTK ISISCPYKSGSLWIKVTGNTMG GQNNVLATNITHFGIALYQGKGMSTPLTLGNGS Sbjct: 61 VTKNISISCPYKSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLTLGNGS 120 Query: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN Sbjct: 121 GNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIYN 167
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 276 bits (706), Expect = 9e-99 Identities = 153/158 (96%), Positives = 156/158 (98%) Query: 1 MLMSQHAHAADNLTFKGKLIIPACTVQNAEVDWGDIEIQNLVQNGGNQKDFTVDMNCPYS 60 +LMSQH HAADNLTFKGKLIIPACTVQNAEV+WGDIEIQNLVQ+GGNQKDFTVDMNCPYS Sbjct: 16 VLMSQHVHAADNLTFKGKLIIPACTVQNAEVNWGDIEIQNLVQSGGNQKDFTVDMNCPYS 75 Query: 61 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQFTPGKITG 120 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQ TPGKITG Sbjct: 76 LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQVTPGKITG 135 Query: 121 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 158 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS Sbjct: 136 TAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASYS 173
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 744 bits (1922), Expect = 0.0 Identities = 244/882 (27%), Positives = 364/882 (41%), Gaps = 67/882 (7%) Query: 2 MRVMKDRI-PFAVNNITCVILLSLFCNAASAVEFNTDVLDAADKKNIDFTRFSEAGYVLP 60 + + K R+ F V + +++ + FN L + D +RF + P Sbjct: 16 LHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPP 75 Query: 61 GQYLLDVIVNGQSISPASLQISFVEPQSSGDKAEKKLPQACLTSDMVRLMGLTAESLDKV 120 G Y +D+ +N + A+ ++F S CLT + MGL S+ + Sbjct: 76 GTYRVDIYLNNGYM--ATRDVTFNTGDSEQGI------VPCLTRAQLASMGLNTASVSGM 127 Query: 121 VYWHDGQCADF-HGLPGVDIRPDTGAGVLRINMPQAWLEYSDATWLPPSRWDDGIPGLML 179 D C + + D G L + +PQA++ ++PP WD GI +L Sbjct: 128 NLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLL 187 Query: 180 DYNLNGTVSRNYQGGDSHQFSYNGTVGGNLGPWRLRADYQGSQEQSRYNGEKTTNRNFTW 239 +YN +G +N GG+SH N G N+G WRLR + S S + + + Sbjct: 188 NYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSS--DSSSGSKNKWQH 245 Query: 240 SRFYLFRAIPRWRANLTLGENNINSDIFRSWSYTGASLESDDRMLPPRLRGYAPQITGIA 299 +L R I R+ LTLG+ DIF ++ GA L SDD MLP RG+AP I GIA Sbjct: 246 INTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIA 305 Query: 300 ETNARVVVSQQGRVLYDSMVPAGPFSIQDLD-SSVRGRLDVEVIEQNGRKKTFQVDTASV 358 A+V + Q G +Y+S VP GPF+I D+ + G L V + E +G + F V +SV Sbjct: 306 RGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSV 365 Query: 359 PYLTRPGQVRYKLVSGRSRGYGHETEGPVFATGEASWGLSNQWSLYGGAVLAGDYNALAA 418 P L R G RY + +G R + E P F GL W++YGG LA Y A Sbjct: 366 PLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNF 425 Query: 419 GAGWDLGVPGTLSADITQSVARIEGERTFQGKSWRLSYSKRFDNADADITFAGYRFSERN 478 G G ++G G LS D+TQ+ + + + G+S R Y+K + + +I GYR+S Sbjct: 426 GIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSG 485 Query: 479 YMTMEQYLNARYR--------------------NDYSSREKEMYTVTLNKNVADWNTSFN 518 Y +R + + ++ +T+ + + + + Sbjct: 486 YFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLY 544 Query: 519 LQYSRQTYWDIRKTD-YYTVSVNRYFNVFGLQGVAVGLSASRSKYLGRD--NDSAYLRIS 575 L S QTYW D + +N F + LS S +K + + L ++ Sbjct: 545 LSGSHQTYWGTSNVDEQFQAGLNTAFE-----DINWTLSYSLTKNAWQKGRDQMLALNVN 599 Query: 576 VPLGT------------GTASYSGSMSND-RYVNMAGYTDT-FNDGLDSYSLNAGLNSGG 621 +P +ASYS S + R N+AG T D SYS+ G GG Sbjct: 600 IPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGG 659 Query: 622 GLTSQRQINAYYSHRSPLANLSANIASLQKGYTSFGVSASGGATITGKGAALHAGGMSGG 681 S A ++R N + S SGG G L G Sbjct: 660 DGNSGSTGYATLNYRGGYGNANIG-YSHSDDIKQLYYGVSGGVLAHANGVTL--GQPLND 716 Query: 682 TRLLVDTDGVGGVPVDGGQVV-TNRWGTGVVTDISSYYRNTTSVDLKRLPDDVEATRSVV 740 T +LV G V+ V T+ G V+ + Y N ++D L D+V+ +V Sbjct: 717 TVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVA 776 Query: 741 ESALTEGAIGYRKFSVLKGKRLFAILRLADGSQPPFGASVTSEKGRELGMVADEGLAWLS 800 T GAI +F G +L L + PFGA VTSE + G+VAD G +LS Sbjct: 777 NVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLS 835 Query: 801 GVTPGETLSVNW--DGKIQCQVNVPETAISDQQLL----LPC 836 G+ + V W + C N S QQLL C Sbjct: 836 GMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 31.5 bits (71), Expect = 0.001 Identities = 41/173 (23%), Positives = 75/173 (43%), Gaps = 29/173 (16%) Query: 29 GMSLPEYWG----EEHVWWDGRAAFHGEVVRPACTLAMEDAWQIIDMGETPVRDL-QNGF 83 G+ LP G +HV F G+++ PACT+ + ++ G+ +++L Q+G Sbjct: 6 GLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPACTVQNAE----VNWGDIEIQNLVQSG- 60 Query: 84 SGPERKFSLRLRNCEFNSQGGNLFSDSRIRVTFDGVRGET---PDKFNLSGQAKGINLQI 140 G ++ F++ + NC ++ ++ +T +G G + P+ SG I L Sbjct: 61 -GNQKDFTVDM-NCPYS------LGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYN 112 Query: 141 ADARGNIARAGKV-MPAIP--LTGNEEALDYTLRIVR----NGKKLEAGNYFA 186 ++ I A + P +TG A TL N + L+AG + A Sbjct: 113 SNN-SGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSA 164
>FIMREGULATRY#Escherichia coli: P pili regulatory PapB protein signature. Length = 104 Score = 168 bits (428), Expect = 3e-58 Identities = 100/104 (96%), Positives = 104/104 (100%) Query: 1 MAHHEIISRAGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHT 60 MAHHE+ISR+GNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGH+ Sbjct: 1 MAHHEVISRSGNAFLLNIRESVLLPGSMSEMHFFLLIGISSIHSDRVILAMKDYLVGGHS 60 Query: 61 RKEVCEKHQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104 RKEVCEK+QMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD Sbjct: 61 RKEVCEKYQMNNGYFSTTLGRLIRLNALAARLAPYYTDESSAFD 104
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 599 bits (1546), Expect = 0.0 Identities = 462/478 (96%), Positives = 468/478 (97%) Query: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV Sbjct: 1 MKTWLMGFSEFLLRYKLVWSETWKIRKQLDTPVREKDENEFLPAHLELIETPVSRRPRLV 60 Query: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTLSGRSKEIKPIENSIVKEIIVKEGESVRK 120 AYFIMGFLVIAFILSVLGQVEIVATANGKLT SGRSKEIKPIENSIVKEIIVKEGESVRK Sbjct: 61 AYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRK 120 Query: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQIRYQILSRSIELNKLPELKLPDESYFQNVS 180 GDVLLKLTALGAEADTLKTQSSLLQARLEQ RYQILSRSIELNKLPELKLPDE YFQNVS Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180 Query: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTILARINRYENLSRVEKSRLDDF 240 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT+LARINRYENLSRVEKSRLDDF Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDF 240 Query: 241 RSLLHKQAIAKHAVLEQENKYVEAANELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 SLLHKQAIAKHAVLEQENKYVEA NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI 300 Query: 301 LDKLRQTTDSIELLTLELEKNEERQQASVIRAPVSGKVQQLKVHTEGGVVTTAETLMVIV 360 LDKLRQTTD+I LLTLEL KNEERQQASVIRAPVS KVQQLKVHTEGGVVTTAETLMVIV Sbjct: 301 LDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV 360 Query: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQKLGL 420 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQ+LGL Sbjct: 361 PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGL 420 Query: 421 VFNVIVSVEENDLSTGNKHIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLHER 478 VFNVI+S+EEN LSTGNK+IPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESL ER Sbjct: 421 VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVTESLRER 478
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 1474 bits (3816), Expect = 0.0 Identities = 975/1024 (95%), Positives = 991/1024 (96%) Query: 1 MPTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60 M TITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ Sbjct: 1 MTTITTAQIKSTLQSAKQSAANKLHSAGQSTKDALKKAAEQTRNAGNRLILLIPKDYKGQ 60 Query: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK Sbjct: 61 GSSLNDLVRTADELGIEVQYDEKNGTAITKQVFGTAEKLIGLTERGVTIFAPQLDKLLQK 120 Query: 121 YQKAGNKLGGSAENIGDNLGKAGSVLSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180 YQKAGN LGG AENIGDNLGKAG +LSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL Sbjct: 121 YQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSEL 180 Query: 181 AKASIELINQLVDTAASLNNNVNSFSQQLNKLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240 AKASIELINQLVDT ASLNNNVNSFSQQLN LGSVLSNTKHLNGVGNKLQNLPNLDNIGA Sbjct: 181 AKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQNLPNLDNIGA 240 Query: 241 GLDTVSGILSAISASFILSNADADTGTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300 GLDTVSGILSAISASFILSNADADT TKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL Sbjct: 241 GLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGL 300 Query: 301 STSAAAAGLIASVVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360 STSAAAAGLIAS VTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE Sbjct: 301 STSAAAAGLIASAVTLAISPLSFLSIADKFKRANKIEEYSQRFKKLGYDGDSLLAAFHKE 360 Query: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH Sbjct: 361 TGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGIISGILEASKQAMFEH 420 Query: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFEILSQYNKEYSVERSVLITQQHW 480 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNF+ILSQYNKEYSVERSVLITQQHW Sbjct: 421 VASKMADVIAEWEKKHGKNYFENGYDARHAAFLEDNFKILSQYNKEYSVERSVLITQQHW 480 Query: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKEPDEFQKQVFDPLKGNIDLSVIKSS 540 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEK+ DEFQKQVFDPLKGNIDLS KSS Sbjct: 481 DTLIGELAGVTRNGDKTLSGKSYIDYYEEGKRLEKKXDEFQKQVFDPLKGNIDLSDSKSS 540 Query: 541 TLLKFITPLLTPGKEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGSVYDYSNLIQHA 600 TLLKF+TPLLTPG+EIRERRQSGKYEYITELLVKGVDKWTVKGVQDKG+VYDYSNLIQHA Sbjct: 541 TLLKFVTPLLTPGEEIRERRQSGKYEYITELLVKGVDKWTVKGVQDKGAVYDYSNLIQHA 600 Query: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE Sbjct: 601 SVGNNQYREIRIESHLGDGDDKVFLSAGSANIYAGKGHDVVYYDKTDTGYLTIDGTKATE 660 Query: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGTDLTETDNLYSVE 720 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHING +LTETDNLYSVE Sbjct: 661 AGNYTVTRVLGGDVKVLQEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVE 720 Query: 721 ELIGTNRADKFFGSKFTDIFHGADGDDHIEGNDGNDRLYGDKGNDTLRGGNGDDQLYGGD 780 ELIGT RADKFFGSKFTDIFHGADGDD IEGNDGNDRLYGDKGNDTL GGNGDDQLYGGD Sbjct: 721 ELIGTTRADKFFGSKFTDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGD 780 Query: 781 GNDKLTGGVGNNYLNGGDGDDELQVQGNSLAKNVLSGGKGNDKLYGSEGADLLDGGEGND 840 GNDKL G GNNYLNGGDGDDE QVQGNSLAKNVL GGKGNDKLYGSEGADLLDGGEG+D Sbjct: 781 GNDKLIGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDD 840 Query: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKDDKLSLADIDFRDVAFKREGNDLIMYKAEGNV 900 LLKGGYGNDIYRYLSGYGHHIIDDDGGK+DKLSLADIDFRDVAFKREGNDLIMYK EGNV Sbjct: 841 LLKGGYGNDIYRYLSGYGHHIIDDDGGKEDKLSLADIDFRDVAFKREGNDLIMYKGEGNV 900 Query: 901 LSIGHKNGITFRNWFEKESGDISNHQIEQIFDKDGRVITPDSLKKAFEYQQSNNQANYVY 960 LSIGHKNGITFRNWFEKESGDISNH+IEQIFDK GR+ITPDSLKKA EYQQ NN+A+YVY Sbjct: 901 LSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY 960 Query: 961 GEYASTYADLDNLNPLINEISKIISAAGNFDVKEERSAASLLQLSGNASDFSYGRNSITL 1020 G A Y +LNPLINEISKIISAAG+FDVKEER+AASLLQLSGNASDFSYGRNSITL Sbjct: 961 GNDALAYGSQGDLNPLINEISKIISAAGSFDVKEERTAASLLQLSGNASDFSYGRNSITL 1020 Query: 1021 TASA 1024 T SA Sbjct: 1021 TTSA 1024
>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C signature. Length = 170 Score = 316 bits (810), Expect = e-114 Identities = 163/170 (95%), Positives = 167/170 (98%) Query: 1 MNMNNPLEVLGHVSWLWASSPLHRNWPVSLFAINVLPAIRANQYALLTRDNYPVAYCSWA 60 MN+N PLE+LGHVSWLWASSPLHRNWPVSLFAINVLPAI+ANQY LLTRD+YPVAYCSWA Sbjct: 1 MNINKPLEILGHVSWLWASSPLHRNWPVSLFAINVLPAIQANQYVLLTRDDYPVAYCSWA 60 Query: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR Sbjct: 61 NLSLENEIKYLNDVTSLVAEDWTSGDRKWFIDWIAPFGDNGALYKYMRKKFPDELFRAIR 120 Query: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKNKSDFNFSLTG 170 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVK KSDFNFSLTG Sbjct: 121 VDPKTHVGKVSEFHGGKIDKQLANKIFKQYHHELITEVKRKSDFNFSLTG 170
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 90.7 bits (225), Expect = 4e-23 Identities = 35/128 (27%), Positives = 60/128 (46%) Query: 6 KILLMEDDYDIAALLRLNLQDEGYQIVHEADGARARLLLDKQTWDAVILDLMLPNVNGLE 65 IL+ +DD I +L L GY + ++ A + D V+ D+++P+ N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 ICRYIRQMTSYLPVIIISARTSETHRVLGLEMGADDYLPKPFSIPELIARIKALFRRQEA 125 + I++ LPV+++SA+ + + E GA DYLPKPF + ELI I + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 126 MGQNILLA 133 + Sbjct: 125 RPSKLEDD 132
>PF06580#Sensor histidine kinase Length = 349 Score = 43.7 bits (103), Expect = 1e-07 Identities = 24/137 (17%), Positives = 54/137 (39%), Gaps = 24/137 (17%) Query: 65 LSIETRRLQLRIMMSHSLPLIRADISMIERVITNLLDNAVRH----TPPEGSIRLKVWQE 120 L + + + + R+ + + D+ + ++ L++N ++H P G I LK ++ Sbjct: 229 LQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKD 288 Query: 121 DNRLHVEVADSGPGLTEDMRTHLFRRASVLCHEPSEEPRGGLGLLIVRRMLVLHGGD--- 177 + + +EV ++G + + + G GL VR L + G Sbjct: 289 NGTVTLEVENTGSLALK-----------------NTKESTGTGLQNVRERLQMLYGTEAQ 331 Query: 178 IRLTDSTTGACFRFFLP 194 I+L++ +P Sbjct: 332 IKLSEKQGKVNAMVLIP 348
>PF05860#haemagglutination activity domain. Length = 117 Score = 75.6 bits (186), Expect = 5e-18 Identities = 25/139 (17%), Positives = 45/139 (32%), Gaps = 26/139 (18%) Query: 32 AVITPQNGA---GMDKAANGVPVVNIATPNGAGISHNRFTDYNVGKEGLILNNATGKLNP 88 A ITP ++ T G+ + H+ F +++V G N Sbjct: 1 AQITPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFNNPT---- 55 Query: 89 TQLGGLIQNNPNLKAGGEAKGIINEVTGGNRSLLQGYTEVAGKAANVMVANPYGITCDGC 148 + II+ VTGG+ S + G A N+ + NP GI Sbjct: 56 -----------------NIQNIISRVTGGSVSNIDGLIRANATA-NLFLINPNGIIFGQN 97 Query: 149 GFINTPHATLTTGRPVMNA 167 ++ + + + + Sbjct: 98 ARLDIGGSFVGSTANRLKF 116
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 55.4 bits (133), Expect = 7e-12 Identities = 25/108 (23%), Positives = 50/108 (46%) Query: 20 YQQLLESAAMIAGRDGIAALSLNAVAREAGVSKGGLLHHFPNKQALIYALFARLLAIMEE 79 Q +L+ A + + G+++ SL +A+ AGV++G + HF +K L ++ + + E Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 80 AIAALMQKDNISYGRFTRAYVNYLSALTDTQESRQLMVLSLAMPDEPV 127 K R + ++ T T+E R+L++ + E V Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 1097 bits (2838), Expect = 0.0 Identities = 877/878 (99%), Positives = 878/878 (100%) Query: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA Sbjct: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60 Query: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN Sbjct: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120 Query: 121 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP Sbjct: 121 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 Query: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTSWSYNSSDSSSGSK 240 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNT+WSYNSSDSSSGSK Sbjct: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSK 240 Query: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV Sbjct: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300 Query: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV Sbjct: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360 Query: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY Sbjct: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420 Query: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR Sbjct: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480 Query: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT Sbjct: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540 Query: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 Query: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 Query: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL Sbjct: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720 Query: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP Sbjct: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 Query: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA Sbjct: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840 Query: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR Sbjct: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 29.1 bits (65), Expect = 0.026 Identities = 10/45 (22%), Positives = 17/45 (37%) Query: 53 LFVIVAVCTFFVQSCARKSNHAASFQNYHATIDGKEIAGITNNIS 97 +++ + + +CA S Q IA IT NI+ Sbjct: 6 TLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIA 50
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.008 Identities = 16/63 (25%), Positives = 27/63 (42%), Gaps = 2/63 (3%) Query: 26 DKVEAEQSLITVEGDKASMEVPSPQAGIVKEIKVSVGDKTQTGALIMIFDSADGAADAAP 85 + V +T G S E+ + IVKEI V G+ + G +++ + AD Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 86 AQA 88 Q+ Sbjct: 139 TQS 141 Score = 31.7 bits (72), Expect = 0.009 Identities = 14/60 (23%), Positives = 29/60 (48%), Gaps = 2/60 (3%) Query: 119 EVTEILVKVGDKV-EAEQSLITVEGDKASMEVPAPFAGTVKEIKVN-VGDKVSTGSLIMI 176 E+ + L + D + L E + + + AP + V+++KV+ G V+T +M+ Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Score = 29.8 bits (67), Expect = 0.035 Identities = 20/95 (21%), Positives = 35/95 (36%), Gaps = 3/95 (3%) Query: 230 DKVAAEQSLITVEGDKASMEVPAPFAGVVKELKVNVGDKVKTGSLIMIFEVEGAAPAAAP 289 + VA +T G S E+ +VKE+ V G+ V+ G +++ GA A Sbjct: 81 EIVATANGKLTHSGR--SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE-ADTL 137 Query: 290 AKQEAAAPAPAAKAEAPAAAPAAKAEGKSEFAEND 324 Q + A + + + + E D Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD 172
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 29.8 bits (67), Expect = 0.015 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 10/69 (14%) Query: 187 FISGTGFATDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 245 +G ++D+R L A + D A+LAL + R+ K++ + Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323 Query: 246 DVIVLGGGM 254 DVIV G+ Sbjct: 324 DVIVFTAGI 332
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.0 bits (122), Expect = 4e-09 Identities = 74/356 (20%), Positives = 126/356 (35%), Gaps = 35/356 (9%) Query: 5 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 61 + ++AL G+G+ IM VL L ++ S H +++ YAL AP++ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 121 S R+ + +LL +A + A+ + +L IGR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 181 G A G +S ++ P+ L FS F A N + F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 RDEAKGKLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 229 + LR + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 230 SGFSETAMTFIMMLVGLGM---VLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFFG 286 F A T + L G+ + M++G ++ R R + ++L F Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 287 GMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 340 I + G+ LQ +L + E G G +A +L S VG Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 39.7 bits (92), Expect = 7e-05 Identities = 40/264 (15%), Positives = 81/264 (30%), Gaps = 11/264 (4%) Query: 162 LNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSL 221 A P E E + E + Q S V + + A + + A Q+ Sbjct: 1029 APATPSETTETVAENSK-----QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN 1083 Query: 222 TASLQVLTDEEKQLITAQQQEQQSLNWLTRLD-ELQQEASRRQQALQQALAEEEKAQPQL 280 + +E Q ++ +++ E QE + + + E QPQ Sbjct: 1084 EVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQA 1143 Query: 281 AALSLAQPARNLRPHWE---RIAEHSTALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQ 337 P N++ A+ T +E+ T + + + + Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203 Query: 338 SAELQQQQQSLNAWLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLN 397 A Q S ++ ++ R + + ++DR + T+ L+ Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPA-TTSSNDRSTVALCDLTSTNTNAVLS 1262 Query: 398 ALAAITLTLTADEVATALAQHAEQ 421 A + + V A++QH Q Sbjct: 1263 DARAKAQFVALN-VGKAVSQHISQ 1285 Score = 33.9 bits (77), Expect = 0.005 Identities = 27/139 (19%), Positives = 54/139 (38%), Gaps = 13/139 (9%) Query: 738 QQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQ 797 Q DV + S + A+ D A A E T T E KQ + + Q Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPP-------APATPSETTETVAENSKQESKTVEKNEQ 1056 Query: 798 TLVTQTAETLTQHQQHRPGGLSLTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDAD 857 TA+ ++ + V E+AQ+ + +E T++ + ++++ Sbjct: 1057 DATETTAQNREVAKEAKS-----NVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111 Query: 858 NRQQQQTLMQQIAQMTQQV 876 + + + Q++ ++T QV Sbjct: 1112 AKVETEK-TQEVPKVTSQV 1129
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 29.7 bits (66), Expect = 0.021 Identities = 13/70 (18%), Positives = 23/70 (32%), Gaps = 4/70 (5%) Query: 157 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFP 216 K+ ++ I ++Y + + + I T D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTRSA--GKD-IVSVKINIDKAKK 190 Query: 217 AQNFPPADYI 226 N P DYI Sbjct: 191 ILNLPECDYI 200
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 97.2 bits (242), Expect = 2e-25 Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152 E L D + G Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 34.1 bits (78), Expect = 0.001 Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%) Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPHLTERFYRVDKAR 380 LV N + H P+G I ++ + VE+ G Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306 Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422 +G GL V+ + E+++ + GK +IP Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.9 bits (88), Expect = 7e-05 Identities = 70/347 (20%), Positives = 134/347 (38%), Gaps = 20/347 (5%) Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121 +F +P++ + F RR LL + V A M W+ + ++A + Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109 Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181 + V A+ D+ +ER + GM+ L + ++ A Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167 Query: 182 VL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238 L + + L PE + P+ + + + A L+ + ++ +G Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227 Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298 A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++ Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285 Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358 A GY LL+ + + V GG+G A A+L ++ L+ Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341 Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403 AL+++ + VGP+ + A +T+ + + AA+ L L + R Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>PF06291#Lambda prophage Bor protein Length = 102 Score = 28.9 bits (64), Expect = 0.006 Identities = 12/37 (32%), Positives = 19/37 (51%) Query: 34 NMFKKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 70 N KK+LF ++ GCA+ T+ PT P++ Sbjct: 4 NKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.043 Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%) Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119 E P+ E + ++G+ A + +Y RL D +++ G Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167 Query: 120 PTGSGKTLLAETL 132 +G+GK L+A L Sbjct: 168 ESGTGKELVARAL 180
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (294), Expect = 3e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 27.1 bits (60), Expect = 0.021 Identities = 24/138 (17%), Positives = 41/138 (29%), Gaps = 20/138 (14%) Query: 1 MQTQIKVRGYHLDVYQHVNNARYL-------EFLEEARWHGLENSDSFHWMTAH------ 47 +Q I + Y N Y E++ + N FH + Sbjct: 361 LQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEWMAKLPGKRYLNHKHFHLFCHYVEQILR 420 Query: 48 ------NIAFVVVN-ININYRRPAVLSDLLTITSQLQQLNGKSGILSQVITLEPEGQVVA 100 + FV N IN + + + + Q+ L+P+ + Sbjct: 421 NIQPPLVVVFVASNFINAHLLTDSFPRYFSDKSIDFHSYYLLQDNVYQIPDLKPDLVITH 480 Query: 101 DALITFVCIDLKTQKALA 118 LI FV +L A+A Sbjct: 481 SQLIPFVHHELTKGIAVA 498
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.019 Identities = 12/64 (18%), Positives = 24/64 (37%), Gaps = 10/64 (15%) Query: 193 LTVLTQHLGLSLRDCMAFGDAMNDREMLGSVGSGFIMGN----------AMPQLRAELPH 242 TVL Q L + D +A + + ++ + +P+++ P Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75 Query: 243 LPVI 246 LPV+ Sbjct: 76 LPVL 79
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 31.4 bits (71), Expect = 0.002 Identities = 18/66 (27%), Positives = 30/66 (45%), Gaps = 7/66 (10%) Query: 37 TKEHLLPHFL-EHLGNNHLDI------GVGTGFYLTHVPESSLISLMDLNEASLNAASTR 89 T EHL F+ HL + ++I G TGFY++ + S + D A++ Sbjct: 54 TLEHLYAGFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKV 113 Query: 90 AGESKI 95 ++KI Sbjct: 114 ENQNKI 119
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 528 bits (1360), Expect = 0.0 Identities = 314/317 (99%), Positives = 317/317 (100%) Query: 1 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS Sbjct: 1 MRAKLLGIVLTTPIAISSFASTETLSFTPDNINADISLGTLSGKTKERVYLAEEGGRKVS 60 Query: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDRDWMDSSNPGTWTDESR 120 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVD+DWMDSSNPGTWTDESR Sbjct: 61 QLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDESR 120 Query: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI Sbjct: 121 HPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRDDI 180 Query: 181 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVEASDNDEHYDPGKRIT 240 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVE+SDNDEHYDPGKRIT Sbjct: 181 GSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKRIT 240 Query: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNDNTSDYSKNGA 300 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHN+NTSDYSKNGA Sbjct: 241 YRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNNNTSDYSKNGA 300 Query: 301 GIENYNFITTAGLKYTF 317 GIENYNFITTAGLKYTF Sbjct: 301 GIENYNFITTAGLKYTF 317
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.012 Identities = 29/184 (15%), Positives = 67/184 (36%), Gaps = 34/184 (18%) Query: 306 EELTRMAKMVSDML-FLAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDR-GVELRFV 363 + M +S+++ + + N + + LADE+ V + + LA + L+F Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVS------LADELTVVDSYLQ-LASIQFEDRLQFE 243 Query: 364 GDECQVAGDPLMLRRALSNLLSNALRY----TPTGETIVVRCQTVDHLVQVTVENPGTPI 419 D + + L+ N +++ P G I+++ + V + VEN G+ Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA 303 Query: 420 APEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVK---SIVVAHKGTVAVTSDVRGTKFVI 476 E +G GL V+ ++ + + ++ ++ Sbjct: 304 LKN------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 477 ILPA 480 ++P Sbjct: 346 LIPG 349
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.7 bits (212), Expect = 2e-21 Identities = 35/117 (29%), Positives = 62/117 (52%) Query: 2 KLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWD 61 +L+ +D+ L + L+ AG+ V + N + GD DL++ D+++PD N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 IVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118 ++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.0 bits (91), Expect = 3e-05 Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 13/189 (6%) Query: 254 QAQTVNSDSLQSVKLPA-GLPSQILLQRPDIMEAEHALM-----AANANIGAARAAFFPS 307 + +S + +K + +I+++ + + L+ A A+ ++ Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS----- 141 Query: 308 ISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYE 367 SL + + + + P F + L + + ++Q Sbjct: 142 -SLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200 Query: 368 QKIQNAFKEVADALALRQSLNDQISAQQRYLASLQITLQRARTLYQHGAVSYLEVLDAER 427 QK Q + A R ++ +I+ + + L +L A++ VL+ E Sbjct: 201 QKYQ-KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259 Query: 428 SLFATRQTL 436 L Sbjct: 260 KYVEAVNEL 268
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 697 bits (1801), Expect = 0.0 Identities = 214/1059 (20%), Positives = 441/1059 (41%), Gaps = 54/1059 (5%) Query: 1 MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQI 60 M + IRR + A+ L + G I+ PV P ++ V + +YPG Q Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56 Query: 61 VENQVTYPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119 V++ VT + M + + S G + + F+ GTDP A+ +V L Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 120 KLPAGVSAELGP-DATGVGWIYEYALVDRSGKHDLADLRSLQDWFLKYELKTIPDVAEVA 178 LP V + + + ++ V + D+ +K L + V +V Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 179 SVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELA------EAEYMVR 232 G ++ +D L +Y ++ +V + L N + + + + Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235 Query: 233 ASGYLQTLDDFNHIVLKASENGVPVYLRDVAKVQVGPEMRRGIAELNGEGEVAGGVVILR 292 A + ++F + L+ + +G V L+DVA+V++G E IA +NG+ AG + L Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294 Query: 293 SGKNAREVIAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVV 352 +G NA + A+K KL L+ P+G++++ YD + + +I + L E ++V +V Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354 Query: 353 CALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412 LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 413 NAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEG 472 N + + E D + + ++ AL ++++ FIP+ G G Sbjct: 415 NVERVMME----------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 473 RLFGPLAFTKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRF----------LI 522 ++ + T AMA + L+A+++ P L ++ + E F + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKP-VSAEHHENKGGFFGWFNTTFDHSV 523 Query: 523 RVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGEFLPQINEGDLLYMPSTLPGISA 582 Y + K+L LL+ AL V ++ ++ FLP+ ++G L M G + Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583 Query: 583 AEAASMLQKTDKLIM--SVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639 +L + + V VF G + + + LKP ++ Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640 Query: 640 MTMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADI-DTMAE 698 + + +I + + + +++ + I +G + + Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700 Query: 699 QIEEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGE 758 + A+ + S LE +E+++EKA G++++D+ +++A+GG V + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 759 TVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLADVADVKVSTGPSMLKTENA 818 ++ + ++ +R P+ + +L + + + + + G L+ N Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820 Query: 819 RPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPM 878 P+ I +A L + +A K L G ++G + ++ +V + Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAI 878 Query: 879 TLMIIFVLLYLAFRRVSEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGVAA 938 + +++F+ L + S + ++ VP +VG + V G + G++A Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 939 EFGVVMLMYLRHAIEAEPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPI 998 + ++++ + + +E E + + EA +R+RP MT I G+LP+ Sbjct: 939 KNAILIVEFAKDLMEKE----------GKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988 Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037 GAGS + + ++GGM++A LL++F +P + + Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 43.2 bits (102), Expect = 8e-07 Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%) Query: 16 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 65 + L A A P + I MD ASG+ L ADE+ Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65 Query: 66 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 125 S K++ V + A +L + + +P V D ++V +L Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119 Query: 126 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 181 I S N A L V G + + +++G T ++T PG Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174 Query: 182 --STARDMA------LLGKAL 194 +T MA L + L Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.8 bits (93), Expect = 1e-05 Identities = 58/269 (21%), Positives = 106/269 (39%), Gaps = 23/269 (8%) Query: 71 LLGPLSDRIGRRPVMLAGVVWFIVTCLAILLAQNIEQFTLLRFLQGISLCFIGAVGYAAI 130 +LG LSDR GRRPV+L + V + A + + R + GI+ GAV A I Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120 Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVG---AAWIHVLPWEGMFVLFAALAAISFFG 187 + + + M+ + GP++G + P F AAL ++F Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----FFAAAALNGLNFLT 176 Query: 188 LQRAMPETATRIGEKLSLKELGRDYKLVLKNG-RFVAGALALGFVSLPLLAWIAQSP--I 244 +PE+ L + L G VA +A+ F ++ + Q P + Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF----IMQLVGQVPAAL 232 Query: 245 IIITGEQLSSYEYGLLQVPIFGALIAGNL----LLARLTSRRTVRSLIIMGGWPIMIGLL 300 +I GE ++ + + + I +L + + +R R +++G G + Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292 Query: 301 VAAAATVISSHAYLWMTAGLSIYAFGIGL 329 + A AT ++ + + + GIG+ Sbjct: 293 LLAFAT----RGWMAFPIMVLLASGGIGM 317
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.001 Identities = 34/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVAVVR-ASALM--GALGIGLIIFVDSAWVA-GVSVVLWGLGASLGFPLTISAASD 331 DR + V+ + L ++ S ++ + VL GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 51.9 bits (124), Expect = 9e-11 Identities = 14/84 (16%), Positives = 32/84 (38%) Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIAALAGVPLGSMTYYFSGIDELLLEAFS 61 + + R+ I+ L G+ + + +IA AGV G++ ++F +L E + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 62 RFTEIMSRQYQAFFSDVSDAPGAR 85 + + + P + Sbjct: 65 LSESNIGELELEYQAKFPGDPLSV 88
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.1 bits (73), Expect = 0.006 Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSTFSFGMGNAAGLLFAGIML-GFMRANHPTFG-YIPQ--GALSMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGSGINNGLGAIGGQM--LIAGLIVSLVPVVICFLF 493 M G G+ AG + +G A + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.010 Identities = 9/18 (50%), Positives = 12/18 (66%) Query: 31 LVLLGPSGAGKSSLLRVL 48 +VL G G GKS+L+ L Sbjct: 599 VVLEGTGGIGKSTLINTL 616
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 28.7 bits (64), Expect = 0.026 Identities = 20/54 (37%), Positives = 26/54 (48%), Gaps = 9/54 (16%) Query: 2 RRVFWLVAAALLLAGCTGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55 R+V LV ALL AG I K+G +LD Y ++ L HY +DD Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 75.2 bits (185), Expect = 2e-17 Identities = 70/363 (19%), Positives = 123/363 (33%), Gaps = 65/363 (17%) Query: 13 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMGKLLEKMGAEFVPAD 63 MK LVTGA +G + + L + G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 64 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 116 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 117 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFT 176 +++ ++ SS S+Y + D + +A +K A+E + + S T Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174 Query: 177 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 234 LR +++GP + + + + M SI + + G D TY ++ A+ Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234 Query: 235 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 280 RVYNI N L +Q L D L I+ + +P D Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294 Query: 281 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVLTLDEGIEKTAAW 340 + T D E +G+ P T+ +G++ W Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327 Query: 341 LRD 343 RD Sbjct: 328 YRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.6 bits (134), Expect = 1e-10 Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54 + LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 NWPDNLPALLQN--IDTVYFLVH------SMGEGGDFIAQERQVALNVRDALREVPVKQL 106 + + L + + V+ H S+ + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>INTIMIN#Intimin signature. Length = 939 Score = 256 bits (656), Expect = 1e-78 Identities = 119/378 (31%), Positives = 195/378 (51%), Gaps = 21/378 (5%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+ Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237 Query: 139 DRYLTWSQLGLTQQDDGLVSNVGVGQRWARGSWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 ++ L + Q+G D +N+G GQR+ ++GYN F D + R G G E W Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297 Query: 199 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSVEQYFGDR 256 +Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+GD Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P + Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376 Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L + Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476 Query: 377 RSRYGIRQLIWQGDTQILS-----LTPGAQANSVEGWTLIMPDWQNGEGASNHWRLSVVV 431 +S+YG+ +++W D+ + S G+Q S + + I+P + +G SN ++++ Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531 Query: 432 EDNQGQRVSSNEITLTLV 449 D G SSN + LT+ Sbjct: 532 YDRNGN--SSNNVLLTIT 547
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.7 bits (181), Expect = 2e-17 Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 53.3 bits (128), Expect = 1e-09 Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%) Query: 424 PESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483 P +RE+L+ + + S + +LT +++ + S +F ++ + Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243 Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534 Q+ P + VP L+Q E N +KH Q ++++ +++ V L V Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585 ++ G +N S G+ +R+R Q L G + +++ E G + IP Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.0 bits (70), Expect = 0.011 Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%) Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314 I+S + L+ + I A A L K + + FFG F S ++ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370 LG T L+ + L+ +LFL LP+ G F+ L +G+T Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583 Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411 + + ++T +K E + E + A + F+S Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 29.8 bits (67), Expect = 0.043 Identities = 21/105 (20%), Positives = 36/105 (34%), Gaps = 2/105 (1%) Query: 526 PIDVELTESCLIENDELALSVIQQFSRLGAQVHLDDFGTGYSSLSQLARFPIDAIKLDQV 585 P+ V S I L S + FS + + ++ Q+ D + Sbjct: 425 PLVVVFVASNFINAHLLTDSFPRYFS--DKSIDFHSYYLLQDNVYQIPDLKPDLVITHSQ 482 Query: 586 FVRDIHKQPVSQSLVRAIVAVAQALNLQVIAEGVESAKEDAFLTK 630 + +H + V I L++Q + V+ K A LTK Sbjct: 483 LIPFVHHELTKGIAVAEISFDESILSIQELMYQVKEEKFQADLTK 527
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 49.3 bits (117), Expect = 4e-09 Identities = 50/260 (19%), Positives = 97/260 (37%), Gaps = 22/260 (8%) Query: 4 LSGKRILVTGVASKLSIAYGIAQAMHREGAEL-AFTYQNDKLKGRVEEFAAQLGSDIVLQ 62 + GK +TG A I +A+ + +GA + A Y +KL+ V A+ Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63 Query: 63 CDVAEDTSIDTMFAELGKVWPKFDGFVHSIGF---APGDQLDGDYVNAVTREGFKIAHDI 119 DV + +ID + A + + D V+ G L + A F + Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT----FSVN--- 116 Query: 120 SSYSFVAMAKACRSMLNP-GSALLTLSYLGAERAIPNYNVMGLAKASLEANVRYMANAMG 178 S+ F A + M++ +++T+ A + +KA+ + + + Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 179 PEGVRVNAISAGPIRTLAASGI--------KDFRKMLAHCEAVTPIRRTVTIEDVGNSAA 230 +R N +S G T + + + L + P+++ D+ ++ Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVL 236 Query: 231 FLCSDLSAGISGEVVHVDGG 250 FL S + I+ + VDGG Sbjct: 237 FLVSGQAGHITMHNLCVDGG 256
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 55.0 bits (132), Expect = 1e-11 Identities = 17/65 (26%), Positives = 32/65 (49%) Query: 1 MTSKLEIRHKQRQDEIINAARRCFRLCGFHAASMSQIASEAQLSVGQIYRYFANKDAIIE 60 M K + ++ + I++ A R F G + S+ +IA A ++ G IY +F +K + Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EMVRR 65 E+ Sbjct: 61 EIWEL 65
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 48.3 bits (115), Expect = 2e-08 Identities = 28/133 (21%), Positives = 55/133 (41%), Gaps = 10/133 (7%) Query: 41 PVSVVSELTGR-TSAALSAEVRPQVGGIIQKRLFKEGDLVKAGQPLYQIDAASYQAAWNE 99 V +V+ G+ T + S E++P I+++ + KEG+ V+ G L ++ A +A + Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 100 ARAALQQAQALVKADCQKAQRYARLVKENGVSQQDADDAQSTCAQDKASV--------EA 151 +++L QA+ Q R L K + D Q+ ++ + Sbjct: 139 TQSSLLQARLEQ-TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Query: 152 KKAALETARINLD 164 + +NLD Sbjct: 198 WQNQKYQKELNLD 210 Score = 31.7 bits (72), Expect = 0.005 Identities = 15/116 (12%), Positives = 32/116 (27%), Gaps = 9/116 (7%) Query: 83 QPLYQIDAASYQAAWN--EARAALQQAQALVKADCQKAQRYARLVKENGVSQQDADDAQS 140 L A + A + K+ ++ + KE Q ++ Sbjct: 241 SSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE--YQLVTQLFKN 298 Query: 141 TCAQDKASVEAKKAALET----ARINLDWTTVTAPISGRI-GISSVTPGALVTASQ 191 L + + AP+S ++ + T G +VT ++ Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1161 bits (3005), Expect = 0.0 Identities = 583/1033 (56%), Positives = 760/1033 (73%), Gaps = 6/1033 (0%) Query: 3 SRFFVRRPVFAWVIAILIMLAGILAIRTLPVAQYPDVAPPTIKISATYTGASAETLENSV 62 + FF+RRP+FAWV+AI++M+AG LAI LPVAQYP +APP + +SA Y GA A+T++++V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 63 TQVIEQQLTGLDNLLYFSSTSSSDGSVSINVTFEQGTDPDTAQVQVQNKIQQAESRLPSE 122 TQVIEQ + G+DNL+Y SSTS S GSV+I +TF+ GTDPD AQVQVQNK+Q A LP E Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 VQQTGVTVEKSQSNFLLIAAVYDTTDKASSSDIADWLVSNVQDPLARVEGVGSLQVFGAE 182 VQQ G++VEKS S++L++A + DI+D++ SNV+D L+R+ GVG +Q+FGA+ Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181 Query: 183 YAMRIWLDPAKLASYSLMPSDVQSAIEAQNVQVTAGKIGALPSPNTQQLTATVRAQSRLQ 242 YAMRIWLD L Y L P DV + ++ QN Q+ AG++G P+ QQL A++ AQ+R + Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 243 TVDQFKNIIVKSQSDGAVVRIKDVARVEMGSEDYTAIGKLNGHPSAGVAVMLSPGANALN 302 ++F + ++ SDG+VVR+KDVARVE+G E+Y I ++NG P+AG+ + L+ GANAL+ Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 303 TATLVKDKIAEFQRNMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIVLVVCVMYLFLQN 362 TA +K K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 363 LRATLIPALAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422 +RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSITIISAMLLS 482 ++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFSITI+SAM LS Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 483 VVVALTLTPALCGSVL----QHVPPHKKGFFGAFNRFYRRTEDKYQRGVIYVLRRAARTM 538 V+VAL LTPALC ++L +K GFFG FN + + + Y V +L R + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 539 GLYVVLGGGMALMMWKLPGSFLPTEDQGEIMVQYTLPAGATAARTAEVNRQIVDWFLINE 598 +Y ++ GM ++ +LP SFLP EDQG + LPAGAT RT +V Q+ D++L NE Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 599 KANTDVIFTVDGFSFSGSGQNTGMAFVSLKNWSQRKGAENTAQAIALRATKELGTIRDAT 658 KAN + +FTV+GFSFSG QN GMAFVSLK W +R G EN+A+A+ RA ELG IRD Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 659 VFAMTPPAVDGLGQSNGFTFELLANGGTDRETLLQMRNQLIEKANQSP-ELHSVRANDLP 717 V PA+ LG + GF FEL+ G + L Q RNQL+ A Q P L SVR N L Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 718 QMPQLQVDIDSNKAVSLGLSLNDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGDSEFRSA 777 Q ++++D KA +LG+SL+D+ T+S+A GGTYVNDFIDRGRVKK+Y+Q D++FR Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 778 PSDLGKWFVRGSDNAMTPFSAFATTRWLYGPERLVRYNGSAAYEIQGENATGFSSGDAMT 837 P D+ K +VR ++ M PFSAF T+ W+YG RL RYNG + EIQGE A G SSGDAM Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 838 KMEELANSLPAGTTWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAALYESWSVPFSVI 897 ME LA+ LPAG + W+G+S QE+L+ QA +L A+S +VVFLCLAALYESWS+P SV+ Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901 Query: 898 LVIPLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFA-EAAVAEGYSLSR 956 LV+PLG++G LAA + + NDVYF V LLTTIGLS+KNAILIVEFA + EG + Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961 Query: 957 AALRAAQTRLRPIIMTSLAFIAGVMPLAIATGAGANSRIAIGTGIIGGTLTATLLAIFFV 1016 A L A + RLRPI+MTSLAFI GV+PLAI+ GAG+ ++ A+G G++GG ++ATLLAIFFV Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021 Query: 1017 PLFFVLVKRLFAG 1029 P+FFV+++R F G Sbjct: 1022 PVFFVVIRRCFKG 1034 Score = 75.3 bits (185), Expect = 1e-15 Identities = 53/330 (16%), Positives = 117/330 (35%), Gaps = 19/330 (5%) Query: 721 QLQVDIDSNKAVSLGLSLNDVTDTLSSA----WGGTYVNDFIDRGRVKKVYIQGDSEFRS 776 +++ +D++ L+ DV + L G G+ I + F++ Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 777 APSDLGKWFVRGSDN-AMTPFSAFATTRWLYGPER--LVRYNGSAA-----YEIQGENAT 828 P + GK +R + + ++ A L G + R NG A G NA Sbjct: 243 -PEEFGKVTLRVNSDGSVVRLKDVARVE-LGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 829 GFSSGDAMTKMEELANSLPAG--TTWAWSGLSLQEKLASGQALSLYAVSILVVFLCLAAL 886 + K+ EL P G + + + +L+ +LV + + Sbjct: 301 DTAKA-IKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV-MYLF 358 Query: 887 YESWSVPFSVILVIPLGLLGAALAAWMRDLNNDVYFQVALLTTIGLSSKNAILIVEFAEA 946 ++ + +P+ LLG + + ++ IGL +AI++VE E Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418 Query: 947 AVAEGYSLSRAALRAAQTRLR-PIIMTSLAFIAGVMPLAIATGAGANSRIAIGTGIIGGT 1005 + E + A + ++++ ++ ++ A +P+A G+ I+ Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478 Query: 1006 LTATLLAIFFVPLFFVLVKRLFAGKPRRQE 1035 + L+A+ P + + + + + Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENK 508
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.048 Identities = 24/166 (14%), Positives = 49/166 (29%), Gaps = 11/166 (6%) Query: 70 DVQKAIADIDSARALYGQTNASLFPTVNAALSSTRSRSLANGTGTTAEADGTVSSYTLDL 129 A AD ++ Q +RS L + + + + Sbjct: 128 TALGAEADTLKTQSSLLQARL----EQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183 Query: 130 FGRNQSLSRAARETWLASEFTAQNTRLTLIAEISTAWLTLAADNSNLALAKETMASAENS 189 R SL + TW ++ + AE T + + + ++ Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTV-------LARINRYENLSRVEKSR 236 Query: 190 LKIIQRQQQVGTAAATDVSEAMSVYQQARASVASYQTQVMQDKNAL 235 L A V E + Y +A + Y++Q+ Q ++ + Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI 282
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 67.9 bits (166), Expect = 1e-14 Identities = 68/312 (21%), Positives = 114/312 (36%), Gaps = 18/312 (5%) Query: 5 SLSWALILGLLAGIGPMCTDLYLPALPEMSEQLAATTTITQLTLTASLIGLGVGQLLFGP 64 L L L +G L +P LP + L + +T L + Q P Sbjct: 6 PLIVILSTVALDAVG---IGLIMPVLPGLLRDLVHSNDVTA-HYGILLALYALMQFACAP 61 Query: 65 ----LSDKIGRKRPLILSLLLFIVSSILCATTNNIYWLVVWRFIQGIAGAGGSVLSRSIA 120 LSD+ GR+ L++SL V + AT ++ L + R + GI GA G+V IA Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121 Query: 121 RDKYQGVTLTQFFALLMTVNGLAPVLSPVLGGYIVSTFDWRTLFWVMAEISTVLLLGCLL 180 D G + F + G V PVLGG + F F+ A ++ + L Sbjct: 122 -DITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFLTGCF 179 Query: 181 FINETLPENKRGSSL----LLTGRSVVQNRRFMRFCLIQSFMLAGLFAYIGSSSFVL--Q 234 + E+ +R L + + + F + L + ++ +V+ + Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFF-IMQLVGQVPAALWVIFGE 238 Query: 235 KEFGFSPMQFSLVFGLNGI-GLIIASWIFSRLARRINAMTLLRGGLIAAILCALLTVLCA 293 F + + GI + + I +A R+ L G+IA +L Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 294 WVQLPIPALVAL 305 + P +V L Sbjct: 299 RGWMAFPIMVLL 310
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 424 bits (1093), Expect = e-151 Identities = 95/346 (27%), Positives = 178/346 (51%), Gaps = 2/346 (0%) Query: 5 SDDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVSVIWFGGVSLARRLSGMLSAGL 64 S +KTE PTP ++ AR++GQ+ +S+E+ S +++ +++ S ++ + Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--I 59 Query: 65 HFDHSIINDPNLILGQIILLIREAMLALLPLISGVVLVAIISPVMLGGLVFSGKSLQPKF 124 + S + + + ++ E PL++ L+AI S V+ G + SG++++P Sbjct: 60 PAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119 Query: 125 SKLNPLPGIKRMFSAQTGAELLKAILKTILVGSVTGFFLWHHWPQMMRLMAESPITAMGN 184 K+NP+ G KR+FS ++ E LK+ILK +L+ + + + +++L Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179 Query: 185 AMDLVGLCALLVVLGVIPMVGFDVFFQIFSHLKKLRMSRQDIRDEFKQSEGDPHVKGRIR 244 ++ ++ +G + + D F+ + ++K+L+MS+ +I+ E+K+ EG P +K + R Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239 Query: 245 QMQRAAARRRMMADVPKADVIVNNPTHYSVALQYDENKMSAPKVVAKGAGLVALRIREIG 304 Q + R M +V ++ V+V NPTH ++ + Y + P V K +R+I Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299 Query: 305 AENNVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWVWQLK 350 E VP L+ PLARALY A + IP + A AEVL W+ + Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.3 bits (219), Expect = 9e-24 Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGLDALNKLQAGGYGFVISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG V++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.4 bits (162), Expect = 3e-14 Identities = 35/188 (18%), Positives = 73/188 (38%), Gaps = 23/188 (12%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAAKASLAAHKPLSVPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179 +AE R +K + + + +G S E R + + + Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMPL-----------------VGRSAAMQEIYRVLARLMQ 158 Query: 180 LSSPALLI 187 ++ Sbjct: 159 TDLTLMIT 166
>PF06580#Sensor histidine kinase Length = 349 Score = 42.9 bits (101), Expect = 3e-06 Identities = 22/151 (14%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 359 ELDKSLIERIIDPLT--HLVRNSLDHGIELPEKRLAAGKNSVGNLILSAEHQGGNICIEV 416 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 417 TDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSTAEQVTDVSGRGVGMDVV 476 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 477 KRNIQEM---DGHVEIQSKQGTGTTIRILLP 504 + +Q + + +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.009 Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 11/93 (11%) Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIEE 105 L +SSP A P + G + ++ PGGGDD GE +++ Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435 Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135 R+ + L+ R L + + S P L Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468
>PF05844#YopD protein Length = 295 Score = 33.1 bits (75), Expect = 0.001 Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103 ++LL +L+R+ K+R++G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>FLAGELLIN#Flagellin signature. Length = 507 Score = 224 bits (571), Expect = 1e-68 Identities = 244/553 (44%), Positives = 298/553 (53%), Gaps = 46/553 (8%) Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61 AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRIRELTVQASTGTNSDSDLDSIQDEIKSRLD 121 TQA+RNANDGIS+AQTTEGAL+EINNNLQR+REL+VQA+ GTNSDSDL SIQDEI+ RL+ Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVNVLAKDGSMKIQVGANDGQTITIDLKKIDSDTLGLSGFNVNGKGAV 181 EIDRVS QTQFNGV VL++D MKIQVGANDG+TITIDL+KID +LGL GFNVNG Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 ANTAATKDDLVAASVSAAVGNEYTVSAGLSKSTAADVIASLTDGATVTAAGVSNGFAAGA 241 ++ + T V + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 242 TGNAYKFNQANNTFTYNTTSTAAELQSYLTPKAGDTATFSVEIGSTKQDVVLASDGKITA 301 N + T + T+ A A G + D T Sbjct: 241 AENNTAVDLFKTTKSTAGTAEA-------------KAIAGAIKGGKEGDTFDYKGVTFTI 287 Query: 302 KDGSKLYIDTTGNLTQNGGGTLEEATLNGLAFNHSGPAAAVQSTITTADGTSIVLAGSGD 361 +K D G ++ G T T A+ + L S + Sbjct: 288 --DTKTGNDGNGKVSTTINGEKVTLT-------------VADITAGAANVDAATLQSSKN 332 Query: 362 FGTTKTAGAINVTGAVISADALLSASKATGFTSGAYTVGTDGVVKSGGNDVYNKADGTGL 421 T+ G + A LS +A G + +G + Sbjct: 333 VYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKT 392 Query: 422 TTDNTTKYYLQDDGSVTNGSGKAVYVDATGKLTTDAETKAATTADPLKALDEAISSIDKF 481 + + DA +TA+PL ++D A+S +D Sbjct: 393 MF------------------IDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAV 434 Query: 482 RSSLGAVQNRLDSAVTNLNNTTTNLSEAQSRIQDADYATEVSNMSKAQIIQQAGNSVLAK 541 RSSLGA+QNR DSA+TNL NT TNL+ A+SRI+DADYATEVSNMSKAQI+QQAG SVLA+ Sbjct: 435 RSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQ 494 Query: 542 ANQVPQQVLSLLQ 554 ANQVPQ VLSLL+ Sbjct: 495 ANQVPQNVLSLLR 507
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 32.7 bits (74), Expect = 0.003 Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 2/72 (2%) Query: 214 NGMEVSVAAQNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273 N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++ Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293 Query: 274 AIKDWVNAYNSL 285 +KD VNA L Sbjct: 294 MLKDQVNALKGL 305
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 49.0 bits (117), Expect = 2e-08 Identities = 32/129 (24%), Positives = 58/129 (44%), Gaps = 20/129 (15%) Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGGDEANTQAQGILERAAKRAGFKDVVF 190 M+ H I+Q + ++ P+ + + + I E +A+ AG ++V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV-------ERRAIRE-SAQGAGAREVFL 140 Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRARLDREASLLGHSGCRI 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 34.7 bits (80), Expect = 6e-04 Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 23/137 (16%) Query: 332 RLSYRLV---RSAEESKIALSSV--AETRASLPFISDELAT------LISQQGLESALSQ 380 R +Y + +AE K + S + + LA ++ + AL + Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262 Query: 381 PLARILEQVQLALDNAQEKPDV--------IYLTGGSARSPLIKKALAEQLPGIPIAGGD 432 PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIP+ + Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319 Query: 433 D-FGSVTAGLARWAEVV 448 D V G + E++ Sbjct: 320 DPLTCVARGGGKALEMI 336
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 43.6 bits (103), Expect = 4e-07 Identities = 42/195 (21%), Positives = 77/195 (39%), Gaps = 18/195 (9%) Query: 4 MPKFRVSLFSLALMLAVPFAPQAVAKTVAATTASQPEIASGSAMI-VDLNTNKVIYSNHP 62 M R+ + SL + +P A A + + S+ +++ MI +DL + + + + Sbjct: 1 MRYIRLCIISL--LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58 Query: 63 DLVRPIASISKLMTAMVVLDARLPLDEKLKVDISQTPEMKGVYSRV---RLNSEISRKDM 119 D P+ S K++ VL DE+L+ I + YS V L ++ ++ Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118 Query: 120 LLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTRFV--EPTGLS-----V 172 A+ S+N +AA+L GG + A + +G N TR E Sbjct: 119 CAAAITMSDN-SAANLLLATVGG----PAGLTAFLRQIGDNVTRLDRWETELNEALPGDA 173 Query: 173 HNVSTARDLTKLLIA 187 + +T + L Sbjct: 174 RDTTTPASMAATLRK 188
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.3 bits (63), Expect = 0.018 Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%) Query: 152 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 182 L + ++ + W++L ++ + R L++ Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (290), Expect = 2e-33 Identities = 72/253 (28%), Positives = 119/253 (47%), Gaps = 12/253 (4%) Query: 3 QVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTAREVVSHGVRAEIVQLDLG 62 ++A IT + GIG+ A LA QG I ++ E+ K + AE D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKA-EARHAEAFPADVR 67 Query: 63 NLPEGAQALEKLIQRLGRIDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIA 122 + + ++ + +G ID+LVN AG + ++ +EW F+V+ G F S+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHALGGLTKAMALELVRHKILVNAVA 182 ++ M+ + + G I+ + S P +AY ++K A TK + LEL + I N V+ Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGAIATPMN-----GMDGGD--VKPDAEP---SIPLRRFGTTHEIASLVVWLCSEGANYT 232 PG+ T M +G + +K E IPL++ +IA V++L S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 233 TGQSLIVDGGFML 245 T +L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 29.3 bits (66), Expect = 0.018 Identities = 32/127 (25%), Positives = 53/127 (41%), Gaps = 5/127 (3%) Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179 G EA+ ++ + +G + E+ K EI A E+ V GR +G R Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249 Query: 180 HIDWQAIGE-IRQRLNIPVIANGEIWDWQSAQQCMAISGCDAVMIGRGALNIPNLSRVVK 238 ++ I E +++ L V A + + IS V+ G GAL + NL R++ Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307 Query: 239 YNEPRMP 245 E +P Sbjct: 308 MEETGIP 314
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.027 Identities = 19/70 (27%), Positives = 28/70 (40%), Gaps = 6/70 (8%) Query: 503 LHVSTPASEYSQGQ-DLF---NPQRRHYWVTAADNDTLAITTPKKTLVLNNNGKYRTYNL 558 L V+ E + + LF QR H V+ +T+ + K L N NG+Y YN Sbjct: 926 LQVADKTGEPNHNELTLFDASKAQRDHLNVSLV-GNTVDLGAWKYKLR-NVNGRYDLYNP 983 Query: 559 RGERVKDEKP 568 E+ Sbjct: 984 EVEKRNQTVD 993
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 535 bits (1380), Expect = 0.0 Identities = 256/386 (66%), Positives = 294/386 (76%), Gaps = 14/386 (3%) Query: 1 MKVKVLSLLVPALLVAGAANAAEVYNKDGNKLDLYGKVDGLHYFSDDKSVDGDQTYMRLG 60 MK KVL+L++PALL AGAA+AAE+YNKDGNKLDLYGKVDGLHYFSDD S DGDQTYMR+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQVTDQLTGYGQWEYQIQGNAPESE-NNSWTRVAFAGLKFQDIGSFDYGRNYGVVY 119 FKGETQ+ DQLTGYGQWEY +Q N E E NSWTR+AFAGLKF D GSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVTSWTDVLPEFGGDTYG-SDNFMQQRGNGFATYRNTDFFGLVDGLNFAVQYQGQNGSVS 178 DV WTD+LPEFGGD+Y +DN+M R NG ATYRNTDFFGLVDGLNFA+QYQG+N S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 179 GENDPDFTGHGITNNGRKALRQNGDGVGGSITYDY-EGFGVGAAVSSSKRTDAQN-TAAY 236 ++ G NNG NGDG G S TYD GF GAA ++S RT+ Q Sbjct: 181 ADDVN--IGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGT 238 Query: 237 IGNGDRAETYTGGLKYDANNIYLAAQYTQTYNATRVGSL------GWANKAQNFEAVAQY 290 I GD+A+ +T GLKYDANNIYLA Y++T N T G G ANK QNFE AQY Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQY 298 Query: 291 QFDFGLRPSVAYLQSKGKNLGTIGTRNYDDEDILKYVDVGATYYFNKNMSTYVDYKINLL 350 QFDFGLRP+V++L SKGK+L N DD+D++KY DVGATYYFNKN STYVDYKINLL Sbjct: 299 QFDFGLRPAVSFLMSKGKDLTY-NNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLL 357 Query: 351 D-DNQFTRDAGINTDNIVALGLVYQF 375 D D+ F +DAGI+TD+IVALG+VYQF Sbjct: 358 DDDDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 9e-09 Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ P L ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139 DL + + + + S+L + Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.0 bits (200), Expect = 6e-18 Identities = 29/106 (27%), Positives = 47/106 (44%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLNKNHIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY + ++ + D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRQLGLTLPVIGVTANALAEEKQRCLESGMDSCLSKPVTLDVI 932 RI++ LPV+ ++A + E G L KP L + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 562 bits (1449), Expect = 0.0 Identities = 181/484 (37%), Positives = 269/484 (55%), Gaps = 35/484 (7%) Query: 1 MTAINRILIVDDEDNVRRMLSTAFALQGFETHCANNGRTALHLFADIHPDVVLMDIRMPE 60 MT IL+ DD+ +R +L+ A + G++ +N T A D+V+ D+ MP+ Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59 Query: 61 MDGIKALKEMRSHETRTPVILMTAYAEVETAVEALRCGAFDYVIKPFDLDELNLIVQRAL 120 + L ++ PV++M+A TA++A GA+DY+ KPFDL EL I+ RAL Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 121 QLQSMKKEIRHLHQALSTSWQWGH-ILTNSPAMMDICKDTAKIALSQASVLISGESGTGK 179 + L Q G ++ S AM +I + A++ + +++I+GESGTGK Sbjct: 120 AEP------KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173 Query: 180 ELIARAIHYNSRRAKGPFIKVNCAALPESLLESELFGHEKGAFTGAQTLRQGLFERANEG 239 EL+ARA+H +R GPF+ +N AA+P L+ESELFGHEKGAFTGAQT G FE+A G Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233 Query: 240 TLLLDEIGEMPLVLQAKLLRILQEREFERIGGHQTIKVDIRIIAATNRDLQAMVKEGTFR 299 TL LDEIG+MP+ Q +LLR+LQ+ E+ +GG I+ D+RI+AATN+DL+ + +G FR Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293 Query: 300 EDLFYRLNVIHLILPPLRDRREDISLLANHFLQKFSSENQRDIIDIDPMAMSLLTAWSWP 359 EDL+YRLNV+ L LPPLRDR EDI L HF+Q+ E + D A+ L+ A WP Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWP 352 Query: 360 GNIRELSNVIERAVVMNSGPIIFSEDLPPQIRQPV---------CNAGEAKTAPVGERN- 409 GN+REL N++ R + +I E + ++R + +G + E N Sbjct: 353 GNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENM 412 Query: 410 ----------------LKEEIKRVEKRIIMEVLEQQEGNRTRTALMLGISRRALMYKLQE 453 + +E +I+ L GN+ + A +LG++R L K++E Sbjct: 413 RQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472 Query: 454 YGID 457 G+ Sbjct: 473 LGVS 476
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 43.7 bits (103), Expect = 8e-07 Identities = 32/165 (19%), Positives = 70/165 (42%), Gaps = 2/165 (1%) Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93 L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 94 ASSQSLA-MMILGTALTGLFSVVAQILVPLA-ATLASPDKRGKVVGTIMSGLLLGILLAR 151 S ++I+ + G + LV + A + RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 152 TVAGLLANLGGWRTVFWVASMLMALMALALWRGLPQMKSETHLNY 196 + G++A+ W + + + + + + +++ + H + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.018 Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%) Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82 P QE+ L + + L R A+G + + T + ++L ALG Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808 Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111 SS ++ D L + GW RE+ RR Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 78.7 bits (194), Expect = 5e-18 Identities = 64/412 (15%), Positives = 120/412 (29%), Gaps = 97/412 (23%) Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80 L FI+ + I VL E A +G +I + V ++ Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115 Query: 81 DFVKEGDVLVTLDPTDARQAFEKA------------------------------------ 104 + V++GDVL+ L A K Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175 Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIEVQKIALAKA-------QS 141 K ++ Q +Q +N + +A + + +S Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 142 DYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201 + L + I + + + A +L V Q ++ IL K E Q Q Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 202 AATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQISPTTP 242 E+ + + + I +P++ V + V G ++ Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298 LM +VP + V A + I + +GQ I + + +Y GKV + + Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409 Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350 ++ G V+ + + PL G++ + T R Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 132 bits (333), Expect = 9e-36 Identities = 97/405 (23%), Positives = 169/405 (41%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++ Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFLWSTIAFAIASWACGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135 G +L L+ I S V S ++LI R IQG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195 R A L V + GP +GG I+ HW + + +P+ + + L L +E R Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194 Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255 + D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 195 I-KGHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 +P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 293 bits (751), Expect = e-105 Identities = 132/170 (77%), Positives = 148/170 (87%) Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61 PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60 Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121 GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120 Query: 122 ELNVYQCGTYQMHSLQEAQDIARNILERDVRINSNEELALPKEKLQELHI 171 ELN YQCGT MHSL EA+ IA+NILE V +N N+ELALP+ L+EL I Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 29.1 bits (65), Expect = 0.002 Identities = 27/114 (23%), Positives = 43/114 (37%), Gaps = 29/114 (25%) Query: 8 QQGFSLPEVMLAMVLMVMIVTA----------------LSGFQRTLMNSLASRNQYQQLW 51 Q+GF+L E+ML ++LM + L+ F+ L Q Q + Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQFF 62 Query: 52 -----RHGWQ--QTQLRAISPPA----NWQVNRMQTSQAGCVSISVTLVSPGGR 94 WQ + R + PA W R +AG V+ S ++ GG+ Sbjct: 63 GVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSI--AGGK 114
>PilS_PF08805#PilS N terminal Length = 185 Score = 28.7 bits (64), Expect = 0.015 Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 3/51 (5%) Query: 72 ALSARRNRRMPVKEQGFSLLEVLIAMAISSVLLLGAARFLPALQRESLTNT 122 +LSARR + +++G +L+EVL+ + + VL A + +Q ++ Sbjct: 15 SLSARRKKE---QDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSN 62
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 29.5 bits (66), Expect = 0.003 Identities = 9/24 (37%), Positives = 18/24 (75%) Query: 1 MKTQRGYTLIETLVAMLILVMLSA 24 QRG+TL+E +V ++I+ +L++ Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLAS 27
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 611 bits (1578), Expect = 0.0 Identities = 189/571 (33%), Positives = 314/571 (54%), Gaps = 7/571 (1%) Query: 168 QTRIRALPAAPGVAIAEGWQDATLPLMEQVYQASTLDPALERERLTGALEEAANEFRRYS 227 +I + A+ GVAIA+ + + + + S D + E E+LT ALE++ E R Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNV--DIEKTSITDVSTEIEKLTAALEKSKEELRAIK 59 Query: 228 KRFAAGAQKETAAIFDLYSHLLSDTRLRRELFAEVDKGSV-AEWAVKTVIEKFAEQFAAL 286 + A + A IF + +L D L + +++ + AE+A+K V + F F ++ Sbjct: 60 DQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESM 119 Query: 287 SDNYLKERAGDLRALGQRLLFHLDDANQGPNAW-PERFILVADELSATTLAELPQDRLVG 345 + Y+KERA D+R + +R+L HL G A E +++A++L+ + A+L + + G Sbjct: 120 DNEYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKG 179 Query: 346 VVVRDGAANSHAAIMVRALGIPTVMGA-DIQPSVLHRRTLIVDGYRGELLVDPEPVLLQE 404 G SH+AIM R+L IP V+G ++ + H +IVDG G ++V+P ++ Sbjct: 180 FATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKA 239 Query: 405 YQRLISEEIELSRLAEDDVNLPAQLKSGERIKVMLNAGLSPEHEEKLGSRIDGIGLYRTE 464 Y+ + + + V P+ K G +++ N G + + L + +GIGLYRTE Sbjct: 240 YEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTE 299 Query: 465 IPFMLQSGFPSEEEQVAQYQGMLQMFNDKPVTLRTLDVGADKQLPYMPISEE-NPCLGWR 523 +M + P+EEEQ Y+ ++Q + KPV +RTLD+G DK+L Y+ + +E NP LG+R Sbjct: 300 FLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFR 359 Query: 524 GIRITLDQPEIFLIQVRAMLRANAATGNLNILLPMVTSLDEVDEARRLIERAGREVEEMI 583 IR+ L++ +IF Q+RA+LRA + GNL ++ PM+ +L+E+ +A+ +++ ++ Sbjct: 360 AIRLCLEKQDIFRTQLRALLRA-STYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEG 418 Query: 584 GYEIPKPRIGIMLEVPSMVFMLPHLAKRVDFISVGTNDLTQYILAVDRNNTRVANIYDSL 643 +GIM+E+PS AK VDF S+GTNDL QY +A DR N RV+ +Y Sbjct: 419 VDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478 Query: 644 HPAMLRALAMIAREAEIHGIDLRLCGEMAGDPMCVAILIGLGYRHLSMNGRSVARVKYLL 703 HPA+LR + M+ + A G + +CGEMAGD + + +L+GLG SM+ S+ + L Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQL 538 Query: 704 RRIDFAEAENLAQRSLEAQLATEVRHQVAAF 734 ++ E + AQ++L A EV V Sbjct: 539 LKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 452 bits (1165), Expect = e-161 Identities = 224/406 (55%), Positives = 300/406 (73%), Gaps = 1/406 (0%) Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKDLIPVHI-EARLNASAGGMLQRRRH 59 MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119 R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179 +D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 180 VVATGVVTILLTAVVPEIIEQFDHLGHALPASTRMLIAMSDTLQTSGVYWLAGLLGLLVL 239 VVA VV+ILL+ VVP+++EQF H+ ALP STR+L+ MSD ++T G + L LL + Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 240 GQRLLKNPAMRLRWDKTLLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299 + +L+ R+ + + LL LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 300 SANRYVEQQLLLAADRVREGSSLRAALADLRLFPPMMLYMIASGEQSGELETMLEQAAVN 359 +N Y +L LA D VREG SL AL LFPPMM +MIASGE+SGEL++MLE+AA N Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405 Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 574 bits (1482), Expect = 0.0 Identities = 295/668 (44%), Positives = 431/668 (64%), Gaps = 34/668 (5%) Query: 24 LLPLVLAAALCSSPVWAEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83 L L++ AAL P AEE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+ Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68 Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143 LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+ Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127 Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203 VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187 Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262 +AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247 Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322 +R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ + Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306 Query: 323 REVVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382 + + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+ Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366 Query: 383 QWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLST-LA 441 QWA+K+AG+ QF N + +PI T A + +G +S+ LA Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406 Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501 LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466 Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556 S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525 Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNL 616 F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+ST+ K KRNL Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585 Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPVLPAQNQALPP 674 M+FIRPT++RD S +Y Q + E +++ + P Q+ A Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645 Query: 675 EVRAFLNA 682 +V A ++A Sbjct: 646 QVSAAIDA 653
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 118 bits (298), Expect = 9e-34 Identities = 72/287 (25%), Positives = 117/287 (40%), Gaps = 40/287 (13%) Query: 40 IARGMFWLMLLIISAKVAHSLWRYFSFSAEYTA-VSPSANKPLRADAKAFDKNDVQLISQ 98 I R +F+L++L+ ++A WR A VS P +A + ND L Sbjct: 14 IRRILFYLLMLLFCQQLAMIFWR---IGLPDNAPVSSVQITPAQARQQPVTLNDFTL--- 67 Query: 99 QNWFGKYQPV--ATPVKQPEPAPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQ 153 FG A + + + + + LN+ L G+ G +R A+I + +Q Sbjct: 68 ---FGVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGV 124 Query: 154 GETLGSHNAVIEEINRDHVMLRYQGKIERLSLAEEERSTVAVTNKKAVSDEAKQAVAEPA 213 E + +NA I I D V+L+YQG+ E L L +E S Sbjct: 125 NEEVPGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDS------------------GSDG 166 Query: 214 VSAPVEIPAAVRQAL-AKDPQKIFNYIQLTPVRKEG-IVGYAVKPGADRSLFDASGFKEG 271 V A V + L + + +Y+ +P+ + + GY + PG F G ++ Sbjct: 167 VPG-----AQVNEQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDN 221 Query: 272 DIAIALNQQDFTDPRAMIALMRQLPSMDSIQLTVLRKGARYDISIAL 318 D+A+ALN D D M ++ + + LTV R G R DI + Sbjct: 222 DMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 278 bits (714), Expect = 2e-96 Identities = 110/274 (40%), Positives = 149/274 (54%), Gaps = 12/274 (4%) Query: 1 MFFDVFQQYPAAMPVLATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGEMPSTQSKI-- 57 + ++ P L + L+IGSFLNVVI R PIML R+ AE+ + Sbjct: 3 LLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDE 62 Query: 58 ---SLALPRSHCPHCQQTIRVRDNIPLLSWLMLKGRCRDCQAKISKRYPLVELLTALAFL 114 +L +PRS CPHC I +NIPLLSWL L+GRCR CQA IS RYPLVELLTAL + Sbjct: 63 PPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSV 122 Query: 115 LASLVWPESGWGLAVMILSAWLIAASVIDLDNQWLPDVFTQGVLWTGLIAAWAQQSPLTL 174 ++ LA ++L+ L+A + IDLD LPD T +LW GL+ ++L Sbjct: 123 AVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFVSL 181 Query: 175 QDAVTGVLVGFITFYSLRWIAGIVLRKEALGMGDVLLFAALGGWVGALSLPNVALIASCC 234 DAV G + G++ +SL W ++ KE +G GD L AALG W+G +LP V L++S Sbjct: 182 GDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV 241 Query: 235 GLIYAVI-----TKRGSTTLPFGPCLSLGGIATL 263 G + S +PFGP L++ G L Sbjct: 242 GAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIAL 275
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 48.8 bits (116), Expect = 5e-08 Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 3/60 (5%) Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPD---PEPTPEPEPEPVP 88 S T + V+P P P EP PEP P PEP E P+P P+P+P+PV Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109 Score = 43.0 bits (101), Expect = 4e-06 Identities = 17/88 (19%), Positives = 25/88 (28%), Gaps = 2/88 (2%) Query: 33 SDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKTG 92 +D P + PE +P P PEP PEP + E + V Sbjct: 58 ADLEPPQAVQ-PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKR 116 Query: 93 YLTLGGSQRITGATCNGESSDGFTFTPG 120 + R N + + T Sbjct: 117 DVK-PVESRPASPFENTAPARPTSSTAT 143 Score = 41.9 bits (98), Expect = 1e-05 Identities = 21/116 (18%), Positives = 37/116 (31%), Gaps = 5/116 (4%) Query: 35 TPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDP--EPTPEPEPEPVPTKTG 92 P + +P +P PEP +PEP PEP P+P E E K Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104 Query: 93 YLTLGGSQRITGATCNGESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRA 148 + ++ ES +P + ++T ++ + + Sbjct: 105 PKPVKKVEQPKRDVKPVESRPA---SPFENTAPARPTSSTATAATSKPVTSVASGP 157 Score = 39.2 bits (91), Expect = 7e-05 Identities = 20/97 (20%), Positives = 30/97 (30%), Gaps = 9/97 (9%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPE-PTPDPEPTPEPTPDPEPTPEPEPEPV 87 + P V PE +P P P E P P+P P+P + +P+ + Sbjct: 65 AVQPPPEPVV----EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP-KPVKKVEQPKRDVK 119 Query: 88 P---TKTGYLTLGGSQRITGATCNGESSDGFTFTPGD 121 P R T +T +S T Sbjct: 120 PVESRPASPFENTAPARPTSSTATAATSKPVTSVASG 156 Score = 35.7 bits (82), Expect = 8e-04 Identities = 17/40 (42%), Positives = 17/40 (42%) Query: 50 PDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPT 89 P P T D EP P PEP EPEPEP P Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83 Score = 30.7 bits (69), Expect = 0.034 Identities = 11/40 (27%), Positives = 13/40 (32%) Query: 52 PTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKT 91 P P + + P P P P EPEP P Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 70.0 bits (171), Expect = 2e-15 Identities = 29/184 (15%), Positives = 62/184 (33%), Gaps = 38/184 (20%) Query: 90 GLGSGVIINANKGYVLTKNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137 + SGV++ + +LT HV++ L +G ++ + Sbjct: 102 FIASGVVVGKDT--LLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190 D+A+++ + ++++ + +V G P ++ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ G+ Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263 Query: 250 MART 253 + Sbjct: 264 VFIN 267
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 53.1 bits (127), Expect = 6e-10 Identities = 33/160 (20%), Positives = 61/160 (38%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKINATGGLPTIP--INPRRVPH-----IGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + I + P + + + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.1 bits (62), Expect = 0.043 Identities = 37/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%) Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62 + GAA GIG+A+A L G+ ++ D P V S A + F + Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66 Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104 A + D+++ AGV R + F+VN+ V N + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 105 VAKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146 V+K + + + +NP ++AA KA K G+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 168 bits (428), Expect = 9e-57 Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%) Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74 + ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+ Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69 Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131 S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128 Query: 132 FTTPANGFTVKELYEAILELF 152 K + + ILEL Sbjct: 129 LIICRTHDDTKVVQKKILELL 149
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 54.4 bits (131), Expect = 2e-10 Identities = 29/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%) Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61 SR V I+ F+ I + + S I P + ++ ++ Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110 Query: 62 NVHDNQLVKKGQVLFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114 V + + V+KG VL + Q +L +A+ + YQ+L++ E + L Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168 Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154 + + VL + Q + Q + +L+L++ Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211 Score = 51.4 bits (123), Expect = 2e-09 Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%) Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150 E R + ++ + ++EE + + +L +L + + Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323 Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208 + +VIRAP V L V+T G +T T + +V ++ V A ++ + + G Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383 Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231 A I P L G V ++ Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 28.4 bits (63), Expect = 0.043 Identities = 12/72 (16%), Positives = 20/72 (27%), Gaps = 3/72 (4%) Query: 296 MMPQVLPSPDAMGPKLPEPATGITQPTPQQPATGNAVTAPAAPTQPAANRSPQRATPPQS 355 P+ +P P P + E +P P+ V P +P +R Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK---KVEQPKRDVKPVESRPASPFENTAP 134 Query: 356 GAQPPARAPGGQ 367 + A Sbjct: 135 ARPTSSTATAAT 146
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 576 bits (1487), Expect = 0.0 Identities = 347/347 (100%), Positives = 347/347 (100%) Query: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAK 60 Query: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ Sbjct: 61 QMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQ 120 Query: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG Sbjct: 121 VERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180 Query: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240 Query: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALL 300 Query: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE Sbjct: 301 RNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.026 Identities = 11/28 (39%), Positives = 17/28 (60%) Query: 150 VVVTGASGGVGSTAVALLHKLGYQVVAV 177 +VTGA+G +G L + G+QVV + Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGI 30
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 128 bits (322), Expect = 5e-39 Identities = 77/209 (36%), Positives = 122/209 (58%), Gaps = 3/209 (1%) Query: 1 MAKRTKAEALKTRQELIETAIAQFAQHGVSKTTLNDIADAANVTRGAIYWHFENKTQLFN 60 MA++TK EA +TRQ +++ A+ F+Q GVS T+L +IA AA VTRGAIYWHF++K+ LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EMW-LQQPSLRELIQDYLTAGLEHDPFQQLREKLIVGLQYIAKIPRQQALLKILYHKCEF 119 E+W L + ++ EL + A DP LRE LI L+ R++ L++I++HKCEF Sbjct: 61 EIWELSESNIGELELE-YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 NDEM-LAEGVIREKMGFNPQTLREVLQACQQQGCVANNLDLDVVMIIIDGAFSGIVQNWL 178 EM + + R + + + L+ C + + +L II+ G SG+++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 MNMACYDLYKQAPALVDNVLRMFMPDENI 207 +DL K+A V +L M++ + Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTL 208
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.7 bits (103), Expect = 8e-07 Identities = 39/217 (17%), Positives = 70/217 (32%), Gaps = 38/217 (17%) Query: 98 ATYQASYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADATV 156 K +L + E+ A + Q + I D RQ + Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 157 IAAKATVESARINLAYTKVTAPISGRIGK-STVTEGALVTNGQTTELATVQQLDPIYVDV 215 + + + AP+S ++ + TEG +VT +T + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTA 370 Query: 216 TQSSND--FMRLKQSVEQGNLHKENATSNVELVMENGQTYP-LKGTLQ--FSDVTVDEST 270 + D F+ + Q+ +++ Y L G ++ D D+ Sbjct: 371 LVQNKDIGFINVGQNAI------------IKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418 Query: 271 GSIT--LRAI------FPNPQHTLLPGMFVRARIDEG 299 G + + +I N L GM V A I G Sbjct: 419 GLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455 Score = 34.0 bits (78), Expect = 0.001 Identities = 22/127 (17%), Positives = 43/127 (33%), Gaps = 13/127 (10%) Query: 46 TAPLEVKTELPGR-TNAYRIAEVRPQVSGIVLNRNFTEGSDVQAGQSLYQIDPATYQASY 104 +E+ G+ T++ R E++P + IV EG V+ G L ++ +A Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA-- 134 Query: 105 DSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADATVIAAKATVE 164 + K++++ A L RY L E ++ + Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 165 SARINLA 171 +L Sbjct: 185 LRLTSLI 191 Score = 29.0 bits (65), Expect = 0.030 Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 1/34 (2%) Query: 65 AEVRPQVSGIVLNRN-FTEGSDVQAGQSLYQIDP 97 + +R VS V TEG V ++L I P Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1402 bits (3631), Expect = 0.0 Identities = 1027/1034 (99%), Positives = 1030/1034 (99%) Query: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSYLMVPGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 EVQQQGISVEKSSSSYLMV GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIQEVVKTLFEAIMLVFLVMYLFLQ 360 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI EVVKTLFEAIMLVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 MEDKLPPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 MEDKLPP+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERSGDENSAEAVIHRAKMELGKIRDG 660 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER+GDENSAEAVIHRAKMELGKIRDG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKVYVQADAKFRM 780 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK+YVQADAKFRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPRTSSGDAM 840 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP TSSGDAM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVIRRCFKG 1034 VPVFFVVIRRCFKG Sbjct: 1021 VPVFFVVIRRCFKG 1034
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 83.9 bits (207), Expect = 4e-21 Identities = 53/200 (26%), Positives = 94/200 (47%), Gaps = 15/200 (7%) Query: 59 EFSLAALWRNENHAGVKDANPVAVNQETPKLSIALNGIVLTSNDETSFVLINEGNEQKRY 118 +F+L + +N AG DA N L+++L G++ +D S +I++ NEQ Sbjct: 64 DFTLFGVSPEKNKAGALDA-SQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSR 122 Query: 119 SLNEALESAPGT--FIRKINKTSVVFETHGHYEKVTLH-------PGLP--DIIKQPDSE 167 +NE + PG I I VV + G YE + L+ G+P + +Q Sbjct: 123 GVNEEV---PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQR 179 Query: 168 NQNVLADYIIATPIRDGEQIYGLRLNPRKGLNAFTTSLLQPGDIALRINNLSLTHPDEVS 227 ++DY+ +PI + ++ G RLNP ++F LQ D+A+ +N L L ++ Sbjct: 180 ASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAK 239 Query: 228 QALSLLLTQQSAQFTIRRNG 247 +A+ + + T+ R+G Sbjct: 240 KAMERMADVHNFTLTVERDG 259
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 716 bits (1850), Expect = 0.0 Identities = 344/629 (54%), Positives = 466/629 (74%), Gaps = 11/629 (1%) Query: 11 ITCCLLAALLMPCAGHAENEQYGANFNNADIRQFVEIVGQHLGKTILIDPSVQGTISVRS 70 +T + AALL A E++ A+F DI++F+ V ++L KT++IDPSV+GTI+VRS Sbjct: 12 LTLLIFAALLF---RPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68 Query: 71 NDTFSQQEYYQFFLSILDLYGYSVITLDNGFLKVVRSANVKTSPGMIADSSRPGVGDELV 130 D ++++YYQFFLS+LD+YG++VI ++NG LKVVRS + KT+ +A + PG+GDE+V Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVV 128 Query: 131 TRIVPLENVPARDLAPLLRQMMDAGSVGNVVHYEPSNVLILTGRASTINKLIEVIKRVDV 190 TR+VPL NV ARDLAPLLRQ+ D VG+VVHYEPSNVL++TGRA+ I +L+ +++RVD Sbjct: 129 TRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDN 188 Query: 191 IGTEKQQIIHLEYASAEDLAEILNQLISESHGKSQMPALLSAKIVADKRTNSLIISGPEK 250 G + L +ASA D+ +++ +L ++ KS +P + A +VAD+RTN++++SG Sbjct: 189 AGDRSVVTVPLSWASAADVVKLVTEL-NKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247 Query: 251 ARQRITSLLKSLDVEESEEGNTRVYYLKYAKATNLVEVLTGVSEKLKDEKGNSRKPSSTS 310 +RQRI +++K LD +++ +GNT+V YLKYAKA++LVEVLTG+S ++ EK ++ + Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAK--PVAA 305 Query: 311 AMDNVAITADEQTNSLVITADQSVQEKLATVIARLDIRRAQVLVEAIIVEVQDGNGLNLG 370 N+ I A QTN+L++TA V L VIA+LDIRR QVLVEAII EVQD +GLNLG Sbjct: 306 LDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLG 365 Query: 371 VQWANKNVGAQQFTNTGLPVFNAAQGVADYKKNGGITSANPAWDMFSAYNGMAAGFFNGD 430 +QWANKN G QFTN+GLP+ A G Y K+G ++S+ S++NG+AAGF+ G+ Sbjct: 366 IQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA--SALSSFNGIAAGFYQGN 423 Query: 431 WGVLLTALASNNKNDILATPSIVTLDNKLASFNVGQDVPVLSGSQTTSGDNVFNTVERKT 490 W +LLTAL+S+ KNDILATPSIVTLDN A+FNVGQ+VPVL+GSQTTSGDN+FNTVERKT Sbjct: 424 WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKT 483 Query: 491 VGTKLKVTPQVNEGDAVLLEIEQEVSSVD---SSSNSTLGPTFNTRTIQNAVLVKTGETV 547 VG KLKV PQ+NEGD+VLLEIEQEVSSV SS++S LG TFNTRT+ NAVLV +GETV Sbjct: 484 VGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETV 543 Query: 548 VLGGLLDDFSKEQVSKVPLLGDIPLVGQLFRYTSTERAKRNLMVFIRPTIIRDDDVYRSL 607 V+GGLLD + KVPLLGDIP++G LFR TS + +KRNLM+FIRPT+IRD D YR Sbjct: 544 VVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQA 603 Query: 608 SKEKYTRYRQEQQLRIDGKSKALVGSEDL 636 S +YT + Q + ++ + ++DL Sbjct: 604 SSGQYTAFNDAQSKQRGKENNDAMLNQDL 632
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 512 bits (1321), Expect = 0.0 Identities = 195/405 (48%), Positives = 283/405 (69%), Gaps = 8/405 (1%) Query: 2 NYRYRAMTQDGQKLQGIIDANDERQARLRLREEGLFLLDIRPQK-------SSGVKTRRP 54 Y Y+A+ G+K +G +A+ RQAR LRE GL L + + S+G+ RR Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62 Query: 55 -RISHSELTLFTRQLATLSAAALPLEESLAVIGQQSSNNRLADVLNQVRSAILEGHPLSD 113 R+S S+L L TRQLATL AA++PLEE+L + +QS L+ ++ VRS ++EGH L+D Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 114 ALQHFPTLFDSLYRTLVKAGEKSGLLAPVLEKLADYNENRQKIRSKLIQSLIYPCMLTTV 173 A++ FP F+ LY +V AGE SG L VL +LADY E RQ++RS++ Q++IYPC+LT V Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 174 AIVVVIILLTAVVPKITEQFVHMKQQLPLSTRILLGLSDTLQRTGPTLLATVFIVAVGFW 233 AI VV ILL+ VVPK+ EQF+HMKQ LPLSTR+L+G+SD ++ GP +L + + F Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 234 LWLKRGNNRHRFHAMLLRVALIGPLICAINSARYLRTLSILQSSGVPLLDGMNLSTESLN 293 + L++ R FH LL + LIG + +N+ARY RTLSIL +S VPLL M +S + ++ Sbjct: 243 VMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVMS 302 Query: 294 NLEIRQRLANAAENVRQGNSIHLSLEQTAIFPPMMLYMVASGEKSGQLGTLMVRAADNQE 353 N R RL+ A + VR+G S+H +LEQTA+FPPMM +M+ASGE+SG+L +++ RAADNQ+ Sbjct: 303 NDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQD 362 Query: 354 TLQQNRIALTLSIFEPALIITMALIVLFIVVSVLQPLLQLNSMIN 398 +++ L L +FEP L+++MA +VLFIV+++LQP+LQLN++++ Sbjct: 363 REFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 249 bits (636), Expect = 1e-88 Identities = 144/145 (99%), Positives = 144/145 (99%) Query: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60 Query: 61 LDNHRYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120 LDNH YPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120 Query: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145 LSAGPDGEMGTEDDITNWGLSKKKK Sbjct: 121 LSAGPDGEMGTEDDITNWGLSKKKK 145
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 141 bits (357), Expect = 2e-45 Identities = 50/154 (32%), Positives = 76/154 (49%), Gaps = 18/154 (11%) Query: 3 QQRGFTLLEMMLVLALVAITASVVLFTYGREDAASTRARETAARFTAALELAIDRATLSG 62 +QRGFTLLEMML+L L+ ++A +VL + + A +T ARF A L R +G Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFP--ASRDDSAAQTLARFEAQLRFVQQRGLQTG 59 Query: 63 QPVGIHFSDSAWRIMV----PGKTP-------SAWRWVPLQEDAADESKNDWGEELSIQL 111 Q G+ W+ +V G P S +RW+PL+ S + G +L++ Sbjct: 60 QFFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAF 119 Query: 112 ---QPFKPDDSNQPQVVILADGQITPFSLLMANA 142 + + P D P V+I G++TPF L + A Sbjct: 120 AQGEAWTPGD--NPDVLIFPGGEMTPFRLTLGEA 151
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 31.0 bits (70), Expect = 9e-04 Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 4/91 (4%) Query: 14 MNKQSGMTLLEVLLAMSIFTAVALTLMSSMQGQ--RTAIERMRNETLALWIADNQLQSQD 71 +KQ G TLLE+++ + I +A ++ ++ G + ++ ++ +AL A + + D Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK-LD 62 Query: 72 SFDEENTSSSGKELINGEELINGEEWNWRSD 102 + T+ + L+ L N+ + Sbjct: 63 NHHYPTTNQGLESLVEAPTL-PPLAANYNKE 92
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 33.8 bits (77), Expect = 2e-04 Identities = 12/47 (25%), Positives = 25/47 (53%), Gaps = 2/47 (4%) Query: 4 RQQGFTLLEVMAALAIFSMLSVLAFMIFSQASELHQRSQKEIQQFNQ 50 RQ+GFTLLE+M L + + + + + F + + + + + +F Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD--DSAAQTLARFEA 46
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 156 bits (397), Expect = 3e-49 Identities = 76/166 (45%), Positives = 98/166 (59%), Gaps = 2/166 (1%) Query: 55 VPLILCVAAAIACALAPFTPIVTGALFLYFCFALTLSVIDFRTQLLPDKLTLPLLWLGLV 114 V L+ + + AL L + + L+ ID LLPD+LTLPLLW GL+ Sbjct: 113 VELLTALLSVAVAMTLAPGWGTLAALLLTWVL-VALTFIDLDKMLLPDQLTLPLLWGGLL 171 Query: 115 FNAQSGLIDLHDAVYGAVAGYGVLWCVYWGVWLVCHKEGLGYGDFKLLAAAGAWCGWQTL 174 FN G + L DAV GA+AGY VLW +YW L+ KEG+GYGDFKLLAA GAW GWQ L Sbjct: 172 FNLLGGFVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQAL 231 Query: 175 PMILLIASLGGIGYAIVSQLLQRRTITT-IAFGPWLALGSMINLGY 219 P++LL++SL G I LL+ + I FGP+LA+ I L + Sbjct: 232 PIVLLLSSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLW 277
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 35.2 bits (81), Expect = 3e-05 Identities = 28/150 (18%), Positives = 59/150 (39%), Gaps = 24/150 (16%) Query: 5 TKVINYLNKLLGNE---LVAINQYFLHARMFKNWGLKRLNDVEYHESIDEM-----KHAD 56 T V N LN L N ++++ +W +K + HE +E+ + D Sbjct: 11 TLVENSLNTQLSNWFLLYSKLHRF--------HWYVKGPHFFTLHEKFEELYDHAAETVD 62 Query: 57 RYIERILFLEGLPN--LQDLGKL------NIGEDVEEMLRSDLALELDGAKNLREAIGYA 108 ER+L + G P +++ + EM+++ + + + IG A Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122 Query: 109 DSVHDYVSRDMMIEILRDEEGHIDWLETEL 138 + D + D+ + ++ + E + L + L Sbjct: 123 EENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKILELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 613 bits (1583), Expect = 0.0 Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 + R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E +K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.021 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 219 SK 220 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 134 bits (339), Expect = 9e-41 Identities = 85/240 (35%), Positives = 131/240 (54%), Gaps = 12/240 (5%) Query: 14 MAVVLHAPITFAAEAAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIK 73 M +V A + A A AT A S D K +Y++GA LG K + GI Sbjct: 3 MKLVTAAIMGLAMSTAMAATDATS---LTTDKDKLSYSIGADLG-------KNFKNQGID 52 Query: 74 LDKDQLIAGVQDAFA-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYR 132 ++ D L G+QD + + L++++++ L F+ + + A+ K A +N+AKG + Sbjct: 53 INPDVLAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFL 112 Query: 133 EKFAKEKGVKTSSTGLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLS 192 + G+ +GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P + Sbjct: 113 SANKSKPGIVVLPSGLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPAT 172 Query: 193 FRLDGVIPGWTEGLKNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251 F++ VIPGWTE L+ + G ++ +P +LAYG V G I PN TL+F + L+ VK A Sbjct: 173 FQVSQVIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 29.6 bits (66), Expect = 0.006 Identities = 32/135 (23%), Positives = 50/135 (37%), Gaps = 16/135 (11%) Query: 11 YAHPESQDSVANWVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 68 Y P + D N V P + + +HD+ ++ D F L + + Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 69 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRSVITTGEPESA------Y 118 P+ + P DR L F GPG N +G Y +IT PE + Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124 Query: 119 RYDALNRYPMSDVLR 133 RY A R + +++R Sbjct: 125 RYSAFKRTNLLEMMR 139
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.7 bits (74), Expect = 0.005 Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 22/152 (14%) Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKE 563 + D + ++ E + + ++ R+ +R R + L E Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331 Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602 +LE++ + +L A+ + EE+ SE QS + +L A Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391 Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634 + + + LEE L A E+L + L E Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422 Score = 32.0 bits (72), Expect = 0.008 Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%) Query: 513 EDYQQWLSDVQKQENQTDEAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572 + + ++ + E A A + D ++ + +++ Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179 Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632 L A+ A E + + E + TA + + ++ + ++ LE + Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239 Query: 633 EGQSN 637 ++ Sbjct: 240 FSTAD 244
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 31.8 bits (72), Expect = 0.002 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116 P+ + + E ++ KG SRK++ ++ + + GTF Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 48.0 bits (114), Expect = 2e-08 Identities = 38/171 (22%), Positives = 70/171 (40%), Gaps = 6/171 (3%) Query: 200 REREHGTVEHLLVMPITPFEIMMAKV-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G+ +V G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFFTIAQLRFRK 368 +P +H + L + I+ ++ + I FF L R+ Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.045 Identities = 9/26 (34%), Positives = 14/26 (53%) Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62 V L G G+GKS+L++ + G Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 81.8 bits (202), Expect = 3e-19 Identities = 72/408 (17%), Positives = 139/408 (34%), Gaps = 81/408 (19%) Query: 6 RHLAWWGVGALAVAAVVAWLLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64 R +A++ +G L +A +++ L G +GR + I + I+VK Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113 Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91 EG+ VR+G+VL K+ L++ R + + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132 Q Q+ + L+++++E + +N+ + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 133 AKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190 R SL + AI+ + + A L K+Q+ ++ I +A+ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236 QT T + S ++AP +V Q +V G V+ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295 ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410 Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341 +E D+RL L+F V I L + + +G+ A ++ Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 29.0 bits (65), Expect = 0.033 Identities = 23/98 (23%), Positives = 38/98 (38%), Gaps = 18/98 (18%) Query: 226 ENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRNAHPNQSLKNTL 283 E + RG GP +L + ++ + + + L T + N Q A N LK L Sbjct: 63 EAITLRERGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLKALQNARLKAPL 118 Query: 284 AVHL------------PKRLVERLQQLGQIPDVSLKQL 309 ++L P R++ QQL + +V L Sbjct: 119 DIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.7 bits (90), Expect = 3e-05 Identities = 34/208 (16%), Positives = 72/208 (34%), Gaps = 13/208 (6%) Query: 33 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 85 + ++ + + L+ P + +DL S V G + + A + + + RR Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 86 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGVALGGFWAISASLTMRLVPPRTVPKA 145 V+++ + +++ A +L IGR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 146 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAAMG----VLCIFWIIKSLPSLPGE 201 + +V LG +G F AAAA+ + F + +S Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 202 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 229 + N + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.8 bits (88), Expect = 1e-04 Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%) Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71 V+R D +I N ILD + G + I +K IA +G A D P Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117 Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116 + I G G +D+H+H + P E A L GLT ++ Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 0.001 Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ C Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90 Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 +G SL +M + F Q G + + + ++ P+ RG G Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212 +G + A+Y+ + + + P + I+ L Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.2 bits (94), Expect = 1e-05 Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%) Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 86 RH + IWL F+ N ++P+I + + + T F +T+ + V G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSNARYFMGIGLIATGIINILFGFSTSLWAFAVLWVLNAFFQGWGS---PVCARLL 143 +SD+ + + G+I +++ S F L ++ F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAASALHYGWRAGMMIAGCMAIVVGIFLC 202 A Y + RG + L + +G + P + A + W ++ M ++ + Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184 Query: 203 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 262 + L +I G L I+ + Y VL Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 365 GS +F G + + GIL+ G L+++ + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351 Query: 366 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395 G++ + ++ AGA + ++L Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>PF06580#Sensor histidine kinase Length = 349 Score = 39.5 bits (92), Expect = 2e-05 Identities = 28/142 (19%), Positives = 57/142 (40%), Gaps = 11/142 (7%) Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424 LR ++L + + ++L L++ + + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 425 IVKHA-----DASAVTLQGWQHDERLMLVIEDDGSGLPPDSGQ-HGFGLTGMRERVTALG 478 +KH + L+G + + + L +E+ GS ++ + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 479 G---TLTISCLHG-TRVSVSLP 496 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.4 bits (149), Expect = 2e-13 Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117 Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169 A+ R L + + + + + A L + T+ Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 58.7 bits (142), Expect = 1e-11 Identities = 41/184 (22%), Positives = 80/184 (43%), Gaps = 1/184 (0%) Query: 7 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 66 R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 67 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 125 +SD++G + ++L G+ I +++ V S ++LI A +QG G + + Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 126 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLALCAGVTFSMARWM 185 + A L+ + + + P IGG++ +W L + V F M Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190 Query: 186 PETR 189 E R Sbjct: 191 KEVR 194
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 120 bits (302), Expect = 2e-39 Identities = 50/89 (56%), Positives = 66/89 (74%) Query: 2 NKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61 NK LI +AE EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90 NPQTG+EIKI A+ VPAF +GKALKDAVK Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF06580#Sensor histidine kinase Length = 349 Score = 38.7 bits (90), Expect = 3e-05 Identities = 49/262 (18%), Positives = 105/262 (40%), Gaps = 43/262 (16%) Query: 197 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 253 I+F + V S+L F W + + + ++ Q +M + L L A + H + N L Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179 Query: 254 SSIKGLAKYFAERASAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVEL 310 ++I+ L +A L+++M + ++ +++ L +V ++L L +++ Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236 Query: 311 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIGQHG 369 + Q+ + ++Q+ P L Q L+ N + I + Q G Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279 Query: 370 VISVTASESGAGVKISVTDSGKGIAADQLEAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 429 I + ++ V + V ++G + E TG GL V ++ G Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327 Query: 430 ---TIQVASQEGKGATFTLWLP 448 I+++ ++GK + +P Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 527 bits (1359), Expect = 0.0 Identities = 184/468 (39%), Positives = 253/468 (54%), Gaps = 35/468 (7%) Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67 ILV DDD + T+L L GY+V + ++ + DLV+ DV M + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 68 LKEIKTLNPAIPVLIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSV 127 L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 128 DAETPAVSASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187 ++ S +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185 Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247 R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+ Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245 Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307 Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV + Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305 Query: 308 EVPSLRQRREDIPLLANHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367 +P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364 Query: 368 AVVLLTGEYISERELPLAIASTPIPLVQSQDIQP-------------------------- 401 L + I+ + + S + Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424 Query: 402 --------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441 L E+E +ILAAL T GN+ +AA LG+ R TL K+ Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 31.7 bits (72), Expect = 6e-04 Identities = 23/62 (37%), Positives = 32/62 (51%), Gaps = 9/62 (14%) Query: 37 IANFFVAEKVLQDLVLQLHPRSTWHSFLPAKRMDIVVSALEMNEGGLSQVEERILHEVVA 96 IA+FFV EK+LQ + Q+H S P+ R+ + V G +QVE R + E Sbjct: 81 IADFFVTEKMLQHFIKQVHSNSF---MRPSPRVLVCVPV------GATQVERRAIRESAQ 131 Query: 97 GA 98 GA Sbjct: 132 GA 133
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 16/54 (29%), Positives = 21/54 (38%), Gaps = 5/54 (9%) Query: 78 IDPDVRGCGVGRMLVKHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126 + D R GVG L+ A+ A E L + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.004 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 31.8 bits (72), Expect = 0.019 Identities = 19/87 (21%), Positives = 37/87 (42%), Gaps = 17/87 (19%) Query: 348 SGLEPLNIGEDSLFVNVGERTN---VTGSA----KFKRLIKEEKYSEALDVARQQVENGA 400 +P+ + ++ + +TN VT + +R+I + LD+ R QV A Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQ------LDIRRPQVLVEA 351 Query: 401 QIIDINMDEGMLDAEAAMVRFLNLIAG 427 I ++ D L+ +++ N AG Sbjct: 352 IIAEVQ-DADGLNLG---IQWANKNAG 374
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.020 Identities = 12/22 (54%), Positives = 13/22 (59%) Query: 32 MVALLGPSGSGKSTLLRHLSGL 53 V L G G GKSTL+ L GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 26.5 bits (58), Expect = 0.045 Identities = 11/42 (26%), Positives = 23/42 (54%) Query: 66 IAGSDIMMSDAIPSGKASYSGFTLVLDSQQVEEGKRWFDNLA 107 I+GS+ + S + + Y G + ++Q + +R+F NL+ Sbjct: 189 ISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLS 230
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 43.3 bits (102), Expect = 2e-06 Identities = 57/290 (19%), Positives = 105/290 (36%), Gaps = 55/290 (18%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYDTIGIWAPILLLICKMAQGFSVGGE 144 G L D++GR+ +L +++ ++ + P +W +L I ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEANFLDWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKYWRS 260 PFF A L + L K E+ P SF+ W Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFR------WAR 207 Query: 261 LLTCIGLVIATNVTYYML----LTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVM 315 +T + ++A ++ + H+ G+ + ++ L + Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMIT 267 Query: 316 GLLSDRFGRRPFVLLG----SVALFVLA--------IPAFILINSNVIGL 353 G ++ R G R ++LG +LA P +L+ S IG+ Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM 317 Score = 39.0 bits (91), Expect = 4e-05 Identities = 39/164 (23%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVLLGSVALFVLAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP +L+ L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLLVS---LAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLVAG 401 + + + +++ G ++A I V + + + R + ++A F +VAG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQNLMMPAYYLMVVAVVGLITG-VTMKETANR 444 P L + S + P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFSPH--APFFAAAALNGLNFLTGCFLLPESHKG 187
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 40/182 (21%), Positives = 80/182 (43%), Gaps = 34/182 (18%) Query: 181 ARLDQMMESVSQLLQLARAGQSFSSGNYQHVKLLEDV-ILPSYDELSTML--DQRQQTLL 237 + +M+ S+S+L++ S N + V L +++ ++ SY +L+++ D+ Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 238 LPESAADITVQGDATLLRMLLRNLVENAHRY----SPQGSNIMIKLQEDGGAV-MAVEDE 292 + + D+ V ML++ LVEN ++ PQG I++K +D G V + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 293 GPGIDESKCGELSKAFVRMDSRYGGIGLGLSIV-SRITQLHHGQFFLQNRQETSGTRAWV 351 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 352 RL 353 + Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 90.7 bits (225), Expect = 2e-23 Identities = 41/121 (33%), Positives = 60/121 (49%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVTTARMAEQSLEAGHYSLVVLDLGLPDEDGLH 61 IL+ +DD + L A GY + A + + AG LVV D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLARIRQKKYTLPVLILTARDTLTDKIAGLDVGADDYLVKPFALEELHARIRALLRRHNN 121 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 Q 122 + Sbjct: 125 R 125
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 93.0 bits (231), Expect = 5e-27 Identities = 37/138 (26%), Positives = 70/138 (50%), Gaps = 3/138 (2%) Query: 21 LITEKSYLSQEEIRRDLQNHGFDSISQSTVSRLLKLLGVIKIRNTKGQKIYSVNPQLL-- 78 +IT +Q+E+ L+ G++ ++Q+TVSR +K L ++K+ G YS+ Sbjct: 13 IITANEIETQDELVDILKKDGYN-VTQATVSRDIKELHLVKVPTNNGSYKYSLPADQRFN 71 Query: 79 PTPDAGRSVAEMVLSVEHNGEFILIHTVAGYGRAVARILDFHALPEILGVIAGSNIVWVA 138 P RS+ + + ++ I++ T+ G +A+ ++D EI+G I G + + + Sbjct: 72 PLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIMGTICGDDTILII 131 Query: 139 PRVVKRTALVHKQINYLL 156 R T +V K+I LL Sbjct: 132 CRTHDDTKVVQKKILELL 149
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 371 bits (954), Expect = e-132 Identities = 136/309 (44%), Positives = 180/309 (58%), Gaps = 13/309 (4%) Query: 6 TLVIALGGNALLKRGEPLEAEIQRKNIDLAAKTIAQL-TQHWRVVLVHGNGPQVGLLALQ 64 +VIALGGNAL +RG+ E N+ A+ IA++ + + VV+ HGNGPQVG L L Sbjct: 4 RVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLLH 63 Query: 65 NSA---YAHVAPYPLDILGAESQGMIGYMLQQALKNQLPQREISV----LLTQVEVDAND 117 A + P+D+ GA SQG IGYM+QQALKN+L +R + ++TQ VD ND Sbjct: 64 MDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKND 123 Query: 118 PAFSNPTKYIGPIYDHAQTQVLQAEKGWVFKAD-GHSFRRVVPSPQPKRIVERDAIQTLI 176 PAF NPTK +GP YD + L EKGW+ K D G +RRVVPSP PK VE + I+ L+ Sbjct: 124 PAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLV 183 Query: 177 AHDHLVICNGAGGVPVVEKADGYHGIEAVIDKDLSAALLASQIHADALLILTDADAVYLD 236 +VI +G GGVPV+ + G+EAVIDKDL+ LA +++AD +ILTD + L Sbjct: 184 ERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALY 243 Query: 237 WGKPTQRPLAQVTPE----LLNEMQFDAGSMGPKVTACAKFVSQCRGIAGIGSLADGPEI 292 +G ++ L +V E E F AGSMGPKV A +F+ A I L E Sbjct: 244 YGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVEA 303 Query: 293 LAGDKGTLI 301 L G GT + Sbjct: 304 LEGKTGTQV 312
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 418 bits (1075), Expect = e-147 Identities = 140/407 (34%), Positives = 226/407 (55%), Gaps = 13/407 (3%) Query: 6 VGSEIGQLCSVMLHRPNLSLKRLTPSNCQELLFDDVLSVERAGEEHDIFANTLRQQGIEV 65 + SEIG+L V+LHRP L+ LTP + LFDD+ +E A +EH++FA+ L+ +E+ Sbjct: 10 IFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNNLVEI 69 Query: 66 LLLTDLLTQTLDIPEA-KSWLLETQISDYRLGPTFATD-VRTWLAEMSHRDLARHLSGGL 123 + DL+++ L A ++ + I + + F + ++ + + ++ ++ + G+ Sbjct: 70 EYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSLTIDNMISKMISGV 129 Query: 124 TYSEIPASIKNMVVDTHDINDFIMKPLPNHLFTRDTSCWIYNGVSINPMAKPARQRETNN 183 E+ ++ + N FI+ P+PN LFTRD I NGV+IN M RQRET Sbjct: 130 VTEELKNYTSSLDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTKVRQRETIF 189 Query: 184 LRAIYRWHPQFAGGEFIKYFGDENINYDHATLEGGDVLVIGRGAVLIGMSERTTPQGVEF 243 I+++HP + + + A+LEGGD LV+ +G ++IG+SERT + VE Sbjct: 190 AEYIFKYHPVY-KENVPIWLNRW----EEASLEGGDELVLNKGLLVIGISERTEAKSVEK 244 Query: 244 LAQALFKHRQA-ERVIAVELPKHRYCMHLDTVMTHIDIDTFSVYPEVVRPDVNCWTLTPD 302 LA +LFK++ + + ++A ++PK+R MHLDTV T ID F+ + + + LT + Sbjct: 245 LAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDM-YFSIYVLTYN 303 Query: 303 GHGG--LKRTQESTLLHAIEKALGIDQVRLI-TTGGDAFEAEREQWNDANNVLTLRPGVV 359 + +++ + + LG ++ +I GGD REQWND NVL + PG + Sbjct: 304 PSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAPGEI 362 Query: 360 VGYERNIWTNEKYDKAGITVLPIPGDELGRGRGGARCMSCPLHRDGI 406 + Y RN TN+ +++ GI V IP EL RGRGG RCMS PL R+ I Sbjct: 363 IAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 32.2 bits (73), Expect = 5e-04 Identities = 15/48 (31%), Positives = 18/48 (37%) Query: 97 PAIRGKGLAKKLALMAMEQAREMGFKRCYLETTAFLKEAIALYEHLGF 144 R KG+ L A+E A+E F LET A Y F Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 151 bits (383), Expect = 1e-46 Identities = 46/110 (41%), Positives = 65/110 (59%) Query: 1 MIEVPAAIMIAEKLASEVDFFSIGTNDLTQYIMAADRGNSTVAKLVDYCNDAVINAIAMV 60 M+E+P+ + A A EVDFFSIGTNDL QY MAADR N V+ L + A++ + MV Sbjct: 430 MVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMV 489 Query: 61 CQAGRNNEIPVSMCGEMAGDIQQTARLLTMGIDKLSASPSRLPALKAAIR 110 +A + V MCGEMAGD LL +G+D+ S S + + ++ + Sbjct: 490 IKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLL 539
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 35.1 bits (81), Expect = 4e-04 Identities = 30/129 (23%), Positives = 48/129 (37%), Gaps = 33/129 (25%) Query: 26 CDVLIANGKIIAVASNIPSDIVPDCT--------VVDLSGQILCPGFIDQHVHLIGG--- 74 D+ + +G+I A+ D+ P T V+ G+I+ G +D H+H I Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICPQQI 145 Query: 75 ------------GGEAGP------TTRTP-EVALSRLTEA--GITSVVGLLGTDSISRHP 113 GG GP TT TP ++R+ EA + G + S P Sbjct: 146 EEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEAADAFPMNLAFAGKGNAS-LP 204 Query: 114 ESLLAKTRA 122 +L+ Sbjct: 205 GALVEMVLG 213
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.8 bits (67), Expect = 0.018 Identities = 65/317 (20%), Positives = 113/317 (35%), Gaps = 26/317 (8%) Query: 82 RPFLLASALASGLLILAMAWLLPFILVLLIRVLAGV-----ASAGMLIFGSTLIMQHTRH 136 RP LL S + + MA ++ + R++AG+ A AG I T + RH Sbjct: 73 RPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARH 132 Query: 137 PFVLAALFSGVGIGIALGNEYVLAGLHFDLSSQTLWQGAGALSGMMLIALTLLMP-SKKH 195 ++A F G G+ G VL GL S + A AL+G+ + L+P S K Sbjct: 133 FGFMSACF---GFGMVAGP--VLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 196 AITPMPLAKTEQQIMSWW---------LLAILYGLAGFGYIIVATYLPLMAKDAGSPLLT 246 P+ W L+A+ + + G + A ++ T Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATT 247 Query: 247 AHLWTLVGLSIVPGCFGWLWA---AKRWGALPCLTANLLVQAI-CVLLTLASDSPLLLII 302 + +L I+ + A R G L ++ +LL A+ + I Sbjct: 248 IGI-SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306 Query: 303 SSLGFGGTFMGTTSLVMTIARQLSVPGNLNLLGFVTLIYGIGQILGPALTSMLGNGTSAL 362 L +G +L ++RQ+ L G + + + I+GP L + + + Sbjct: 307 MVL-LASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITT 365 Query: 363 ASATLCGAAALFIAALI 379 + A A + Sbjct: 366 WNGWAWIAGAALYLLCL 382
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 28.8 bits (64), Expect = 0.017 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%) Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189 P +V GIA G V T+ GL W ++ N D + +W Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 9e-23 Identities = 34/139 (24%), Positives = 61/139 (43%) Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFDVEVFERGLPVLDKARQQVPDVMILDVGLPDI 60 M T+ + +D+ I L L + G+DV + + D+++ DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120 + F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RVKKFSTPSPVIRIGHFEL 139 K+ + L Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139
>PF06580#Sensor histidine kinase Length = 349 Score = 32.9 bits (75), Expect = 0.003 Identities = 43/182 (23%), Positives = 73/182 (40%), Gaps = 40/182 (21%) Query: 312 LRQARLENRQEVVLTAVDVAALFR---RVSEARTVQLAE--KNITLHV----------MP 356 +R LE+ + ++ L R R S AR V LA+ + ++ + Sbjct: 182 IRALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ 241 Query: 357 TEINVAAE------PALLEQAL-GNLLDNAIDFTPESGRITLSAEVDQEYVTLKVLDTGS 409 E + P +L Q L N + + I P+ G+I L D VTL+V +TGS Sbjct: 242 FENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGS 301 Query: 410 GIPDYALSRIFERFYSLPRANGQKSSGLGLAFVSE-VARLFNGEVTLR-NVQEGGVLASL 467 N ++S+G GL V E + L+ E ++ + ++G V A + Sbjct: 302 LALK----------------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 468 RL 469 + Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 4e-20 Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%) Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60 M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119 N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 120 RT 121 Sbjct: 121 EP 122 Database: VIFASCDB Posted date: Jun 1, 2014 9:04 PM Number of letters in database: 79,683 Number of sequences in database: 213 Lambda K H 0.322 0.138 0.400 Gapped Lambda K H 0.267 0.0533 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Sequences: 213 Number of Hits to DB: 207,414,809 Number of extensions: 9395514 Number of successful extensions: 36166 Number of sequences better than 5.0e-02: 1164 Number of HSP's gapped: 33567 Number of HSP's successfully gapped: 2291 Length of database: 79,683 Neighboring words threshold: 11 Window for multiple hits: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits)