>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 141 bits (357), Expect = 3e-39 Identities = 84/388 (21%), Positives = 152/388 (39%), Gaps = 86/388 (22%) Query: 5 IGIDLGTTNSCVAIMDGTQARVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + Q VL PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPYKIIGADNGDAWLDVKGQKMAPPQISAE 118 P N + AI+ +K +A ++ + Sbjct: 64 GRTPGN-IAAIR---------------------------------PMKDGVIADFFVTEK 89 Query: 119 VLKK-MKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAAL 177 +L+ +K+ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 90 MLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAI 149 Query: 178 AYGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDTR 235 GL + G+ V D+GGGT ++++I ++ V + +GG+ FD Sbjct: 150 GAGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEA 197 Query: 236 LINYLVDEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADAT 291 +INY+ + G + AE+ K E+ SA + ++ + Sbjct: 198 IINYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEG 244 Query: 292 GPKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIND--VILVGGQTRMPM 348 P+ + + LE+L E + + + VAL+ SDI++ ++L GG + Sbjct: 245 VPRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRN 302 Query: 349 VQKKVAEFFGKEPRKDVNPDEAVAIGAA 376 + + + E G +P VA G Sbjct: 303 LDRLLMEETGIPVVVAEDPLTCVARGGG 330
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 836 bits (2160), Expect = 0.0 Identities = 440/864 (50%), Positives = 582/864 (67%), Gaps = 21/864 (2%) Query: 17 PVLTPLALAIALAPAPGWAENYFNPAFLSDDPSAVADLSTFSR-NAQAAGMYRVDVYLNN 75 V +A A A AE YFNP FL+DDP AVADLS F G YRVD+YLNN Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86 Query: 76 TFLATRDIAFQAVKTTGKSAPTDDSGLRACLTPEMLKNMGVNTGAFPLLAKAAAGSCPDL 135 ++ATRD+ F + G+ CLT L +MG+NT + + A +C L Sbjct: 87 GYMATRDVTFNTGD--------SEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPL 138 Query: 136 ASAIPAARTRFDFAQQRLDISIPQAAMVASARGYIPPKYWDEGINALLLNYTFTGANSQD 195 S I A + D QQRL+++IPQA M ARGYIPP+ WD GINA LLNY F+G + Q+ Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198 Query: 196 RSPGGSAENSYFLGLNSGLNLGAWRLRDYSTWNANSGDQNSDS--DWQHISTYLERDVVF 253 R G S + +L L SGLN+GAWRLRD +TW+ NS D +S S WQHI+T+LERD++ Sbjct: 199 RIGGNS--HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 254 LQGELTAGDSYTPSALFDSLPFRGLQLASDDNMLPDSMKGFAPTIHGIARSNAQVTIRQN 313 L+ LT GD YT +FD + FRG QLASDDNMLPDS +GFAP IHGIAR AQVTI+QN Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 314 GYIINQRYVPPGAFTINDLYPTAASGDLTVEVKESDGSINRYNVPYSAVPILQREGRLKY 373 GY I VPPG FTIND+Y SGDL V +KE+DGS + VPYS+VP+LQREG +Y Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 374 AATVAEYRSDSSQKEKVKFSQATLIWGLPHGFTLYGGTQLSSHYHALAIGSGANLGDWGA 433 + T EYRS ++Q+EK +F Q+TL+ GLP G+T+YGGTQL+ Y A G G N+G GA Sbjct: 377 SITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGA 436 Query: 434 VSLDVTQATSTLADNNTYQGQSLRFLYAKSLAQSGTNLQLMGYRYSTSGFYTLDDTTWKR 493 +S+D+TQA STL D++ + GQS+RFLY KSL +SGTN+QL+GYRYSTSG++ DTT+ R Sbjct: 437 LSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSR 496 Query: 494 MSGYDDDSRTDSDKSRPEWADYYNLYYTRRGKVQLDINQQLGGLGSLFITGSQQSYWHTD 553 M+GY+ +++ + +P++ DYYNL Y +RGK+QL + QQLG +L+++GS Q+YW T Sbjct: 497 MNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTS 556 Query: 554 EKDSLLQVGYSDTLAGIAWSVSYNNNKSAGDAERDQIFALNISVPLSQWLQHGDEVTRHH 613 D Q G + I W++SY+ K+A RDQ+ ALN+++P S WL D ++ Sbjct: 557 NVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWL-RSDSKSQWR 615 Query: 614 NVYATFSTSTDKQHNVTQNAGLSGTLLDENNLSYNIQQGYQNHGIGESGA---ASLEYDG 670 + A++S S D +T AG+ GTLL++NNLSY++Q GY G G SG+ A+L Y G Sbjct: 616 HASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRG 675 Query: 671 AKGNANIGYNVSDNGDYQQVNYGLSGGLVAHAHGVTLSQPLGNTNILIAAPGAANVGVVD 730 GNANIGY S + D +Q+ YG+SGG++AHA+GVTL QPL +T +L+ APGA + V + Sbjct: 676 GYGNANIGY--SHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVEN 733 Query: 731 QPGIHTDARGYAVVPYATTYRQNRMALDVNAMADDVDIDDAVTRVVPTEGALVLARFKAR 790 Q G+ TD RGYAV+PYAT YR+NR+ALD N +AD+VD+D+AV VVPT GA+V A FKAR Sbjct: 734 QTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKAR 793 Query: 791 VGARALVTLNHNGKPVPFGATVTVNDRHAEAIVDEAGEVYLSGLSAQGVLHVRWGNLPDQ 850 VG + L+TL HN KP+PFGA VT + IV + G+VYLSG+ G + V+WG + Sbjct: 794 VGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENA 853 Query: 851 QCVASYHL--SSSRQILSRQHAEC 872 CVA+Y L S +Q+L++ AEC Sbjct: 854 HCVANYQLPPESQQQLLTQLSAEC 877
>PF06580#Sensor histidine kinase Length = 349 Score = 29.8 bits (67), Expect = 0.025 Identities = 15/79 (18%), Positives = 27/79 (34%), Gaps = 3/79 (3%) Query: 4 RRQPLIPGWLIPGLCAAALMITVSLAAFLALWLNAPSGAWSTIWRDSYLWHVVRFSFWQA 63 R GWL + L + + +W A + W + +++ Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL---AFINTKPVAFTLPL 116 Query: 64 FLSAVLSVVPAVFLARALY 82 LS + +VV F+ LY Sbjct: 117 ALSIIFNVVVVTFMWSLLY 135
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 499 bits (1286), Expect = 0.0 Identities = 245/296 (82%), Positives = 267/296 (90%) Query: 1 MRDLYPLTRRRLLTAMALSPLLWQMNTAQAAAIDPRRIVALEWLPVELLLALGITPYGVA 60 M L ++RRRLLTAMALSPLLWQMNTA AAAIDP RIVALEWLPVELLLALGI PYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DVPNYKLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEKLARIAPGR 120 D NY+LWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPE LARIAPGR Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFDFSDGKKPLAVARRSLVELAQTLNLEAAAEKHLAQYDRFIASQKPHFIRRGGRPLLMT 180 GF+FSDGK+PLA+AR+SL E+A LNL++AAE HLAQY+ FI S KP F++RG RPLL+T Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVLGPNCLFQEVLDEYGIVNAWQGETNFWGSTAVSIDRLAMYKEADVICFDH 240 TLIDPRHMLV GPN LFQE+LDEYGI NAWQGETNFWGSTAVSIDRLA YK+ DV+CFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 GNNTDMNALMATPLWQAMPFVRAGRFHRVPAVWFYGATLSTMHFVRILNNVLGGKA 296 N+ DM+ALMATPLWQAMPFVRAGRF RVPAVWFYGATLS MHFVR+L+N +GGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.9 bits (75), Expect = 0.006 Identities = 38/185 (20%), Positives = 58/185 (31%), Gaps = 31/185 (16%) Query: 529 ALREWQGDAPVV----FPEVSAAVVA--AIVADWTGIP--AGRMVKDEASQVLELPARLA 580 +++ + D PV+ A+ A D+ P ++ + E R + Sbjct: 68 RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPS 127 Query: 581 QRVTGQDGALAQIGE--RIQTAR---AGLGDPRKPVGVFMLAGPSGVGKTETALALAEAI 635 + + +G +Q A L + M+ G SG GK A AL + Sbjct: 128 KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL---MITGESGTGKELVARALHDYG 184 Query: 636 YGGEQNLVTINMSEFQEAHTVSTLKGAPPGYVGYGEGGVLTEAVRRHPWSV-------VL 688 V INM+ S L G E G T A R + Sbjct: 185 KRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGRFEQAEGGTLF 236 Query: 689 LDEIE 693 LDEI Sbjct: 237 LDEIG 241
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 70.3 bits (172), Expect = 2e-15 Identities = 36/128 (28%), Positives = 58/128 (45%), Gaps = 16/128 (12%) Query: 317 QHSRVVFRGDAMFVPGQKTVSDAIRPVINKAAREIARVG---GAVTVTGHTDSQPIHSAE 373 Q + D +F + T+ + +++ +++ + G+V V G+TD I S Sbjct: 211 QTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDA 268 Query: 374 FPSNLVLSEKRAAEVAALLTSGGVPAGRVHIVGKGDTVPVADN---------GSKAGRAK 424 + N LSE+RA V L S G+PA ++ G G++ PV N A Sbjct: 269 Y--NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAP 326 Query: 425 NRRVEILV 432 +RRVEI V Sbjct: 327 DRRVEIEV 334
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 830 bits (2145), Expect = 0.0 Identities = 310/872 (35%), Positives = 453/872 (51%), Gaps = 52/872 (5%) Query: 4 KQPALLLFIAGVVHCANAHT-------YTFDASML-GDAAKGVDMSLFNQG-VQQPGTYR 54 K F+ V CA A F+ L D D+S F G PGTYR Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79 Query: 55 VDVMVNGKRVDTRDVVFKLEKDGQGTPVLAPCLTVSQLSRYGVKTEDYPQLWKAAKPPDE 114 VD+ +N + TRDV F QG + PCLT +QL+ G+ T + D Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQG---IVPCLTRAQLASMGLNTASVSGMNLL--ADDA 134 Query: 115 CADLT-AIPQAKAVLDINNQQLQLSIPQLALRPEFKGIAPEDLWDDGIPAFLMNYSARTT 173 C LT I A A LD+ Q+L L+IPQ + +G P +LWD GI A L+NY+ Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 174 QTDYKMDMVGRDNSSWVQLQPGINIGAWRVRNATSWQR-----SSQLSGKWQAAYTYAER 228 ++ G + +++ LQ G+NIGAWR+R+ T+W SS KWQ T+ ER Sbjct: 195 SVQNRIG--GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLER 252 Query: 229 GLYSLKSRLTLGQKTSQGEIFDSVPFTGVMLASDDNMVPYSERQFAPVVRGIARTQARVE 288 + L+SRLTLG +QG+IFD + F G LASDDNM+P S+R FAPV+ GIAR A+V Sbjct: 253 DIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 289 VKQNGYTIYNTTVAPGPFALRDLSVTDSSGDLHVTVWEADGSTQMFVVPYQTPAIALHQG 348 +KQNGY IYN+TV PGPF + D+ +SGDL VT+ EADGSTQ+F VPY + + +G Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 349 YLKYSLLAGRYRSSDSATDKAQIAQATLMYGLPWNLTAYGGIQSATHYQAALLGLGGSLG 408 + +YS+ AG YRS ++ +K + Q+TL++GLP T YGG Q A Y+A G+G ++G Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432 Query: 409 RWGSLSVDGSDTHSQRQGEAVQQGASWRLRYSNQLTATGTNFFLTRWQYASQGYNTLSDV 468 G+LSVD + +S ++ G S R Y+ L +GTN L ++Y++ GY +D Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 469 LDSYRHNGNRL-------------WSWRENLQPSSRTTLMLSQSWGRHLGNLSLTGSRTD 515 S + N + + L ++Q GR L L+GS Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQT 551 Query: 516 WRNRPGHDDSYGLSWGTSIGGGSLSLNWNQNRTLWRNGAHRKENITSLWFSMPLSRWTGN 575 + D+ + T+ + +L+++ + W+ G ++ + +L ++P S W + Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG---RDQMLALNVNIPFSHWLRS 608 Query: 576 -------NVSASWQMTSPSHGGQTQQVGVNGEAFSQ-QLDWEVRQSYRADAPPGGGNNSA 627 + SAS+ M+ +G T GV G L + V+ Y G+ Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 628 LHLAWNGDYGLLGGDYSYSRAMRQMGVNIAGGIVIHHHGVTLGQPLQGSVALVEAPGASG 687 L + G YG YS+S ++Q+ ++GG++ H +GVTLGQPL +V LV+APGA Sbjct: 669 ATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKD 728 Query: 688 VPVGGWPGVKTDFRGDTTVGNLNVYQENTVSLDPSRLPDDAEVTQTDVRVVPTEGAVVEA 747 V GV+TD+RG + Y+EN V+LD + L D+ ++ VVPT GA+V A Sbjct: 729 AKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788 Query: 748 KFHTRIGARALMTLKREDGSAIPFGAQVTVNGQDGSAALVDTDSQVYLTGLADKGELTVK 807 +F R+G + LMTL + +PFGA VT + S+ +V + QVYL+G+ G++ VK Sbjct: 789 EFKARVGIKLLMTLTH-NNKPLPFGAMVT-SESSQSSGIVADNGQVYLSGMPLAGKVQVK 846 Query: 808 WGA---QQCRVNYRLPAHKGIAGLYQMSGLCR 836 WG C NY+LP L Q+S CR Sbjct: 847 WGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>PF05775#Enterobacteria AfaD invasin protein Length = 142 Score = 90.4 bits (224), Expect = 4e-26 Identities = 37/132 (28%), Positives = 66/132 (50%), Gaps = 2/132 (1%) Query: 14 SVSLLVAASSLVPIANAAEKLQTTLRVGTYFRAGHVPDGMVLAQGWVTYHGSHSGFRVWS 73 S+SL + L+ + + ++ TL Y + DG+ LA G + +HSGFRVW Sbjct: 4 SISLTLCGILLMLMGSFSQAADITLMNHKYM-GNLLHDGVKLATGRIICQDTHSGFRVWI 62 Query: 74 DEQKAGNTPTVLLLSGQQDPRHHIQVRLEGEGWQPDTVSGRGAILRTAADNAS-FSVVVD 132 + ++ G ++ + P+H++++R+ G GW G + T ++AS F + VD Sbjct: 63 NARQEGGGAGKYIVQSTEGPQHNLRIRISGNGWSSFVEKGIQGVFNTIKEDASIFYIEVD 122 Query: 133 GNQEVPADTWTL 144 GNQ+V + Sbjct: 123 GNQQVQPGKYLF 134
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.1 bits (117), Expect = 2e-08 Identities = 57/261 (21%), Positives = 92/261 (35%), Gaps = 24/261 (9%) Query: 70 FSGTLSDRFGRKPIIFYSLLAGGILTLLCATASSWPMLVVYRALLGIAVSGITAAVTVYI 129 G LSDRFGR+P++ SL + + ATA +L + R + GI +G T AV Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI--TGATGAVAGAY 119 Query: 130 SEEVSPA---------LAGIVTGYFIFGNSLGSMSGRVFATLMMEHVSIDTIFFIFGGVL 180 +++ ++ + G LG + G S FF + Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG---------FSPHAPFFAAAALN 170 Query: 181 IAMALAVKLFLPTSRQFVPTPSLQLGAVLKGGLEHFKNIR-VSLCFVIGFI--LFGSFTS 237 L LP S + P + + + V+ + FI L G + Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230 Query: 238 IFNFLAFYLHRPPYELSYTWIGLIPVSFSLTFFLAPYAARVALNIGSMNALSMLIICMMV 297 ++ F R ++ + I L + A VA +G AL + +I Sbjct: 231 AL-WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289 Query: 298 GAFLTLIAPSLWVFISGIVLL 318 G L A W+ +VLL Sbjct: 290 GYILLAFATRGWMAFPIMVLL 310
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 758 bits (1958), Expect = 0.0 Identities = 260/880 (29%), Positives = 412/880 (46%), Gaps = 63/880 (7%) Query: 4 TINLNRKS-LALLIAIVCSGSAQG----EEYYFDPALLQGATYGQ-NIARFNER-QTPSG 56 I +R + + + + C+ +AQ E YF+P L +++RF + P G Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 57 DYLADVYVNGTLVTSSTNIRFNAVKEGQQTEPCLPLSVMKAAQIKSLPATDAA----TEC 112 Y D+Y+N + + ++ FN Q PCL + + + + + + C Sbjct: 77 TYRVDIYLNNGYMAT-RDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135 Query: 113 RPLREWVPHAGWQFDSATLRLLLTIPMTELTHKPRGYISPSEWDSGALALFLRHNTNWTH 172 PL + A Q D RL LTIP ++++ RGYI P WD G A L +N + Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195 Query: 173 TENTDSHYRYQYLWSGLNMGVNLGLWQVRHQSNLRYANSNQS-GSAWRYNSVRTWVQRPV 231 +N Y + L G+N+G W++R + Y +S+ S GS ++ + TW++R + Sbjct: 196 VQN-RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254 Query: 232 ASVNSILSLGDSYTDSSLFGSLSFNGAKLVTDERMRPQGKRGYAPEVRGVAASSAHVVVK 291 + S L+LGD YT +F ++F GA+L +D+ M P +RG+AP + G+A +A V +K Sbjct: 255 IPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIK 314 Query: 292 QLGKVIYETNVPPGPFYIDDLYNTRYQGDLEVEVIEASGKTSRFTVPYSSVPDSVRPGNW 351 Q G IY + VPPGPF I+D+Y GDL+V + EA G T FTVPYSSVP R G+ Sbjct: 315 QNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHT 374 Query: 352 HYSLAFGRVRQYY--DIENRFFEGTFQHGVNNTITLNLGSRIAQRYQAWLAGGVWATGM- 408 YS+ G R + RFF+ T HG+ T+ G+++A RY+A+ G G Sbjct: 375 RYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGAL 434 Query: 409 GAFGLNATWSNARAEHNDRQQGWRAELSYSKTFT-TGTNLVLAAYRYSTNGFRDLQDVLG 467 GA ++ T +N+ + + G Y+K+ +GTN+ L YRYST+G+ + D Sbjct: 435 GALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTY 494 Query: 468 VRREAKTGI-------------DYYSDTLHQRNRLSATVSQPLGRLGTLNLSASTADYYN 514 R DYY+ ++R +L TV+Q LGR TL LS S Y+ Sbjct: 495 SRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWG 554 Query: 515 NQSRITQLQMGYSNQWRNISYGVNIARQRTTWDYDRFYHGVNEPLDVSSRQKYTETTMSF 574 + Q Q G + + +I++ ++ + + W + ++ Sbjct: 555 TSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG------------------RDQMLAL 596 Query: 575 NVSIPLDWGENRTSVA------MNYNQSSQSRSST---VSMTGSSGENSDLSWSVYGGYE 625 NV+IP S + +Y+ S + G+ E+++LS+SV GY Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656 Query: 626 RYRNSNSDSSAPTTFGGNLQQNTRFGALRANYDQGDNYRQEGLGASGTLVLHPGGLTAGP 685 + NS S+ L +G Y D+ +Q G SG ++ H G+T G Sbjct: 657 GGGDGNSGSTG----YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQ 712 Query: 686 YTSDTFALIHADGAQGAIVQNGQGAVVDRFGYAILPSLSPYRVNNVTLDTRKMRSDAELT 745 +DT L+ A GA+ A V+N G D GYA+LP + YR N V LDT + + +L Sbjct: 713 PLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD 772 Query: 746 GGSQQIVPYAGAIARVNFATISGKAVLISVKMPDGGIPPMGADVFNGEGTNIGMVGQSGQ 805 +VP GAI R F G +L+++ + P GA V + + G+V +GQ Sbjct: 773 NAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQ 831 Query: 806 IYARIAHPSGSLLVRWGTGANQRCRVAYQLDLHTKEPFLY 845 +Y +G + V+WG N C YQL +++ L Sbjct: 832 VYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLT 871
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 26.1 bits (57), Expect = 0.030 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 8/73 (10%) Query: 4 SKRTLLIIFVIFIAIVAIGSIIALEWVNRDSYRTNNLICTTKSEAYIAPRKLYMDGGLVL 63 S+R LL+I ++F++ + I W +D TT++ A D G+ Sbjct: 3 SQRNLLVIALLFVSFM-----IWQAW-EQDKNPQPQAQQTTQTTTTAAGSAA--DQGVPA 54 Query: 64 DLKSGRITLHYDV 76 + I++ DV Sbjct: 55 SGQGKLISVKTDV 67
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 134 bits (339), Expect = 7e-43 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 21/183 (11%) Query: 1 MKRRSSFLVFLGLLLASPLALANDQHTVSFGYAQTHLSSLKNSDSKDLRGFNFKYRYEFN 60 MK+ + +L + TV+ GYAQ+ N + GFN KYRYE + Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMN----KMGGFNLKYRYEED 56 Query: 61 ET-WGMLGSFTATRNEMENYTWKEGKLHKNGSDSVDYGSLMFGPTYRFNDYVSLYGNAGI 119 + G++GSFT T + K Y + GP YR ND+ S+YG G+ Sbjct: 57 NSPLGVIGSFTYTEKSRTASSGDYNK--------NQYYGITAGPAYRINDWASIYGVVGV 108 Query: 120 ATMKF--------NKHSKEDSFAYGAGVIFNPVKSISIDASWEASRFFAVDTNTFGVSVG 171 KF + + F+YGAG+ FNP++++++D S+E SR +VD T+ VG Sbjct: 109 GYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVG 168 Query: 172 YRF 174 YRF Sbjct: 169 YRF 171
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1044 bits (2702), Expect = 0.0 Identities = 434/1040 (41%), Positives = 660/1040 (63%), Gaps = 19/1040 (1%) Query: 6 FFIARPIFAIVLSLLMLLAGAIAFLKLPLSEYPAVTPPTVQVSASYPGANPQVIADTVAA 65 FFI RPIFA VL++++++AGA+A L+LP+++YP + PP V VSA+YPGA+ Q + DTV Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLEQVINGVDGMLYMNTQMAIDGRMVISIAFEQGTDPDMAQIQVQNRVSRALPRLPEEVQ 125 +EQ +NG+D ++YM++ G + I++ F+ GTDPD+AQ+QVQN++ A P LP+EVQ Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 126 RIGVVTEKTSPDMLMVVHLVSPQKRYDSLYLSNFAIRQVRDELARLPGVGDVLVWGAGEY 185 + G+ EK+S LMV VS +S++ V+D L+RL GVGDV ++G +Y Sbjct: 124 QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQY 182 Query: 186 AMRVWLDPAKIANRGLTASDIVTALREQNVQVAAGSVGQQPEASA-AFQMTVNTLGRLTS 244 AMR+WLD + LT D++ L+ QN Q+AAG +G P ++ R + Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 245 EEQFGEIVVKIGADGEVTRLRDVARVTLGADAYTLRSLLNGEAAPALQIIQSPGANAIDV 304 E+FG++ +++ +DG V RL+DVARV LG + Y + + +NG+ A L I + GANA+D Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 305 SNAIRGKMDELQQNFPQDIEYRIAYDPTVFVRASLQSVAITLLEALVLVVLVVVMFLQTW 364 + AI+ K+ ELQ FPQ ++ YD T FV+ S+ V TL EA++LV LV+ +FLQ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 365 RASIIPLVAVPVSLVGTFALMHLFGFSLNTLSLFGLVLSIGIVVDDAIVVVENVERHISQ 424 RA++IP +AVPV L+GTFA++ FG+S+NTL++FG+VL+IG++VDDAIVVVENVER + + Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 425 GKSPG-EAAKKAMDEVTGPILSITSVLTAVFIPSAFLAGLQGEFYRQFALTIAISTILSA 483 K P EA +K+M ++ G ++ I VL+AVFIP AF G G YRQF++TI + LS Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 484 INSLTLSPALAAILLRPHHDTAKADWLTRLMGTVTGGFFHRFNRFFDSASNRYVSAVRRA 543 + +L L+PAL A LL+P ++ GGFF FN FD + N Y ++V + Sbjct: 483 LVALILTPALCATLLKP---------VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 544 VRGSVIVMVLYAGFVGLTWLGFHQVPNGFVPAQDKYYLVGIAQLPSGASLDRTEAVVKQM 603 + + +++YA V + F ++P+ F+P +D+ + + QLP+GA+ +RT+ V+ Q+ Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593 Query: 604 SAIALA--EPGVESVVVFPGLSVNGPVNVPNSALMFAMLKPFDEREDPSLSANAIAGKLM 661 + L + VESV G S +G N+ + F LKP++ER SA A+ + Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSG--QAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651 Query: 662 HKFSHIPDGFIGIFPPPPVPGLGATGGFKLQIEDRAELGFEAMTKVQSEIMSKAMQTP-E 720 + I DGF+ F P + LG GF ++ D+A LG +A+T+ +++++ A Q P Sbjct: 652 MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPAS 711 Query: 721 LANMLASFQTNAPQLQVDIDRVKAKSMGVSLTDIFETLQINLGSLYVNDFNRFGRAWRVM 780 L ++ + + Q ++++D+ KA+++GVSL+DI +T+ LG YVNDF GR ++ Sbjct: 712 LVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLY 771 Query: 781 AQADAPFRMQQEDIGLLKVRNAKGEMIPLSAFVTIMRQSGPDRIIHYNGFPSVDISGGPA 840 QADA FRM ED+ L VR+A GEM+P SAF T G R+ YNG PS++I G A Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831 Query: 841 PGFSSGQATDAIEKIVRETLPEGMVFEWTDLVYQEKQAGNSALAIFALAVLLAFLILAAQ 900 PG SSG A +E + + LP G+ ++WT + YQE+ +GN A A+ A++ ++ FL LAA Sbjct: 832 PGTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890 Query: 901 YNSWSLPFAVLLIAPMSLLSAIVGVWVSGGDNNIFTQIGFVVLVGLAAKNAILIVEFAR- 959 Y SWS+P +V+L+ P+ ++ ++ + N+++ +G + +GL+AKNAILIVEFA+ Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950 Query: 960 AKEHDGADPLTAVLEASRLRLRPILMTSFAFIAGVVPLVLATGAGAEMRHAMGIAVFAGM 1019 E +G + A L A R+RLRPILMTS AFI GV+PL ++ GAG+ ++A+GI V GM Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010 Query: 1020 LGVTLFGLLLTPVFYVVVRR 1039 + TL + PVF+VV+RR Sbjct: 1011 VSATLLAIFFVPVFFVVIRR 1030 Score = 88.4 bits (219), Expect = 8e-20 Identities = 68/427 (15%), Positives = 143/427 (33%), Gaps = 36/427 (8%) Query: 643 FDEREDPSLSANAIAGKLMHKFSHIPDGFIGIFPPPPVPGLGATGGFKLQIEDRAELGFE 702 F DP ++ + KL +P + ++ + + ++ Sbjct: 94 FQSGTDPDIAQVQVQNKLQLATPLLPQEV----QQQGISVEKSSSSYLMVAGFVSDNP-- 147 Query: 703 AMTKVQSEIMSKAMQTPELANM--LASFQTNAPQLQVDI--DRVKAKSMGVSLTDIFETL 758 T+ + L+ + + Q Q + I D ++ D+ L Sbjct: 148 GTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQL 207 Query: 759 QIN--------LGSLYVNDFNRFGRAWRVMAQADAPFRMQQEDIGLLKVR-NAKGEMIPL 809 ++ LG + + + P E+ G + +R N+ G ++ L Sbjct: 208 KVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP-----EEFGKVTLRVNSDGSVVRL 262 Query: 810 SAFVTIMRQSGPDRII-HYNGFPSVDISGGPAPGFSSGQATDAIEKIV---RETLPEGM- 864 + +I NG P+ + A G ++ AI+ + + P+GM Sbjct: 263 KDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMK 322 Query: 865 ---VFEWTDLVYQEKQAGNSALAIFALAVLLAFLILAAQYNSWSLPFAVLLIAPMSLLSA 921 ++ T V + + + + A++L FL++ + + P+ LL Sbjct: 323 VLYPYDTTPFV---QLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGT 379 Query: 922 IVGVWVSGGDNNIFTQIGFVVLVGLAAKNAILIVE-FARAKEHDGADPLTAVLEASRLRL 980 + G N T G V+ +GL +AI++VE R D P A ++ Sbjct: 380 FAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQ 439 Query: 981 RPILMTSFAFIAGVVPLVLATGAGAEMRHAMGIAVFAGMLGVTLFGLLLTPVFYVVVRRM 1040 ++ + A +P+ G+ + I + + M L L+LTP + + Sbjct: 440 GALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKP 499 Query: 1041 ALKRENR 1047 + Sbjct: 500 VSAEHHE 506
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 49.4 bits (118), Expect = 1e-08 Identities = 19/112 (16%), Positives = 37/112 (33%), Gaps = 7/112 (6%) Query: 74 ELRSRVGGTLDAISVPEGRLVSRGQLLFQIDPRPFEVALDTAVAQLRQAEVLARQAQADF 133 E++ + I V EG V +G +L ++ E + L QA + + Q Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 134 DRIQR-------LVASGAVSRKNADDVTATRNARQAQMQSAKAAVAAARLEL 178 I+ L + ++V + + Q + + L L Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL 209 Score = 34.4 bits (79), Expect = 7e-04 Identities = 20/106 (18%), Positives = 37/106 (34%), Gaps = 13/106 (12%) Query: 112 LDTAVAQLRQAEVLARQAQADFDRIQRLVASGAVSRKNADDVTATRNARQAQMQSAKAAV 171 L +QL Q E A+ ++ + +L + ++ + + Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKN---------EILDKLRQTTDNIGLLTLEL 318 Query: 172 AAARLELSWTRITAPIAGRVDRILVTRGNLVSGGVAGNATLLTTIV 217 A + I AP++ +V ++ V GGV A L IV Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHT----EGGVVTTAETLMVIV 360
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.043 Identities = 7/21 (33%), Positives = 12/21 (57%) Query: 46 VLALIGPSGSGKTTVLRAVAG 66 + L G G GK+T++ + G Sbjct: 598 SVVLEGTGGIGKSTLINTLVG 618
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 84.5 bits (209), Expect = 6e-20 Identities = 85/383 (22%), Positives = 155/383 (40%), Gaps = 26/383 (6%) Query: 11 GLGTVFSLRMLGMFMVLPVLTTY--GMALQGASEALIGIAIGIYGLAQAIFQIPFGLLSD 68 L TV L +G+ +++PVL + A GI + +Y L Q G LSD Sbjct: 10 ILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 69 RIGRKPLIVGGLAVFVAGSVIAALSHSIWGIILGRALQG-SGAIAAAVMALLSDLTREQN 127 R GR+P+++ LA I A + +W + +GR + G +GA A A ++D+T Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDE 128 Query: 128 RTKAMAFIGVSFGITFAIAMVLGPIVTHSLGLNALFWMIAALATLGILLTIWVVPNSTNH 187 R + F+ FG VLG ++ +A F+ AAL L L +++P S Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 188 VLNRESGMVKGSFSKVLAEPRLLKLNFGIMCLHILLMSTFVA-LPGQLADAGFPAAEHWK 246 + + G+ + L+ F+ L GQ+ A + + Sbjct: 188 ERRPLRREALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240 Query: 247 VYLATMVIAFA--------AVVPFIIYAEVKRRMKQVFLFCVGLI--VVAEIVLWGAGQH 296 + I + ++ +I V R+ + +G+I I+L A + Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG 300 Query: 297 FWELVIGVQLFFLAFNL--MEALLPSLISKESPAGYKGTAMGVYSTSQFLGVALGGSLGG 354 + I V L + ++A+L + +E +G+ + S + +G L ++ Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 355 WIDGTFDGQTVFLAGAVLAMVWL 377 T++G ++AGA L ++ L Sbjct: 361 ASITTWNG-WAWIAGAALYLLCL 382
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.0 bits (96), Expect = 8e-06 Identities = 40/190 (21%), Positives = 76/190 (40%), Gaps = 15/190 (7%) Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYG 279 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GILMQRLSLFRALLIFGILQGASNAGYWLLSITDKNMFSMGAAVFFENLCGGMGTAAFVA 339 L +L + R LL I+ + ++ + FS+ + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGPVAGWFVEAH-GWPTFYLFSVVAAVP 394 L+M K F L+ ++ A+G VGP G + + W L ++ + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GLLLLLVCRQ 404 L+ + ++ Sbjct: 182 VPFLMKLLKK 191
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.9 bits (59), Expect = 0.023 Identities = 11/37 (29%), Positives = 18/37 (48%) Query: 18 NMLKKLLFPLVALFMLAGCATPPTTIDVAPKITLPQQ 54 N +KK+LF ++ GCA T+ P P++ Sbjct: 4 NKMKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.1 bits (125), Expect = 2e-09 Identities = 52/326 (15%), Positives = 113/326 (34%), Gaps = 24/326 (7%) Query: 62 AYCSMQIPC----GILVDKFGQKIMLMAGFTLFIIGTLCIAKANGLTMIYIGSLMAGGGC 117 Y MQ C G L D+FG++ +L+ + +A A L ++YIG ++AG Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITG 110 Query: 118 ASFFSSAYSLSSANVPQARRA----LANAIINSGSAIGMGIGLIGSSILVKNMSMAWQNV 173 A+ + A + + RA +A G G +G + Sbjct: 111 AT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPH--------A 161 Query: 174 LYIVAAILVIMLCVFTLIIRGKAKSDSAQAEKQTQTVTEDEKRAPLFSGLLCSVYFLYFC 233 + AA L + + + ++ + ++ R ++ ++ ++F Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFI 221 Query: 234 TCYGYYLIVTWLPSYLQTERGFDGGAIGLASALVAVVG-VPGALFFSHLSDKFR-NSKVK 291 + + + +D IG++ A ++ + A+ ++ + + Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM 281 Query: 292 VILGLEIVAAAMLAFTVLSPNTTMLMVSLTLYGLLGKMAVDPILISFVSEQASAKSLGRA 351 + + + +LAF +MV L G+ P L + +S Q + G+ Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLASGGI-----GMPALQAMLSRQVDEERQGQL 336 Query: 352 FSLFNFFGMSSAVVAPTLTGFISDVT 377 +++V P L I + Sbjct: 337 QGSLAALTSLTSIVGPLLFTAIYAAS 362
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 49.7 bits (119), Expect = 1e-08 Identities = 37/163 (22%), Positives = 57/163 (34%), Gaps = 32/163 (19%) Query: 4 DLIIKNGTVILENEARVIDIAVQGGKIAAIGEN------------LGEAKNVLDATGLIV 51 D +I N ++ DI ++ G+IAAIG+ +G V+ G IV Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128 Query: 52 SPGMVDAHTHISEPGRTHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRET------- 104 + G +D+H H P + A G+T M+ PA T Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177 Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNLDRLHELDEVGVVGFK 146 I +AA ++ A G + L E+ G K Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 359 bits (922), Expect = e-127 Identities = 126/310 (40%), Positives = 176/310 (56%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIADAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKA---VEPYPLDVLVAESQGMIGYMLAQRLALEPDM----PPVTAVLTRIKVSAD 113 A +A + P+DV A SQG IGYM+ Q L E V ++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLEPEKFIGPVYSPEEQMALEATYGWHMKRD-GKYLRRVVASPAPRQIIESAAIELL 172 DPAF P K +GP Y E L GW +K D G+ RRVV SP P+ +E+ I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVAGEG---EGVEAVIDKDLAAALLAEQIAADGLIILTDADAVYE 229 ++ G +VI SGGGGVPV E +GVEAVIDKDLA LAE++ AD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 HWGTPQQRAIRQASPDELAPFAKAD----GAMGPKVTAVSGYVKRCGKPAWIGALSRIDD 285 ++GT +++ +R+ +EL + + G+MGPKV A +++ G+ A I L + + Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGRAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 837 bits (2163), Expect = 0.0 Identities = 406/859 (47%), Positives = 562/859 (65%), Gaps = 19/859 (2%) Query: 18 LSSVALSVLAALCPLTSRGESYFNPAFLSADTASVADLSRFEKGNHQPPGIYRVDIWRND 77 + ++ A S E YFNP FL+ D +VADLSRFE G PPG YRVDI+ N+ Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86 Query: 78 EFVATQDIRFEAGAVGTGDKSGGLMPCFTPEWIKRLGVNTAAFPVSDKGVDTTCIHLPEK 137 ++AT+D+ F G D G++PC T + +G+NTA+ + D C+ L Sbjct: 87 GYMATRDVTFNTG-----DSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSM 141 Query: 138 IPGAEVAFDFASMRLNISLPQASLLNSARGYIPPEEWDEGIPAALINYSFTGSR-----G 192 I A D RLN+++PQA + N ARGYIPPE WD GI A L+NY+F+G+ G Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIG 201 Query: 193 TDSDSYFLSLLSGLNYGPWRLRNNGAWNYSKGDG--YHSQRWNNIGTWVQRAIIPLKSEL 250 +S +L+L SGLN G WRLR+N W+Y+ D +W +I TW++R IIPL+S L Sbjct: 202 GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRL 261 Query: 251 VMGDSNTGNDVFDSVGFRGARLYSSDNMYPDSLQGYAPTVRGIARTAAKLTIRQNGYVIY 310 +GD T D+FD + FRGA+L S DNM PDS +G+AP + GIAR A++TI+QNGY IY Sbjct: 262 TLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIY 321 Query: 311 QSYVSPGAFAITDLNPTSSSGDLEVTVDEKDGSQQRYTVPYSTVPLLQREGRVKYDLVAG 370 S V PG F I D+ +SGDL+VT+ E DGS Q +TVPYS+VPLLQREG +Y + AG Sbjct: 322 NSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAG 381 Query: 371 DFRSGNSQQSSPFFFQGTVIAGLPAGLTAYGGTQLADRYRAVVVGAGRNLGDWGAVSVDV 430 ++RSGN+QQ P FFQ T++ GLPAG T YGGTQLADRYRA G G+N+G GA+SVD+ Sbjct: 382 EYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDM 441 Query: 431 THARSQLADDSTHQGQSLRFLYAKSLNNYGTNFQLLGYRYSTRGFYTLDDVAYRSMEGYD 490 T A S L DDS H GQS+RFLY KSLN GTN QL+GYRYST G++ D Y M GY+ Sbjct: 442 TQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501 Query: 491 YEYDSDGRRHKVPVAQSYHNLRYSKKGRFQVNISQNLGDYGSLYLSGSQQNYWNTADTNT 550 E DG P Y+NL Y+K+G+ Q+ ++Q LG +LYLSGS Q YW T++ + Sbjct: 502 IE-TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDE 560 Query: 551 WYQLGYASGWQGISYSLSWSWSESVGSSGADRILAFNMSVPFSVLTGRRYARDTILDRTY 610 +Q G + ++ I+++LS+S +++ G D++LA N+++PFS R + Sbjct: 561 QFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFS--HWLRSDSKSQWRHAS 618 Query: 611 ATFNANRNRDGDNSWQTGVGGTLLEGRNLSYSVTQGRS----SSNGYSGSASASWQATYG 666 A+++ + + +G + GV GTLLE NLSYSV G + ++G +G A+ +++ YG Sbjct: 619 ASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYG 678 Query: 667 TLGVGYNYDRDQHDYNWQLSGGVVGHADGITFSQPLGDTNVLIKAPGAKGVRIENQTGVK 726 +GY++ D + +SGGV+ HA+G+T QPL DT VL+KAPGAK ++ENQTGV+ Sbjct: 679 NANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVR 738 Query: 727 TDWRGYAVMPYATVYRYNRVALDTNTMDNHTDVENNVSSVVPTEGALVRAAFDTRIGVRA 786 TDWRGYAV+PYAT YR NRVALDTNT+ ++ D++N V++VVPT GA+VRA F R+G++ Sbjct: 739 TDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKL 798 Query: 787 IITARLGGRPLPFGAIVRETASGITSMVGDDGQIYLSGLPLKGELFIQWGEGKNARCIAP 846 ++T +PLPFGA+V +S + +V D+GQ+YLSG+PL G++ ++WGE +NA C+A Sbjct: 799 LMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVAN 858 Query: 847 YALAEDSLKQAITIASATC 865 Y L +S +Q +T SA C Sbjct: 859 YQLPPESQQQLLTQLSAEC 877
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 2e-16 Identities = 29/122 (23%), Positives = 58/122 (47%), Gaps = 2/122 (1%) Query: 1 MKPASVIIMDEHPIVRMSIEVLLGKNSNIQVVLKTDDSRTAIEYLRTYPVDLVILDIELP 60 M A++++ D+ +R + L + V T ++ T ++ DLV+ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GTDGFTLLKRIKSIQEHTRILFLSSKSEAFYAGRAIRAGANGFVSKRKDLNDIYNAVKMI 120 + F LL RIK + +L +S+++ A +A GA ++ K DL ++ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LS 122 L+ Sbjct: 119 LA 120
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 24.3 bits (52), Expect = 0.033 Identities = 7/27 (25%), Positives = 11/27 (40%) Query: 7 HLAPNHLTEHARKIDDIFGDDVPNMSH 33 H L+E + + G+ VP S Sbjct: 114 HKELQDLSEEEKNSMNSRGEKVPFASR 140
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 36.0 bits (83), Expect = 3e-06 Identities = 8/45 (17%), Positives = 18/45 (40%), Gaps = 1/45 (2%) Query: 4 KRYPEEFKIEAVRQVVER-GHSVSSAATHLDITTHSLYARIKKYG 47 R E + + + + AA L + ++L +I++ G Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELG 474
>PF06580#Sensor histidine kinase Length = 349 Score = 31.0 bits (70), Expect = 6e-04 Identities = 15/89 (16%), Positives = 28/89 (31%), Gaps = 21/89 (23%) Query: 20 GKTITLRIKEADDQIHIIVENPGTPIAPEHLPLLFDRFYRVDPSRQRKGEGSGIGLAIVK 79 G I L+ + + + + VEN G+ E +G GL V+ Sbjct: 278 GGKILLKGTKDNGTVTLEVENTGSLALKN------------------TKESTGTGLQNVR 319 Query: 80 SIVTAHRGK---ISVTSDTRATRFILTFP 105 + G I ++ ++ P Sbjct: 320 ERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 368 bits (946), Expect = e-133 Identities = 234/234 (100%), Positives = 234/234 (100%) Query: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 Query: 181 SLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAPASILSAIPR 234 SLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAPASILSAIPR Sbjct: 181 SLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAPASILSAIPR 234
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 28.7 bits (64), Expect = 0.048 Identities = 69/397 (17%), Positives = 130/397 (32%), Gaps = 66/397 (16%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQVGLSVTLTGGAMFIGLMVGGVLADRYERKKVIL 86 F S+++ +L V++P T IG V G L+D+ K+++L Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 87 LARGTCGIGFIGLCVNALLPEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGRENLMQ 146 G + V ++A ++ G F +L ++ + +EN + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKENRGK 139 Query: 147 AGAITMLTVRLGSVISPMLGGILLASGGVAWNYGLAAAGTFITLLPLLTLPRLPVPPQPR 206 A + V +G + P +GG++ + W+Y L IT++ + L +L Sbjct: 140 AFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKEVRI 195 Query: 207 ------------------------------------------------ENPFIAL-LAAF 217 +PF+ L Sbjct: 196 KGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKN 255 Query: 218 RFLLASPLIGGIALLGGLVTMASAVRVLYPALAMSWQMSAAQIGLLYAAI-PLGAAIGAL 276 + L GGI T+A V ++ + Q+S A+IG + + I Sbjct: 256 IPFMIGVLCGGIIF----GTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 277 TSGQLAHSVRPGLIMLVSTVG---SFLAVGLFAIMPVWIAGVICLALFGWLSAISSLLQY 333 G L P ++ + SFL W +I + + G LS +++ Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 334 TLLQTQTPENMLGRMNGLWTAQNVTGDAIGAALLGGL 370 + + + M L + + G A++GGL Sbjct: 372 IVSSSLKQQEAGAGM-SLLNFTSFLSEGTGIAIVGGL 407
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 60.3 bits (146), Expect = 1e-12 Identities = 47/210 (22%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 105 EPNAETVAAQMPDLILISATGGDSALALYDQLSAIAPTLVINYDDKS-----WQSLLTQL 159 EPN E + P ++ SA G S + L+ IAP N+ D + LT++ Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLTEM 141 Query: 160 GEITGQEKQAAARIAEFETQLTTVKQRIALPPQPVSALVYTPAAHSANLWTPESAQGKLL 219 ++ + A +A++E + ++K R L ++ P S ++L Sbjct: 142 ADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEIL 201 Query: 220 TQLGFTLATLPRGLQTSKSQGKRHDIIQLGGENLAAGLNGESLFLFAGDNKDVAALYANP 279 + G A + + + + LAA + + L ++KD+ AL A P Sbjct: 202 DEYGIPNAW--------QGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATP 253 Query: 280 LLAHLPAVQNKRVYALGTETFRLDYYSATL 309 L +P V+ R + F Y ATL Sbjct: 254 LWQAMPFVRAGRFQRVPAVWF----YGATL 279
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 425 bits (1095), Expect = e-154 Identities = 148/299 (49%), Positives = 192/299 (64%), Gaps = 18/299 (6%) Query: 1 MAIPKLQSYALPTALDIPTNKVNWAFEPERAALLIHDMQDYFVSFWGRNCPMMDQVIANI 60 MAIP +Q Y +PTA D+P NKV+W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRQYCKEHHIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVEALTPDEADTV 120 L+ C + IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P++ D V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKDTGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSREEHLMALNYVAGRSGRVVMTESLL------PTPVPASKA-----------ALRALIL 223 FS E+H MAL Y AGR VMT+SLL P V + A +R I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDETDEPLD-DENLIDYGLDSVRMMGLAARWRKVHGDIDFVMLAKNPTIDAWWALLS 281 LL ET E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 338 bits (867), Expect = e-120 Identities = 105/257 (40%), Positives = 148/257 (57%), Gaps = 20/257 (7%) Query: 9 KTVWVTGAGKGIGYATALAFVDAGARVIGFDRE---------------FTQENYPFATEV 53 K ++TGA +GIG A A GA + D E +P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP----- 63 Query: 54 MDVADAAQVAQVCQRVLQKTTRLDVLVNAAGILRMGATDALSVDDWQQTFAVNVGGAFNL 113 DV D+A + ++ R+ ++ +D+LVN AG+LR G +LS ++W+ TF+VN G FN Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 114 FSQTMAQFRRQQGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGCGVRCN 173 ++ G+IVTV S+ A PR M+AY +SKAA +GLELA +RCN Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 174 VVSPGSTDTDMQRTLWVSEDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDLA 233 +VSPGST+TDMQ +LW E+ +Q I+G E FK GIPL K+A+P +IA+ +LFL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 234 SHITLQDIVVDGGSTLG 250 HIT+ ++ VDGG+TLG Sbjct: 244 GHITMHNLCVDGGATLG 260
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.8 bits (228), Expect = 1e-23 Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 1/123 (0%) Query: 2 TNVLIVEDEQAIRRFLRAALEGDGLRVYEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61 +L+ +D+ AIR L AL G V A DL++ D+ +PD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 DFIRDLRQWSA-IPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHA 120 D + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 ASP 123 P Sbjct: 124 RRP 126
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 30.4 bits (68), Expect = 0.007 Identities = 16/87 (18%), Positives = 26/87 (29%), Gaps = 8/87 (9%) Query: 20 LTLVSSANIACGFHAGDAQTMLT---CVREALKNGVAIGAHPSFPDRDN--FGRT--AMV 72 + + IA G G T+LT V + A+ A PS ++DN G + Sbjct: 95 VEAPTGTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQI 153 Query: 73 LPPETVYAQTLYQIGALGAIVQAQGGV 99 + + V Sbjct: 154 TKYSGEGDLAIVKFSPNEQNKHIGEVV 180
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.006 Identities = 11/33 (33%), Positives = 16/33 (48%), Gaps = 6/33 (18%) Query: 50 KIDFTLTEGNRLALIGHNGSGKTTLLRVLAGAY 82 K D+++ L G G GK+TL+ L G Sbjct: 594 KFDYSVV------LEGTGGIGKSTLINTLVGLD 620
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.004 Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 15 VDNAPRMQDYTLEGEEGRDM-MLLDALIQLKEKDPSLSFRR 54 ++N + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.5 bits (146), Expect = 6e-12 Identities = 29/198 (14%), Positives = 67/198 (33%), Gaps = 6/198 (3%) Query: 64 YNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQKQAEEA 123 YN + +++ Q E R+ + A + E Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 124 AKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKAEAEAA 183 A+ ++Q+ + E+ + A + + A +A ++ K + V A+ +E + Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEV-----AQSGSETKET 1095 Query: 184 KAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKKADAAAA 243 + + + KA E +K E + + K+ +E + AE ++ D Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQE-VPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154 Query: 244 KAAAEAKKKADAAAAKAA 261 +++ A + A Sbjct: 1155 IKEPQSQTNTTADTEQPA 1172 Score = 55.1 bits (132), Expect = 4e-10 Identities = 39/254 (15%), Positives = 80/254 (31%), Gaps = 11/254 (4%) Query: 55 VDPGAVVQQYNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQ 114 VD + N Q D S EE ++ + +E E + ++ Sbjct: 992 VDTTNITTPNNIQADVP-SVPSNNEEIARVDEAPVPPPAPATPSETTETVA-ENSKQESK 1049 Query: 115 EQQKQAEEAAKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADA 174 +K ++A + Q ++ A+EA + E A++ ++ K+ E Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQ--TNEVAQSGSETKETQTTE-------T 1100 Query: 175 KKKAEAEAAKAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEA 234 K+ A E + A +K + + + K+ ++E + AE ++ D ++ Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160 Query: 235 KKKADAAAAKAAAEAKKKADAAAAKAAADAKKKAAAEKAAAAEGVDDLLGDLSSGKNAPK 294 + A + A E + ++ + E S N PK Sbjct: 1161 QTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220 Query: 295 TGGGAKGNGQPSKD 308 P Sbjct: 1221 NRHRRSVRSVPHNV 1234 Score = 52.0 bits (124), Expect = 3e-09 Identities = 28/239 (11%), Positives = 79/239 (33%), Gaps = 27/239 (11%) Query: 66 RQQDQQASARRAEEERKKLQQQQAEE--LQQKQAAEQER------LKQLEKERLAAQEQQ 117 + A + E + + +Q A E Q ++ A++ + + E + ++ ++ Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 118 KQ-----------AEEAAKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAE 166 Q EE AK+ ++ Q E + + K+ ++E + A+ ++ + Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQ--EVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152 Query: 167 AVKAAADAKKKAEAEAAKAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAE 226 ++ A+ + A + E ++ + E + A + Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212 Query: 227 AAKAAAEAKKKADAAAAK------AAAEAKKKADAAAAKAAADAKKKAAAEKAAAAEGV 279 + + + + + A + ++ A + ++ A A+ V Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFV 1271 Score = 45.4 bits (107), Expect = 3e-07 Identities = 25/234 (10%), Positives = 65/234 (27%), Gaps = 6/234 (2%) Query: 59 AVVQQYNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQK 118 A Q Q + E K+ + EE + + + + E ++ +Q K Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQ-----EVPKVTSQVSPK 1132 Query: 119 QAEEAAKLAQQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKA 178 Q + Q + + + + + + A + + E + Sbjct: 1133 QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTG 1192 Query: 179 EAEAAKAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKKA 238 + + ++ K + ++ + A + + A Sbjct: 1193 NSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252 Query: 239 DAAAAKAA-AEAKKKADAAAAKAAADAKKKAAAEKAAAAEGVDDLLGDLSSGKN 291 + A ++A+ KA A + + + + + + S KN Sbjct: 1253 TSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQYNVWVSNTSMNKN 1306
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 115 bits (289), Expect = 2e-33 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%) Query: 56 EEQARLQMQQLQQNNIVYFDLDKYDIRSDFAAMLDAHANFLRSN--PSYKVTVEGHADER 113 +Q + + V F+ +K ++ + A LD + L + V V G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 114 GTPEYNISLGERRANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAYAKNRRAVL 172 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGN-TCDNVKQRAALI 321
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.021 Identities = 2/39 (5%), Positives = 19/39 (48%) Query: 56 LTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVMERQKQI 94 + + + + + +++ + Q+++ + ++ E + + Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.020 Identities = 14/51 (27%), Positives = 24/51 (47%), Gaps = 9/51 (17%) Query: 37 VREGHICDI----VPETQLPV-----SGDNIHDMQGRLVTPGLIDCHTHLV 78 +++G I I P+ Q V G + +G++VT G +D H H + Sbjct: 90 LKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI 140
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 47.1 bits (112), Expect = 5e-08 Identities = 49/207 (23%), Positives = 78/207 (37%), Gaps = 25/207 (12%) Query: 1 MTQYASSLRSLAAGSVLLFLFASPVKAEEQTIAPPGVDAR-AWILMDYASGKVLAEGNAD 59 M + SL A ++ L + ASP E+ ++ + R I MD ASG+ L AD Sbjct: 1 MRYIRLCIISLLA-TLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRAD 59 Query: 60 EKLDPASLTKIMTSYVVGQALKAGKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQV 119 E+ S K++ V + AG +L + + +P V D + Sbjct: 60 ERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGM 113 Query: 120 SVADLNKGIIIQSGNDACIALADYVAGSQESFIGLMNAYAKRLGLTNTT---FQTVHGLD 176 +V +L I S N A L V G + A+ +++G T ++T Sbjct: 114 TVGELCAAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEA 168 Query: 177 APGQF---STARDMA------LLGKAL 194 PG +T MA L + L Sbjct: 169 LPGDARDTTTPASMAATLRKLLTSQRL 195
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 42.2 bits (99), Expect = 3e-06 Identities = 66/356 (18%), Positives = 127/356 (35%), Gaps = 51/356 (14%) Query: 48 QAGLDWVPTSMTAYLAGGMFLQWLLGPLSDRIGRRPVMLAGVVWFIVTCLATLLAKNIEQ 107 A +WV T+ + G + G LSD++G + ++L G++ + + + Sbjct: 48 PASTNWVNTAFMLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104 Query: 108 FT-FLRFLQGISLCFIGAVGYAAIQESFEEAVCIKITALMANVALIAPLLGPLVGAAWVH 166 RF+QG A+ + + K L+ ++ + +GP +G H Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164 Query: 167 VLPWEGMFILFAALAAIAFFGLQRAMTETATRRGE------------------------- 201 + W +L + I L + + + +G Sbjct: 165 YIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSI 223 Query: 202 ------TLSFKALGRDYRLV---------IKNRRFVAGALALGFVSLPLLAWIAQSPIII 246 LSF + R V KN F+ G L G + + +++ P ++ Sbjct: 224 SFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283 Query: 247 ISGEQLSSYEYG-LLQVPVFGALIAGNLVLARLTSRRTVRSLIVMGGWPIVAGLIIAAAA 305 QLS+ E G ++ P ++I + L RR ++ +G + + A+ Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-- 341 Query: 306 TVVSSHAYLWMTAGLSVYAFGIGLANAGLVRLTLFSSDMSKGTVSAAMGMLQMLIF 361 + +MT + V+ G GL+ V T+ SS + + A M +L F Sbjct: 342 -FLLETTSWFMTIII-VFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVTVVR-ASALM--GALGIGLIIFVDSDWVA-GVSVILWGLGASLGFPLTISAASD 331 DR + V+ + L ++ S ++ + +L GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407 Score = 28.7 bits (64), Expect = 0.042 Identities = 32/135 (23%), Positives = 56/135 (41%), Gaps = 11/135 (8%) Query: 38 IRDILSVSTAEMGAVLFGLSIGSMSGILCS---AWLVKRFGTRKVIRTTMTCAVTGMVIL 94 ++D+ +STAE+G+V+ + G+MS I+ LV R G V+ +T + Sbjct: 283 MKDVHQLSTAEIGSVI--IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTA 340 Query: 95 SVALWCASPLIFALGLAVFGASFGAAEVAINVEGAAVERELNKTVLPMMHGFYSFGTLAG 154 S L S + + + V G + I+ V L + +F + Sbjct: 341 SFLLETTSWFMTIIIVFVLG-GLSFTKTVIS---TIVSSSLKQQEAGAGMSLLNFTSFLS 396 Query: 155 AGVGMALTA--LSVP 167 G G+A+ LS+P Sbjct: 397 EGTGIAIVGGLLSIP 411
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.3 bits (112), Expect = 6e-09 Identities = 17/80 (21%), Positives = 33/80 (41%) Query: 7 RRANDPKRREKIIQATLEAVKTYGVHAVTHRKIAAIAQVPLGSMTYYFAGMDALLSEAFT 66 + + R+ I+ L GV + + +IA A V G++ ++F L SE + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 67 LFTENMSRQYQDFFAQVTDA 86 L N+ ++ A+ Sbjct: 65 LSESNIGELELEYQAKFPGD 84
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.011 Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSNFSFGIGNAAGLLFAGIML-GFLRANHPTFG-YIPQ--GALNMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGSGISNGLGAVGGQM--LIAGLVVSLVPVVICFLF 493 M G G+ AG + +G A + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.5 bits (68), Expect = 0.004 Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 31/119 (26%) Query: 81 FDAVMAG--MDITPEREKQVLFTTPYYDNSALFVGQQGKYTSVDQLKGKKVGVQNGTTHQ 138 D+V+A M + E +QV+ TP DNSAL + QL Q Sbjct: 112 LDSVIASRLMQMALEAARQVIGQTPTVDNSALI-------KQIQQL-----------LQQ 153 Query: 139 KFIMDKHPEITTVPYDSYQNAKLDLQNGRIDAVFGDTAVVTEW-LKANPKLAPVGDKVT 196 + + P++ P DLQ R+D + G T + W L+ +P L P G KV+ Sbjct: 154 EPLFSGKPQLRVHPD--------DLQ--RVDDMLGATLSLHGWRLRGDPTLHPGGCKVS 202
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 1/50 (2%) Query: 31 LVLLGPSGAGKSSLLRVLNLLEMPRSGTLTIAGNHFDFTKTPSDKAIREL 80 +VL G G GKS+L+ L L+ S T G D + + EL Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDF-FSDTHFDIGTGKDSYEQIAGIVAYEL 647
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.0 bits (83), Expect = 9e-04 Identities = 17/170 (10%), Positives = 50/170 (29%), Gaps = 25/170 (14%) Query: 639 FKDPTPPKTAATRNDEASRLLLR-----YSQQQAQVEGQIAAAKQSSGLNTDRMTEAHKQ 693 D + + L++ + Q+ Q E + + R+ Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 694 LLALQQRISDLSGKKLTADEKSVLAHKTELVQALTLLDLKQQELQRQSALNDLKKKAVQL 753 + R+ D L ++++ H +L+ + + ++ + L K + Q+ Sbjct: 230 SRVEKSRLDDF--SSL-LHKQAIAKHA--------VLEQENKYVEAVNELRVYKSQLEQI 278 Query: 754 ESQLKLEEQSQRQRHDLDLATAGMGDVQKQRYQEQLTLRQKFAQQQEQLE 803 ES++ ++ + + + + +L Sbjct: 279 ESEILSAKEEYQLVT---------QLFKNEILDKLRQTTDNIGLLTLELA 319
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 141 bits (358), Expect = 1e-45 Identities = 61/175 (34%), Positives = 90/175 (51%), Gaps = 10/175 (5%) Query: 2 KQVLIAVLIVSVTGGLAQAAGVKNTVSLGYAYTDLGGALSGNARGLNLKYNAEDLNSGFG 61 K ++ L + + +TV+ GYA +D G ++ G NLKY E+ NS G Sbjct: 3 KIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMN-KMGGFNLKYRYEEDNSPLG 61 Query: 62 GMGSFTFTTADIQSYSGGKAGDMDYSSFLVGPSYRFNEYLNAYVMAGLGHGHIDDKR--- 118 +GSFT+T + S G Y GP+YR N++ + Y + G+G+G Sbjct: 62 VIGSFTYTEKSRTA-SSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTEYPT 120 Query: 119 -DNSGKKTGFAYGAGVQINPVENIAINASYEYSRFSAYDSKVNAGTWVLGVGYSF 172 + GF+YGAG+Q NP+EN+A++ SYE SR + D GTW+ GVGY F Sbjct: 121 YKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVD----VGTWIAGVGYRF 171
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 38.1 bits (88), Expect = 1e-04 Identities = 54/231 (23%), Positives = 85/231 (36%), Gaps = 11/231 (4%) Query: 126 AAGQASEQAQTSAGQAAESATAAASAAGAADASATQAASSAASAESSAGTA------TTK 179 + G + S AA ATA S A A QAA + A+AE+ A T + Sbjct: 34 SGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQR 93 Query: 180 AGEASASAASADTARTAAAASEAAAKTSEANADASRTAAGDSAAAAAASATAAQTSAERA 239 + A + +RT +A A A + A+ R + A A AA+ + + A Sbjct: 94 LKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEA 153 Query: 240 G--ASETAAKTSETQAASSAGDAGASATAAAASEKAAAASAAEAKTSETNAATSASTAAA 297 E + +ET+ +A AA SE+A A A+ K S + Sbjct: 154 EQRRKEIEREKAETERQLKLAEA-EEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEI 212 Query: 298 SATAASSSASEASTHAAASDTSASLA--AQSRAAAGESATRAEEAAKRAED 346 + S+S + A + AQ+ A E ++ + RA D Sbjct: 213 KTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRAND 263 Score = 31.2 bits (70), Expect = 0.018 Identities = 48/262 (18%), Positives = 84/262 (32%), Gaps = 12/262 (4%) Query: 104 RRFEAMVEEVARQASEASRNATAAGQASEQAQTSAGQAAESATAAASAAGAADASATQAA 163 +R + +V E R + + +AT A+ A QA + A A A A A Sbjct: 92 QRLKDIVNEALRHNASRTPSATELAHANNAAM----QAEDERLRLAKAEEKARKEAEAAE 147 Query: 164 SSAASAESSAG-TATTKAGEASASAASADTARTAAAASEAA-------AKTSEANADASR 215 + AE KA + + AA SE A K S A ++ + Sbjct: 148 KAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVK 207 Query: 216 TAAGDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGDAGASATAAAASEKAAA 275 + S++ AE + + ++ A D + A++ Sbjct: 208 MDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQN 267 Query: 276 ASAAEAKTSETNAATSASTAAASATAASSSASEASTHAAASDTSASLAAQSRAAAGESAT 335 EA A TA+ + + + + S + +R A Sbjct: 268 RPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVH 327 Query: 336 RAEEAAKRAEDIADVISLEDAS 357 AEE K+A++ ++DA Sbjct: 328 EAEENLKKAQNNLLNSQIKDAV 349
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 66.3 bits (162), Expect = 2e-14 Identities = 69/370 (18%), Positives = 122/370 (32%), Gaps = 71/370 (19%) Query: 1 MKVLVTGATSGLGRNAVEFLRNKGISVRA---------TGRNEAMGKLLEKMGAEFVHAD 51 MK LVTGA +G + + L G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAAGEEVINLLAQANPQT--- 161 +++ ++ SS S+Y + D + +A +K A E L+A Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171 Query: 162 RFTVLRPQSLFGPHDK--VFIPRLAHMMHHYGSVLLPHGGSALVDMTYYENAIHAMWLAS 219 T LR +++GP + + + + M S+ + + G D TY ++ A+ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 220 QPGCDHLPS--------------GRAYNITNGENRTLRSIVQKLIDELTIDCRIRSVPYP 265 R YNI N L +Q L D L I+ + +P Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 266 MLDMIARSMERFGKKSAKEPPLTHYGVSKLNFDFTLDTTRAQDELGYQPIVTLDEGIERT 325 D+ T DT + +G+ P T+ +G++ Sbjct: 292 PGDV----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNF 324 Query: 326 AAWLRDHGNL 335 W RD + Sbjct: 325 VNWYRDFYKV 334
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.2 bits (133), Expect = 2e-10 Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVFALSQQGHQVRA---------AARRIERLEKQRLANVSCHKVDL 54 + LV GA+G+IG H+ L + GHQV + + RLE HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 HWPENLPALLRD--IDTVYYLVH------GMGEGGDFIAHERQAALNVRDALRQTPVKQL 106 E + L + V+ H + + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 52.8 bits (126), Expect = 1e-08 Identities = 43/276 (15%), Positives = 80/276 (28%), Gaps = 47/276 (17%) Query: 574 PQLPRPNRVR-----VPTRRELASYGIKLPSQRIAE-------EKAREAERNQYETGAQL 621 + PN ++ VP+ E + + P A E E + + +T + Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054 Query: 622 TDEEIDAMHQDELARQFAQSQQHRYGETYQHDTQQAEDDDTAAEAELARQFAASQQQRYS 681 + + Q R+ A+ + + +TQ E + +E + + + Sbjct: 1055 EQDATETTAQ---NREVAKEAK----SNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107 Query: 682 GEQPAGAQPFSLDDLDFSPMKVLVDEGPHEPLFTPGVMPESTPVQQPVAPQPQPQYQQPQ 741 E+ A KV ++ P T V P+ +Q QPQ + + Sbjct: 1108 KEEKA---------------KVETEKTQEVPKVTSQVSPKQ---EQSETVQPQAEPAREN 1149 Query: 742 QPV--APQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQP 795 P +PQ Q QP + P P+ QP Sbjct: 1150 DPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQP 1209 Query: 796 VAPQPQYQQPQ----QPVAPQPQYQQPQQPVAPQPQ 827 +P+ + V P +P + Sbjct: 1210 TVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Score = 38.1 bits (88), Expect = 3e-04 Identities = 23/188 (12%), Positives = 46/188 (24%), Gaps = 27/188 (14%) Query: 720 PESTPVQQPVAPQPQPQYQ-----QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQP 774 + PV P P + Q+ + Q + A + + + Sbjct: 1020 VDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKAN 1079 Query: 775 QYQQPQQPVAPQPQYQQPQQPVAPQ------------------PQYQQPQQPVAPQPQYQ 816 + + Q + P+ P Q + Sbjct: 1080 TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139 Query: 817 QPQQPVAPQPQ----YQQPQQPTAPQDSLIHPLLMRNGDSRPLQRPTTPLPSLDLLTPPP 872 QPQ A + ++PQ T P + + +T + + + + P Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199 Query: 873 SEVEPVDT 880 P T Sbjct: 1200 ENTTPATT 1207 Score = 37.4 bits (86), Expect = 4e-04 Identities = 44/307 (14%), Positives = 78/307 (25%), Gaps = 57/307 (18%) Query: 296 RATQPEYDEYDPLLNGHSVTEPVAAAAAATAVTQTWAASADP--IMQTPPMPGAEPVVAQ 353 PE ++ + ++ ++T P A +V A PP P + Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038 Query: 354 PTVEWQP--------------VPGPQTGE------PVIAPAPEGYQPHPQYAQPQEAQSA 393 E Q E + + + ++ +E Q+ Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 394 PWQQPVPVASAPQYAATPATAAEYDSLA---------PQETQPQWQPEPTHQPT------ 438 ++ V + E + + QPQ +P + PT Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158 Query: 439 ------PVYQPEPIAAEPSHMPPPVIE-QPVATEPEP-------DTEETRPARPPLYYFE 484 +P S++ PV E V T T+P + Sbjct: 1159 QSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK 1218 Query: 485 EVEEKRAREREQLAAWYQPIPEPVKENVPVKPTVSVAPSIPPVEAVAAAASLDAGIKSGA 544 R R EP + + TV++ A + A + A Sbjct: 1219 PKNRHRRSVRSVPHN-----VEPATTSSNDRSTVALCDLT-STNTNAVLSDARAKAQFVA 1272 Query: 545 LAAGAAA 551 L G A Sbjct: 1273 LNVGKAV 1279 Score = 37.4 bits (86), Expect = 5e-04 Identities = 60/341 (17%), Positives = 101/341 (29%), Gaps = 32/341 (9%) Query: 365 QTGEPVIAPAPEGYQP-HPQYAQPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLAPQ 423 QT + P Q P E + + PVP P ATP+ E + + Sbjct: 990 QTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVP----PPAPATPSETTETVAENSK 1045 Query: 424 ETQPQWQPEPTHQPTPVYQPEPIAAEPSHMPPPVIEQPVATEPEPDTEETRPARPPLYYF 483 + + Q +A E + + +T+ET+ Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105 Query: 484 EEVEEKRAREREQLAAWYQPIPEPVKENVP-------VKPTVSVAPSIPPVEAVAAAASL 536 E EEK E E+ Q +P+ + P V+P A P + S Sbjct: 1106 VEKEEKAKVETEKT----QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161 Query: 537 DAGIKSGALAAGAAAAAPAFSLATGGAPRPQVKEGIGPQLPRPNRVRVPTRRELASYGIK 596 A ++ + P+ P + PT +S K Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ-PTVNSESSNKPK 1220 Query: 597 LPSQRIAEEKAREAE-----RNQYETGAQ--LTDEEIDAMHQDELARQFAQSQQHRYGET 649 +R E N T A LT +A+ D A+ AQ G+ Sbjct: 1221 NRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAK--AQFVALNVGKA 1278 Query: 650 YQHDTQQAEDDDTAA------EAELARQFAASQQQRYSGEQ 684 Q E ++ + + +++SQ +R+S + Sbjct: 1279 VSQHISQLEMNNEGQYNVWVSNTSMNKNYSSSQYRRFSSKS 1319
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 8e-04 Identities = 39/158 (24%), Positives = 62/158 (39%), Gaps = 6/158 (3%) Query: 8 VMLLLCGLLLLT-LAIAVLNTLVPLWLAQANLPTWQVGMVSSSYFTGNLVGTLFTGYLIK 66 +++ LC L + L VLN +P N P V++++ +GT G L Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 67 RIGFNRSYYLASLIFAAGCVGLGVMVGFWSWMSW-RFIAGIGCAMIWVVVESALMCSGTS 125 ++G R +I G V V F+S + RFI G G A +V + Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 126 HNRGRLLAAYMMVYYMGTFLGQLLVSKVSGELLHVLPW 163 NRG+ + MG +G + G + H + W Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPA----IGGMIAHYIHW 168
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 481 bits (1240), Expect = e-173 Identities = 216/387 (55%), Positives = 265/387 (68%), Gaps = 29/387 (7%) Query: 2 MKRKILAAVIPALLAAATANAAEIYNKDGNKLDLYGKAVGRHVWTTTGDSKNADQTYAQI 61 MKRK+LA VIPALLAA A+AAEIYNKDGNKLDLYGK G H + + SK+ DQTY ++ Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLH-YFSDDSSKDGDQTYMRV 59 Query: 62 GFKGETQINTDLTGFGQWEYRTKADRAEGEQQNSNLVRLAFAGLKYAEVGSIDYGRNYGI 121 GFKGETQIN LTG+GQWEY +A+ EGE NS RLAFAGLK+ + GS DYGRNYG+ Sbjct: 60 GFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYGV 118 Query: 122 VYDVESYTDMAPYFSGETWGGAYTDNYMTSRAGGLLTYRNSDFFGLVDGLSFGIQYQGKN 181 +YDVE +TDM P F G+++ Y DNYMT RA G+ TYRN+DFFGLVDGL+F +QYQGKN Sbjct: 119 LYDVEGWTDMLPEFGGDSY--TYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKN 176 Query: 182 QDNHS---------------INSQNGDGVGYTMAYEFD-GFGVTAAYSNSKRTNDQQDRD 225 + + I NGDG G + Y+ GF AAY+ S RTN+Q + Sbjct: 177 ESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAG 236 Query: 226 G---NGDRAESWAVGAKYDANNVYLAAVYAETRNMSIVENTVTD-TVEMANKTQNLEVVA 281 G GD+A++W G KYDANN+YLA +Y+ETRNM+ T +ANKTQN EV A Sbjct: 237 GTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTA 296 Query: 282 QYQFDFGLRPAISYVQSKGKQL---NGAGGSADLAKYIQAGATYYFNKNMNVWVDYRFNL 338 QYQFDFGLRPA+S++ SKGK L N G DL KY GATYYFNKN + +VDY+ NL Sbjct: 297 QYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINL 356 Query: 339 LDEND--YSSSYVGTDDQAAVGITYQF 363 LD++D Y + + TDD A+G+ YQF Sbjct: 357 LDDDDPFYKDAGISTDDIVALGMVYQF 383
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 27.0 bits (59), Expect = 0.033 Identities = 11/41 (26%), Positives = 15/41 (36%) Query: 83 TRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGL 123 QS NT SQT + S + + S + V G Sbjct: 302 KNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASF 342
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.7 bits (79), Expect = 0.002 Identities = 44/300 (14%), Positives = 93/300 (31%), Gaps = 29/300 (9%) Query: 453 DTLDEKIATLQEKIARARKTPWTVSSSQTEYDQQQLNELQEQKRQKDLLDAKAQAERNYQ 512 D + + TL+ K + + E ++ N ++ ++ L KA + + Sbjct: 60 DKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELE 119 Query: 513 KTQKRRNEQNAALNRDNETESLRHQREVARITAMQYADAAVRNAALERENERHKKAMARQ 572 + + T + + A A A ALE A+ Sbjct: 120 ARKADLEKALEGAMNF-STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 178 Query: 573 KEKPKAYYNDEAGRLLLQYSQQQAQTEGLIAAAKLSTTEKMTEAHKQLLSFQQRIADLSG 632 K EA + L+ + + A +AK+ T E A Sbjct: 179 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARK----------- 227 Query: 633 KKLTADEQSVLAHKDEIALALQKLDISQQDLQHQNAFNELKKKTLTLTSQLADEESRVRQ 692 L + + + ++ L+ + L+ + A E + S + + + Sbjct: 228 ADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLE 287 Query: 693 QHALALATMGMGDQQRGRYEEHLKIQQHYQEQLEQLKRDSKAKGTYGSDEYRQAEQELQA 752 AL + ++ E ++ + L+RD A R+A+++L+A Sbjct: 288 AEKAAL------EAEKADLEHQSQVLN---ANRQSLRRDLDAS--------REAKKQLEA 330 Score = 33.1 bits (75), Expect = 0.005 Identities = 47/248 (18%), Positives = 76/248 (30%), Gaps = 19/248 (7%) Query: 457 EKIATLQEKIARARKTPWTVSSSQTEYDQQQLNELQEQKRQKDLLDAKAQAERNYQKTQK 516 E + KT ++ + L+ AK + + Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224 Query: 517 RRNEQNAALNRDNETESLRHQREVARITAMQYADAAVRNAALERENERHKKAMA------ 570 R S ++ + A + A R A LE+ E Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEA-EKAALEARQAELEKALEGAMNFSTADSAKI 283 Query: 571 RQKEKPKAYYNDEAGRLLLQYSQQQAQTEGLIAAAKLSTTEKM-TEAHKQLLSFQQRIAD 629 + E KA E L Q A + L S K EA Q L Q +I++ Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343 Query: 630 LSGKKLTADEQSVLAHKDEIALALQKL-------DISQQDLQHQ-NAFNELKKKTLTLTS 681 S + L D + K ++ QKL + S+Q L+ +A E KK+ + Sbjct: 344 ASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ---VEK 400 Query: 682 QLADEESR 689 L + S+ Sbjct: 401 ALEEANSK 408
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 159 bits (404), Expect = 1e-52 Identities = 51/156 (32%), Positives = 83/156 (53%), Gaps = 8/156 (5%) Query: 1 MKKIV-VAVLVGLALGSIGVANAAGYKNTVSIGYAYTDLSGWLSGNANGANIKYNWEDLD 59 MKKI ++ L + + G + AA +TV+ GYA +D G ++ G N+KY +E+ + Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA--TSTVTGGYAQSDAQGQMN-KMGGFNLKYRYEEDN 57 Query: 60 SGFGAMGSVTYTSADVNNYGYKVGDADYTSLLVGPSYRFNDYLNAYVMIGAANGHIKDN- 118 S G +GS TYT Y + GP+YR ND+ + Y ++G G + Sbjct: 58 SPLGVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTE 117 Query: 119 ---WGNSDNKTAFAYGAGIQLNPVENIAVNASYEHT 151 + + + F+YGAG+Q NP+EN+A++ SYE + Sbjct: 118 YPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQS 153
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 33.5 bits (76), Expect = 0.004 Identities = 56/239 (23%), Positives = 85/239 (35%), Gaps = 16/239 (6%) Query: 123 NATAAGQASEQAQTSAGQASES-----ATAAVNAAGAAEASATQAASSAASAESSAGTA- 176 N T G S G SES ATA + A + A QAA + A+AE+ A Sbjct: 26 NGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKA 85 Query: 177 -----TTKAGEASASAASADTARTAAAASAAAAKTSEANADASRTAAGDSAAAAAASATA 231 T + + A + +RT +A A A + A+ R + A A A Sbjct: 86 NRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEA 145 Query: 232 AQTSAERAG--ASETAAKTSETQAASSAGDAGASATAAAASEKAAAASAAAAKTSETNAA 289 A+ + + A E + +ET+ +A AA SE+A A A K S + Sbjct: 146 AEKAFQEAEQRRKEIEREKAETERQLKLAEA-EEKRLAALSEEAKAVEIAQKKLSAAQSE 204 Query: 290 TSASTAAASATAASSSASEASTHAAASDTSASLA--AQSSTAAGAAATRAEDAAKRAED 346 + S+S + A + AQ+S + + RA D Sbjct: 205 VVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRAND 263
>PF07824#Type III secretion chaperone Length = 120 Score = 165 bits (419), Expect = 1e-56 Identities = 33/114 (28%), Positives = 63/114 (55%), Gaps = 1/114 (0%) Query: 1 MESLLNRLYDALGLDAPE-DEPLLIIDDGIQVYFNESDHTLEMCCPFMPLPDDILTLQHF 59 ME L + + ALG+ + + D+ +++DD + +Y + ++ + CPF LP++I L + Sbjct: 1 MEDLADVICRALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYA 60 Query: 60 LRLNYTSAVTIGADADNTALVALYRLPQTSTEEEALTGFELFISNVKQLKEHYA 113 L LNY+ + + D + +L+A L + E+ E +IS V+ LK+ +A Sbjct: 61 LSLNYSEKICLATDDEGGSLIARLDLTGINEFEDIYVNTEYYISRVRWLKDEFA 114
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 665 bits (1717), Expect = 0.0 Identities = 186/396 (46%), Positives = 254/396 (64%), Gaps = 5/396 (1%) Query: 166 LNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANN 225 LNN+ W + ++H+G +Y PA+ MKIG K+IF Y GKG+C T+ H N Sbjct: 146 LNNKNWGPVNKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRESDHIAN 205 Query: 226 LWMSTVSVHEDGKDKTLFCGIRHGVLSPYH-EKDPLLRHVGAENKAKEVLTAALFSKPEL 284 +W+S V V ++GK+ +F GIRHGV+S Y +K+ R V A NKA+E+++AAL+S+PEL Sbjct: 206 MWLSKV-VDDEGKE--IFSGIRHGVISAYGLKKNSSERAVAARNKAEELVSAALYSRPEL 262 Query: 285 LNKALAGEAVSLKLVSVGLLTASNIFGKEGTMVEDQMRAWQSL-TQPGKMIHLKIRNKDG 343 L++AL+G+ V LK+VS LLT +++ G E +M++DQ+ A + L ++ G+ L IRN DG Sbjct: 263 LSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKRGEPTKLLIRNSDG 322 Query: 344 DLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGWVGE 403 L+ V + V FN GVNELALK+G G + D N E++ LLG++ GGW E Sbjct: 323 LLKEVSVNLKVVTFNFGVNELALKMGLGWRNVDKLNDESICSLLGDNFLKNGVIGGWAAE 382 Query: 404 WLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSG 463 + + P V LA QIK+I D GEPYKL+QR+ +LA+ I AVP WNCKSG Sbjct: 383 AIEKNPPCKNDVIYLANQIKEIINKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSG 442 Query: 464 KDRTGMMDSEIKREIISLHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAG 523 KDRTGM D+EIKREII H+T S S S +++F +L+NSGN+EIQ+ NTG G Sbjct: 443 KDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGVPG 502 Query: 524 NKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLI 559 NKVMK L L LSY +R+GD IW VKG SS + Sbjct: 503 NKVMKKLPLSSLELSYSERIGDSKIWNMVKGYSSFV 538
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 18/102 (17%), Positives = 38/102 (37%), Gaps = 15/102 (14%) Query: 348 ILLQRVLSNLLTNAIRYSDENAVIRIESAYDDNVAEIRVANPGSHPADADKLFRRFWRGD 407 +L+Q ++ N + + I + I ++ D+ + V N GS K Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------- 309 Query: 408 NARHTAGFGLGLSLVNA-IALLHGGSASYRYADEHNIFSVRL 448 G GL V + +L+G A + +++ + + Sbjct: 310 ------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.7 bits (207), Expect = 1e-20 Identities = 30/117 (25%), Positives = 56/117 (47%), Gaps = 1/117 (0%) Query: 24 KILLIEDNQKTIEWVRQGLTEAGYVVDYACDGRDGLHLALQEHYSLIILDIMLPGLDGWQ 83 IL+ +D+ + Q L+ AGY V + L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 84 VLRALRTAHQS-PVICLTARDSVEDRVKGLEAGANDYLVKPFSFAELLARVRAQLRQ 139 +L ++ A PV+ ++A+++ +K E GA DYL KPF EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 61.2 bits (148), Expect = 8e-14 Identities = 30/158 (18%), Positives = 58/158 (36%), Gaps = 8/158 (5%) Query: 20 RQLILTAALAVFSQYGIHGARLEQVAERAGVSKTNLLYYYPSKEALYVAVMRQILDVWLA 79 RQ IL AL +FSQ G+ L ++A+ AGV++ + +++ K L+ + Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 80 PLKAFRAEF--SPLEAIKEYIRLKLEVSRDYPQASRLF-CMEMLAGAPLLMEELTGDLKA 136 ++A+F PL ++E + LE + + L + M + + Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132 Query: 137 LIDEKSALIAGWVHSG-----KLAPISPHHLIFMIWAA 169 L E I + A + ++ Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.4 bits (123), Expect = 4e-09 Identities = 51/253 (20%), Positives = 90/253 (35%), Gaps = 24/253 (9%) Query: 56 AFLATAAFIGRPFGGALFGLLADKFGRKPLMMWSIVAYSVGTGLSGLASGVIMLTLSRFI 115 A A F P GAL +D+FGR+P+++ S+ +V + A + +L + R + Sbjct: 50 ALYALMQFACAPVLGAL----SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 116 VGMGMAGEYACASTYAVESWPKHLKSKASAFLVSGFGIGNIIAAYFMPSFAEAYGWRAAF 175 G+ A + A + +++ F+ + FG G ++A + + A F Sbjct: 106 AGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPHAPF 163 Query: 176 FV-GLLPVLLVIYIRARAPESKEWEE--AKLSGPGKHSQSAWSVFSLSMKGLFNRA---- 228 F L L + PES + E + + W+ + L Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223 Query: 229 ---QFPLTLCVFIVLFSIFGANWPIFGLLPTYLTGEGFDTGVVSNLMTAAAFGTVLGN-- 283 Q P L V+F +W + L G ++ M LG Sbjct: 224 LVGQVPAAL---WVIFGEDRFHWDA-TTIGISLAAFGI-LHSLAQAMITGPVAARLGERR 278 Query: 284 -IVWGLCADRIGL 295 ++ G+ AD G Sbjct: 279 ALMLGMIADGTGY 291
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.5 bits (63), Expect = 0.010 Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Query: 44 LKNQDPTNPLQNNELTTQLAQISTVSGIEKLNTT 77 L N+ P N L NN L TQL + V G E+L T+ Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.1 bits (96), Expect = 7e-06 Identities = 17/48 (35%), Positives = 29/48 (60%) Query: 356 LTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 403 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 37.6 bits (87), Expect = 9e-05 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 2 SFSQAVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 + A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 353 bits (908), Expect = e-127 Identities = 211/232 (90%), Positives = 223/232 (96%) Query: 1 MQKYALHAYPVMALMVATLTGCAWIPAKPLVQGATTAQPIPGPVPVANGSIFQSAQPINY 60 MQK A H Y + +L+V +LTGCAWIP+ PLVQGAT+AQP+PGP PVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTSFGFDTVPRYLQGLFGNS 120 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKT+FGFDTVPRYLQGLFGN+ Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 121 RADMEASGGNSFNGKGGANASNTFSGTLTVTVDQVLANGNLHVVGEKQIAINQGTEFIRF 180 RAD+EASGGN+FNGKGGANASNTFSGTLTVTVDQVL NGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 181 SGVVNPRTISGSNSVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232 SGVVNPRTISGSN+VPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 429 bits (1104), Expect = e-153 Identities = 153/362 (42%), Positives = 215/362 (59%), Gaps = 9/362 (2%) Query: 5 LAGIVLALVATLAHAERIRDLTSVQGVRENSLIGYGLVVGLDGTGDQTTQTPFTTQTLNN 64 A L+ A RI+D+ S+Q R+N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 14 SALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRA 73 Query: 65 MLSQLGITVPTGTNMQLKNVAAVMVTASYPPFARQGQTIDVVVSSMGNAKSLRGGTLLMT 124 ML LGIT G + KN+AAVMVTA+ PPFA G +DV VSS+G+A SLRGG L+MT Sbjct: 74 MLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMT 132 Query: 125 PLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAIIERELPTQFGAGNT 184 L G D Q+YA+AQG ++V G A +++ R+ NGAIIERELP++F Sbjct: 133 SLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVN 192 Query: 185 INLQLNDEDFTMAQQITDAINRAR----GYGSATALDARTVQVRVPSGNSSQVRFLADIQ 240 + LQL + DF+ A ++ D +N G A D++ + V+ P + R +A+I+ Sbjct: 193 LVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEIE 251 Query: 241 NMEVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQLNVNQPNTPFGGG 300 N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF G Sbjct: 252 NLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSRG 309 Query: 301 QTVVTPQTQIDLRQSGGSLQSVRSSANLNSVVRALNALGATPMDLMSILQSMQSAGCLRA 360 QT V PQT I Q G + ++ +L ++V LN++G +++ILQ ++SAG L+A Sbjct: 310 QTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQA 368 Query: 361 KL 362 +L Sbjct: 369 EL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 499 bits (1285), Expect = 0.0 Identities = 263/316 (83%), Positives = 289/316 (91%), Gaps = 3/316 (0%) Query: 1 MIGDGKLLASAAWDAQSLNELKAKAGQDPAANIRPVARQVEGMFVQMMLKSMREALPKDG 60 MI D KLLASAAWDAQSLNELKAKAG+DPAANIRPVARQVEGMFVQMMLKSMR+ALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSDQTRLYTSMYDQQIAQQMTAGKGLGLADMMVKQMTSGQTMPADDAPQVPLKFSLET 120 LFSS+ TRLYTSMYDQQIAQQMTAGKGLGLA+MMVKQMT Q +P + P P+KF LET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VNSYQNQALTQLVRKAIPKTPDSSDAPLSGDSKDFLARLSLPARLASEQSGVPHHLILAQ 180 V YQNQAL+QLV+KA+P+ D S L GDSK FLA+LSLPA+LAS+QSGVPHHLILAQ Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDS---LPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQ 177 Query: 181 AALESGWGQRQILRENGEPSYNVFGVKATASWKGPVTEITTTEYENGEAKKVKAKFRVYS 240 AALESGWGQRQI RENGEPSYN+FGVKA+ +WKGPVTEITTTEYENGEAKKVKAKFRVYS Sbjct: 178 AALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYS 237 Query: 241 SYLEALSDYVALLTRNPRYAAVTTAATAEQGAVALQNAGYATDPNYARKLTSMIQQLKAM 300 SYLEALSDYV LLTRNPRYAAVTTAA+AEQGA ALQ+AGYATDP+YARKLT+MIQQ+K++ Sbjct: 238 SYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSI 297 Query: 301 SEKVSKTYSANLDNLF 316 S+KVSKTYS N+DNLF Sbjct: 298 SDKVSKTYSMNIDNLF 313
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 664 bits (1714), Expect = 0.0 Identities = 438/553 (79%), Positives = 487/553 (88%), Gaps = 8/553 (1%) Query: 2 SSLINHAMSGLNAAQAALNTVSNNINNYNVAGYTRQTTILAQANSTLGAGGWIGNGVYVS 61 SSLIN+AMSGLNAAQAALNT SNNI++YNVAGYTRQTTI+AQANSTLGAGGW+GNGVYVS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRGAQNQSSGLTTRYEQMSKIDNLLADKSSSLSGSLQSFFTSLQTLV 121 GVQREYDAFITNQLR AQ QSSGLT RYEQMSKIDN+L+ +SSL+ +Q FFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKAEGLVNQFKTTDQYLRDQDKQVNIAIGSSVAQINNYAKQIANLND 181 SNAEDPAARQALIGK+EGLVNQFKTTDQYLRDQDKQVNIAIG+SV QINNYAKQIA+LND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRMTGVGAGASPNDLLDQRDQLVSELNKIVGVEVSVQDGGTYNLTMANGYTLVQGSTA 241 QISR+TGVGAGASPN+LLDQRDQLVSELN+IVGVEVSVQDGGTYN+TMANGY+LVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSSADPTRTTVAYVDEAAGNIEIPEKLLNTGSLGGLLTFRSQDLDQTRNTLGQL 301 RQLAAVPSSADP+RTTVAYVD AGNIEIPEKLLNTGSLGG+LTFRSQDLDQTRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFADAFNAQHTKGYDADGNKGKDFFSIGSPVVYSNSNNADKTVSLTAKVVDSTKVQAT 361 ALAFA+AFN QH G+DA+G+ G+DFF+IG P V N+ N V++ A V D++ V AT Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGD-VAIGATVTDASAVLAT 359 Query: 362 DYKIVFDGTDWQVTRTADNTTFTATKDADGKLEIDGLKVTVGTGAQKNDSFLLKPVSNAI 421 DYKI FD WQVTR A NTTFT T DA+GK+ DGL++T NDSF LKPVS+AI Sbjct: 360 DYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419 Query: 422 VDMNVKVTNEAEIAMASESKLDPDVDTGDSDNRNGQALLDLQ-NSNVVGGNKTFNDAYAT 480 V+M+V +T+EA+IAMASE D GDSDNRNGQALLDLQ NS VGG K+FNDAYA+ Sbjct: 420 VNMDVLITDEAKIAMASEE------DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 481 LVSDVGNKTSTLKTSSTTQANVVKQLYKQQQSVSGVNLDEEYGNLQRYQQYYLANAQVLQ 540 LVSD+GNKT+TLKTSS TQ NVV QL QQQS+SGVNLDEEYGNLQR+QQYYLANAQVLQ Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533 Query: 541 TANALFDALLNIR 553 TANA+FDAL+NIR Sbjct: 534 TANAIFDALINIR 546
>FLAGELLIN#Flagellin signature. Length = 507 Score = 41.2 bits (96), Expect = 4e-06 Identities = 30/138 (21%), Positives = 59/138 (42%) Query: 1 MRISTQMMYEQNMSGITNSQAEWMKLGEQMSTGKRVTNPSDDPIAASQAVVLSQAQAQNS 60 I+T + + + SQ+ E++S+G R+ + DD + A + + Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61 Query: 61 QYALARTFATQKVSLEESVLSQVTTAIQTAQEKIVYAGNGTLSDDDRASLATDLQGIRDQ 120 Q + E L+++ +Q +E V A NGT SD D S+ ++Q ++ Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121 Query: 121 LMNLANSTDGNGRYIFAG 138 + ++N T NG + + Sbjct: 122 IDRVSNQTQFNGVKVLSQ 139
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 55.8 bits (134), Expect = 8e-10 Identities = 50/259 (19%), Positives = 93/259 (35%), Gaps = 26/259 (10%) Query: 513 PSEEEYAERKRPEQPALATFAMPDVPPAPTPVEPAVSVATAKKDNVAAAQPAQPGLFSRF 572 P E+ + DVP P+ + A+ D PA P S Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSN-----NEEIARVDEAPVPPPA-PATPSET 1036 Query: 573 LNALKQLFSGEETKTVETAAPKAEEKAERQQDRRKPRQNNRRDRNERRDTRDNRAGRDGG 632 S +E+KTVE A E + ++ K ++N + +T+ N + G Sbjct: 1037 TE-TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKA-----NTQTNEVAQSGS 1090 Query: 633 ESRDDNRRNRRQTQQQNAEAR---DTRQQETAEKVKTGDEQQQTPRRERSRRRNDDKRQA 689 E+++ ++T E + +T + + KV + Q +P++E+S A Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS----QVSPKQEQSETVQPQAEPA 1146 Query: 690 QQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSAVVETVDTPVVVDEPR 749 ++ +N +E Q + QP ++ N + T S V T ++ V E Sbjct: 1147 RENDPTVNIKEPQSQTNTTAD---TEQPAKETSS-NVEQPVTESTTVNTGNSVVENPENT 1202 Query: 750 PVENVEQPVPAPRTELAKV 768 + P +E + Sbjct: 1203 TPATTQ---PTVNSESSNK 1218 Score = 40.0 bits (93), Expect = 5e-05 Identities = 53/372 (14%), Positives = 89/372 (23%), Gaps = 47/372 (12%) Query: 630 DGGESRDDNRRNRRQTQQQNAEARDTRQQETAEKVKTGDEQQQTPRRERSRRRNDDKRQA 689 D G + R + N E Q + T + Q S + Sbjct: 963 DLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDE 1022 Query: 690 QQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSAVVETVDTPVVVDEPR 749 ET E Q+ + K Q + N V + V + Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE-AKSNVKANTQ 1081 Query: 750 PVENVEQPVPAPRTELAKVDLPVVADIAPEQDDSVEPRDNTGMPRRSRRSPRHLRVSGQR 809 E + T+ + A + E+ VE +++ P+ + Sbjct: 1082 TNEVAQSGSETKETQTTETKET--ATVEKEEKAKVETE-------KTQEVPKVTSQVSPK 1132 Query: 810 RRRYRDERYPTQSPMPLTVACASPEMASGKVWIRYPIVRPQETQVVDEQREADLALPQPV 869 + + + + P V +E Q AD P Sbjct: 1133 QEQSETVQPQAEPARE-----------------NDPTVNIKEPQS-QTNTTADTEQP--- 1171 Query: 870 VAEQQVIAATAALEPQASVQAVENVAVEPQTVAEPQAPEVVEVETTHPEVIAAPVDEQPQ 929 A Q V + + PE TT P V + ++ Sbjct: 1172 ----------AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221 Query: 930 LIAESDTPVAQEVIA------DAEPVAETADASITVAENVADVVVVEPEEETKAEAAVVE 983 S V V D VA S ++D AV + Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQ 1281 Query: 984 HTAEETVIAPAQ 995 H ++ + Q Sbjct: 1282 HISQLEMNNEGQ 1293
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 197 bits (503), Expect = 2e-67 Identities = 67/187 (35%), Positives = 94/187 (50%), Gaps = 18/187 (9%) Query: 1 MKNIILSTLVITTSVLVVNVAQADTNAFSVGYAQSKVQDFKN-IRGVNVKYRYE-DDSPV 58 MK I + + + A T+ + GYAQS Q N + G N+KYRYE D+SP+ Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 60 Query: 59 SFISSLSYLYGDRQASGSVEPEGIHYHDKFEVKYGSLMVGPAYRLSDNFSLYALAGVGTV 118 I S +Y R AS D + +Y + GPAYR++D S+Y + GVG Sbjct: 61 GVIGSFTYTEKSRTASSG---------DYNKNQYYGITAGPAYRINDWASIYGVVGVGYG 111 Query: 119 KATFKEHSTQDGDSFSNKISSRKTGFAWGAGVQMNPLENIVVDVGYEGSNISSTKINGFN 178 K E+ T D+ GF++GAG+Q NP+EN+ +D YE S I S + + Sbjct: 112 KFQTTEYPTYKHDT-------SDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWI 164 Query: 179 VGVGYRF 185 GVGYRF Sbjct: 165 AGVGYRF 171
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.002 Identities = 8/37 (21%), Positives = 17/37 (45%), Gaps = 5/37 (13%) Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTIIL 35 + I+ G I+G++ W+ K ++ I+L Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 28.5 bits (63), Expect = 0.008 Identities = 17/59 (28%), Positives = 25/59 (42%) Query: 49 QGLTVGIIILTIGVMAPIASGTLPPSTLIHSFVNWKSLVAIAVGVFVSWLGGRGITLMG 107 Q + L IG + + LPPS ++ N ++ A VS LG +TL G Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDG 232
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (299), Expect = 6e-39 Identities = 34/89 (38%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 27.2 bits (60), Expect = 0.007 Identities = 14/29 (48%), Positives = 19/29 (65%) Query: 6 QLILGAVVLGSTLLAGCSSNAKIDQLSSD 34 +L L A+ LG+TLL GC+S+ Q SD Sbjct: 2 KLRLSALALGTTLLVGCASSGTDQQGRSD 30
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 27.2 bits (60), Expect = 0.006 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 1/45 (2%) Query: 5 KLVLGAVILGSTLLAGCSSNAKIDQLSSD-VQTLNAKVDQLSNDV 48 KL L A+ LG+TLL GC+S+ Q SD ++ N + + +V Sbjct: 2 KLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNV 46
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.5 bits (209), Expect = 2e-21 Identities = 31/127 (24%), Positives = 56/127 (44%) Query: 2 ATIHLLDDDTAVTNACAFLLESLGYDVKCWTQGADFLAQASLYQAGVVLLDMRMPVLDGQ 61 ATI + DDD A+ L GYDV+ + A + +V+ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 GVHDALRQCGSTLAVVFLTGHGDVPMAVEQMKRGAVDFLQKPVSVKPLQAALERALTVSS 121 + +++ L V+ ++ A++ ++GA D+L KP + L + RAL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 122 AAVARRE 128 ++ E Sbjct: 124 RRPSKLE 130
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 6e-15 Identities = 28/119 (23%), Positives = 50/119 (42%), Gaps = 2/119 (1%) Query: 1 MKEYKILLVDDHEIIINGIMNALLPWPHFKIVEHVKNGLEVYNACCAYEPDILILDLSLP 60 M IL+ DD I + AL + V N ++ A + D+++ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GINGLDIIPQLHQRWPAMNILVYTAYQQEYMTIKTLAAGANGYVLKSSSQQVLLAALQT 119 N D++P++ + P + +LV +A IK GA Y+ K L+ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.5 bits (165), Expect = 9e-14 Identities = 31/156 (19%), Positives = 57/156 (36%), Gaps = 13/156 (8%) Query: 691 ILLVDDADINRDIIGKMLVSLGQHVTIAASSNEALTLSQQQRFDLVLIDIRMPEIDGIEC 750 IL+ DD R ++ + L G V I +++ DLV+ D+ MP+ + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 751 VRLWHDEPNNLDPDCMFVALSASVATEDIHRCKKNGIHHYITKPVTLATLARYISIAAEY 810 + PD + +SA + + G + Y+ KP L L Sbjct: 66 LP----RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG-------- 113 Query: 811 QLLRNIELQEQDPSRCSALLAT-DDMVINSKIFQSL 845 + R + ++ PS+ +V S Q + Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 583 bits (1504), Expect = 0.0 Identities = 157/500 (31%), Positives = 261/500 (52%), Gaps = 15/500 (3%) Query: 11 LLFILNTAKSDELSWKGNDFTLYARQMPLAEVLHLLSENYDTAITISPLITATFSGKIPP 70 LL + + + + EL W + A+ L ++L NYD + +S I SG+ Sbjct: 17 LLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEH 76 Query: 71 GPPVDILNNLAAQYDLLTWFDGSMLYVYPASLLKHQVITFNILSTGRFIHYLRSQNILSS 130 P D L ++A+ Y+L+ ++DG++LY++ S + ++I L+ I Sbjct: 77 DNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWE- 135 Query: 131 PGCEVKEITGTKAVEVSGVPSCLTRISQLASVLDNALIKR--KDSAVSVSIYTLKYATAM 188 P + + V VSG P L + Q A+ L+ R K A+++ I+ LKYA+A Sbjct: 136 PRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASAS 195 Query: 189 DTQYQYRDQSVVVPGVVSVL-REMSKTSVPTSSTNN-----GSPATQALPMFAADPRQNA 242 D YRD V PGV ++L R +S ++ + +N + A ADP NA Sbjct: 196 DRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEADPSLNA 255 Query: 243 VIVRDYAANMAGYRKLITELDQRQQMIEISVKIIDVNAGDINQLGIDWGTAVSLGG---- 298 +IVRD M Y++LI LD+ IE+++ I+D+NA + +LG+DW + G Sbjct: 256 IIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQV 315 Query: 299 --KKIAFNTGLNDGGASGFSTVISDTSNFMVRLNALEKSSQAYVLSQPSVVTLNNIQAVL 356 K + + GA G + R+N LE A V+S+P+++T N QAV+ Sbjct: 316 VIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVI 375 Query: 357 DKNITFYTKLQGEKVAKLESITTGSLLRVTPRLLNDNGTQKIMLNLNIQDGQQSDTQSET 416 D + T+Y K+ G++VA+L+ IT G++LR+TPR+L +I LNL+I+DG Q S Sbjct: 376 DHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGI 435 Query: 417 DPLPEVQNSEIASQATLLAGQSLLLGGFKQGKQIHSQNKIPLLGDIPVVGHLFRNDTTQV 476 + +P + + + + A + GQSL++GG + + + +K+PLLGDIP +G LFR + Sbjct: 436 EGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELT 495 Query: 477 HSVIRLFLIKASVVNNGISH 496 +RLF+I+ +++ GI+H Sbjct: 496 RRTVRLFIIEPRIIDEGIAH 515
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 27.3 bits (60), Expect = 0.048 Identities = 15/44 (34%), Positives = 22/44 (50%) Query: 78 SNEMDEVIAKAAKGDAKTKEEVPEDVIKYMRDNGILIDGMTIDD 121 + E I K K +E+PED +KY+ + L DG ID+ Sbjct: 368 NTEEQAKINNKIKEAIKMFKELPEDFVKYINSDKALKDGNKIDN 411
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 88.1 bits (218), Expect = 8e-25 Identities = 40/154 (25%), Positives = 67/154 (43%), Gaps = 7/154 (4%) Query: 6 TLQQAHDTMRFFRRGGSLRMLL---DDDVTQPLNTLYRYATQLMEVKEFAGAARLFQLLT 62 T + F + GG++ ML D L LY A + ++ A ++FQ L Sbjct: 8 TQEYQLAMESFLKGGGTIAMLNEISSDT----LEQLYSLAFNQYQSGKYEDAHKVFQALC 63 Query: 63 IYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLACDNVCYA 122 + D + ++ LG C QA + AI++Y A + I P+ P+ AAEC L + A Sbjct: 64 VLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123 Query: 123 IKALKAVVRICGEVSEHQILRQRAEKMLQQLSDR 156 L + + +E + L R ML+ + + Sbjct: 124 ESGLFLAQELIADKTEFKELSTRVSSMLEAIKLK 157
>PF05844#YopD protein Length = 295 Score = 29.2 bits (65), Expect = 0.009 Identities = 29/149 (19%), Positives = 48/149 (32%), Gaps = 19/149 (12%) Query: 9 VLPAPSL-LTPSSTPSPSGEGMGTESMLLLFDDIWMKLMELAKKLRDIMRSYNVEKQRLA 67 L AP L P + E + +LL+ I K EL RD + Q+ Sbjct: 50 ELNAPRQVLDPVRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQK-- 107 Query: 68 WELQVNVLQTQMKTIDEAFRASMITAGGAMLSGVLTIGLGAVGGETGLIAGQAVGHTAGG 127 +DE + + A+++GV + VG L G+A+ Sbjct: 108 ------------AQVDEMRSGATLMIAMAVIAGVGALASAVVGSLGALKNGKAISQEK-- 153 Query: 128 VMGLGAGVAQRQSDQDKAIADLQQNGAQS 156 L + R D + L + + Sbjct: 154 --TLQKNIDGRNELIDAKMQALGKTSDED 180
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 79.2 bits (195), Expect = 1e-21 Identities = 26/127 (20%), Positives = 49/127 (38%) Query: 16 LKQLLSVDPETVYASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYT 75 L ++ S E +Y+ + +Q G Y A F L + + R + L + +Y Sbjct: 28 LNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87 Query: 76 TAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNA 135 AI+ Y + ++D P + CL GE A A ++ + E+ Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRV 147 Query: 136 QIMVDTL 142 M++ + Sbjct: 148 SSMLEAI 154
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 52.3 bits (125), Expect = 5e-10 Identities = 29/183 (15%), Positives = 70/183 (38%), Gaps = 15/183 (8%) Query: 23 LYRSLPEDEANQMLALLMQHHIDAEKKQEEDGVTLRVEQSQFINAVELLRLNGYPHRQFT 82 L+ +L + + ++A L Q +I + V + L G P + Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYRFA--NGSGAIEVPADKVHELRLRLAQQGLP-KGGA 109 Query: 83 TADKMFPANQLVVSPQEEQQKINFLK--EQRIEGMLSQMEGVINAKVTIALPTYDEGS-- 138 ++ + +S EQ +N+ + E + + + V +A+V +A+P + S Sbjct: 110 VGFELLDQEKFGISQFSEQ--VNYQRALEGELARTIETLGPVKSARVHLAMP---KPSLF 164 Query: 139 --NASPSSVAVFIKYSPQVNMEAFRVK-IKDLIEMSIPGLQYSKISILMQPAEFRMVADV 195 S +V + P ++ ++ + L+ ++ GL ++++ Q ++ Sbjct: 165 VREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNT 224 Query: 196 PAR 198 R Sbjct: 225 SGR 227
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 53.9 bits (129), Expect = 2e-10 Identities = 17/69 (24%), Positives = 32/69 (46%) Query: 247 LEQIPQQVLFEVGRASLEIGQLRQLKTGDVLPVGGCFAPEVTIRVNDRIIGQGELIACGN 306 L Q+P ++ F + R ++ + +L + +L + V I N ++G GEL+ + Sbjct: 227 LNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMND 286 Query: 307 EFMVRITRW 315 V I W Sbjct: 287 TLGVEIHEW 295
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 231 bits (592), Expect = 9e-80 Identities = 79/215 (36%), Positives = 130/215 (60%), Gaps = 8/215 (3%) Query: 8 LQLIGILFLLSILPLIIVMGTSFLKLAVVFSILRNALGIQQVPPNIALYGLALVLSLFIM 67 + LI +L ++LP II GT F+K ++VF ++RNALG+QQ+P N+ L G+AL+LS+F+M Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64 Query: 68 GPTLLAVKERWHPVQVAGAPFWT-SEWDSKALAPYRQFLQKNSEEKEANYFRNLIKRTWP 126 P + + V + S+ + L YR +L K S+ + +F N + Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124 Query: 127 ED-------IKRKIKPDSLLILIPAFTVSQLTQAFRIGLLIYLPFLAIDLLISNILLAMG 179 + K +I+ S+ L+PA+ +S++ AF+IG +YLPF+ +DL++S++LLA+G Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184 Query: 180 MMMVSPMTISLPFKLLIFLLAGGWDLTLAQLVQSF 214 MMM+SP+TIS P KL++F+ GW L L+ + Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 72.5 bits (178), Expect = 9e-21 Identities = 30/85 (35%), Positives = 50/85 (58%) Query: 4 SELTQFVTQLLWIVLFTSMPVVLVASVVGVIVSLVQALTQIQDQTLQFMIKLLAIAITLM 63 +L + L++VL S +VA+++G++V L Q +TQ+Q+QTL F IKLL + + L Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VSYPWLSGILLNYTRQIMLRIGEHG 88 + W +LL+Y RQ++ G Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 164 bits (416), Expect = 6e-52 Identities = 55/229 (24%), Positives = 101/229 (44%), Gaps = 5/229 (2%) Query: 8 WLIALAVAFIRPLSLSLLLPLLKSGSLGAALLRNGVLMSLTFPILPIIYQQKIMMHIGKD 67 WL +R L+L P+L S+ ++ G+ M +TF I P + + + Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPANDVPVF---S 67 Query: 68 YSWLGLVTGEVIIGFSIGFCAAVPFWAVDMAGFLLDTLRGATMGTIFNSTIEAETSLFGL 127 + L L +++IG ++GF F AV AG ++ G + T + + Sbjct: 68 FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLAR 127 Query: 128 LFSQFLCVIFFISGGMEFILNILYESYQYLPPGRTLLFDQQFLKYIQAEWRTLYQLCISF 187 + ++F G +++++L +++ LP G L FL +A ++ + Sbjct: 128 IMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSL-IFLNGLML 186 Query: 188 SLPAIICMVLADLALGLLNRSAQQLNVFFFSMPLKSILVLLTLLISFPY 236 +LP I ++ +LALGLLNR A QL++F PL + + + P Sbjct: 187 ALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPL 235
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.6 bits (139), Expect = 3e-11 Identities = 44/192 (22%), Positives = 86/192 (44%), Gaps = 8/192 (4%) Query: 36 LSDIAESFHMQTAQVGIMLTIYAWVVAVMSLPFMLLTSQMERRKLLIYLFVLFIASHVLS 95 L DIA F+ A + T + ++ + + L+ Q+ ++LL++ ++ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FLAWN-FTVLVISRIGIAFAHAIFWSITASLAIRLAPAGKRAQALSLIATGTALAMVLGL 154 F+ + F++L+++R A F ++ + R P R +A LI + A+ +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PIGRVVGQYFGWRTTFFAIGMGALITLLCLIKLLPKLPSEHSGSLKSLPLLFRRPALMSL 214 IG ++ Y W + I M +IT+ L+KLL K + LMS+ Sbjct: 157 AIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKK------EVRIKGHFDIKGIILMSV 209 Query: 215 YVLTVVVVTAHY 226 ++ ++ T Y Sbjct: 210 GIVFFMLFTTSY 221
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 30.5 bits (69), Expect = 0.008 Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 22 GQGKVADYIPALASVEGSKLGI-AICTVDGQHYQAGDAHERFSIQSISKVL 71 + + I S ++G+ + G+ A A ERF + S KV+ Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 470 bits (1211), Expect = e-169 Identities = 235/386 (60%), Positives = 275/386 (71%), Gaps = 20/386 (5%) Query: 3 KVVVLSAVAAAVMMAGAANAAEIYNKDGNKLDLYGKVDGLHYFSSNHSTDGDQSYIRMGI 62 K VL+ V A++ AGAA+AAEIYNKDGNKLDLYGKVDGLHYFS + S DGDQ+Y+R+G Sbjct: 2 KRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVGF 61 Query: 63 KGETQITDQLTGFGQWEYQVNANRPEDGDSSGSPQSWTRLGFAGLAFADMGSVDYGRNYG 122 KGETQI DQLTG+GQWEY V AN E SWTRL FAGL F D GS DYGRNYG Sbjct: 62 KGETQINDQLTGYGQWEYNVQANTTE----GEGANSWTRLAFAGLKFGDYGSFDYGRNYG 117 Query: 123 VLYDIGSWTDVLPEFGNDSYEASDNFMTGRANGVLTYRNNDFFGLVDGLNIALQYQGKND 182 VLYD+ WTD+LPEFG DSY +DN+MTGRANGV TYRN DFFGLVDGLN ALQYQGKN+ Sbjct: 118 VLYDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNE 177 Query: 183 GLSKEGDPLSNNAR---KSIAYQNGDGFGASATYDLGMGVSLGAAYTSSKRTLDQMTQDK 239 S + + N R I Y NGDGFG S TYD+GMG S GAAYT+S RT +Q+ Sbjct: 178 SQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG 237 Query: 240 YD-NGDRAEAWTGGVKYDANNIYLAANYTRTYDMTYMGDTL----GGFAHKTDNWEMVGQ 294 GD+A+AWT G+KYDANNIYLA Y+ T +MT G T GG A+KT N+E+ Q Sbjct: 238 TIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQ 297 Query: 295 YQFDNGLRPSLAFLQSRANDVD----GLGSFDLVKYIDVGSYYYFNKNMSAYVDYKINLL 350 YQFD GLRP+++FL S+ D+ DLVKY DVG+ YYFNKN S YVDYKINLL Sbjct: 298 YQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLL 357 Query: 351 KDGNP----SNPNTDNTVALGLVYEF 372 D +P + +TD+ VALG+VY+F Sbjct: 358 DDDDPFYKDAGISTDDIVALGMVYQF 383
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 55.2 bits (133), Expect = 2e-10 Identities = 67/396 (16%), Positives = 138/396 (34%), Gaps = 30/396 (7%) Query: 10 IVFLLFIVYMLNYMDRSALSITAPLIEKELGFN---AAEMGMIFSAFFIGYALFNFIGGW 66 ++ +L V L+ + + P + ++L + A G++ + + + + G Sbjct: 7 LIVILSTVA-LDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 67 ASDKVGPKTVFLIAALLWSVFCGLTGLVTGLWTMLIVRVLFGMAEGPVSAAGNKIINNWI 126 SD+ G + V L++ +V + LW + I R++ G+ + AG I + Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGA-YIADIT 124 Query: 127 SRKESATAIGIFSAGSPLGGAVSGPIVGLLALSLGWRPAFGIIFLFGLVWVLLWYFIVSD 186 E A G SA G V+GP++G L F + L F++ + Sbjct: 125 DGDERARHFGFMSA-CFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 187 KPTMSKRLAPEERIDFENHEDVILSDDGRATPSLGYYMKQPMVWATTLAFFSYNYILFFF 246 +R E + + + FF + Sbjct: 184 SHKGERRPLRREAL-------------NPLASFRWARGMTVVAALMAV-FFIMQLVGQVP 229 Query: 247 LTWFPSYLNHSLHLDIKEISIATVIPWVIGAIGMVLGGVCSDVIYRITGNALLSRRLILG 306 + + H D I I+ G + + + + + L R L Sbjct: 230 AALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQAMITGPVAA-----RLGERRALM 281 Query: 307 VCLAGAAVCVAVSGTVSTIGSAITLMSVSLFLLYLTGPIYWAVIQDVVHKDKVGSVGGAM 366 + + + + A +M V L + P A++ V +++ G + G++ Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIM-VLLASGGIGMPALQAMLSRQVDEERQGQLQGSL 340 Query: 367 HGLANISGIIGPLVTGFIVQFS-GKYDYAFYLAGAI 401 L +++ I+GPL+ I S ++ ++AGA Sbjct: 341 AALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAA 376 Score = 32.9 bits (75), Expect = 0.002 Identities = 31/121 (25%), Positives = 49/121 (40%), Gaps = 13/121 (10%) Query: 299 LSRRLILGVCLAGAAVCVAVSGTVST-----IGSAITLMSVSLFLLYLTGPIYWAVIQDV 353 RR +L V LAGAAV A+ T IG + ++ + TG + A I D+ Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA------TGAVAGAYIADI 123 Query: 354 VHKDKVGSVGGAMHGLANISGIIGPLVTGFIVQFSGKYDYAFYLAGAIAIVSSLLVFVFV 413 D+ G M + GP++ G + FS F+ A A+ ++ L + Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHA--PFFAAAALNGLNFLTGCFLL 181 Query: 414 K 414 Sbjct: 182 P 182
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 151 bits (382), Expect = 1e-43 Identities = 98/369 (26%), Positives = 169/369 (45%), Gaps = 6/369 (1%) Query: 20 RRILPVFLLVGLYAASTAAVMSVLPFYIREMGGSPLII---GIIIATEAFSQFCAAPLIG 76 R ++ + V L A +M VLP +R++ S + GI++A A QF AP++G Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 77 HLSDRVGRKRILIVTLAIAAISLLLLANAQCILFILLARTLFGISAGNLSAAAAYIADCT 136 LSDR GR+ +L+V+LA AA+ ++A A + + + R + GI+ + A AYIAD T Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 137 HVRNRRQAIGILTGCIGLGGIVGAGVSGWLSRISLSAPIYAAFILVLGSALVAIWGLKDP 196 R + G ++ C G G + G + G + S AP +AA L + L + L + Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 197 STTSRTTDKIAAFSARAILKMPVLRVLIIVMLCHFFAYGMYSSQLPVFLSDTFIWNGLPF 256 R + A + A + ++ ++ FF + Q+P L F + + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLV-GQVPAALWVIFGEDRFHW 243 Query: 257 GPKALSYLLMADGVINIFVQLFLLGWVSQYFSERKLIILIFALLCTGFLTAGIATTIPVL 316 + L A G+++ Q + G V+ ER+ ++L TG++ AT + Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM- 302 Query: 317 VFAIVCISIADALAKPTYLAALSVHVSPARQGIVIGTAQALIAIADFISPVLGGFVLGYA 376 F I+ + + + P A LS V RQG + G+ AL ++ + P+L + + Sbjct: 303 AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362 Query: 377 LYGVWIGIA 385 + W G A Sbjct: 363 I-TTWNGWA 370
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.5 bits (69), Expect = 0.005 Identities = 21/147 (14%), Positives = 50/147 (34%), Gaps = 22/147 (14%) Query: 49 NIARS--LFHAISLMAIFIIAWGVGILLFFLVKQKARIHDISFLRLFLAAVLFFIPIVIE 106 +A+S + ++A+ + G+ F + I F A+ + +V Sbjct: 22 QVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSY---VVDN 78 Query: 107 FSLLTESFLWELFFIILLVALC---LSVGMRF--------YSKLMPVICFTQLSWVR--- 152 L + L + L+A+ + G K+ P+ ++ ++ Sbjct: 79 VLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLV 138 Query: 153 ---RHCFTIVMLGFIIYFFIFSFFVGI 176 + +V+L +I+ I V + Sbjct: 139 EFLKSILKVVLLSILIWIIIKGNLVTL 165
>INTIMIN#Intimin signature. Length = 939 Score = 214 bits (547), Expect = 1e-61 Identities = 117/412 (28%), Positives = 186/412 (45%), Gaps = 27/412 (6%) Query: 29 SDNEIQSWIAGTASSISPHLQEGTLE-DYAKGKIKALPGQAANHLVNEGIKSAFPEIIFR 87 +D++ ++ A A+S+ LQ +L DYAK + G A+ S + Sbjct: 158 TDDKALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQAS--------SQLQAWLQH 209 Query: 88 GG---VNLEDGAKYRSSEFDMFIPVQETTSSLLFGQLGFRDHDNSSFDGRTYVNVGMGYR 144 G VNL+ G + S D +P ++ L FGQ+G R D+ R N+G G R Sbjct: 210 YGTAEVNLQSGNNFDGSSLDFLLPFYDSEKMLAFGQVGARYIDS-----RFTANLGAGQR 264 Query: 145 QEVNGWLLGVNTFLDADIRYSHLRGGIGGEVYKDSLAFSGNYYFPLTGWKMSAAHELHDE 204 + +LG N F+D D + R GIGGE ++D S N YF ++GW S + +DE Sbjct: 265 FFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDE 324 Query: 205 RPAYGFDLRTKGTLPDFPWFSGELTYEQYYGDKVDLLGNGTLSRNPRAAGAALVWNPVPL 264 RPA GFD+R G LP +P +L YEQYYGD V L + L NP AA + + P+PL Sbjct: 325 RPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPL 384 Query: 265 LEVRAGYRDAGNGGSQAEGGLRVNYSFGTPLHEQLDYRNV-GAPSNTTNRRAFVDRNYDI 323 + + YR + ++ Y F P +Q++ + V + + +R V RN +I Sbjct: 385 VTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNI 444 Query: 324 VMAYREQAS-KIRITAMPVSGLSGTLVTLMATVDSRYPVEKVEWSGDAELLAGLQLQGSL 382 ++ Y++Q + I ++G + + V S+Y ++++ W A G Q+Q S Sbjct: 445 ILEYKKQDILSLNIPH-DINGTERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSG 503 Query: 383 GSG-----LILPQLPLTATDGQEYSLYLTVTDSRGTRVTSERIPVRVTQDET 429 ILP Y + D G + + + V + Sbjct: 504 SQSAQDYQAILP--AYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQ 553
>MPTASEINHBTR#Metalloprotease inhibitor signature. Length = 122 Score = 25.7 bits (56), Expect = 0.015 Identities = 6/43 (13%), Positives = 14/43 (32%) Query: 30 AGRGELSQSEQQRLLQLTDDAQRMRERIQALEDILDAEHPNWR 72 AG+ + + + A + + E L + +W Sbjct: 37 AGQLGIEATGSGVCAGPAEQANALAGDVACAEQWLGDKPVSWS 79
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 27.5 bits (61), Expect = 0.044 Identities = 20/105 (19%), Positives = 46/105 (43%), Gaps = 7/105 (6%) Query: 40 LVEVRSNSARALAEKKQLSR-RIEQATAQQIEWQEKAELA-LRKDKDDLARAALIEKQKL 97 + + R + +L K+ +++ + + + +E EL + + + L K++ Sbjct: 232 VEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV--NELRVYKSQLEQIESEILSAKEEY 289 Query: 98 TDLIATLEQEVTLVDDTLARMKKEIGELENKLSETRARQQALMLR 142 + + E+ D L + IG L +L++ RQQA ++R Sbjct: 290 QLVTQLFKNEIL---DKLRQTTDNIGLLTLELAKNEERQQASVIR 331
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 342 bits (880), Expect = e-118 Identities = 124/345 (35%), Positives = 179/345 (51%), Gaps = 22/345 (6%) Query: 2 AEFKDNLLGEANRFLEVLEQVSRLAPLDKPVLIIGERGTGKELIANRLHYLSSRWQGPLI 61 ++ L+G + E+ ++RL D ++I GE GTGKEL+A LH R GP + Sbjct: 133 SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFV 192 Query: 62 SLNCAALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMLVQEKLL 121 ++N AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LL Sbjct: 193 AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLL 252 Query: 122 RVIEYGELERVGGSQPLQVNVRLVCATNADLPAMVKEGTFRADLLDRLAFDVVQLPPLRE 181 RV++ GE VGG P++ +VR+V ATN DL + +G FR DL RL ++LPPLR+ Sbjct: 253 RVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRD 312 Query: 182 RQSDIMLMAEHFAIQMCRELRLPLFPGFTDRAKETLLHYAWPGNVRELKNVVERSVYRHG 241 R DI + HF Q +E F A E + + WPGNVREL+N+V R + Sbjct: 313 RAEDIPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYP 370 Query: 242 SSE--------QPLDEIVIDPFQRHPAEPPAPALPAASVT------------PDLPLKLR 281 + EI P ++ A + ++ A Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430 Query: 282 EFQLQQEKALLQRSLQQAKFNQKRAADLLALTYHQFRALLKKHQL 326 + E L+ +L + NQ +AADLL L + R +++ + Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>SOPEPROTEIN#Salmonella type III secretion SopE effector protein signature. Length = 239 Score = 406 bits (1044), Expect = e-148 Identities = 165/237 (69%), Positives = 194/237 (81%) Query: 2 TNITLSTQHYRIHRSDVEPVKEKTTEKDIFAKSITAVRNSFISLSTSLSDRFSLHQQTDI 61 T ITLS Q++RI + + +KEK+TEK+ AKSI AV+N FI L + LS+RF H+ T+ Sbjct: 1 TKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTES 60 Query: 62 PTTHFHRGNASEGRAVLTSKTVKDFMLQKLNSLDIKGNASKDPAYARQTCEAILSAVYSN 121 THFHRG+ASEGRAVLT+K VKDFMLQ LN +DI+G+ASKDPAYA QT EAILSAVYS Sbjct: 61 SATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSK 120 Query: 122 NKDQCCKLLISKGVSITPFLKEIGEAAQNAGLPGEIKNGVFTPGGAGANPFVVPLIASAS 181 NKDQCC LLISKG++I PFL+EIGEAA+NAGLPG KN VFTP GAGANPF+ PLI+SA+ Sbjct: 121 NKDQCCNLLISKGINIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSAN 180 Query: 182 IKYPHMFINHNQQVSFKAYAEKIVMKEVTPLFNKGTMPTPQQFQLTIENIANKYLQN 238 KYP MFIN +QQ SFK YAEKI+M EV PLFN+ MPTPQQFQL +ENIANKY+QN Sbjct: 181 SKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQN 237
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 112 bits (282), Expect = 3e-36 Identities = 89/103 (86%), Positives = 95/103 (92%) Query: 2 AAIQGIEGVISQLQATAMAARGQDTHSQSTVSFAGQLHAALDRISDRQAAARVQAEKFTL 61 +AIQGIEGVISQLQATAM+AR Q++ Q T+SFAGQLHAALDRISD Q AAR QAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGIALNDVMADMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPG+ALNDVM DMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 787 bits (2033), Expect = 0.0 Identities = 559/559 (100%), Positives = 559/559 (100%) Query: 2 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 61 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ Sbjct: 1 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 60 Query: 62 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 121 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF Sbjct: 61 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 120 Query: 122 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE 181 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE Sbjct: 121 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE 180 Query: 182 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV 241 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV Sbjct: 181 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV 240 Query: 242 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS 301 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS Sbjct: 241 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS 300 Query: 302 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRN 361 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRN Sbjct: 301 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRN 360 Query: 362 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG 421 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG Sbjct: 361 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG 420 Query: 422 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 481 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR Sbjct: 421 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 480 Query: 482 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 541 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD Sbjct: 481 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 540 Query: 542 NDPRVVALVIRQWMSNDHE 560 NDPRVVALVIRQWMSNDHE Sbjct: 541 NDPRVVALVIRQWMSNDHE 559
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 339 bits (870), Expect = e-118 Identities = 114/329 (34%), Positives = 196/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLSGTDKSVILLMTIGEDRAAEVFKHLSTREVQALSTAMANVRQISNKQLTDVLSEFE 60 +S L+G K+ ILL++IG + +++VFK+LS E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANEYLRSVLVKALGEERASSLLEDILETRDTTSGIETLNFMEPQSAAD 120 + + +Y R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRSQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEPPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 368 bits (945), Expect = e-133 Identities = 193/235 (82%), Positives = 209/235 (88%), Gaps = 7/235 (2%) Query: 1 MSNELPWQVWTPDDLAPPPETFVPVEADNVTLTEDTPEPELTAEQQLEQELAQLKIQAHE 60 MS+ LPW+ WTPDDLAPP FVP+ T+ E+ AE LEQ+LAQL++QAHE Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEE-------AEPSLEQQLAQLQMQAHE 53 Query: 61 QGYNAGLAEGRQKGHAQGYQEGLAQGLEQGQAQAQTQQAPIHARMQQLVSEFQNTLDALD 120 QGY AG+AEGRQ+GH QGYQEGLAQGLEQG A+A++QQAPIHARMQQLVSEFQ TLDALD Sbjct: 54 QGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALD 113 Query: 121 SVIASRLMQMALEAARQVIGQTPAVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 180 SVIASRLMQMALEAARQVIGQTP VDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV Sbjct: 114 SVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 173 Query: 181 EEMLGATLSLHGWRLRGDPTLHHGGCKVSADEGDLDASVATRWQELCRLAAPGVL 235 ++MLGATLSLHGWRLRGDPTLH GGCKVSADEGDLDASVATRWQELCRLAAPGV+ Sbjct: 174 DDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 206 bits (526), Expect = 4e-72 Identities = 130/147 (88%), Positives = 138/147 (93%) Query: 1 MAQHGALETLKDLAEKEVDDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRSNLNTDMGNG 60 MA+HGAL TLKDLAEKEV+DAARLLGEMRRGCQQAEEQLKMLIDYQNEYR+NLN+DM G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 IASNRWINYQQFIQTLEKAIEQHRLQLTQWTQKVDLALKSWREKKQRLQAWQTLQDRQTA 120 I SNRWINYQQFIQTLEKAI QHR QL QWTQKVD+AL SWREKKQRLQAWQTLQ+RQ+ Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRMDQKKMDEFAQRAAMRKPE 147 AALLAENR+DQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 414 bits (1064), Expect = e-146 Identities = 193/409 (47%), Positives = 233/409 (56%), Gaps = 38/409 (9%) Query: 1 MITLPQLITTDTDMTAGLTSGKTTGSAEDFLALLAGALGADGAQGKDARITLADLQAAGG 60 MI L LIT D D T L GK + +A+DFLALL+ AL + K A L Sbjct: 1 MIRLAPLITADVDTTT-LPGGKASDAAQDFLALLSEALAGETTTDKAAPQLL-------- 51 Query: 61 KLSKELLTQHGEPGQAVKLADLLAQKAN---ATDETLTDLTQAQHLLSTLTPSLKTSALA 117 ++ + T GEP + ++D AQ+AN DET + Q + LT + + A Sbjct: 52 -VATDKPTTKGEPLISDIVSD--AQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAA 108 Query: 118 ALSKTAQHDEKTPALSDEDLASLSALFAMLPGQPVATPVAGETPAENHIALPSLLRGDMP 177 K DEK L+++ ASLSALFAMLPG V D P Sbjct: 109 VADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVT-----------------DAP 151 Query: 178 SAPQEETHTLSFSEHEKGKTEASLARASDDRATGPALTPLVVAAAATSAKVEVDSPPAPV 237 S F++ T L A D A G PL A +K EV S P+PV Sbjct: 152 STVLPTEKPTLFTK----LTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPV 207 Query: 238 THGAAMPTLSSATAQPLPVASAPVLSAPLGSHEWQQTFSQQVMLFTRQGQQSAQLRLHPE 297 T AA P ++ QPLP +APVLSAPLGSHEWQQ+ SQ + LFTRQGQQSA+LRLHP+ Sbjct: 208 T-AAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQ 266 Query: 298 ELGQVHISLKLDDNQAQLQMVSPHSHVRAALEAALPMLRTQLAESGIQLGQSSISSESFA 357 +LG+V ISLK+DDNQAQ+QMVSPH HVRAALEAALP+LRTQLAESGIQLGQS+IS ESF+ Sbjct: 267 DLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGESFS 326 Query: 358 GQQQ-SSSQQQSSRAQHTDAFGAEDDIALAAPASLQAAARGNGAVDIFA 405 GQQQ +S QQQS R + + EDD L P SLQ GN VDIFA Sbjct: 327 GQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 384 bits (987), Expect = e-136 Identities = 86/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--DTKDEPTPGIASDSDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S D E I+ I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RQFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 HEDQNWRDNLVRQVQHSELELVANFADIPLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297 + L ++ ++++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTVNGQYALRVEHLI 321 Q G V + A ++ I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 211 bits (538), Expect = 4e-74 Identities = 137/137 (100%), Positives = 137/137 (100%) Query: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 27.2 bits (60), Expect = 0.010 Identities = 10/41 (24%), Positives = 15/41 (36%) Query: 20 VFYTFCGYFIWAMARCVWLMSAIQTEPVLGPISTPGSATEK 60 +FY F +A W + PV TP A ++ Sbjct: 18 LFYLLMLLFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQ 58
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 553 bits (1426), Expect = 0.0 Identities = 273/399 (68%), Positives = 317/399 (79%), Gaps = 17/399 (4%) Query: 1 MNRKVLALLVPALLVAGAANAAEVYNKNGNKLDLYGKVDGLRYFSDNAGDDGDQSYARFG 60 M RKVLAL++PALL AGAA+AAE+YNK+GNKLDLYGKVDGL YFSD++ DGDQ+Y R G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQINDMLTGYGQWEYNIKVNTTEGEGANSWTRLGFAGLKFGEYGSFDYGRNYGVIY 120 FKGETQIND LTGYGQWEYN++ NTTEGEGANSWTRL FAGLKFG+YGSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DIEAWTDALPEFGGDTYTQTDVYMLGRTNGVATYRNTDFFGLVEGLNFALQYQGNNEDPG 180 D+E WTD LPEFGGD+YT D YM GR NGVATYRNTDFFGLV+GLNFALQYQG NE Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 181 AGEGTANGSDADSGTRKLARENGDGFGMSTSYDFDFGLSLGAAYSSSDRTDNQVASGRGD 240 A + ++ ++G + +NGDGFG+ST+YD G S GAAY++SDRT+ QV + Sbjct: 181 ADDVNIGTNNRNNG-DDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNA---- 235 Query: 241 GHHYYGNSYAGGETAEAWTVGVKYDAYNVYLAAMYAETRNMTYYGGGDGG-DGGIANKTQ 299 G + AGG+ A+AWT G+KYDA N+YLA MY+ETRNMT YG D G DGG+ANKTQ Sbjct: 236 -----GGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQ 290 Query: 300 NFEVVAQYQFDFGLRPSIAYLQSKGKDLGGQDMDSRGNYRYTDKDLVKYVDVGMTYYFNK 359 NFEV AQYQFDFGLRP++++L SKGKDL N DKDLVKY DVG TYYFNK Sbjct: 291 NFEVTAQYQFDFGLRPAVSFLMSKGKDLT------YNNVNGDDKDLVKYADVGATYYFNK 344 Query: 360 NMSTYVDYKINLLDEDDDFYANNGIATDDIVGVGLVYQF 398 N STYVDYKINLLD+DD FY + GI+TDDIV +G+VYQF Sbjct: 345 NFSTYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>PF03627#PapG Length = 336 Score = 36.9 bits (85), Expect = 1e-04 Identities = 22/93 (23%), Positives = 34/93 (36%), Gaps = 8/93 (8%) Query: 327 DDHVLDAVLPPDIP-------IPSIAEVQRALYDATKAVSGMPGEEVKQRLRTGTVVTTD 379 DD + LP D+P IP + +QR A +P K R ++ Sbjct: 152 DDIIFKVALPADLPLGDYSVTIPYTSGMQRHFASYLGARFKIPYNVAKTLPRENEMLFLF 211 Query: 380 DRNWELRYSASALRFNLSRAVAIDMESATIAAQ 412 R SA +L ++I+ + AAQ Sbjct: 212 KNIGGCRPSAQSLEIKHGD-LSINSANNHYAAQ 243
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 26.6 bits (58), Expect = 0.017 Identities = 14/32 (43%), Positives = 21/32 (65%), Gaps = 1/32 (3%) Query: 39 QAIAPQYKPWFQPLYEPASGEIESLLFTLQGS 70 + IAP+YK +FQ L E + +++ LL T Q S Sbjct: 739 KQIAPEYKNYFQYLKERITNQVQ-LLLTHQKS 769
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 30.6 bits (69), Expect = 0.011 Identities = 8/39 (20%), Positives = 17/39 (43%) Query: 190 SDFTDALAEKAAKLVFQYLPTAVEKGDCVATRGKMHNAS 228 SDF+ ++ K LV+ +L + + + G + Sbjct: 518 SDFSKVVSSKDKSLVYSFLDNLMSYLETIKNDGPIDTDK 556
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.6 bits (71), Expect = 0.005 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 1/84 (1%) Query: 138 STTAEGAQRRLAEYIQQVDEEVAKELEVDLKDNITLQTKTLQESLETQEVVAQEQKDLRI 197 +T R +A+ + + + EV + T +T+T E+ ET V +E+ + Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT-TETKETATVEKEEKAKVET 1116 Query: 198 KQIEEALRYADEAKITQPQIQQTQ 221 ++ +E + + Q Q + Q Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQ 1140
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 80.6 bits (199), Expect = 1e-19 Identities = 62/332 (18%), Positives = 126/332 (37%), Gaps = 56/332 (16%) Query: 8 VIVSGASGFIGKHLLEALKKSGISVVAITRDVIKNNSNAL---ANVRWCSWDNIEL---- 60 +V+GA+GFIG H+ + L ++G VV I D + + + A + + + Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGI--DNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 61 -----LVEELSIDSALIGIIHLATEYGHKTSSLINIE------DANVIKPLKLLDLAIKY 109 + +L + + S +E D+N+ L +L+ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYS----LENPHAYADSNLTGFLNILEGCRHN 116 Query: 110 RADIF----------LNTDSFFAKKDFNYQHMRPYIITKRHFDEIGHYYANMHDISFVNM 159 + LN F+ D + Y TK+ + + H Y++++ + + Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176 Query: 160 RLEHVYGP-GDGENKFIPYIIDCLNKKQSCVKCTTGEQIRDFIFVDDVVNAYLTILEN-- 216 R VYGP G + + L K V G+ RDF ++DD+ A + + + Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIAEAIIRLQDVIP 235 Query: 217 ------RKEVPS-------YTEYQVGTGAGVSLKDFLVYLQNTMMPGSSSIFEFGAIEQR 263 E + Y Y +G + V L D++ L++ + + + + Sbjct: 236 HADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM----LPLQ 291 Query: 264 DNEIMFSVANNKNL-KAMGWKPNFDYKKGIEE 294 +++ + A+ K L + +G+ P K G++ Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVKDGVKN 323
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.8 bits (69), Expect = 0.011 Identities = 23/82 (28%), Positives = 34/82 (41%), Gaps = 9/82 (10%) Query: 209 GDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQ 268 G +G S + E A+ + GEL+ ++ WGR DN G+RF Q+ Sbjct: 629 GGVGLAS-----TLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQK 683 Query: 269 LGSLPQGYDHKYTYS----HLG 286 + G DH + HLG Sbjct: 684 VAGFELGADHAVAVAGGRWHLG 705
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 70.6 bits (173), Expect = 1e-15 Identities = 62/352 (17%), Positives = 122/352 (34%), Gaps = 48/352 (13%) Query: 11 RVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLNDLMES----HIGDI 66 + VTG GF G +S L E G V G + RL L + H D+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 67 RDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLETVKQVGNIKA 126 D E + + A E VF + VR S E P +N+ G +++LE + I+ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-KIQH 120 Query: 127 VVNITSDKCYDNREWVWGYRENEPMGGYD-------PYSNSKGCAELVASAFRNSFFNPA 179 ++ +S V+G P D Y+ +K EL+A + + + Sbjct: 121 LLYASSSS-------VYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY---- 169 Query: 180 NYEQHGVGLASVRAGNVIGGGDWAK-DRLIPDILRSFENNQQVIIRNPYSI-RPWQHVLE 237 G+ +R V G W + D + ++ + + + N + R + ++ + Sbjct: 170 -----GLPATGLRFFTVY--GPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDD 222 Query: 238 PLSGYIVVAQRLYTEGAKFSEG-------------WNFGPRDEDAKTVEFIVDKMVTLWG 284 I + + +++ +N G + + + + G Sbjct: 223 IAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIG--NSSPVELMDYIQALEDALG 280 Query: 285 DDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAW 336 +A + P + D +G+ P + + + V W++ + Sbjct: 281 IEAKKNML-PLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 41.7 bits (98), Expect = 2e-06 Identities = 25/162 (15%), Positives = 58/162 (35%), Gaps = 27/162 (16%) Query: 1 MNILLFGKTGQVGWELQRSLAPVGN-LIALDV-----------HSKEFC---------GD 39 M L+ G G +G+ + + L G+ ++ +D E D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 40 FSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPEL---AQLLNATSVEAIAKAANETG 96 ++ +G+ + + + + AV + P + L ++ + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 97 AWVVHYSTDYVFPGTGDIPWQETDATS-PLNVYGKTKLAGEK 137 +++ S+ V+ +P+ D+ P+++Y TK A E Sbjct: 121 --LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 176 bits (447), Expect = 2e-54 Identities = 83/359 (23%), Positives = 142/359 (39%), Gaps = 50/359 (13%) Query: 1 MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLT--YAGNLE--SLSDISESNRYNFEH 56 MK L+TG AGFIG V + +++ VV ID L Y +L+ L +++ + F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPG-FQFHK 58 Query: 57 ADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYALLEVARKYWS 116 D+ D +T +F + V V S+ P A+ ++N+ G +LE R Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN-- 116 Query: 117 ALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDH 176 + S+ VYG +P T+ + P S Y+A+K +++ Sbjct: 117 -------KIQHLLYASSSSVYGLNRK---------MPFSTDDSVDHPVSLYAATKKANEL 160 Query: 177 LVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYV 236 + + YGLP YGP+ P+ + LEGK + +Y G RD+ Y+ Sbjct: 161 MAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI 220 Query: 237 EDHA----RALHMVVTEGKA--------------GETYNIGGHNEKKNLDVVFTICDLLD 278 +D A R ++ YNIG + + +D + + D L Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280 Query: 279 EIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLAN 337 +A + + +PG + D + +G+ P T + G++ V WY Sbjct: 281 ---IEA-----KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 87.9 bits (218), Expect = 7e-22 Identities = 64/344 (18%), Positives = 128/344 (37%), Gaps = 47/344 (13%) Query: 5 RIFVAGHRGMVGSAIVRQLAQRG-------------DVEL------VLRTRD----ELDL 41 + V G G +G + ++L + G DV L +L ++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 42 LDGRAVQAFFAGAGIDQVYLAAAKVGGIVANNTYPADFIYENMMIESNIIHAAHLHNVNK 101 D + FA ++V+++ + + + P + N+ NI+ + + Sbjct: 62 ADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 LLFLGSSCIYPKLARQPMAESELLQGTLEPTNEPYAIAKIAGIKLCESYNRQYGRDYRSV 161 LL+ SS +Y + P + P + YA K A + +Y+ YG + Sbjct: 121 LLYASSSSVYGLNRKMPFSTD---DSVDHPVS-LYAATKKANELMAHTYSHLYGLPATGL 176 Query: 162 MPTNLYGPHDNFHPDNSHVIPALLRRFHEAAQSHAPEVVVWGSGTPMREFLHVDDMAAAS 221 +YGP PD AL + + + +V + G R+F ++DD+A A Sbjct: 177 RFFTVYGPWGR--PDM-----ALFKFTKAMLEGKSIDV--YNYGKMKRDFTYIDDIAEAI 227 Query: 222 IHVMELA----REVWQENTAPMLSH-----INVGTGVDCTIRELAQTIAKVVGYQGRVVF 272 I + ++ + E P S N+G + + Q + +G + + Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287 Query: 273 DAAKPDGTPRKLLDVTRLHQ-LGWYHEISLEAGLAGTYQWFLEN 315 +P D L++ +G+ E +++ G+ W+ + Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 107 bits (268), Expect = 2e-28 Identities = 81/361 (22%), Positives = 127/361 (35%), Gaps = 58/361 (16%) Query: 6 LITGVTGQDGSYLAEFLLEKGYEVHGIKRRASSFNTERVDHIYQDPH--------SCNPK 57 L+TG G G ++++ LLE G++V GI + N Y D P Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQPG 53 Query: 58 FHLHYGDLTDASNLTRILQEVQPDEVYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAI 117 F H DL D +T + + V+ V S E+P AD + G L +LE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 118 RFLGLEKKTRFYQASTSELYGLVQEIPQKETTPF-YPRSPYAVAKLYAYWITVNYRESYG 176 R ++ AS+S +YGL +++P +P S YA K + Y YG Sbjct: 114 RHNKIQ---HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 177 IYACNGILFNHESPRRGETFVTRKITRAIANIAQGLESCLYLGNMDSLRDWGHAKDYVRM 236 + A F P K T+A+ G +Y RD+ + D Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDD---- 222 Query: 237 QWMMLQQEQPEDFVIATGVQYSVRQFVELAAAQLGIKLRFEGEGINEKGIVVSVTGHDAP 296 IA + +R + A + + V G+ +P Sbjct: 223 --------------IAEAI---IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSP 265 Query: 297 GVKPGDVIVAV--------DPRY--FRPAEVETLLGDPSKAHEKLGWKPEITLSEMVSEM 346 V+ D I A+ +P +V D +E +G+ PE T+ + V Sbjct: 266 -VELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324 Query: 347 V 347 V Sbjct: 325 V 325
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.5 bits (121), Expect = 5e-09 Identities = 32/129 (24%), Positives = 56/129 (43%), Gaps = 20/129 (15%) Query: 132 AMMVHIRHTAHSQ-LPEAITQAVIGRPINFQGLGGDDANRQAQGILERAAKRAGFQEVVF 190 M+ H HS + ++ P+ R+A + +A+ AG +EV Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGA-----TQVERRA---IRESAQGAGAREVFL 140 Query: 191 QYEPVAAGLDYEATLREEKRVLVVDIGGGTTDCSMLLMGPQWRQRADRENSLLGHSGCRV 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S R+ Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 35.1 bits (81), Expect = 5e-04 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 12/81 (14%) Query: 377 ALDQPLTRILEQVQLALDSAQEKPDV--------IYLTGGSARSPLIKKALSEQLPGIPV 428 AL +PLT I+ V +AL+ Q P++ + LTGG A + + L E+ GIPV Sbjct: 259 ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPV 315 Query: 429 AGGDD-FGSVTAGLARWAEVV 448 +D V G + E++ Sbjct: 316 VVAEDPLTCVARGGGKALEMI 336
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.7 bits (98), Expect = 4e-06 Identities = 49/369 (13%), Positives = 105/369 (28%), Gaps = 88/369 (23%) Query: 4 SNTFRWAIAIGVVVAAAAF-WFWHSRSESPTAAPGV--------AAQAPHTASAGRRGMR 54 S R + AF + E A G + + ++ Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 55 DG-------PLA---PVQAATATTQAVPRYLSGLGTVTAANTVTVRSRVDG--QLIALHF 102 +G L + A T + L T ++ ++ +L Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 103 QEGQQVNAGDLLAQI------------DPSQFKVALAQAQGQLAKDNATLANARR----- 145 Q V+ ++L Q ++ L + + + A + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 146 --DLARYQQLAKTNLVSRQELDAQQALVNETQGTIKADEANVA----------------- 186 L + L +++ + Q+ E ++ ++ + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 187 --------------------------SAQLQLDWSRITAPVSGRV-GLKQVDVGNQISSS 219 + + S I APVS +V LK G ++++ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 220 DTAGIVVITQTHPIDLIFTLPESDIATVVQAQKAGKALVVEAWDRTNSHKL-SEGVLLSL 278 +T +V++ + +++ + DI + Q A + VEA+ T L + ++L Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410 Query: 279 DNQIDPTTG 287 D D G Sbjct: 411 DAIEDQRLG 419
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 888 bits (2296), Expect = 0.0 Identities = 291/1036 (28%), Positives = 505/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLLMAAILLAGIIGYRFLPVAALPEVDYPTIQVVTLYPGASPDVMTSAV 72 + FI RP+ +L +++AG + LPVA P + P + V YPGA + V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPIYSKVNPADPPIMTLAVTSNSMPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + I S + +M S++ TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAVAALGLTSETVRTAITGANVNSAKGSLDGP------ERAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G + ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSADEYRRLII-AYQNGAPVRLGDVATVEQGAENSWLGAWANQAPAIVMNVQRQPGANI 302 ++ +E+ ++ + +G+ VRL DVA VE G EN + A N PA + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SQQSLRKQNRFSRACERMFDRVIASYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S + + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVAFATLLLSVMLWIVIPKGFFPVQDNGIIQGTLQAPQSSSYASMAQRQRQVAERILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV + L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTTFVGVDGANPTLNSARLQINLKPLDARDDR---VQQVISRLQTAVATIPG 653 + V+S+ T G + N+ ++LKP + R+ + VI R + + I Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658 Query: 654 VELYLQPTQDLTIDTQVSRTQYQFTLQ---ATTLDALSHWVPKL-QNALQSLPQLSEVSS 709 + ++ P I + T + F L DAL+ +L A Q L V Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDRGLAAWVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTA 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 SMPGLAALETIRLTSRDGGTVPLSAIARIEQRFAPLSINHLDQFPVTTFSFNVPESYSLG 829 ++ + + S +G VP SA + + + P S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQAILDTEKTLALPADITTQFQGSTLAFQAALGSTVWLIVAAVVAMYIVLGVLYESFI 889 DA+ + + LPA I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALIIAGSELDIIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + D+ ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPRDAIFQACLLRFRPILMTTLAALLGALPLMLSTGVGTELRRPLGIAMVGGLLVSQV 1009 G +A A +R RPILMT+LA +LG LPL +S G G+ + +GI ++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 880 bits (2276), Expect = 0.0 Identities = 283/1035 (27%), Positives = 504/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILIAAAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ ++A + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPGGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSES--WSQGKLYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ S++ +Q + D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDEVREAIDSANVRRPQGAIEDSV------HRWQIQTNDELK 236 A+R+ L+ L ++ +V + N + G + + I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGAAVRLGDVASVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G+ VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDGIRAKLPELRAMIPAAIDLQIAQDRSPTIRASLQEVEETLAISVALVILVVFLFLRS 355 T I+AKL EL+ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVISMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LVVSLTLTPMMCGWMLKSSKPRTQPRKRGVG----RLLVALQQGYGTSLKWVLNHTRLVG 530 ++V+L LTP +C +LK K G Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVFLGTVALNIWLYIAIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 +++ VA + L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVNNVTGFT-GGSRVNSGMMFITLKPRGER---KETAQQIIDRLRVKLAKEPGAR 641 + +V V GF+ G N+GM F++LKP ER + +A+ +I R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDSLAALREWEPKIRKALSAL-----PQLADVNSD 696 + + I G ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMNLIYDRDTMSRLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 SQDISALEKMFVINRDGKAIPLSYFAQWRPANAPLSVNHQGLSAASTIAFNLPTGTSLSQ 816 ++K++V + +G+ +P S F + + I GTS Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ATEAINRTMTQLGVPSTVRGSFSGTAQVFQQTMNSQLILIVAAIATVYIVLGILYESYVH 876 A + ++L P+ + ++G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRSGG 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + G Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPEQAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 +A A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 123 bits (311), Expect = 5e-33 Identities = 95/450 (21%), Positives = 198/450 (44%), Gaps = 25/450 (5%) Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMVVVSYVLTVAVMLPASGWLADKIGVRNIFF 79 F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 AAIVLFTLGSLFCALSGTLNQ-LVLARVLQGVGGAMMVPVGRLTVMKIVPRAQYMAAMTF 138 I++ GS+ + + L++AR +QG G A + + V + +P+ A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQIGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAMATFMLMPNYIIETRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP + I+ L+ + FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 PGFLLLAIGMAVLTLALDGSKSMGISPWTLAGLAAGGAAAILLYLFHAKKNSGALFSLRL 257 G +L+++G+ L + L + L+++ H +K + L Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLIVSVL---------SFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRTPTFSLGLLGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 + F +G+L M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQIVNRFGYRRVLVATTLGLALVSLLFMSVALL----GWYYLLPLVLLLQGMVNSARFS 372 +V+R G VL +G+ +S+ F++ + L W+ + +V +L G+ S + Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367 Query: 373 SMNTLTLKDLPDTLASSGNSLLSMIMQLSMSIGVTIAGMLL--GMFGQQHIGIDSSATHH 430 ++T+ L A +G SLL+ LS G+ I G LL + Q+ + ++ + + Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427 Query: 431 VFMYTWLCMAVIIALPAIIFARVPNDTQQN 460 ++ L + II + ++ V +Q++ Sbjct: 428 LYSNLLLLFSGIIVISWLVTLNVYKHSQRD 457
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.010 Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 14/66 (21%) Query: 187 RGLLAPVKRLVEGTHRLAAGDFTTRVTPTSADEL-----------GKLAQDFNQLASTLE 235 L+A V+ V H LA + P S + L G L N+LA E Sbjct: 104 SQLMAAVRSKVMEGHSLAD---AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTE 160 Query: 236 KNQQMR 241 + QQMR Sbjct: 161 QRQQMR 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 8e-18 Identities = 27/140 (19%), Positives = 65/140 (46%), Gaps = 2/140 (1%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLINHGDKVLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + ++ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-RRC 128 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 129 KPQRELQQQDAESPLMIDES 148 + +L+ + ++ S Sbjct: 124 RRPSKLEDDSQDGMPLVGRS 143
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 87.2 bits (216), Expect = 3e-25 Identities = 30/118 (25%), Positives = 45/118 (38%), Gaps = 3/118 (2%) Query: 6 DRLLRQFSLKLNTDSIVFDENRLCSFIIDNRYRI-LLTSTNSEYIMIYGFCGKPPDNNNL 64 LL FS L +VFD++ C+ IIDN + + L E +++ G D Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPHKDI--P 64 Query: 65 AFEFLNANLWFAENNGPHLCYDNNSQSLLLALNFSLNESSVEKLECEIEVVIRSMENL 122 L L N GP L D S + + SV L+ E+ ++ M Sbjct: 65 QQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 25.8 bits (56), Expect = 0.044 Identities = 12/46 (26%), Positives = 19/46 (41%) Query: 60 SGAVASVSSGAAYTTALTVLGASFGMGGIGMMGICAGLYLSANGIR 105 SG++ +V S A ++ + M C GL L+ IR Sbjct: 136 SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.1 bits (86), Expect = 1e-04 Identities = 33/153 (21%), Positives = 52/153 (33%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLITAAIRYGFFVYGGAETYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYPQPVN 372 + V D R G ++ C GFG + G LGG+M P Sbjct: 109 TGATGAVAGAYIADI-TDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGGFSPHAP---- 162 Query: 373 GLTFNWAGMWTFGAVMIAVIALLFMIFFRESDK 405 + A + + L ES K Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186 Score = 33.3 bits (76), Expect = 0.002 Identities = 55/286 (19%), Positives = 93/286 (32%), Gaps = 17/286 (5%) Query: 29 LNKSGFSAGEIGWSYACTAIAAILSPILVGSVTDRFFSAQKVLAVLMFAGAVLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLGY-NDISPTNIPLLITAASSALLGVFAFCLPDTPPKSTGKMDIKVMLGLDALVLL 206 P + G SP + P AA + L + L K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLITAAIRYGFFVYGGAETYFTYALLFLG 306 R G ++ L+LG+I Y + ++ L Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 26.8 bits (59), Expect = 0.019 Identities = 10/43 (23%), Positives = 17/43 (39%), Gaps = 7/43 (16%) Query: 3 SKLLPCALLLATSFAWAAPA-------TTGIDQYELKSFIADF 38 ++L LLL +S++WA L+ + DF Sbjct: 10 KRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 680 bits (1756), Expect = 0.0 Identities = 247/839 (29%), Positives = 389/839 (46%), Gaps = 26/839 (3%) Query: 2 LRMTPIASLVLLTLFTWQTQAIATETFDTHFMVGGMRDQKITNFHLDENKPIPGQYELDI 61 L + V + A F+ F+ + + + + PG Y +DI Sbjct: 23 LAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDI 82 Query: 62 YVNNQWRGKYDIIVADDPGST----CISTELLKNIGVISDGLQPQ---GATDCIALKDVV 114 Y+NN + D+ C++ L ++G+ + + C+ L ++ Sbjct: 83 YLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMI 142 Query: 115 RSGGYTFNIGVFRLDLSVPQAYVNEVEAGYVLPENWDRGINAFYTSYYASQYYSDYKNSG 174 ++G RL+L++PQA+++ GY+ PE WD GINA +Y S + G Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG 202 Query: 175 SSESTYVRFNSGFNLLGWQAHADTTFNKTD-----GSSGEWKSNTLYLERGIAELLGTLR 229 +S Y+ SG N+ W+ +TT++ GS +W+ +LER I L L Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 230 AGDQYTSSEIFDSVRFTGVRLFRDMQMLPNSKQNFTPLVQGIAQTNALVTIEQNGFVVYQ 289 GD YT +IFD + F G +L D MLP+S++ F P++ GIA+ A VTI+QNG+ +Y Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 290 KEVPPGPFSIADLQLAGGGADLDVTVREADGSINTWLVPYASVPNMLQPGVSKYDFSAGR 349 VPPGPF+I D+ AG DL VT++EADGS + VPY+SVP + + G ++Y +AG Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 350 SHIEGADNQAD-FTQISYQYGLNNLLTLYGGTMLSNHYNAFTLGTGWNT-RIGAISLDAT 407 A + F Q + +GL T+YGGT L++ Y AF G G N +GA+S+D T Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442 Query: 408 RAHSKQDNGDVFDGQSYQIAYNKYLTQTLTRFGLAAYRYSSQDYRTFNDHVWANNKNNYR 467 +A+S + DGQS + YNK L ++ T L YRYS+ Y F D ++ Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502 Query: 468 RDKNDVYDI----ADYYQNDFGRKNTFSANVSQSLPEGWGAVSLSALWRDYWGRSGTSKD 523 ++ V + DYY + ++ V+Q L + LS + YWG S + Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQ 561 Query: 524 YQISYSNTFQKINYTLSASQTYDE-DHNEDKRFNLFISIPFD--WGDGITTPRRHLNVSN 580 +Q + F+ IN+TLS S T + D+ L ++IPF + RH + S Sbjct: 562 FQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASY 621 Query: 581 STTFDDDGFTSNNIGLTGTAGSRDQFNYGVSVSH---QRHDSETTAGTNLTWNTPVATLN 637 S + D +G +N G+ GT + +Y V + +S +T L + N Sbjct: 622 SMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNAN 681 Query: 638 GSYSQSSNYTQTGGSISGGVVAWSGGLNLSSRLSDTFAIMQAPGLEGAYVNGQKYRTTNK 697 YS S + Q +SGGV+A + G+ L L+DT +++APG + A V Q T+ Sbjct: 682 IGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDW 741 Query: 698 KGTVVYDNLTPYRENHLMLDVSQSSSETELRGNRKVAAPYRGAVVLVNFDTDQRKPWFIK 757 +G V T YREN + LD + + +L P RGA+V F + Sbjct: 742 RGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMT 801 Query: 758 AQRPDGSPLIFGYDVVDHHGHNVGIVGQGSQLFIRTNDIPPEVSVPVDKEQGLSCSITF 816 + PL FG V + GIV Q+++ + +V V +E+ C + Sbjct: 802 L-THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.7 bits (61), Expect = 0.012 Identities = 12/32 (37%), Positives = 19/32 (59%) Query: 7 MALPLFALSLSVSITGCDQKNDTLQGKQNNMT 38 M LF+ +L++ ITGC Q+ T+ K +T Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVT 37
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 19/215 (8%) Query: 2 IKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRIS 61 +L+ DD+ R L L V +NA + D++ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQ 117 +++ + + RP + + ++A + AIKA E+ A+DYL KP + L + R Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 118 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVT--SSEGKEGFT 175 E ++ L ++Q + + G S +A + + +T S GK Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGK---- 173 Query: 176 ELTLRTLESRTPLLRCHRQFL-VNMAHLQEIRLED 209 EL R L R + F+ +NMA + +E Sbjct: 174 ELVARALHDYGK--RRNGPFVAINMAAIPRDLIES 206
>PF06580#Sensor histidine kinase Length = 349 Score = 219 bits (559), Expect = 2e-68 Identities = 60/216 (27%), Positives = 116/216 (53%), Gaps = 3/216 (1%) Query: 343 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402 L G + + + ++ ++++ L AQ+NPHF+FNALN I+A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 403 SQLVQYLSTFFRKNLKR-PSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQ 461 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ + + + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 462 KLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAG-SSGL 520 ++P +Q +VEN IKHG +QL G + ++ ++ + L++E+ L + S+G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 521 GMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLP 556 G+ V +RL+ +G + I ++ + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 37.5 bits (87), Expect = 5e-05 Identities = 38/180 (21%), Positives = 68/180 (37%), Gaps = 6/180 (3%) Query: 11 LALMLAVPFAPQAVAKTAATTAASQPEIASGSAMI-VDLNTNKVIYSNHPDLVRPIASIT 69 ++L+ +P A A + S+ +++ MI +DL + + + + D P+ S Sbjct: 9 ISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTF 68 Query: 70 KLMTAMVVLDARLPLDEILKVDISQTPEMKGVYSRV---RLNSEISRKNMLLLALMSSEN 126 K++ VL DE L+ I + YS V L ++ + A+ S+N Sbjct: 69 KVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDN 128 Query: 127 RAAASLAHYY--PGGYNAFIKAMNAKAKALGMTHTRFVEPTGLSIHNVSTARDLTKLLIA 184 AA L P G AF++ + L T E + +T + L Sbjct: 129 SAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRK 188
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 27.5 bits (61), Expect = 0.034 Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 1/39 (2%) Query: 161 WLHDLDQHLRH-GVWLILAIVLVVGVRWWLKRRGKAEAR 198 L + +R G W++LA++ + R+ K Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVS 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 112 bits (282), Expect = 3e-32 Identities = 68/253 (26%), Positives = 116/253 (45%), Gaps = 12/253 (4%) Query: 3 KVAIVTASDSGIGKACALLLAQNGFDIGITWHSDERGAQETAKKAAQFGVRAETIHLDLS 62 K+A +T + GIG+A A LA G I ++ E+ + + A+ AE D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVR 67 Query: 63 QLPEGAQAIEHLIQRLGRVDVLVNNAGAMTKSAFIDMPFTQWRQIFTVDVDGAFLCAQIA 122 + + + +G +D+LVN AG + + +W F+V+ G F ++ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARHMIKQGEGGRIINITSVHEHTPLPQASAYTAAKHALGGLTKSMALELIEHHILVNAVA 182 +++M+ + G I+ + S P +AY ++K A TK + LEL E++I N V+ Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGAIATPM-------NDMDDSDIEPGSEP---SIPIARPGSTHEIASLVAWLCSEGASYT 232 PG+ T M + + I+ E IP+ + +IA V +L S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 233 TGQSLIVDGGFML 245 T +L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 50.6 bits (121), Expect = 7e-09 Identities = 65/402 (16%), Positives = 142/402 (35%), Gaps = 19/402 (4%) Query: 22 RVIICCFLVVMLDGFDTAAIGFIAPDIRTHWQLSASELAPLFGAGLLGLTAGALLCGPLA 81 +++I ++ + + PDI + + + A +L + G + G L+ Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 82 DRFGRKRVIELCVALFGALSLLSAFS-PDIETLVLLRFLTGLGLGGAMPNTIT-MTSEYL 139 D+ G KR++ + + S++ L++ RF+ G G A P + + + Y+ Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132 Query: 140 PARRRGALVTLMFCGFTLGSAMGGIVSAQLVPLIGWHGILALGGILPLMLFFGLLFALPE 199 P RG L+ +G +G + + I W +L + I + + F L+ L + Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPF-LMKLLKK 191 Query: 200 SPRWQVRRQLPQAV---------VARTVSAITGERYHDTQFFLHETAAVAKGSI----RQ 246 R + + + + T S FL + K + Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPG 251 Query: 247 LFAGRQLVITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGA 306 L +I ++ + F ++ + +M ++ + + G Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG- 310 Query: 307 LLLGVLMDRLNPFRVLAVSYALGAVCIVMIGLSENG-LWLMALAIFGTGIGISGSQVGLN 365 + G+L+DR P VL + +V + W M + I G+S ++ ++ Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIS 370 Query: 366 ALTATLYPTQSRATGVSWSNAIGRCGAIVGSLSGGMMMALNF 407 + ++ Q G+S N G G ++++ Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412 Score = 41.8 bits (98), Expect = 4e-06 Identities = 40/169 (23%), Positives = 73/169 (43%), Gaps = 1/169 (0%) Query: 251 RQLVITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGALLLG 310 R I + L ++ F S+L +L+ +P + N +WV AF + ++G + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 311 VLMDRLNPFRVLAVSYALGAVCIVMIGLSENGLWLMALAIFGTGIGISGSQVGLNALTAT 370 L D+L R+L + V+ + + L+ +A F G G + + + A Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 371 LYPTQSRATGVSWSNAIGRCGAIVGSLSGGMMM-ALNFSFDTLFFVIAI 418 P ++R +I G VG GGM+ +++S+ L +I I Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITI 179
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.8 bits (69), Expect = 0.026 Identities = 35/146 (23%), Positives = 59/146 (40%), Gaps = 21/146 (14%) Query: 557 AQLAEDEALRANTFAMATEATSSCE---DRVTFFLHQMKNVQLVHNAEKGQYDNDLA--- 610 AQL + +A +A A EA + + D +T L + N L HNA + +LA Sbjct: 60 AQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSATELAHAN 119 Query: 611 -ALVATGREMFRLGKLEQIAREKVRTLALVDEIEVW-LAYQNKLKKSLGLTSVTSE---- 664 A + E RL K E+ AR+ E E A+Q ++ + +E Sbjct: 120 NAAMQAEDERLRLAKAEEKARK---------EAEAAEKAFQEAEQRRKEIEREKAETERQ 170 Query: 665 MRFFDVSGVTVTDLQDAELQVKAAEK 690 ++ + + L + V+ A+K Sbjct: 171 LKLAEAEEKRLAALSEEAKAVEIAQK 196
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 32.9 bits (75), Expect = 5e-04 Identities = 19/97 (19%), Positives = 33/97 (34%), Gaps = 4/97 (4%) Query: 77 VRKLIAALVGSVLEPLDTLQELADALGNDPNFATTVLNKLAGKQPLDETLTALSGKSVDG 136 + + L E +DT+ E A+G P + A +A V Sbjct: 46 LHEKFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASE--MVQA 103 Query: 137 LIEYVGLRETISRAADALQKSQNGGDIPDKDLFVRRI 173 L+ ++ S + + ++ D DLFV I Sbjct: 104 LVN--DYKQISSESKFVIGLAEENQDNATADLFVGLI 138
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.2 bits (164), Expect = 3e-15 Identities = 23/114 (20%), Positives = 48/114 (42%), Gaps = 2/114 (1%) Query: 9 VLIVDDHPLMRRGIRQLLELDPAFHVVAEAGDGASAIDLANRIEPDLILLDLNMKGLSGL 68 +L+ DD +R + Q L A + V + A+ + DL++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDSASDIYALIDAGADGYLLKDSDPEVLLEAIRK 122 D L +++ +++++ ++ + GA YL K D L+ I + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 27.9 bits (62), Expect = 0.014 Identities = 9/30 (30%), Positives = 11/30 (36%) Query: 1 MNLRRKNRLWVVCAVLAGLGLTTALVLYAL 30 R K WVV V L + + AL Sbjct: 27 AAERSKKLAWVVAGVAGALATAGVVAVAAL 56
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 540 bits (1392), Expect = 0.0 Identities = 261/389 (67%), Positives = 298/389 (76%), Gaps = 17/389 (4%) Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLFGKVDGLHYFSDDKGSDGDQTYMRIG 60 MK KVL+L++PALL AGAA+AAEIYNKDGNKLDL+GKVDGLHYFSDD DGDQTYMR+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQVNDQLTGYGQWEYQIQGNQTEG-SNDSWTRVAFAGLKFADAGSFDYGRNYGVTY 119 FKGETQ+NDQLTGYGQWEY +Q N TEG +SWTR+AFAGLKF D GSFDYGRNYGV Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVTSWTDVLPEFGGDTYG-ADNFMQQRGNGYATYRNTDFFGLVDGLDFALQYQGKNGSVS 178 DV WTD+LPEFGGD+Y ADN+M R NG ATYRNTDFFGLVDGL+FALQYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 179 GEN--------TNGRSLLNQNGDGYGGSLTYAIGEGFSVGGAITTSKRTADQNNTANARL 230 ++ NG + NGDG+G S TY IG GFS G A TTS RT +Q N Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG--T 238 Query: 231 YGNGDRATVYTGGLKYDANNIYLAAQYSQTYNATRFGTSNGSNPSTSYGFANKAQNFEVV 290 GD+A +T GLKYDANNIYLA YS+T N T +G ++ G ANK QNFEV Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDK---GYDGGVANKTQNFEVT 295 Query: 291 AQYQFDFGLRPSVAYLQSKGKDISNGYGASYGDQDIVKYVDVGATYYFNKNMSTYVDYKI 350 AQYQFDFGLRP+V++L SKGKD++ + D+D+VKY DVGATYYFNKN STYVDYKI Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYN-NVNGDDKDLVKYADVGATYYFNKNFSTYVDYKI 354 Query: 351 NLLDKND-FTRDAGINTDDIVALGLVYQF 378 NLLD +D F +DAGI+TDDIVALG+VYQF Sbjct: 355 NLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 8e-09 Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 1 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 60 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 114 + L+ IK+ P L ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 115 DLPKALAALQKGKKFTPESVSRLLE 139 DL + + + + S+L + Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 1e-17 Identities = 29/104 (27%), Positives = 47/104 (45%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNAIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY + ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRQLGLTLPVVGVTANALAEEKQRCLESGMDSCLSKPVTLD 930 RI++ LPV+ ++A + E G L KP L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 30.6 bits (69), Expect = 0.012 Identities = 36/179 (20%), Positives = 68/179 (37%), Gaps = 12/179 (6%) Query: 25 ILYFFNYMDRVNIGFAALRMNESLGITPEDFANISSIFFISYLIFQIPSSIGLQKLGARK 84 IL FF+ ++ + + + + P +++ F +++ I +LG ++ Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 85 W--ISSIIIGWGAVTGLIFFAKDTQHIL-LARIFLGVFEAGFFPGMVYYLACWFPARERG 141 II +G+V G F +L +AR G A F ++ +A + P RG Sbjct: 81 LLLFGIIINCFGSVIG--FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 142 KVNSFFMLSIAVASVLAAPMSGWIIEHLNTPDYEGWRWLFAIEGIPTVFLGILTFYLLP 200 K +A+ + + G I ++ W +L I I T+ LL Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYI------HWSYLLLIPMI-TIITVPFLMKLLK 190
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.2 bits (63), Expect = 0.043 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 15/75 (20%) Query: 196 AERDTQAYLKLDHDFHYVFVKYADNKYISQAHLLISARLLAIRYRLDFTAEYITSSNRGH 255 A+R+ L F VF+ + A+RY L+ Y S+ G Sbjct: 62 ADREGMTDLFASGHFERVFISPH------RL---------AVRYSLENPHAYADSNLTGF 106 Query: 256 ATILDMLKNNNVEGV 270 IL+ ++N ++ + Sbjct: 107 LNILEGCRHNKIQHL 121
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 30.2 bits (68), Expect = 0.002 Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 12/74 (16%) Query: 1 MIDWQDLHHSELTVPQLYALLKLRCAVFV--------VEQRCPYLDVDGDDLVGDNRHIL 52 M++ D++H+ L+ + L LR F + D + + ++ Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56 Query: 53 GWHQDELVAYARIL 66 G + ++ R + Sbjct: 57 GIKDNTVICSLRFI 70
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.9 bits (114), Expect = 3e-08 Identities = 32/135 (23%), Positives = 56/135 (41%), Gaps = 16/135 (11%) Query: 185 PGAVAIVAEDSKVARAMLEKGLNAMGIPHQMHVTGKDAWERIQQLAQEAEAEGKPISEKI 244 GA +VA+D R +L + L+ G ++ W I + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-------------AGDG 48 Query: 245 ALVLTDLEMPEMDGFTLTRKIKTDERLKKIPVVIHSSLSGSANEDHIRKVKADGYVAK-F 303 LV+TD+ MP+ + F L +IK + +PV++ S+ + + A Y+ K F Sbjct: 49 DLVVTDVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106 Query: 304 EINELSSVIQEVLER 318 ++ EL +I L Sbjct: 107 DLTELIGIIGRALAE 121
>PERTACTIN#Pertactin signature. Length = 922 Score = 29.7 bits (66), Expect = 0.010 Identities = 20/60 (33%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 99 PIPVETPKPKPVEKPKPQPKPQQPVVAASTPTPAPQPVADDKPAPTGKAYVVQLGALKNA 158 P P P+P P P+P PQ P P QP A P G+ L A NA Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE----LSAAANA 624 Score = 28.5 bits (63), Expect = 0.024 Identities = 16/49 (32%), Positives = 17/49 (34%) Query: 106 KPKPVEKPKPQPKPQQPVVAASTPTPAPQPVADDKPAPTGKAYVVQLGA 154 K P KP PQP PQ P P P P +A Q A Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPA 614
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 28.7 bits (64), Expect = 0.026 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 6/63 (9%) Query: 204 VRNIVGS-LLEVGAHNQPESWIAELLAARDRTLAAATAKAEGLYLVAVDYPDRFDLPKPP 262 +NI GS ++ + PES + E AA LAA +K++ V VDY + ++ KP Sbjct: 496 TKNIPGSHVIVKNIMDIPESTLLE--AAN---LAAYYSKSQNSSNVPVDYTEVKNVKKPN 550 Query: 263 MGP 265 Sbjct: 551 GAK 553
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 44.8 bits (106), Expect = 4e-07 Identities = 80/362 (22%), Positives = 135/362 (37%), Gaps = 30/362 (8%) Query: 14 NFSLFRIAFAVFLTYMTVGLPLPVIPLFVHHELGYSNTMV---GIAVGIQFFATVLTRGY 70 N L I V L + +GL +PV+P + +L +SN + GI + + Sbjct: 4 NRPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPV 62 Query: 71 AGRLADQYGAKRSALQGMLACGLAGAAWLLAALLPVSAPVKFALLIVGRLILGFGESQLL 130 G L+D++G + +L LAGAA + + +AP +L +GR++ G + Sbjct: 63 LGALSDRFGRRP-----VLLVSLAGAA--VDYAIMATAPF-LWVLYIGRIVAGITGATGA 114 Query: 131 TGTLTWGLGLVGPTRSGKVMSWNGMAIYGALAAGAPLGLL---IHSHFGFAALAGTTMVL 187 G R+ + + + AG LG L H F A A + Sbjct: 115 VAGAYIADITDGDERA-RHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLN 173 Query: 188 PLLAWAFNGTVRKVPAYTGERPSLWSVVGLIWKPGL-----------GLALQGVGFAVIG 236 L K R +L + W G+ + L G A + Sbjct: 174 FLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALW 233 Query: 237 TFISLYFVSNGWTMAGFTLTAFGGAFVLMRI-LFGWMPDRFGGVKVAVVSLLVETAGLLL 295 T G +L AFG L + + G + R G + ++ ++ + G +L Sbjct: 234 VIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYIL 293 Query: 296 LWLAPTAWIALVGAALTGAGCSLIFPALGVEVVKRVPAQVRGTALGGYAAFQDISYGVTG 355 L A W+A L +G + PAL + ++V + +G G AA ++ + G Sbjct: 294 LAFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-SIVG 351 Query: 356 PL 357 PL Sbjct: 352 PL 353 Score = 30.2 bits (68), Expect = 0.016 Identities = 32/142 (22%), Positives = 47/142 (33%), Gaps = 8/142 (5%) Query: 252 GFTLTAFGGAFVLMRILFGWMPDRFGGVKVAVVSLLVETAGLLLLWLAPTAWIALVG--- 308 G L + + G + DRFG V +VSL ++ AP W+ +G Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 309 AALTGAGCSLIFPALGVEVVKRVPAQVRGTALGGYAAFQDISYGVTGPLAGMLATSYGYP 368 A +TGA G + R G +A V GP+ G L + Sbjct: 106 AGITGA----TGAVAGAYIADITDGDERARHFGFMSACFGFGM-VAGPVLGGLMGGFSPH 160 Query: 369 SVFLAGAISAVVGILVTILSFR 390 + F A A + L Sbjct: 161 APFFAAAALNGLNFLTGCFLLP 182
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 2e-04 Identities = 16/102 (15%), Positives = 39/102 (38%), Gaps = 4/102 (3%) Query: 61 LRPWNDPEMDIERKVNHDVSLFLVAEVSGEVVG--TVMGGYDGHRGSAYYLGVHPEFRGR 118 + + D +MD+ + FL + +G + ++G + V ++R + Sbjct: 47 FKQYEDDDMDVSYVEEEGKAAFL-YYLENNCIGRIKIRSNWNG-YALIEDIAVAKDYRKK 104 Query: 119 GIANALLNRLEKKLIARGCPKIQIMVRDDNDVVLGMYERLGY 160 G+ ALL++ + + + +D N Y + + Sbjct: 105 GVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 48.2 bits (115), Expect = 1e-08 Identities = 32/116 (27%), Positives = 51/116 (43%), Gaps = 9/116 (7%) Query: 64 VRDGIVWDFFGAVTLVRRHLDTLEQQLGCRFT-HAATSFPPGTDP---RISINVLESAGL 119 ++DG++ DFF +++ + + R + P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 120 EVSHVLDEPTAVA---DLLALDNAG--VVDIGGGTTGIAIVKQGKVTYSADEATGG 170 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>PF06580#Sensor histidine kinase Length = 349 Score = 41.8 bits (98), Expect = 5e-06 Identities = 34/165 (20%), Positives = 65/165 (39%), Gaps = 23/165 (13%) Query: 406 SRALNDAYRQ---LRELLTTFRLTLQQADLPS-ALHEMLEDLQS-------QTPAKLTLD 454 + L D + L L R +L+ ++ +L + L + S Q +L + Sbjct: 184 ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFE 243 Query: 455 CRLPTLALDAQMQVHLLQIVREAVLNAIKHA-----NASEIAVSCVTAPDGDHTVYIRDN 509 ++ +D Q+ L+Q + E N IKH +I + T +G T+ + + Sbjct: 244 NQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLK-GTKDNGTVTLEVENT 299 Query: 510 GIGIGEPHEPAGHYGLNIMRERAERLGG---TLNFSQPSGGGTLV 551 G + + + GL +RER + L G + S+ G + Sbjct: 300 GSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM 344
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1285 bits (3326), Expect = 0.0 Identities = 651/1032 (63%), Positives = 801/1032 (77%), Gaps = 2/1032 (0%) Query: 1 MANFFIDRPIFAWVLAILLCLTGALAIFSLPVEQYPDLAPPNVRITANYPGASAQTLENT 60 MANFFI RPIFAWVLAI+L + GALAI LPV QYP +APP V ++ANYPGA AQT+++T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMTGLDNLMYMSSQSSGTGQATITLSFIAGTDPDEAVQQVQNQLQSAMRKLPQ 120 VTQVIEQNM G+DNLMYMSS S G TITL+F +GTDPD A QVQN+LQ A LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 AVQDQGVTVRKTGDTNILTIAFVSTDGSMDKQDIADYVASNIQDPLSRVNGVGDIDAYGS 180 VQ QG++V K+ + ++ FVS + + DI+DYVASN++D LSR+NGVGD+ +G+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYSMRIWLDPAKLNSFQMTTKDVTDAIESQNAQIAVGQLGGTPSVDKQALNATINAQSLL 240 QY+MRIWLD LN +++T DV + ++ QN QIA GQLGGTP++ Q LNA+I AQ+ Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTPQQFRDITLRVNQDGSEVKLGDVATVELGAEKYDYLSRFNGNPASGLGVKLASGANEM 300 + P++F +TLRVN DGS V+L DVA VELG E Y+ ++R NG PA+GLG+KLA+GAN + Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 ATAKLVLDRLNELAQYFPHGLEYKIAYETTSFVKASIIDVVKTLLEAIALVFLVMYLFLQ 360 TAK + +L EL +FP G++ Y+TT FV+ SI +VVKTL EAI LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLMGTFSVLYAFGYSINTLTMFAMVLAIGLLVDDAIVVVENVERIM 420 N RATLIPTIAVPVVL+GTF++L AFGYSINTLTMF MVLAIGLLVDDAIVVVENVER+M Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 SEEGLTPREATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGTTGAIYRQFSITIVSAMVL 480 E+ L P+EAT KSM QIQGALVGIAMVLSAVF+PMAFFGG+TGAIYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVAMILTPALCATLLKPLHKGEQHGQRGFFGWFNRTFNRNAERYEKGVAKILHRSLRW 540 SVLVA+ILTPALCATLLKP+ + GFFGWFN TF+ + Y V KIL + R+ Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 ILIYVLLLGGMVFLFLRLPTSFLPQEDRGMFTTSIQLPSGSTQQQTLKVVEKVENYYFTH 600 +LIY L++ GMV LFLRLP+SFLP+ED+G+F T IQLP+G+TQ++T KV+++V +YY + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKDNIMSVFSTVGSGPGGNGQNVARMFVRLKDWDARDPTTGSSFAIIERATKAFNQIKEA 660 EK N+ SVF+ G G QN FV LK W+ R+ S+ A+I RA +I++ Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 RVFASSPPAISGLGSSAGFDMELQDHAGAGHDALMAARDQLIELAGKN-SSLTRVRHNGL 719 V + PAI LG++ GFD EL D AG GHDAL AR+QL+ +A ++ +SL VR NGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 DDSPQLQIDIDQRKAQALGVSIDDINDTLQTAWGSSYVNDFMDRGRVKKVYVQAAAKYRM 779 +D+ Q ++++DQ KAQALGVS+ DIN T+ TA G +YVNDF+DRGRVKK+YVQA AK+RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 780 LPDDINLWYVRNKDGGMVPFSAFATSRWETGSPRLERYNGYSAVEIVGEAAPGVSTGTAM 839 LP+D++ YVR+ +G MVPFSAF TS W GSPRLERYNG ++EI GEAAPG S+G AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 840 DVMESLVHQLPGGFGLEWTAMSYQERLSGAQAPALYAISLLVVFLCLAALYESWSVPFSV 899 +ME+L +LP G G +WT MSYQERLSG QAPAL AIS +VVFLCLAALYESWS+P SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 900 MLVVPLGVIGALLATWMRGLENDVYFQVGLLTVIGLSAKNAILIVEFANE-MNQKGHALL 958 MLVVPLG++G LLA + +NDVYF VGLLT IGLSAKNAILIVEFA + M ++G ++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 959 DATLYASRQRLRPILMTSLAFIFGVLPMATSTGAGSGSQHAVGTGVMGGMISATVLAIFF 1018 +ATL A R RLRPILMTSLAFI GVLP+A S GAGSG+Q+AVG GVMGGM+SAT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1019 VPLFFVLIRRRF 1030 VP+FFV+IRR F Sbjct: 1021 VPVFFVVIRRCF 1032
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 31.7 bits (72), Expect = 0.007 Identities = 24/122 (19%), Positives = 49/122 (40%), Gaps = 13/122 (10%) Query: 459 SRIAVHPARQREGIGQQLIACACMQAAQCDYLSVSFGYT-------PKLWRFWQRCGFVL 511 SR V +R ++ +G + + + + +Y S GY + +R G+ Sbjct: 100 SRFFVDKSRAKDILGNEYPISSMLFLSMINY-SKDKGYDGIYTIVSHPMLTILKRSGWG- 157 Query: 512 VRMGNHREASSGCYTAMALLPLSDAG-KRLAQQEHRRLRRDADILTQWNGEAIPLAALRE 570 +R+ + + LP+ D + LA++ +R ++ L QW + + A Sbjct: 158 IRVVEQGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQW---PLRVPAAIA 214 Query: 571 QA 572 QA Sbjct: 215 QA 216
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 89.0 bits (220), Expect = 1e-19 Identities = 128/547 (23%), Positives = 195/547 (35%), Gaps = 62/547 (11%) Query: 1504 VNVASGATFGGSNGTTVNGKVTNEGTLVFGNSEETGAIFTLNGDLINMGTMTSGSSSSTP 1563 V +AS A + G+ + + N ++ NS +G + +G Sbjct: 415 VALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLT 474 Query: 1564 GNTLYVDGNYTGNGGSLYLNTVLGDDDSATDKLVITGDASGTTDLYINGIGDGAQTTNGI 1623 NTL G+ G +N D +DKLV+ DASG L++ G + N + Sbjct: 475 VNTL--AGS-----GLFRMNVFA--DLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTL 525 Query: 1624 EVVDVGGVSTSDAFELKN---EVNAGLYTYRLYWNESDNDWYLASKAQSDDDDSGGDDTP 1680 +V S + F L N +V+ G Y YRL N + W L Sbjct: 526 LLVQTPLGSAA-TFTLANKDGKVDIGTYRYRLAAN-GNGQWSLV---------------- 567 Query: 1681 SDGGDDGGNVTPPDDGGDGGNVTPPDDGGDGGDVTPPDHGGDVAPQYRADIGAYMGNQ-- 1738 G P G P P A + G Sbjct: 568 --GAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTL 625 Query: 1739 WMARNLQMQTLYDREGSQYRNAD-GSVWARFKAGKAESEAVSGNIDMDSNYSQFQLGGDI 1797 W A + L R G N D G W R A + + + +G D + F+LG D Sbjct: 626 WYA---ESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGR-RFDQKVAGFELGAD- 680 Query: 1798 LAWGNGQQSVTVGVMASYINADTDSTGNRGADGSQFTSSGNVDGYNLGVYATWFADAQTH 1857 A +G +A Y D TG+ G G+ D ++G YAT+ AD Sbjct: 681 HAVAVAGGRWHLGGLAGYTRGDRGFTGDGG---------GHTDSVHVGGYATYIAD---- 727 Query: 1858 SGAYVDSWYQYGFYNN--SVESGDAGSESYDSTANAV--SLETGYRYDIALSNGNTVSLT 1913 SG Y+D+ + N V D + + V SLE G R+ A + L Sbjct: 728 SGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHA----DGWFLE 783 Query: 1914 PQAQVVWQNYSADSVKDNYGTRIDGQDGDSWTTRLGLRVDGKLYKGSRTVIQPFAEANWL 1973 PQA++ + + G R+ + G S RLGL V ++ +QP+ +A+ L Sbjct: 784 PQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVL 843 Query: 1974 HTSD-DVSVSFDDATVKQDLPANRAELKVGLQADIDKQWSVRAQVAGQTGSNDFGDLNGS 2032 D +V + + +L RAEL +G+ A + + S+ A G Sbjct: 844 QEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFH 903 Query: 2033 LNLRYNW 2039 RY+W Sbjct: 904 AGYRYSW 910 Score = 37.3 bits (86), Expect = 7e-04 Identities = 76/404 (18%), Positives = 136/404 (33%), Gaps = 59/404 (14%) Query: 112 TGTGLVIETSGGGA-----DDPDGGKYVSNAISLDHYAILELTDAKITTTGIYTQGISAA 166 T +G I+ SG A ++P N + + + T G A Sbjct: 63 TASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVA 122 Query: 167 DGSTLRLTDSTLTIDGNFGVMTLYTGSEATLDGTIVEAANSSSAQVQQGSTLNVLDGSTI 226 D +TL T DG + ++A++ + ++ A Q+++G+ + V S I Sbjct: 123 DHATLANVGDTWDDDG-IALYVAGEQAQASIADSTLQGA--GGVQIERGANVTV-QRSAI 178 Query: 227 TLAQGQINVVAGNTATDEG-STLNLSDSSVSS---AGTMSTIQGTNKAALNLTNATITHT 282 I + D S + L D++V++ +G + + + L L IT Sbjct: 179 VDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGGHITGG 238 Query: 283 NASGAAVQANNATTLD---ITGGNITSAGT----------------------------GV 311 A+G A L I G+ + G GV Sbjct: 239 RAAGVAAMQGAVVHLQRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGV 298 Query: 312 YILASDARIDGATINADGDGIFITSKRKLDGYEDLNALTVSDANVTSKTVALNIDG---- 367 + S + + + A G I R +L+ NV A Sbjct: 299 DVSGSSVELAQSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAP 358 Query: 368 -STTINDPIELTNSTFTA---PTAIKLGSKATIQAEKTMLTGNIVQTDASSSS----LSL 419 S T+ P +KL A+ ++ + + +S ++L Sbjct: 359 LSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVATEL-PSIPGTSIGPLDVAL 417 Query: 420 SQGSTLTGSVDAMFTTLSLDDTSQWNMTDPSTVGNLTNDGDITL 463 + + TG+ A+ +LS+D+ + W MTD S VG L D ++ Sbjct: 418 ASQARWTGATRAV-DSLSIDNAT-WVMTDNSNVGALRLASDGSV 459
>INTIMIN#Intimin signature. Length = 939 Score = 329 bits (845), Expect = e-102 Identities = 178/587 (30%), Positives = 267/587 (45%), Gaps = 59/587 (10%) Query: 53 KGKSFKEQGADYFINSATQGFDNLTPEALES-QARSYLQNQITSSAQSYLEGVMSPYGKI 111 K ++ +Y A L +L A+ + A S L+ + YG Sbjct: 154 KSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTA 213 Query: 112 RTSLSVGEGGDLDGSSLDYFIPWYDNQSTLFFSQISAQRKEDRTIGNFGLGVRQNVGNWL 171 +L G + DGSSLD+ +P+YD++ L F Q+ A+ + R N G G R + + Sbjct: 214 EVNLQSGN--NFDGSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENM 271 Query: 172 LGGNAFYDYDFTRGHRRLGLGTEAWTDYLKFSGNYYHPLSDWKDSEDFDFYEERPARGWD 231 LG N F D DF+ + RLG+G E W DY K S N Y +S W +S + Y+ERPA G+D Sbjct: 272 LGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFD 331 Query: 232 IRMESWLPFYPQLGAKLVYEQYYGDEVALFGTDNLQKDPHAVTLGLEYTPVPLVTVGTDY 291 IR +LP YP LGAKL+YEQYYGD VALF +D LQ +P A T+G+ YTP+PLVT+G DY Sbjct: 332 IRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDY 391 Query: 292 KAGTGDSNDFSVNATVNYQIGTPLAAQLDPENVKIQHSLMGSRTDFVDRNNFIILEYREK 351 + GTG+ ND + YQ P + Q++P+ V +L GSR D V RNN IILEY+++ Sbjct: 392 RHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQ 451 Query: 352 DPLDVTLWLKADATNEHPECVIEDTPEAAVGLEKCKWTVNALINHHYKIISASWQAKNNA 411 D L + + + T E I+ ++ GL++ W +AL + +I + Q+ + Sbjct: 452 DILSLNIPHDINGT-ERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQSAQD- 509 Query: 412 ARTLVMPVVKANALTEGNNNSWNLVLPAWVNADTEEQRTALNTWKVRMTLEDEKGNKQNS 471 + +LPA+V + N +KV D GN N+ Sbjct: 510 ---------------------YQAILPAYV-------QGGSNVYKVTARAYDRNGNSSNN 541 Query: 472 GVVEITVQQDRKIELIVDNIADTDRSDHSHEASALADGEDGVVMDLLITDSFGDSTDRNG 531 ++ ITV + +VD + TD + + + SA ADG + + + Sbjct: 542 VLLTITVLSN---GQVVDQVGVTDFT--ADKTSAKADGTEAITYTATVK----------- 585 Query: 532 NELVDDAMTPVLYDSNDKKVTLAQTPCTTETPCVF--IASRDKEAGTVTLSSTLPGTFRW 589 V A PV ++ L+ T DK V + T T Sbjct: 586 KNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSAL 645 Query: 590 KAKADAYGDSNY--------VDVTFIGDNLSALNAVIYQVKAANPVN 628 A A + D T + + A+ + +K PV+ Sbjct: 646 NANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVS 692
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.7 bits (72), Expect = 0.007 Identities = 71/362 (19%), Positives = 119/362 (32%), Gaps = 52/362 (14%) Query: 44 SHAINLFSAYA-SLVYVTPILGGWLADRLLGNRVAVITGALLMTLGHVVLGLESDSTLSL 102 +H L + YA P+LG +DR G R ++ + + ++ L + Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMATAP--FLWV 98 Query: 103 YAALAIIICGYGLFKSNISCLLGELYAPDDNRRDGGFSLLYAAGNIGSIAAPIACGLAAQ 162 I+ G + + ++ D+ R GF + A G +A P+ GL Sbjct: 99 LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF--MSACFGFGMVAGPVLGGLMGG 156 Query: 163 WYGWHVGFALAGVGMFIGLLIFLSGHRHFQQTRGVNRPALRAVKFALPT-WGWLVLMLCI 221 + H F A + L FL+G ++ R LR + W M + Sbjct: 157 -FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212 Query: 222 APVFFTLLLENNWSGYVLAIVCVFAAQLI----ARIMVKFPEHRRALWQIVLLMITGTLF 277 A + F QL+ A + V F E R W + I+ F Sbjct: 213 AALMAVF----------------FIMQLVGQVPAALWVIFGEDRFH-WDATTIGISLAAF 255 Query: 278 WVLAQQGGSSISLFIDRFVNRHWLNMTVPTALFQSVNAIAVMAAGVVLAWLSSPKESARS 337 +L S I V IA ++LA+ + Sbjct: 256 GIL----HSLAQAMITGPVAARLGERRALMLGM-----IADGTGYILLAFAT-------- 298 Query: 338 VLRVWLKFAVGLVLMGGGFMLLALNAHQARLDGQASMGMMIAGLALMGFAELFIDPVAMA 397 R W+ F + ++L GG + AL A +R + G + LA + + P+ Sbjct: 299 --RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356 Query: 398 QI 399 I Sbjct: 357 AI 358
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 481 bits (1240), Expect = e-170 Identities = 166/480 (34%), Positives = 247/480 (51%), Gaps = 42/480 (8%) Query: 7 AHLLLVDDDPGLLKLLGMRLTSEGYSVVTAESGQEGLRVLHREKVDLVISDLRMDEMDGM 66 A +L+ DDD + +L L+ GY V + R + DLV++D+ M + + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 QLFTEIQKVQPGMPVIILTAHGSIPDAVAATQKGVFSFLTKPIDRDALYKAIDEALE--- 123 L I+K +P +PV++++A + A+ A++KG + +L KP D L I AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 124 --QSAPATDDSWRNAIVTRSPLMLRLLEQARMVAQSDVSVLINGQSGTGKEIFAQAIHNA 181 S D +V RS M + + Q+D++++I G+SGTGKE+ A+A+H+ Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 182 SPRSNKPFVAINCGALPEQLLESELFGHARGAFTGAVSNREGLFQAAEGGTLFLDEIGDM 241 R N PFVAIN A+P L+ESELFGH +GAFTGA + G F+ AEGGTLFLDEIGDM Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 242 PAPLQVKLLRVLQERKVRPLGSNRDIDIDVRIISATHRDLPKAMARGEFREDLYYRLNVV 301 P Q +LLRVLQ+ + +G I DVRI++AT++DL +++ +G FREDLYYRLNVV Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 302 SLKIPALAERTEDIPLLANHLLRQSAQRHKPFVRAFSTDAMKRLMTASWPGNVRQLVNVI 361 L++P L +R EDIP L H ++Q+ + V+ F +A++ + WPGNVR+L N++ Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 362 EQCVALTSSPVISDALVEQALEGENTALPT------------------------------ 391 + AL VI+ ++E L E P Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 392 ------FAEARNQFELNYLRKLLQITKGNVTHAARMAGRNRTEFYKLLSRHELDANDFKE 445 + + E + L T+GN AA + G NR K + + Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSR 482
>PF06580#Sensor histidine kinase Length = 349 Score = 33.3 bits (76), Expect = 0.002 Identities = 20/127 (15%), Positives = 45/127 (35%), Gaps = 26/127 (20%) Query: 354 DVDLEAERCIAEPMLLMSVLDNLYSNAVHYG----AESGNICIRSRSQGSTVYIDVVNSG 409 ++ PML+ + L N + +G + G I ++ TV ++V N+G Sbjct: 245 QINPAIMDVQVPPMLVQT----LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300 Query: 410 EPIPQTEREMIFEPFFQGSHQRKGAVKGSGLGLSIARDCIRRMQGEIQLVDDNAQEVCFR 469 + +E +G GL R+ ++ + G + + ++ Sbjct: 301 SLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342 Query: 470 ISLPLPA 476 + +P Sbjct: 343 AMVLIPG 349
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 33/177 (18%), Positives = 69/177 (38%), Gaps = 3/177 (1%) Query: 213 FWLLFMILALGVFSGMVISSSSAQIGMTQYGLLSGAL-VVSLVSIFNSIGRLFWGGLTDK 271 WL + V + MV++ S I + V + + SIG +G L+D+ Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 272 LGGYNTLVIVYLFTCVCMLLLLFFNGNTSVFYFSALGVGFAYAGILVIFPGLTSQNFGMR 331 LG L+ + C ++ + S+ + G A + + ++ Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 332 NQGLNYGFMYFGFAVGAVIAPYVTSAIAKYTGSYNTVFILTTVLLLIGVVLTLITKK 388 N+G +G + A+G + P + IA Y ++ + ++ + ++ L + KK Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH-WSYLLLIPMITIITVPFLMKLLKK 191
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.007 Identities = 30/121 (24%), Positives = 50/121 (41%), Gaps = 15/121 (12%) Query: 7 YCGFIAIVGRPNVGKSTLLNKLLGQKISITSRKAQTTRHRIVGIHTEGPYQAIYVDTPGL 66 G I +G + G + N LL ++ IT + T+ E I +DTPG Sbjct: 26 NSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS------FQWENTKVNI-IDTPG- 77 Query: 67 HMEEKRAINRLMNKAASSSIGDVELVIFVVEGTRWTPDDEMVLNKLRDGKAPVILAVNKV 126 HM+ + R + S + L+I +G + ++ + LR P I +NK+ Sbjct: 78 HMDFLAEVYRSL-----SVLDGAILLISAKDGVQ--AQTRILFHALRKMGIPTIFFINKI 130 Query: 127 D 127 D Sbjct: 131 D 131
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 153 bits (388), Expect = 8e-42 Identities = 95/458 (20%), Positives = 181/458 (39%), Gaps = 91/458 (19%) Query: 3 NIRNFSIIAHIDHGKSTLSDRIIQICGG---LSDREMEAQVLDSMDLERERGITIKAQSV 59 I N ++AH+D GK+TL++ ++ G L + D+ LER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 60 TLDFKASDGETYQLNFIDTPGHVDFSYEVSRSLAACEGALLVVDAGQGVEAQTLANCYTA 119 + + E ++N IDTPGH+DF EV RSL+ +GA+L++ A GV+AQT + Sbjct: 62 SFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 120 MEMDLEVVPVLNKIDLPAADPERVAEEIED-----------------IVGIDATDAVRCS 162 +M + + +NKID D V ++I++ + + T++ + Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 163 AKTGVGVTDVLERLVRDIPP---------------------------------------- 182 G D+LE+ + Sbjct: 177 T-VIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT 235 Query: 183 -----PQGDPDGPLQALIIDSWFDNYLGVVSLVRIKNGTMRKGDKIKVMSTGQTYNADRL 237 L + + ++ +R+ +G + D +++ + Sbjct: 236 NKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI---KIT 292 Query: 238 GIFTPKQVDRTELKCGEVGWLVCAIKDIL--GAPVGDTLTSARNPAEKALPGFKKVKPQV 295 ++T + ++ G +V + L + +GDT P + + + P + Sbjct: 293 EMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTK---LLPQRERI---ENPLPLL 346 Query: 296 YAGLFPVSSDDYESFRDALGKLSLNDASL-FYEPESSSALGFGFRCGFLGLLHMEIIQER 354 + P E DAL ++S +D L +Y ++ + FLG + ME+ Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIIL----SFLGKVQMEVTCAL 402 Query: 355 LEREYDLDLITTAPTVVYEVET---TAKETIYVDSPSK 389 L+ +Y +++ PTV+Y +E A+ TI+++ P Sbjct: 403 LQEKYHVEIEIKEPTVIY-MERPLKKAEYTIHIEVPPN 439 Score = 35.2 bits (81), Expect = 7e-04 Identities = 18/81 (22%), Positives = 30/81 (37%), Gaps = 1/81 (1%) Query: 398 ELREPIAECHMLLPQAYLGNVITLCIEKRGVQTNMVYHGNQVALTYEIPMAEVVLDFFDR 457 EL EP + PQ YL T + + N+V L+ EIP + ++ Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARC-IQEYRSD 592 Query: 458 LKSTSRGYASLDYNFKRFQAS 478 L + G + K + + Sbjct: 593 LTFFTNGRSVCLTELKGYHVT 613
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 33.5 bits (76), Expect = 0.003 Identities = 52/231 (22%), Positives = 81/231 (35%), Gaps = 11/231 (4%) Query: 126 AAGQASEQAQTSAGQAAESATAAVNAAGAAEASATQAASSAASAESSAGTA------TTK 179 + G + S AA ATA + A + A QAA + A+AE+ A T + Sbjct: 34 SGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQR 93 Query: 180 AGEASASAASADTARTAAAASAAAA--KTSEANADVSRTAAGDSAAAAAASATAAQTSAA 237 + A + +RT +A A A +A + R A + A A A A Sbjct: 94 LKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEA 153 Query: 238 RAGASETAAKTSETQAASSAGDAGASATAAAASEKAAAASAAAAKISETNAATSASTAAA 297 E + +ET+ +A AA SE+A A A K+S + Sbjct: 154 EQRRKEIEREKAETERQLKLAEA-EEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEI 212 Query: 298 SATAASSSASEASNHAAASDTSASLA--AQSSTAAGAAATRAEDAAKRAED 346 + S+S + A + AQ+S + + RA D Sbjct: 213 KTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRAND 263
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 37.7 bits (87), Expect = 2e-04 Identities = 40/227 (17%), Positives = 72/227 (31%), Gaps = 25/227 (11%) Query: 516 TEYDQQQLNELQEQKRQKDLLDAKAQAERNYQETQKRRNEQNAALNRDNETESLRHQREV 575 + L+ AK + + R S ++ Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248 Query: 576 ARITAMQYADAAVRNAALERENERHKKALSQQAKKPKTYHNDEARRLLLQYSQQQAQTEG 635 + A + A R A LE+ E + EA + L+ + + + Sbjct: 249 KTLEA-EKAALEARQAELEKALEGAMNFSTA---DSAKIKTLEAEKAALEAEKADLEHQS 304 Query: 636 QITAAKLSTTE----------KMTEAHKQLLSFQQRIADLSGKKLTADEQSVLAHKDEIA 685 Q+ A + K EA Q L Q +I++ S + L D + K ++ Sbjct: 305 QVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLE 364 Query: 686 LALQKL-------DISQQDLQHQ-NALNELKKKTLTLTSQLADEESR 724 QKL + S+Q L+ +A E KK+ + L + S+ Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQ---VEKALEEANSK 408
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.009 Identities = 31/198 (15%), Positives = 64/198 (32%), Gaps = 36/198 (18%) Query: 177 QQQSQERAARAELLQYQLKELNDFNPQAGEFEQIDEEYKRLANSGQLLTTSQNALALLAD 236 + QS AR E +YQ+ + + E + DE Y + + ++L + + Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS- 196 Query: 237 GEDVNLQSQLYSAKQLVSELVGMDSKLSGILDMLEEATIQLTEASDELRHYCERLDLDPN 296 Q+Q Y Q L ++ +L A I E + Sbjct: 197 ----TWQNQKY---QKELNLDKKRAERLTVL-----ARINRYENLSRV------------ 232 Query: 297 RLFELEQRIAKQISLARKHHVSPEALPQLYQSLLEEQQQLDDQADSLETLTLAVNKHHQQ 356 + R+ SL K ++ ++LE++ + + + L + + + Sbjct: 233 ----EKSRLDDFSSLLHKQAIA-------KHAVLEQENKYVEAVNELRVYKSQLEQIESE 281 Query: 357 ALETAQALHQQRQFYAQE 374 L + Q + E Sbjct: 282 ILSAKEEYQLVTQLFKNE 299
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 27.2 bits (60), Expect = 0.027 Identities = 16/78 (20%), Positives = 30/78 (38%), Gaps = 8/78 (10%) Query: 49 GSRVLESSPAQMTAAVDVSKAGISKTFTTRNQLTRNQSILMHLVDGPFKKLIGGWK---- 104 G+ VLE P+ + +D G + + LT I +++G +++ + Sbjct: 113 GNAVLEVDPSITFSIIDRLFGGTGQAAKVQRDLT---DIENSVMEGVIVRILANVRESWT 169 Query: 105 -FTPLSPEACRIEFQLDF 121 L P +IE F Sbjct: 170 QVIDLRPRLGQIETNPQF 187
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.2 bits (81), Expect = 5e-04 Identities = 32/224 (14%), Positives = 63/224 (28%), Gaps = 32/224 (14%) Query: 209 DVVQTEARIESARSQLAQYQANLDSAKASLMSWLGWNSLNGINNDFPAKLARSCETATPD 268 + EA +S L Q + + S D P S E Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Query: 269 DRLVPAVLAAW-AQANVARANLDYASAQ---MTPTISLEPSVQHYLNDKYPSHEVLDKTQ 324 L+ + W Q NLD A+ + I+ ++ + L Q Sbjct: 188 TSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247 Query: 325 YSTWVKVEMPLYQGGGLTARRNAASHAVDAAQSTIQRTRLDVRQKLMEARSQAMSLASAL 384 V + ++ +L +SQ + S Sbjct: 248 AIAKHAVL-------------------------EQENKYVEAVNELRVYKSQLEQIES-- 280 Query: 385 QILRRQQQLSERTRELYQQQYLDLGSRPLLDVLNAEQEVYQARF 428 +IL +++ T +L++ + LD + ++ E+ + Sbjct: 281 EILSAKEEYQLVT-QLFKNEILDKLRQTTDNIGLLTLELAKNEE 323
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 244 bits (624), Expect = 1e-78 Identities = 98/432 (22%), Positives = 179/432 (41%), Gaps = 56/432 (12%) Query: 9 ERAFSGAGRIVLICSLLFLILGI-WAWFGRLDEVSTGNGKVIPSSREQVLQSLDGGILAQ 67 E S R+V + FL++ + G+++ V+T NGK+ S R + ++ ++ I+ + Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 68 LTVREGDRVQANQIVARLDPTRLASNVGESAAKYRASLASSARLTA-----EVNDLPL-- 120 + V+EG+ V+ ++ +L ++ ++ + + R E+N LP Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 121 --AFPAELNGWPDLIAAETRLYKSR-----------RAQLADTEAELRDALASVNK---- 163 P N + + T L K + L AE LA +N+ Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 164 ------ELAITQRLEKSGAASHVEVLRLQRQKSDLG---------------------LKI 196 L L A + VL + + + + Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 197 TDLRSQYYVQAREALSKANAEVDMLSAILKGREDSVTRLTVRSPVRGIVKNIQVTTIGGV 256 + + + + L + + +L+ L E+ +R+PV V+ ++V T GGV Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 257 IPPNGEMMEIVPVDDRLLIETRLSPRDIAFIHPGQRALVKITAYDYAIYGGLDGVVETIS 316 + +M IVP DD L + + +DI FI+ GQ A++K+ A+ Y YG L G V+ I+ Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409 Query: 317 PDTIQDKVKPEIFYYRVFIRTHQDYLQNKSGRRFSIVPGMIATVDIKTGEKTIVDYLIKP 376 D I+D+ + V I ++ L + + GM T +IKTG ++++ YL+ P Sbjct: 410 LDAIEDQRLGL--VFNVIISIEENCLSTG-NKNIPLSSGMAVTAEIKTGMRSVISYLLSP 466 Query: 377 F-NRAKEALRER 387 E+LRER Sbjct: 467 LEESVTESLRER 478
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 31.6 bits (71), Expect = 0.019 Identities = 27/177 (15%), Positives = 62/177 (35%), Gaps = 8/177 (4%) Query: 15 VDKLTRPFRSAQASSKELATAIQQSRARLKELDAQAGRID--------GFRKASAQLAVT 66 +L + A S + I+ A L+A+ ++ + L + Sbjct: 262 QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDAS 321 Query: 67 GNSLKAAREEAAKLATQFSATNRPTAAQARLLEQAKNRVTELQSKYNGLRQSVQRQRLAL 126 + K E KL Q + + R L+ ++ +L++++ L + + + Sbjct: 322 REAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASR 381 Query: 127 NEAGLDTKKLSSAQRELRQNADETRQALDRQQKSLKRLGEQQARMNAVRDQYSRRLE 183 D A++++ + +E L +K K L E + + + +LE Sbjct: 382 QSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLE 438
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 24.9 bits (54), Expect = 0.035 Identities = 11/32 (34%), Positives = 16/32 (50%) Query: 15 AVLLAWLGDLSLKDASTVGGVLIGVLMLAINW 46 A W+ DLS +D + +L+GV M I Sbjct: 449 APFALWIHDLSAQDPYYILPILMGVTMFFIQK 480
>SECA#SecA protein signature. Length = 901 Score = 39.9 bits (93), Expect = 1e-05 Identities = 14/27 (51%), Positives = 18/27 (66%) Query: 18 FVENDKVKLGRNDKCWCKSGLKYKKCH 44 + + K+GRND C C SG KYK+CH Sbjct: 871 AAQTGERKVGRNDPCPCGSGKKYKQCH 897
>FLAGELLIN#Flagellin signature. Length = 507 Score = 278 bits (712), Expect = 5e-90 Identities = 267/515 (51%), Positives = 314/515 (60%), Gaps = 18/515 (3%) Query: 2 AQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGL 61 AQVINTNSLSLLTQNNLNKSQS+L +AIERLSSGLRINSAKDDAAGQAIANRFT+NIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLN 121 TQASRNANDGISIAQTTEGALNEINNNLQRVREL+VQ+ N TNS SDL SIQ EI QRL Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDSLNVQKAYDV 181 EIDRVS QTQFNGVKVL+QDN + IQVGANDGETI IDL++I+ ++LGLD NV + Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 KDTAVTTKAYANNGTTLDVSGLDDAAIKAATGGTNGTASVTGGAVKFDADNNKYFVTIGG 241 + + G G + + +G A VT D G Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSG-----AVVTDTTAPTVPDKVYVNAANGQ 235 Query: 242 FTGADAAKNGDYEVNVATDGTVTLAAGATKTTMPAGATTKTEVQELKDTPAVVSADAKNA 301 T DA N ++ T T A A + E + D K Sbjct: 236 LTTDDAENNTAVDLFKTTKST---AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTG 292 Query: 302 LIAGGVDATDANGAELVKMSYTDKNGKTIEGGYALKAGDKYYAA------DYDEATGAIK 355 G +T NG ++ G L++ Y + +D+ T Sbjct: 293 NDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNES 352 Query: 356 AKTTSYTAADGTTKTAANQLGGVDG----KTEVVTIDGKTYNASKAAGHDFKAQPELAEA 411 AK + A + + + G + + VT+ GKT K A E A A Sbjct: 353 AKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAA 412 Query: 412 AAKTTENPLQKIDAALAQVDALRSDLGAVQNRFNSAITNLGNTVNNLSEARSRIEDSDYA 471 A K+T NPL ID+AL++VDA+RS LGA+QNRF+SAITNLGNTV NL+ ARSRIED+DYA Sbjct: 413 AKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYA 472 Query: 472 TEVSNMSRAQILQQAGTSVLAQANQVPQNVLSLLR 506 TEVSNMS+AQILQQAGTSVLAQANQVPQNVLSLLR Sbjct: 473 TEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>PF05272#Virulence-associated E family protein Length = 892 Score = 35.4 bits (81), Expect = 0.002 Identities = 46/219 (21%), Positives = 68/219 (31%), Gaps = 49/219 (22%) Query: 989 IIPPG----TVVAVVGRSGAGKSTLIKLLAGLYSPGSGQIRVGER-----------LIDA 1033 ++ PG V + G G GKSTLI L GL +G + Sbjct: 588 VMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYEL 647 Query: 1034 ASLSDYRRQTGLVTQDVALFSGDIAENI-RYPRPDSSDTEVESAARRAGLFETV---QHL 1089 + ++ +RR D + RY V+ R+ ++ T Q+L Sbjct: 648 SEMTAFRR------ADAEAVKAFFSSRKDRYRGA--YGRYVQDHPRQVVIWCTTNKRQYL 699 Query: 1090 --PLGFRT--PVNNGG----TDLSAGQRQLIALA--------RAHLA--QAHILLLDEAT 1131 G R PV G L + QL A A R + I E Sbjct: 700 FDITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQE 759 Query: 1132 AR-IDRSAEERLMTSLTRVTHTEKRIALIVAHRLTTARR 1169 R ++ + RL LTR A A + + Sbjct: 760 LRLVETGVQGRLWALLTREG---APAAEGAAQKGYSVNT 795
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 21/101 (20%) Query: 370 LLDNALKY----TPEQGIVTARLERDGDAVTLVVEDSGPGIDDEHIHLALQPFHRLDNVG 425 L++N +K+ P+ G + + +D VTL VE++G L N Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA--------------LKNTK 308 Query: 426 NVAGAGIGLALVND-IARLHRTHPHFSRSEALGGLYVRIRF 465 G GL V + + L+ T SE G + + Sbjct: 309 E--STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.8 bits (241), Expect = 2e-25 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%) Query: 2 RLLLAEDNRELAHWLEKALVQNGFAVDCVFDGLAADHLLHSEMYALAVLDINMPGMDGLE 61 +L+A+D+ + L +AL + G+ V + + + L V D+ MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VVQRLRKRGQTLPVLLLTARSAVADRVKGLNVGADDYLPKPFELEE-LDARLRALLRRSA 120 ++ R++K LPVL+++A++ +K GA DYLPKPF+L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 GQ 122 Sbjct: 125 RP 126
>INTIMIN#Intimin signature. Length = 939 Score = 27.3 bits (60), Expect = 0.030 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 6/69 (8%) Query: 82 SVDDQVKTTTPAAESQFYTVKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPE---KI 138 D ++ T FYT+K+G+T++ +SK N + I+ NK + S K Sbjct: 48 GSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQDINLST---IWSLNKHLYSSESEMMKA 104 Query: 139 YPGQVLRIP 147 PGQ + +P Sbjct: 105 EPGQQIILP 113
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.1 bits (60), Expect = 0.044 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%) Query: 1 MKPRQRQAAILEHLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40 M QR I E + + +EL ++ T T+ +D+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 375 bits (964), Expect = e-128 Identities = 123/340 (36%), Positives = 180/340 (52%), Gaps = 21/340 (6%) Query: 187 MIGLSPAMTQLKKEIEIVAGSDLNVLIGGETGTGKELVAKAIHQGSPRAVNPLVYLNCAA 246 ++G S AM ++ + + + +DL ++I GE+GTGKELVA+A+H R P V +N AA Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198 Query: 247 LPESVAESELFGHVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYG 306 +P + ESELFGH KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG 258 Query: 307 DIQRVGDDRSLRVDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLFVPPLRERGDDVV 366 + VG +R DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ Sbjct: 259 EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIP 318 Query: 367 LLAGYFCEQCRLRLGLSRVVLSPGARRHLLNYGWPGNVRELEHAIHRAVVLARATRAGDE 426 L +F +Q + GL A + + WPGNVRELE+ + R L E Sbjct: 319 DLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITRE 377 Query: 427 VIL-----EAQHFALS---------------EDVLPAPPAESFLALPTCRNLRESTENFQ 466 +I E + E+ + A ALP + Sbjct: 378 IIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEME 437 Query: 467 REMIRQALAQNNHNWAASARALETDVANLHRLAKRLGLKD 506 +I AL N +A L + L + + LG+ Sbjct: 438 YPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSV 477
>PF04183#IucA / IucC family Length = 580 Score = 29.8 bits (67), Expect = 0.047 Identities = 17/89 (19%), Positives = 34/89 (38%), Gaps = 13/89 (14%) Query: 342 HDIADGFLLHNRDIVQRMDDSV-----VRDSGEMLRRSRGYVPDAIALPPGFRDV----- 391 + +A + H ++I M + V ++D +R + P+ +LP RDV Sbjct: 415 YGVA--LIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLPQEVRDVTSRLS 472 Query: 392 PPILCLGADLKNTFCLVRGAQAVVSQHLG 420 L + ++R + + LG Sbjct: 473 ADYLIHDLQTGHFVTVLRFI-SPLMVRLG 500
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.0 bits (59), Expect = 0.011 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%) Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRLGQWVLVHVGFAMSVINEAEARDTLD 69 I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D + Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226 Query: 70 ALQN--MFDVEPDVG 82 A+ + V+PD+ Sbjct: 227 AINQEPVPHVQPDIA 241
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 382 bits (983), Expect = e-128 Identities = 142/373 (38%), Positives = 206/373 (55%), Gaps = 39/373 (10%) Query: 350 YQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYNVLKQVEMVAQSDSTVLILG 409 E+ + R + E +L + + ++GRS AM + + + + Q+D T++I G Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 410 ETGTGKELIARAIHNLSGRSGRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 469 E+GTGKEL+ARA+H+ R V +N AA+P L+ES+LFGHE+GAFTGA + GRF Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 470 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKLIQTDVRLIAATNRDLKKMV 529 E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G I++DVR++AATN+DLK+ + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 530 ADREFRNDLYYRLNVFPIQLPPLRERPEDIPLLVKAFTFKIARRMGRNIDSIPAETLRTL 589 FR DLYYRLNV P++LPPLR+R EDIP LV+ F + A + G ++ E L + Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346 Query: 590 SSMEWPGNVRELENVVERAVLLTRGNVLQLS-LPDITAVTPDTSPVATESDKEG------ 642 + WPGNVRELEN+V R L +V+ + + SP+ + + G Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 643 ----------------------------EDEYQLIIRVLKETNGVVAGPKGAAQRLGLKR 674 E EY LI+ L T G AA LGL R Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQI---KAADLLGLNR 463 Query: 675 TTLLSRMKRLGID 687 TL +++ LG+ Sbjct: 464 NTLRKKIRELGVS 476
>adhesinb#Adhesin B signature. Length = 310 Score = 321 bits (824), Expect = e-112 Identities = 89/309 (28%), Positives = 164/309 (53%), Gaps = 14/309 (4%) Query: 4 LHRLKTLLIAGIVAILAL-------SPAYAKEKFKVITTFTVIADMAKNVAGDAAEVSSI 56 + + + L++ + + S K V+ T ++IAD+ KN+AGD + SI Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 57 TKPGAEIHEYQPTPGDIKRAQGAQLILANGLNLER----WFARFYQHLSGVPE---VVVS 109 G + HEY+P P D+K+ A LI NG+NLE WF + ++ VS Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120 Query: 110 TGVKPMGITEGPYNGKPNPHAWMSAENALIYVDNIRDALVKYDPDNAQIYKQNAERYKAK 169 GV + + GK +PHAW++ EN +IY NI L + DP N + Y++N + Y K Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180 Query: 170 IRQMADPLRAELEKIPADQRWLVTSEGAFSYLARDNDMKELYLWPINADQQGTPKQVRKV 229 + + + + IP +++ +VTSEG F Y ++ ++ Y+W IN +++GTP Q++ + Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240 Query: 230 IDTIKKHHIPAIFSESTVSDKPARQVARESGAHYGGVLYVDSLSAADGPVPTYLDLLRVT 289 ++ ++K +P++F ES+V D+P + V++++ ++ DS++ +Y +++ Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300 Query: 290 TETIVNGIN 298 E I G++ Sbjct: 301 LEKIAEGLS 309
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 30.5 bits (68), Expect = 0.007 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 8/57 (14%) Query: 201 IISDLTRKWSQAEVAGKLFMSVSSLKRKLAAEEVSFSKIYLDARMNQAIKLLRMGAG 257 ++ LT + Q + F+S SS +R ++++YL+ RM +A++ R G G Sbjct: 66 VLDHLTGRSCQVGSSNSAFVSTSSSRR--------YTEVYLEHRMQEAVEAERAGRG 114
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 42.6 bits (100), Expect = 7e-07 Identities = 33/167 (19%), Positives = 63/167 (37%), Gaps = 10/167 (5%) Query: 23 LLKGLDQEQANEVIAVLQMHNIEANKIDSGKLGYSITVAEPDFTAAVYWIKTYQLPPRPR 82 L L + ++A L NI + +G +I V + LP Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNI-PYRFANG--SGAIEVPADKVHELRLRLAQQGLPKGGA 109 Query: 83 VEIAQMFPADSLVSSPRAEKARLYSAIEQRLEQSLQTMEGVLSARVHISYDIDA---GEN 139 V + + S +E+ A+E L ++++T+ V SARVH++ + E Sbjct: 110 VGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQ 168 Query: 140 GRPPKPVHLSALAVYERGSPLAHQISDIKRFLKNSFADVDYDNISVV 186 P V ++ QIS + + ++ A + N+++V Sbjct: 169 KSPSASVTVTLEPGRALDEG---QISAVVHLVSSAVAGLPPGNVTLV 212
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 28.1 bits (62), Expect = 0.044 Identities = 12/39 (30%), Positives = 21/39 (53%) Query: 234 MSTSTLKRKLAEEGTSFSDIYLSARMNQAAKLLRIGNHN 272 +S +K++ +GT+ IY+++ KLLRI N Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLG 279
>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase signature. Length = 468 Score = 304 bits (779), Expect = e-100 Identities = 67/212 (31%), Positives = 102/212 (48%), Gaps = 17/212 (8%) Query: 340 GKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQAKQ--LPPYFRGSYTFG 397 G +A YP LE+H +ML E L VL S ++ ++ +P YFR S T+G Sbjct: 252 GNTRTIACQYP--LQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQSGTYG 309 Query: 398 EVHTNSQKVSSASQGEAI--DQYNMQL-SCGEKRYTIPVLHVKNWPDHQPLPS--TDQLE 452 + S+ G+ I D Y + + G+K ++PV+HV NWPD + S T L Sbjct: 310 SITVESKMTQQVGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVTKALA 369 Query: 453 YLADRVKNSNQNGAPGRSSS-----DKHLPMIHCLGGVGRTGTMAAALVLKDNPHSNL-- 505 L D+ + +N + SS K P+IHC GVGRT + A+ + D+ +S L Sbjct: 370 SLVDQTAETKRNMYESKGSSAVGDDSKLRPVIHCRAGVGRTAQLIGAMCMNDSRNSQLSV 429 Query: 506 EQVRADFRDSRNNRMLEDASQF-VQLKAMQAQ 536 E + + R RN M++ Q V +K + Q Sbjct: 430 EDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQ 461
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 44.0 bits (104), Expect = 1e-08 Identities = 17/128 (13%), Positives = 46/128 (35%), Gaps = 8/128 (6%) Query: 2 QAHQDIIANIGEKLGL-PLTFDDNNQCLLLLDSDIFTSIEAK--DDIWLLNGMIIPLSPV 58 ++ ++ + L + PL FDD+ C +++D+ ++ + LL G++ P Sbjct: 4 LFYKTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPH--- 60 Query: 59 CGDSIWRQIMVINGELAANNEGTLAYIDAAETLLLIHAI-TDLTNTYHIISQLESFVNQQ 117 D + ++ N L + + +I + + + ++ + Sbjct: 61 -KDIPQQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWM 119 Query: 118 EALKNILQ 125 + Q Sbjct: 120 RGWREASQ 127
>BACINVASINC#Salmonella/Shigella invasin protein C signature. Length = 409 Score = 515 bits (1327), Expect = 0.0 Identities = 407/409 (99%), Positives = 408/409 (99%) Query: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP Sbjct: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 Query: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS Sbjct: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 Query: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ Sbjct: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 Query: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV Sbjct: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 Query: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKDSNKQISPEHQAILSKRLESV 300 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIK+SNKQISPEHQAILSKRLESV Sbjct: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESV 300 Query: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASGQYAATQERSEQQISQVN 360 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGAS QYAATQERSEQQISQVN Sbjct: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVN 360 Query: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA Sbjct: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 842 bits (2176), Expect = 0.0 Identities = 593/593 (100%), Positives = 593/593 (100%) Query: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE Sbjct: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 Query: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE Sbjct: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 Query: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG Sbjct: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 Query: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN Sbjct: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 Query: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF Sbjct: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 Query: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA Sbjct: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 Query: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV Sbjct: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 Query: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG Sbjct: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 Query: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML Sbjct: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 Query: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA Sbjct: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 128 bits (322), Expect = 2e-40 Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 4/160 (2%) Query: 4 QNNVSEERVAEMIWDAVSEGATLKDVHGIPQDMMDGLYAHAYEFYNQGRLDEAETFFRFL 63 Q + + + G T+ ++ I D ++ LY+ A+ Y G+ ++A F+ L Sbjct: 3 QETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQAL 62 Query: 64 CIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAK 123 C+ D Y+ + +GL A Q Q+ A Y+ + + R F +C L + A+ Sbjct: 63 CVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAE 122 Query: 124 ARQCF----ELVNERTEDESLRAKALVYLEALKTAETEQH 159 A EL+ ++TE + L + LEA+K + +H Sbjct: 123 AESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEH 162
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 340 bits (875), Expect = e-118 Identities = 120/360 (33%), Positives = 205/360 (56%), Gaps = 19/360 (5%) Query: 1 MSSNKTEKPTKKRLEDSAKKGQSFKSKDLIIACLTLGGIAYLVSYGSFN-EFMGIIKIII 59 MS KTE+PT K++ D+ KKGQ KSK+++ L + A L+ + E + +I Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 60 ADNFDQSMADYSLAVFGIGLKYLIPFMLLCL---VCSALPAL----LQAGFVLATEALKP 112 +QS +S A+ + L+ F LC +AL A+ +Q GF+++ EA+KP Sbjct: 61 ---AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKP 117 Query: 113 NLSALNPVEGAKKLFSMRTVKDTVKTLLYLSSFVVAAIICWKKYKVEIFSQLNGNIVGIA 172 ++ +NP+EGAK++FS++++ + +K++L + V+ +I+ W K + + L GI Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKV---VLLSILIWIIIKGNLVTLLQLPTCGIE 174 Query: 173 VIWRELLLALVLTCLACA---LIVLLLDAIAEYFLTMKDMKMDKEEVKREMKEQEGNPEV 229 I L L + C +++ + D EY+ +K++KM K+E+KRE KE EG+PE+ Sbjct: 175 CITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 230 KSKRREVHMEILSEQVKSDIENSRLIVANPTHITIGIYFKPELMPIPMISVYETNQRALA 289 KSKRR+ H EI S ++ +++ S ++VANPTHI IGI +K P+P+++ T+ + Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 290 VRAYAEKVGVPVIVDIKLARSLFKTHRRYDLVSLEEIDEVLRLLVWLE--EVENAGKDVI 347 VR AE+ GVP++ I LAR+L+ + E+I+ +L WLE +E +++ Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHSEML 354
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 188 bits (478), Expect = 3e-61 Identities = 48/237 (20%), Positives = 104/237 (43%), Gaps = 4/237 (1%) Query: 12 LVASAALGFARVAPIFFFLPFLNSGVLSGAPRNAIIILVALGVWPHALNEAPPFLSVAMI 71 + RV + P L+ + + + +++ + P P S + Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFAL 71 Query: 72 PLVLQEAAVGVMLGCLLSWPFWVMHALGCIIDNQRGATLSSSIDPANGIDTSEMANFLNM 131 L +Q+ +G+ LG + + F + G II Q G + ++ +DPA+ ++ +A ++M Sbjct: 72 WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDM 131 Query: 132 FAAVVYLQNGGLVTMVDVLNKSYQLCDPMNEC--TPSLPPLLTFINQVAQNALVLASPVV 189 A +++L G + ++ +L ++ E + + L + + N L+LA P++ Sbjct: 132 LALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLI 191 Query: 190 LVLLLSEVFLGLLSRFAPQMNAFAISLTVKSGIAVLIMLLYFS--PVLPDNVLRLSF 244 +LL + LGLL+R APQ++ F I + + + +M +++ F Sbjct: 192 TLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 88.7 bits (220), Expect = 4e-27 Identities = 86/86 (100%), Positives = 86/86 (100%) Query: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 Query: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86 FLLSGWYGEVLLSYGRQVIFLALAKG Sbjct: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 303 bits (777), Expect = e-107 Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 539 bits (1389), Expect = 0.0 Identities = 303/303 (100%), Positives = 303/303 (100%) Query: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL 60 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL Sbjct: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL 60 Query: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL Sbjct: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 Query: 121 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS Sbjct: 121 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 Query: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR Sbjct: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 Query: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG Sbjct: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 Query: 301 NGE 303 NGE Sbjct: 301 NGE 303
>SSPANPROTEIN#Salmonella invasion protein InvJ signature. Length = 336 Score = 597 bits (1539), Expect = 0.0 Identities = 336/336 (100%), Positives = 336/336 (100%) Query: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL Sbjct: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 Query: 61 PLLLAAWRHGAPAKSEHHNGNVSGLHHNGKSELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 PLLLAAWRHGAPAKSEHHNGNVSGLHHNGKSELRIAEKLLKVTAEKSVGLISAEAKVDKS Sbjct: 61 PLLLAAWRHGAPAKSEHHNGNVSGLHHNGKSELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 Query: 121 AALLSSKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 AALLSSKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR Sbjct: 121 AALLSSKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 Query: 181 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA Sbjct: 181 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 Query: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH Sbjct: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 Query: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA Sbjct: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 169 bits (429), Expect = 3e-57 Identities = 141/147 (95%), Positives = 143/147 (97%) Query: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRGLQAEEEAILEQIAGLKLLLDTLRAEN 60 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDR LQ EEEAI+EQIAGLKLLLDTLRAEN Sbjct: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAEN 60 Query: 61 RQLSREEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNY 120 RQLSREEIY LLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQ+KSKYWLRKEGNY Sbjct: 61 RQLSREEIYALLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQEKSKYWLRKEGNY 120 Query: 121 QRWIIRQKRFYIQREIQQEEAESEEII 147 QRWIIRQKR YIQREIQQEEAESEEII Sbjct: 121 QRWIIRQKRLYIQREIQQEEAESEEII 147
>SSPAKPROTEIN#Invasion protein B family signature. Length = 133 Score = 206 bits (525), Expect = 3e-72 Identities = 43/133 (32%), Positives = 76/133 (57%) Query: 1 MQHLDIAELVRSALEVSGCDPSLIGGIDSHSTIVLDLFALPSICISVKDDDVWIWAQLGA 60 M ++++ +LVR +L GC PS+I +DSHS I + L ++P+I I++ ++ V +WA A Sbjct: 1 MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNEQVMLWANFDA 60 Query: 61 DSMVVLQQRAYEILMTIMEGCHFARGGQLLLGEQNGELTLKALVHPDFLSDGEKFSTALN 120 S V LQ AY IL ++ ++ + L + L L+ ++ D++ DG F+ L+ Sbjct: 61 PSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAEILH 120 Query: 121 GFYNYLEVFSRSL 133 FY +E+ + L Sbjct: 121 EFYQRMEILNGVL 133
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 604 bits (1558), Expect = 0.0 Identities = 371/372 (99%), Positives = 371/372 (99%) Query: 1 MIPGSTSGISFSRILSRQTSHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 MIPGSTSGISFSRILSRQ SHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA Sbjct: 1 MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 Query: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP Sbjct: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 Query: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS Sbjct: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 Query: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR Sbjct: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 Query: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL Sbjct: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 Query: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE Sbjct: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 Query: 361 MAEQRRTIEKLS 372 MAEQRRTIEKLS Sbjct: 361 MAEQRRTIEKLS 372
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 576 bits (1486), Expect = 0.0 Identities = 169/540 (31%), Positives = 271/540 (50%), Gaps = 57/540 (10%) Query: 4 HILLARVLACAALVLVTPGYSSE----KIPVTGSGFVAKDDSLRTFFDAMALQLKEPVIV 59 H RVL L+L + ++ E IP +VAK +SLR V+V Sbjct: 6 HSFFKRVLTGTLLLLSSYSWAQELDWLPIPYV---YVAKGESLRDLLTDFGANYDATVVV 62 Query: 60 SKMAARKKITGNFEFHDPNALLEKLSLQLGLIWYFDGQAIYIYDASEMRNAVVSLRNVSL 119 S K++G FE +P L+ ++ L+WY+DG +YI+ SE+ + ++ L+ Sbjct: 63 SD-KINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEA 121 Query: 120 NEFNNFLKRSGLYNKNYPLRGDNRKGTFYVSGPPVYVDMVVNAATMMDKQND--GIELGR 177 E L+RSG++ + R D YVSGPP Y+++V A +++Q + G Sbjct: 122 AELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGA 181 Query: 178 QKIGVMRLNNTFVGDRTYNLRDQKMVIPGIATAIERLLQGEEQPLGNIVSSEPPAMPAFS 237 I + L DRT + RD ++ PG+AT ++R+L + + P Sbjct: 182 LAIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIP------ 235 Query: 238 ANGEKGKAANYAGGMSLQEALKQNAAAGNIKIVAYPDTNSLLVKGTAEQVHFIEMLVKAL 297 Q A + +A A ++ A P N+++V+ + E++ + L+ AL Sbjct: 236 -----------------QAATRASAQA---RVEADPSLNAIIVRDSPERMPMYQRLIHAL 275 Query: 298 DVAKRHVELSLWIVDLNKSDLERLGTSWSGSI-----------TIGDKLGVSLNQSSIST 346 D +E++L IVD+N L LG W I T GD+ ++ N + S Sbjct: 276 DKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASNGALGSL 335 Query: 347 LDG---SRFIAAVNALEEKKQATVVSRPVLLTQENVPAIFDNNRTFYTKLIGERNVALEH 403 +D +A VN LE + A VVSRP LLTQEN A+ D++ T+Y K+ G+ L+ Sbjct: 336 VDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKG 395 Query: 404 VTYGTMIRVLPRFSADG---QIEMSLDIEDGNDKTPQSDTTTSVDALPEVGRTLISTIAR 460 +TYGTM+R+ PR G +I ++L IEDGN Q ++ ++ +P + RT++ T+AR Sbjct: 396 ITYGTMLRMTPRVLTQGDKSEISLNLHIEDGN----QKPNSSGIEGIPTISRTVVDTVAR 451 Query: 461 VPHGKSLLVGGYTRDANTDTVQSIPFLGKLPLIGSLFRYSSKNKSNVVRVFMIEPKEIVD 520 V HG+SL++GG RD + + +P LG +P IG+LFR S+ VR+F+IEP+ I + Sbjct: 452 VGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRIIDE 511
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 78.4 bits (193), Expect = 6e-18 Identities = 66/387 (17%), Positives = 150/387 (38%), Gaps = 48/387 (12%) Query: 16 FLDLINLFIASVAFPAMSVDLHTSISALAWVSNGYIAGLTLIVPFSAFLSRYLGARRLII 75 F ++N + +V+ P ++ D + ++ WV+ ++ ++ LS LG +RL++ Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 76 FSLILFSVAAAAAGFADSLHS-LVFWRIVQGAGGGLLIPVGQALTWQQFKPHERAGVSSV 134 F +I+ + S S L+ R +QGAG + + + R + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 135 VMMVALLAPACSPAIGGLLVETCGWRWIFF------------------------------ 164 + + + PAIGG++ W ++ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203 Query: 165 -ATLPVAVLTLLL---AYRWLNVASTTMA------SARLLHLPLLTDRLLRFAMIVYLCV 214 + V ++ +L +Y + + ++ R + P + L + + + Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263 Query: 215 PGMFIGISVVGM-----FYLQNVAQLSPAAAGS-LMIPWSIASFVAIMLTGRYFNRLGPR 268 G I +V G + +++V QLS A GS ++ P +++ + + G +R GP Sbjct: 264 CGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323 Query: 269 PLIIVGCLLQAAGILLLTNVTPATSHRVLMMIFALMGAGGSLCSSTAQSGAFLTIARRDM 328 ++ +G + L + + TS + ++I ++G G S + + ++ +++ Sbjct: 324 YVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVISTIVSSSLKQQEA 382 Query: 329 PDASALWNLNRQISFFLGATLLTLLLN 355 +L N +S G ++ LL+ Sbjct: 383 GAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 84.1 bits (208), Expect = 1e-20 Identities = 55/217 (25%), Positives = 92/217 (42%), Gaps = 31/217 (14%) Query: 1 MQIIITGGGGFLGQKLASALLNSSL------AFNELLLVDLKMPARLS--DSPRLRCLEA 52 M+ ++TG GF+G ++ LL + N+ V LK ARL P + + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ-ARLELLAQPGFQFHKI 59 Query: 53 DLT-QPGVLESVITANTSVVYHLAA-------IVSSHAEDDFDLGWKVNLDLTRQLLEAC 104 DL + G+ + + + V+ + + HA D NL +LE C Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD------SNLTGFLNILEGC 113 Query: 105 RRQPQKIRFVFSSSLAVYGG--TLPECVTDTTALTPRSSYGAQKAACELLVNDYTRKGYV 162 R + +++SS +VYG +P D+ P S Y A K A EL+ + Y+ + Sbjct: 114 RHNKIQ-HLLYASSSSVYGLNRKMPFSTDDSVD-HPVSLYAATKKANELMAHTYSHLYGL 171 Query: 163 DGLALRLPTICVRPGKPNRAASSFVSAIIREPLQGET 199 LR T+ G+P+ A F A+ L+G++ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAM----LEGKS 204
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.003 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 12/84 (14%) Query: 290 IVATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGS 349 IVATA+G++ ++G + IK ++ ++V+E + V+ G + + + Sbjct: 82 IVATANGKLTHSGRSK-------EIKP---IENSIV--KEIIVKEGESVRKGDVLLKLTA 129 Query: 350 TGTSSTRLHFEIRYKGKSVNPLRY 373 G + L + + RY Sbjct: 130 LGAEADTLKTQSSLLQARLEQTRY 153
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 630 bits (1627), Expect = 0.0 Identities = 230/856 (26%), Positives = 383/856 (44%), Gaps = 66/856 (7%) Query: 19 SQATEFNASLLDSGNLSNVDLTAFSREGYVAPGNYILDIWLNDQPVREQYPVRVVPVAGL 78 S FN L + DL+ F + PG Y +DI+LN+ + + V Sbjct: 44 SAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRD-VTFNTGDSE 102 Query: 79 DAAVICVTTDMVAMLGLKDKIIHGLKPVTGIPDGQCLELRSA--DSQVRYSAENQRLTFI 136 V C+T +A +GL + + + D C+ L S D+ + QRL Sbjct: 103 QGIVPCLTRAQLASMGLNTA---SVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLT 159 Query: 137 IPQAWMRYQDPDWVPPSRWSDGVTAGLLDYSLMVNRYMPQQGETSTSYSLYGTAGFNLGA 196 IPQA+M + ++PP W G+ AGLL+Y+ N + G S L +G N+GA Sbjct: 160 IPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGA 219 Query: 197 WRLRSDYQYSRFDS-GQGASQSDFYLPQTYLFRALPALRSKLTLGQTYLSSAIFDSFRFA 255 WRLR + +S S S++ + T+L R + LRS+LTLG Y IFD F Sbjct: 220 WRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFR 279 Query: 256 GLTLASDERMLPPSLQGYAPKISGIANSNAQVTVSQNGRILYQTRVSPGPFELPDLSQ-N 314 G LASD+ MLP S +G+AP I GIA AQVT+ QNG +Y + V PGPF + D+ Sbjct: 280 GAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAG 339 Query: 315 ISGNLDVSVRESDGSVRTWQVNTASVPFMARQGQVRYKVAAGRPLYGGTHNNSTVSPDFL 374 SG+L V+++E+DGS + + V +SVP + R+G RY + AG G N P F Sbjct: 340 NSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG---NAQQEKPRFF 396 Query: 375 LGEATWGAFNNTSLYGGLIASTGDYQSAALGIGQNMGLLGALSADVTRSDARLPHGQKQS 434 G ++YGG + Y++ GIG+NMG LGALS D+T++++ LP + Sbjct: 397 QSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHD 455 Query: 435 GYSYRINYAKTFDKTGSTLAFVGYRFSDRHFLSMPEYLQRRTTDGGD------------- 481 G S R Y K+ +++G+ + VGYR+S + + + R Sbjct: 456 GQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKF 515 Query: 482 ------AWHEKQSYTVTYSQSVPVLNMSAALSVSRLNYWNAQ-SNNNYMLSLNKVFSLGD 534 A++++ +T +Q + + LS S YW + + LN F Sbjct: 516 TDYYNLAYNKRGKLQLTVTQQLG-RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF---- 570 Query: 535 LQGLSASVSFARNQYTGG-GSQNQVYATISIPWGDSR-----------QVSYSVQKDNRG 582 + ++ ++S++ + G + ++IP+ SYS+ D G Sbjct: 571 -EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629 Query: 583 GLQQTVNYSD--FHNPDTTWNISAGHNRYDTGSN-SSFSGSVQSRLPWGQAAADATLQPG 639 + + + ++++ G+ G++ S+ ++ R +G A + Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689 Query: 640 QYRSLGLSWYGSVTATAHGAAFSQSMAGNEPRMMIDTGDVAGVPVNGNSGV-TNRFGVGV 698 + L G V A A+G Q + N+ +++ V +GV T+ G V Sbjct: 690 -IKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAV 746 Query: 699 VSAGSSYRRSDISVDVAALPEDVDVSSSVISQVLTEGAVGYRKIDANQGEQVLGHIRLAD 758 + + YR + +++D L ++VD+ ++V + V T GA+ + A G ++L + + Sbjct: 747 LPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HN 805 Query: 759 GASPPFGALVVSWKTGRTAGMVGDGGLAYLTGLSGEDRRTLNVSW--DGRVQCRLTLPET 816 PFGA+V S ++ +++G+V D G YL+G+ + V W + C Sbjct: 806 NKPLPFGAMVTS-ESSQSSGIVADNGQVYLSGM--PLAGKVQVKWGEEENAHCVANYQLP 862 Query: 817 VTLSRGPL---LLPCR 829 + L CR Sbjct: 863 PESQQQLLTQLSAECR 878
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 93.8 bits (233), Expect = 8e-27 Identities = 51/183 (27%), Positives = 76/183 (41%), Gaps = 17/183 (9%) Query: 1 MNKMLLAGSTGIVLLSAAASPVWADDNASTFSLGYAQSH-TNHAGTLRGVRLANNYEMSP 59 M K+ S +L+ A A ST + GYAQS + G L YE Sbjct: 1 MKKIACL-SALAAVLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDN 57 Query: 60 D-WGLTTSFAWLNGSQRYSDESSNGRVTTRYYSLLAGPSWKINNQLSLYSQVGPVLLHQR 118 G+ SF + S SS +YY + AGP+++IN+ S+Y VG + Sbjct: 58 SPLGVIGSFTYTEKS---RTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQ 114 Query: 119 DH---GINESDSKVGYGYSAGVAYTPVSSVAITLGYEGADFDATHNSGSLNSNGFNLGVG 175 S G+ Y AG+ + P+ +VA+ YE + S++ + GVG Sbjct: 115 TTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQS------RIRSVDVGTWIAGVG 168 Query: 176 YRF 178 YRF Sbjct: 169 YRF 171
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.4 bits (66), Expect = 0.011 Identities = 38/160 (23%), Positives = 70/160 (43%), Gaps = 12/160 (7%) Query: 11 LVLIVIAIAINMIGGQLISMLKLPIFLDSIGTLISAVLLGPFIGMLTGLLTNLLWGLLTD 70 LV +V+ + + + LI + +P+ L +GT G I LT L GLL D Sbjct: 350 LVFLVMYLFLQNMRATLIPTIAVPVVL--LGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407 Query: 71 PIAAAFAPVAMVIGLVAGWLARAGWFRTLPKVIVSGVVITLAVTLVAVPLRTALFGGVTG 130 V V+ + + +++ ++ G ++ +A+ L AV + A FGG TG Sbjct: 408 DAIVVVENVERVM-MEDKLPPKEATEKSMSQI--QGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 131 SGADLFVAWMHSMGQNLVESVAITVIGANLVDKILTAIIV 170 A +V ++A++V+ A ++ L A ++ Sbjct: 465 -------AIYRQFSITIVSAMALSVLVALILTPALCATLL 497
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.029 Identities = 10/21 (47%), Positives = 13/21 (61%) Query: 32 LALTGDNGAGKSTLLRIMAGL 52 + L G G GKSTL+ + GL Sbjct: 599 VVLEGTGGIGKSTLINTLVGL 619
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 68.1 bits (166), Expect = 1e-14 Identities = 34/187 (18%), Positives = 64/187 (34%), Gaps = 32/187 (17%) Query: 90 GLGSGVIIDAAKGYVLTNNHVINQAQKISIQL------------NDGREFDAKQIGGDDQ 137 + SGV++ K +LTN HV++ L +G + + Sbjct: 102 FIASGVVV--GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190 D+A+++ + ++++ + +V G P ++ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSIGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ I N Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW---GGVPNEFNGAVFINEN 269 Query: 250 MAQTLAQ 256 + L Q Sbjct: 270 VRNFLKQ 276
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 53.5 bits (128), Expect = 4e-10 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKI-------NATGGLPTIPINTKRTPHIGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.0 bits (70), Expect = 0.014 Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 508 ASSAPVQAAAPA-------GAGTPVTAPLAGNIWKVIATEGQTVAEGDVLLILEAMKMET 560 + V+ A A G + + ++I EG++V +GDVLL L A+ E Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134 Query: 561 EIRAAQA 567 + Q+ Sbjct: 135 DTLKTQS 141 Score = 29.4 bits (66), Expect = 0.048 Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 10/56 (17%) Query: 535 KVIATEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDTLMTL 590 V G+ G EI+ + V+ I VK G++V GD L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>FLAGELLIN#Flagellin signature. Length = 507 Score = 35.8 bits (82), Expect = 0.002 Identities = 39/301 (12%), Positives = 67/301 (22%), Gaps = 2/301 (0%) Query: 427 TTNFAGDIAVSGGGTAIIIDGDNATIKNTGTSDISGAGSTGTVIDGNNARVNNDGDMTIT 486 T F G +S I G N T T D+ +DG N + + Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG--ETITIDLQKIDVKSLGLDGFNVNGPKEATVGDL 185 Query: 487 DGGTGGHITGDNVVIDNAGSTTVSGADATALYIEGDNALVINEGNQTISGGAVGTRIDGD 546 D + + A N + Sbjct: 186 KSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNT 245 Query: 547 DAHTTNTGDIAVDGAGSAAVIINGDNGSLTQAGDLLVTDGAMGIITYGTGNEAKNTGNAT 606 T A + A+ G D + T GN +T Sbjct: 246 AVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTING 305 Query: 607 VRDADSVGFVVAGEKNTFKNKGDIDVSLNGTGALVSGDMSQVTLDGDINVVSVQDSEGVF 666 + +V + AG N ++ + T + + ++ + V Sbjct: 306 EKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVK 365 Query: 667 SSATGVSVSGDSNAVDITGNVNISADYGQDDLAAGAPPLTGVVVGGNGNTVTLNGALNID 726 + + A V ++ D A T N +ID Sbjct: 366 GESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASID 425 Query: 727 D 727 Sbjct: 426 S 426
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 40.0 bits (93), Expect = 2e-05 Identities = 36/197 (18%), Positives = 61/197 (30%), Gaps = 18/197 (9%) Query: 146 ANATQPAPGATSAEQTAGNTSQDISLPPISSTPTQGQSPVVADGQQRVEVQGDLNNALTQ 205 N Q + + + +PP + + VA+ ++ + N Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059 Query: 206 NPEQMNNVAVN---STLPTEPATVAPVRNGSTTRQAAVSEPAERHTTRPERKQAV----- 257 N S + T ++GS T++ +E E T E K V Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119 Query: 258 ---------IEPKKPQTTAKTTTAEPKKPVAP-VKRTEPAAPAATPKATTTTAAPTATAS 307 + PK+ Q+ AEP + P V EP + T T A T++ Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 308 AAPVQTAKPAQASTTPV 324 PV + + V Sbjct: 1180 EQPVTESTTVNTGNSVV 1196
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 32.5 bits (74), Expect = 6e-04 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 32 FYDSDQEIEKRTGADVGWVFDVEGEDGFRN----------REEKVINELTEKQGIVLATG 81 FYD + KR + GW+ + G+R E + I +L E+ IV+A+G Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193 Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112 GG V + +GV E I+K LA Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 268 bits (687), Expect = 2e-86 Identities = 82/301 (27%), Positives = 134/301 (44%), Gaps = 18/301 (5%) Query: 117 LENRSISLQYADAGELAKAGEKLLSAKGTIMVDKRTNRLLLRDNRAALAELEKWVSQMDL 176 L + +I D + +A SA+ + D N +++RD+ + ++ + +D Sbjct: 219 LSDATIQQVTVDNQRIPQAAT-RASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDK 277 Query: 177 PVAQVELAAHIVTINEKSLRELGVKWTLADATQAGSVGDVTTLSSDLSVAAATSRVGFNI 236 P A++E+A IV IN L ELGV W + T + T ++A+ G Sbjct: 278 PSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASN----GALG 333 Query: 237 GRINGRLLDL---ELSALEQKQLLDIIASPRLLASHLQPASIKQGSEIPYQVSSGESGAT 293 ++ R LD ++ LE + +++ P LL A I SE Y +G+ A Sbjct: 334 SLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA- 391 Query: 294 SVEFKEAVLG--MEVTPTVLQKG---RIRLKLHISQNVPGQVLQQADGEVLAIDKQEIET 348 E K G + +TP VL +G I L LHI +G + I + ++T Sbjct: 392 --ELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG-IPTISRTVVDT 448 Query: 349 QVEVKSGETLALGGIFSRKNKSGSDSVPLLGDIPWLGQLFRHDGKEDERRELVVFITPRL 408 V G++L +GGI+ + VPLLGDIP++G LFR + R + I PR+ Sbjct: 449 VARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRI 508 Query: 409 V 409 + Sbjct: 509 I 509 Score = 29.5 bits (66), Expect = 0.029 Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 24/139 (17%) Query: 1 MKRWIAIILIALMPAAQAG----KAAKVTLVVDDVPVVQVLQTLAEQERQNLVVSPDVSG 56 KR + L+ L + A V + +L +VVS ++ Sbjct: 9 FKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIND 68 Query: 57 TLSLHLTDVPWKQALQTVVNSAGLVLRQEGNILHVHSQAWQKEHSARQDAERLRFQANLP 116 +S + LQ + + LV +GN+L++ + Sbjct: 69 KVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVA------------------ 110 Query: 117 LENRSISLQYADAGELAKA 135 +R I LQ ++A EL +A Sbjct: 111 --SRLIRLQESEAAELKQA 127
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 45/188 (23%) Query: 270 INKDIEECNAIIEQFIDYLR------TGQEMPM--EMADLNSVL-------GEVIAAESG 314 I +D + ++ + +R +++ + E+ ++S L + + Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ---- 241 Query: 315 YEREINTALQAGSIQVKMHPLSIKRAVANMVVNA--ARYGNGWIKVSSGTESHRAWFQVE 372 +E +IN A+ V++ P+ ++ V N + + G I + ++ +VE Sbjct: 242 FENQINPAIM----DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 373 DDGPGIKPEQRKHLFQPFVRGDSARSTSGTGLGLAIV-QRIIDNH--NGMLEIGTSERGG 429 + G ++ TG GL V +R+ + +++ + ++G Sbjct: 298 NTGSLALKNTKE----------------STGTGLQNVRERLQMLYGTEAQIKL-SEKQGK 340 Query: 430 LSIRAWLP 437 ++ +P Sbjct: 341 VNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.4 bits (245), Expect = 6e-26 Identities = 39/136 (28%), Positives = 72/136 (52%), Gaps = 3/136 (2%) Query: 6 KILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLS 65 ILV DDD +R +L + L+ G+ VR +NA + R + L+V D+++P E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 ICRRLRSQSNPMPIIMVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVL---RR 122 + R++ +P+++++A+ + I E GA DY+PKPF+ EL+ I L +R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 123 QANELPGAPSQEEAVI 138 + ++L ++ Sbjct: 125 RPSKLEDDSQDGMPLV 140
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 41.8 bits (98), Expect = 9e-06 Identities = 42/142 (29%), Positives = 66/142 (46%), Gaps = 30/142 (21%) Query: 1 MKKLTIGLIGNPNSGKTTLFNQL---TGARQRVGNW-AGVTV------ERKEG---QFAT 47 MK + IG++ + ++GKTTL L +GA +G+ G T ER+ G Q Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60 Query: 48 T-----DHQVTLVDLPGTYSLTTISSQTSLDEQIACHYILSGDADLLINVVDASNLE-RN 101 T + +V ++D PG + SL +L G A LLI+ D + R Sbjct: 61 TSFQWENTKVNIIDTPG-HMDFLAEVYRSLS-------VLDG-AILLISAKDGVQAQTRI 111 Query: 102 LYLTLQLLELGIPCIVALNMLD 123 L+ L+ ++GIP I +N +D Sbjct: 112 LFHALR--KMGIPTIFFINKID 131
>PF06580#Sensor histidine kinase Length = 349 Score = 27.5 bits (61), Expect = 0.038 Identities = 12/26 (46%), Positives = 16/26 (61%), Gaps = 5/26 (19%) Query: 96 AKMRKVADDAPLMERVEYALQSQINP 121 KM +A +A LM AL++QINP Sbjct: 152 WKMASMAQEAQLM-----ALKAQINP 172
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.041 Identities = 18/115 (15%), Positives = 42/115 (36%), Gaps = 6/115 (5%) Query: 534 QSEIQFAQGFLQAAWETQERAFQLIKEQHLEQLPMHEFLVRIRAQLL------WAWARLD 587 +++ Q L A Q R L + L +LP + Q + + + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 588 EAEASARSGIAVLSTFQPQQQLQCLTLLVQCSLARGDLDNARSQLNRLENLLGNG 642 E ++ ++ +++ + LT+L + + +S+L+ +LL Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 229 bits (585), Expect = 3e-71 Identities = 86/268 (32%), Positives = 138/268 (51%), Gaps = 10/268 (3%) Query: 173 QEREETLNFLKSGIATRNPCFNRMIEQIERVAIRSRSPILLNGPTGAGKSFLARRIYELK 232 + E + + R+ + + R+ ++ +++ G +G GK +AR +++ Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM-QTDLTLMITGESGTGKELVARALHDYG 184 Query: 233 LARHQFSGPFVEVNCATLRGDTAMSALFGHVKGAFTGAREERAGLLRSADGGMLFLDEVG 292 R+ GPFV +N A + D S LFGH KGAFTGA+ G A+GG LFLDE+G Sbjct: 185 KRRN---GPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241 Query: 293 ELGADEQAMLLKAIEEKRFYPFGSDQQVSSDFQLIAGTVRDLRQRVAEGTFREDLYARIN 352 ++ D Q LL+ +++ + G + SD +++A T +DL+Q + +G FREDLY R+N Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301 Query: 353 LWTFELPGLRQRQEDIEPNLDYELERHAALTGDSVRFNTEARRAWLSFATSPQAAWRGNF 412 + LP LR R EDI + + +++ D RF+ EA + W GN Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAH------PWPGNV 355 Query: 413 RELSASVTRMATLADNGRITVETVDDEI 440 REL V R+ L IT E +++E+ Sbjct: 356 RELENLVRRLTALYPQDVITREIIENEL 383
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 32.1 bits (73), Expect = 0.001 Identities = 14/48 (29%), Positives = 26/48 (54%), Gaps = 5/48 (10%) Query: 1 MKQTQRHDAIIELVKKQGYVSTEELV-----EHFSVSPQTIRRDLNDL 43 M + QRH I E++ + +ELV + ++V+ T+ RD+ +L Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.0 bits (67), Expect = 0.024 Identities = 15/114 (13%), Positives = 34/114 (29%), Gaps = 2/114 (1%) Query: 17 DKEQKQEQTEEQQIVEEQRPVEPPVETAADVDAQTPAHSKAETEAFAEEVVDVTEKVQES 76 +++ K E + Q++ + V P E + V Q + + +E T ++ Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 77 EKP-QPVEPEPAAAIETAAPQIAVEREELPLPEEVKDEAISPEEWQAEAETVEV 129 E+P + + + PE P + + Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVV-ENPENTTPATTQPTVNSESSNKPKN 1221
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 26.7 bits (59), Expect = 0.027 Identities = 6/29 (20%), Positives = 16/29 (55%) Query: 7 FFIIIIALIVVAASFRFVQQRREKAANEA 35 +++I AA ++F++Q+ K ++ Sbjct: 173 ALMVLIQSTSEAARYKFIEQQIGKRVDKT 201
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.2 bits (68), Expect = 0.038 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 3/78 (3%) Query: 336 AEERRAPIERFIDRFSRIYTPVIMVIALLVTLIPPLMFDGGWQEWIYKGLTLLLIGCPCA 395 E++ P E S+I ++ + +L + P+ F GG IY+ ++ ++ A Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS---A 477 Query: 396 LVISTPAAITSGLAAAAR 413 + +S A+ A A Sbjct: 478 MALSVLVALILTPALCAT 495
>PF01206#SirA family protein Length = 76 Score = 101 bits (254), Expect = 2e-32 Identities = 28/72 (38%), Positives = 42/72 (58%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQTGETLLIIADDPATTRDIPGFCTFMEHDLLAQET 68 D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F H+LL Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 EGLPYRYLLRKA 80 E Y + L++A Sbjct: 65 EDGTYHFRLKRA 76
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 50.2 bits (120), Expect = 7e-09 Identities = 75/399 (18%), Positives = 141/399 (35%), Gaps = 34/399 (8%) Query: 27 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHD--AMGFSAFWAGLIISLQYFATLLSR 84 ++ N ++ I+ + IGL + VLPG + D G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 85 PHAGRYADVLGPKKIVVFGLCGCFLSGLGYLLADIASAWPMISLLLLGLGRVILGI-GQS 143 P G +D G + +++ L G + + Y + A L +L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPF-----LWVLYIGRIVAGITGAT 112 Query: 144 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLCYAWGGLQGL--ALTVM 201 A G+ + + R + M G LG L + A + Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170 Query: 202 GVALLAVLLALPRPSVK----ANKGKPLPFRAVLGRVWLYGMALALA-----SAGFGVIA 252 G+ L LP + P + + +A +A V A Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230 Query: 253 TFITLFYDAK-GWDGAAFALTLFSVAFVGT---RLLFPNGINRLGGLNVAMICFGVEIIG 308 +F + + WD ++L + + + ++ RLG M+ + G Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290 Query: 309 LLLVGTAAMPWMAKIGVLLTGMGFSLVFPALGVVAVKAVPPQNQGAALATYTVFMDMSLG 368 +L+ A WMA ++L + PAL + + V + QG + ++ Sbjct: 291 YILLAFATRGWMAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348 Query: 369 VTGPLAGLVMTWAGVPV----IYLAAAGLVAMALLLTWR 403 + GPL + A + ++A A L + L R Sbjct: 349 IVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 32.7 bits (74), Expect = 6e-04 Identities = 25/93 (26%), Positives = 44/93 (47%), Gaps = 6/93 (6%) Query: 30 RRASWLAGRVLLSRALSPL---PEMVYGEQGKPAFSAGAPLWFNLSHSGDTIALLLSDEG 86 R+A LAGR+ AL + G++ +P + G L+ ++SH T ++S + Sbjct: 46 RKAEHLAGRIAAVHALREVGVRTVPGMGDKRQPLWPDG--LFGSISHCATTALAVISRQ- 102 Query: 87 EVGCDIEVIRPRDNWRSLANAVFSLGEHAEMEA 119 +G DIE I + LA ++ E ++A Sbjct: 103 RIGIDIEKIMSQHTATELAPSIIDSDERQILQA 135
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 48.0 bits (114), Expect = 2e-08 Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 7/171 (4%) Query: 200 REREHGTVEHLLVMPVTPFEIMMAKV-WSMGLVVLVVSGLSLMLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G + +V LG + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG---IGVVAAALGY-TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQAVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGLSIVWPQFLTLLAIGGVFFL-IALLRFR 367 +P +H + L + I+ + + + I FFL ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 32.8 bits (74), Expect = 0.001 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 30/160 (18%) Query: 93 DFFTRHHLLASVNVDGPTLIAMRRQPDILAAMERLPWLRFELV----EHIRLPKDSSFAS 148 DF+ H +++ G T A R D AA WL E V EHI ++ Sbjct: 157 DFWLLHDSNGILHLLGKT--AAARLSDPQAASHTAQWLVEESVTPAGEHI------YYSY 208 Query: 149 MCEFGPLWLDDFGTGMANFSA---LSEVRYDYIKVARELFVMLRQSPEGRNLFTLLLQLM 205 + E G + + SA LS+V+Y A +L++ +P + LFTL+ Sbjct: 209 LAENGDNVDLNGNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAVQWLFTLVFDYG 268 Query: 206 NRYCRGVIVEGVETLEEWRDVQRSPAFAAQGYFLSRPVPL 245 RGV D Q PAF AQ +L+R P Sbjct: 269 E---RGV------------DPQVPPAFTAQNSWLARQDPF 293
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.2 bits (70), Expect = 0.008 Identities = 14/42 (33%), Positives = 19/42 (45%) Query: 162 VVKEVNRDGEVVWEWRAWEHLNPEDFPIHDIFDRRHWPMING 203 V RDG W+WR W+ P FP H + R ++ G Sbjct: 194 VYSRSQRDGSEAWKWRGWDDPRPLYFPSHRAPESRTVVLVEG 235
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 802 bits (2074), Expect = 0.0 Identities = 374/863 (43%), Positives = 521/863 (60%), Gaps = 56/863 (6%) Query: 19 ALALMIAGTLPAYAGTFNPRFLEDVPGIDQHVDLSMYESNKAEHLPGKYRVSVVVNEKKM 78 A A L + FNPRFL D P DLS +E N E PG YRV + +N M Sbjct: 33 ACAFAAQAPLSSAELYFNPRFLADDPQ--AVADLSRFE-NGQELPPGTYRVDIYLNNGYM 89 Query: 79 ESRTLEFKAATEAQRAKMGESLVPCLSRVQLEDMGVRIDSFPALKMAPPEACVAFDDIIP 138 +R + F + +VPCL+R QL MG+ S + + +ACV +I Sbjct: 90 ATRDVTFNTG------DSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIH 143 Query: 139 QAASHFDFADQTLIMSFPQAAMKQTARGTVPESQWDEGVNALLVDYNFSGSNASYDAHDS 198 A + D Q L ++ PQA M ARG +P WD G+NA L++YNFSG++ + Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV-----QN 198 Query: 199 ETSYNSDSYYLNLRSGMNLGAWRLRNYSTWTRNDGNNT------WDNIGTSLSRAIVPLK 252 NS YLNL+SG+N+GAWRLR+ +TW+ N +++ W +I T L R I+PL+ Sbjct: 199 RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLR 258 Query: 253 SQLTLGDTSTAGDIFDSVQMRGVQLTSDEEMLPDSQRGFAPVIRGIAKSNAEVTVEQNNY 312 S+LTLGD T GDIFD + RG QL SD+ MLPDSQRGFAPVI GIA+ A+VT++QN Y Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318 Query: 313 VIYRTFVQPGAFEINDLYPTSNSGDLTVTIKESDGSEQKFVQPFSSVALLQREGHLKYSL 372 IY + V PG F IND+Y NSGDL VTIKE+DGS Q F P+SSV LLQREGH +YS+ Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378 Query: 373 SAGEYRAGNYNSAEPKFGQLDAMYGLPYGFTVYGGAIFSDDYYSLAGGLGKNFGYIGAIS 432 +AGEYR+GN +P+F Q ++GLP G+T+YGG +D Y + G+GKN G +GA+S Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438 Query: 433 IDVTQAKSKLANEENSEGQSYRFLYSKSFN-SGTDFRLLGYKYSTSGYYTFQEATDVRSD 491 +D+TQA S L ++ +GQS RFLY+KS N SGT+ +L+GY+YSTSGY+ F + T R + Sbjct: 439 VDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMN 498 Query: 492 A-----------------DSSYSQYHKRSQIQGNVTQQLGAWGSVYFNVTQQDYWNDEGK 534 D Y+KR ++Q VTQQLG ++Y + + Q YW Sbjct: 499 GYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNV 558 Query: 535 QRSLNAGYNGRIGRVNYSVAYTWTKSPEWDESDRLLSFSMSIPLGR------------VW 582 AG N +N++++Y+ TK+ D++L+ +++IP Sbjct: 559 DEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHAS 618 Query: 583 SNYHLTTDQHGRTNQQLGVSGTALEDHNLNYSVQEGY---GSNGVGNSGSVNLDYQGGVG 639 ++Y ++ D +GR GV GT LED+NL+YSVQ GY G G++G L+Y+GG G Sbjct: 619 ASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYG 678 Query: 640 SASLGYNYNRDGQQVNYGLRGGVIAHSEGITLSQPLGESMAIISAPGARGAHVINNGGVE 699 +A++GY+++ D +Q+ YG+ GGV+AH+ G+TL QPL +++ ++ APGA+ A V N GV Sbjct: 679 NANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVR 738 Query: 700 VDWMGNAVVPYLTPYRETEVSLRSDSLNNQVDLDTASVNVVPTRGAIVRARFDTRVGYRV 759 DW G AV+PY T YRE V+L +++L + VDLD A NVVPTRGAIVRA F RVG ++ Sbjct: 739 TDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKL 798 Query: 760 LMNLTQANGKAVPFGATATLLDTTKESSSIVGEDGQLYISGMPEKGALQVNWGKDQAQQC 819 LM LT N K +PFGA T + +SS IV ++GQ+Y+SGMP G +QV WG+++ C Sbjct: 799 LMTLTH-NNKPLPFGAMVT--SESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHC 855 Query: 820 RVAFTLPEQQDNTGVVMANAVCR 842 + LP + + +A CR Sbjct: 856 VANYQLPPESQQQLLTQLSAECR 878
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 8e-05 Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 5/52 (9%) Query: 76 VAPDALRHGIGKALL----EYVQQR-FPLLSLEVYQKNQSAVNFYHALGFRI 122 VA D + G+G ALL E+ ++ F L LE N SA +FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 116 bits (293), Expect = 1e-33 Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 11/124 (8%) Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVVGYTDSTGSHDLNMRLS 165 + ++V F+ + ATLKP G L + L +V V+GYTD GS N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASSLITQGVDASRIRTSGMGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A +I GMG +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SPLQ 220 ++ Sbjct: 335 KGIK 338
>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein signature. Length = 170 Score = 234 bits (598), Expect = 2e-82 Identities = 91/153 (59%), Positives = 118/153 (77%), Gaps = 4/153 (2%) Query: 3 EQNNTEMAFQIQRIYTKDVSFEAPNAPHVFQKDWQPEVKLDLDTASSQLADDVYEVVLRV 62 Q + QIQRIY KDVSFEAPN PH+FQ+DW+P++ DL T + Q+ DD+YEV L + Sbjct: 12 TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQVGDDLYEVCLNI 71 Query: 63 TVTASLGEE--TAFLCEVQQAGIFSISGIEGTQMAHCLGAYCPNILFPYARECITSLVSR 120 +V ++ AF+CEV+QAG+F+ISG+E QMAHCL + CPN+LFPYARE ++SLV+R Sbjct: 72 SVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFPYARELVSSLVNR 131 Query: 121 GTFPQLNLAPVNFDALFMNYL--QQQAGEGTEE 151 GTFP LNL+PVNFDALFM+YL Q+QA + TEE Sbjct: 132 GTFPALNLSPVNFDALFMDYLQRQEQAEQTTEE 164
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.1 bits (112), Expect = 7e-08 Identities = 25/196 (12%), Positives = 62/196 (31%), Gaps = 21/196 (10%) Query: 45 RDQLKSIQADIAAKERDVRQQQQQRASLLAQLKAQEEAISAAARKLRETQSTLDQLNAQI 104 ++ + + Q +L +E S + + + LD+ A+ Sbjct: 157 SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER 216 Query: 105 DEMNASIAKLEQQKASQERNLAAQLDAAFRQGEHTGIQLILSGEESQRGQRLQAYFGYLN 164 + A I + E ++ L + + ++ Q + ++A Sbjct: 217 LTVLARINRYENLSRVEKSRLDD-FSSLLHKQAIAKHAVL-----EQENKYVEA-----V 265 Query: 165 QARQETIAELKQTREQVATQKAELEEKQSQQQTLLYEQRAQ-QAKLEQARNERKKTLAGL 223 + ++L+Q ++ + K E + + + ++ Q + E K Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE-- 323 Query: 224 ESSIQQGQQQLSELRA 239 +QQ S +RA Sbjct: 324 -------RQQASVIRA 332
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 99.5 bits (248), Expect = 3e-26 Identities = 75/348 (21%), Positives = 125/348 (35%), Gaps = 67/348 (19%) Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLNI 47 +VTG AGFIG ++ K L + G ++ +DNL D +++ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 48 ADYMDKEDFLIQIMSGEELGDIEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100 AD + + + G E +F + +Y ++N + Y+ + Sbjct: 62 ADR----EGMTDLF---ASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109 Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158 L C +I LYASS++ YG F + P+++Y +K + Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169 Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218 G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224 Query: 219 AVNL------------WFLESGKSG-------IFNLGTGRAESFQAVADATLAY-HKKGS 258 + W +E+G ++N+G A + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRNA-GYDKPFKTVAEGVTEYMAW 305 +P G T AD L G+ P TV +GV ++ W Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327
>SECA#SecA protein signature. Length = 901 Score = 41.0 bits (96), Expect = 2e-05 Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 291 MRLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFEPL 344 M L + + G GKTL A L A L A+ GK V ++ + LA++ A N R FE L Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150 Query: 345 GVEVGWLAGKQKGKARQAQ 363 G+ VG A++ Sbjct: 151 GLTVGINLPGMPAPAKREA 169
>PERTACTIN#Pertactin signature. Length = 922 Score = 118 bits (296), Expect = 4e-29 Identities = 160/718 (22%), Positives = 278/718 (38%), Gaps = 95/718 (13%) Query: 230 TGDSSEGLRTGQSGSLIRLGDDATIETSGASSTGIYAASSSRTELGNNATITVNGASAHA 289 TG + G+ G+++ L ATI A + G + + Sbjct: 236 TGGRAAGV-AAMDGAIVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGG-FGPLLDGWYG 292 Query: 290 VYATNATVNLGENATISVNSASKAASYSKAPAGLYALSRGAINLAGGAAITMAGDNSSES 349 V +++TV+L A V + A+ +S G+++ G I G Sbjct: 293 VDVSDSTVDL---AQSIVEAPQLGAAIRAGRGARVTVSGGSLSAPHGNVIETGGGARRFP 349 Query: 350 YAISTETGGIVDGS--SGGRFVIDGDIRAAGATAASGTLPQ--------------QNSTI 393 S + + G+ G + T A G Q + + Sbjct: 350 PPASPLSITLQAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATELPPIPGASSGPL 409 Query: 394 KLNMTDNSRWDGASYITSATAGTGVISVQMSDATWNMTSSSTLTDLTLNSGATINFSH-- 451 + + +RW GA+ V S+ + +ATW MT +S + L L S +++F Sbjct: 410 DVALASQARWTGATRA--------VDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPA 461 Query: 452 EDGEPWQTLTINEDYVGNGGKLVFNTVLNDDDSETDRLQVLGNTSGNTFVAVNNIGGAGA 511 E G ++ L ++ G+G +F + D +D+L V+ + SG + V N G A Sbjct: 462 EAGR-FKVLMVDT-LAGSG---LFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEPA 516 Query: 512 QTIEGIEIVNVAGNSNGTFEKASR---IVAGAYDYNVVQKGKNWYLTSYIEPDEPIIPDP 568 + + +V S TF A++ + G Y Y + G + S + P P P Sbjct: 517 -SGNTMLLVQTPRGSAATFTLANKDGKVDIGTYRYRLAANGNGQW--SLVGAKAPPAPKP 573 Query: 569 VDPVIPDPVVPDPVDPDPVDPVIPDPVIPDPVDPDPVDPEPVDPVIPDPTIPDIGQSDTP 628 P P P P P P P P P +P P P ++ + Sbjct: 574 APQPGPQPGPQPPQPPQPPQP---------PQPPQPPQRQPEAPAPQPPAGRELSAAANA 624 Query: 629 PITEHQFRPEVGSYLANNYAANTLFMTRLHDRLGETQYTDMLTGEKKVTSLWMRNVGAHT 688 + + A + A L RLGE + G W R Sbjct: 625 AVNTGGVGLASTLWYAESNA--------LSKRLGELRLNPDAGG------AWGRGFAQRQ 670 Query: 689 RFNDGSGQLKTRINSYVLQLGGDLAQWSTDGLDRWHIGAMAGYANSQNRTLSSVSDYHSR 748 + ++ +G+ + +LG D A + G RWH+G +AGY +R + Sbjct: 671 QLDNRAGRRFDQ-KVAGFELGADHA-VAVAG-GRWHLGGLAGYTRG-DRGFTG----DGG 722 Query: 749 GQVTGYSVGLYGTWYANNIDRSGAYVDTWMLYNWFDN--KVMGQDQAA--EKYKSKGITA 804 G VG Y T+ AN+ G Y+D + + +N KV G D A KY++ G+ Sbjct: 723 GHTDSVHVGGYATYIANS----GFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGV 778 Query: 805 SVEAGYSFRLGESAHQSYWLQPKAQVVWMGVQADDNREANGTLVKDDTAGNLLTRMGVKA 864 S+EAG F ++L+P+A++ V R ANG V+D+ ++L R+G++ Sbjct: 779 SLEAGRRFAH----ADGWFLEPQAELAVFRVGGGAYRAANGLRVRDEGGSSVLGRLGLEV 834 Query: 865 YINGHNAIDNDKSREFQPFVEANWIHNTQPA-SVKMDDVS--SDMRGTKNIGELKVGI 919 I+ R+ QP+++A+ + A +V+ + ++ +++RGT+ EL +G+ Sbjct: 835 ----GKRIELAGGRQVQPYIKASVLQEFDGAGTVRTNGIAHRTELRGTR--AELGLGM 886
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.7 bits (85), Expect = 1e-04 Identities = 40/208 (19%), Positives = 77/208 (37%), Gaps = 13/208 (6%) Query: 33 ITVEFLPVSLLTP----MAQDLGISEGVAGQSVTVTAFVAMFSSLFITQIIQATDR--RY 86 + ++ + + L+ P + +DL S V + A A+ + +DR R Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 87 IVILFAVLLTA-SCLMVSFANSFTLLLLGRACLGLALGGFWAMSASLTMRLVPARTVPKA 145 V+L ++ A +++ A +L +GR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 146 LSVIFGAVSIALVIAAPLGSFLGGIIGWRNVFNAAAVMGVLCVIWVVKSLP-SLPGEPSH 204 + +V LG +GG F AAA + L + LP S GE Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 205 QKQ---NMFSLLQRPGVMAGMIAIFMSF 229 ++ N + + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 28.4 bits (63), Expect = 0.030 Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 9/69 (13%) Query: 51 TVKKAVDQLVREGVLVQVQGKGTFVKKENVAYPLGEGLLSFAEALASQKINFTTSVITSR 110 ++ +A Q+ RE V G F K N+ + F ++++S T V + Sbjct: 49 SIDQAAAQITREN--VSWNGTNVFGKSANLTF-------KFLQSVSSIPSGDTGFVKFNA 99 Query: 111 LEPANRFVA 119 + ++ Sbjct: 100 EQIEQAKLS 108
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.0 bits (83), Expect = 3e-04 Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 16/168 (9%) Query: 49 FNIAQNDMISTYGLSMTELGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89 Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 F + +G S F ++ + F Q G + + + ++ P+ RG G Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRF 212 +G + A+Y+ + + + P +I +I ++ Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP-MITIITVPFLMKL 188
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.0 bits (96), Expect = 7e-06 Identities = 72/408 (17%), Positives = 138/408 (33%), Gaps = 60/408 (14%) Query: 29 RHILITIWLGYALFY--FTRKSFNAAAPEILASGILTRSDIGLLATLFYITYGVSKFVSG 86 RH I IWL F+ N + P+I + + T F +T+ + V G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSNARYFMGIGLIATGVVNILFGFSTSLWAFALLWALNAFFQGFGS---PVCARLL 143 +SD+ + + G+I +++ S F L + F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPLVMAAVALHYGWRVGMMVAGLLAIGVGMVLC 202 A Y + RG + L + +G + P + +A + W +++ + I V Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV----- 182 Query: 203 WRLRDRPQAIGLPPVGDWRHDALEVAQQQEGAGLSRKEILAKYVLLNPYIWLLSLCYVLV 262 P + L ++ G L I+ + Y + VL Sbjct: 183 ------PFLMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVSMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNG----------------NRGPMNLIFAAGILLSVGSL---WLMPFASYVMQ 347 GS +F G RGP+ ++ LSV L +L+ S+ M Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352 Query: 348 AACFFTTGFFVFGPQMLIGMAAAECSHKEAAGAATGFVGLFAYLGASL 395 F G F + +I + ++ AGA + ++L Sbjct: 353 IIIVFVLGGLSF-TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>PF06580#Sensor histidine kinase Length = 349 Score = 38.7 bits (90), Expect = 3e-05 Identities = 30/142 (21%), Positives = 55/142 (38%), Gaps = 11/142 (7%) Query: 365 LRPRQLDDLTLAQAIRSLLREMELESRGIVSHLDWRIDETALSESQRVTLFRVCQEGLNN 424 LR ++LA + + ++L S L + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 425 IVKHA-----NASAVTLQGWQQDERLMLVIEDDGSGLPPGSHQ-QGFGLTGMRERVSALG 478 +KH + L+G + + + L +E+ GS + + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 479 G---TLTISCTHG-TRVSVSLP 496 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.4 bits (149), Expect = 2e-13 Identities = 24/116 (20%), Positives = 45/116 (38%), Gaps = 5/116 (4%) Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIVLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHT 114 +LL ++ K + +V+S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 862 bits (2229), Expect = 0.0 Identities = 522/548 (95%), Positives = 537/548 (97%) Query: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQTQQTTQTTTTAAGSAADQGVPASGQGKM 60 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQ QQTTQTTTTAAGSAADQGVPASGQGK+ Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 Query: 61 ITVKTDVLDLTINTRGGDVEQALLPAYPKELGSNEPFQLLETTPQFIYQAQSGLTGRDGP 120 I+VKTDVLDLTINTRGGDVEQALLPAYPKEL S +PFQLLET+PQFIYQAQSGLTGRDGP Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 Query: 121 DNPANGPRPLYNVEKEAFVLADGQNELQVPMTYTDAAGNTFTKTFVFKRGDYAVNVNYSV 180 DNPANGPRPLYNVEK+A+VLA+GQNELQVPMTYTDAAGNTFTKTFV KRGDYAVNVNY+V Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 Query: 181 QNAGEKPLEVSTFGQLKQSVNLPPHRDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD 240 QNAGEKPLE+S+FGQLKQS+ LPPH DTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEKYKFDTIAD 240 Query: 241 NENLNVSSKGGWVAMLQQYFATAWIPRNDGTNNFYTANLGNGIVAIGYKAQPVLVQPGQT 300 NENLN+SSKGGWVAMLQQYFATAWIP NDGTNNFYTANLGNGI AIGYK+QPVLVQPGQT Sbjct: 241 NENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQT 300 Query: 301 GAMTSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII 360 GAM STLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII Sbjct: 301 GAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIII 360 Query: 361 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNPL 420 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNPL Sbjct: 361 ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNPL 420 Query: 421 GGCFPLIIQMPIFLALYYMLMGSIELRHAPFALWIHDLSAQDPYYILPILMGVTMFFIQK 480 GGCFPL+IQMPIFLALYYMLMGS+ELR APFALWIHDLSAQDPYYILPILMGVTMFFIQK Sbjct: 421 GGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQK 480 Query: 481 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL 540 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL Sbjct: 481 MSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRGL 540 Query: 541 HSREKKKS 548 HSREKKKS Sbjct: 541 HSREKKKS 548
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 26.7 bits (58), Expect = 0.043 Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 70 AEAQVIIEQANKRRAQILDEAKTEAEQERTKIVAQAQAEIEA 111 A+A + ANK R Q EAK +AE++ + A A A Sbjct: 210 AKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYA 251
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 29.4 bits (66), Expect = 0.010 Identities = 18/90 (20%), Positives = 32/90 (35%), Gaps = 9/90 (10%) Query: 24 SH-ALNYLFADGQLKQGTLVAINAEKLLTAEDNPEVRALIAAAEFKYADGISVVRSIRKK 82 S A Y + + IN E+ T E + + + + V S+ + Sbjct: 204 SEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQTKVP---SLFVESSVDDR 260 Query: 83 FPQAQVSRVAGADLWEAL----MARAGKEG 108 P VS+ ++ + +A GKEG Sbjct: 261 -PMKTVSQDTNIPIYAQIFTDSIAEQGKEG 289
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 28.9 bits (64), Expect = 0.041 Identities = 16/41 (39%), Positives = 23/41 (56%) Query: 66 TETSDALATQLTALQKAQESQKAELEGIIKKQAAQLDDANR 106 TE L+ QL LQ+ QES KA+L +I + + D A + Sbjct: 598 TEAKITLSQQLNTLQQQQESAKAQLSILINRSGSWADVARQ 638
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 177 bits (450), Expect = 4e-50 Identities = 99/448 (22%), Positives = 170/448 (37%), Gaps = 87/448 (19%) Query: 4 NLRNIAIIAHVDHGKTTLVDKLLQQSGTFDARAETQE--RVMDSNDLEKERGITILAKNT 61 + NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAHGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIIYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QAIVDHVPAPDVDLDGPLQMQISQLDYNNYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247 + I + + L ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NAKVGKVLTHLGLERIDSNIAEAGDIIAITGLG-ELN--ISDTICDPQNVEALPALSVDE 304 K+ ++ T + E + A +G+I+ + +LN + DT PQ + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPL 343 Query: 305 PTVSMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGEL 364 P + + + D L LR +S G++ Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKV 394 Query: 365 HLSVLIENMRRE-GFELAVSRPKVIFRE 391 + V ++ + E+ + P VI+ E Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.5 bits (74), Expect = 0.005 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 30.7 bits (69), Expect = 0.004 Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 6/53 (11%) Query: 40 MLVTGRSHREAY-AYYQTLALTEPMICCNGSYI----YQPAQQQILHPLPLTH 87 M+VT + Y +YQTL +T P++ +G+ Q LPL+H Sbjct: 166 MVVTALAPSYDYFIFYQTLVIT-PILFLSGAVFPVDQLPIVFQTAARFLPLSH 217
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 266 bits (682), Expect = 8e-87 Identities = 87/425 (20%), Positives = 175/425 (41%), Gaps = 25/425 (5%) Query: 9 LMMIIISLTILIIILTYFIEINSVVHGQGVITTKDNAQLISLSKGGTIQDIYVAEGDTVK 68 + I+ ++ IL+ ++ V G +T ++ I + +++I V EG++V+ Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119 Query: 69 KGELLAKVVNLDLQKEYQRYRTQKGYLDKDVNEI-------SFILDKENESGLITLDGTR 121 KG++L K+ L E +TQ L + + S L+K E L + Sbjct: 120 KGDVLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQ 177 Query: 122 SLSNKEVKANIELVHSQIRA-------KELKKTSLDSEISGLQEKLSSKEKELALLAEEI 174 ++S +EV L+ Q KEL +E + +++ E + + Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237 Query: 175 NILSPLVKKGISPYTNFLNKKQAYIKVKSEINDIESSITLKKDDIELVVNDIEALNNELR 234 + S L+ K L ++ Y++ +E+ +S + + +I + + + + Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297 Query: 235 LSLSKIISKNLQELEVVNSTLKVIEKQINEEDIYSPVDGVIYKINKSATTHGGVIQAADL 294 + + + + ++ L E++ I +PV + ++ T GGV+ A+ Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQL--KVHTEGGVVTTAET 355 Query: 295 LFEIKPKVRTMLADVKILPKYRDQIYVDEAVKLDVQSIIQPKIKSYNATIDNISPDSYEE 354 L I P+ T+ + K I V + + V++ + + NI+ D+ E+ Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415 Query: 355 NTGGTIQRYYKVIIAFDVNE----DDLRWLKPGMTVDASVITGKHSIMEYLLSPLMKGVD 410 G + VII+ + N + L GM V A + TG S++ YLLSPL + V Sbjct: 416 QRLGL---VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472 Query: 411 KAFSE 415 ++ E Sbjct: 473 ESLRE 477
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.7 bits (118), Expect = 3e-07 Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 30/190 (15%) Query: 96 DSAQVEKKGNGKRRNKKEEEELKKQLDDAENAKK--EADKAK-EEAEKAKEAAEKALNEA 152 A+ + + + L++ LD + AKK EA+ K EE K EA+ ++L Sbjct: 293 LEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352 Query: 153 FEVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQA---KATQASKQNDAEKVLP 208 + +K Q+E Q N + S+AS+Q+ + + +A KQ + Sbjct: 353 LDASREAKKQLEAEHQKLEEQN-------KISEASRQSLRRDLDASREAKKQVEKALEEA 405 Query: 209 QPI-------NKNTSTGK--SNSSKNEEN-KLDAESVKEPLKVTLALAAES----NSGSK 254 NK K + K E KL+AE+ + LK LA AE +G Sbjct: 406 NSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEA--KALKEKLAKQAEELAKLRAGKA 463 Query: 255 DDSITNFTKP 264 DS T KP Sbjct: 464 SDSQTPDAKP 473 Score = 48.1 bits (114), Expect = 8e-07 Identities = 35/136 (25%), Positives = 63/136 (46%), Gaps = 4/136 (2%) Query: 98 AQVEKKGNGKRRNKKEEEELKKQLDDAENAKKEADKAKEEAEKAKEAAEKALNEAFEVQN 157 A+ +K + ++ + L++ LD + AKK+ +KA EEA A EK E E + Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 424 Query: 158 SSKQIEEMLQNFL-ADNVA-KDNLAQQSD--ASQQNTQAKATQASKQNDAEKVLPQPINK 213 +++ + LQ L A+ A K+ LA+Q++ A + +A +Q K +P Sbjct: 425 LTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQA 484 Query: 214 NTSTGKSNSSKNEENK 229 + K N +K + Sbjct: 485 PQAGTKPNQNKAPMKE 500 Score = 43.5 bits (102), Expect = 3e-05 Identities = 17/115 (14%), Positives = 42/115 (36%), Gaps = 19/115 (16%) Query: 101 EKKGNGKRRNKKEEEELKKQLDDAENAKKEAD-------KAKEEAEKAKEAAEKALNEAF 153 ++ ++ + + + E + + + A ++L Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLRRDL 318 Query: 154 EVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQAK---ATQASKQNDAE 204 + +K Q+E Q + + N + S+AS+Q+ + + +A KQ +AE Sbjct: 319 DASREAKKQLEAEHQ-----KLEEQN--KISEASRQSLRRDLDASREAKKQLEAE 366
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.9 bits (67), Expect = 0.003 Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 6/65 (9%) Query: 89 LAVDKSLHGQGVARALVRDAGLRVIQVAETIGIRGMLVHALSDE--AREFYQRVGFVPSP 146 +AV K +GV AL+ A I+ A+ G+++ A FY + F+ Sbjct: 95 IAVAKDYRKKGVGTALLHKA----IEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150 Query: 147 MDPMM 151 +D M+ Sbjct: 151 VDTML 155
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 28.8 bits (64), Expect = 0.033 Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 7/64 (10%) Query: 130 PPPPPPVVAKRVESAPRPTEPARNPFKSSDDRLTGVTSSNTVTRPAARASAGAGDKVVIA 189 P P P K+VE R +P + S + + RP + + A K V + Sbjct: 100 KPKPKPKPVKKVEQPKRDVKPVESRPASPFE-------NTAPARPTSSTATAATSKPVTS 152 Query: 190 IDAG 193 + +G Sbjct: 153 VASG 156
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 30.1 bits (68), Expect = 0.027 Identities = 26/161 (16%), Positives = 57/161 (35%), Gaps = 18/161 (11%) Query: 31 VENSLDAGATRVDIDIER---GGAKLIR-IRDNGCGIKKEELALALARHATSKIASLDDL 86 ++ SLD A + ++ I R A++ ++ N G E + A+ + +L++ Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEA 64 Query: 87 EAIISLGFRGEAL----------ASISSVSRLTLTSRTAEQAEAWQAYAEGRDMDVTVK- 135 + G++G L I RLT + Q +A Q +D+ +K Sbjct: 65 ITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKV 124 Query: 136 -PAAHPVGTTLEVLDLFYNTPARRKFMRTEK--TEFNHIDE 173 + +G + + + + + F + Sbjct: 125 NSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEH 165
>SECA#SecA protein signature. Length = 901 Score = 33.3 bits (76), Expect = 0.002 Identities = 26/144 (18%), Positives = 55/144 (38%), Gaps = 6/144 (4%) Query: 282 HVVDAADVRVQENIEAVNTVLEEIDAHEIPTLMVMNKIDMLDDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P + ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQSGVGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIEY 424 +R I R +++P EY Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.0 bits (64), Expect = 0.030 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 3/65 (4%) Query: 225 NRMRAEREAVARRHRSQGQEEAEKLRAAADYEVTK---TLAEAERQGRIMRGEGDAEAAK 281 N+ R + A A+R + + +RAA Y + +A A +G I +G A A+ Sbjct: 220 NKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQ 279 Query: 282 LFADA 286 +DA Sbjct: 280 AISDA 284
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.9 bits (106), Expect = 5e-07 Identities = 69/394 (17%), Positives = 147/394 (37%), Gaps = 32/394 (8%) Query: 30 DTAVISGAIGSLTSYFHLSPAETGWAVSCVVVGCVIGSFSAGYLSKRFGRKKSLMVSALL 89 + V++ ++ + + F+ PA T W + ++ IG+ G LS + G K+ L+ ++ Sbjct: 29 NEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIII 88 Query: 90 FTISAVGTSLSYTFTHFVIY-RIIGGLAVGLAATVSPMYMSEVSPKNMRGRALSMQQFAI 148 +V + ++F +I R I G + + ++ PK RG+A + + Sbjct: 89 NCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIV 148 Query: 149 VFGQILIFYVNYKIASIAADTWLIELGWRYMFAAGIIPCILFCILVFLIPESPRW----- 203 G+ V I + A + W Y+ +I I L+ L+ + R Sbjct: 149 AMGE----GVGPAIGGMIAHY----IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFD 200 Query: 204 -----MMMIGREEETLKILTKISNEEHARHLLADIKTSLQNDQLNAHQKLNYRDGNVRFI 258 +M +G + T + + +++ + ++ G Sbjct: 201 IKGIILMSVGI--VFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPF 258 Query: 259 LILGCMIAMLQQVTGVNVMMYYAPIVLKDVTG-SAQEALFQTIWIGVIQ-LIGSIIGAMI 316 +I ++ V M P ++KDV S E I+ G + +I IG ++ Sbjct: 259 MIGVLCGGIIFGTVAGFVSM--VPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGIL 316 Query: 317 MDKMGRLSLMRKGTIGSIIGLLLTSWALYSQATGYFALFGMLFFMIFYALSWGVGAWVLI 376 +D+ G L ++ G + L + + T +F ++F + + + +I Sbjct: 317 VDRRGPLYVLNIGVTFLSVSFLTA--SFLLETTSWFMTIIIVFVLGGLSFT-----KTVI 369 Query: 377 SEIFPNRMRSQGMSISVGFMWMANFLVSQFFPMI 410 S I + ++ Q + + +FL I Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.6 bits (139), Expect = 4e-11 Identities = 83/403 (20%), Positives = 160/403 (39%), Gaps = 30/403 (7%) Query: 2 SQRSKYNSAYVYVLCCIAALAGLMFGYSTAVITGVVLP-LQQYYQLTPTETGWAVSSIVI 60 SQ + ++ + LC ++ F ++ V LP + + P T W ++ ++ Sbjct: 6 SQSNLRHNQILIWLCILS-----FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFML 60 Query: 61 GCIIGALVGGKIADKLGRKPALLIIAIIFIASSLGAAMSES-FMIFSLSRIVCGFAVGMA 119 IG V GK++D+LG K LL II S+ + S F + ++R + G Sbjct: 61 TFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF 120 Query: 120 GTASTMYMSELAPAEIRGKALGIYNISVVSGQVIVFIVNYLIAKGMPADVLVSQGWKTML 179 + ++ P E RGKA G+ V G+ + + +IA + W +L Sbjct: 121 PALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI--------HWSYLL 172 Query: 180 FAQVVPSIAMLAITLFLPESPAWCARNNRSEA--RSIKVLTRIYSGLTAT------DVAA 231 ++ I + + L + + S+ ++ + + + V + Sbjct: 173 LIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 232 IFDSMKETVRSQDNVAGGERTNLKSSPVLRYILLVGCCIAVLQQFTGVNVMNYYAPLVLQ 291 +K + D K+ P + +L G + F V+++ Y V Q Sbjct: 233 FLIFVKHIRKVTDPFVDPGL--GKNIPFMIGVLCGGIIFGTVAGF--VSMVPYMMKDVHQ 288 Query: 292 NSSTEVVMFQTIFIAVCNVVGSFIGMILFDRYGRIPIMKIGTIGSIVGLLIASYGLYTHD 351 S+ E+ + ++ +IG IL DR G + ++ IG V L AS+ L T Sbjct: 289 LSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLET-- 346 Query: 352 TGYITIFGILFFMLLFAVSWSVGAWVLISEVFPEKIKGFGMGL 394 T + I+F + + + +V + ++ S + ++ G GM L Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQE-AGAGMSL 388
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.026 Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 4/59 (6%) Query: 64 DKDVEVVIITASNEAHADVAVAALNANKYVFCEKP--LAVTAADCQRVIEAEQKNGKRM 120 D+ V++++A N A+ A Y + KP L R + ++ ++ Sbjct: 73 RPDLPVLVMSAQNTF--MTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKL 129
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.2 bits (86), Expect = 9e-06 Identities = 22/106 (20%), Positives = 39/106 (36%), Gaps = 7/106 (6%) Query: 39 KGYTVADPNLDELYQVYSQPGAAYWVVEQNGCVVGGGGVAPLSCSEPDICELQKMYFLPV 98 K Y D ++ V + AA+ +N C+ G + + ++ + Sbjct: 48 KQYEDDD---MDVSYVEEEGKAAFLYYLENNCI----GRIKIRSNWNGYALIEDIAVAKD 100 Query: 99 IRGQGLAKKLALMALDHAREQGFKRCYLETTAFLREAIALYERLGF 144 R +G+ L A++ A+E F LET A Y + F Sbjct: 101 YRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 34.7 bits (79), Expect = 7e-04 Identities = 18/82 (21%), Positives = 29/82 (35%), Gaps = 8/82 (9%) Query: 117 KGLQYQAMMTSLNGVHFGFQCSMRRAWWYMFALPVLLMVALYIVLYIISLVTIAVGGLVF 176 K L+ ++ V W P+ L + +S V L+F Sbjct: 433 KYLKITGHVSFGYDVVSDILKIKDTGDWK----PLFLTLEKKAADAGVSYVVA----LLF 484 Query: 177 SIVFLGLLAIIGIGVINGITYS 198 S++ L I GI ++ GI S Sbjct: 485 SLLAGTTLGIWGIAIVTGILCS 506
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 140 bits (353), Expect = 1e-42 Identities = 85/256 (33%), Positives = 133/256 (51%), Gaps = 8/256 (3%) Query: 7 LAGKNILITGAAQGIGYLLATGLGRYGARIIVNDITPERAETAVTKLQQEGIKAIAAPFN 66 + GK ITGAAQGIG +A L GA I D PE+ E V+ L+ E A A P + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTHKQDIEAAVEHIEKDIGVIDVLINNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQ 126 V I+ IE+++G ID+L+N AG+ R ++EW +VN T VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 AVTRRMVARQAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGI 186 +V++ M+ R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 187 APGYFKTEMTKALVEDE--------AFTSWLCKRTPAARWGDPQELIGAAVFLSSKASDF 238 +PG +T+M +L DE P + P ++ A +FL S + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 239 VNGHLLFVDGGMLVAV 254 + H L VDGG + V Sbjct: 246 ITMHNLCVDGGATLGV 261
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.0 bits (72), Expect = 0.017 Identities = 14/126 (11%), Positives = 31/126 (24%), Gaps = 2/126 (1%) Query: 372 STRKAEAAKKYQTEDFFNQVESKEYVEDALLFYLEKAKAAFPEKECSSPEKVIELLHGQL 431 E A + + +E +A+ A EK ++ Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKA--ALEARQAELEKALEGAMNFSTADSAKI 213 Query: 432 AAKSEQLVRLNATWQTLSQVRATRELIDNDIEQYLDNLNKLLSGQEQKVTQLKSAKAEWK 491 + L A L + + L + E + +L+ A Sbjct: 214 KTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAM 273 Query: 492 KYRASE 497 + ++ Sbjct: 274 NFSTAD 279
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.4 bits (66), Expect = 0.021 Identities = 8/21 (38%), Positives = 14/21 (66%) Query: 30 ESNVLITGPNGCGKSTLLRAI 50 + ++ITG +G GK + RA+ Sbjct: 160 DLTLMITGESGTGKELVARAL 180
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 39.3 bits (92), Expect = 2e-05 Identities = 34/129 (26%), Positives = 51/129 (39%), Gaps = 33/129 (25%) Query: 26 CDVLLANGKIIAVG-AGIPG-----DIV--PDCTVINLSGRMLCPGFIDQHVHLIGG--- 74 D+ L +G+I A+G AG P I+ P VI G+++ G +D H+H I Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICPQQI 145 Query: 75 ------------GGEAGP------TTRTP-EVSLSRLTEA--GITTVVGLLGTDSVSRHP 113 GG GP TT TP ++R+ EA + G + S P Sbjct: 146 EEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEAADAFPMNLAFAGKGNAS-LP 204 Query: 114 ASLLAKTRA 122 +L+ Sbjct: 205 GALVEMVLG 213
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.0 bits (96), Expect = 6e-06 Identities = 83/396 (20%), Positives = 142/396 (35%), Gaps = 31/396 (7%) Query: 9 PRHPIFTALFGMMVLTLGMGVGRFLYTPMLPVMLAEKQLTFNQLSWIASANYAGYLAGSL 68 P P+ L + + +G+G L P+LP +L + + N ++ A Y Sbjct: 3 PNRPLIVILSTVALDAVGIG----LIMPVLPGLLRDLVHS-NDVTAHYGILLALYALMQF 57 Query: 69 LFSFGLFHLPSRL--RPMLLASAVATGILILAMAIFTQPAVVMLVRFLAGVASAGMMIFG 126 + L L R RP+LL S + MA V+ + R +AG+ A + G Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117 Query: 127 SMI-----VLHHTRHPFVIAALFSGVGAGIALGNEYVIGGLHYALSAHSLWLGAGALAGI 181 + I RH ++A F G G+ G V+GGL S H+ + A AL G+ Sbjct: 118 AYIADITDGDERARHFGFMSACF---GFGMVAGP--VLGGLMGGFSPHAPFFAAAALNGL 172 Query: 182 LLLIVAILIPPRAHALPPAPLARIENQPMPWWQLA-LLYGFAGFGYIIVATYLPLMAKSA 240 L L+P +H PL R P+ ++ A + A + L +A Sbjct: 173 NFLTGCFLLPE-SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231 Query: 241 GSPLLTAHL--WSLVGLAIIPGCFGWLWA----------AKHWGVLPCLTANLLIQSACV 288 + W + I FG L + A G L ++ Sbjct: 232 LWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY 291 Query: 289 LLSLASDSLLLLILSSIGFGATFMGTTSLVMPLARQLSAPGNINLLGLVTLTYGIGQILG 348 +L + + + + +G +L L+RQ+ L G + + I+G Sbjct: 292 ILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351 Query: 349 PLAASLSGNGASAIINATLCGAAALFFAALISAVQQ 384 PL + + N A A + + A+++ Sbjct: 352 PLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 47.6 bits (113), Expect = 5e-08 Identities = 35/141 (24%), Positives = 64/141 (45%), Gaps = 5/141 (3%) Query: 58 SLYLAGGMALQWLLGPLSDRIGRRPVLIAGALIFTLACAATLLTTSMTQFLV-ARFVQGT 116 L + G A+ G LSD++G + +L+ G +I + S L+ ARF+QG Sbjct: 59 MLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA 115 Query: 117 SICFIATVGYVTVQEAFGQTKAIKLMAIITSIVLVAPVIGPLSGAALMHFVHWKVLFGII 176 + V V + K +I SIV + +GP G + H++HW L +I Sbjct: 116 GAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL-LI 174 Query: 177 AVMGLLALCGLLLAMPETVQR 197 ++ ++ + L+ + + V+ Sbjct: 175 PMITIITVPFLMKLLKKEVRI 195
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 32.5 bits (73), Expect = 5e-04 Identities = 22/71 (30%), Positives = 32/71 (45%), Gaps = 8/71 (11%) Query: 107 AADRRKGERQGRQVGLEEGLEKGLEKGQ--------RVAALRIAKQMLADGLDRETVQRF 158 A R++G +QG Q GL +GLE+GL + + R+ L Q D LD R Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 Query: 159 TGLTAEELQDV 169 + E + V Sbjct: 121 MQMALEAARQV 131
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 31.1 bits (70), Expect = 0.019 Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 773 ADEIWGYLPGEREKYVFTGEWYDGLFGLEENEEFNDAFWDDVRYIK---DQVNKELENQK 829 AD+ W + ++EK++ E + EE + N+ + ++ K + K + + K Sbjct: 344 ADKKWSHFGTQKEKWIGVAE--NHFSNTEEQAKINNKIKEAIKMFKELPEDFVKYINSDK 401 Query: 830 A 830 A Sbjct: 402 A 402
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 28.8 bits (64), Expect = 0.007 Identities = 12/32 (37%), Positives = 19/32 (59%) Query: 8 NSAILVHFTLKLDDGSTAESTRNNGKPALFRL 39 + + V +T L DG+ +ST GKPA F++ Sbjct: 144 SDTVTVEYTGTLIDGTVFDSTEKAGKPATFQV 175
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 68.7 bits (168), Expect = 1e-15 Identities = 28/141 (19%), Positives = 48/141 (34%), Gaps = 2/141 (1%) Query: 1 MDSITTLIVEDEPMLAEILVDTIKIFPQFSIVGIADKLESAKKQIRLYQPQLILLDNFLP 60 M T L+ +D+ + +L + V I + + I L++ D +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 DGKGIDLIRHTISTNYTGRIIFITADNHMDTISDALRMGVFDYLIKPVHYQRLQHTLERF 120 D DL+ ++ ++A N T A G +DYL KP L + R Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 TRYRSSLRSSEQANQTHVDAL 141 S + + L Sbjct: 119 LAEPKRRPSKLEDDSQDGMPL 139
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.2 bits (68), Expect = 0.017 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 13/81 (16%) Query: 91 DATYITVGNEKGQRLYHVNPDEIGKYMEGGDSDDALYNAKSYVSVRKGSLGSSLRGKSPI 150 + + G EK Q L V +E+ KY E G + GS+G + Sbjct: 238 NGAALYYGTEKEQWLREVKVEELRKYYEEG-------------HFKAGSMGPKVLAAIRF 284 Query: 151 QDSTGKVIGIVSVGYTLEQLE 171 + G+ I + +E LE Sbjct: 285 IEWGGERAIIAHLEKAVEALE 305
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 38.3 bits (89), Expect = 1e-05 Identities = 21/102 (20%), Positives = 43/102 (42%), Gaps = 4/102 (3%) Query: 158 NPFTLGHRYLVEQAAAACDWLHLFVVKEDAS--FFSYTDRWALIEQGIAGIDNVTLHSGS 215 +P T GH ++E+ D +++ V++ FS +R I + IA + N + S Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69 Query: 216 AYMISRATFPGYFLKEKGV--VDDCHCQIDLQLFREHLAPAL 255 ++ A +G+ + D ++ + + LA L Sbjct: 70 GLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDL 111
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 695 bits (1796), Expect = 0.0 Identities = 243/885 (27%), Positives = 385/885 (43%), Gaps = 67/885 (7%) Query: 3 HYKKFRLSTLAAVVGIVLAVGPENSYAEAPIQFNTRFLDVKDDASLDLSRFSRKGYIMPG 62 H +K RL+ + + A + + A + FN RFL A DLSRF + PG Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 63 SYHLQVLVNQSQIVQDNVITYSVDNNDPDNTYPCLSPELVSLLGLKPEIADKMIWINAGQ 122 +Y + + +N + +V + D+ PCL+ ++ +GL M + Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQ--GIVPCLTRAQLASMGLNTASVSGMNLLADDA 134 Query: 123 CLQPDQL-EGMETQTDLSQSTLTVIIPQAYLEYSDEEWDPPSRWDEGIPGVLFDYNVNSQ 181 C+ + Q D+ Q L + IPQA++ + PP WD GI L +YN + Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 182 WRHAEHDDGDEYDISGNGTVGANLGAWRLRADWQANYRHENDSEDKDNFGSSSEQNWDWN 241 Y N G N+GAWRLR + +Y + S S S+ W Sbjct: 195 SVQNRIGGNSHY-AYLNLQSGLNIGAWRLRDNTTWSYNSSDSS-------SGSKNKWQHI 246 Query: 242 RYYAWRAIPQLRAQLTLGEGSLESDIFDGFNYVGGSLITDDQMLPPNLRGYAPDISGVAR 301 + R I LR++LTLG+G + DIFDG N+ G L +DD MLP + RG+AP I G+AR Sbjct: 247 NTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIAR 306 Query: 302 TNAKVTVTQRGRVIYESQVPAGPFRIQDINET-VSGDLHVKIEEQSGQVQEYDVSTASIP 360 A+VT+ Q G IY S VP GPF I DI SGDL V I+E G Q + V +S+P Sbjct: 307 GTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVP 366 Query: 361 FLTRPGQVRYKLAAGRPQDWDHNMEGGFFTSAEASWGIANGWSLYGGAIGEQDYQALALG 420 L R G RY + AG + + E F + G+ GW++YGG Y+A G Sbjct: 367 LLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFG 426 Query: 421 LGRDLALLGAFSVDVTHSRATLPEGSAYGDGTIQGNSFRASYAKDFDDIDSRLTFAGYRF 480 +G+++ LGA SVD+T + +TLP D G S R Y K ++ + + GYR+ Sbjct: 427 IGKNMGALGALSVDMTQANSTLP-----DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRY 481 Query: 481 SEENYMTMDEFIDTHNDDNDR-----------------QRTGHDKEMYTLTYSQNFSAIN 523 S Y + + + + + + LT +Q Sbjct: 482 STSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-T 540 Query: 524 VNAYINYTHRTYWNQPNQD-SYNLTLSHYFDVGEVRGISLSVNGFRNEYDNERDDGVYVS 582 Y++ +H+TYW N D + L+ F+ +LS + +N + RD + ++ Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALN 597 Query: 583 LSIPWGN-----------NRTLSYNGSFSDDNN-SNQVGYYERI--DDRNNYQINAGRAD 628 ++IP+ + + + SY+ S + +N G Y + D+ +Y + G A Sbjct: 598 VNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAG 657 Query: 629 -----NGATLDGYYRYQASYADIDVSANYQEGDYTSGGLNIQGGATLTAKGGALHRTSVN 683 +G+T Y+ Y + ++ ++ D + GG A G L + Sbjct: 658 GGDGNSGSTGYATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQPL-- 714 Query: 684 GGSRLLVDVGDEANVPISGYSTPVYTNAFGKAVIVDVNDYYRNQVKIDITQLPEDAEAIL 743 + +LV + + + V T+ G AV+ +Y N+V +D L ++ + Sbjct: 715 NDTVVLVKAPGAKDAKVENQTG-VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDN 773 Query: 744 SIAQATLTEGAIGYRRMEVLSGKKAMASIRLRDGGTPPFGAEVYNSRQQQLGIVGEDGSV 803 ++A T GAI + G K + ++ + PFGA V + Q GIV ++G V Sbjct: 774 AVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQV 832 Query: 804 YLIGINPGERLQVTW--EGKTQCEAT--LPDPLPGDLFSGLLLPC 844 YL G+ ++QV W E C A LP L + L C Sbjct: 833 YLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 38.9 bits (90), Expect = 2e-06 Identities = 36/136 (26%), Positives = 61/136 (44%), Gaps = 10/136 (7%) Query: 43 PPCTVTGGE---VEFGNVLTTKVDGVNYRQAVGYRLSCNGRVSDYLKLQIQGNAVTINGE 99 PPCT+ G+ V+FGN+ VD +SC + S L +++ GN + + Sbjct: 32 PPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISISCPYK-SGSLWIKVTGNTMGVGQN 90 Query: 100 SVLQTDVDGLGIRL-QTATDGALISPGNTQWLSFQYSGGSGPA-----IEAIPVKNNGVT 153 +VL T++ GI L Q ++ GN ++ + G A ++P +N Sbjct: 91 NVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRVTAGLDTARSTFTFTSVPFRNGSGI 150 Query: 154 LTGGAFNAGATLVVDY 169 L GG F A++ + Y Sbjct: 151 LNGGDFRTTASMSMIY 166
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 44.3 bits (104), Expect = 3e-08 Identities = 55/184 (29%), Positives = 78/184 (42%), Gaps = 39/184 (21%) Query: 1 MKKI---VLTMLMGGSLAAQ---AADNLKFHGTLISPPNCTINNDQTIDVKFGNLLINKI 54 MKKI L +++G L +Q AADNL F G LI P CT+ N +V +G++ I + Sbjct: 1 MKKIRGLCLPVMLGAVLMSQHVHAADNLTFKGKLIIPA-CTVQN---AEVNWGDIEIQNL 56 Query: 55 DGTRYAQ-------NVPYEITCDSTVRDETMAMTLTLSGSVSD--FNPAAVNTSVAGLGI 105 + Q N PY + TM +T+T +G + P S GL I Sbjct: 57 VQSGGNQKDFTVDMNCPYSLG--------TMKVTITSNGQTGNSILVPNTSTASGDGLLI 108 Query: 106 ELRQNDQ-----------PFTLGS-TITVNEQSIPVLKAIPVKKSGASLKEGGFDATATL 153 L ++ T G T T + I + + K + SL+ G F ATATL Sbjct: 109 YLYNSNNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATL 168 Query: 154 QVDY 157 Y Sbjct: 169 VASY 172
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 35.4 bits (81), Expect = 5e-05 Identities = 42/170 (24%), Positives = 74/170 (43%), Gaps = 16/170 (9%) Query: 14 ILCGALILP--VSAADNLHFSGSLVASPCTLTMQGTGIAEVDFSSLDSSDFTPDGQSARK 71 ++ GA+++ V AADNL F G L+ CT+ AEV++ ++ + G +K Sbjct: 11 VMLGAVLMSQHVHAADNLTFKGKLIIPACTVQN-----AEVNWGDIEIQNLVQSG-GNQK 64 Query: 72 PLVFELTDCDSALSNGVQVTFTGTEATGMRGILAIDSYSGASGIGIGIETLSGVPVGMND 131 ++ C +L ++VT T TG ++ S + G+ I + + +G Sbjct: 65 DFTVDMN-CPYSLGT-MKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAV 122 Query: 132 KEGAIFT--LVTGKN---TLNLNAWV-QRLPGEDLIPGRFSASALATFEY 175 G+ T +TG + L A + + + L G FSA+A Y Sbjct: 123 TLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 342 bits (879), Expect = e-114 Identities = 120/376 (31%), Positives = 188/376 (50%), Gaps = 57/376 (15%) Query: 192 ALDMTRLTRRQRVDYPSGKGLQTRYELGDIRGQSPQMEQLRQTITLYARSRAAVLIQGET 251 AL + + + + G+S M+++ + + ++ ++I GE+ Sbjct: 118 ALAEPKRRPSKL--------EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGES 169 Query: 252 GTGKELAAQAIHQTFFHRQPHRQNKPSPPFVAVNCGAITESLLEAELFGYEEGAFTGSRR 311 GTGKEL A+A+H R+N P FVA+N AI L+E+ELFG+E+GAFTG++ Sbjct: 170 GTGKELVARALHD-----YGKRRNGP---FVAINMAAIPRDLIESELFGHEKGAFTGAQT 221 Query: 312 GGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVLEEKAVTRVGGHQPIPVDVRVISATH 371 G FE A GGTLFLDEIG+MP+ QTRLLRVL++ T VGG PI DVR+++AT+ Sbjct: 222 R-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280 Query: 372 CDLDREIMQGRFRPDLFYRLSILRLTLPPLRERQADILPLAESFLKQSLAAMEIPFTESI 431 DL + I QG FR DL+YRL+++ L LPPLR+R DI L F++Q + + Sbjct: 281 KDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLD----V 335 Query: 432 RHGLTQCQPLLLAWRWPGNIRELRNMMERLALFLS------------------------- 466 + + L+ A WPGN+REL N++ RL Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395 Query: 467 -VDPAPTLDRQFMRQLLPELMVNTAELTPST---------VDANALQDVLARFKGDKTAA 516 Q + + + + + + P + ++ + L +G++ A Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455 Query: 517 ARYLGISRTTLWRRLK 532 A LG++R TL ++++ Sbjct: 456 ADLLGLNRNTLRKKIR 471
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.003 Identities = 19/69 (27%), Positives = 29/69 (42%) Query: 254 DVLREIRERTELPLGAYQVSGEYAMIKFAAMAGAIDEEKVVLESLGSIKRAGADLIFSYF 313 + E+ + +L L QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKNI 322 L+L E+ I Sbjct: 526 DLNLVERRI 534
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 124 bits (312), Expect = 4e-31 Identities = 108/491 (21%), Positives = 182/491 (37%), Gaps = 77/491 (15%) Query: 556 VLNADLVNDRTWDTTQANYGYGTIAMNSDGHL-----------------TINGNGDINNG 598 +++ +++ TW N G + + SDG + T+ G+G Sbjct: 429 AVDSLSIDNATW-VMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAGSGLFRMN 487 Query: 599 DELDNSSVDNVVA---ATGNYKVRIDNATGAGSVADYKGNELIYVNDVNTDATFSAAN-- 653 D D +V A+G +++ + N+ GS L+ + + ATF+ AN Sbjct: 488 VFADLGLSDKLVVMQDASGQHRLWVRNS---GSEPASANTLLLVQTPLGSAATFTLANKD 544 Query: 654 -KADLGAYTYQAKQEGNTV------------------------------------VLEQM 676 K D+G Y Y+ GN Sbjct: 545 GKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGR 604 Query: 677 ELTDYANMALSIP--SANTNIWNLEQDTVGTRLTNARHGLADNGGAWVSYFGGNFNGDNG 734 EL+ AN A++ + +W E + + RL R D GGAW F DN Sbjct: 605 ELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRL-NPDAGGAWGRGFAQRQQLDNR 663 Query: 735 TIN-YDQDVNGIMVGVDTKVDGNNAKWIVGAAAGFAKGDLS---DRTGQVDQDSQSAYIY 790 +DQ V G +G D V +W +G AG+ +GD D G D S ++ Sbjct: 664 AGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTD----SVHVG 719 Query: 791 SSARFANN--IFVDGNLSYSHFNNDLSANMSDGTYVDGNTSSDAWGFGLKLGYDLKLGDA 848 A + + ++D L S ND SDG V G + G L+ G D Sbjct: 720 GYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADG 779 Query: 849 GYVTPYGSVSGLFQSGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYSEDQALTPYFK 908 ++ P ++ G Y+ +N ++V + S+ LG++ G + + + PY K Sbjct: 780 WFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIK 839 Query: 909 LAYVYD-DSNNDADVNGDSIDNGVEGSAVRVGLGTQFSFTKNFSAYTDANYLGGGDVDQD 967 + + + D NG + + G+ +GLG + + S Y Y G + Sbjct: 840 ASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMP 899 Query: 968 WSANVGVKYTW 978 W+ + G +Y+W Sbjct: 900 WTFHAGYRYSW 910
>PF06291#Lambda prophage Bor protein Length = 102 Score = 30.0 bits (67), Expect = 0.002 Identities = 21/68 (30%), Positives = 30/68 (44%), Gaps = 11/68 (16%) Query: 28 VNDKEIICSPDESNTHTFVILEGVVSLVRGDKVLIGIVQAPFIFGLADGVAKKEAQYKLI 87 V +K +P E+ TH F VS + K V A I G A+ V K E Q + Sbjct: 29 VGNKPTAVTPKETITHHFF-----VSGIGQKKT----VDAAKICGGAENVVKTETQQTFV 79 Query: 88 AESGCIGY 95 +G +G+ Sbjct: 80 --NGLLGF 85
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 50.2 bits (120), Expect = 7e-09 Identities = 68/356 (19%), Positives = 121/356 (33%), Gaps = 35/356 (9%) Query: 5 IFSLALGTFGLGMAEFGIMGVLTELARDVGITIPAAGH---MISFYAFGVVLGAPVMALF 61 + ++AL G+G+ IM VL L RD+ + H +++ YA APV+ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRFSLKHILLFLVMLCVMGNAIFTFSSSYLMLAVGRLVSGFPHGAFFGVGAIVLSKIIR 121 S RF + +LL + + AI + +L +GR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMVSGMTVANLVGIPFGTYLSQEFSWRYTFLLIAVFNIAVLTAIFFWVPDI 181 G A G +S +V P L FS F A N F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 RDKAQGSLREQ----------FHFLRSPSPWLI--FAATMFGNAGVFAWFSYIKPFMMYI 229 + LR + + + + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 230 SGFSETSMTFIMMLVGLGM---VLGNLLSGKLSGRYTPLRIAVVTDLVIVLSLMALFFFS 286 F + T + L G+ + +++G ++ R R ++ ++ + Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG---MIADGTGYILLA 295 Query: 287 GYKTTSLTFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAIG 340 + F + + P +L E G G +A +L S +G Sbjct: 296 FATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 49.1 bits (117), Expect = 7e-08 Identities = 32/198 (16%), Positives = 71/198 (35%), Gaps = 13/198 (6%) Query: 373 TQQSHDRAQLSQWQQQLLSDTRQRDALPPLTLDLTPQALAEARALHTRQRPLRHRLAALQ 432 TQ S +A+L Q + Q+LS + + + LP L L P + R L ++ Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL------IK 192 Query: 433 GQILPKQKRQAQLQAAIARHHQEQAQYTQRLADKRLSYKTKAQELADVRTICEQ----EA 488 Q Q ++ Q + + + E+ R+ + + L D ++ + + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 489 RIKDLESQRAHLQS--GQPCPLCGSTTHPAIAAYQALELSANQTRRDALEKEVKTLAEEG 546 + + E++ + ++A + +L + + L+K +T Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI- 311 Query: 547 AALRGQLDALTQQLQRDE 564 L +L ++ Q Sbjct: 312 GLLTLELAKNEERQQASV 329
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 29.3 bits (65), Expect = 0.026 Identities = 14/70 (20%), Positives = 29/70 (41%), Gaps = 4/70 (5%) Query: 149 KQQQLLHAIADYYQQQYQEACQLRGERKLPVIATGHLTTVGASKSDAVRDIYIGTLDAFP 208 K+ Q+++ IA++Y +++ + E++ T D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAIN-EKEAFECIYDSRTRSA--GKD-IVSVKINIDKAKK 190 Query: 209 AQHFPPADYI 218 + P DYI Sbjct: 191 ILNLPECDYI 200
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.0 bits (244), Expect = 7e-26 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNKLNEPWPDLILLDWMLPGGSGLQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKREAMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPGSHRVMTGDSP 152 E L D + G S Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 17/123 (13%), Positives = 38/123 (30%), Gaps = 28/123 (22%) Query: 300 TFTFEVDDSLSVLGNEEQLRSAISNLVYNAVNH----TPAGTHITVSWRRVAHGAEFCIQ 355 F +++ ++ + + + LV N + H P G I + + ++ Sbjct: 241 QFENQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 356 DNGPGIAAEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHALNH---HESRLEIDSSPG 412 + G +G GL V+ L E+++++ G Sbjct: 298 NTGSLAL------------------KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 413 KGT 415 K Sbjct: 340 KVN 342
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 69.5 bits (170), Expect = 6e-15 Identities = 35/165 (21%), Positives = 79/165 (47%), Gaps = 4/165 (2%) Query: 433 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIF-FYKKFGLIATSALVANLVLIV 491 ++I ++GP + + + + + LA VV + ++ F +F L A ALV +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 492 GIMSLLPGATLSMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAINEGYAGA 549 G+ ++L + +A ++ +++ V++ +R++E L ++ +N Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 550 FSSIFDANITTLIKVIILYAVGTGAIKGFAITTGIGVATSMFTAI 594 S +TTL+ ++ + G I+GF GV T ++++ Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSV 298
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 352 bits (904), Expect = e-124 Identities = 104/309 (33%), Positives = 176/309 (56%), Gaps = 12/309 (3%) Query: 17 YDFMRWDFWAFGISGLLLIAAIVIMGVRGFNWGLDFTGGTVIEITLEKPAEMDVMREALQ 76 +DF RW + FG + +++IA++++ V G N+G+DF GGT I ++ V R AL+ Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73 Query: 77 KAGYEEPQLQNFGS------SHDIMVRMPPTEGETGGQVLGSKVVTIINE------ATNQ 124 + + H M+R+ E G + G++ ++N+ A + Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133 Query: 125 NAAVKRIEFVGPSVGADLAQTGAMALLVALISILVYVGFRFEWRLAAGVVIALAHDVIIT 184 + E VGP V +L T +LL A + I+ Y+ RFEW+ A G V+AL HDV++T Sbjct: 134 ALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLT 193 Query: 185 LGILSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQT 244 +G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +T Sbjct: 194 VGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 245 LHRTLITSGTTLVVILMLYLFGGPVLEGFSLTMLIGVSIGTASSIYVASALALKLGMKRE 304 L RT++T TTL+ ++ + ++GG V+ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRN 313 Query: 305 HMLQQKVEK 313 + +K Sbjct: 314 KEKKDPSDK 322
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 32.9 bits (75), Expect = 4e-04 Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 5/56 (8%) Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAQRLAVSERTIYRDIRDLSLSGVPVEG 53 + R +I +I+ + T L V++ T+ RDI++L L VP Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNN 57
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 493 bits (1270), Expect = e-180 Identities = 240/295 (81%), Positives = 254/295 (86%), Gaps = 9/295 (3%) Query: 1 MKKTLLAVSAALALTSSFTANAAENDQPQYLSDWWHQSVNVVGSYHTRFSPKLNNDVYLE 60 MKKTLLA A +AL+++F A AAEND+PQYLSDWWHQSVNVVGSYHTRF P++ ND YLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 61 YEAFAKKDWFDFYGYIDIPKTFDWGNGNDKGIWSDGSPLFMEIEPRFSIDKLTGADLSFG 120 YEAFAKKDWFDFYGYID P F GN KGIW+ GSPLFMEIEPRFSIDKLT DLSFG Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFG-GNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFG 119 Query: 121 PFKEWYFANNYIYDMGDNKASRQSTWYMGLGTDIDTGLPMGLSLNVYAKYQWQNYGASNE 180 PFKEWYFANNYIYDMG N + QSTWYMGLGTDIDTGLPM LSLNVYAKYQWQNYGASNE Sbjct: 120 PFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNE 179 Query: 181 NEWDGYRFKVKYFVPITDLWGGKLSYIGFTNFDWGSDLGDDP--------NRTSNSIASS 232 NEWDGYRFKVKYFVP+TDLWGG LSYIGFTNFDWGSDLGDD RTSNSIASS Sbjct: 180 NEWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASS 239 Query: 233 HILALNYDHWHYSVVARYFHNGGQWQNGAKLNWGDGDFSAKSTGWGGYLVVGYNF 287 HILALNY HWHYS+VARYFHNGGQW + AKLN+GDG FS +STGWGGY VVGYNF Sbjct: 240 HILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 115 bits (291), Expect = 8e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIEKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1368 bits (3542), Expect = 0.0 Identities = 810/1033 (78%), Positives = 918/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISATYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDPISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPTELTKYQLTPVDVINAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ L KY+LTPVDVIN +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTDEFGKILLKVNQDGSQVRLRDVAKIELGGENYDVIAKFNGQPASGLGIKLATGANAL 300 + +EFGK+ L+VN DGS VRL+DVA++ELGGENY+VIA+ NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIRAELKKMEPFFPPGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+L +++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 TEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPVAKGDHGEGKKGFFGWFNRLFDKSTHHYTDSVGNILRSTGR 540 SVLVALILTPALCAT+LKPV+ H E K GFFGWFN FD S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLLLYLIIVVGMAYLFVRLPSSFLPDEDQGVFLTMVQLPAGATQERTQKVLDEVTDYYLN 600 YLL+Y +IV GM LF+RLPSSFLP+EDQGVFLTM+QLPAGATQERTQKVLD+VTDYYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKANVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEKNKVEAITQRATAAFSQIKD 660 EKANVESVF VNGF F+G+ QN G+AFVSLK W +R G++N EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLFGEVAKYPDLLVGVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQL G A++P LV VRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDINDWYVRGSDGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D++ YVR ++G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MAMMEELASKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFS 900 MA+ME LASKLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 VEATLEAVRMRLRPILMTSLAFMLGVMPLVISSGAGSGAQNAVGTGVLGGMVTATVLAIF 1020 VEATL AVRMRLRPILMTSLAF+LGV+PL IS+GAGSGAQNAVG GV+GGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 33/216 (15%), Positives = 75/216 (34%), Gaps = 27/216 (12%) Query: 100 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 159 + Y A +L + + ++ + Q +++ ++ L +Q T + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVQNGQASALATVQQLDPIYVDVTQ 218 + + + +P+S ++ + V TEG +V + + V + D + V Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTALV 372 Query: 219 SSNDFLRLKQELA------------NGSLKQENGKAKVDLVTSDGIKFPQSGTLEFSDVT 266 + D + G L KV + D I+ + G + ++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYL-----VGKVKNINLDAIEDQRLGLVFNVIIS 427 Query: 267 VDQTTGSITLRAIFPNPDHTLLPGMFVRARLQEGTK 302 +++ S + I L GM V A ++ G + Sbjct: 428 IEENCLSTGNKNIP------LSSGMAVTAEIKTGMR 457 Score = 32.9 bits (75), Expect = 0.002 Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 10/133 (7%) Query: 49 PLQITTELPGR-TVAYRIAEVRPQVSGIILKRNFV-EGSDIEAGVSLYQIDP-------A 99 ++I G+ T + R E++P + I+ K V EG + G L ++ Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTL 137 Query: 100 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 159 Q++ A+ + + Q + EL KL Y ++ L Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Query: 160 AKAAVETARINLA 172 + +NL Sbjct: 198 WQNQKYQKELNLD 210
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 204 bits (519), Expect = 8e-69 Identities = 187/214 (87%), Positives = 199/214 (92%) Query: 1 MARKTKQQALETRQHILDVALRLFSQQGVSATSLAEIANAAGVTRGAIYWHFKNKSDLFS 60 MARKTKQ+A ETRQHILDVALRLFSQQGVS+TSL EIA AAGVTRGAIYWHFK+KSDLFS Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSESNIGELEIEYQAKFPDDPLSVLREILVHILEATVTEERRRLLMEIIFHKCEFV 120 EIWELSESNIGELE+EYQAKFP DPLSVLREIL+H+LE+TVTEERRRLLMEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMVVVQQAQRSLCLESYDRIEQTLKHCINAKMLPENLLTRRAAILMRSFISGLMENWLF 180 GEM VVQQAQR+LCLESYDRIEQTLKHCI AKMLP +L+TRRAAI+MR +ISGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APQSFDLKKEARAYVTILLEMYQLCPTLRASTVN 214 APQSFDLKKEAR YV ILLEMY LCPTLR N Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATN 214
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 36.2 bits (83), Expect = 7e-04 Identities = 41/219 (18%), Positives = 81/219 (36%), Gaps = 18/219 (8%) Query: 92 RQKVAQAPEKMRQ-ATAALNALSDVDNDDEMRKTLSALSLRQLELRVA--QVLDDLQNSQ 148 R ++A+A EK R+ A AA A + + + + A + RQL+L A + L L Sbjct: 129 RLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEA 188 Query: 149 NDLAAYNSQLVSLQTQPERVQNAMYTASQQI-------QQIRNRLDGNNVGEAALRPSQQ 201 + +L + Q++ ++ + T + ++ L G A + Sbjct: 189 KAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYK 248 Query: 202 VLLQAQQALLNAQID--------QQRKSLEGNTVLQDTLQKQRDYVTANSNRLEHQLQLL 253 L + + L D + + G +++ QKQ NR+ + + Sbjct: 249 ELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQI 308 Query: 254 QEAVNSKRLTLTEKTAQEAISPDETARIQANPLVKQELD 292 Q+A++ A+ + + + Q N L Q D Sbjct: 309 QKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKD 347
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 35.9 bits (82), Expect = 1e-04 Identities = 29/64 (45%), Positives = 39/64 (60%), Gaps = 4/64 (6%) Query: 222 AEPGALIRQLAQGAPQYKEQLMT--IAEWLEEKGRTEGLQKGLQKGLEQGLAQGREAEAR 279 AEP +L +QLAQ Q EQ IAE ++G +G Q+GL +GLEQGLA+ + +A Sbjct: 36 AEP-SLEQQLAQLQMQAHEQGYQAGIAEG-RQQGHKQGYQEGLAQGLEQGLAEAKSQQAP 93 Query: 280 AIAR 283 AR Sbjct: 94 IHAR 97 Score = 30.1 bits (67), Expect = 0.009 Identities = 20/71 (28%), Positives = 32/71 (45%), Gaps = 12/71 (16%) Query: 233 QGAPQYKEQLMTIAEWLEEKGRTEGLQKGLQKGLEQGLAQGREAEARAIARKMLANGLEP 292 + P ++QL + E+G G+ +G Q+G +QG +G LA GLE Sbjct: 35 EAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG------------LAQGLEQ 82 Query: 293 GLIASVTGITP 303 GL + + P Sbjct: 83 GLAEAKSQQAP 93
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 27.0 bits (59), Expect = 0.049 Identities = 19/61 (31%), Positives = 25/61 (40%), Gaps = 2/61 (3%) Query: 112 QEFERRLLAMTQE--RKIRLAQATGLVEQQTLQKEVEIYEGRLARCRHALEKIENVLARL 169 E ER LA +E RK A E + +KE+E + R E E LA L Sbjct: 125 AEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAAL 184 Query: 170 T 170 + Sbjct: 185 S 185
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.9 bits (103), Expect = 2e-06 Identities = 52/275 (18%), Positives = 85/275 (30%), Gaps = 34/275 (12%) Query: 366 PEPETPRQSFAPVAPTAVMTPP--QLQQPSAP-----------APQTSPAPLPASTSQVL 412 PE E Q V T + TP Q PS P AP PAP S + Sbjct: 983 PEVEKRNQ---TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 413 AARNQLQRAQGVTKTKK--SEPAAASRARPVNNSALERLASVSERVQARPAPSALETAPV 470 A N Q ++ V K ++ +E A + R + + + E A Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQN-----------REVAKEAKSNVKANTQTNEVAQS 1088 Query: 471 KKEAYRWKATTPVVQTKEVVATPKALKKALEHEKTPELAAKLAAEAIERDPWAAQVSQLS 530 E T +TKE K K +E EKT E K+ ++ + + V + Sbjct: 1089 GSET----KETQTTETKETATVEKEEKAKVETEKTQE-VPKVTSQVSPKQEQSETVQPQA 1143 Query: 531 LPKLVEQVALNAWKEQNGNAVCLHLRSTQRHLNSSGAQQKLAQALSDLTGTTVELTIVED 590 P +N + Q+ + +S+ Q + + VE Sbjct: 1144 EPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTT 1203 Query: 591 DNPAVRTPLEWRQAIYEEKLAQARESIIADNNIQT 625 T + + ++ S+ + T Sbjct: 1204 PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 46.8 bits (111), Expect = 3e-10 Identities = 17/57 (29%), Positives = 28/57 (49%), Gaps = 9/57 (15%) Query: 1 MRPKAMTVAVIIAGLLPVLWGTGAGSEVMSRI---------TAPLLSLFIIPAAYKL 48 +RP MT I G+LP+ GAGS + + +A LL++F +P + + Sbjct: 971 LRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 29.1 bits (65), Expect = 0.039 Identities = 30/151 (19%), Positives = 47/151 (31%), Gaps = 15/151 (9%) Query: 128 GAESVIPAITGLTTTAGVFDSTGLLSLSQRPARLG--ILGGGYIGLEFASMFANFGTKVT 185 GA+ I A+ F + G + + + G I E S F V Sbjct: 136 GADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPS---KFKDSVN 192 Query: 186 IFEAAPQFLPREDRDIAQAITRILQEKGVELILNANVQAVSSKEGAVQVETPEGAHLVDA 245 + L D A + ++ + + S+E + V+ P A L Sbjct: 193 LVLQ----LRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQE--IAVQKPRVADLTRL 246 Query: 246 LLVASGRKPATAGLQLQNAGVAVNERGGIIV 276 + T A V +NER G IV Sbjct: 247 MAEIENLTVETD----TPAKVVINERTGTIV 273
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.7 bits (85), Expect = 1e-04 Identities = 25/192 (13%), Positives = 68/192 (35%), Gaps = 6/192 (3%) Query: 62 LQAQRALSTWQDPSIAASKLELKALEAQLRDLIDKRNARVQILDDFNDLYHLRLGPLMSR 121 L A+ Q S+ ++LE + L I+ L D ++ ++ Sbjct: 130 LGAEADTLKTQS-SLLQARLEQTRYQI-LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Query: 122 ILELRKQLAVSMQRKQEAE--IKRREKDYQSCLQFISQAVDQLATLKQQWTGLNAASREA 179 +++Q + +K + E + ++ + + L I++ + K + ++ + Sbjct: 188 TSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247 Query: 180 VGIRQRIQQQTELITALLAEIRELEADFSHQDDSAFRQAQENAEQDYHQYREQQQEAQFR 239 + + +Q + E+R ++ Q +S A+E + ++ + + + R Sbjct: 248 AIAKHAVLEQENKYVEAVNELRVYKSQLE-QIESEILSAKEEYQLVTQLFKNEILD-KLR 305 Query: 240 YARDQRLSADER 251 D Sbjct: 306 QTTDNIGLLTLE 317
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 31.4 bits (71), Expect = 0.006 Identities = 11/60 (18%), Positives = 25/60 (41%), Gaps = 5/60 (8%) Query: 373 RYQQLRQGENPVGSLGMFGGYSGGYQRYDNNEADGNGNHNNLTVGVDYQLNEQVLLGGLI 432 RY++ +P+G +G F + + +T G Y++N+ + G++ Sbjct: 52 RYEED---NSPLGVIGSFTYTEKSRTASSGD--YNKNQYYGITAGPAYRINDWASIYGVV 106
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 145 bits (367), Expect = 4e-39 Identities = 72/254 (28%), Positives = 120/254 (47%), Gaps = 19/254 (7%) Query: 89 QALCAERQDSLELLIGAQGSLQESLRQCKAAISYPGAGLPLLLRGPTGTGKSFLARQLWQ 148 L + QD + L +G ++QE R + L L++ G +GTGK +AR Sbjct: 127 SKLEDDSQDGMPL-VGRSAAMQEIYRVLARLMQTD---LTLMITGESGTGKELVAR---- 178 Query: 149 YAIEQGILPPEAPFTVFNCAEYANNPELLTSKLFGHAKGAFTGADRAVSGLIETSNGGVL 208 A+ PF N A A +L+ S+LFGH KGAFTGA +G E + GG L Sbjct: 179 -ALHDYGKRRNGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235 Query: 209 FIDEVHRLPPEGQEKLFHFMDNGTWRRLGESSDERSATVRLIFASTEDLEK-----HFLA 263 F+DE+ +P + Q +L + G + +G + VR++ A+ +DL++ F Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVG-GRTPIRSDVRIVAATNKDLKQSINQGLFRE 294 Query: 264 TFIRRIPVI-VKILPIAERGQYERLAFIHHFFRREAQRLHHDLSLDSEIISQLMQETLEG 322 R+ V+ +++ P+ +R + + + HF ++ + D E + + G Sbjct: 295 DLYYRLNVVPLRLPPLRDRAE-DIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPG 353 Query: 323 NVGGLENLIRNICA 336 NV LENL+R + A Sbjct: 354 NVRELENLVRRLTA 367
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 45.7 bits (108), Expect = 1e-07 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 5/139 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFVGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYL 333 P+ H D+ + I L Sbjct: 209 AARFLPLSHSIDLIRPIML 227
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.009 Identities = 22/89 (24%), Positives = 29/89 (32%), Gaps = 21/89 (23%) Query: 294 PRFEDAFIDLLGGAGTSESPLGSILHTVEGTAGETVIEAQELTKKFGDFAATDHVNFVVQ 353 PR E + +LG P + Q + K HV V++ Sbjct: 548 PRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVME 590 Query: 354 RGEIFG----LLGPNGAGKSTTFKMMCGL 378 G F L G G GKST + GL Sbjct: 591 PGCKFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.7 bits (66), Expect = 0.044 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 60.6 bits (147), Expect = 2e-12 Identities = 44/262 (16%), Positives = 97/262 (37%), Gaps = 27/262 (10%) Query: 79 YENALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVRQAQAAYDYAQNFYNRQQGLWK 138 ++N Q + + +A+ +LA E R + + + L + Sbjct: 198 WQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQ 257 Query: 139 SRTISA--NDLENARSSRDQAQATLKSAQDKLSQYRTGNREQDI----AQAKASLEQAKA 192 N+L +S +Q ++ + SA+++ T + +I Q ++ Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL-VTQLFKNEILDKLRQTTDNIGLLTL 316 Query: 193 QLAQAQLDLQDTTLIAPANGTLLTRAV-EPGSMLNAGSTVLTLSLT-RPVWVRAYVDERN 250 +LA+ + Q + + AP + + V G ++ T++ + + V A V ++ Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKD 376 Query: 251 LSQTQPGRDILLYTDGRPDKPYH---GKIGFVSPTAEFTPKTVETPDLRTDLVYRLRIIV 307 + G++ ++ + P Y GK+ ++ A D R LV+ + I + Sbjct: 377 IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISI 428 Query: 308 T-------DADDALRQGMPVTV 322 + + L GM VT Sbjct: 429 EENCLSTGNKNIPLSSGMAVTA 450
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 32/224 (14%), Positives = 72/224 (32%), Gaps = 25/224 (11%) Query: 6 TTTKGEQAKSQLIAAALAQFGEYGLHATT-RDIAALAGQNIAAITYYFGSKEDLYLACAQ 64 T + ++ + ++ AL F + G+ +T+ +IA AG AI ++F K DL+ + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 65 WIADFLGEKFRPHAEKAERLFSQPAPD-RDAIRELILLACKNMIMLLTQEDTVNLSKFIS 123 +GE E + P R+ + ++ L E + +F+ Sbjct: 65 LSESNIGELEL---EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV- 120 Query: 124 REQLSPTSAYQLVHEQVIDPLHTHLTRLVAA---YTGCDANDTRMILHTHALLGEVLAFR 180 E A + + + D + L + A +I+ + ++ Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM--- 175 Query: 181 LGKETILLRTGWPQFDEEKAELIYQTVTCHIDLILHGLTQRSLD 224 W + + ++ ++L Sbjct: 176 ---------ENWLFAPQSFDLK--KEARDYVAILLEMYLLCPTL 208
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.024 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 387 bits (995), Expect = e-136 Identities = 126/350 (36%), Positives = 204/350 (58%), Gaps = 4/350 (1%) Query: 2 SEKTEQPTEKKLRDGRKEGQVVKSIEITSLFQLIALYLYFHFFTEKMILILIESITFTLQ 61 EKTEQPT KK+RD RK+GQV KS E+ S ++AL ++ + + + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62 Query: 62 LVNKPFSYALTQL-SHALIESLTSALLFLGAGVIVATVGSVFLQVGVVIASKAIGFKSEH 120 PFS AL+ + + L+E L ++A + S +Q G +I+ +AI + Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA-IASHVVQYGFLISGEAIKPDIKK 121 Query: 121 INPVSNFKQIFSLHSVVELCKSSLKVIMLSLIFAFFFYYYASTFRALPYCGLACGVLVVS 180 INP+ K+IFS+ S+VE KS LKV++LS++ T LP CG+ C ++ Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 181 SLIKWLWVGVMVFYIVVGILDYSFQYYKIRKDLKMSKDDVKQEHKDLEGDPQMKTRRREM 240 +++ L V V ++V+ I DY+F+YY+ K+LKMSKD++K+E+K++EG P++K++RR+ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 241 QSEIQSGSLAQSVKQSVAVVRNPTHIAVCLGYHPTDMPIPRVLEKGSDAQANYIVNIAER 300 EIQS ++ ++VK+S VV NPTHIA+ + Y + P+P V K +DAQ + IAE Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 301 NCIPVVENVELARSLFFEVERGDKIPETLFEPVAALLRMVMK--IDYAHS 348 +P+++ + LAR+L+++ IP E A +LR + + I+ HS Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 76.5 bits (188), Expect = 3e-17 Identities = 48/194 (24%), Positives = 84/194 (43%), Gaps = 3/194 (1%) Query: 8 LVWLAGLSVLGFLATDMYLPAFAAIQADLQTPAAAVSASLSLFLAGFAVAQLLWGPLSDR 67 L+WL LS L + + I D P A+ + + F+ F++ ++G LSD+ Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 68 YGRKPILLLGLSIFALGSLGMLWVESAAALLTL-RFVQAVGVCAATVIWQALVTDYYPSQ 126 G K +LL G+ I GS+ S +LL + RF+Q G A + +V Y P + Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 127 KINRIFATIMPLVGLSPALAPLLGSWILTHFSWQAIFATLFVITLLLMLPALRLKPSGKA 186 + F I +V + + P +G I + W + L + ++ +P L + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEV 193 Query: 187 RTEGQDKLTFATLL 200 R +G + L+ Sbjct: 194 RIKGHFDIKGIILM 207
>adhesinb#Adhesin B signature. Length = 310 Score = 27.9 bits (62), Expect = 0.004 Identities = 10/48 (20%), Positives = 18/48 (37%), Gaps = 6/48 (12%) Query: 7 MQKCSLITVLSLSVLMLAGCTTTYTMTTRTGEIIETQGKPEVDTATGM 54 M+KC + +L L+ + LA C++ K V + Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQ------KSSTETGSSKLNVVATNSI 42
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 45.4 bits (107), Expect = 3e-08 Identities = 14/115 (12%), Positives = 40/115 (34%), Gaps = 5/115 (4%) Query: 6 SRTPGRPRQFDPEQAIETAQHLFHSRGYDAVSVADLTKAFGINPPSFYAAFGSKLGLYTR 65 +T ++ + ++ A LF +G + S+ ++ KA G+ + Y F K L++ Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 66 VLK----RYRMTDAIPLGALLRHDRPTAKCLIDVLMEAARRYAADPDATGCLVLE 116 + + + + ++ ++E+ + + Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 86.3 bits (213), Expect = 2e-22 Identities = 66/248 (26%), Positives = 110/248 (44%), Gaps = 22/248 (8%) Query: 7 KSVLVLGGSRGIGAAIVRRFSADGASVV-FSYAG-------SREAAEKLATETGSIAIQT 58 K + G ++GIG A+ R ++ GA + Y S AE E ++ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 59 DSADRDAVISLVREYGPLDILVVNAGVALFGDALEQDSDAIDRLFRINIHAPYHASVEAA 118 +A + + RE GP+DILV AGV G + + F +N ++AS + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 RNMP--EGGRIIIIGSVNGDRMPVPGMAAYAASKSALQGLARGLARDFGPRGITINVVQP 176 + M G I+ +GS N +P MAAYA+SK+A + L + I N+V P Sbjct: 129 KYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPIDTDI--------NPEDGPMKELMHSF---MAIKRHGRPEEVAGMVAWLAGPEASFVT 225 G +TD+ N + +K + +F + +K+ +P ++A V +L +A +T Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 226 GAMHTIDG 233 +DG Sbjct: 248 MHNLCVDG 255
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.026 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 128 TPFSTFIIISLLCGFAGANF-ASSMANISFFFPKQKQGGALGLNGGLGNMGVSVMQLV 184 + FS I+ + G A F A M ++ + PK+ +G A GL G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158
>PF06580#Sensor histidine kinase Length = 349 Score = 51.4 bits (123), Expect = 4e-09 Identities = 30/123 (24%), Positives = 54/123 (43%), Gaps = 17/123 (13%) Query: 473 SARFGFTVKLDYQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SHADDVVVTV 523 S +F ++ + Q+ P + VP L+Q E N +KH +++ Sbjct: 233 SIQFEDRLQFENQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKG 285 Query: 524 TQCGKQVKLKVQDNGCGVPENAERSNHYGMIIMRDRAQSLRG-DCQVRRRETGGTEVTVT 582 T+ V L+V++ G +N + S G+ +R+R Q L G + Q++ E G + Sbjct: 286 TKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 583 FIP 585 IP Sbjct: 346 LIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.2 bits (177), Expect = 6e-17 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLVSMAPDISVVGEASNGEQGIDLAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A V SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKALSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>INTIMIN#Intimin signature. Length = 939 Score = 247 bits (632), Expect = 4e-75 Identities = 126/444 (28%), Positives = 216/444 (48%), Gaps = 24/444 (5%) Query: 22 SFSLSLLLLAASGTIRAQAQDPFTQNRL----PDLGMMPESHEGEKHFAEMAKAFGEASM 77 F S L L S + A N+L PD+ + + ++A A + + Sbjct: 118 PFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQL 177 Query: 78 KNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSW 137 ++ L+ G+ A+ A G + Q + QL++WL +G+A V++ N F+GS + Sbjct: 178 QSRSLN-GDYAKDTALG----IAGNQASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDF 230 Query: 138 FIPLQDKQRYLTWSQLGLTQQTDGLVSNIGVGQRWAQDGWLLGYNTFYDNLLDENLQRAG 197 +P D ++ L + Q+G +N+G GQR+ +LGYN F D + R G Sbjct: 231 LLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLG 290 Query: 198 FGAEAWGEYLRLSANYYQPFADWQT--HTATLEQRMARGYDINAQMRLPFYQHINTSVSL 255 G E W +Y + S N Y + W + ++R A G+DI LP Y + + Sbjct: 291 IGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMY 350 Query: 256 EQYFGDSVDLFDSGTGYHNPVALKLGLNYTPVPLLTMTAQHKQGESGVSQNNLGLTLNYR 315 EQY+GD+V LF+S NP A +G+NYTP+PL+TM ++ G + + Y+ Sbjct: 351 EQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQ 410 Query: 316 FGVPLKKQLAASEVAQSQSLRGSRYDTPQRNSLPTMEYRQRKTLTVFLATPPWDLTPGET 375 F P +Q+ V + ++L GSRYD QRN+ +EY+++ L++ + + T T Sbjct: 411 FDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERST 469 Query: 376 VALKLQVRSVHGIRHLSWQGDTQALSLTAG----TDTRSTEGWTIIMPAWDHREGAANRW 431 ++L V+S +G+ + W D AL G + ++S + + I+PA+ +G +N + Sbjct: 470 QKIQLIVKSKYGLDRIVW--DDSALRSQGGQIQHSGSQSAQDYQAILPAY--VQGGSNVY 525 Query: 432 RLSVVVEDEKGQRVSSNEITLALT 455 +++ D G SSN + L +T Sbjct: 526 KVTARAYDRNGN--SSNNVLLTIT 547
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 420 bits (1082), Expect = e-149 Identities = 101/351 (28%), Positives = 179/351 (50%), Gaps = 14/351 (3%) Query: 7 DDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVCIIWFGGESLARQLAGMLSAGLH 66 +KTE PTP ++ AR++GQ+ +S+E+ S +++ ++ + + ++ + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--IP 60 Query: 67 FDHRMVNDPNLILGQIILLIKAAMMALLPLIAGVVLVALISPVMLGGLIFSGKSLQPKFS 126 + + + + ++ PL+ L+A+ S V+ G + SG++++P Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 127 KLNPLPGIKRMFSAQTGAELLKAVLKSTLVGCVTGFYLWHHWPQMMRLMAESPIVAMGNA 186 K+NP+ G KR+FS ++ E LK++LK L+ + + + +++L P + Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQL----PTCGIECI 176 Query: 187 LDLVGLCALLVVLGVIPMVGF------DVFFQIFSHLKKLRMSRQDIRDEFKESEGDPHV 240 L+G +L L VI VGF D F+ + ++K+L+MS+ +I+ E+KE EG P + Sbjct: 177 TPLLG--QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 241 KGKIRQMQRAAAQRRMMEDVPKADVIVTNPTHYSVALQYDENKMSAPKVVAKGAGLIALR 300 K K RQ + R M E+V ++ V+V NPTH ++ + Y + P V K Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 301 IREIGAEHRVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWVWQLK 351 +R+I E VP L+ PLARALY A + IP + A AEVL W+ + Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.7 bits (220), Expect = 7e-24 Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGFGFIISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG +++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADSAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.6 bits (160), Expect = 7e-14 Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 6/142 (4%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAARARIAAHKPM 141 +AE R ++ + M Sbjct: 116 GRALAEPKRRPSKLEDDSQDGM 137
>PF06580#Sensor histidine kinase Length = 349 Score = 41.8 bits (98), Expect = 7e-06 Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 378 ELDKSLIERIIDPLT--HLVRNSLDHGIEMPEKRLEAGKNVVGNLILSAEHQGGNICIEV 435 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 436 TDDGAGLNRERILAKAMSQGMAVNENMTDDEVGMLIFAPGFSTAEQVTDVSGRGVGMDVV 495 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 496 KRNIQEMGG---HVEIQSKQGSGTTIRILLP 523 + +Q + G +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 42.2 bits (99), Expect = 1e-06 Identities = 25/118 (21%), Positives = 46/118 (38%), Gaps = 11/118 (9%) Query: 162 FKTGSAEVEPYMRDILRAIAPVL---NGIPNRISLAGHTDDFPYANGEKGYSNWELSADR 218 F A ++P + L + L + + + G+TD G Y N LS R Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI----GSDAY-NQGLSERR 277 Query: 219 ANASRRELVAGGLDNGKVLRVVGMAATMRLSDRGPDDAINRR--ISLLVLNKQAEQAI 274 A + L++ G+ K+ GM + ++ D+ R I L +++ E + Sbjct: 278 AQSVVDYLISKGIPADKI-SARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>PF05844#YopD protein Length = 295 Score = 31.9 bits (72), Expect = 0.002 Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQQGMFSLERDIEN 103 ++LL +L+R+ K+R+ G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.4 bits (66), Expect = 0.027 Identities = 10/53 (18%), Positives = 17/53 (32%), Gaps = 2/53 (3%) Query: 184 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFIGMIGWALLT 234 R L R + + + A L + P R R M + ++L Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG 78
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 6e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 328 bits (843), Expect = e-117 Identities = 224/245 (91%), Positives = 233/245 (95%) Query: 1 MRRLLFLSLAGLWLFSPAAAAQLPGLISQPLAGGGQSWSLSVQTLVFITSLTFLPAILLM 60 MRRLL ++ LWL +P A AQLPG+ SQPL GGGQSWSL VQTLVFITSLTF+PAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEQK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE+K Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALDKGAQPLRAFMLRQTREADLALFARLANSGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEAL+KGAQPLR FMLRQTREADL LFARLAN+GPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLMGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLL+GSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.5 bits (165), Expect = 1e-18 Identities = 23/78 (29%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALITGLIISILQAATQINEMTLSFIPKIVAVFIAII 63 + ++ G +A+ + L L+ +VA I GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 213 bits (543), Expect = 5e-71 Identities = 231/260 (88%), Positives = 246/260 (94%) Query: 1 MIQVTSEQWLYWLHLYFWPLLRVLALISTAPILSERAIPKRVKLGLGIMITLVIAPSLPA 60 M+QVTSEQWL WL+LYFWPLLRVLALISTAPILSER++PKRVKLGL +MIT IAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDTPLFSIAALWLAMQQILIGIALGFTMQFAFAAVRTAGEFIGLQMGLSFATFVDPGSHL 120 ND P+FS ALWLA+QQILIGIALGFTMQFAFAAVRTAGE IGLQMGLSFATFVDP SHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLAMLLFLTFNGHLWLISLLVDTFHTLPIGSNPVNSNAFMALARAGGLIF 180 NMPVLARIMDMLA+LLFLTFNGHLWLISLLVDTFHTLPIG P+NSNAF+AL +AG LIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPVITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGIMLMAALMPLIAPFC 240 LNGLMLALP+ITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGI LMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEIFNLLADIVSEMPI 260 EHLFSEIFNLLADI+SE+P+ Sbjct: 241 EHLFSEIFNLLADIISELPL 260
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 31.7 bits (72), Expect = 0.006 Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 20/133 (15%) Query: 87 VLKAIRDAGICAEANSQYEVRKCLEIGFRGDQIVFNGVVKKPADLEYAIANDLYLINVDS 146 + AI A N + E E G++G ++ G DLE + L Sbjct: 46 IWSAIGATDGFALLNLE-EAITLRERGWKGPILMLEGFFH-AQDLEIYDQHRLTT----C 99 Query: 147 LYELEHIDAIS-RKLKKVANVCVRVEPNVPSATHAELVTAFHAKSGLDLEQAEETCRRIL 205 ++ + A+ +LK ++ ++V + + + G ++ +++ Sbjct: 100 VHSNWQLKALQNARLKAPLDIYLKVN------------SGMN-RLGFQPDRVLTVWQQLR 146 Query: 206 AMPYVHLRGLHMH 218 AM V L H Sbjct: 147 AMANVGEMTLMSH 159
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 348 bits (894), Expect = e-118 Identities = 121/371 (32%), Positives = 185/371 (49%), Gaps = 24/371 (6%) Query: 122 NMSGVRRLQEQVVELNQLLYADHHE---KHHAIITENPEMLSNIAKAKRLAASNIPVTIV 178 +++ + + + + + + + ++ + M RL +++ + I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 179 GETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFGTVRGAYTGAENS-QGY 237 GE+GTGKEL +R +H KR N PF+A+N A+P LIES LFG +GA+TGA+ G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 238 LELANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKL 297 E A GGTLFLDE+ MP++ Q++LLR LQ + +GG+ + SDVRIVAA N+ + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 298 IQQERLRADLFYRLSVGMLTLPPLRARPEDIPLLANYFIDKYRNDVPQDIHGLSETARAD 357 I Q R DL+YRL+V L LPPLR R EDIP L +F+ + + D+ + A Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 358 LLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIF-------------------EQDELN 398 + H WPGNVR LEN + R + +D + + II ++ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 399 LGVPETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTT 458 V E + G + +A E LI AL +GN AA L ++R T Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 459 LQYKVQKYAIR 469 L+ K+++ + Sbjct: 466 LRKKIRELGVS 476
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 33.6 bits (76), Expect = 0.002 Identities = 13/37 (35%), Positives = 24/37 (64%), Gaps = 2/37 (5%) Query: 469 KDVDQQYLDFLDSLRND-DAKAVLFQNEM-ENLEMHN 503 K +D ++L+ + SL +D D+ +LF + E LE++N Sbjct: 186 KSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNN 222
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 398 bits (1024), Expect = e-144 Identities = 237/251 (94%), Positives = 248/251 (98%) Query: 1 MKLRLSALALGTTLLVGCASSGTEQQGRSDPFEGFNRTMYNFNFNVLDPYVVRPVAVAWR 60 MKLRLSALALGTTLLVGCASSGT+QQGRSDP EGFNRTMYNFNFNVLDPY+VRPVAVAWR Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 Query: 61 DYVPQPARNGLSNFTGNLEEPAIMVNYFLQGDPYQGMVHFTRFFLNTLLGMGGFIDVAGM 120 DYVPQPARNGLSNFTGNLEEPA+MVNYFLQGDPYQGMVHFTRFFLNT+LGMGGFIDVAGM Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 Query: 121 ANPKLQRVEPHRFGSTLGHYGVGYGPYMQLPFYGSFTLREDGGDMADTLYPVLSWLTWPM 180 ANPKLQR EPHRFGSTLGHYGVGYGPY+QLPFYGSFTLR+DGGDMAD LYPVLSWLTWPM Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180 Query: 181 SIGKWTIEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGKLKPQENPNAQA 240 S+GKWT+EGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGG+LKPQENPNAQA Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 Query: 241 IQDELKEIDSE 251 IQD+LK+IDSE Sbjct: 241 IQDDLKDIDSE 251
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.030 Identities = 22/113 (19%), Positives = 46/113 (40%), Gaps = 12/113 (10%) Query: 199 WIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLV-FNGTLPWSDFLW 257 I W+ G I+ ++ +I + V + I L+ F T P + F Sbjct: 61 SFIKRQGWLKLNMG-QIILRVLPACVVIGM----VWFVANTSIWRLLAFINTKPVA-FTL 114 Query: 258 PFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAE 310 P AL ++ N+ TF+++L+ K ++A + ++ ++A+ Sbjct: 115 PLAL-SIIFNVVVVTFMWSLLYFGWHFF----KNYKQAEIDQWKMASMAQEAQ 162
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 477 bits (1230), Expect = e-174 Identities = 149/320 (46%), Positives = 213/320 (66%), Gaps = 11/320 (3%) Query: 1 MKKHAIAVMMIAVFSESVYAESALFIPDVSPDSVTTSLSVGVLNGKSRELVYD-TDTGRK 59 M+ + +++ + S +A + +PD++ +S+G L+GK++E VY + GRK Sbjct: 1 MRAKLLGIVLTTPIAISSFASTET--LSFTPDNINADISLGTLSGKTKERVYLAEEGGRK 58 Query: 60 LSQLDWKIKNVATLQGDLSWEPYSFMTLDARGWTSLASGSGHMVDHDWMSSEQPG-WTDR 118 +SQLDWK N A ++G ++W+ +++ A GWT+L S G+MVD DWM S PG WTD Sbjct: 59 VSQLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDE 118 Query: 119 SIHPDTSVNYANEYDLNVKGWLLQGDNYKAGVTAGYQETRFSWTARGGSYIYDNGR---- 174 S HPDT +NYANE+DLN+KGWLL NY+ G+ AGYQE+R+S+TARGGSYIY + Sbjct: 119 SRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRD 178 Query: 175 YIGNFPHGVRGIGYSQRFEMPYIGLAGDYRINDFECNVLFKYSDWVNAHDNDEHY--MRK 232 IG+FP+G R IGY QRF+MPYIGL G YR DFE FKYS WV + DNDEHY ++ Sbjct: 179 DIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKR 238 Query: 233 LTFREKTENSRYYGASIDAGYYITSNAKIFAEFAYSKYEEGKGGTQIIDKTSGDTAYFGG 292 +T+R K ++ YY +++AGYY+T NAK++ E A+++ KG T + D + +T+ + Sbjct: 239 ITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNN-NTSDYSK 297 Query: 293 DAAGIANNNYTVTAGLQYRF 312 + AGI N N+ TAGL+Y F Sbjct: 298 NGAGIENYNFITTAGLKYTF 317
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 248 bits (636), Expect = 4e-80 Identities = 120/474 (25%), Positives = 192/474 (40%), Gaps = 73/474 (15%) Query: 7 SILLIDDDVDVLDAYTQMLEQAGYRVRGFTHPFEAKEWVKADWEGIVLSDVCMPGCSGID 66 +IL+ DDD + Q L +AGY VR ++ W+ A +V++DV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LMTLFHQDDDQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKLLILIEDALRQRRS 126 L+ + LP+L+++ A+ A +KGA+D+L KP D +L+ +I AL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 127 VIARRQYCQQTLQVELIGRSEWMNQFRQRLQQLAETDIAVWFYGEHGTGRMTGARYLHQL 186 ++ + Q L+GRS M + + L +L +TD+ + GE GTG+ AR LH Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 187 GRNAKGPFVRYELT--PENAGQLETF-----------------IDQAQGGTLVLSHPEYL 227 G+ GPFV + P + + E F +QA+GGTL L + Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 228 TREQQHHLAR-LQSLEHRP----------FRLVGVGSASLVEQAAANQIAAELYYCFAMT 276 + Q L R LQ E+ R+V + L + +LYY + Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 277 QIACQSLSQRPDDIEPLFRHYLRKACLRLNHPVPEIAGELLKGIMRRAWPSNVRELANAA 336 + L R +DI L RH++++A + V E L+ + WP NVREL N Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 337 ELFAV-----------------------------------GVLPLAETVNPQLL------ 355 + E Q Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 356 LQEPTPLDRRVEEYERQIITEALNIHQGRINEVAEYLQIPRKKLYLRMKKYGLS 409 L DR + E E +I AL +G + A+ L + R L ++++ G+S Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.4 bits (79), Expect = 8e-04 Identities = 72/429 (16%), Positives = 140/429 (32%), Gaps = 45/429 (10%) Query: 28 IQALLSVFLGYLAYYIVRNNFTLSTPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVM 84 I L +V L + ++ P L L S G+L + + V+ Sbjct: 8 IVILSTVALDAVGIGLI----MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 85 SSLADKASPKVFMACGLVLCAIVNVGLGFSSAFWIFAALVVFNGLFQGMGVGPSFITIAN 144 +L+D+ + + L A+ + + W+ + G+ G IA+ Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122 Query: 145 WFPRRERGRVGAFWNISHNVGGGIVA-PIVGAAFAILGSEHWQSASYIVPACVAVIFALI 203 ER R F +S G G+VA P++G A + A + + L Sbjct: 123 ITDGDERAR--HFGFMSACFGFGMVAGPVLGGLMGGFSPH----APFFAAAALNGLNFLT 176 Query: 204 VLVLGKGSPRKEGLPSLEQMMPEEKVVLKTKNTAKAPENMSAWQIFCTYVLRNKNAWYIS 263 L +PE + + P A ++ + Sbjct: 177 GCFL----------------LPE------SHKGERRPLRREALNPLASFRWARGMTVVAA 214 Query: 264 LVDVFVYMVRFGMISWLPIYLLTVKHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKL 320 L+ VF M G + + F + ++ + ++ ++ G ++ +L Sbjct: 215 LMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARL 274 Query: 321 FKGRRMPLAMICMALIFVCLIGYWKSESLLMVTIFAAIVGCLIYVPQFLASVQTMEIVPS 380 + R + L MI ++ L + + + A G + Q + S Q E Sbjct: 275 GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334 Query: 381 FAVGSAVGLRGFMSYIFGASLGTSLFGVMVDKLGWYGGFYLLMGGIVCCILFCYLSHRGA 440 GS L ++ I G L T+++ + + G+ + G + + L RG Sbjct: 335 QLQGSLAALTS-LTSIVGPLLFTAIYAA---SITTWNGWAWIAGAALYLLCLPAL-RRGL 389 Query: 441 LELERQRQN 449 QR + Sbjct: 390 WSGAGQRAD 398
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.8 bits (67), Expect = 0.014 Identities = 8/55 (14%), Positives = 17/55 (30%) Query: 277 FAIMKLPLADINAQNAMMHAGKSSEADVQGHVDGWINAHQQQFDGWVKEALAAQK 331 F + ++P + S +D + HV + + Q + Q Sbjct: 133 FVLNEMPARYRKNVLGAVLLSPSQSSDFEIHVSEMVTSDNQSARYLTLPEVNKQT 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.4 bits (110), Expect = 1e-07 Identities = 31/165 (18%), Positives = 66/165 (40%), Gaps = 2/165 (1%) Query: 34 LDTIAHHFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFE-RRTLIVSMTLLAAGGMLI 92 L IA+ F+ +S ++ TA L ++ G L D +R L+ + + G ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 93 TASSQSLSMMILGTALTGLFSVVAQILVPLA-ATLATPATRGKVVGTIMSGLLLGILLAR 151 S++I+ + G + LV + A RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 152 TVAGLLANLGGWRTVFWVASALMALMAVALWRGLPKLKSDTHLNY 196 + G++A+ W + + + + + +++ H + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 74.1 bits (182), Expect = 2e-16 Identities = 61/418 (14%), Positives = 125/418 (29%), Gaps = 97/418 (23%) Query: 19 KRKTALLLLTLLFVIIAVAYGIYWFLVLRHIEETDDA----YVAGNQVQIMAQVSGSVTK 74 + L F++ + VL +E A +G +I + V + Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 75 VWADNTDFVKEGDVLVTLDQT--------------------------------------- 95 + + V++GDVL+ L Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 96 -------------DAKQAFEKAKTALASSVRQTHQLMINSKQ-------LQANIDVQKTA 135 + + K ++ Q +Q +N + + A I+ + Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 136 LAQAQSDLNRRVPLGNANLIGREELQHARDAVASAQAQLDVAIQQYNANQAMILNSNLED 195 +S L+ L + I + + + A +L V Q ++ IL++ E Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 196 QPAVQQAATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQ 236 Q Q E+ + + + I +P++ V + V G Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 237 ISPTTPLMAVVPATD-LWVDANFKETQLANMRIGQPVTVITDIYGDDVKY---TGKVVGL 292 ++ LM +VP D L V A + + + +GQ + + + +Y GKV + Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI 408 Query: 293 DMGTGSAFSLLPAQNATGNWIKVVQRLPVRVELDARQLEQHPLRIGLSTLVTVDTANR 350 + ++ G V+ + + PL G++ + T R Sbjct: 409 -----NLDAIE--DQRLGLVFNVIISIEENCLSTGNK--NIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 129 bits (326), Expect = 7e-35 Identities = 94/405 (23%), Positives = 164/405 (40%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRF 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ + Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFMWSTVAFAAASWACGVS-SSLNMLIFFRVVQGVVAGPLIPLSQSLLLNNYPPAK 135 G +L ++ + S V S ++LI R +QG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGIAVVLMTLHTLRGRETH 195 R A L V + GP +GG I+ HW ++ I + I I V + L+ Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRI 195 Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVIAISFLIVWELTD 255 D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 196 KG--HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DHPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAWTFEPGMDFGASAWPQFIQGF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 287 bits (736), Expect = e-103 Identities = 130/170 (76%), Positives = 145/170 (85%) Query: 2 PLLDSFAVDHTRMQAPAVRVAKTMNTPHGDAITVFDLRFCIPNKEVMPEKGIHTLEHLFA 61 PLLDSF VDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ EKGIHTLEHL+A Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60 Query: 62 GFMRDHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMADVLKVQDQNQIP 121 GFMR+HLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAM DVLKV++QN+IP Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120 Query: 122 ELNVYQCGTYQMHSLSEAQDIARHILERDVRVNSNKELALPKEKLQELHI 171 ELN YQCGT MHSL EA+ IA++ILE V VN N ELALP+ L+EL I Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 84.3 bits (208), Expect = 1e-21 Identities = 67/257 (26%), Positives = 120/257 (46%), Gaps = 7/257 (2%) Query: 3 QVAVVIGGGQTLGAFLCRGLAEEGYRVAVVDIQSDKAANVAQEINADFGEGMAYGFGADA 62 ++A + G Q +G + R LA +G +A VD +K V + A+ A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEQSVLALSRGVDEIFGRVDLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122 ++ ++ ++ G +D+LV AG+ + I +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSPMFQSL-LPQYATKLGIKPDEVEQYYIDKVPLKRGCDYQDVLNMLLFYASPK 241 G+ ++ M SL + + IK +E + +PLK+ D+ + +LF S + Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242 Query: 242 ASYCTGQSINVTGGQVM 258 A + T ++ V GG + Sbjct: 243 AGHITMHNLCVDGGATL 259
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 38.1 bits (88), Expect = 3e-05 Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 2/92 (2%) Query: 156 AQGCEGKNVIIVGAGT-IGLLALQCARELGARSVTAIDINPQKLELAKALGATHTCNSRE 214 A+G EGK I GA IG + GA + A+D NP+KLE + ++ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 215 MTADDIQTALSDIQFDQLVLETAGTPQTVSLA 246 AD +A D ++ E V++A Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.5 bits (76), Expect = 0.003 Identities = 23/120 (19%), Positives = 38/120 (31%), Gaps = 8/120 (6%) Query: 289 QAVEMQPAAAPDAPVEPGVEETQPQMTNGVASPSQASVSDLTDDAPAQSATPVSAPQTPP 348 Q+ +QP A P +P V +PQ + A + + PV+ T Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTN----TTADTEQPAKETSSNVEQPVTESTTVN 1190 Query: 349 ATASAPADPSAELKIYDTSSQPLD-QVLAQVQQDGASIVVGPLLKNNVEALMKSNTPLNV 407 S +P ++QP + ++ V + N A SN V Sbjct: 1191 TGNSVVENPENTT---PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 28.0 bits (62), Expect = 0.028 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.8 bits (67), Expect = 0.007 Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 16/55 (29%) Query: 15 VLITGATGLVGGHLLRMLINTPQVSAIAAPTRRPLTDIVGV--YNP-HDPQLTDA 66 L+TGA G +G H+ + L+ +VG+ N +D L A Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGH-------------QVVGIDNLNDYYDVSLKQA 44
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 128 bits (324), Expect = 2e-39 Identities = 82/216 (37%), Positives = 130/216 (60%), Gaps = 3/216 (1%) Query: 1 MAKKTKADALKTRQHLIETAIAQFALRGVANTTLNDIADAADVTRGAIYWHFENKTQLFN 60 MA+KTK +A +TRQH+++ A+ F+ +GV++T+L +IA AA VTRGAIYWHF++K+ LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVW-LQQPPLRELIQDRLTGCWNDNPLQDLREKFIAALQYIAAVPRQQALMQILYHKCEF 119 E+W L + + EL + +PL LRE I L+ R++ LM+I++HKCEF Sbjct: 61 EIWELSESNIGELELEYQAKF-PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 HNGM-ISEQAIREKMGFHHQSLLEVLQRCMDKKLISGSLDLDVILIILHGSFSGIVKNWL 178 M + +QA R + + + L+ C++ K++ L II+ G SG+++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 MNPTSYDLYKQAPALVDNVLKMLSPDGSVRQLMPNE 214 P S+DL K+A V +L+M ++R NE Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.5 bits (100), Expect = 2e-06 Identities = 24/137 (17%), Positives = 48/137 (35%), Gaps = 15/137 (10%) Query: 98 ATYQADYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 156 + K +L + E+ A + Q + I D RQ + Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 157 VAAKAAVESARINLAYTKVTSPISGRIGKSNV-TEGALVTNGQSTELATVQQLDPIYVDV 215 + + + +P+S ++ + V TEG +VT + T + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTA 370 Query: 216 TQSSND--FMRLKQSVE 230 + D F+ + Q+ Sbjct: 371 LVQNKDIGFINVGQNAI 387 Score = 37.1 bits (86), Expect = 1e-04 Identities = 22/127 (17%), Positives = 41/127 (32%), Gaps = 13/127 (10%) Query: 46 TAPLAVTTELPGR-TSAFRIAEVRPQVSGIVLKRNFTEGSDVEAGQSLYQIDPATYQADY 104 + + G+ T + R E++P + IV + EG V G L ++ +AD Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD- 135 Query: 105 DSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADAAVVAAKAAVE 164 K++++ A L RY L E ++ + Sbjct: 136 ------TLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 165 SARINLA 171 +L Sbjct: 185 LRLTSLI 191
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1389 bits (3596), Expect = 0.0 Identities = 916/1032 (88%), Positives = 973/1032 (94%) Query: 1 MANFFIRRPIFAWVLAIILMMAGALAIMQLPVAQYPTIAPPAVSISATYPGADAQTVQDT 60 MANFFIRRPIFAWVLAIILMMAGALAI+QLPVAQYPTIAPPAVS+SA YPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSFLMVAGFVSDNPNTTQDDISDYVASNIKDSISRLNGVGDVQLFGA 180 EVQQQGISVEKSSSS+LMVAGFVSDNP TTQDDISDYVASN+KD++SRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDANLLNKYQLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240 QYAMRIWLDA+LLNKY+LTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KDPEEFGKVTLRVNTDGSVVHLKDVARIELGGENYNVVARINGKPASGLGIKLATGANAL 300 K+PEEFGKVTLRVN+DGSVV LKDVAR+ELGGENYNV+ARINGKPA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIKVKLAELQPFFPQGMKVVYPYDTTPFVKISIHEVVKTLFEAIILVFLVMYLFLQ 360 DTA AIK KLAELQPFFPQGMKV+YPYDTTPFV++SIHEVVKTLFEAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NIRATLIPTIAVPVVLLGTFAVLAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N+RATLIPTIAVPVVLLGTFA+LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 MEDNLSPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 MED L P+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATLLKPVSAEHHEKKSGFFGWFNTRFDHSVNHYTNSVSGIVRNTGRY 540 SVLVALILTPALCATLLKPVSAEHHE K GFFGWFNT FDHSVNHYTNSV I+ +TGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LIIYLLIVVGMAVLFLRLPTSFLPEEDQGVFLTMIQLPSGATQERTQKVLDQVTHYYLNN 600 L+IY LIV GM VLFLRLP+SFLPEEDQGVFLTMIQLP+GATQERTQKVLDQVT YYL N Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFTVNGFSFSGQGQNSGMAFVSLKPWEERNGEENSVEAVIARATRAFSQIRDG 660 EKANVESVFTVNGFSFSGQ QN+GMAFVSLKPWEERNG+ENS EAVI RA +IRDG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LVFPFNMPAIVELGTATGFDFELIDQGGLGHDALTKARNQLLGMVAKHPDLLVRVRPNGL 720 V PFNMPAIVELGTATGFDFELIDQ GLGHDALT+ARNQLLGM A+HP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKLDVDQEKAQALGVSLSDINETISAALGGYYVNDFIDRGRVKKVYVQADAQFRM 780 EDT QFKL+VDQEKAQALGVSLSDIN+TIS ALGG YVNDFIDRGRVKK+YVQADA+FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPGDINNLYVRSANGEMVPFSTFSSARWIYGSPRLERYNGMPSMELLGEAAPGRSTGEAM 840 LP D++ LYVRSANGEMVPFS F+++ W+YGSPRLERYNG+PSME+ GEAAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 SLMENLASQLPNGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 +LMENLAS+LP GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAASLRGLNNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGRGLI 960 MLVVPLG+VG LLAA+L NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEG+G++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEASRMRLRPILMTSLAFILGVMPLVISRGAGSGAQNAVGTGVMGGMLTATLLAIFF 1020 EATL A RMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGM++ATLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVKRRF 1032 VPVFFVV++R F Sbjct: 1021 VPVFFVVIRRCF 1032
>adhesinb#Adhesin B signature. Length = 310 Score = 29.0 bits (65), Expect = 0.001 Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%) Query: 1 MKR---LIPVALLTTLLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWETAGAIAGG 57 MK+ L+ + L LA C+ + +V TN+ + T++ IAG Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53 Query: 58 AAAVAGLT 65 + + Sbjct: 54 KINLHSIV 61
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 141 bits (358), Expect = 2e-44 Identities = 59/143 (41%), Positives = 86/143 (60%), Gaps = 2/143 (1%) Query: 4 ALPFLIFYASFSLLLGIYDARTGLLPDRFTCPLLWGGLLYHQICLPERLPDALWGAIAGY 63 L L+ + L D LLPD+ T PLLWGGLL++ + L DA+ GA+AGY Sbjct: 134 TLAALLLT-WVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGY 192 Query: 64 GGFALIYWGYRLRYQKEGLGYGDVKYLAALGAWHCWETLPLLVFLAAMLACGGFGVALLV 123 +YW ++L KEG+GYGD K LAALGAW W+ LP+++ L++++ G+ L++ Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV-GAFMGIGLIL 251 Query: 124 RGKSALINPLPFGPWLAVAGFIT 146 P+PFGP+LA+AG+I Sbjct: 252 LRNHHQSKPIPFGPYLAIAGWIA 274
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 36.8 bits (85), Expect = 1e-05 Identities = 18/103 (17%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+++ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKIIELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 616 bits (1591), Expect = 0.0 Identities = 178/698 (25%), Positives = 305/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMQDLANEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KQALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 +Q R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKTARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPENPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P+ ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKSGPLAGYPVVDLGVRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D + +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.7 bits (64), Expect = 0.022 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 219 SK 220 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 128 bits (323), Expect = 2e-38 Identities = 80/226 (35%), Positives = 122/226 (53%), Gaps = 9/226 (3%) Query: 28 AAKPAATADSKAAFKNDDQKAAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87 A A A + D K +Y++GA LG K + GI ++ D L G+QD Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66 Query: 88 A-DKSKLSDQEIEQTLQTFEARVKSAAQAKMEKDAADNEAKGKTFRDAFAKEKGVKTSST 146 + + L++++++ L F+ + + A+ K A +N+AKG F A + G+ + Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126 Query: 147 GLLYKVEKEGTGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206 GL YK+ GTG P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186 Query: 207 KNIKKGGKIKLVIPPELAYGKTGVPG-IPANSTLVFDVELLDIKPA 251 + + G ++ +P +LAYG V G I N TL+F + L+ +K A Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 27.7 bits (61), Expect = 0.025 Identities = 35/138 (25%), Positives = 52/138 (37%), Gaps = 22/138 (15%) Query: 11 YAHPESQDSVANRVLLKPAIQHNNVTVHDLYARYPDFFID--TPYEQ-----ALLREHDV 63 Y P + D N+V P + +HD+ + D F +P + L+ V Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 64 IVFQH--PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLVGKYWRSVITTGEPESA---- 117 Q P+ + P DR L F GPG N G Y +IT PE Sbjct: 69 ---QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLNS--GPYEEKIITELAPEDDDLVL 121 Query: 118 --YRYDALNRYPMSDVLR 133 +RY A R + +++R Sbjct: 122 TKWRYSAFKRTNLLEMMR 139
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.9 bits (69), Expect = 0.020 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 7/85 (8%) Query: 522 VQKQENQADNAPKENNANSAQSRKDQKRREAELRTLT---QPLRKEITRLEKEMEKLNAQ 578 + E + A +E N N ++ RE E T + + I+ L+ M L A Sbjct: 151 TRTAEEIGEQAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAA 210 Query: 579 LA----QAEEKLGDSSLYDPSRKAE 599 A A K + + + RKAE Sbjct: 211 KASIEAAAANKAREQAAAEAKRKAE 235
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 32.4 bits (73), Expect = 0.004 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%) Query: 275 RTPISGDYRGYQVFSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331 R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131 Query: 332 YAYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 132 GRY---------PTFSYQDWQS 144
>PF04619#Dr-family adhesin Length = 160 Score = 27.6 bits (61), Expect = 0.032 Identities = 11/60 (18%), Positives = 21/60 (35%), Gaps = 4/60 (6%) Query: 29 VGARYGHTMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGGW 84 +G ++ D + G+ FL+ D+N ++ W + D G W Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.040 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 43.2 bits (101), Expect = 1e-06 Identities = 46/176 (26%), Positives = 73/176 (41%), Gaps = 17/176 (9%) Query: 133 SGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQELADYTAKLRAAGMKCGYASGW 192 +G L++ P L YNKD L P PPKTW+E+ +L+A G + Sbjct: 126 NGKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQ 178 Query: 193 QGWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYVG 250 + + +A G F +N +D D ++ K + L++ + D Y Sbjct: 179 EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY-- 236 Query: 251 RKDESTEKFYNGDCAMTTASSGSLANIRQYAKFNYGVGMMPYDADIKGAPQNAIIG 306 + F G+ AMT + +NI +K NYGV ++P KG P +G Sbjct: 237 --SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 77.9 bits (192), Expect = 6e-18 Identities = 70/413 (16%), Positives = 136/413 (32%), Gaps = 82/413 (19%) Query: 3 KMKRHLVWWGAGILVAVAAIAWWMLRPAGIPEGFAASNGRI--EATEVDIATKIAGRIDT 60 + LV + + V A +L E A +NG++ +I + Sbjct: 54 SRRPRLVAY-FIMGFLVIAFILSVLGQV---EIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 61 ILVSEGQFVRQGEVLAKMDTRV----------------LQEQRLEAI------------- 91 I+V EG+ VR+G+VL K+ L++ R + + Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 92 -----------------------AQIKEAESAVAAARALLEQRQSEMRAAQSVVKQREAE 128 Q ++ L+++++E + + + E Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 129 LDSVSKRHVRSRSLSQRGAVSVQQLDDDRAAAESARAALETAKAQVSAAKAAIEAARTSI 188 R SL + A++ + + A L K+Q+ ++ I +A+ Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 189 IQ-------------AQTRVEAAQATERRIVADID--DSELKAPRDGRV-QYRVAEPGEV 232 QT T + S ++AP +V Q +V G V Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 233 LSAGGRVLNMVDLSDVY-MTFFLPTEQAGLLKIGGEARLVLDAAPDLRIPATISFVASVA 291 ++ ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVK 406 Query: 292 QFTPKTVETHDERLKLMFRVKARIPPELLRQHLEYV--KTGLPGMAWVRLDER 342 +E D+RL L+F V I L + + +G+ A ++ R Sbjct: 407 NINLDAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMR 457
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.1 bits (68), Expect = 0.022 Identities = 23/194 (11%), Positives = 57/194 (29%), Gaps = 40/194 (20%) Query: 12 TGLLLLLALAFVLFYEAINGFHDTANAVATVIY------TRAMRSQLAVVMAAVFNFFGV 65 L++AL+ +L + F + + ++A+ + V+ F Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89 Query: 66 LLGGLSVAYAIVHML-------------------PTDLLLNMGSAHGLAMVFSMLLAAII 106 LL ++ H++ P + + S L +L ++ Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVL 149 Query: 107 WNLGTWYFGLPASSSHTLIGAIIGIGLTNAMMTGTSVVDALNIPKVINIFGSLIISPIVG 166 ++ W + ++ + T + T ++ + L++ VG Sbjct: 150 LSILIWIIIKG------NLVTLLQLP-TCGIECITPLLGQI--------LRQLMVICTVG 194 Query: 167 LVFAGGLIFLLRRY 180 V + Y Sbjct: 195 FVVISIADYAFEYY 208
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 35.9 bits (82), Expect = 4e-04 Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 12/89 (13%) Query: 21 RQASIEILLLLGIHTTEGKEPRWFMEQLEQARLNLGGWGAVAKKLRINDAQLSQFMLQLR 80 R + +++ LG+H+ G++P+ F E + L +G GA K S L + Sbjct: 280 RASGEPVVIDLGLHSRSGEQPKGFTESFKAPELGVGNLGASEK---------SDVFLVVS 330 Query: 81 HLQQHVPQYDSGQEVSENQLLAALRFVTS 109 L + ++ E+ NQ LRF+TS Sbjct: 331 TLLHCIEGFEKNPEIKPNQ---GLRFITS 356
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.003 Identities = 22/138 (15%), Positives = 47/138 (34%), Gaps = 26/138 (18%) Query: 26 GISIQSVGQAEELWQKIESAPDALVMLDSGLDAEFCREVLQRTAQQFPEVK-IIITAMDG 84 G ++ A LW+ I + LV+ D + E ++L R + P++ ++++A Sbjct: 27 GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSA--- 83 Query: 85 SQKWLHEVMQFNVQAVVPRDSDAETFVLALNAVARGMMFLPGDWLNSTELESRDIKALSA 144 + T + A A + P D + R AL+ Sbjct: 84 -------------------QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---ALAE 121 Query: 145 RQREILQMLAAGESNKQI 162 +R ++ + + Sbjct: 122 PKRRPSKLEDDSQDGMPL 139
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.5 bits (79), Expect = 8e-04 Identities = 83/367 (22%), Positives = 126/367 (34%), Gaps = 59/367 (16%) Query: 79 IGSALFGHFGDRVGRKVTLVASLLTMGISTVIIGLLPGYATIGIFAPLLLALARFGQGLG 138 IG+A++G D++G K LL GI G + G+ F+ LL +ARF QG G Sbjct: 64 IGTAVYGKLSDQLGIK-----RLLLFGIIINCFGSVIGFVGHSFFS--LLIMARFIQGAG 116 Query: 139 LGGEWGGAALLATENAPPRKR----ALYGSFPQLGAPIGFFFANGTFLLLSW-------- 186 ++ P R L GS +G +G + W Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 187 -----LLTDEQFMSWGWRV--PF-IFSAVLVIIG-------------LYVRVSLHETPVF 225 + + + R+ F I +L+ +G ++ VS+ +F Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236 Query: 226 AKVAAAKKQVKIPLGTLLTKHVRVTVLGTFIMLATYTLFYIMTVYSMTYSTAAAPVGLGL 285 K + G + VL I+ T F M Y M + +G Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG- 295 Query: 286 PRNEILWMLMMAVIGFGVMVPIAGLLADAFGRRKSMVIITTLIIL-FALFAFTPLLGSGN 344 + I++ M+VI FG I G+L D G + I T + + F +F S Sbjct: 296 --SVIIFPGTMSVIIFG---YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF 350 Query: 345 PALVFVFLLLGLSLMGL---TFGPMGALLPELFPTEVRYTGASFS-YNVSSILGASVAPY 400 ++ VF+L GLS T L E GA S N +S L Sbjct: 351 MTIIIVFVLGGLSFTKTVISTIVSSS-----LKQQEA---GAGMSLLNFTSFLSEGTGIA 402 Query: 401 IAAWLQS 407 I L S Sbjct: 403 IVGGLLS 409
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.4 bits (66), Expect = 0.016 Identities = 11/62 (17%), Positives = 23/62 (37%), Gaps = 4/62 (6%) Query: 91 DELARRGYHILLCVAGYTEQTEAELVATLLSRRPDGVVLTGIHH----TIELKKVILNAA 146 E+ ++ +LC+ G + L + + L+G H ++ K+I Sbjct: 181 PEVNKQTTVPMLCLYGKEDDAPLHLCPEVKQPNVTVMELSGGHSFDDDYDKVVKLIKGWL 240 Query: 147 IP 148 P Sbjct: 241 KP 242
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.2 bits (94), Expect = 1e-05 Identities = 68/410 (16%), Positives = 129/410 (31%), Gaps = 65/410 (15%) Query: 35 VAPIMSKELGFDPEA---MGLAFSSFGIAYVIMQLPGGWLLDRYGSRLVYGCALIGWSLV 91 V P + ++L + G+ + + + G L DR+G R V +L G ++ Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAV- 85 Query: 92 TMFQGTIYLYGSPLIVLVILRLLMGAIEAPAFPANSRLS--------VQWFPNNERGFVT 143 I L VL I R++ G A A + ++ + F GF++ Sbjct: 86 ---DYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHF-----GFMS 137 Query: 144 SVYQAAQYISLGIITPLMTIILHNLSWHFVFYYIGAIGV---MLGIFWLMKVKDPMHHPK 200 + + + P++ ++ S H F+ A+ + G F L + P Sbjct: 138 ACFGFGM-----VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPL 192 Query: 201 VNQAEIDYIRSGGGEPSLGCKKEPQKITFAQIKTVCVNRMMIGVYIGQFCVTSITWFFLT 260 + A + ++ + F + + Sbjct: 193 ---------------------RREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231 Query: 261 WFPTYLYQAKGMSILKVGFVASIPAIAGFIGGLLGGVFSDWLLKRGYSLTVARKLPVICG 320 + + +G A G + L + + + R ++ G Sbjct: 232 LWVIFGEDRFHWDATTIGISL---AAFGILHSLAQAMITGPVAARLGERRA-----LMLG 283 Query: 321 MLLSCV--IVIANYTSSEFVVIAAMSLAFFAKGFGNLGWCVLSDTSPKEVLGIAGGVFNM 378 M+ I++A T + LA G L +LS +E G G Sbjct: 284 MIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQ-AMLSRQVDEERQGQLQGSLAA 342 Query: 379 CGNMASIVTPLVIGVILANTQSFDFAILYVGSMGLIGLISYLFIVGPLDR 428 ++ SIV PL+ I A + + + G + G YL + L R Sbjct: 343 LTSLTSIVGPLLFTAIYAASITT-----WNGWAWIAGAALYLLCLPALRR 387
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.028 Identities = 21/87 (24%), Positives = 30/87 (34%), Gaps = 13/87 (14%) Query: 8 MTVI---GAGSYGTALAITLARNGHQVVLWGHD---PKHIATLEHDRCNVAFLPDVPFPD 61 M + AG G ++ L GHQVV G D + +L+ R + P F Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVV--GIDNLNDYYDVSLKQARLELLAQPGFQF-- 56 Query: 62 TLHLESDLATALAASRNILVVVPSHVF 88 + DLA + VF Sbjct: 57 ---HKIDLADREGMTDLFASGHFERVF 80
>PF06872#EspG protein Length = 398 Score = 28.5 bits (63), Expect = 0.021 Identities = 14/54 (25%), Positives = 27/54 (50%) Query: 111 LLLEAGMEVNDDFKEPTDHLAIYLELLSHLHFSLGESFQQRRMNKLRQKTLSSL 164 L+L+A +++N D+K+P + + +LL L L + + Q L+ L Sbjct: 29 LVLDATIKINSDYKKPWNEMTCAEKLLKILTLGLWNPKYSQDERQQFQGLLTVL 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.8 bits (189), Expect = 2e-18 Identities = 28/115 (24%), Positives = 56/115 (48%), Gaps = 1/115 (0%) Query: 4 HIVIVEDEPVTQARLQAYFEQEGYRVSVTDSGAGLRDIMEHEHVSLILLDINLPDENGLM 63 I++ +D+ + L + GY V +T + A L + L++ D+ +PDEN Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 LTRALRER-STVGIILVTGRCDQIDRIVGLEMGADDYVTKPLELRELVVRVKNLL 117 L +++ + +++++ + + I E GA DY+ KP +L EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 55.2 bits (133), Expect = 8e-10 Identities = 28/162 (17%), Positives = 64/162 (39%), Gaps = 5/162 (3%) Query: 681 RLLLIEDNMLTQRITAEMLTGKGVKVSLAESANDALRCLAEGESFDVALVDFDLPDYDGL 740 +L+ +D+ + + + L+ G V + +A R +A G+ D+ + D +PD + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63 Query: 741 TLAQQLMSLYPAMKRIGFSAH-VIDDNLRQRTAGLFCGIIQKPVPREELYRMIAHYLQGK 799 L ++ P + + SA ++ G + + KP EL +I L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAEP 122 Query: 800 SHNARAMLNEHQLAGDMASVGP--EKLRQWVALFKDSALPLV 839 + ++ Q + +++ + +A + L L+ Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.1 bits (112), Expect = 8e-08 Identities = 65/384 (16%), Positives = 118/384 (30%), Gaps = 36/384 (9%) Query: 66 AEMGYVFSAFAWLYTLCQIPGGWFLDRIGSRLTYFIAIFGWSVATLLQGFATGLLSLIGL 125 A G + + +A + C G DR G R +++ G +V + A L L Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 126 RAITGIFEAPAFPANNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQEMLSWH 185 R + GI A + ERA GF ++ G+ P+L + S H Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160 Query: 186 WVFIVTGGIGIIWSLVWFKVYQPPRLTKSLSQAELEYIRDGGGLVDGDAPAKKEARQPLT 245 F + + L + + P ++EA PL Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE-------------------RRPLRREALNPLA 201 Query: 246 KADWKLVFHRKLVGVYLGQFAVNSTLWFFLTWFPNYLTQEKGITALKAGFMTTV-PFLAA 304 W + + F + + + A G L + Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS 260 Query: 305 FFGVLLSGWLADKLVKKGFSLGVARKTPIICGLLISTC--IMGANYTNDPLWIMALMAIA 362 +++G +A +L + ++ G++ I+ A T + ++ +A Sbjct: 261 LAQAMITGPVAARL---------GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311 Query: 363 FFGNGFASITWSLISSLAPMRLIGLTGGMFNFIGGLGGISVPLVIGYL-AQSYGFAPALV 421 G G ++ +++S G G + L I PL+ + A S Sbjct: 312 SGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWA 370 Query: 422 YISVVALLGALSYILLVGDVKRVG 445 +I+ AL L G G Sbjct: 371 WIAGAALYLLCLPALRRGLWSGAG 394
>SECA#SecA protein signature. Length = 901 Score = 28.3 bits (63), Expect = 0.018 Identities = 14/74 (18%), Positives = 27/74 (36%) Query: 11 KAFGKQRRKTREELNQEARDRKRLKKHRGHAPGSRAAGGNSASGGGNQNQQKDPRIGSKT 70 K + + EE+ + + R+ + +SA+ Q + ++G Sbjct: 824 STLSKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRND 883 Query: 71 PVPLGVTEKVTQQH 84 P P G +K Q H Sbjct: 884 PCPCGSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 597 bits (1542), Expect = 0.0 Identities = 204/478 (42%), Positives = 299/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGNEVLAALASKTPDVLLSDIRMPGM 60 M + V DDD++IR VL +AL+ AG N + +A+ D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNIEVNGPTTDMIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLERRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L++ + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRIHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETETALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFEASTPDSPSHLPPDSWATLLAQWADRALRS---- 416 EN R LT + + + + EL S + ++Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.034 Identities = 33/189 (17%), Positives = 71/189 (37%), Gaps = 39/189 (20%) Query: 171 IIEQADRLRNLVDRL-------LGPQHPGMHIT--ESIHKVAERVVALVSMELPDNVRLI 221 I+E + R ++ L L ++ + + V + + L S++ D ++ Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYS-NARQVSLADELTVV-DSYLQLASIQFEDRLQFE 243 Query: 222 RDYDPSLPELPHDPEQIEQVLL-NIVRNALQALGPEGGEITLRTRTAFQLTLHGERYRLA 280 +P++ ++ P + Q L+ N +++ + L P+GG+I L+ Sbjct: 244 NQINPAIMDVQV-PPMLVQTLVENGIKHGIAQL-PQGGKILLKGT------KDNGTVT-- 293 Query: 281 ARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHAGK---IEFTSWPG 337 ++VE+ G + ++ TG GL R + G I+ + G Sbjct: 294 --LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 338 HTEFSVYLP 346 V +P Sbjct: 340 KVNAMVLIP 348
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.037 Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 28/108 (25%) Query: 354 LENIVRNALRY------SHTKIEVGFSVDKDGITITVDDDGPGVSPEDREQIFRPFYRTD 407 ++ +V N +++ KI + + D +T+ V++ G +E Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309 Query: 408 EARDRESGGTGLGLAIVESAMQQHRGWVKAD---DSPLGGLRLTLWLP 452 TG GL V +Q G +A G + + +P Sbjct: 310 --------STGTGLQNVRERLQMLYG-TEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.1 bits (234), Expect = 2e-24 Identities = 36/128 (28%), Positives = 64/128 (50%), Gaps = 2/128 (1%) Query: 3 KILLVDDDRELTSLLKELLEMEGFNVLVAHDGEQALELL-DDSIDLLLLDVMMPKKNGID 61 IL+ DDD + ++L + L G++V + + + DL++ DV+MP +N D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 TLKALRQTH-QTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRRSHW 120 L +++ PV++++A+ + + + E GA DYLPKPF+ EL+ I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 SEQQQSSD 128 + D Sbjct: 125 RPSKLEDD 132
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 28.4 bits (63), Expect = 0.031 Identities = 32/140 (22%), Positives = 58/140 (41%), Gaps = 27/140 (19%) Query: 10 SRAAIAATAMASALLLIKIFAWWYTGSVSILAALVD-SLVDIAASLTNLLVVRYSLQPAD 68 ++AA+A + + W S+L AL +L +A + ++V +L P+ Sbjct: 123 TKAALAGAGIGVVAAALGYTQWL-----SLLYALPVIALTGLAFASLGMVVT--ALAPSY 175 Query: 69 DEHTFGHGKAESLAALAQSMFISGSAL--------------FLFLTSIQNLIKPTPMNDP 114 D F ++L + +F+SG+ FL L+ +LI+P + P Sbjct: 176 DYFIF----YQTLV-ITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP 230 Query: 115 GVGIGVTVIALICTIILVTF 134 V + V AL I++ F Sbjct: 231 VVDVCQHVGALCIYIVIPFF 250
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.041 Identities = 12/58 (20%), Positives = 24/58 (41%), Gaps = 1/58 (1%) Query: 16 MAINVVIIAMQLLLAYFYTDIYGLSAADVGVLFVVVRMIDAII-DPAMGVLTDKLNTR 72 I + ++ Y D++ LS A++G + + + II G+L D+ Sbjct: 266 GIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 120 bits (303), Expect = 1e-39 Identities = 49/89 (55%), Positives = 66/89 (74%) Query: 2 NKTQLIDVIADKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61 NK LI +A+ EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90 NPQTG+EIKI A+ VPAF +GKALKDAVK Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF06580#Sensor histidine kinase Length = 349 Score = 37.2 bits (86), Expect = 1e-04 Identities = 48/268 (17%), Positives = 102/268 (38%), Gaps = 53/268 (19%) Query: 205 IVLSALAAVLLATLLAFFWHQRYQRSHRELLDAMKRKEKLVAMGHLAAGVA----HEIRN 260 I+ + + + +LL F WH + +++++ + + L A A H + N Sbjct: 120 IIFNVVVVTFMWSLLYFGWH--FFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFN 177 Query: 261 PLSSIKGLAKYFAERTPAGGESHELAQVM---AKEADRLNRVVSELLELVKPAHLTLQTV 317 L++I+ L + A L+++M + ++ +++ L +V ++L L ++ Sbjct: 178 ALNNIRALILEDPTK--AREMLTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASI 234 Query: 318 NLNDIITHSLNLVSQDAQSREIQLRFTANETLKRIQADPDRLTQVLLNLYLNAI-HAIGR 376 D + + + ++Q+ P L Q L+ N I H I + Sbjct: 235 QFEDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVE---NGIKHGIAQ 274 Query: 377 Q---GTISVEAKESGTDRVIITVTDSGKGIAPDQLEAIFTPYFTTKADGTGLGLAVVQNI 433 G I ++ + V + V ++G + E TG GL V+ Sbjct: 275 LPQGGKILLKGTKDN-GTVTLEVENTGSLALKNTKE------------STGTGLQNVRER 321 Query: 434 IEQHGG---AIKVKSIEGKGAVFTIWLP 458 ++ G IK+ +GK + +P Sbjct: 322 LQMLYGTEAQIKLSEKQGKVNA-MVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 525 bits (1353), Expect = 0.0 Identities = 181/475 (38%), Positives = 256/475 (53%), Gaps = 37/475 (7%) Query: 1 MIRGKIDILVVDDDVSHCTILQALLRGWGYNVALAYSGHDALAQVREKVFDLVLCDVRMA 60 M I LV DDD + T+L L GY+V + + + DLV+ DV M Sbjct: 1 MTGATI--LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 EMDGIATLKEIKALNPAIPILIMTAFSSVETAVEALKAGALDYLIKPLDFDRLQETLEKA 120 + + L IK P +P+L+M+A ++ TA++A + GA DYL KP D L + +A Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LAHTRETGAELPSASAAQFGMIGSSPAMQHLLNEIAMVAPSDATVLIHGDSGTGKELVAR 180 LA + ++L S ++G S AMQ + +A + +D T++I G+SGTGKELVAR Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 181 ALHACSARSDRPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLD 240 ALH R + P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLD Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 241 EIGDISPLMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAEEVSAGRFRQDLYY 300 EIGD+ Q RLLR +Q+ E VG I DVR++AAT++DL + ++ G FR+DLYY Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 301 RLNVVAIEMPSLRQRREDIPLLADHFLRRFAERNRKVVKGFTPQAMDLLIHYDWPGNIRE 360 RLNVV + +P LR R EDIP L HF+++ + VK F +A++L+ + WPGN+RE Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRE 357 Query: 361 LENAIERAVVLLTGEYISERELPLAIAATPIKTEYSGEIQP------------------- 401 LEN + R L + I+ + + + + Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 402 ---------------LVDVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441 L ++E +ILAAL T GN+ +AA LG+ R TL K+ Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.0 bits (88), Expect = 4e-06 Identities = 15/54 (27%), Positives = 22/54 (40%), Gaps = 5/54 (9%) Query: 78 IDPDVRGQGIGKMLVEHALTLAP-----GLTTNVNEQNTQAVGFYKKMGFKVTG 126 + D R +G+G L+ A+ A GL + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 31.6 bits (71), Expect = 0.007 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFADAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 9/86 (10%) Query: 61 LALRNGEVVGMISLHMQFHLHHANWIG--EIQELVVLPPMRGQKIGSQLLAWAEEEARQA 118 L +G I + +NW G I+++ V R + +G+ LL A E A++ Sbjct: 69 LYYLENNCIGRIKIR-------SNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121 Query: 119 GAELTELSTNIKRRDAHRFYLREGYK 144 L T A FY + + Sbjct: 122 HFCGLMLETQDINISACHFYAKHHFI 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 53/284 (18%), Positives = 104/284 (36%), Gaps = 40/284 (14%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSVGGE 144 G L D++GR+ +L +++ ++ + P +W +L + ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKHWRS 260 PFF A L + L K E+ P SF+ + Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFRWARGMTVVA 213 Query: 261 LLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVMGLLS 319 L + ++ + + + H+ G+ + ++ L + G ++ Sbjct: 214 ALMAVFFIM--QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271 Query: 320 DRFGRRPFVIMGSIA-LFALAIPAFILINSNVIGLIFAGLLMLA 362 R G R +++G IA + AF + F +++LA Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIMVLLA 311 Score = 38.7 bits (90), Expect = 5e-05 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVIMGSIALFALAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP ++ ++L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLL---VSLAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLIAG 401 + + + +++ G ++A I V + + + R + ++A F ++AG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQDLMMPAYYLMVIAVIGLITGI-SMKETANR 444 P L + S P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPESHKG 187
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 5e-05 Identities = 39/182 (21%), Positives = 78/182 (42%), Gaps = 34/182 (18%) Query: 181 ARLDQMMDSVSQLLQLARVGQSFSSGNYQEVKLLEDV-ILPSYDELNTM-LETR-QQTLL 237 + +M+ S+S+L++ S N ++V L +++ ++ SY +L ++ E R Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 238 LPESAADVVVRGDATLLRMLLRNLVENAHRY----SPEGTHITIHISADPDAI-MAVEDE 292 + + DV V ML++ LVEN ++ P+G I + + D + + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 293 GPGIDESKCGKLSEAFVRMDSRYGGIGLGLSIV-SRITQLHQGQFFLQNRTERTGTRAWV 351 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 352 LL 353 L+ Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.0 bits (226), Expect = 2e-23 Identities = 46/144 (31%), Positives = 69/144 (47%), Gaps = 1/144 (0%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGVSTARAAEHSLESGHYSLMVLDLGLPDEDGLH 61 IL+ +DD + L A GY S A + +G L+V D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLTRIRQKKYTLPVLILTARDTLNDRITGLDVGADDYLVKPFALEELHARI-RALLRRHN 120 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 NQGESELTVGNLTLNIGRHQAWRD 144 + E + +GR A ++ Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQE 148
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 32.1 bits (73), Expect = 0.005 Identities = 39/163 (23%), Positives = 60/163 (36%), Gaps = 13/163 (7%) Query: 80 CVFILVGAAAQYFILTYGIIIDRSMIANMMDTTPAETFALM-TPQMVLTLG---LSGVLA 135 CV +V A +L+ + +M P T LM V T G L +LA Sbjct: 177 CVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLA 236 Query: 136 AVIAFWVKIRPATPRLRSGLYRLASVLISILLVILVAAFFYKDYASLFRNNKQLIKALSP 195 +AF V +R R+ L LI + L A + + + L + L++A+ Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRI 296 Query: 196 SNSIVASWSWYSHQRLANLPLVRIGEDAHRN--------PLML 230 S V S + H+ VR G H+ P+M Sbjct: 297 S-GDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMR 338
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.3 bits (107), Expect = 3e-07 Identities = 44/225 (19%), Positives = 93/225 (41%), Gaps = 9/225 (4%) Query: 14 LMFGLFVAYLDRSNLSVTLPTITHDLNIDGATASIVLTIFLIGYAFSNIFGGVFTQRYDP 73 L F + L+ L+V+LP I +D N A+ + V T F++ ++ G + + Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 74 KKIVILMVLIWSIATVFVGFTSSVYVILI-CRLVLGITEGIYWPQQSRFASDWFSDKERT 132 K++++ ++I +V S + +LI R + G + + + + R Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 133 QANSIIQYYGQFLALGLGFMILSPLDAAFGWRNVFIITGVIGIVVVVPLYITMLKKQEEA 192 +A +I + G+G I + W + +I + ++ VP + +LKK+ Sbjct: 139 KAFGLIGSIVA-MGEGVGPAIGGMIAHYIHWSYLLLIP--MITIITVPFLMKLLKKEV-- 193 Query: 193 PYYRAPAPTEKTKLTLESLGGTPFLLLIFTYITQGMLFWGITLWI 237 R + + L S+G F+L +Y ++ ++ I Sbjct: 194 ---RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLI 235
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.0 bits (86), Expect = 1e-04 Identities = 44/205 (21%), Positives = 69/205 (33%), Gaps = 55/205 (26%) Query: 3 KNDILITGGHI--IDPARNINEINNLRIINDIIVDADKYPVTSETRIIHADGMIVTPGLI 60 K DI + G I I A N + + II V T +I +G IVT G + Sbjct: 85 KADIGLKDGRIAAIGKAGNPDMQPGVTII-----------VGPGTEVIAGEGKIVTAGGM 133 Query: 61 DYHAHVF-----YDATEGGVRPDMYMPPNGVTTVVDAGSAGTANFDAFYRTVICASKVRI 115 D H H +A +G+T ++ G+ A T I Sbjct: 134 DSHIHFICPQQIEEALM-----------SGLTCMLGGGTGPAHGTLA---TTCTPGPWHI 179 Query: 116 KAFLTVSPPGQTWSQENYDPDNI------DENKIHALFRQYRNVLQGLKLKVQTEDIAEY 169 + + + + P N+ + + AL LKL ED + Sbjct: 180 ARMIE--------AADAF-PMNLAFAGKGNASLPGALVEMVLGGATSLKLH---ED---W 224 Query: 170 GLKP--LTESLRIANDLKCPVAIHS 192 G P + L +A++ V IH+ Sbjct: 225 GTTPAAIDCCLSVADEYDVQVMIHT 249 Score = 30.1 bits (68), Expect = 0.017 Identities = 16/67 (23%), Positives = 26/67 (38%), Gaps = 16/67 (23%) Query: 310 THTPAVLMGMAAEIGTLAPGAFADIAIFKLKNRHVEFADIHGETLTGTHVLVPQMTIKSG 369 T PA+ G++ EIG+L G AD+ ++ V+ P M + G Sbjct: 410 TINPAIAHGLSHEIGSLEVGKRADLVLWNPAFFGVK----------------PDMVLLGG 453 Query: 370 EILFRQI 376 I + Sbjct: 454 TIAAAPM 460
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 48.0 bits (114), Expect = 8e-10 Identities = 17/59 (28%), Positives = 28/59 (47%) Query: 62 DEATLFNIAVDPDFQRRGLGRMLLEHLIDELETRGVVTLWLEVRASNAAAIALYESLGF 120 A + +IAV D++++G+G LL I+ + L LE + N +A Y F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 213 bits (545), Expect = 6e-64 Identities = 109/452 (24%), Positives = 209/452 (46%), Gaps = 44/452 (9%) Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSSQHAKSDWMEMEKQRGISIT 71 K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT----TRTDNTLLERQRGITIQ 57 Query: 72 TSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131 T + F + + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 132 LRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETY 191 P + F+NK+D++ D + +++ +L K + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159 Query: 192 LYQTGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVQGASNEFDEELFLAGEI 251 LY E + + D + + ++ + + LEL Q S F + Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN-DDLLEKYMSGKSLEALELEQEES-----IRFHNCSL 213 Query: 252 TPVFFGTALGNFGVDHMLDGLVAWAPAPMPRQTDTRTVEASEEKFTGFVFKIQANMDPKH 311 PV+ G+A N G+D++++ + + + + G VFKI+ K Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSS---------THRGQSELCGKVFKIE--YSEK- 261 Query: 312 RDRVAFMRVVSGKYEKGMKLRQVRTGKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHG 371 R R+A++R+ SG +R K + I++ T + G+ +++AY G+I+ L N Sbjct: 262 RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNEF 320 Query: 372 TIQIGDTFTQGEMMKFTGIPNFA-PELFRRIRLKDPLKQKQLLKGLVQLSEEG-AVQVFR 429 +++ +++ P L + P +++ LL L+++S+ ++ + Sbjct: 321 -LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379 Query: 430 PISNNDLIVGAVGVLQFDVVVARLKSEYNVEA 461 + +++I+ +G +Q +V A L+ +Y+VE Sbjct: 380 DSATHEIILSFLGKVQMEVTCALLQEKYHVEI 411
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 27.0 bits (59), Expect = 0.004 Identities = 16/49 (32%), Positives = 20/49 (40%), Gaps = 8/49 (16%) Query: 10 WGIIFLVIALIA--------AALGFGGLAGTAAGAAKIVFVVGIVLFLV 50 W +FL + A AL F LAGT G I V GI+ + Sbjct: 460 WKPLFLTLEKKAADAGVSYVVALLFSLLAGTTLGIWGIAIVTGILCSYI 508
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 29.7 bits (67), Expect = 0.011 Identities = 32/141 (22%), Positives = 51/141 (36%), Gaps = 37/141 (26%) Query: 6 IDTHCHFDFPPFTGDERASIQRACEAGVEKIIVPATEAA-------------HFPRVLAL 52 +D+H HF P I+ A +G+ ++ T A H R++ Sbjct: 133 MDSHIHFICP-------QQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEA 185 Query: 53 AARFPSLYAALGLHPIVIERHADDDSDKLQQALAQQQNVVAVGEIGLDLYRDDPQFARQE 112 A FP A G + L AL + V G L L+ D + Sbjct: 186 ADAFPMNLAFAG-----------KGNASLPGALVEM---VLGGATSLKLHED---WGTTP 228 Query: 113 RFLDAQLQLAKRYDLPVILHS 133 +D L +A YD+ V++H+ Sbjct: 229 AAIDCCLSVADEYDVQVMIHT 249