>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 141 bits (357), Expect = 3e-39 Identities = 84/388 (21%), Positives = 152/388 (39%), Gaps = 86/388 (22%) Query: 5 IGIDLGTTNSCVAIMDGTQARVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + Q VL PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKG--QGIVLNE-------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPYKIIGADNGDAWLDVKGQKMAPPQISAE 118 P N + AI+ +K +A ++ + Sbjct: 64 GRTPGN-IAAIR---------------------------------PMKDGVIADFFVTEK 89 Query: 119 VLKK-MKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAAL 177 +L+ +K+ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 90 MLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAI 149 Query: 178 AYGL--DKEVGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDTR 235 GL + G+ V D+GGGT ++++I ++ V + +GG+ FD Sbjct: 150 GAGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEA 197 Query: 236 LINYLVDEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADAT 291 +INY+ + G + AE+ K E+ SA + ++ + Sbjct: 198 IINYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEG 244 Query: 292 GPKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIND--VILVGGQTRMPM 348 P+ + + LE+L E + + + VAL+ SDI++ ++L GG + Sbjct: 245 VPRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRN 302 Query: 349 VQKKVAEFFGKEPRKDVNPDEAVAIGAA 376 + + + E G +P VA G Sbjct: 303 LDRLLMEETGIPVVVAEDPLTCVARGGG 330
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 68.7 bits (168), Expect = 1e-15 Identities = 28/141 (19%), Positives = 48/141 (34%), Gaps = 2/141 (1%) Query: 1 MDSITTLIVEDEPMLAEILVDTIKIFPQFSIVGIADKLESAKKQIRLYQPQLILLDNFLP 60 M T L+ +D+ + +L + V I + + I L++ D +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 DGKGIDLIRHTISTNYTGRIIFITADNHMDTISDALRMGVFDYLIKPVHYQRLQHTLERF 120 D DL+ ++ ++A N T A G +DYL KP L + R Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 TRYRSSLRSSEQANQTHVDAL 141 S + + L Sbjct: 119 LAEPKRRPSKLEDDSQDGMPL 139
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.2 bits (68), Expect = 0.018 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 13/81 (16%) Query: 104 DATYITVGNEKGQRLYHVNPDEIGKYMEGGDSDDALYNAKSYVSVRKGSLGSSLRGKSPI 163 + + G EK Q L V +E+ KY E G + GS+G + Sbjct: 238 NGAALYYGTEKEQWLREVKVEELRKYYEEG-------------HFKAGSMGPKVLAAIRF 284 Query: 164 QDSTGKVIGIVSVGYTLEQLE 184 + G+ I + +E LE Sbjct: 285 IEWGGERAIIAHLEKAVEALE 305
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.0 bits (70), Expect = 0.014 Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 7/67 (10%) Query: 508 ASSAPVQAAAPA-------GAGTPVTAPLAGNIWKVIATEGQTVAEGDVLLILEAMKMET 560 + V+ A A G + + ++I EG++V +GDVLL L A+ E Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134 Query: 561 EIRAAQA 567 + Q+ Sbjct: 135 DTLKTQS 141 Score = 29.4 bits (66), Expect = 0.046 Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 10/56 (17%) Query: 535 KVIATEGQTVAEGDVLLILEAMKMETEIRAAQAGTVRGIAVKSGDAVSVGDTLMTL 590 V G+ G EI+ + V+ I VK G++V GD L+ L Sbjct: 82 IVATANGKLTHSGRSK----------EIKPIENSIVKEIIVKEGESVRKGDVLLKL 127
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 38.3 bits (89), Expect = 1e-05 Identities = 21/102 (20%), Positives = 43/102 (42%), Gaps = 4/102 (3%) Query: 158 NPFTLGHRYLVEQAAAACDWLHLFVVKEDAS--FFSYTDRWALIEQGIAGIDNVTLHSGS 215 +P T GH ++E+ D +++ V++ FS +R I + IA + N + S Sbjct: 10 DPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPNAQVDSFE 69 Query: 216 AYMISRATFPGYFLKEKGV--VDDCHCQIDLQLFREHLAPAL 255 ++ A +G+ + D ++ + + LA L Sbjct: 70 GLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDL 111
>PF06580#Sensor histidine kinase Length = 349 Score = 29.8 bits (67), Expect = 0.024 Identities = 15/79 (18%), Positives = 27/79 (34%), Gaps = 3/79 (3%) Query: 4 RRQPLIPGWLIPGLCAAALMITVSLAAFLALWLNAPSGAWSTIWRDSYLWHVVRFSFWQA 63 R GWL + L + + +W A + W + +++ Sbjct: 60 RSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLL---AFINTKPVAFTLPL 116 Query: 64 FLSAVLSVVPAVFLARALY 82 LS + +VV F+ LY Sbjct: 117 ALSIIFNVVVVTFMWSLLY 135
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 499 bits (1286), Expect = 0.0 Identities = 246/296 (83%), Positives = 266/296 (89%) Query: 1 MRDLYPLTRRRLLTAMALSPLLWQMNTAQAAAIDPRRIVALEWLPVELLLALGITPYGVA 60 M L ++RRRLLTAMALSPLLWQMNTA AAAIDP RIVALEWLPVELLLALGI PYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DVPNYKLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEKLARIAPGH 120 D NY+LWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPE LARIAPG Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFDFSDGKKPLAVARRSLVELAQTLNLEAAAEKHLAQYDRFIASQKPHFIRRGGRPLLMT 180 GF+FSDGK+PLA+AR+SL E+A LNL++AAE HLAQY+ FI S KP F++RG RPLL+T Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVLGPNCLFQEVLDEYGIVNAWQGETNFWGSTAVSIDRLAMYKEADVICFDH 240 TLIDPRHMLV GPN LFQE+LDEYGI NAWQGETNFWGSTAVSIDRLA YK+ DV+CFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 GNSTDMNALMATPLWQAMPFVRAGRFHRVPAVWFYGATLSTMHFVRILDNVLGGKA 296 NS DM+ALMATPLWQAMPFVRAGRF RVPAVWFYGATLS MHFVR+LDN +GGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 69.6 bits (170), Expect = 3e-15 Identities = 32/98 (32%), Positives = 44/98 (44%), Gaps = 13/98 (13%) Query: 344 INKAAREIARVGGAVTVTGHTDSQPIHSAEFPSNLVLSEKRAAEVAALLTSGGVPAGRVH 403 + + G+V V G+TD I S + N LSE+RA V L S G+PA ++ Sbjct: 241 LYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQGLSERRAQSVVDYLISKGIPADKIS 296 Query: 404 IVGKGDTVPVADN---------GSKAGRAKNRRVEILV 432 G G++ PV N A +RRVEI V Sbjct: 297 ARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 27.5 bits (61), Expect = 0.004 Identities = 11/24 (45%), Positives = 18/24 (75%) Query: 9 SLLFMVLIVLFVILFFTWLGRENI 32 SL++ VLIV +L FT+L R+++ Sbjct: 7 SLVWCVLIVCLTLLIFTYLTRKSL 30
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 831 bits (2148), Expect = 0.0 Identities = 311/872 (35%), Positives = 455/872 (52%), Gaps = 52/872 (5%) Query: 4 KQPALLLFIAGVVHCANA-------HAYTFDASML-GDAAKGVDMSLFNQG-VQQPGTYR 54 K F+ V CA A F+ L D D+S F G PGTYR Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79 Query: 55 VDVMVNGKRIDTRDVVFKLEKDGQGTPFLAPCLTVSQLSRYGVKTEDYPQLWKAAKTPDE 114 VD+ +N + TRDV F QG + PCLT +QL+ G+ T + A D Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQG---IVPCLTRAQLASMGLNTASVSGMNLLAD--DA 134 Query: 115 CADLT-AIPQAKAVLDINNQQLQLSIPQLALRTKFKGIAPEDLWDDGIPAFLMNYSARTM 173 C LT I A A LD+ Q+L L+IPQ + + +G P +LWD GI A L+NY+ Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 174 QTDYKMDMGRRDNSSWVQLQPGINIGAWRVRNATSWQR-----SSQLSGKWQAAYTYAER 228 +G + +++ LQ G+NIGAWR+R+ T+W SS KWQ T+ ER Sbjct: 195 SVQN--RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLER 252 Query: 229 GLYSLKSRLTLGQKTSQGEIFDSVPFTGVMLASDDNMVPYSERQFAPVVRGIARTQARVE 288 + L+SRLTLG +QG+IFD + F G LASDDNM+P S+R FAPV+ GIAR A+V Sbjct: 253 DIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVT 312 Query: 289 VKQNGYTIYNTTVAPGPFALRDLSVTDSSGDLHVTVWEADGSTQMFVVPYQTPAIALHQG 348 +KQNGY IYN+TV PGPF + D+ +SGDL VT+ EADGSTQ+F VPY + + +G Sbjct: 313 IKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREG 372 Query: 349 YLKYSLLAGRYRSSDSATDKAQIAQATLMYGLPWNLTAYGGIQSATHYQAASLGLGVSLG 408 + +YS+ AG YRS ++ +K + Q+TL++GLP T YGG Q A Y+A + G+G ++G Sbjct: 373 HTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMG 432 Query: 409 RWGSLSVDGSDTHSQRQGEAVQQGASWRLRYSNQLTATGTNFSLTRWQYASQGYNTLSDV 468 G+LSVD + +S ++ G S R Y+ L +GTN L ++Y++ GY +D Sbjct: 433 ALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADT 492 Query: 469 LDSYRHDGNRL-------------WSWRENLQPSSRTTLMLSQSWGRHLGNLSLTGSRTD 515 S + N + + L ++Q GR L L+GS Sbjct: 493 TYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQT 551 Query: 516 WRNRPGHDDSYGLSWGTSIGGGSLSLNWNQNRTLWRNGAHRKENITSLWFSMSLSRWTGN 575 + D+ + T+ + +L+++ + W+ G ++ + +L ++ S W + Sbjct: 552 YWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG---RDQMLALNVNIPFSHWLRS 608 Query: 576 -------NVSASWQMTSPSHGGQTQQVGVNGEAFSQ-QLDWEVRQSYRADAPPGGGNNSA 627 + SAS+ M+ +G T GV G L + V+ Y G+ Sbjct: 609 DSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGY 668 Query: 628 LHLAWNGGYGLLGGDYSYSRAMRQMGVNIAGGIVIHHHGVTLGQPLQGSVALVEAPGASG 687 L + GGYG YS+S ++Q+ ++GG++ H +GVTLGQPL +V LV+APGA Sbjct: 669 ATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKD 728 Query: 688 VPVGGWPGVKTDFRGDTTVGNLSVYQENTVSLDPSRLPDDAEVTQTDVRVVPTEGAVVEA 747 V GV+TD+RG + + Y+EN V+LD + L D+ ++ VVPT GA+V A Sbjct: 729 AKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRA 788 Query: 748 KFHTRIGARALMTLKREDGSAIPFGAQVTVNGQDGSADLVDTDSQVYLTGLADKGELTVK 807 +F R+G + LMTL + +PFGA VT + S+ +V + QVYL+G+ G++ VK Sbjct: 789 EFKARVGIKLLMTLTH-NNKPLPFGAMVT-SESSQSSGIVADNGQVYLSGMPLAGKVQVK 846 Query: 808 WGA---QQCRVNYHLPAHKGIAGLYQMSGLCR 836 WG C NY LP L Q+S CR Sbjct: 847 WGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>PF05775#Enterobacteria AfaD invasin protein Length = 142 Score = 91.1 bits (226), Expect = 3e-26 Identities = 38/132 (28%), Positives = 66/132 (50%), Gaps = 2/132 (1%) Query: 14 SVSLLVTVSSLMPIANAAEKLQTTLRVGAYFRAGHVPDGMVLAQGWVTYHGSHSGFRVWS 73 S+SL + LM + + ++ TL Y + DG+ LA G + +HSGFRVW Sbjct: 4 SISLTLCGILLMLMGSFSQAADITLMNHKYM-GNLLHDGVKLATGRIICQDTHSGFRVWI 62 Query: 74 DEQKAGNTPTVLLLSGQQDPRHHIQVRLEGEGWQPDTVNGRGAILRTAADNAS-FSVVVD 132 + ++ G ++ + P+H++++R+ G GW G + T ++AS F + VD Sbjct: 63 NARQEGGGAGKYIVQSTEGPQHNLRIRISGNGWSSFVEKGIQGVFNTIKEDASIFYIEVD 122 Query: 133 GNQEVPADTWTL 144 GNQ+V + Sbjct: 123 GNQQVQPGKYLF 134
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 36.3 bits (84), Expect = 1e-04 Identities = 32/126 (25%), Positives = 48/126 (38%), Gaps = 12/126 (9%) Query: 130 VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQQGLFTADPRSNPQAELIK 189 VPVI E+ + E V D D A AD ++LTD G + + ++ Sbjct: 197 VPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILTDVNGAALY--YGTEKEQWLR 253 Query: 190 DVYGVDDALRSIAGDSVSGLGTGGMSTKLQAA-DVACRAGIDTIIASGSKPGVIGDVMEG 248 +V V++ + G M K+ AA G IIA K + +EG Sbjct: 254 EV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGGERAIIAHLEK---AVEALEG 306 Query: 249 ISVGTR 254 GT+ Sbjct: 307 -KTGTQ 311 Score = 30.2 bits (68), Expect = 0.011 Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%) Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAAGHRIVIVTSG-------- 51 + +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 52 -AIAAGREHLGYPELP 66 + AG+ G P P Sbjct: 62 LHMDAGQATYGIPAQP 77
>PF05272#Virulence-associated E family protein Length = 892 Score = 25.0 bits (54), Expect = 0.025 Identities = 7/33 (21%), Positives = 14/33 (42%) Query: 24 DIEEDLGISDDEWDSYSEGDKDEIMKDVAWERM 56 D+ + LG + EG + + + WE + Sbjct: 802 DLVQALGADPGKSSPMLEGQVRDWLNENGWEYL 834
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 26.6 bits (58), Expect = 0.014 Identities = 30/92 (32%), Positives = 40/92 (43%), Gaps = 13/92 (14%) Query: 3 GNRSAATNTGDCS--AADVS----GSQSVAAAFGIEGKARASEGGAI------VLCYRDE 50 G+RS T DC A D S G S+ A G K S G + VL +R Sbjct: 1163 GDRSKLTAGNDCILMAGDRSKLTAGINSILTA-GCRSKLIGSNGSTLTAGENSVLIFRCW 1221 Query: 51 DGELIHIRASKVGENGIMPNTWYQLNEDGEFV 82 DG+ +K G+ GI + YQ++ED V Sbjct: 1222 DGKRYTNVVAKTGKGGIEADMPYQMDEDNNIV 1253
>PF05272#Virulence-associated E family protein Length = 892 Score = 53.2 bits (127), Expect = 3e-09 Identities = 30/82 (36%), Positives = 48/82 (58%), Gaps = 2/82 (2%) Query: 4 SELSDLLWAQVDRVAPHLLPNGKKEGHEWVAGNVNGDKGNSLKVNLSGKKKWADFAEGDG 63 + L+D L + + P LP G GHE+ G++ G KG+S KVN++ KW DF+ G+ Sbjct: 12 TSLADALLTRAKDLLPEWLPGGVLVGHEYECGSLAGGKGDSCKVNVT-TGKWCDFSTGES 70 Query: 64 G-DMLDLWMACRGINLHQAMQE 84 G D+LDL+ G+ + +A + Sbjct: 71 GRDLLDLYAEIHGLKVSKAAAQ 92
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.3 bits (63), Expect = 0.031 Identities = 15/110 (13%), Positives = 40/110 (36%), Gaps = 8/110 (7%) Query: 40 TDVSSIAEKAN-QAGGGAYDAQVRNDEQDVILDEHEKRITKTEEDISGIKVKLLEIENDV 98 DV + + N Q G Q + + K E+ + +++ + Sbjct: 201 VDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRV-----NS 255 Query: 99 NGLKIKVQDIDGKVSDIIVDYVSLSRTGTQTLASSINVSGSYFVNGTKVV 148 +G ++++D+ +V +Y ++R + A+ + + + N Sbjct: 256 DGSVVRLKDV-ARVELGGENYNVIARINGKP-AAGLGIKLATGANALDTA 303
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 50.6 bits (121), Expect = 6e-09 Identities = 57/265 (21%), Positives = 91/265 (34%), Gaps = 32/265 (12%) Query: 103 FSGTLSDRFGRKPIIFYSLLAGGILTLLCATASSWPMLVVYRALLGIAVSGITAAVTVYI 162 G LSDRFGR+P++ SL + + ATA +L + R + GI +G T AV Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI--TGATGAVAGAY 119 Query: 163 SEEVSPA---------LAGIVTGYFIFGNSLGSMSGRVFATLMMEHVSIDTIFFIFGGVL 213 +++ ++ + G LG + G F L Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP----------FFAAAAL 169 Query: 214 IAMALAVKLFL---PTSRQFVPTPSLQLGAVLKGGLEHFKNIRVSLCFVIGFI--LFGSF 268 + FL + P L + + V+ + FI L G Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV-VAALMAVFFIMQLVGQV 228 Query: 269 TSIFNFLAFYLHRPPYELSYTWIGLIPVSFSLT--FFLAPYAARVALNIGSMNALSMLII 326 + ++ F R + T IG+ +F + A VA +G AL + +I Sbjct: 229 PAAL-WVIFGEDR--FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMI 285 Query: 327 CMMVGAFLTLIAPSLWVFISGIVLL 351 G L A W+ +VLL Sbjct: 286 ADGTGYILLAFATRGWMAFPIMVLL 310
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 762 bits (1968), Expect = 0.0 Identities = 261/880 (29%), Positives = 413/880 (46%), Gaps = 63/880 (7%) Query: 4 TINLNRKS-LALLIAIVCSGSAQG----EEYYFDPALLQGATYGQ-NIARFNE-QQTPSG 56 I +R + + + + C+ +AQ E YF+P L +++RF Q+ P G Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 57 DYLADVYVNGTLVTSSTNIRFNAVKEGQQTEPCLPLSVMKAAQIKSLPATDAA----TEC 112 Y D+Y+N + + ++ FN Q PCL + + + + + + C Sbjct: 77 TYRVDIYLNNGYMAT-RDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135 Query: 113 RPLREWVPHAGWQFDSATLRLLLTIPMTELTHKPRGYISPSEWDSGALALFLRHNTNWTH 172 PL + A Q D RL LTIP ++++ RGYI P WD G A L +N + Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNS 195 Query: 173 TENTDSHYRYQYLWSGLNMGVNLGLWQVRHQSNLRYANSNQS-GSAWRYNSVRTWVQRPV 231 +N Y + L G+N+G W++R + Y +S+ S GS ++ + TW++R + Sbjct: 196 VQN-RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDI 254 Query: 232 ASINSILSLGDSYTDSSLFGSLSFNGAKLVTDERMRPQGKRGYAPEVRGVAASSAHVVVK 291 + S L+LGD YT +F ++F GA+L +D+ M P +RG+AP + G+A +A V +K Sbjct: 255 IPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIK 314 Query: 292 QLGKVIYETNVPPGPFYIDDLYNTRYQGDLEVEVIEASGKTSRFTVPYSSVPDSVRPGNW 351 Q G IY + VPPGPF I+D+Y GDL+V + EA G T FTVPYSSVP R G+ Sbjct: 315 QNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHT 374 Query: 352 HYSLAFGRVRQYY--DIENRFFEGTFQHGVNNTITLNLGSRIAQRYQAWLAGGVWATGM- 408 YS+ G R + RFF+ T HG+ T+ G+++A RY+A+ G G Sbjct: 375 RYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGAL 434 Query: 409 GAFGLNATWSNARAEHNERQQGWRAELSYSKTFT-TGTNLVLAAYRYSTNGFRDLQDVLG 467 GA ++ T +N+ + + G Y+K+ +GTN+ L YRYST+G+ + D Sbjct: 435 GALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTY 494 Query: 468 VRREAKTGI-------------DYYSDTLHQRNRLSATVSQPLGRLGTLNLSASTADYYN 514 R DYY+ ++R +L TV+Q LGR TL LS S Y+ Sbjct: 495 SRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWG 554 Query: 515 NQSRITQLQMGYSNQWRNISYGVNIARQRTTWDYDRFYHGVNEPLDVSSRQKYTETTMSF 574 + Q Q G + + +I++ ++ + + W + ++ Sbjct: 555 TSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKG------------------RDQMLAL 596 Query: 575 NVSIPLDWGENRTSVA------MNYNQSSQSRSST---VSMTGSSGENSDLSWSVYGGYE 625 NV+IP S + +Y+ S + G+ E+++LS+SV GY Sbjct: 597 NVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYA 656 Query: 626 RYRNSNSDSSAPTTFGGNLQQNTRFGALRANYDQGDNYRQEGLGASGTLVLHPGGLTAGP 685 + NS S+ L +G Y D+ +Q G SG ++ H G+T G Sbjct: 657 GGGDGNSGSTG----YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQ 712 Query: 686 YTSDTFALIHADGAQGAIVQNGQGAVVDRFGYAILPSLSPYRVNNVTLDTRKMRSDAELT 745 +DT L+ A GA+ A V+N G D GYA+LP + YR N V LDT + + +L Sbjct: 713 PLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD 772 Query: 746 GGSQQIVPYAGAIARVNFATISGKAVLISVKMPDGGIPPMGADVFNGEGTNIGMVGQSGQ 805 +VP GAI R F G +L+++ + P GA V + + G+V +GQ Sbjct: 773 NAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQ 831 Query: 806 IYARIAHPSGSLLVRWGTGANQRCRVAYQLDLHTKEPFLY 845 +Y +G + V+WG N C YQL +++ L Sbjct: 832 VYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLT 871
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 134 bits (339), Expect = 7e-43 Identities = 59/183 (32%), Positives = 88/183 (48%), Gaps = 21/183 (11%) Query: 1 MKRRSSFLVFLGLLLASPLALANDQHTVSFGYAQTHLSSLKNSDSKDLRGFNFKYRYEFN 60 MK+ + +L + TV+ GYAQ+ N + GFN KYRYE + Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMN----KMGGFNLKYRYEED 56 Query: 61 ET-WGMLGSFTATRNEMENYTWKEGKLHKNGSDSVDYGSLMFGPTYRFNDYVSLYGNAGI 119 + G++GSFT T + K Y + GP YR ND+ S+YG G+ Sbjct: 57 NSPLGVIGSFTYTEKSRTASSGDYNK--------NQYYGITAGPAYRINDWASIYGVVGV 108 Query: 120 ATMKF--------NKHSKEDSFAYGAGVIFNPVKSISIDASWEASRFFAVDTNTFGVSVG 171 KF + + F+YGAG+ FNP++++++D S+E SR +VD T+ VG Sbjct: 109 GYGKFQTTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWIAGVG 168 Query: 172 YRF 174 YRF Sbjct: 169 YRF 171
>cdtoxinb#Cytolethal distending toxin B signature. Length = 269 Score = 30.7 bits (69), Expect = 0.010 Identities = 21/70 (30%), Positives = 27/70 (38%), Gaps = 4/70 (5%) Query: 75 AIALRNNRDLRKAGLNVEAARALYRIQRAEMLPTLGIATAMDAGRTPADLSVTDEPEINR 134 AIA+RNN A VE +R R + L D R PADL + + R Sbjct: 155 AIAMRNN----DAPALVEEVYNFFRDSRDPVHQALNWMILGDFNREPADLEMNLTVPVRR 210 Query: 135 RYEMAGATTA 144 E+ A Sbjct: 211 ASEIISPAAA 220
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 48.3 bits (115), Expect = 4e-08 Identities = 18/112 (16%), Positives = 37/112 (33%), Gaps = 7/112 (6%) Query: 74 ELRSRVGGTLDAVSVPEGRLVSRGQLLFQIDPRPFEVALDTAVAQLRQAEVLARQAQADF 133 E++ + + V EG V +G +L ++ E + L QA + + Q Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 134 DRIQR-------LVASGAVSRKNADDVTATRNARQAQMQSAKAAVAAARLEL 178 I+ L + ++V + + Q + + L L Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNL 209 Score = 34.4 bits (79), Expect = 7e-04 Identities = 20/106 (18%), Positives = 37/106 (34%), Gaps = 13/106 (12%) Query: 112 LDTAVAQLRQAEVLARQAQADFDRIQRLVASGAVSRKNADDVTATRNARQAQMQSAKAAV 171 L +QL Q E A+ ++ + +L + ++ + + Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKN---------EILDKLRQTTDNIGLLTLEL 318 Query: 172 AAARLELSWTRITAPIAGRVDRILVTRGNLVSGGVAGNATLLTTIV 217 A + I AP++ +V ++ V GGV A L IV Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHT----EGGVVTTAETLMVIV 360
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.1 bits (125), Expect = 2e-09 Identities = 70/356 (19%), Positives = 122/356 (34%), Gaps = 35/356 (9%) Query: 5 IFSLALGTFGLGMAEFSIMGVLTELARDVGITIPAAGH---MISFYAFGVVLGAPVMALF 61 + ++AL G+G+ IM VL L RD+ + H +++ YA APV+ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 SSRFSLKHILLFLVTLCVMGNAIFTFSSSYLMLAVGRLVSGFPHGAFFGVGAIVLSKIIR 121 S RF + +LL + + AI + +L +GR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 122 PGKVTAAVAGMVSGMTVANLVGIPVGTYLSQEFSWRYTFLLIAVFNIAVLTAIFFWVPDI 181 G A G +S +V PV L FS F A N F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 RDKAQGSLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYIKPFMMYI 229 + LR + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 230 SGFSETSMTFIMMLVGLGM---VLGNLLSGKLSGRYTPLRIAVVTDLVIVLSLMALFFFS 286 F + T + L G+ + +++G ++ R R ++ ++ + Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG---MIADGTGYILLA 295 Query: 287 GYKTASLTFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAIG 340 + F + + P +L E G G +A +L S +G Sbjct: 296 FATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 49.1 bits (117), Expect = 7e-08 Identities = 32/198 (16%), Positives = 71/198 (35%), Gaps = 13/198 (6%) Query: 373 TQQSHDRAQLSQWQQQLLSDTRQRDALPPLTLDLTPQALAEARALHTRQRPLRHRLAALQ 432 TQ S +A+L Q + Q+LS + + + LP L L P + R L ++ Sbjct: 139 TQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL------IK 192 Query: 433 GQILPKQKRQAQLQAAIARHHQEQAQYTQRLADKRLSYKTKAQELADVRTICEQ----EA 488 Q Q ++ Q + + + E+ R+ + + L D ++ + + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 489 RIKDLESQRAHLQS--GQPCPLCGSTTHPAIAAYQALELSANQTRRDALEKEVKTLAEEG 546 + + E++ + ++A + +L + + L+K +T Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNI- 311 Query: 547 AALRGQLDALTQQLQRDE 564 L +L ++ Q Sbjct: 312 GLLTLELAKNEERQQASV 329
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.041 Identities = 7/21 (33%), Positives = 12/21 (57%) Query: 46 VLALIGPSGSGKTTVLRAVAG 66 + L G G GK+T++ + G Sbjct: 598 SVVLEGTGGIGKSTLINTLVG 618
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 28.2 bits (62), Expect = 0.047 Identities = 33/118 (27%), Positives = 52/118 (44%), Gaps = 14/118 (11%) Query: 118 PLVKNYLSFIYNSKLLKTAPASWQDL--LDAKFKNKLQYSTPGQAADGMAVMLQAFH-SF 174 P+ LS IYN LL P +W+++ LD + K K G++A + F Sbjct: 133 PIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAK------GKSALMFNLQEPYFTWPL 186 Query: 175 GSKDAGFAYL---GKLQANNVGPSASTGK--LTALVNKGEIYVANGDLQMNLAQMERN 227 + D G+A+ GK +VG + K LT LV+ + N D ++A+ N Sbjct: 187 IAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFN 244
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 83.7 bits (207), Expect = 1e-19 Identities = 81/373 (21%), Positives = 151/373 (40%), Gaps = 25/373 (6%) Query: 2 LGMFMVLPVLTTY--GMALQGASEALIGIAIGIYGLAQAIFQIPFGLLSDRIGRKPLIVG 59 +G+ +++PVL + A GI + +Y L Q G LSDR GR+P+++ Sbjct: 19 VGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLV 78 Query: 60 GLAVFVAGSVIAALSHSIWGIILGRALQG-SGAIAAAVMALLSDLTREQNRTKAMAFIGV 118 LA I A + +W + +GR + G +GA A A ++D+T R + F+ Sbjct: 79 SLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSA 138 Query: 119 SFGITFAIAMVLGPIVTHSLGLNALFWMIAALATLGILLTIWVVPNSTNHVLNRESGMVK 178 FG VLG ++ +A F+ AAL L L +++P S Sbjct: 139 CFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREAL 197 Query: 179 GSFSKVLAEPRLLKLNFGIMCLHILLMSTFVA-LPGQLADAGFPAAEHWKVYLATMVIAF 237 + + G+ + L+ F+ L GQ+ A + + + I Sbjct: 198 NPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGI 250 Query: 238 A--------AVVPFIIYAEVKRRMKQVFLFCVGLI--VVAEIVLWGAGQHFWELVIGVQL 287 + ++ +I V R+ + +G+I I+L A + + I V L Sbjct: 251 SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLL 310 Query: 288 FFLAFNL--MEALLPSLISKESPAGYKGTAMGVYSTSQFLGVALGGSLGGWIDGTFDGQT 345 + ++A+L + +E +G+ + S + +G L ++ T++G Sbjct: 311 ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNG-W 369 Query: 346 VFLAGAVLAMVWL 358 ++AGA L ++ L Sbjct: 370 AWIAGAALYLLCL 382
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.0 bits (96), Expect = 9e-06 Identities = 40/190 (21%), Positives = 76/190 (40%), Gaps = 15/190 (7%) Query: 223 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYG 281 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 282 GILMQRLSLFRALLIFGILQGASNAGYWLLSITDKNMFSMGAAVFFENLCGGMGTAAFVA 341 L +L + R LL I+ + ++ + FS+ + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINCFGS----VIGFVGHSFFSL---LIMARFIQGAGAAAFPA 122 Query: 342 LLM----TLCNKSFSATQFALLSALSAVGRVYVGPVAGWFVEAH-GWPTFYLFSVVAAVP 396 L+M K F L+ ++ A+G VGP G + + W L ++ + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 397 GLLLLLVCRQ 406 L+ + ++ Sbjct: 182 VPFLMKLLKK 191
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 50.2 bits (120), Expect = 7e-09 Identities = 51/326 (15%), Positives = 112/326 (34%), Gaps = 24/326 (7%) Query: 62 AYCSMQIPC----GILVDKFGQKIMLMAGFTLFIIGTLCIAKANGLAMIYTGSLMAGGGC 117 Y MQ C G L D+FG++ +L+ + +A A L ++Y G ++AG Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITG 110 Query: 118 ASFFSSAYSLSSANVPQARRA----LANAIINSGSAIGMGIGLIGSSILVKNMSMAWQNV 173 A+ + A + + RA +A G G +G + Sbjct: 111 AT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPH--------A 161 Query: 174 LYIVAAILVIMLCVFTLVIRGKAKSDSAQAEKQTQTVTEDEKRAPLFSGLLCSVYFLYFC 233 + AA L + + + ++ + ++ R ++ ++ ++F Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFI 221 Query: 234 TCYGYYLIVTWLPSYLQTERGFDGGAIGLASALVAVVG-VPGALFFSHLSDKFR-NSKVK 291 + + + +D IG++ A ++ + A+ ++ + + Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM 281 Query: 292 VILGLEIVAAAMLAFTVLSPNTTMLMVSLTLYGLLGKMAVDPILISFVSEQASAKSLGRA 351 + + + +LAF +MV L G+ P L + +S Q + G+ Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLASGGI-----GMPALQAMLSRQVDEERQGQL 336 Query: 352 FSLFNFFGMSSAVVAPTLTGFISDVT 377 +++V P L I + Sbjct: 337 QGSLAALTSLTSIVGPLLFTAIYAAS 362
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 46.3 bits (110), Expect = 2e-07 Identities = 36/163 (22%), Positives = 56/163 (34%), Gaps = 32/163 (19%) Query: 4 DLIIKNGTVILENEARVIDIAVQGGKIAAIGEN------------LEEAKNVLDATGLIV 51 D +I N ++ DI ++ G+IAAIG+ + V+ G IV Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128 Query: 52 SPGMVDAHTHISEPGRTHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRET------- 104 + G +D+H H P + A G+T M+ PA T Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177 Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNLDRLHELDEVGVVGFK 146 I +AA ++ A G + L E+ G K Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 361 bits (929), Expect = e-128 Identities = 127/310 (40%), Positives = 177/310 (57%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIADAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKA---VEPYPLDVLVAESQGMIGYMLAQRLALEPDM----PPVTTVLTRIKVSAD 113 A +A + P+DV A SQG IGYM+ Q L E V T++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLEPEKFIGPVYSPEEQMALEATYGWHMKRD-GKYLRRVVASPAPRQIIESAAIELL 172 DPAF P K +GP Y E L GW +K D G+ RRVV SP P+ +E+ I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVAGEG---EGVEAVIDKDLAAALLAEQIAADGLIILTDADAVYE 229 ++ G +VI SGGGGVPV E +GVEAVIDKDLA LAE++ AD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 HWGTPQQRAIRQASPDELAPFAKAD----GAMGPKVTAVSGYVKRCGKPAWIGALSRIDD 285 ++GT +++ +R+ +EL + + G+MGPKV A +++ G+ A I L + + Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGRAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 834 bits (2157), Expect = 0.0 Identities = 404/861 (46%), Positives = 559/861 (64%), Gaps = 21/861 (2%) Query: 18 LSSVALSVLVALCPLTSRGESYFNPAFLSADTASVADLSRFEKGNHQPPGIYRVDIWRND 77 + ++ A S E YFNP FL+ D +VADLSRFE G PPG YRVDI+ N+ Sbjct: 27 FVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNN 86 Query: 78 EFVATQDIRFEAGAVGTGDKSGGLMPCFTPEWIKRLGVNTAAFPVSDKGVDTTCIHLPEK 137 ++AT+D+ F G D G++PC T + +G+NTA+ + D C+ L Sbjct: 87 GYMATRDVTFNTG-----DSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSM 141 Query: 138 IPGAEVAFDFASMRLNISLPQASLLNSARGYIPPEEWDEGIPAALINYSFTGSR-----G 192 I A D RLN+++PQA + N ARGYIPPE WD GI A L+NY+F+G+ G Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIG 201 Query: 193 TDSDSYFLSLLSGLNYGPWRLRNNGAWNYSKGDG--YHSQRWNNIGTWVQRAIIPLKSEL 250 +S +L+L SGLN G WRLR+N W+Y+ D +W +I TW++R IIPL+S L Sbjct: 202 GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRL 261 Query: 251 VMGDSNTGNDVFDSVGFRGARLYSSDNMYPDSLQGYAPTVRGIARTAAKLTIRQNGYVIY 310 +GD T D+FD + FRGA+L S DNM PDS +G+AP + GIAR A++TI+QNGY IY Sbjct: 262 TLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIY 321 Query: 311 QSYVSPGAFAITDLNPTSSSGDLEVTVDEKDGSQQRYTVPYSTVPLLQREGRVKYDLVAG 370 S V PG F I D+ +SGDL+VT+ E DGS Q +TVPYS+VPLLQREG +Y + AG Sbjct: 322 NSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAG 381 Query: 371 DFRSGNSQQSSPFFFQGTVIAGLPAGLTAYGGTQLADRYRAVVVGAGRNLGDWGAVSVDV 430 ++RSGN+QQ P FFQ T++ GLPAG T YGGTQLADRYRA G G+N+G GA+SVD+ Sbjct: 382 EYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDM 441 Query: 431 THARSQLADDSTHQGQSLRFLYAKSLNNYGTNFQLLGYRYSTRGFYTLDDVAYRSMEGYD 490 T A S L DDS H GQS+RFLY KSLN GTN QL+GYRYST G++ D Y M GY+ Sbjct: 442 TQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501 Query: 491 YEYDSDGRRHKVPVAQSYHNLRYSKKGRFQVNISQNLGDYGSLYLSGSQQNYWNTADTNT 550 E DG P Y+NL Y+K+G+ Q+ ++Q LG +LYLSGS Q YW T++ + Sbjct: 502 IE-TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDE 560 Query: 551 WYQLGYASGWQGISYSLSWSWNESVGISGADRILAFNMSVPFSVLTGRRYARDTLLDRTY 610 +Q G + ++ I+++LS+S ++ G D++LA N+++PF D+ + Sbjct: 561 QFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPF----SHWLRSDSKSQWRH 616 Query: 611 ATFNANRNRNRDGDNSWQTGVGGTLLEGRNLSYSVTQGRS----STNSYSGSASASWQAT 666 A+ + + + + +G + GV GTLLE NLSYSV G + + +G A+ +++ Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676 Query: 667 YGTLGVGYNYDRDQHDYNWQLSGGVVGHADGITFSQPLGDTNVLIKAPGAKGVRIENQTG 726 YG +GY++ D + +SGGV+ HA+G+T QPL DT VL+KAPGAK ++ENQTG Sbjct: 677 YGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTG 736 Query: 727 VKTDWRGYAVMPYATVYRYNRVALDTNTMDNHTDVENNVSSVVPTEGALVRAAFDTRIGV 786 V+TDWRGYAV+PYAT YR NRVALDTNT+ ++ D++N V++VVPT GA+VRA F R+G+ Sbjct: 737 VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGI 796 Query: 787 RAIITARLGGRPLPFGAIVRETASGITSMVGDDGQIYLSGLPLKGELFIQWGEGKNARCI 846 + ++T +PLPFGA+V +S + +V D+GQ+YLSG+PL G++ ++WGE +NA C+ Sbjct: 797 KLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856 Query: 847 APYALAEDSLKQAITIASATC 867 A Y L +S +Q +T SA C Sbjct: 857 ANYQLPPESQQQLLTQLSAEC 877
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.1 bits (169), Expect = 9e-16 Identities = 29/122 (23%), Positives = 59/122 (48%), Gaps = 2/122 (1%) Query: 32 MKPASVIIMDEHPIVRMSIEVLLGKNSNIQVILKTDDSRTAIKYLRTYPVDLVILDIELP 91 M A++++ D+ +R + L + V T ++ T +++ DLV+ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDV-RITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 92 GTDGFTLLKRIKSIQEHTRILFLSSKSEAFYAGRAIRAGANGFVSKRKDLNDIYNAVKMI 151 + F LL RIK + +L +S+++ A +A GA ++ K DL ++ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 152 LS 153 L+ Sbjct: 119 LA 120
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 369 bits (948), Expect = e-133 Identities = 231/234 (98%), Positives = 233/234 (99%) Query: 1 MLTSHFPLPFAGHRLHIVDFDASSFHEHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 MLTSHFPLPFAGHRLHIVDFDASSF EHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 61 LREMGVRTVPGIGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 LRE+GVRTVPG+GDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 Query: 181 SLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAPASILSAIPR 234 SLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAPASILSAIPR Sbjct: 181 SLHLLPAFAATMAERTVRTEWFQRDNSVITLVSAITRVPHDRSAPASILSAIPR 234
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 60.0 bits (145), Expect = 2e-12 Identities = 47/210 (22%), Positives = 82/210 (39%), Gaps = 21/210 (10%) Query: 105 EPNAETVAAQMPDLILISATGGDSALALYDQLSAIAPTLVINYDDKS-----WQSLLTQL 159 EPN E + P ++ SA G S + L+ IAP N+ D + LT++ Sbjct: 86 EPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPLAMARKSLTEM 141 Query: 160 GEITGQEKQAAARIAEFEAQLTTVKQRIALPPQPVSALVYTPAAHSANLWTPESAQGKLL 219 ++ + A +A++E + ++K R L ++ P S ++L Sbjct: 142 ADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEIL 201 Query: 220 TQLGFTLATLPRGLQTSKSQGKRHDIIQLGGENLAAGLNGESLFLFAGDNKDVAALYANP 279 + G A + + + + LAA + + L ++KD+ AL A P Sbjct: 202 DEYGIPNAW--------QGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATP 253 Query: 280 LLAHLPAVQNKRVHALGTETFRLDYYSATL 309 L +P V+ R + F Y ATL Sbjct: 254 LWQAMPFVRAGRFQRVPAVWF----YGATL 279
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 424 bits (1090), Expect = e-153 Identities = 148/299 (49%), Positives = 192/299 (64%), Gaps = 18/299 (6%) Query: 1 MAIPKLQSYALPTALDIPTNKVNWAFEPERAALLIHDMQDYFVSFWGRNCPMMDQVIANI 60 MAIP +Q Y +PTA D+P NKV+W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRQYCKEHHIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVEALTPDEADTV 120 L+ C + IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P++ D V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKDIGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSREEHLMALNYVAGRSGRVVMTESLL------PTPVPASKAE-----------LRALIL 223 FS E+H MAL Y AGR VMT+SLL P V + A +R I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDETDEPLD-DENLIDYGLDSVRMMGLAARWRKVHGDIDFVMLAKNPTIDAWWALLS 281 LL ET E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 338 bits (867), Expect = e-120 Identities = 104/257 (40%), Positives = 148/257 (57%), Gaps = 20/257 (7%) Query: 9 KTVWVTGAGKGIGYATALAFVDAGARVIGFDRE---------------FTQESYPFATEV 53 K ++TGA +GIG A A GA + D E++P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP----- 63 Query: 54 MDVADAGQVAQVCQRVLQKTPRLDVLVNAAGILRMGATDALSVDDWQQTFAVNVGGAFNL 113 DV D+ + ++ R+ ++ +D+LVN AG+LR G +LS ++W+ TF+VN G FN Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 114 FSQTMAQFRRQQGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLELAGCGVRCN 173 ++ G+IVTV S+ A PR M+AY +SKAA +GLELA +RCN Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 174 VVSPGSTDTDMQRTLWVSEDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDLA 233 +VSPGST+TDMQ +LW E+ +Q I+G E FK GIPL K+A+P +IA+ +LFL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 234 SHITLQDIVVDGGSTLG 250 HIT+ ++ VDGG+TLG Sbjct: 244 GHITMHNLCVDGGATLG 260
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 31.2 bits (70), Expect = 0.011 Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 1/73 (1%) Query: 2 LDTNMKTQLRAYLEKLTKPVELIATLDDS-AKSAEIKELLAEIAELSDKVTFKEDNTLPV 60 D N K + +++E + ++ LD + A +AEIK+ + + S + + + N + Sbjct: 109 FDANGKENIASFMESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNL 168 Query: 61 RKPSFLITNPGSQ 73 P PG Q Sbjct: 169 LTPVIEKVKPGEQ 181
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.4 bits (227), Expect = 1e-23 Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 1/123 (0%) Query: 2 TNVLIVEDEQAIRRFLRAALEGDGLRVYEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61 +L+ +D+ AIR L AL G V A DL++ D+ +PD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 DFIRDLRQWSA-IPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHA 120 D + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 ASP 123 P Sbjct: 124 RRP 126
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 30.0 bits (67), Expect = 0.010 Identities = 15/87 (17%), Positives = 26/87 (29%), Gaps = 8/87 (9%) Query: 35 LTLVSSANIACGFHAGDAQTMLT---CVREALKNGVAIGAHPSFPDRDNFG----RTAMV 87 + + IA G G T+LT V + A+ A PS ++DN+ + Sbjct: 95 VEAPTGTFIASGVVVGK-DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQI 153 Query: 88 LPPETVYAQTLYQIGALGAIVQAQGGV 114 + + V Sbjct: 154 TKYSGEGDLAIVKFSPNEQNKHIGEVV 180
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.004 Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 15 VDNAPRMQDYTLEGEEGRDM-MLLDALIQLKEKDPSLSFRR 54 ++N + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.8 bits (147), Expect = 5e-12 Identities = 28/194 (14%), Positives = 59/194 (30%), Gaps = 5/194 (2%) Query: 64 YNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQKQAEEA 123 YN + +++ Q E R+ + A + E Sbjct: 981 YNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 124 AKLAQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKAEAEAAK 183 A+ ++Q+ + E + A E AK A K+ +A + + Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEA-----KSNVKANTQTNEVAQSGSETKET 1095 Query: 184 AAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKKADAAAAK 243 + K+ A E + A +K + + + K+ +E + AE ++ D Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155 Query: 244 AAADAKKKAAAAEG 257 ++ A Sbjct: 1156 KEPQSQTNTTADTE 1169 Score = 59.3 bits (143), Expect = 1e-11 Identities = 24/177 (13%), Positives = 58/177 (32%), Gaps = 2/177 (1%) Query: 61 VQQYNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQKQA 120 + N Q S EE ++ + +E ++ + ++ + Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056 Query: 121 EEAAKLAQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKAEAE 180 + AQ ++ A+EA + E + + + E + A + ++KA+ E Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE-TATVEKEEKAKVE 1115 Query: 181 AAKAAADAKKKAEAEAAKAAAEA-KKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKK 236 K K ++ + +E + +AE K+ ++ A + Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172 Score = 52.0 bits (124), Expect = 2e-09 Identities = 27/221 (12%), Positives = 75/221 (33%), Gaps = 20/221 (9%) Query: 66 RQQDQQASARRAEEERKKLQQQQAEE--LQQKQAAEQER------LKQLEKERLAAQEQQ 117 + A + E + + +Q A E Q ++ A++ + + E + ++ ++ Sbjct: 1035 ETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 118 KQ-----------AEEAAKLAQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEA 166 Q EE AK+ ++ Q E + + K+ ++E + A+ ++ + Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEPARENDPTV 1153 Query: 167 VKAAADAKKKAEAEAAKAAADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEA 226 ++ A+ + A + E ++ + E + A + + Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213 Query: 227 AKAAAEAKKKADAAAAKAAADAKKKAAAAEGVDDLLGDLSS 267 + + + + ++ + L DL+S Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254 Score = 48.9 bits (116), Expect = 3e-08 Identities = 23/191 (12%), Positives = 57/191 (29%) Query: 65 NRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQKQAEEAA 124 NR+ ++A + + Q E ++ Q E + +EKE A E +K E Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124 Query: 125 KLAQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKAEAEAAKA 184 +Q + E++ A+ E + + + + A + + E Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVT 1184 Query: 185 AADAKKKAEAEAAKAAAEAKKKAEAEAAKAAAEAKKKADAEAAKAAAEAKKKADAAAAKA 244 + + + ++ K + ++ + A ++ Sbjct: 1185 ESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244 Query: 245 AADAKKKAAAA 255 + A + Sbjct: 1245 STVALCDLTST 1255 Score = 42.4 bits (99), Expect = 3e-06 Identities = 23/198 (11%), Positives = 55/198 (27%), Gaps = 1/198 (0%) Query: 59 AVVQQYNRQQDQQASARRAEEERKKLQQQQAEELQQKQAAEQERLKQLEKERLAAQEQQK 118 A Q Q + E K+ + EE + + + + + ++ + QEQ + Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137 Query: 119 QAEEAAKLAQQQQQAEEAAKAAADAKKKAEAEAAKAAADAKKKAEAEAVKAAADAKKKAE 178 + A+ A++ + + A+ E + + E Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197 Query: 179 AEAAKAAADAKKKAEAEAAKAAAEAKKKA-EAEAAKAAAEAKKKADAEAAKAAAEAKKKA 237 A + +E++ +++ + D Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257 Query: 238 DAAAAKAAADAKKKAAAA 255 +A + A A A+ A Sbjct: 1258 NAVLSDARAKAQFVALNV 1275
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 115 bits (289), Expect = 2e-33 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%) Query: 56 EEQARLQMQQLQQNNIVYFDLDKYDIRSDFAAMLDAHANFLRSN--PSYKVTVEGHADER 113 +Q + + V F+ +K ++ + A LD + L + V V G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 114 GTPEYNISLGERRANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAYAKNRRAVL 172 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGN-TCDNVKQRAALI 321
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.0 bits (65), Expect = 0.023 Identities = 2/39 (5%), Positives = 19/39 (48%) Query: 56 LTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVMERQKQI 94 + + + + + +++ + Q+++ + ++ E + + Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV 292
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 30.4 bits (68), Expect = 0.014 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 5/55 (9%) Query: 230 VLQTAKALGIPVKGHVEQLSLLGGAQLVSRYQGLSADHIEYLDEAGVAAMRDGGT 284 VL+ +P G +S+LG ++L L HI AGVAAM+ Sbjct: 202 VLRDTNVTAVPASGAPAAVSVLGASELT-----LDGGHITGGRAAGVAAMQGAVV 251
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 46.7 bits (111), Expect = 5e-08 Identities = 49/207 (23%), Positives = 78/207 (37%), Gaps = 25/207 (12%) Query: 1 MTQYASSLRSLAAGSVLLFLFASPVKAEEQTIAPPGVDAR-AWILMDYASGKVLAEGNAD 59 M + SL A ++ L + ASP E+ ++ + R I MD ASG+ L AD Sbjct: 1 MRYIRLCIISLLA-TLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRAD 59 Query: 60 EKLDPASLTKIMTSYVVGQALKAGKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQV 119 E+ S K++ V + AG +L + + +P V D + Sbjct: 60 ERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGM 113 Query: 120 SVADLNKGIIIQSGNDACIALADYVAGSQESFIGLMNAYAKRLGLTNTT---FQTVHGLD 176 +V +L I S N A L V G + A+ +++G T ++T Sbjct: 114 TVGELCAAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEA 168 Query: 177 APGQF---STARDMA------LLGKAL 194 PG +T MA L + L Sbjct: 169 LPGDARDTTTPASMAATLRKLLTSQRL 195
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 42.2 bits (99), Expect = 3e-06 Identities = 65/356 (18%), Positives = 126/356 (35%), Gaps = 51/356 (14%) Query: 48 QAGLDWVPTSMTAYLAGGMFLQWLLGPLSDRIGRRPVMLAGVVWFIVTCLATLLAKNIEQ 107 A +WV T+ + G + G LSD++G + ++L G++ + + + Sbjct: 48 PASTNWVNTAFMLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104 Query: 108 FT-FLRFLQGISLCFIGAVGYAAIQESFEEAVCIKITALMANVALISPLLGPLVGAAWVH 166 RF+QG A+ + + K L+ ++ + +GP +G H Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164 Query: 167 VLPWEGMFILFAALAAIAFFGLQRAMPETATRRGE------------------------- 201 + W +L + I L + + + +G Sbjct: 165 YIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSI 223 Query: 202 ------TLSFKALGRDYRLV---------IKNRRFVAGALALGFVSLPLLAWIAQSPIII 246 LSF + R V KN F+ G L G + + +++ P ++ Sbjct: 224 SFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283 Query: 247 ISGEQLSSYEYG-LLQVPVFGALIAGNLVLARLTSRRTVRSLIVMGGWPIVAGLIITAAA 305 QLS+ E G ++ P ++I + L RR ++ +G + + + Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-- 341 Query: 306 TVVSSHAYLWMTAGLSVYAFGIGLANAGLVRLTLFSSDMSKGTVSAAMGMLQMLIF 361 + +MT + V+ G GL+ V T+ SS + + A M +L F Sbjct: 342 -FLLETTSWFMTIII-VFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 33/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 218 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVTVVR-ASALM--GALGIGLIIFVDSDWVA-GVSVILWGLGASLGFPLTISAASD 331 DR + V+ + L ++ S ++ + +L GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 332 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 361 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.3 bits (112), Expect = 6e-09 Identities = 17/80 (21%), Positives = 33/80 (41%) Query: 7 RRANDPKRREKIIQATLEAVKTYGVHAVTHRKIAAIAQVPLGSMTYYFAGMDALLSEAFT 66 + + R+ I+ L GV + + +IA A V G++ ++F L SE + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 67 LFTENMSRQYQDFFAQVTDA 86 L N+ ++ A+ Sbjct: 65 LSESNIGELELEYQAKFPGD 84
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 16/50 (32%), Positives = 22/50 (44%), Gaps = 1/50 (2%) Query: 31 LVLLGPSGAGKSSLLRVLNLLEMPRSGTLTIAGNHFDFTKTPSDKAIREL 80 +VL G G GKS+L+ L L+ S T G D + + EL Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDF-FSDTHFDIGTGKDSYEQIAGIVAYEL 647
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 63.6 bits (155), Expect = 2e-13 Identities = 68/370 (18%), Positives = 122/370 (32%), Gaps = 71/370 (19%) Query: 1 MKVLVTGATSGLGRNAVEFLRNKGISVRA---------TGRNEAMGKLLEKMGAEFVHAD 51 MK LVTGA +G + + L G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAAGEEVINLLAQANPQT--- 161 +++ ++ SS S+Y + D + +A +K A E L+A Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANE----LMAHTYSHLYGL 171 Query: 162 RFTVLRPQSLFGPHDK--VFIPRLAHMMHHYGSVLLPHGGSALVDMTYYENAIHAMWLAS 219 T LR +++GP + + + + M S+ + + G D TY ++ A+ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 220 QPGCDHLPS--------------GRAYNITNGENRTLRSIVQKLIDELTIDCRIRSVPYP 265 R YNI N L +Q L D L I+ + +P Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 266 MLDMIARSMERFGKKSAKEPPLTHYGVSKLNFDFTLNTTRAQDELGYQPIITLDEGIERT 325 D+ T +T + +G+ P T+ +G++ Sbjct: 292 PGDV----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNF 324 Query: 326 AAWLRDHGNL 335 W RD + Sbjct: 325 VNWYRDFYKV 334
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.2 bits (133), Expect = 2e-10 Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVFALSQQGHQVRA---------AARRIERLEKQRLANVSCHKVDL 54 + LV GA+G+IG H+ L + GHQV + + RLE HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 HWPENLPALLRD--IDTVYYLVH------GMGEGGDFIAHERQAALNVRDALRQTPVKQL 106 E + L + V+ H + + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 52.8 bits (126), Expect = 1e-08 Identities = 45/279 (16%), Positives = 89/279 (31%), Gaps = 42/279 (15%) Query: 602 PQLPRPNRVR-----VPTRRELASYGIKLPSQRIAE-------EKAREAERNQYETGAQL 649 + PN ++ VP+ E + + P A E E + + +T + Sbjct: 995 TNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN 1054 Query: 650 TDEEIDAMHQDELARQFAQSQQHRYGETYQHDTQQAEDDDTAAEAELARQFAASQQQRYS 709 + + Q R+ A+ + + +TQ E + +E + + + Sbjct: 1055 EQDATETTAQ---NREVAKEAK----SNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107 Query: 710 GEQPAGAQPFSLDDLDFSPMKVLVDEGPHEPLFTPGVMPESTPVQQPVAPQPQYQQPQQP 769 E+ A KV ++ P T V P+ +Q QPQ +P + Sbjct: 1108 KEEKA---------------KVETEKTQEVPKVTSQVSPKQ---EQSETVQPQ-AEPARE 1148 Query: 770 VAPQPQYQQPQQPVASQPQYQQPQQPVAPQ-PQYQQPQQPVAPQPQYQQPQQPVAPQPQY 828 P ++PQ + +QP + + Q V + P P Sbjct: 1149 NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSV--VENPENTTPAT 1206 Query: 829 QQPQQPVALQPQYQ-QPQQPVAPQPQYQQPQQPTAPQDS 866 QP + + + ++ V P +P ++ S Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Score = 43.9 bits (103), Expect = 5e-06 Identities = 31/175 (17%), Positives = 55/175 (31%), Gaps = 17/175 (9%) Query: 405 QPQEAQSAPWQQPVPVASAPQYAATPATAAEYDSLAPQETQPQWQAPDAEQHWQPEPTHQ 464 P+ +Q PQ A PA + + D EQ + ++ Sbjct: 1122 VPKVTSQVSPKQEQSETVQPQ--AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 465 PEPVYQPEPIAAEPSHMPPPVIEQPVATEPEPNTEETRPARPPLYYFEEVEEKRAREREQ 524 +PV + + S + P P T+P N+E + + + + R Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESS----------NKPKNRHRRSVRS 1229 Query: 525 LAAWYQPIPEPVKENVPVKSTVSVAPSIPPVEAVAAAASLDAGIKSGALAAGAAA 579 + EP + +STV++ A + A + AL G A Sbjct: 1230 VPH----NVEPATTSSNDRSTVALCDLT-STNTNAVLSDARAKAQFVALNVGKAV 1279 Score = 37.7 bits (87), Expect = 4e-04 Identities = 22/188 (11%), Positives = 47/188 (25%), Gaps = 29/188 (15%) Query: 748 PESTPVQQPVAPQPQYQ-------QPQQPVAPQPQYQQPQQPVASQPQYQQPQQPVAPQP 800 + PV P P Q+ + Q + A + + + Sbjct: 1020 VDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKAN 1079 Query: 801 QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVALQPQYQQPQ----------QPVAP 850 + + Q + ++ + V + + P+ Q Sbjct: 1080 TQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETV 1139 Query: 851 QPQYQ------------QPQQPTAPQDSLIHPLLMRNGDSRPLQRPTTPLPSLDLLTPPP 898 QPQ + +PQ T P + + +T + + + + P Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199 Query: 899 SEVEPVDT 906 P T Sbjct: 1200 ENTTPATT 1207 Score = 35.4 bits (81), Expect = 0.002 Identities = 20/173 (11%), Positives = 46/173 (26%), Gaps = 10/173 (5%) Query: 754 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVASQPQYQ---QPQQPVAPQPQYQQPQQPVA 810 Q Q ++ + + VA Q + ++ + V Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVE 1115 Query: 811 PQPQYQQPQQPVAPQPQYQQPQQPVALQPQYQQPQQPV----APQPQYQQPQQPTAPQDS 866 + + P+ P+ +Q + +QPQ + ++ +PQ Q Q + Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSE---TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172 Query: 867 LIHPLLMRNGDSRPLQRPTTPLPSLDLLTPPPSEVEPVDTFALEQMARLVEAR 919 + + T + P+ +P + R Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRR 1225
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 8e-04 Identities = 39/158 (24%), Positives = 62/158 (39%), Gaps = 6/158 (3%) Query: 8 VMLLLCGLLLLT-LAIAVLNTLVPLWLAQANLPTWQVGMVSSSYFTGNLVGTLFTGYLIK 66 +++ LC L + L VLN +P N P V++++ +GT G L Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 67 RIGFNRSYYLASLIFAAGCVGLGVMVGFWSWMSW-RFIAGIGCAMIWVVVESALMCSGTS 125 ++G R +I G V V F+S + RFI G G A +V + Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 126 HNRGRLLAAYMMVYYMGTFLGQLLVSKVSGELLHVLPW 163 NRG+ + MG +G + G + H + W Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPA----IGGMIAHYIHW 168
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 474 bits (1221), Expect = e-170 Identities = 214/389 (55%), Positives = 264/389 (67%), Gaps = 31/389 (7%) Query: 2 MKRKILAAVIPALLAAATANANAAEIYNKDGNKLDLYGKAVGRHVWTTTGDSKNADQTYA 61 MKRK+LA VIPALLAA A+A AEIYNKDGNKLDLYGK G H + + SK+ DQTY Sbjct: 1 MKRKVLALVIPALLAAGAAHA--AEIYNKDGNKLDLYGKVDGLH-YFSDDSSKDGDQTYM 57 Query: 62 QIGFKGETQINTDLTGFGQWEYRTKADRAEGEQQNSNLVRLAFAGLKYAEVGSIDYGRNY 121 ++GFKGETQIN LTG+GQWEY +A+ EGE NS RLAFAGLK+ + GS DYGRNY Sbjct: 58 RVGFKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNY 116 Query: 122 GIVYDVESYTDMAPYFSGETWGGAYTDNYMTSRAGGLLTYRNSDFFGLVDGLSFGIQYQG 181 G++YDVE +TDM P F G+++ Y DNYMT RA G+ TYRN+DFFGLVDGL+F +QYQG Sbjct: 117 GVLYDVEGWTDMLPEFGGDSY--TYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQG 174 Query: 182 KNQDNHS---------------INSQNGDGVGYTMAYEFD-GFGVTAAYSNSKRTNDQQD 225 KN+ + I NGDG G + Y+ GF AAY+ S RTN+Q + Sbjct: 175 KNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVN 234 Query: 226 RDG---NGDRAESWAVGAKYDANNVYLAAVYAETRNMSIVENTVTD-TVEMANKTQNLEV 281 G GD+A++W G KYDANN+YLA +Y+ETRNM+ T +ANKTQN EV Sbjct: 235 AGGTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEV 294 Query: 282 VAQYQFDFGLRPAISYVQSKGKQLNGAD---GSADLAKYIQAGATYYFNKNMNVWVDYRF 338 AQYQFDFGLRPA+S++ SKGK L + DL KY GATYYFNKN + +VDY+ Sbjct: 295 TAQYQFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKI 354 Query: 339 NLLDEND--YSSSYVGTDDQAAVGITYQF 365 NLLD++D Y + + TDD A+G+ YQF Sbjct: 355 NLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.7 bits (64), Expect = 0.023 Identities = 20/108 (18%), Positives = 35/108 (32%), Gaps = 12/108 (11%) Query: 32 NSDAERLVDALFMQLKQI-------FPAATQTNLRSDADERVAKQQWIAAFSENGIRTRK 84 + AE ++ M+L +I F L + + + R Sbjct: 640 ENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARN 699 Query: 85 QLSAGMQKARSSQSPFWPS-----PGQFISWCREGSGALGVSVDDIMD 127 QL + +S P+ + +E + ALGVS+ DI Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQ 747
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.7 bits (79), Expect = 0.002 Identities = 43/298 (14%), Positives = 92/298 (30%), Gaps = 29/298 (9%) Query: 477 LDEKIATLQEKIARARKTPWTVSSSQTEYDQQQLNELQEQKRQKDLLDAKAQAERNYQKT 536 + + TL+ K + + E ++ N ++ ++ L KA + + Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEAR 121 Query: 537 QKRRNEQNAALNRDNETESLRHQREVARITAMQYADAAVRNAALERENERHKKAMARQKE 596 + + T + + A A A ALE A+ K Sbjct: 122 KADLEKALEGAMNF-STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 180 Query: 597 KPKAYYNDEAGRLLLQYSQQQAQTEGLIAAAKLSTTEKMTEAHKQLLSFQQRIADLSGKK 656 EA + L+ + + A +AK+ T E A Sbjct: 181 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARK-----------AD 229 Query: 657 LTADEQSVLAHKDEIALALQKLDISQQDLQHQNAFNELKKKTLTLTSQLADEESRVRQQH 716 L + + + ++ L+ + L+ + A E + S + + + Sbjct: 230 LEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 289 Query: 717 ALALATMGMGDQQRGRYEEHLKIQQHYQEQLEQLKRDSKAKGTYGSDEYRQAEQELQA 774 AL + ++ E ++ + L+RD A R+A+++L+A Sbjct: 290 KAAL------EAEKADLEHQSQVLN---ANRQSLRRDLDAS--------REAKKQLEA 330 Score = 33.9 bits (77), Expect = 0.004 Identities = 47/248 (18%), Positives = 76/248 (30%), Gaps = 19/248 (7%) Query: 479 EKIATLQEKIARARKTPWTVSSSQTEYDQQQLNELQEQKRQKDLLDAKAQAERNYQKTQK 538 E + KT ++ + L+ AK + + Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224 Query: 539 RRNEQNAALNRDNETESLRHQREVARITAMQYADAAVRNAALERENERHKKAMA------ 592 R S ++ + A + A R A LE+ E Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEA-EKAALEARQAELEKALEGAMNFSTADSAKI 283 Query: 593 RQKEKPKAYYNDEAGRLLLQYSQQQAQTEGLIAAAKLSTTEKM-TEAHKQLLSFQQRIAD 651 + E KA E L Q A + L S K EA Q L Q +I++ Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 343 Query: 652 LSGKKLTADEQSVLAHKDEIALALQKL-------DISQQDLQHQ-NAFNELKKKTLTLTS 703 S + L D + K ++ QKL + S+Q L+ +A E KK+ + Sbjct: 344 ASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQ---VEK 400 Query: 704 QLADEESR 711 L + S+ Sbjct: 401 ALEEANSK 408
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 159 bits (404), Expect = 1e-52 Identities = 51/156 (32%), Positives = 83/156 (53%), Gaps = 8/156 (5%) Query: 1 MKKIV-VAVLVGLALGSIGVANAAGYKNTVSIGYAYTDLSGWLSGNANGANIKYNWEDLD 59 MKKI ++ L + + G + AA +TV+ GYA +D G ++ G N+KY +E+ + Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAA--TSTVTGGYAQSDAQGQMN-KMGGFNLKYRYEEDN 57 Query: 60 SGFGAMGSVTYTSADVNNYGYKVGDADYTSLLVGPSYRFNDYLNAYVMIGAANGHIKDN- 118 S G +GS TYT Y + GP+YR ND+ + Y ++G G + Sbjct: 58 SPLGVIGSFTYTEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQTTE 117 Query: 119 ---WGNSDNKTAFAYGAGIQLNPVENIAVNASYEHT 151 + + + F+YGAG+Q NP+EN+A++ SYE + Sbjct: 118 YPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQS 153
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 33.5 bits (76), Expect = 0.004 Identities = 58/240 (24%), Positives = 86/240 (35%), Gaps = 18/240 (7%) Query: 123 NATAAGQASEQAQTSAGQASES-----ATAAVNAAGAAEASATQAASSAASAESSAGTA- 176 N T G S G SES ATA + A + A QAA + A+AE+ A Sbjct: 26 NGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKA 85 Query: 177 -----TTKAGEASASAASADTARTAAAASAAAAKTSEANADASRTA---AGDSAAAAAAS 228 T + + A + +RT +A A A + A+ R A + A A + Sbjct: 86 NRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEA 145 Query: 229 ATAAQTSAERAGASETAAKTSETQAASSAGDAGASATAAAASEKAAAASAAAAKTSETNA 288 A A AE+ E + +ET+ +A AA SE+A A A K S + Sbjct: 146 AEKAFQEAEQR-RKEIEREKAETERQLKLAEA-EEKRLAALSEEAKAVEIAQKKLSAAQS 203 Query: 289 ATSASTAAASATAASSSASEASTHAAASDTSASLA--AQSSTAAGAAATRAEDAAKRAED 346 + S+S + A + AQ+S + + RA D Sbjct: 204 EVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRAND 263
>PF07824#Type III secretion chaperone Length = 120 Score = 165 bits (419), Expect = 1e-56 Identities = 33/114 (28%), Positives = 63/114 (55%), Gaps = 1/114 (0%) Query: 1 MESLLNRLYDALGLDAPE-DEPLLIIDDGIQVYFNESDHTLEMCCPFMPLPDDILTLQHF 59 ME L + + ALG+ + + D+ +++DD + +Y + ++ + CPF LP++I L + Sbjct: 1 MEDLADVICRALGIPSIDTDDQAIMLDDDVLIYIEKEGDSINLLCPFCALPENINDLIYA 60 Query: 60 LRLNYTSAVTIGADADNTALVALYRLPQTSTEEEALTGFELFISNVKQLKEHYA 113 L LNY+ + + D + +L+A L + E+ E +IS V+ LK+ +A Sbjct: 61 LSLNYSEKICLATDDEGGSLIARLDLTGINEFEDIYVNTEYYISRVRWLKDEFA 114
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 665 bits (1717), Expect = 0.0 Identities = 186/396 (46%), Positives = 254/396 (64%), Gaps = 5/396 (1%) Query: 166 LNNQPWQTIKNTLTHNGHHYTNTQLPAAEMKIGAKDIFPSAYEGKGVCSWDTKNIHHANN 225 LNN+ W + ++H+G +Y PA+ MKIG K+IF Y GKG+C T+ H N Sbjct: 146 LNNKNWGPVNKNISHHGKNYGFQLTPASHMKIGNKNIFVKEYNGKGICCASTRESDHIAN 205 Query: 226 LWMSTVSVHEDGKDKTLFCGIRHGVLSPYH-EKDPLLRHVGAENKAKEVLTAALFSKPEL 284 +W+S V V ++GK+ +F GIRHGV+S Y +K+ R V A NKA+E+++AAL+S+PEL Sbjct: 206 MWLSKV-VDDEGKE--IFSGIRHGVISAYGLKKNSSERAVAARNKAEELVSAALYSRPEL 262 Query: 285 LNKALAGEAVSLKLVSVGLLTASNIFGKEGTMVEDQMRAWQSL-TQPGKMIHLKIRNKDG 343 L++AL+G+ V LK+VS LLT +++ G E +M++DQ+ A + L ++ G+ L IRN DG Sbjct: 263 LSQALSGKTVDLKIVSTSLLTPTSLTGGEESMLKDQVNALKGLNSKRGEPTKLLIRNSDG 322 Query: 344 DLQTVKIKPDVAAFNVGVNELALKLGFGLKASDSYNAEALHQLLGNDLRPEARPGGWVGE 403 L+ V + V FN GVNELALK+G G + D N E++ LLG++ GGW E Sbjct: 323 LLKEVSVNLKVVTFNFGVNELALKMGLGWRNVDKLNDESICSLLGDNFLKNGVIGGWAAE 382 Query: 404 WLAQYPDNYEVVNTLARQIKDIWKNNQHHKDGGEPYKLAQRLAMLAHEIDAVPAWNCKSG 463 + + P V LA QIK+I D GEPYKL+QR+ +LA+ I AVP WNCKSG Sbjct: 383 AIEKNPPCKNDVIYLANQIKEIINKKLQKNDNGEPYKLSQRMTLLAYTIGAVPCWNCKSG 442 Query: 464 KDRTGMMDSEIKREIISLHQTHMLSAPGSLPDSGGQKIFQKVLLNSGNLEIQKQNTGGAG 523 KDRTGM D+EIKREII H+T S S S +++F +L+NSGN+EIQ+ NTG G Sbjct: 443 KDRTGMQDAEIKREIIRKHETGQFSQLNSKLSSEEKRLFSTILMNSGNMEIQEMNTGVPG 502 Query: 524 NKVMKNLSPEVLNLSYQKRVGDENIWQSVKGISSLI 559 NKVMK L L LSY +R+GD IW VKG SS + Sbjct: 503 NKVMKKLPLSSLELSYSERIGDSKIWNMVKGYSSFV 538
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 18/102 (17%), Positives = 38/102 (37%), Gaps = 15/102 (14%) Query: 348 ILLQRVLSNLLTNAIRYSDENAVIRIESAYDDNVAEIRVANPGSHPADADKLFRRFWRGD 407 +L+Q ++ N + + I + I ++ D+ + V N GS K Sbjct: 258 MLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE-------- 309 Query: 408 NARHTAGFGLGLSLVNA-IALLHGGSASYRYADEHNIFSVRL 448 G GL V + +L+G A + +++ + + Sbjct: 310 ------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.5 bits (196), Expect = 2e-19 Identities = 29/117 (24%), Positives = 55/117 (47%), Gaps = 1/117 (0%) Query: 2 KILLIEDNQKTIEWVRQGLTEAGYVVDYACDGRDGLHLALQEHYSLIILDIMLPGLDGWQ 61 IL+ +D+ + Q L+ AGY V + L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRALRTAHQS-PAICLTARDSVEDRVKGLEAGANDYLVKPFSFAELLARVRAQLRQ 117 +L ++ A P + ++A+++ +K E GA DYL KPF EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.3 bits (151), Expect = 4e-14 Identities = 30/158 (18%), Positives = 58/158 (36%), Gaps = 8/158 (5%) Query: 20 RQLILTAALAVFSQYGIHGARLEQVAERAGVSKTNLLYYYPSKEALYVAVMRQILDVWLA 79 RQ IL AL +FSQ G+ L ++A+ AGV++ + +++ K L+ + Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 80 PLKAFRAEF--SPLEAIKEYIRLKLEVSRDYPQASRLF-CMEMLAGAPLLMEELTGDLKA 136 ++A+F PL ++E + LE + + L + M + + Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRN 132 Query: 137 LIDEKSALIAGWVHSG-----KLAPVSPHHLIFMIWAA 169 L E I + A + ++ Sbjct: 133 LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.9 bits (64), Expect = 0.010 Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Query: 44 LKNQDPTNPLQNNELTTQLAQISTVSGIEKLNTT 77 L N+ P N L NN L TQL + V G E+L T+ Sbjct: 89 LWNRQPLNSLDNNSLYTQLEML--VQGAERLQTS 120
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.1 bits (96), Expect = 7e-06 Identities = 17/48 (35%), Positives = 29/48 (60%) Query: 356 LTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 403 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 37.6 bits (87), Expect = 9e-05 Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 4/60 (6%) Query: 2 SFSQAVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 + A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 355 bits (911), Expect = e-128 Identities = 211/232 (90%), Positives = 223/232 (96%) Query: 4 MQKYALHAYPVMALMVATLTGCAWIPAKPLVQGATTAQPIPGPVPVANGSIFQSAQPINY 63 MQK A H Y + +L+V +LTGCAWIP+ PLVQGAT+AQP+PGP PVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 64 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTSFGFDTVPRYLQGLFGNS 123 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKT+FGFDTVPRYLQGLFGN+ Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 124 RADMEASGGNSFNGKGGANASNTFSGTLTVTVDQVLANGNLHVVGEKQIAINQGTEFIRF 183 RAD+EASGGN+FNGKGGANASNTFSGTLTVTVDQVL NGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 184 SGVVNPRTISGSNSVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 235 SGVVNPRTISGSN+VPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 430 bits (1106), Expect = e-153 Identities = 153/362 (42%), Positives = 215/362 (59%), Gaps = 9/362 (2%) Query: 7 LAGIVLALVATLAHAERIRDLTSVQGVRENSLIGYGLVVGLDGTGDQTTQTPFTTQTLNN 66 A L+ A RI+D+ S+Q R+N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 14 SALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRA 73 Query: 67 MLSQLGITVPTGTNMQLKNVAAVMVTASYPPFARQGQTIDVVVSSMGNAKSLRGGTLLMT 126 ML LGIT G + KN+AAVMVTA+ PPFA G +DV VSS+G+A SLRGG L+MT Sbjct: 74 MLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMT 132 Query: 127 PLKGVDSQVYALAQGNILVGGVGASAGGSSVQVNQLNGGRITNGAIIERELPTQFGAGNT 186 L G D Q+YA+AQG ++V G A +++ R+ NGAIIERELP++F Sbjct: 133 SLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVN 192 Query: 187 INLQLNDEDFTMAQQITDAINRAR----GYGSATALDARTVQVRVPSGNSSQVRFLADIQ 242 + LQL + DF+ A ++ D +N G A D++ + V+ P + R +A+I+ Sbjct: 193 LVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEIE 251 Query: 243 NMEVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQLNVNQPNTPFGGG 302 N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF G Sbjct: 252 NLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSRG 309 Query: 303 QTVVTPQTQIDLRQSGGSLQSVRSSANLNSVVRALNALGATPMDLMSILQSMQSAGCLRA 362 QT V PQT I Q G + ++ +L ++V LN++G +++ILQ ++SAG L+A Sbjct: 310 QTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQA 368 Query: 363 KL 364 +L Sbjct: 369 EL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 499 bits (1285), Expect = 0.0 Identities = 263/316 (83%), Positives = 289/316 (91%), Gaps = 3/316 (0%) Query: 1 MIGDGKLLASAAWDAQSLNELKAKAGQDPAANIRPVARQVEGMFVQMMLKSMREALPKDG 60 MI D KLLASAAWDAQSLNELKAKAG+DPAANIRPVARQVEGMFVQMMLKSMR+ALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSDQTRLYTSMYDQQIAQQMTAGKGLGLADMMVKQMTGGQTMPADDAPQVPLKFSLET 120 LFSS+ TRLYTSMYDQQIAQQMTAGKGLGLA+MMVKQMT Q +P + P P+KF LET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VNSYQNQALTQLVRKAIPKTPDSSDAPLSGDSKDFLARLSLPARLASEQSGVPHHLILAQ 180 V YQNQAL+QLV+KA+P+ D S L GDSK FLA+LSLPA+LAS+QSGVPHHLILAQ Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDS---LPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQ 177 Query: 181 AALESGWGQRQILRENGEPSYNVFGVKATASWKGPVTEITTTEYENGEAKKVKAKFRVYS 240 AALESGWGQRQI RENGEPSYN+FGVKA+ +WKGPVTEITTTEYENGEAKKVKAKFRVYS Sbjct: 178 AALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYS 237 Query: 241 SYLEALSDYVALLTRNPRYAAVTTAATAEQGAVALQNAGYATDPNYARKLTSMIQQLKAM 300 SYLEALSDYV LLTRNPRYAAVTTAA+AEQGA ALQ+AGYATDP+YARKLT+MIQQ+K++ Sbjct: 238 SYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSI 297 Query: 301 SEKVSKTYSANLDNLF 316 S+KVSKTYS N+DNLF Sbjct: 298 SDKVSKTYSMNIDNLF 313
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 664 bits (1714), Expect = 0.0 Identities = 438/553 (79%), Positives = 487/553 (88%), Gaps = 8/553 (1%) Query: 2 SSLINHAMSGLNAAQAALNTVSNNINNYNVAGYTRQTTILAQANSTLGAGGWIGNGVYVS 61 SSLIN+AMSGLNAAQAALNT SNNI++YNVAGYTRQTTI+AQANSTLGAGGW+GNGVYVS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRGAQNQSSGLTTRYEQMSKIDNLLADKSSSLSGSLQSFFTSLQTLV 121 GVQREYDAFITNQLR AQ QSSGLT RYEQMSKIDN+L+ +SSL+ +Q FFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKAEGLVNQFKTTDQYLRDQDKQVNIAIGSSVAQINNYAKQIANLND 181 SNAEDPAARQALIGK+EGLVNQFKTTDQYLRDQDKQVNIAIG+SV QINNYAKQIA+LND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRMTGVGAGASPNDLLDQRDQLVSELNKIVGVEVSVQDGGTYNLTMANGYTLVQGSTA 241 QISR+TGVGAGASPN+LLDQRDQLVSELN+IVGVEVSVQDGGTYN+TMANGY+LVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSSADPTRTTVAYVDEAAGNIEIPEKLLNTGSLGGLLTFRSQDLDQTRNTLGQL 301 RQLAAVPSSADP+RTTVAYVD AGNIEIPEKLLNTGSLGG+LTFRSQDLDQTRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFADAFNAQHTKGYDADGNKGKDFFSIGSPVVYSNSNNADKTVSLTAKVVDSTKVQAT 361 ALAFA+AFN QH G+DA+G+ G+DFF+IG P V N+ N V++ A V D++ V AT Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGD-VAIGATVTDASAVLAT 359 Query: 362 DYKIVFDGTDWQVTRTADNTTFTATKDADGKLEIDGLKVTVGTGAQKNDSFLLKPVSNAI 421 DYKI FD WQVTR A NTTFT T DA+GK+ DGL++T NDSF LKPVS+AI Sbjct: 360 DYKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419 Query: 422 VDMNVKVTNEAEIAMASESKLDPDVDTGDSDNRNGQALLDLQ-NSNVVGGNKTFNDAYAT 480 V+M+V +T+EA+IAMASE D GDSDNRNGQALLDLQ NS VGG K+FNDAYA+ Sbjct: 420 VNMDVLITDEAKIAMASEE------DAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 481 LVSDVGNKTSTLKTSSTTQANVVKQLYKQQQSVSGVNLDEEYGNLQRYQQYYLANAQVLQ 540 LVSD+GNKT+TLKTSS TQ NVV QL QQQS+SGVNLDEEYGNLQR+QQYYLANAQVLQ Sbjct: 474 LVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQ 533 Query: 541 TANALFDALLNIR 553 TANA+FDAL+NIR Sbjct: 534 TANAIFDALINIR 546
>FLAGELLIN#Flagellin signature. Length = 507 Score = 41.2 bits (96), Expect = 4e-06 Identities = 30/138 (21%), Positives = 59/138 (42%) Query: 1 MRISTQMMYEQNMSGITNSQAEWMKLGEQMSTGKRVTNPSDDPIAASQAVVLSQAQAQNS 60 I+T + + + SQ+ E++S+G R+ + DD + A + + Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61 Query: 61 QYALARTFATQKVSLEESVLSQVTTAIQTAQEKIVYAGNGTLSDDDRASLATDLQGIRDQ 120 Q + E L+++ +Q +E V A NGT SD D S+ ++Q ++ Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121 Query: 121 LMNLANSTDGNGRYIFAG 138 + ++N T NG + + Sbjct: 122 IDRVSNQTQFNGVKVLSQ 139
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 55.1 bits (132), Expect = 2e-09 Identities = 44/263 (16%), Positives = 84/263 (31%), Gaps = 34/263 (12%) Query: 513 PSEEEYAERKRPEQPALATFAMPDVPPAPTPVEPAVSVATAKKDNVAAAQPAQPGLFSRF 572 P E+ + DVP P+ + A+ D PA Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSN-----NEEIARVDEAPVPPPAPA------ 1031 Query: 573 LNALKQLFSGEETKAVETAAPKAEEKAERQQDRRKPRQNNRRDRNERRDTRDNR----AG 628 + E +K K E+ A QN + + + + N Sbjct: 1032 TPSETTETVAENSKQESKTVEKNEQDATE-----TTAQNREVAKEAKSNVKANTQTNEVA 1086 Query: 629 RDGGESRDDNRRNRRQTQQQNAEAR---DTRQQETAEKVKTGDEQQQTPRRERSRRRNDD 685 + G E+++ ++T E + +T + + KV + Q +P++E+S Sbjct: 1087 QSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS----QVSPKQEQSETVQPQ 1142 Query: 686 KRQAQQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSAVVETVDTPVVV 745 A++ +N +E Q + QP ++ N + T S V T ++ V Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTAD---TEQPAKETSS-NVEQPVTESTTVNTGNSVVEN 1198 Query: 746 DEPRPVENVEQPVPAPRTELAKV 768 E + P +E + Sbjct: 1199 PENTTPATTQ---PTVNSESSNK 1218 Score = 39.3 bits (91), Expect = 1e-04 Identities = 48/331 (14%), Positives = 81/331 (24%), Gaps = 45/331 (13%) Query: 630 DGGESRDDNRRNRRQTQQQNAEARDTRQQETAEKVKTGDEQQQTPRRERSRRRNDDKRQA 689 D G + R + N E Q + T + Q S + Sbjct: 963 DLGAWKYKLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDE 1022 Query: 690 QQEVKALNREEQPVQETEQEERVQQVQPRRKQRQLNQKVRFTNSAVVETVDTPVVVDEPR 749 ET E Q+ + K Q + N V + V + Sbjct: 1023 APVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE-AKSNVKANTQ 1081 Query: 750 PVENVEQPVPAPRTELAKVDLPVVADIAPEQDDSVEPRDNTGMPRRSRRSPRHLRVSGQR 809 E + T+ + A + E+ VE +++ P+ + Sbjct: 1082 TNEVAQSGSETKETQTTETKET--ATVEKEEKAKVETE-------KTQEVPKVTSQVSPK 1132 Query: 810 RRRYRDERYPTQSPMPLTVACASPEMASGKVWIRYPIVRPQETQVVDEQREADLALPQPV 869 + + + + P V +E Q AD P Sbjct: 1133 QEQSETVQPQAEPARE-----------------NDPTVNIKEPQS-QTNTTADTEQPAKE 1174 Query: 870 VAEPQVTAATVALEPQASVQAVENVVVEPQTVAEPQTPEVVEVETTHPEVIAAPVDEQPQ 929 Q V + + PE TT P V + ++ Sbjct: 1175 T-------------SSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221 Query: 930 LIAESDTPVAQEVIADAEPVAETADASITVA 960 S V V EP +++ TVA Sbjct: 1222 RHRRSVRSVPHNV----EPATTSSNDRSTVA 1248
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 32.5 bits (73), Expect = 8e-04 Identities = 28/110 (25%), Positives = 43/110 (39%), Gaps = 10/110 (9%) Query: 53 LTDATAALQQEVTERAKEKRRQHAADEERKRADEELAKIQADADAAERARGGLQQQLAAV 112 T+A ++LQ + AA + A A+ QA A+A +A +QQ A Sbjct: 193 FTEAISSLQIRMN-------TLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIR 245 Query: 113 Q-RQLAGSETGRLSALAAASQ--AKAETGILLAQLLGEADDLAGKFAKEA 159 A G + A AA A+ LAQ + +A + G+ A Sbjct: 246 AANTYAMPANGSVVATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASA 295
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 37.0 bits (85), Expect = 4e-04 Identities = 40/227 (17%), Positives = 72/227 (31%), Gaps = 25/227 (11%) Query: 516 TEYDQQQLNELQEQKRQKDLLDAKAQAERNYQETQKRRNEQNAALNRDNETESLRHQREV 575 + L+ AK + + R S ++ Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 248 Query: 576 ARITAMQYADAAVRNAALERENERHKKALSQQAKKPKTYHNDEARRLLLQYSQQQAQTEG 635 + A + A R A LE+ E + EA + L+ + + + Sbjct: 249 KTLEA-EKAALEARQAELEKALEGAMNFSTA---DSAKIKTLEAEKAALEAEKADLEHQS 304 Query: 636 QIAAAKLSTTE----------KMTEAHKQLLSFQQRIADLSGKKLTADEQSVLAHKDEIA 685 Q+ A + K EA Q L Q +I++ S + L D + K ++ Sbjct: 305 QVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLE 364 Query: 686 LALQKL-------DISQQDLQHQ-NALNELKKKTLTLTSQLADEESR 724 QKL + S+Q L+ +A E KK+ + L + S+ Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQ---VEKALEEANSK 408
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 35.8 bits (82), Expect = 8e-04 Identities = 58/240 (24%), Positives = 86/240 (35%), Gaps = 18/240 (7%) Query: 123 NATAAGQASEQAQTSAGQASES-----ATAAVNAAGAAEASATQAASSAASAESSAGTA- 176 N T G S G SES ATA + A + A QAA + A+AE+ A Sbjct: 26 NGTPDGSGSGGGGGKGGSKSESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKA 85 Query: 177 -----TTKAGEASASAASADTARTAAAASAAAAKTSEANADASRTA---AGDSAAAAAAS 228 T + + A + +RT +A A A + A+ R A + A A + Sbjct: 86 NRDALTQRLKDIVNEALRHNASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEA 145 Query: 229 ATAAQTSAERAGASETAAKTSETQAASSAGDAGASATAAAASEKAAAASAAAAKTSETNA 288 A A AE+ E + +ET+ +A AA SE+A A A K S + Sbjct: 146 AEKAFQEAEQR-RKEIEREKAETERQLKLAEA-EEKRLAALSEEAKAVEIAQKKLSAAQS 203 Query: 289 ATSASTAAASATAASSSASEASTHAAASDTSASLA--AQSSTAAGAAATRAEDAAKRAED 346 + S+S + A + AQ+S + + RA D Sbjct: 204 EVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRAND 263 Score = 30.8 bits (69), Expect = 0.029 Identities = 47/261 (18%), Positives = 84/261 (32%), Gaps = 7/261 (2%) Query: 97 DVRPEALRSFEAMVEEVARQASEASRNATAAGQASEQAQTSAGQASESATAAVNAAGAAE 156 D+ EALR + A A+ A A + + +A + A AA A AE Sbjct: 96 DIVNEALRHNASRTPSATELA-HANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAE 154 Query: 157 ASATQAASSAASAESSAGTATTKAGEASASAASADTARTAAAASAAAAKTSEANADASRT 216 + A E A E AA ++ A+ A A K S A ++ + Sbjct: 155 QRRKEIEREKAETERQLKLAEA---EEKRLAALSEEAK---AVEIAQKKLSAAQSEVVKM 208 Query: 217 AAGDSAAAAAASATAAQTSAERAGASETAAKTSETQAASSAGDAGASATAAAASEKAAAA 276 + S++ AE + + ++ A D + A++ Sbjct: 209 DGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYKELDELVKKLSPRANDPLQNR 268 Query: 277 SAAAAKTSETNAATSASTAAASATAASSSASEASTHAAASDTSASLAAQSSTAAGAAATR 336 A A TA+ + + + + S + + A A Sbjct: 269 PFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHE 328 Query: 337 AEDAAKRAEDIADVISLEDAS 357 AE+ K+A++ ++DA Sbjct: 329 AEENLKKAQNNLLNSQIKDAV 349
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 197 bits (503), Expect = 2e-67 Identities = 67/187 (35%), Positives = 94/187 (50%), Gaps = 18/187 (9%) Query: 1 MKNIILSTLVITTSVLVVNVAQADTNAFSVGYAQSKVQDFKN-IRGVNVKYRYE-DDSPV 58 MK I + + + A T+ + GYAQS Q N + G N+KYRYE D+SP+ Sbjct: 1 MKKIACLSALAAVLAFTAGTSVAATSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDNSPL 60 Query: 59 SFISSLSYLYGDRQASGSVEPEGIHYHDKFEVKYGSLMVGPAYRLSDNFSLYALAGVGTV 118 I S +Y R AS D + +Y + GPAYR++D S+Y + GVG Sbjct: 61 GVIGSFTYTEKSRTASSG---------DYNKNQYYGITAGPAYRINDWASIYGVVGVGYG 111 Query: 119 KATFKEHSTQDGDSFSNKISSRKTGFAWGAGVQMNPLENIVVDVGYEGSNISSTKINGFN 178 K E+ T D+ GF++GAG+Q NP+EN+ +D YE S I S + + Sbjct: 112 KFQTTEYPTYKHDT-------SDYGFSYGAGLQFNPMENVALDFSYEQSRIRSVDVGTWI 164 Query: 179 VGVGYRF 185 GVGYRF Sbjct: 165 AGVGYRF 171
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.002 Identities = 8/37 (21%), Positives = 17/37 (45%), Gaps = 5/37 (13%) Query: 4 LSWIIFGLIAGILAKWIMPG-----KDGGGFFMTIIL 35 + I+ G I+G++ W+ K ++ I+L Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILL 199
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 28.5 bits (63), Expect = 0.012 Identities = 17/59 (28%), Positives = 25/59 (42%) Query: 49 QGLTVGIIILTIGVMAPIASGTLPPSTLIHSFVNWKSLVAIAVGVFVSWLGGRGITLMG 107 Q + L IG + + LPPS ++ N ++ A VS LG +TL G Sbjct: 174 QRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDG 232
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 29.0 bits (65), Expect = 0.030 Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 7/53 (13%) Query: 234 VVARCQEICGK--DNLGLVIECSGANIALKQAIDMLRPNGEVVRVGMGFKPLD 284 V R EI K ++L ++ C N + A+ NG+ + MGF PL+ Sbjct: 186 VSQRAAEILNKPIESLKIIT-CHLGNGSSIAAVK----NGKSIDTSMGFTPLE 233
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.010 Identities = 22/88 (25%), Positives = 34/88 (38%), Gaps = 5/88 (5%) Query: 71 MFLGALVGGIIGDKTGRRNAFILYEAIHIASMVVGAFSPNMMF-LIACRFVMGVGLGALL 129 +G V G + D+ G + + I+ V+G + LI RF+ G G A Sbjct: 62 FSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFP 121 Query: 130 VTLFAGFTEYMPGRNR----GTWSSRVS 153 + Y+P NR G S V+ Sbjct: 122 ALVMVVVARYIPKENRGKAFGLIGSIVA 149
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.5 bits (92), Expect = 3e-05 Identities = 24/118 (20%), Positives = 47/118 (39%), Gaps = 1/118 (0%) Query: 65 ALMLGYFIGSLTGGFIGDYLGRRKAFRINLLLVGISATAAAFVPNMY-WLIFFRCLMGTG 123 A ML + IG+ G + D LG ++ +++ + + + LI R + G G Sbjct: 57 AFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG 116 Query: 124 MGALIMVGYASFTEFIPPVVRGKWSARLSFVGNWSPMLSAGIGVVVIAFLSWRMMFLL 181 A + +IP RGK + + + IG ++ ++ W + L+ Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 32.3 bits (73), Expect = 0.001 Identities = 38/178 (21%), Positives = 58/178 (32%), Gaps = 32/178 (17%) Query: 3 NRALLLV-DLQNDFCAGGALAVAEGDSTIDIANALIDWCQPRQIPVLASQDWHPAQHGSF 61 NRA+LL+ D+QN F + L + C IPV+ Sbjct: 29 NRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVV------------- 75 Query: 62 ASQHQAEPYSQGELD-GLPQTLW-PDHCVQHTDVAALHPLLNQHAIDACIYKGENPLIDS 119 + A+P SQ D L W P + + L + D + K Sbjct: 76 ---YTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDD-DLVLTKWR------ 125 Query: 120 YSAFFDNEHRQKTTLDTWLREHDVTELIVMGLATDYCVKFTVLDALELGYAVNVITDG 177 YSAF +T L +R+ +LI+ G+ T +A + D Sbjct: 126 YSAFK------RTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDA 177
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.5 bits (79), Expect = 8e-04 Identities = 32/166 (19%), Positives = 71/166 (42%), Gaps = 7/166 (4%) Query: 23 FLHGMSVITLAQNMTSLAQKFSTDSAGIAYLISGIGLGRLVSILFFGVLSDKFGRRAIIL 82 F ++ + L ++ +A F+ A ++ + L + +G LSD+ G + ++L Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 83 LGAVLYML----FFFGIPASPNLMIAFILAVCVGVANSALDTGGYPALMECFPKASGSAV 138 G ++ F G L++A + A AL + G A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARY--IPKENRGKAF 141 Query: 139 ILVKAMVSFGQMIYPLIVSALLVNHIWYGYAVVIPGILFVLITLML 184 L+ ++V+ G+ + P I ++ ++I + Y ++IP I + + ++ Sbjct: 142 GLIGSIVAMGEGVGPAI-GGMIAHYIHWSYLLLIPMITIITVPFLM 186
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 26.8 bits (59), Expect = 0.008 Identities = 14/29 (48%), Positives = 19/29 (65%) Query: 6 QLILGAVVLGSTLLAGCSSNAKIDQLSSD 34 +L L A+ LG+TLL GC+S+ Q SD Sbjct: 2 KLRLSALALGTTLLVGCASSGTDQQGRSD 30
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 27.2 bits (60), Expect = 0.006 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 1/45 (2%) Query: 5 KLVLGAVILGSTLLAGCSSNAKIDQLSSD-VQTLNAKVDQLSNDV 48 KL L A+ LG+TLL GC+S+ Q SD ++ N + + +V Sbjct: 2 KLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNV 46
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.5 bits (209), Expect = 2e-21 Identities = 31/127 (24%), Positives = 56/127 (44%) Query: 2 ATIHLLDDDTAVTNACAFLLESLGYDVKCWTQGADFLAQASLYQAGVVLLDMRMPVLDGQ 61 ATI + DDD A+ L GYDV+ + A + +V+ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 GVHDALRQCGSTLAVVFLTGHGDVPMAVEQMKRGAVDFLQKPVSVKPLQAALERALTVSS 121 + +++ L V+ ++ A++ ++GA D+L KP + L + RAL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 122 AAVARRE 128 ++ E Sbjct: 124 RRPSKLE 130
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 7e-15 Identities = 28/119 (23%), Positives = 50/119 (42%), Gaps = 2/119 (1%) Query: 1 MKEYKILLVDDHEIIINGIMNALLPWPHFKIVEHVKNGLEVYNACCAYEPDILILDLSLP 60 M IL+ DD I + AL + V N ++ A + D+++ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GINGLDIIPQLHQRWPAMNILVYTAYQQEYMTIKTLAAGANGYVLKSSSQQVLLAALQT 119 N D++P++ + P + +LV +A IK GA Y+ K L+ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.1 bits (169), Expect = 3e-14 Identities = 31/156 (19%), Positives = 58/156 (37%), Gaps = 13/156 (8%) Query: 691 ILLVDDADINRDIISKMLVSLGQHVTIAASSNEALTLSQQQRFDLVLIDIRMPEIDGIEC 750 IL+ DD R ++++ L G V I +++ DLV+ D+ MP+ + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 751 VQLWHDEPNNLDPDCMFVALSASVATEDIHRCKKNGIHHYITKPVTLATLARYISIAAEY 810 + PD + +SA + + G + Y+ KP L L Sbjct: 66 LP----RIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG-------- 113 Query: 811 QLLRNIELQEQDPSRCSALLAT-DDMVINSKIFQSL 845 + R + ++ PS+ +V S Q + Sbjct: 114 IIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 581 bits (1498), Expect = 0.0 Identities = 158/500 (31%), Positives = 259/500 (51%), Gaps = 15/500 (3%) Query: 11 LLFILNTVKSDELSWKGNDFTLYARQMPLAEVLHLLSENYDTAITISPLITATFSGKIPP 70 LL + + + EL W + A+ L ++L NYD + +S I SG+ Sbjct: 17 LLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEH 76 Query: 71 GPPVDILNNLAAQYDLLTWFDGSMLYVYPASLLKHQVITFNILSTGRFIHYLRSQNILSS 130 P D L ++A+ Y+L+ ++DG++LY++ S + ++I L+ I Sbjct: 77 DNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWE- 135 Query: 131 PGCEVKEITGTRAVEVSGVPSCLTRISQLASVLDNALIKR--KDSAVSVSIYTLKYATAM 188 P + R V VSG P L + Q A+ L+ R K A+++ I+ LKYA+A Sbjct: 136 PRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASAS 195 Query: 189 DTQYQYRDQSVVVPGVVSVL-REMSKTSVPASSTTN-----GSPATQALPMFAADPRQNA 242 D YRD V PGV ++L R +S ++ + N + A ADP NA Sbjct: 196 DRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQAATRASAQARVEADPSLNA 255 Query: 243 VIVRDYAANMAGYRKLITELDQRQQMIEISVKIIDVNAGDINQLGIDWGTAVSLGG---- 298 +IVRD M Y++LI LD+ IE+++ I+D+NA + +LG+DW + G Sbjct: 256 IIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQV 315 Query: 299 --KKIAFNTGLNDGGASGFSTVISDTSNFMVRLNALEKSSQAYVLSQPSVVTLNNIQAVL 356 K + + GA G + R+N LE A V+S+P+++T N QAV+ Sbjct: 316 VIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVI 375 Query: 357 DKNITFYTKLQGEKVAKLESITTGSLLRVTPRLLNDNGTQKIMLNLNIQDGQQSDTQSET 416 D + T+Y K+ G++VA+L+ IT G++LR+TPR+L +I LNL+I+DG Q S Sbjct: 376 DHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGI 435 Query: 417 DPLPEVQNSEIASQATLLAGQSLLLGGFKQGKQIHSQNKIPLLGDIPVVGHLFRNDTTQV 476 + +P + + + + A + GQSL++GG + + + +K+PLLGDIP +G LFR + Sbjct: 436 EGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELT 495 Query: 477 HSVIRLFLIKASVVNNGISH 496 +RLF+I+ +++ GI+H Sbjct: 496 RRTVRLFIIEPRIIDEGIAH 515
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 87.7 bits (217), Expect = 9e-25 Identities = 40/154 (25%), Positives = 67/154 (43%), Gaps = 7/154 (4%) Query: 6 TLQQAHDTMRFFRRGGSLRMLL---DDDVTQPLNTLYRYATQLMEVKEFAGAARLFQLLT 62 T + F + GG++ ML D L LY A + ++ A ++FQ L Sbjct: 8 TQEYQLAMESFLKGGGTIAMLNEISSDT----LEQLYSLAFNQYQSGKYEDAHKVFQALC 63 Query: 63 IYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLACDNVCYA 122 + D + ++ LG C QA + AI++Y A + I P+ P+ AAEC L + A Sbjct: 64 VLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEA 123 Query: 123 IKALKAVVRICGEVSEHQILRLRAEKMLQQLSDR 156 L + + +E + L R ML+ + + Sbjct: 124 ESGLFLAQELIADKTEFKELSTRVSSMLEAIKLK 157
>PF05844#YopD protein Length = 295 Score = 28.4 bits (63), Expect = 0.018 Identities = 29/149 (19%), Positives = 49/149 (32%), Gaps = 19/149 (12%) Query: 9 VLPAPSL-LTPSSSQAPSGEGMGTESMLLLFDDIWTKLMELAKKLRDIMRSYNVVKQRLG 67 L AP L P +A E + +LL+ I K EL RD + Q+ Sbjct: 50 ELNAPRQVLDPVRMEAAGSELDSSVELLLILFRIAQKARELGVLQRDNENQAIIHAQK-- 107 Query: 68 WELQVNVLQTQMKTIDEAFRASMITAGGAMLSGVLTIGLGAVGGETGLIAGQAVGHTAGG 127 +DE + + A+++GV + VG L G+A+ Sbjct: 108 ------------AQVDEMRSGATLMIAMAVIAGVGALASAVVGSLGALKNGKAISQEK-- 153 Query: 128 VMGLGAGVAQRQSDQNKAIADLKQNGAQS 156 L + R + + L + + Sbjct: 154 --TLQKNIDGRNELIDAKMQALGKTSDED 180
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 79.2 bits (195), Expect = 1e-21 Identities = 26/127 (20%), Positives = 49/127 (38%) Query: 16 LKQLLSVDPETVYASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYT 75 L ++ S E +Y+ + +Q G Y A F L + + R + L + +Y Sbjct: 28 LNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYD 87 Query: 76 TAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNA 135 AI+ Y + ++D P + CL GE A A ++ + E+ Sbjct: 88 LAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKTEFKELSTRV 147 Query: 136 QIMVDTL 142 M++ + Sbjct: 148 SSMLEAI 154
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 53.4 bits (128), Expect = 2e-10 Identities = 29/183 (15%), Positives = 69/183 (37%), Gaps = 15/183 (8%) Query: 23 LYRSLPEDEANQMLALLMQHHIDAEKKQEEDGVTLRVEQSQFINAVELLRLNGYPHRQFT 82 L+ +L + + ++A L Q +I + V + L G P + Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPYRFA--NGSGAIEVPADKVHELRLRLAQQGLP-KGGA 109 Query: 83 TADKMFPANQLVVSPQEEQQKINFLK--EQRIEGMLSQMEGVINAKVTIALPTYDEGS-- 138 ++ + +S EQ +N+ + E + + + V +A+V +A+P + S Sbjct: 110 VGFELLDQEKFGISQFSEQ--VNYQRALEGELARTIETLGPVKSARVHLAMP---KPSLF 164 Query: 139 --NASPSSVAVFIKYSPQVNMEAFRVK-IKDLIEMSIPGLQYSKISILMQPAEFRMVPDV 195 S +V + P ++ ++ + L+ ++ GL ++++ Q + Sbjct: 165 VREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNT 224 Query: 196 PAR 198 R Sbjct: 225 SGR 227
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 51.1 bits (122), Expect = 3e-10 Identities = 21/67 (31%), Positives = 38/67 (56%) Query: 247 LEQIPQQVLFEIGRASLEIGQLRQLKTGDVLPVGGCFAPEVTIRVNDRIIGQGELIACGN 306 + IP ++ E+GR + I +L +L G V+ + G + I +N +I QGE++ + Sbjct: 57 IMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVAD 116 Query: 307 EFMVRIT 313 ++ VRIT Sbjct: 117 KYGVRIT 123
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 231 bits (592), Expect = 9e-80 Identities = 79/215 (36%), Positives = 130/215 (60%), Gaps = 8/215 (3%) Query: 8 LQLIGILFLLSILPLIIVMGTSFLKLAVVFSILRNALGIQQVPPNIALYGLALVLSLFIM 67 + LI +L ++LP II GT F+K ++VF ++RNALG+QQ+P N+ L G+AL+LS+F+M Sbjct: 5 ISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMFVM 64 Query: 68 GPTLLAVKERWHPVQVAGAPFWT-SEWDSKALAPYRQFLQKNSEEKEANYFRNLIKRTWP 126 P + + V + S+ + L YR +L K S+ + +F N + Sbjct: 65 WPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKRQY 124 Query: 127 ED-------IKRKIKPDSLLILIPAFTVSQLTQAFRIGLLIYLPFLAIDLLISNILLAMG 179 + K +I+ S+ L+PA+ +S++ AF+IG +YLPF+ +DL++S++LLA+G Sbjct: 125 GEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLALG 184 Query: 180 MMMVSPMTISLPFKLLIFLLAGGWDLTLAQLVQSF 214 MMM+SP+TIS P KL++F+ GW L L+ + Sbjct: 185 MMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 72.5 bits (178), Expect = 9e-21 Identities = 30/85 (35%), Positives = 50/85 (58%) Query: 4 SELTQFVTQLLWIVLFTSMPVVLVASVVGVIVSLVQALTQIQDQTLQFMIKLLAIAITLM 63 +L + L++VL S +VA+++G++V L Q +TQ+Q+QTL F IKLL + + L Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VSYPWLSGILLNYTRQIMLRIGEHG 88 + W +LL+Y RQ++ G Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 163 bits (415), Expect = 7e-52 Identities = 55/229 (24%), Positives = 101/229 (44%), Gaps = 5/229 (2%) Query: 8 WLIALAVAFIRPLSLSLLLPLLKSGSLGAALLRNGVLMSLTFPILPIIYQQKIMMHIGKD 67 WL +R L+L P+L S+ ++ G+ M +TF I P + + + Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPK-RVKLGLAMMITFAIAPSLPANDVPVF---S 67 Query: 68 YSWLGLVTGEVIIGFLIGFCAAVPFWAVDMAGFLLDTLRGATMGTIFNSTIEAETSLFGL 127 + L L +++IG +GF F AV AG ++ G + T + + Sbjct: 68 FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLAR 127 Query: 128 LFSQFLCVIFFISGGMEFILNILYESYQYLPPGRTLLFDQQFLKYIQAEWRTLYQLCISF 187 + ++F G +++++L +++ LP G L FL +A ++ + Sbjct: 128 IMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSL-IFLNGLML 186 Query: 188 SLPAIICMVLADLALGLLNRSAQQLNVFFLSMPLKSILVLLTLLISFPY 236 +LP I ++ +LALGLLNR A QL++F + PL + + + P Sbjct: 187 ALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPL 235
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.2 bits (138), Expect = 4e-11 Identities = 44/192 (22%), Positives = 85/192 (44%), Gaps = 8/192 (4%) Query: 36 LSDIAESFHMQTAQVGIMLTIYAWVVAVMSLPFMLLTSQMERRKLLICLFVLFIASHVLS 95 L DIA F+ A + T + ++ + + L+ Q+ ++LL+ ++ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FLAWN-FTVLVISRIGIAFAHAIFWSITASLAIRLAPAGKRAQALSLIATGTALAMVLGL 154 F+ + F++L+++R A F ++ + R P R +A LI + A+ +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PIGRVVGQYFGWRTTFFAIGMGALITLLCLIKLLPKLPSEHSGSLKSLPLLFRRPALMSL 214 IG ++ Y W + I M +IT+ L+KLL K + LMS+ Sbjct: 157 AIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKK------EVRIKGHFDIKGIILMSV 209 Query: 215 YVLTVVVVTAHY 226 ++ ++ T Y Sbjct: 210 GIVFFMLFTTSY 221
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 30.5 bits (69), Expect = 0.008 Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 22 GQGKVADYIPALASVEGSKLGI-AICTVDGQHYQAGDAHERFSIQSISKVL 71 + + I S ++G+ + G+ A A ERF + S KV+ Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71
>MPTASEINHBTR#Metalloprotease inhibitor signature. Length = 122 Score = 25.7 bits (56), Expect = 0.015 Identities = 6/43 (13%), Positives = 14/43 (32%) Query: 30 AGRGELSQSEQQRLLQLTDDAQRMRERIQALEDILDAEHPNWR 72 AG+ + + + A + + E L + +W Sbjct: 37 AGQLGIEATGSGVCAGPAEQANALAGDVACAEQWLGDKPVSWS 79
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 28.3 bits (63), Expect = 0.028 Identities = 19/104 (18%), Positives = 43/104 (41%), Gaps = 5/104 (4%) Query: 40 LVEVRSNSARALAEKKQLSRRIEQATTQQTEWQEKAELA-LRKDKDDLARAALIEKQKLT 98 + + R + +L K+ +++ + + EL + + + L K++ Sbjct: 232 VEKSRLDDFSSLLHKQAIAK-HAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ 290 Query: 99 DLIATLEQEVTLVDDTLARMKKEIGELENKLSETRARQQALMLR 142 + + E+ D L + IG L +L++ RQQA ++R Sbjct: 291 LVTQLFKNEIL---DKLRQTTDNIGLLTLELAKNEERQQASVIR 331
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 344 bits (883), Expect = e-118 Identities = 124/341 (36%), Positives = 176/341 (51%), Gaps = 22/341 (6%) Query: 6 DNLLGEANRFLEVLEQVSRLAPLDKPVLIIGERGTGKELIANRLHYLSSRWQGPLISLNC 65 L+G + E+ ++RL D ++I GE GTGKEL+A LH R GP +++N Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196 Query: 66 AALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMLVQEKLLRVIE 125 AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLRV++ Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256 Query: 126 YGELERVGGSQPLQVNVRLVCATNADLPAMVKEGTFRADLLDRLAFDVVQLPPLRERQSD 185 GE VGG P++ +VR+V ATN DL + +G FR DL RL ++LPPLR+R D Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316 Query: 186 IMLMAEHFAIQMCRELRLPLFPGFTDRAKETLLHYAWPGNVRELKNVVERSVYRHGSSE- 244 I + HF Q +E F A E + + WPGNVREL+N+V R + Sbjct: 317 IPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374 Query: 245 -------HPLDEIVIDPFQRHPAEPPAPALPAA------------SATPDLPLKLREFQL 285 EI P ++ A + ++ A Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434 Query: 286 QQEKALLQRSLQQAKFNQKRAADLLALTYHQFRALLKKHQL 326 + E L+ +L + NQ +AADLL L + R +++ + Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.007 Identities = 9/16 (56%), Positives = 14/16 (87%) Query: 38 LVGESGSGKSLIAKAI 53 + GESG+GK L+A+A+ Sbjct: 165 ITGESGTGKELVARAL 180
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.3 bits (234), Expect = 3e-25 Identities = 60/242 (24%), Positives = 103/242 (42%), Gaps = 22/242 (9%) Query: 10 LQNRIILVTGASDGIGREAALTYARYGATVILLGRNEEKLRRVAQHIADEQHVQPQWFTL 69 ++ +I +TGA+ GIG A T A GA + + N EKL +V + E F Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA-FPA 64 Query: 70 DLLTCTAEECRQVADRIAAHYPRLDGILHNAGLLGEIGPMSEQDPQIWQDVMQVNVNATF 129 D+ + ++ RI +D +++ AG+L G + + W+ VN F Sbjct: 65 DV--RDSAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVF 121 Query: 130 MLTQALLPLLLKSDAGSLVFTSSSVGRQGRANWGAYATSKFATEGMMQVLADEYQNRPLR 189 ++++ ++ +GS+V S+ R + AYA+SK A + L E +R Sbjct: 122 NASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 190 VNCINPGGTRTSMRASAFPTEDPQ------------------KLKTPADIMPLYLWLMGD 231 N ++PG T T M+ S + E+ KL P+DI L+L+ Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 232 DS 233 + Sbjct: 242 QA 243
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.004 Identities = 16/112 (14%), Positives = 39/112 (34%), Gaps = 12/112 (10%) Query: 9 LEALHERHEEVQALLGDAGIIADQDRFRALSREYAQLS-DVSRCFTDWQQVQDDIETAQM 67 R ++ +LL I + +Y + ++ + +Q++ +I +A+ Sbjct: 230 SRVEKSRLDDFSSLLHKQAI--AKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287 Query: 68 MLD--DPEMREMAQEELREAKEKSEQLEQQLQVLLLPKDPDDERNAFLEVRA 117 + ++LR+ + L +L +ER +RA Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKN-------EERQQASVIRA 332
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 31.1 bits (70), Expect = 0.016 Identities = 23/81 (28%), Positives = 37/81 (45%), Gaps = 16/81 (19%) Query: 288 LGAIESLLCAV----VL---DGMTGTKHKANSELIGQGLGNM---VAPFF------GGIT 331 L + +L A+ +L D T TK A EL + LGN+ ++ + G++ Sbjct: 242 LDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLS 301 Query: 332 ATAAIARSAANVRAGATSPVS 352 +AA A A+ A SP+S Sbjct: 302 TSAAAAGLIASAVTLAISPLS 322
>SOPEPROTEIN#Salmonella type III secretion SopE effector protein signature. Length = 239 Score = 401 bits (1032), Expect = e-146 Identities = 163/237 (68%), Positives = 193/237 (81%) Query: 2 TNITLSTQHYRIHRSDVEPVKEKTTEKDIFAKSITAVRNSFISLSTSLSDRFSLHQQTDI 61 T ITLS Q++RI + + +KEK+TEK+ AKSI AV+N FI L + LS+RF H+ T+ Sbjct: 1 TKITLSPQNFRIQKQETTLLKEKSTEKNSLAKSILAVKNHFIELRSKLSERFISHKNTES 60 Query: 62 PTTHFHRGSASEGRAVLTSKTVKDFMLQKLNSLDIKGNASKDPAYARQTCEAILSAVYSN 121 THFHRGSASEGRAVLT+K VKDFMLQ LN +DI+G+ASKDPAYA QT EAILSAVYS Sbjct: 61 SATHFHRGSASEGRAVLTNKVVKDFMLQTLNDIDIRGSASKDPAYASQTREAILSAVYSK 120 Query: 122 NKDHCCKLLISKGVSITPFLKEIGEAAQNAGLPGEIKNGVFTPGGAGANPFVVPLIAAAS 181 NKD CC LLISKG++I PFL+EIGEAA+NAGLPG KN VFTP GAGANPF+ PLI++A+ Sbjct: 121 NKDQCCNLLISKGINIAPFLQEIGEAAKNAGLPGTTKNDVFTPSGAGANPFITPLISSAN 180 Query: 182 IKYPHMFINHNQQVSFKAYAEKIVMKEVTPLFNKGTMPTPQQFQLTIENIANKHLQN 238 KYP MFIN +QQ SFK YAEKI+M EV PLFN+ MPTPQQFQL +ENIANK++QN Sbjct: 181 SKYPRMFINQHQQASFKIYAEKIIMTEVAPLFNECAMPTPQQFQLILENIANKYIQN 237
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.0 bits (62), Expect = 0.011 Identities = 13/60 (21%), Positives = 27/60 (45%), Gaps = 2/60 (3%) Query: 60 WLCIDYLWVSESARSNGLGSKLMEMAEKEGLRKGCVHGLVDTFSFQ--ALPFYEKQGYIL 117 + I+ + V++ R G+G+ L+ A + +++T A FY K +I+ Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 34.0 bits (78), Expect = 0.001 Identities = 25/83 (30%), Positives = 40/83 (48%), Gaps = 10/83 (12%) Query: 123 QLPFAWPLSVILMLTALAALY--YHLPALLLFIVPLWLT-ALLASVRLNQYVNIRFLLVW 179 Q P +S +++ LAALY + +P ++ +VPL + LLA+ NQ ++ F++ Sbjct: 871 QAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGL 930 Query: 180 LTL------TAILIYGRFILQRW 196 LT AILI F Sbjct: 931 LTTIGLSAKNAILIVE-FAKDLM 952
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.3 bits (185), Expect = 4e-18 Identities = 24/113 (21%), Positives = 46/113 (40%), Gaps = 2/113 (1%) Query: 4 VLLVDDHELVRAGIRRILEDIKGIKVVGEACCGEDAVKWCRTNAVDVVLMDMNMPGIGGL 63 +L+ DD +R + + L G V + +W D+V+ D+ MP Sbjct: 6 ILVADDDAAIRTVLNQALSR-AGYDVRITSN-AATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 64 EATRKIARSTADIKVIMLTVHTENPLPAKVMQAGAAGYLSKGAAPQEVVSAIR 116 + +I ++ D+ V++++ K + GA YL K E++ I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>FLAGELLIN#Flagellin signature. Length = 507 Score = 267 bits (684), Expect = 8e-86 Identities = 257/510 (50%), Positives = 300/510 (58%), Gaps = 13/510 (2%) Query: 2 AQVINTNSLSLLTQNNLNKSQSALGTAIERLSSGLRINSAKDDAAGQAIANRFTANIKGL 61 AQVINTNSLSLLTQNNLNKSQS+L +AIERLSSGLRINSAKDDAAGQAIANRFT+NIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQASRNANDGISIAQTTEGALNEINNNLQRVRELAVQSANSTNSQSDLDSIQAEITQRLN 121 TQASRNANDGISIAQTTEGALNEINNNLQRVREL+VQ+ N TNS SDL SIQ EI QRL Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSGQTQFNGVKVLAQDNTLTIQVGANDGETIDIDLKQINSQTLGLDTLNVQKKYDV 181 EIDRVS QTQFNGVKVL+QDN + IQVGANDGETI IDL++I+ ++LGLD NV + Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 SDTAVAASYSDSKQNIAVPDKTAITAKIGAATSGGAGIKADISFKDGKYYATVSGYDDAA 241 + + + S SG K Y + Sbjct: 181 TVGDLKS--SFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238 Query: 242 DTDKNGTYEVTVAADTGAVTFATTPTVVDLPTDAKAVSKVQQNDTEIAATNAKAALKAAG 301 D +N T A + K + K G Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGV-TFTIDTKTGNDGNG 297 Query: 302 VADAEADTATLVKMSYTDNNGKVIDGGFAFKTSGGYYAASV-------DKSGAASLKVTS 354 + + G ++S Y + V DK+ S K++ Sbjct: 298 KVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSD 357 Query: 355 YVD---ATTGTEKTAANKLGGADGKTEVVTIDGKTYNASKAAGHNFKAQPELAEAAATTT 411 ++ T A+ + VT+ GKT K A E A AA +T Sbjct: 358 LEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKST 417 Query: 412 ENPLQKIDAALAQVDALRSDLGAVQNRFNSAITNLGNTVNNLSSARSRIEDSDYATEVSN 471 NPL ID+AL++VDA+RS LGA+QNRF+SAITNLGNTV NL+SARSRIED+DYATEVSN Sbjct: 418 ANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSN 477 Query: 472 MSRAQILQQAGTSVLAQANQVPQNVLSLLR 501 MS+AQILQQAGTSVLAQANQVPQNVLSLLR Sbjct: 478 MSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 110 bits (277), Expect = 2e-35 Identities = 90/103 (87%), Positives = 95/103 (92%) Query: 2 AAIQGIEGVISQLQATAMAASGQETHSQSTVSFAGQLHAALDRISDRQTAARVQAEKFTL 61 +AIQGIEGVISQLQATAM+A QE+ Q T+SFAGQLHAALDRISD QTAAR QAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGIALNDVMADMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPG+ALNDVM DMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 784 bits (2026), Expect = 0.0 Identities = 557/559 (99%), Positives = 558/559 (99%) Query: 2 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 61 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ Sbjct: 1 SATASTATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQ 60 Query: 62 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 121 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF Sbjct: 61 DGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKF 120 Query: 122 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE 181 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE Sbjct: 121 GISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLE 180 Query: 182 PGRALDEGQISAVVHLVSSAVAGLPLGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV 241 PGRALDEGQISAVVHLVSSAVAGLP GNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV Sbjct: 181 PGRALDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDV 240 Query: 242 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS 301 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS Sbjct: 241 ESRIQRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNIS 300 Query: 302 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRNTQRN 361 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPR+TQRN Sbjct: 301 EQVGAGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRN 360 Query: 362 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG 421 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG Sbjct: 361 ETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMG 420 Query: 422 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 481 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR Sbjct: 421 FSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVR 480 Query: 482 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 541 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD Sbjct: 481 PQLTRRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSD 540 Query: 542 NDPRVVALVIRQWMSNDHE 560 NDPRVVALVIRQWMSNDHE Sbjct: 541 NDPRVVALVIRQWMSNDHE 559
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 339 bits (870), Expect = e-118 Identities = 114/329 (34%), Positives = 196/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLSGTDKSVILLMTIGEDRAAEVFKHLSTREVQALSTAMANVRQISNKQLTDVLSEFE 60 +S L+G K+ ILL++IG + +++VFK+LS E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANEYLRSVLVKALGEERASSLLEDILETRDTTSGIETLNFMEPQSAAD 120 + + +Y R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRSQAADILALFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEPPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 368 bits (945), Expect = e-133 Identities = 193/235 (82%), Positives = 209/235 (88%), Gaps = 7/235 (2%) Query: 1 MSNELPWQVWTPDDLAPPPETFVPVEADNVTLTEDTPEPELTAEQQLEQELAQLKIQAHE 60 MS+ LPW+ WTPDDLAPP FVP+ T+ E+ AE LEQ+LAQL++QAHE Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEE-------AEPSLEQQLAQLQMQAHE 53 Query: 61 QGYNAGLAEGRQKGHAQGYQEGLAQGLEQGQAQAQTQQAPIHARMQQLVSEFQNTLDALD 120 QGY AG+AEGRQ+GH QGYQEGLAQGLEQG A+A++QQAPIHARMQQLVSEFQ TLDALD Sbjct: 54 QGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALD 113 Query: 121 SVIASRLMQMALEAARQVIGQTPAVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 180 SVIASRLMQMALEAARQVIGQTP VDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV Sbjct: 114 SVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRV 173 Query: 181 EEMLGATLSLHGWRLRGDPTLHHGGCKVSADEGDLDASVATRWQELCRLAAPGVL 235 ++MLGATLSLHGWRLRGDPTLH GGCKVSADEGDLDASVATRWQELCRLAAPGV+ Sbjct: 174 DDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 206 bits (526), Expect = 4e-72 Identities = 130/147 (88%), Positives = 138/147 (93%) Query: 1 MAQHGALETLKDLAEKEVDDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRSNLNTDMGNG 60 MA+HGAL TLKDLAEKEV+DAARLLGEMRRGCQQAEEQLKMLIDYQNEYR+NLN+DM G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 IASNRWINYQQFIQTLEKAIEQHRLQLTQWTQKVDLALKSWREKKQRLQAWQTLQDRQTA 120 I SNRWINYQQFIQTLEKAI QHR QL QWTQKVD+AL SWREKKQRLQAWQTLQ+RQ+ Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRMDQKKMDEFAQRAAMRKPE 147 AALLAENR+DQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 408 bits (1049), Expect = e-144 Identities = 191/409 (46%), Positives = 231/409 (56%), Gaps = 38/409 (9%) Query: 1 MITLPQLITTDTDMTAGLTSGKTTGSAEDFLALLAGALGADGAQGKDARITLADLQAAGG 60 MI L LIT D D T L GK + +A+DFLALL+ AL + K A L Sbjct: 1 MIRLAPLITADVDTTT-LPGGKASDAAQDFLALLSEALAGETTTDKAAPQLL-------- 51 Query: 61 KLSKELLTQHGEPGQAVKLADLLAQKAN---ATDETLTDLTQAQHLLSTLTPSLKTSALA 117 ++ + T GEP + ++D AQ+AN DET + Q + LT + + A Sbjct: 52 -VATDKPTTKGEPLISDIVSD--AQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAA 108 Query: 118 ALSKTAQHDEKTPALSDEDLASLSALFAMLPGQPVATPVAGETPAENHIALPSLLRGDMP 177 K DEK L+++ ASLSALFAMLPG V D P Sbjct: 109 VADKNTTKDEKADDLNEDVTASLSALFAMLPGFDNTPKVT-----------------DAP 151 Query: 178 SAPQEETHTLSFSEHEKGKTEASLARASDDRATGPSLTPLVVAAAATSAKVEVDSPSAPV 237 S F++ T L A D A G PL A +K EV S +PV Sbjct: 152 STVLPTEKPTLFTK----LTSEQLTTAQPDDAPGTPAQPLTPLVAEAQSKAEVISTPSPV 207 Query: 238 THGAAMPTLSSATAQPLPVASAPELSAPLGSHEWQQTFSQQVMLFTRQGQQSAQLRLHPE 297 T AA P ++ QPLP +AP LSAPLGSHEWQQ+ SQ + LFTRQGQQSA+LRLHP+ Sbjct: 208 T-AAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQSLSQHISLFTRQGQQSAELRLHPQ 266 Query: 298 ELGQVHISLKLDDNQAQLQMVSPHSHVRAALEAALPMLRTQLAESGIQLGQSSISSESFA 357 +LG+V ISLK+DDNQAQ+QMVSPH HVRAALEAALP+LRTQLAESGIQLGQS+IS ESF+ Sbjct: 267 DLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGESFS 326 Query: 358 GQQQ-SSSQQQSSRAQHTDAFGAEDDIALAAPASLQAAARGNGAVDIFA 405 GQQQ +S QQQS R + + EDD L P SLQ GN VDIFA Sbjct: 327 GQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVSLQGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 384 bits (987), Expect = e-136 Identities = 86/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--DTKDEPTPGIASDSDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S D E I+ I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RQFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 HEDQNWRDNLVRQVQHSELELVANFADIPLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297 + L ++ ++++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTVNGQYALRVEHLI 321 Q G V + A ++ I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 209 bits (534), Expect = 2e-73 Identities = 136/137 (99%), Positives = 136/137 (99%) Query: 1 MSDMNNPSDENTGALDDLWADALNEQKATTNKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 MSDMNNPSDENTGALDDLWADALNEQKATT KSAADAVFQQLGGGDVSGAMQDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 330 bits (847), Expect = e-117 Identities = 225/245 (91%), Positives = 233/245 (95%) Query: 1 MRRLLFLSLAGLWLFSPAAAAQLPGLISQPLAGGGQSWSLSVQTLVFITSLTFLPAILLM 60 MRRLL ++ LWL +P A AQLPG+ SQPL GGGQSWSL VQTLVFITSLTF+PAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEQK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSE+K Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALDKGAQPLRAFMLRQTREADLALFARLANSGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEAL+KGAQPLR FMLRQTREADL LFARLAN+GPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.5 bits (165), Expect = 1e-18 Identities = 23/78 (29%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALITGLIISILQAATQINEMTLSFIPKIVAVFIAII 63 + ++ G +A+ + L L+ +VA I GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 566 bits (1459), Expect = 0.0 Identities = 270/397 (68%), Positives = 312/397 (78%), Gaps = 14/397 (3%) Query: 1 MNRKVLALLVPALLVAGAANAAEIYNKNGNKLDLYGKVDGLRYFSDNAGDDGDQSYARIG 60 M RKVLAL++PALL AGAA+AAEIYNK+GNKLDLYGKVDGL YFSD++ DGDQ+Y R+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQINDMLTGYGQWEYNIKVNTTEGEGANSWTRLGFAGLKFGEYGSFDYGRNYGVIY 120 FKGETQIND LTGYGQWEYN++ NTTEGEGANSWTRL FAGLKFG+YGSFDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DIEAWTDALPEFGGDTYTQTDVYMLGRTNGVATYRNTDFFGLVEGLNFALQYQGNNEDPG 180 D+E WTD LPEFGGD+YT D YM GR NGVATYRNTDFFGLV+GLNFALQYQG NE Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 181 AGEGTANGSDANSGSRKLARENGDGFGMSASYDFDFGLSLGAAYSSSDRTDNQVARGYGD 240 A + ++ N+G + +NGDGFG+S +YD G S GAAY++SDRT+ QV G Sbjct: 181 ADDVNIGTNNRNNG-DDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAG--- 236 Query: 241 GMNERNNYTGGETAEAWTVGAKYDAYNVYLAAMYAETRNMTYYGGGNGEGNGGIANKTQN 300 GG+ A+AWT G KYDA N+YLA MY+ETRNMT YG + +GG+ANKTQN Sbjct: 237 -----GTIAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQN 291 Query: 301 FEVVAQYQFDFGLRPSIAYLQSKGKDLGGQEVHRGNWHYTDKDLVKYVDVGMTYYFNKNM 360 FEV AQYQFDFGLRP++++L SKGKDL N + DKDLVKY DVG TYYFNKN Sbjct: 292 FEVTAQYQFDFGLRPAVSFLMSKGKDLT-----YNNVNGDDKDLVKYADVGATYYFNKNF 346 Query: 361 STYVDYKINLLDEDDDFYASNGIATDDIVGVGLVYQF 397 STYVDYKINLLD+DD FY GI+TDDIV +G+VYQF Sbjct: 347 STYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.6 bits (71), Expect = 0.005 Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 1/84 (1%) Query: 156 STTAEGAQRRLAEYIQQVDEEVAKELEVDLKDNITLQTKTLQESLETQEVVAQEQKDLRI 215 +T R +A+ + + + EV + T +T+T E+ ET V +E+ + Sbjct: 1058 ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT-TETKETATVEKEEKAKVET 1116 Query: 216 KQIEEALRYADEAKITQPQIQQTQ 239 ++ +E + + Q Q + Q Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQ 1140
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 88.3 bits (219), Expect = 4e-22 Identities = 64/344 (18%), Positives = 128/344 (37%), Gaps = 47/344 (13%) Query: 5 RIFVAGHRGMVGSAIVRQLAQRG-------------DVEL------VLRTRD----ELDL 41 + V G G +G + ++L + G DV L +L ++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 42 LDGRAVQAFFAGAGIDQVYLAAAKVGGIVANNTYPADFIYENMMIESNIIHAAHLHNVNK 101 D + FA ++V+++ + + + P + N+ NI+ + + Sbjct: 62 ADREGMTDLFASGHFERVFISPHR-LAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 LLFLGSSCIYPKLARQPMAESELLQGTLEPTNEPYAIAKIAGIKLCESYNRQYGRDYRSM 161 LL+ SS +Y + P + P + YA K A + +Y+ YG + Sbjct: 121 LLYASSSSVYGLNRKMPFSTD---DSVDHPVS-LYAATKKANELMAHTYSHLYGLPATGL 176 Query: 162 MPTNLYGPHDNFHPDNSHVIPALLRRFHEAAQSHAPEVVVWGSGTPMREFLHVDDMAAAS 221 +YGP PD AL + + + +V + G R+F ++DD+A A Sbjct: 177 RFFTVYGPWGR--PDM-----ALFKFTKAMLEGKSIDV--YNYGKMKRDFTYIDDIAEAI 227 Query: 222 IHVMELA----REVWQENTAPMLSH-----INVGTGVDCTIRELAQTIAKVVGYQGRVVF 272 I + ++ + E P S N+G + + Q + +G + + Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287 Query: 273 DAAKPDGTPRKLLDVTRLHQ-LGWYHEISLEAGLAGTYQWFLEN 315 +P D L++ +G+ E +++ G+ W+ + Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 107 bits (268), Expect = 2e-28 Identities = 81/361 (22%), Positives = 127/361 (35%), Gaps = 58/361 (16%) Query: 6 LITGVTGQDGSYLAEFLLEKGYEVHGIKRRASSFNTERVDHIYQDPH--------SCNPK 57 L+TG G G ++++ LLE G++V GI + N Y D P Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGI----DNLND------YYDVSLKQARLELLAQPG 53 Query: 58 FHLHYGDLTDASNLTRILQEVQPDEVYNLGAMSHVAVSFESPEYTADVDAMGTLRLLEAI 117 F H DL D +T + + V+ V S E+P AD + G L +LE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 118 RFLGLEKKTRFYQASTSELYGLVQEIPQKETTPF-YPRSPYAVAKLYAYWITVNYRESYG 176 R ++ AS+S +YGL +++P +P S YA K + Y YG Sbjct: 114 RHNKIQ---HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 177 IYACNGILFNHESPRRGETFVTRKITRAIANIAQGLESCLYLGNMDSLRDWGHAKDYVRM 236 + A F P K T+A+ G +Y RD+ + D Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLE---GKSIDVY-NYGKMKRDFTYIDD---- 222 Query: 237 QWMMLQQEQPEDFVIATGVQYSVRQFVELAAAQLGIKLRFEGEGINEKGIVVSVTGHDAP 296 IA + +R + A + + V G+ +P Sbjct: 223 --------------IAEAI---IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSP 265 Query: 297 GVKPGDVIVAV--------DPRY--FRPAEVETLLGDPSKAHEKLGWKPEITLSEMVSEM 346 V+ D I A+ +P +V D +E +G+ PE T+ + V Sbjct: 266 -VELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324 Query: 347 V 347 V Sbjct: 325 V 325
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 49.0 bits (117), Expect = 2e-08 Identities = 31/129 (24%), Positives = 56/129 (43%), Gaps = 20/129 (15%) Query: 132 TMMVHIRHTAHSQ-LPEAITQAVIGRPINFQGLGGDDANRQAQGILERAAKRAGFQDVVF 190 M+ H HS + ++ P+ R+A + +A+ AG ++V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGA-----TQVERRA---IRESAQGAGAREVFL 140 Query: 191 QYEPVAAGLDYEATLREEKRVLVVDIGGGTTDCSMLLMGPQWRQRADRENSLLGHSGCRV 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S R+ Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 35.9 bits (83), Expect = 2e-04 Identities = 25/81 (30%), Positives = 39/81 (48%), Gaps = 12/81 (14%) Query: 377 ALDQPLARILEQVRLALDSAQEKPDV--------IYLTGGSARSPLIKKALSEQLPGIPV 428 AL +PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIPV Sbjct: 259 ALQEPLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPV 315 Query: 429 AGGDD-FGSVTAGLARWAEVV 448 +D V G + E++ Sbjct: 316 VVAEDPLTCVARGGGKALEMI 336
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.1 bits (99), Expect = 3e-06 Identities = 36/172 (20%), Positives = 71/172 (41%), Gaps = 10/172 (5%) Query: 107 KVALAQAQGQLAKDNATLANARRDLARYQQ---LAKTNLVSRQELDAQQAL--VNETQGT 161 K A+ + + + + L + L + + AK +L + L + +T Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310 Query: 162 IKADEANVASAQLQLDWSRITAPVSGRV-GLKQVDVGNQISSSDTAGIVVITQTHPIDLI 220 I +A + + S I APVS +V LK G +++++T +V++ + +++ Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVT 369 Query: 221 FTLPESDIATVVQAQKAGKTLVVEAWDRTNSHKL-SEGVLLSLDNQIDPTTG 271 + DI + Q A + VEA+ T L + ++LD D G Sbjct: 370 ALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419 Score = 40.6 bits (95), Expect = 9e-06 Identities = 20/122 (16%), Positives = 46/122 (37%), Gaps = 13/122 (10%) Query: 63 GTVTAA-NTVTVRSRVDGQLIALHFQEGQQVNAGDLLAQIDPSQFKVALAQAQGQLAKDN 121 G +T + + ++ + + + +EG+ V GD+L ++ + + Q Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQ------- 140 Query: 122 ATLANARRDLARYQQLAKTNLVSRQELDAQQALVNETQGTIKADEANVASAQLQLDWSRI 181 ++L AR + RYQ L+++ EL+ L + + L + Sbjct: 141 SSLLQARLEQTRYQILSRS-----IELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQF 195 Query: 182 TA 183 + Sbjct: 196 ST 197
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 886 bits (2290), Expect = 0.0 Identities = 291/1036 (28%), Positives = 503/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFILRPVATTLLMAAILLAGIIGYRFLPVAALPEVDYPTIQVVTLYPGASPDVMTSAV 72 + FI RP+ +L +++AG + LPVA P + P + V YPGA + V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPIYSKVNPADPPIMTLAVTSNAMPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + I S + +M S+ TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAVAALGLTSETVRTAITGANVNSAKGSLDGP------ERAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G + ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSADEYRRLII-AYQNGAPVRLGDVATVEQGAENSWLGAWANQAPAIVMNVQRQPGANI 302 ++ +E+ ++ + +G+ VRL DVA VE G EN + A N PA + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IATADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SQQSLRKQNRFSRACERMFDRVIASYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S + + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVAFATLLLSVMLWIVIPKGFFPVQDNGIIQGTLQAPQSSSYASMAQRQRQVAERILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV + L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTTFVGVDGANSTLNSTRLQINLKPLDARDDR---VQQVISRLQTAVATIPG 653 + V+S+ T G + N+ ++LKP + R+ + VI R + + I Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 VALYLQPTQDLTIDTQVSRTQYQFSLQ---ATTLDALSHWVPKL-QNALQSLPQLSEVSS 709 ++ P I + T + F L DAL+ +L A Q L V Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDRGLAAWVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTA 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 STPGLAALETIRLTSRDGGTVPLSAIARIEQRFAPLSINHLDQFPITTFSFNVPEGYSLD 829 ++ + + S +G VP SA + + + P G S Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQAILDTEKTLALPADITTQFQGSTLAFQAALGSTVWLIVAAVVAMYIVLGVLYESFI 889 DA+ + + LPA I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALIIAGSELDIIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + D+ ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPRDAIFQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIAMVGGLLVSQV 1009 G +A A +R RPILMT+LA +LG LPL +S G G+ + +GI ++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 880 bits (2275), Expect = 0.0 Identities = 282/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILIAAAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ ++A + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPGGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSES--WSQGKLYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ S++ +Q + D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDEVREAIDSANVRRPQGAIEDSV------HRWQIQTNDELK 236 A+R+ L+ L ++ +V + N + G + + I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGAAVRLGDVASVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G+ VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDGIRAKLPELRAMIPAAIDLQIAQDRSPTIRASLQEVEETLAISVALVIMVVFLFLRS 355 T I+AKL EL+ P + + D +P ++ S+ EV +TL ++ LV +V++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVISMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LVVSLTLTPMMCGWMLKSSKPRTQPRKRGVG----RLLVALQQGYGTSLKWVLNHTRLVG 530 ++V+L LTP +C +LK K G Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVFLGTVALNIWLYIAIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 +++ VA + L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVNNVTGFT-GGSRVNSGMMFITLKPRGER---KETAQQIIDRLRVKLAKEPGAR 641 + +V V GF+ G N+GM F++LKP ER + +A+ +I R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQANASYQYTLLSDSLAALREWEPKIRKALSAL-----PQLADVNSD 696 + + I G ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMNLIYDRDTMSRLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 SQDISALEKMFVINRDGKAIPLSYFAQWRPANAPLSVNHQGLSAASTIAFNLPTGTSLSQ 816 ++K++V + +G+ +P S F + + I GTS Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ATEAINRTMTQLGVPPTVRGSFSGTAQVFQQTMNSQLILIVAAIATVYIVLGILYESYVH 876 A + ++L P + ++G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRSGG 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + G Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPAQAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 +A A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.010 Identities = 20/66 (30%), Positives = 26/66 (39%), Gaps = 14/66 (21%) Query: 187 RGLLAPVKRLVEGTHRLAAGDFTTRVTPTSADEL-----------GKLAQDFNQLASTLE 235 L+A V+ V H LA + P S + L G L N+LA E Sbjct: 104 SQLMAAVRSKVMEGHSLAD---AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTE 160 Query: 236 KNQQMR 241 + QQMR Sbjct: 161 QRQQMR 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.9 bits (184), Expect = 1e-17 Identities = 28/140 (20%), Positives = 65/140 (46%), Gaps = 2/140 (1%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLINHGDKLLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + ++ L ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL-RRC 128 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 129 KPQRELQQQDAESPLMIDES 148 + +L+ + ++ S Sbjct: 124 RRPSKLEDDSQDGMPLVGRS 143
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 86.0 bits (213), Expect = 8e-25 Identities = 30/118 (25%), Positives = 46/118 (38%), Gaps = 3/118 (2%) Query: 6 DRLLRQFSLKLNTDSIVFDENRLCSFIIDNRYRI-LLTSTNSEYIMIYGFCGRPPDNNNL 64 LL FS L +VFD++ C+ IIDN + + L E +++ G P + Sbjct: 7 KTLLDDFSRSLEMQPLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLE--PHKDIP 64 Query: 65 AFEFLNANLWFAENNGPHLCYDNNSQSLLLALNFSLNESSVEKLECEIEVVIRSMENL 122 L L N GP L D S + + SV L+ E+ ++ M Sbjct: 65 QQCLLAGALNPLLNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGW 122
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.1 bits (86), Expect = 1e-04 Identities = 33/153 (21%), Positives = 52/153 (33%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLITAAIRYGFFVYGGAETYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYPQPVN 372 + V D R G ++ C GFG + G LGG+M P Sbjct: 109 TGATGAVAGAYIADI-TDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGGFSPHAP---- 162 Query: 373 GLTFNWAGMWTFGAVMIAVIALLFMIFFRESDK 405 + A + + L ES K Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186 Score = 32.5 bits (74), Expect = 0.003 Identities = 55/286 (19%), Positives = 93/286 (32%), Gaps = 17/286 (5%) Query: 29 LNKSGFSAGEIGWSYACTAIAAILSPILVGSVTDRFFSAQKVLAVLMFAGAVLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLGY-NDISPTNTPLLITAASSALLGVFAFCLPDTPPKSTGKMDIKVMLGLDALVLL 206 P + G SP + P AA + L + L K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLITAAIRYGFFVYGGAETYFTYALLFLG 306 R G ++ L+LG+I Y + ++ L Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 26.8 bits (59), Expect = 0.019 Identities = 10/43 (23%), Positives = 17/43 (39%), Gaps = 7/43 (16%) Query: 3 SKLLPCALLLATSFAWAAPA-------TTGIDQYELKSFIADF 38 ++L LLL +S++WA L+ + DF Sbjct: 10 KRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 681 bits (1759), Expect = 0.0 Identities = 246/839 (29%), Positives = 389/839 (46%), Gaps = 26/839 (3%) Query: 2 LRMTPIASLVLLTLFTWQTQAIATETFDTHFMVGGMRDQKITNFHLDENKPIPGQYELDI 61 L + V + A F+ F+ + + + + PG Y +DI Sbjct: 23 LAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDI 82 Query: 62 YVNNQWRGKYDIIVADEPGST----CISTELLKNIGVISDGLQPQ---GATDCIALKDVV 114 Y+NN + D+ C++ L ++G+ + + C+ L ++ Sbjct: 83 YLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMI 142 Query: 115 RSGGYTFNIGVFRLDLSVPQAYVNEVEAGYVLPENWDRGINAFYTSYYASQYYSDYKNSG 174 ++G RL+L++PQA+++ GY+ PE WD GINA +Y S + G Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGG 202 Query: 175 SSESTYVRFNSGFNLLGWQAHADTTFNKTD-----GSSGEWKSNTLYLERGIAELLGTLR 229 +S Y+ SG N+ W+ +TT++ GS +W+ +LER I L L Sbjct: 203 NSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 230 AGDQYTSSEIFDSVRFTGVRLFRDMQMLPNSKQNFTPLVQGIAQTNALVTIEQNGFVVYQ 289 GD YT +IFD + F G +L D MLP+S++ F P++ GIA+ A VTI+QNG+ +Y Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 290 KEVPPGPFSIADLQLAGGGADLDVTVREADGSINTWLVPYASVPNMLQPGVSKYDFSAGR 349 VPPGPF+I D+ AG DL VT++EADGS + VPY+SVP + + G ++Y +AG Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 350 SHIEGADNQAD-FTQISYQYGLNNLLTLYGGTMLSNHYNAFTLGTGWNT-RIGAISLDAT 407 A + F Q + +GL T+YGGT L++ Y AF G G N +GA+S+D T Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442 Query: 408 RSHSKQDNGDVFDGQSYQIAYNKYLTQTLTRFGLAAYRYSSQDYRTFNDHVWANNKNNYR 467 +++S + DGQS + YNK L ++ T L YRYS+ Y F D ++ Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502 Query: 468 RDKNDVYDI----ADYYQNDFGRKNTFSANVSQSLPEGWGAVSLSALWRDYWGRSGTSKD 523 ++ V + DYY + ++ V+Q L + LS + YWG S + Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQ 561 Query: 524 YQISYSNTFQKINYTLSASQTYDE-DHNEDKRFNLFISIPFD--WGDGITTPRRHLNVSN 580 +Q + F+ IN+TLS S T + D+ L ++IPF + RH + S Sbjct: 562 FQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASY 621 Query: 581 STTFDDDGFTSNNIGLTGTAGSRDQFNYGVNVSH---QRHDSETTAGTNLTWNTPVATLN 637 S + D +G +N G+ GT + +Y V + +S +T L + N Sbjct: 622 SMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNAN 681 Query: 638 GSYSQSSNYTQTGGSISGGVVAWSGGLNLSSRLSDTFAIMQAPGLEGAYVNGQKYRTTNK 697 YS S + Q +SGGV+A + G+ L L+DT +++APG + A V Q T+ Sbjct: 682 IGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDW 741 Query: 698 KGTVVYDNLTPYRENHLMLDVSQSSSEAELRGNRKVAAPYRGAVVLVNFDTDQRKPWFIK 757 +G V T YREN + LD + + +L P RGA+V F + Sbjct: 742 RGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMT 801 Query: 758 AQRPDGSPLIFGYDVVDHHGHNVGIVGQGSQLFIRTNDIPPEVSVPVDKEQGLSCSITF 816 + PL FG V + GIV Q+++ + +V V +E+ C + Sbjct: 802 L-THNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANY 859
>PF06580#Sensor histidine kinase Length = 349 Score = 220 bits (561), Expect = 5e-69 Identities = 60/216 (27%), Positives = 116/216 (53%), Gaps = 3/216 (1%) Query: 328 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 387 L G + + + ++ ++++ L AQ+NPHF+FNALN I+A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 388 SQLVQYLSTFFRKNLKR-PSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQ 446 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ + + + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 447 KLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAG-SSGL 505 ++P +Q +VEN IKHG +QL G + ++ ++ + L++E+ L + S+G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 506 GMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLP 541 G+ V +RL+ +G + I ++ + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 37.5 bits (87), Expect = 5e-05 Identities = 38/180 (21%), Positives = 68/180 (37%), Gaps = 6/180 (3%) Query: 11 LALMLAVPFAPQAVAKTAATTAASQPEIASGSAMI-VDLNTNKVIYSNHPDLVRPIASIT 69 ++L+ +P A A + S+ +++ MI +DL + + + + D P+ S Sbjct: 9 ISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTF 68 Query: 70 KLMTAMVVLDARLPLDEILKVDISQTPEMKGVYSRV---RLNSEISRKNMLLLALMSSEN 126 K++ VL DE L+ I + YS V L ++ + A+ S+N Sbjct: 69 KVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSDN 128 Query: 127 RAAASLAHYY--PGGYNAFIKAMNAKAKALGMTHTRFVEPTGLSIHNVSTARDLTKLLIA 184 AA L P G AF++ + L T E + +T + L Sbjct: 129 SAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPASMAATLRK 188
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 27.5 bits (61), Expect = 0.031 Identities = 8/39 (20%), Positives = 16/39 (41%), Gaps = 1/39 (2%) Query: 152 WLHDLDQHLRH-GVWLILAIVLVVGVRWWLKRRGKAEAR 189 L + +R G W++LA++ + R+ K Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQEKRRVS 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 112 bits (281), Expect = 4e-32 Identities = 68/253 (26%), Positives = 116/253 (45%), Gaps = 12/253 (4%) Query: 3 KVAIVTASDSGIGKACALLLAQNGFDIGITWHSDERGAQETAKKAAQFGVRAETIHLDLS 62 K+A +T + GIG+A A LA G I ++ E+ + + A+ AE D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE-ARHAEAFPADVR 67 Query: 63 QLPEGAQAIEYLIQRLGRVDVLVNNAGAMTKSAFIDMPFTQWRQIFTVDVDGAFLCAQIA 122 + + + +G +D+LVN AG + + +W F+V+ G F ++ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARHMIKQGEGGRIINITSVHEHTPLPQASAYTAAKHALGGLTKSMALELIEHHILVNAVA 182 +++M+ + G I+ + S P +AY ++K A TK + LEL E++I N V+ Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGAIATPM-------NDMDDSDIEPGSEP---SIPIARPGSTHEIASLVAWLCSEGASYT 232 PG+ T M + + I+ E IP+ + +IA V +L S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 233 TGQSLIVDGGFML 245 T +L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 53.3 bits (128), Expect = 1e-09 Identities = 66/402 (16%), Positives = 142/402 (35%), Gaps = 19/402 (4%) Query: 22 RVIICCFLVVMLDGFDTAAIGFIAPDIRTHWQLSASELAPLFGAGLLGLTAGALLCGPLA 81 +++I ++ + + PDI + + + A +L + G + G L+ Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 82 DRFGRKRVIELCVALFGALSLLSAFS-PDIETLVLLRFLTGLGLGGAMPNTIT-MTSEYL 139 D+ G KR++ + + S++ L++ RF+ G G A P + + + Y+ Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132 Query: 140 PARRRGALVTLMFCGFTLGSATGGIVSAQLVPLIGWHGILALGGILPLMLFFGLLFALPE 199 P RG L+ +G G + + I W +L + I + + F L+ L + Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPF-LMKLLKK 191 Query: 200 SPRWQVRRQLPQAV---------VARTVSAITGERYLDTQFFLHETAAIAKGSI----RQ 246 R + + + + T S + FL I K + Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPG 251 Query: 247 LFAGRQLVITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGA 306 L +I ++ + F ++ + +M ++ + + G Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFG- 310 Query: 307 LLLGVLMDRLNPFRVLAVSYALGAVCIVMIGLSENG-LWLMALAIFGTGIGISGSQVGLN 365 + G+L+DR P VL + +V + W M + I G+S ++ ++ Sbjct: 311 YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIS 370 Query: 366 ALTATLYPTQSRATGVSWSNAIGRCGAIVGSLSGGMMMALNF 407 + ++ Q G+S N G G ++++ Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412 Score = 41.8 bits (98), Expect = 4e-06 Identities = 40/169 (23%), Positives = 73/169 (43%), Gaps = 1/169 (0%) Query: 251 RQLVITLMLWVVFFMSLLIIYLLSSWMPTLLNHRGIDLQQASWVTAAFQVGGTLGALLLG 310 R I + L ++ F S+L +L+ +P + N +WV AF + ++G + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 311 VLMDRLNPFRVLAVSYALGAVCIVMIGLSENGLWLMALAIFGTGIGISGSQVGLNALTAT 370 L D+L R+L + V+ + + L+ +A F G G + + + A Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 371 LYPTQSRATGVSWSNAIGRCGAIVGSLSGGMMM-ALNFSFDTLFFVIAI 418 P ++R +I G VG GGM+ +++S+ L +I I Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITI 179
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.2 bits (164), Expect = 3e-15 Identities = 23/114 (20%), Positives = 48/114 (42%), Gaps = 2/114 (1%) Query: 9 VLIVDDHPLMRRGIRQLLELDPAFHVVAEAGDGASAIDLANRIEPDLILLDLNMKGLSGL 68 +L+ DD +R + Q L A + V + A+ + DL++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDSASDIYALIDAGADGYLLKDSDPEVLLEAIRK 122 D L +++ +++++ ++ + GA YL K D L+ I + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 29.0 bits (65), Expect = 0.006 Identities = 10/30 (33%), Positives = 12/30 (40%) Query: 1 MNLRRKNRLWVVCAVLAGLALTTALVLYAL 30 R K WVV V LA + + AL Sbjct: 27 AAERSKKLAWVVAGVAGALATAGVVAVAAL 56
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 30.2 bits (68), Expect = 0.002 Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 12/74 (16%) Query: 1 MIDWQDLHHSELTVPQLYALLKLRCAVFV--------VEQRCPYLDVDGDDLVGDNRHIL 52 M++ D++H+ L+ + L LR F + D + + ++ Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56 Query: 53 GWHQDELVAYARIL 66 G + ++ R + Sbjct: 57 GIKDNTVICSLRFI 70
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.1 bits (112), Expect = 5e-08 Identities = 32/135 (23%), Positives = 56/135 (41%), Gaps = 16/135 (11%) Query: 185 PGAVAIVAEDSKVARAMLEKGLNAMGIPHQMHVTGKDAWERIQQLAQEAEAEGKPISEKI 244 GA +VA+D R +L + L+ G ++ W I + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-------------AGDG 48 Query: 245 ALVLTDLEMPEMDGFTLTRKIKTDERLKKIPVVIHSSLSGSANEDHIRKVKADGYVAK-F 303 LV+TD+ MP+ + F L +IK + +PV++ S+ + + A Y+ K F Sbjct: 49 DLVVTDVVMPDENAFDLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106 Query: 304 EINELSSVIQEMLER 318 ++ EL +I L Sbjct: 107 DLTELIGIIGRALAE 121
>SECA#SecA protein signature. Length = 901 Score = 32.2 bits (73), Expect = 0.011 Identities = 46/189 (24%), Positives = 70/189 (37%), Gaps = 36/189 (19%) Query: 474 VDGIDSDLQNKIDVIVQALAGAKKPLIISGTNAGSSEVIQAAANVAKALKGRGADVGITM 533 VD +DS L ID A+ PLIISG SSE+ + + L + + T Sbjct: 208 VDEVDSIL---ID-------EARTPLIISGPAEDSSEMYKRVNKIIPHLIRQEKEDSETF 257 Query: 534 IA----------RSVNSMGLGM-------MGGGSLDDALGELETGNADAVVVLENDLHRH 576 R VN G+ + G +D+ N + + L H Sbjct: 258 QGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAH 317 Query: 577 ASATRVNAALAKAPLVMVVDHQRTAIMENAHLV--LSAASFAESDGTVINNEGRA----- 629 A TR + K V++VD M+ L A A+ +G I NE + Sbjct: 318 ALFTRDVDYIVKDGEVIIVDEHTGRTMQGRRWSDGLHQAVEAK-EGVQIQNENQTLASIT 376 Query: 630 -QRFFQVYD 637 Q +F++Y+ Sbjct: 377 FQNYFRLYE 385
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 28.3 bits (63), Expect = 0.019 Identities = 18/56 (32%), Positives = 25/56 (44%), Gaps = 3/56 (5%) Query: 68 MVTSFT---AVHDVARFGAEVLRASPRQADLMVVAGTCFTKMAPVIQRLYDQMLEP 120 M+TSFT V + R A P Q L + F M+PVI ++Y +P Sbjct: 60 MMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQP 115
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.0 bits (75), Expect = 2e-04 Identities = 16/102 (15%), Positives = 39/102 (38%), Gaps = 4/102 (3%) Query: 24 LRPWNDPEMDIERKVNHDVSLFLVAEVSGEVVG--TVMGGYDGHRGSAYYLGVHPEFRGR 81 + + D +MD+ + FL + +G + ++G + V ++R + Sbjct: 47 FKQYEDDDMDVSYVEEEGKAAFL-YYLENNCIGRIKIRSNWNG-YALIEDIAVAKDYRKK 104 Query: 82 GIANALLNRLEKKLIARGCPKIQIMVRDDNDVVLGMYERLGY 123 G+ ALL++ + + + +D N Y + + Sbjct: 105 GVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 48.6 bits (116), Expect = 9e-09 Identities = 32/116 (27%), Positives = 51/116 (43%), Gaps = 9/116 (7%) Query: 64 VRDGIVWDFFGAVTLVRRHLDTLEQQLGCRFT-HAATSFPPGTDP---RISINVLESAGL 119 ++DG++ DFF +++ + + R + P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 120 EVSHVLDEPTAVA---DLLALDNAG--VVDIGGGTTGIAIVKQGKVTYSADEATGG 170 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1274 bits (3298), Expect = 0.0 Identities = 647/1032 (62%), Positives = 797/1032 (77%), Gaps = 2/1032 (0%) Query: 1 MANFFIDRPIFAWVLAILLCLTGALAIFSLPVEQYPDLAPPNVRITANYPGASAQTLENT 60 MANFFI RPIFAWVLAI+L + GALAI LPV QYP +APP V ++ANYPGA AQT+++T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMTGLDNLMYMSSQSSGTGQATITLSFIAGTAPDEAVQQVQNQLQSAMRKLPQ 120 VTQVIEQNM G+DNLMYMSS S G TITL+F +GT PD A QVQN+LQ A LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 AVQDQGVTVRKTGDTNILTIAFVSTDGSMDKQDIADYVASNIQDPLSRVNGVGDIDAYGS 180 VQ QG++V K+ + ++ FVS + + DI+DYVASN++D LSR+NGVGD+ +G+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYSMRIWLDPAKLNSFQMTTKDVTDAIESQNAQIAVGQLGGTPSVDKQALNATINAQSLL 240 QY+MRIWLD LN +++T DV + ++ QN QIA GQLGGTP++ Q LNA+I AQ+ Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTPQQFRDITLCVNQDGSEVKLGDVATVELGAEKYDYLSRFNGNPASGLGVKLASGANEM 300 + P++F +TL VN DGS V+L DVA VELG E Y+ ++R NG PA+GLG+KLA+GAN + Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 ATAKLVLDRLNELAQYFPHGLEYKIAYETTSFVKASIIDVVKTLLEAIALVFLVMYLFLQ 360 TAK + +L EL +FP G++ Y+TT FV+ SI +VVKTL EAI LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLMGTFSVLYAFGYSINTLTMFAMVLAIGLLVDDAIVVVENVERIM 420 N RATLIPTIAVPVVL+GTF++L AFGYSINTLTMF MVLAIGLLVDDAIVVVENVER+M Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 SEEGLTPREATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGTTGAIYRQFSITIVSAMVL 480 E+ L P+EAT KSM QIQGALVGIAMVLSAVF+PMAFFGG+TGAIYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVAMILTPALCATLLKPLHKGEQHGQRGFFGWFNRTFNRNAERYEKGVAKILHRSLRW 540 SVLVA+ILTPALCATLLKP+ + GFFGWFN TF+ + Y V KIL + R+ Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 ILIYVLLLGGMVFLFLRLPTSFLPQEDRGMFTTSIQLPSGSTQQQTLKVVEKVENYYFTH 600 +LIY L++ GMV LFLRLP+SFLP+ED+G+F T IQLP+G+TQ++T KV+++V +YY + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKNNIMSVFSTVGSGPGGNGQNVARMFVRLKDWDARDPTTGSSFAIIERATKAFNQIKEA 660 EK N+ SVF+ G G QN FV LK W+ R+ S+ A+I RA +I++ Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 RVFASSPPAISGLGSSAGFDMELQDHAGAGHDALMAARDQLIELAGKN-SSLTRVRHNGL 719 V + PAI LG++ GFD EL D AG GHDAL AR+QL+ +A ++ +SL VR NGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 DDSPQLQIDIDQRKAQALGVSIDDINDTLQTAWGSSYVNDFMDRGRVKKVYVQAAAKYRM 779 +D+ Q ++++DQ KAQALGVS+ DIN T+ TA G +YVNDF+DRGRVKK+YVQA AK+RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 780 LPDDINLWYVRNKDGGMVPFSAFATSRWETGSPRLERYNGYSAVEIVGEAAPGVSTGTAM 839 LP+D++ YVR+ +G MVPFSAF TS W GSPRLERYNG ++EI GEAAPG S+G AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 840 DVMESLVHQLPGGFGLEWTAMSYQERLSGAQAPALYAISLLVVFLCLAALYESWSVPFSV 899 +ME+L +LP G G +WT MSYQERLSG QAPAL AIS +VVFLCLAALYESWS+P SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 900 MLVVPLGVIGALLATWMRGLENDVYFQVGLLTVIGLSAKNAILIVEFANE-MNQKGHALL 958 MLVVPLG++G LLA + +NDVYF VGLLT IGLSAKNAILIVEFA + M ++G ++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 959 DATLYASRQRPRPILMTSLAFIFGVLPMATSTGAGSGSQHAVGTGVTGGMISATILAIFF 1018 +ATL A R R RPILMTSLAFI GVLP+A S GAGSG+Q+AVG GV GGM+SAT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1019 VPLFFVLIRRRF 1030 VP+FFV+IRR F Sbjct: 1021 VPVFFVVIRRCF 1032
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 32.1 bits (73), Expect = 0.005 Identities = 24/122 (19%), Positives = 50/122 (40%), Gaps = 13/122 (10%) Query: 459 SRIAVHPARQREGIGQQLIVCACMQAAQCDYLSVSFGYT-------PELWRFWQRCGFVL 511 SR V +R ++ +G + + + + + +Y S GY + +R G+ Sbjct: 100 SRFFVDKSRAKDILGNEYPISSMLFLSMINY-SKDKGYDGIYTIVSHPMLTILKRSGWG- 157 Query: 512 VRMGNHREASSGCYTAMALLPLSDAG-KRLAQQEHRRLRRDADILTQWNGEAIPLAALDE 570 +R+ + + LP+ D + LA++ +R ++ L QW + + A Sbjct: 158 IRVVEQGLSEKEERVYLVFLPVDDENQEALARRINRSGTFMSNELKQW---PLRVPAAIA 214 Query: 571 QA 572 QA Sbjct: 215 QA 216
>SECA#SecA protein signature. Length = 901 Score = 53.0 bits (127), Expect = 4e-09 Identities = 18/26 (69%), Positives = 21/26 (80%) Query: 767 RKSKKIGRNVPCPCGSGMKYKRCHGR 792 +K+GRN PCPCGSG KYK+CHGR Sbjct: 874 TGERKVGRNDPCPCGSGKKYKQCHGR 899
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.7 bits (72), Expect = 0.007 Identities = 71/362 (19%), Positives = 119/362 (32%), Gaps = 52/362 (14%) Query: 44 SHAINLFSAYA-SLVYVTPILGGWLADRLLGNRVAVITGALLMTLGHVVLGLESDSTLSL 102 +H L + YA P+LG +DR G R ++ + + ++ L + Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMATAP--FLWV 98 Query: 103 YAALAIIICGYGLFKSNISCLLGELYAPDDNRRDGGFSLLYAAGNIGSIAAPIACGLAAQ 162 I+ G + + ++ D+ R GF + A G +A P+ GL Sbjct: 99 LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF--MSACFGFGMVAGPVLGGLMGG 156 Query: 163 WYGWHVGFALAGVGMFIGLLIFLSGHRHFQQTRGVNRPALRAVKFALPT-WGWLVLMLCI 221 + H F A + L FL+G ++ R LR + W M + Sbjct: 157 -FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVV 212 Query: 222 APVFFTLLLENNWSGYVLAIVCVFAAQLI----ARIMVKFPEHRRALWQIVLLMITGTLF 277 A + F QL+ A + V F E R W + I+ F Sbjct: 213 AALMAVF----------------FIMQLVGQVPAALWVIFGEDRFH-WDATTIGISLAAF 255 Query: 278 WVLAQQGGSSISLFIDRFVNRHWLNMTVPTALFQSVNAIAVMAAGVVLAWLSSPKESARS 337 +L S I V IA ++LA+ + Sbjct: 256 GIL----HSLAQAMITGPVAARLGERRALMLGM-----IADGTGYILLAFAT-------- 298 Query: 338 VLRVWLKFAVGLVLMGGGFMLLALNAHQARLDGQASMGMMIAGLALMGFAELFIDPVAMA 397 R W+ F + ++L GG + AL A +R + G + LA + + P+ Sbjct: 299 --RGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFT 356 Query: 398 QI 399 I Sbjct: 357 AI 358
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 481 bits (1240), Expect = e-170 Identities = 166/480 (34%), Positives = 247/480 (51%), Gaps = 42/480 (8%) Query: 7 AHLLLVDDDPGLLKLLGMRLTSEGYSVVTAESGQEGLRVLHREKVDLVISDLRMDEMDGM 66 A +L+ DDD + +L L+ GY V + R + DLV++D+ M + + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 QLFTEIQKVQPGMPVIILTAHGSIPDAVAATQKGVFSFLTKPIDRDALYKAIDEALE--- 123 L I+K +P +PV++++A + A+ A++KG + +L KP D L I AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 124 --QSAPATDDSWRNAIVTRSPLMLRLLEQARMVAQSDVSVLINGQSGTGKEIFAQAIHNA 181 S D +V RS M + + Q+D++++I G+SGTGKE+ A+A+H+ Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 182 SPRSNKPFVAINCGALPEQLLESELFGHARGAFTGAVSNREGLFQAAEGGTLFLDEIGDM 241 R N PFVAIN A+P L+ESELFGH +GAFTGA + G F+ AEGGTLFLDEIGDM Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 242 PAPLQVKLLRVLQERKVRPLGSNRDIDIDVRIISATHRDLPKAMARGEFREDLYYRLNVV 301 P Q +LLRVLQ+ + +G I DVRI++AT++DL +++ +G FREDLYYRLNVV Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 302 SLKIPALAERTEDIPLLANHLLRQSAQRHKPFVRAFSTDAMKRLMTASWPGNVRQLVNVI 361 L++P L +R EDIP L H ++Q+ + V+ F +A++ + WPGNVR+L N++ Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 362 EQCVALTSSPVISDALVEQALEGENTALPT------------------------------ 391 + AL VI+ ++E L E P Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 392 ------FAEARNQFELNYLRKLLQITKGNVTHAARMAGRNRTEFYKLLSRHELDANDFKE 445 + + E + L T+GN AA + G NR K + + Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSR 482
>PF06580#Sensor histidine kinase Length = 349 Score = 33.3 bits (76), Expect = 0.002 Identities = 20/127 (15%), Positives = 45/127 (35%), Gaps = 26/127 (20%) Query: 354 DVDLEAERCIAEPMLLMSVLDNLYSNAVHYG----AESGNICIRSRSQGSTVYIDVVNSG 409 ++ PML+ + L N + +G + G I ++ TV ++V N+G Sbjct: 245 QINPAIMDVQVPPMLVQT----LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300 Query: 410 EPIPQTEREMIFEPFFQGSHQRKGAVKGSGLGLSIARDCIRRMQGEIQLVDDNAQEVCFR 469 + +E +G GL R+ ++ + G + + ++ Sbjct: 301 SLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342 Query: 470 ISLPLPA 476 + +P Sbjct: 343 AMVLIPG 349
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.8 bits (85), Expect = 1e-04 Identities = 34/177 (19%), Positives = 70/177 (39%), Gaps = 3/177 (1%) Query: 213 FWLLFMILALGVFSGMVISSSSAQIGMTQYGLLSGAL-VVSLVSIFNSIGRLFWGGLTDK 271 WL + V + MV++ S I + V + + SIG +G L+D+ Sbjct: 17 IWLCILSF-FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 272 LGGYNTLVIVYLFTCLCMLLLFFFNGNTSVFYFSALGVGFAYAGILVIFPGLTSQNFGMR 331 LG L+ + C ++ F + S+ + G A + + ++ Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 332 NQGLNYGFMYFGFAVGAVIAPYVTSAIAKYTGSYNTVFILTTVLLLIGVVLTLITKK 388 N+G +G + A+G + P + IA Y ++ + ++ + ++ L + KK Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH-WSYLLLIPMITIITVPFLMKLLKK 191
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.006 Identities = 30/121 (24%), Positives = 50/121 (41%), Gaps = 15/121 (12%) Query: 7 YCGFIAIVGRPNVGKSTLLNKLLGQKISITSRKAQTTRHRIVGIHTEGPYQAIYVDTPGL 66 G I +G + G + N LL ++ IT + T+ E I +DTPG Sbjct: 26 NSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS------FQWENTKVNI-IDTPG- 77 Query: 67 HMEEKRAINRLMNKAASSSIGDVELVIFVVEGTRWTPDDEMVLNKLRDGKAPVILAVNKV 126 HM+ + R + S + L+I +G + ++ + LR P I +NK+ Sbjct: 78 HMDFLAEVYRSL-----SVLDGAILLISAKDGVQ--AQTRILFHALRKMGIPTIFFINKI 130 Query: 127 D 127 D Sbjct: 131 D 131
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 153 bits (388), Expect = 8e-42 Identities = 95/458 (20%), Positives = 181/458 (39%), Gaps = 91/458 (19%) Query: 3 NIRNFSIIAHIDHGKSTLSDRIIQICGG---LSDREMEAQVLDSMDLERERGITIKAQSV 59 I N ++AH+D GK+TL++ ++ G L + D+ LER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 60 TLDFKASDGETYQLNFIDTPGHVDFSYEVSRSLAACEGALLVVDAGQGVEAQTLANCYTA 119 + + E ++N IDTPGH+DF EV RSL+ +GA+L++ A GV+AQT + Sbjct: 62 SFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 120 MEMDLEVVPVLNKIDLPAADPERVAEEIED-----------------IVGIDATDAVRCS 162 +M + + +NKID D V ++I++ + + T++ + Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 163 AKTGVGVTDVLERLVRDIPP---------------------------------------- 182 G D+LE+ + Sbjct: 177 T-VIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT 235 Query: 183 -----PQGDPDGPLQALIIDSWFDNYLGVVSLVRIKNGTMRKGDKIKVMSTGQTYNADRL 237 L + + ++ +R+ +G + D +++ + Sbjct: 236 NKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI---KIT 292 Query: 238 GIFTPKQVDRTELKCGEVGWLVCAIKDIL--GAPVGDTLTSARNPAEKALPGFKKVKPQV 295 ++T + ++ G +V + L + +GDT P + + + P + Sbjct: 293 EMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTK---LLPQRERI---ENPLPLL 346 Query: 296 YAGLFPVSSDDYESFRDALGKLSLNDASL-FYEPESSSALGFGFRCGFLGLLHMEIIQER 354 + P E DAL ++S +D L +Y ++ + FLG + ME+ Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIIL----SFLGKVQMEVTCAL 402 Query: 355 LEREYDLDLITTAPTVVYEVET---TAKETIYVDSPSK 389 L+ +Y +++ PTV+Y +E A+ TI+++ P Sbjct: 403 LQEKYHVEIEIKEPTVIY-MERPLKKAEYTIHIEVPPN 439 Score = 35.2 bits (81), Expect = 7e-04 Identities = 18/81 (22%), Positives = 30/81 (37%), Gaps = 1/81 (1%) Query: 398 ELREPIAECHMLLPQAYLGNVITLCIEKRGVQTNMVYHGNQVALTYEIPMAEVVLDFFDR 457 EL EP + PQ YL T + + N+V L+ EIP + ++ Sbjct: 534 ELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARC-IQEYRSD 592 Query: 458 LKSTSRGYASLDYNFKRFQAS 478 L + G + K + + Sbjct: 593 LTFFTNGRSVCLTELKGYHVT 613
>SECA#SecA protein signature. Length = 901 Score = 26.8 bits (59), Expect = 0.041 Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 15/49 (30%) Query: 51 HRKGGPVLV-----EHREYTHEELI----------AQAEARKAELLAEA 84 KG PVLV E E EL A+ A +A ++A+A Sbjct: 446 TAKGQPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQA 494
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 27.3 bits (60), Expect = 0.024 Identities = 13/49 (26%), Positives = 18/49 (36%) Query: 83 TRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMS 131 QS NT SQT + S + + S + V G S+S Sbjct: 302 KNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVS 350
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 25.5 bits (55), Expect = 0.033 Identities = 7/24 (29%), Positives = 16/24 (66%) Query: 24 DKFREAEKHIAELEAKLETADRLQ 47 +KF+++ ++ + E ET D++Q Sbjct: 57 EKFKDSINNLVKTEFTNETLDKIQ 80
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 243 bits (623), Expect = 2e-78 Identities = 97/443 (21%), Positives = 181/443 (40%), Gaps = 61/443 (13%) Query: 6 HDAAMDDPDIQRERAFSGAGRIVLICSLLFLILGIWAWFGRLDEVSTGNGKVIPSSREQV 65 H ++ P +R R ++ ++ I + G+++ V+T NGK+ S R + Sbjct: 45 HLELIETPVSRRPRLV---AYFIMGFLVIAFI---LSVLGQVEIVATANGKLTHSGRSKE 98 Query: 66 LQSLDGGILAQLTVREGDRVQANQIVARLDPTRLASNVGESAAKYRASLASSARLTA--- 122 ++ ++ I+ ++ V+EG+ V+ ++ +L ++ ++ + + R Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 123 --EVNDLPL----AFPAELNGWPDLIAAETRLYKSR-----------RAQLSDTEAELRD 165 E+N LP P N + + T L K + L AE Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218 Query: 166 ALASVNK----------ELAITQRLEKSGAASHVEVLRLQRQKSDLG------------- 202 LA +N+ L L A + VL + + + Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278 Query: 203 --------LKITDLRSQYYVQAREALSKANAEVDMLSAILKGREDSVTRLTVRSPVRGIV 254 + + + + + L + + +L+ L E+ +R+PV V Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338 Query: 255 KNIQVTTIGGVIPPNGEMMEIVPVDDRLLIETRLSPRDIAFIHPGQRALVKITAYDYAIY 314 + ++V T GGV+ +M IVP DD L + + +DI FI+ GQ A++K+ A+ Y Y Sbjct: 339 QQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY 398 Query: 315 GGLDGVVETISPDTIQDKVKPEIFYYRVFIRTHQDYLQNKSGRRFSIVPGMIATVDIKTG 374 G L G V+ I+ D I+D+ + V I ++ L + + GM T +IKTG Sbjct: 399 GYLVGKVKNINLDAIEDQRLGL--VFNVIISIEENCLSTG-NKNIPLSSGMAVTAEIKTG 455 Query: 375 EKTIVDYLIKPF-NRAKEALRER 396 ++++ YL+ P E+LRER Sbjct: 456 MRSVISYLLSPLEESVTESLRER 478
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 0.004 Identities = 46/217 (21%), Positives = 66/217 (30%), Gaps = 49/217 (22%) Query: 992 PPG----TVVAVVGRSGAGKSTLIKLLAGLYSPGSGQIRVGER-----------LIDAAS 1036 PG V + G G GKSTLI L GL +G + + Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSE 649 Query: 1037 LSDYRRQTGLVTQDVALFSGDIAENI-RYPRPNSSDTEVESAARRAGLFETV---QHL-- 1090 ++ +RR D + RY V+ R+ ++ T Q+L Sbjct: 650 MTAFRR------ADAEAVKAFFSSRKDRYRGA--YGRYVQDHPRQVVIWCTTNKRQYLFD 701 Query: 1091 PLGFRT--PVNNGG----TDLSAGQRQLIALA--------RAHLA--QAHILLLDEATAR 1134 G R PV G L + QL A A R + I E R Sbjct: 702 ITGNRRFWPVLVPGRANLVWLQKFRGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQELR 761 Query: 1135 -IDRSAEERLMTSLTRVTHTEKRIALIVAHRLTTARR 1170 ++ + RL LTR A A + + Sbjct: 762 LVETGVQGRLWALLTREG---APAAEGAAQKGYSVNT 795
>PF06580#Sensor histidine kinase Length = 349 Score = 33.7 bits (77), Expect = 0.001 Identities = 23/101 (22%), Positives = 38/101 (37%), Gaps = 21/101 (20%) Query: 387 LLDNALKY----TPEQGIVTARLERDGDAVTLVVEDSGPGIDDEHIHLALQPFHRLDNVG 442 L++N +K+ P+ G + + +D VTL VE++G L N Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA--------------LKNTK 308 Query: 443 NVAGAGIGLALVND-IARLHRTHPHFSRSEALGGLYVRIRF 482 G GL V + + L+ T SE G + + Sbjct: 309 E--STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.8 bits (241), Expect = 2e-25 Identities = 35/122 (28%), Positives = 61/122 (50%), Gaps = 1/122 (0%) Query: 2 RLLLAEDNRELAHWLEKALVQNGFAVDCVFDGLAADHLLHSEMYALAVLDINMPGMDGLE 61 +L+A+D+ + L +AL + G+ V + + + L V D+ MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VVQRLRKRGQTLPVLLLTARSAVADRVKGLNVGADDYLPKPFELEE-LDARLRALLRRSA 120 ++ R++K LPVL+++A++ +K GA DYLPKPF+L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 GQ 122 Sbjct: 125 RP 126
>INTIMIN#Intimin signature. Length = 939 Score = 27.3 bits (60), Expect = 0.030 Identities = 20/69 (28%), Positives = 33/69 (47%), Gaps = 6/69 (8%) Query: 82 SVDDQVKTTTPAAESQFYTVKSGDTLSAISKQVYGNANLYNKIFEANKPMLKSPE---KI 138 D ++ T FYT+K+G+T++ +SK N + I+ NK + S K Sbjct: 48 GSDSKLLTHNSYQNRLFYTLKTGETVADLSKSQDINLST---IWSLNKHLYSSESEMMKA 104 Query: 139 YPGQVLRIP 147 PGQ + +P Sbjct: 105 EPGQQIILP 113
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 82.4 bits (203), Expect = 8e-21 Identities = 66/257 (25%), Positives = 120/257 (46%), Gaps = 7/257 (2%) Query: 3 QVAVVIGGGQTLGAFLCRGLAEEGYRVAVVDIQSDKAANVAQEINADFGEGMAYGFGADA 62 ++A + G Q +G + R LA +G +A VD +K V + A+ A F AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAE--ARHAEAFPADV 66 Query: 63 TSEQSVLALSRGVDEIFGRVNLLVYSAGIAKAAFISDFQLGDFDRSLQVNLVGYFLCARE 122 ++ ++ ++ G +++LV AG+ + I +++ + VN G F +R Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FSRLMIRDGIQGRIIQINSKSGKVGSKHNSGYSAAKFGGVGLTQSLALDLAEYGITVHSL 182 S+ M D G I+ + S V + Y+++K V T+ L L+LAEY I + + Sbjct: 127 VSKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 183 MLGNLLKSPMFQSL-LPQYATKLGIKPDEVEQYYIDKVPLKRGCDYQDVLNMLLFYASPK 241 G+ ++ M SL + + IK +E + +PLK+ D+ + +LF S + Sbjct: 186 SPGS-TETDMQWSLWADENGAEQVIKGS-LETFKTG-IPLKKLAKPSDIADAVLFLVSGQ 242 Query: 242 ASYCTGQSINVTGGQVM 258 A + T ++ V GG + Sbjct: 243 AGHITMHNLCVDGGATL 259
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 27.1 bits (60), Expect = 0.044 Identities = 10/45 (22%), Positives = 18/45 (40%), Gaps = 5/45 (11%) Query: 1 MKPRQRQAAILEHLQKQGKCSVEEL-----AQYFDTTGTTIRKDL 40 M QR I E + + +EL ++ T T+ +D+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDI 45
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 352 bits (905), Expect = e-119 Identities = 114/306 (37%), Positives = 167/306 (54%), Gaps = 21/306 (6%) Query: 187 MIGLSPAMTQLKKEIEIVAGSDLNVLIGGETGTGKELVAKAIHQGSPRAVNPLVYLNCAA 246 ++G S AM ++ + + + +DL ++I GE+GTGKELVA+A+H R P V +N AA Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198 Query: 247 LPESVAESELFGHVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYG 306 +P + ESELFGH KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG 258 Query: 307 DIQRVGDDRSLRVDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLFVPPLRERGDDVV 366 + VG +R DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ Sbjct: 259 EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIP 318 Query: 367 LLAGYFCEQCRLRLGLSRVVLSPGARRHLLNYGWPGNVRELEHAIHRAVVLARATRAGDE 426 L +F +Q + GL A + + WPGNVRELE+ + R L E Sbjct: 319 DLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITRE 377 Query: 427 VVL-----EEQHFALS---------------EDVLPAPSAESFLALPACRNLRESTENFQ 466 ++ E + E+ + A ALP + Sbjct: 378 IIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEME 437 Query: 467 REMIRQ 472 +I Sbjct: 438 YPLILA 443
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.0 bits (59), Expect = 0.011 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 8/75 (10%) Query: 12 IDGNQAKVD--VCGIQRDVDLTLVGSCDENGQPRLGQWVLVHVGFAMSVINEAEARDTLD 69 I GNQ + D G+ D L ++NG+P G W+ + + F + ++ ++ D + Sbjct: 171 IIGNQIRTDQKFMGV-FDESLKERQEAEKNGEPTGGDWLDIFLSF---IFDKKQSSDVKE 226 Query: 70 ALQN--MFDVEPDVG 82 A+ + V+PD+ Sbjct: 227 AINQEPVPHVQPDIA 241
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 385 bits (991), Expect = e-129 Identities = 143/373 (38%), Positives = 207/373 (55%), Gaps = 39/373 (10%) Query: 387 YQEIHRLKERLVDENLALTEQLNNVDSEFGEIIGRSEAMYNVLKQVEMVAQSDSTVLILG 446 E+ + R + E +L + + ++GRS AM + + + + Q+D T++I G Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 447 ETGTGKELIARAIHNLSGRSGRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 506 E+GTGKEL+ARA+H+ R V +N AA+P L+ES+LFGHE+GAFTGA + GRF Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 507 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKLIQTDVRLIAATNRDLKKMV 566 E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G I++DVR++AATN+DLK+ + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 567 ADREFRNDLYYRLNVFPIQLPPLRERPEDIPLLVKAFTFKIARRMGRNIDSIPAETLRTL 626 FR DLYYRLNV P++LPPLR+R EDIP LV+ F + A + G ++ E L + Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346 Query: 627 SSMEWPGNVRELENVVERAVLLTRGNVLQLS-LPDITAVTPDTSPVATESAKEG------ 679 + WPGNVRELEN+V R L +V+ + + SP+ +A+ G Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 680 ----------------------------EDEYQLIIRVLKETNGVVAGPKGAAQRLGLKR 711 E EY LI+ L T G AA LGL R Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIK---AADLLGLNR 463 Query: 712 TTLLSRMKRLGID 724 TL +++ LG+ Sbjct: 464 NTLRKKIRELGVS 476
>adhesinb#Adhesin B signature. Length = 310 Score = 322 bits (827), Expect = e-112 Identities = 90/309 (29%), Positives = 164/309 (53%), Gaps = 14/309 (4%) Query: 4 LHRLKTLLLAGIVAILAL-------SPAYAKEKFKVITTFTVIADMAKNVAGDAAEVSSI 56 + + + L+L + + S K V+ T ++IAD+ KN+AGD + SI Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 57 TKPGAEIHEYQPTPGDIKRAQGAQLILANGLNLER----WFARFYQHLSGVPE---VVVS 109 G + HEY+P P D+K+ A LI NG+NLE WF + ++ VS Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120 Query: 110 TGVKPMGITEGPYNGKPNPHAWMSAENALIYVDNIRDALVKYDPDNAQIYKQNAERYKAK 169 GV + + GK +PHAW++ EN +IY NI L + DP N + Y++N + Y K Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180 Query: 170 IRQMADPLRAELEKIPADQRWLVTSEGAFSYLARDNDMKELYLWPINADQQGTPKQVRKV 229 + + + + IP +++ +VTSEG F Y ++ ++ Y+W IN +++GTP Q++ + Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240 Query: 230 IDTIKKHHIPAIFSESTVSDKPARQVARESGAHYGGVLYVDSLSAADGPVPTYLDLLRVT 289 ++ ++K +P++F ES+V D+P + V++++ ++ DS++ +Y +++ Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300 Query: 290 TETIVNGIN 298 E I G++ Sbjct: 301 LEKIAEGLS 309
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 30.5 bits (68), Expect = 0.007 Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 8/57 (14%) Query: 201 IISDLTRKWSQAEVAGKLFMSVSSLKRKLAAEEVSFSKIYLDARMNQAIKLLRMGAG 257 ++ LT + Q + F+S SS +R ++++YL+ RM +A++ R G G Sbjct: 66 VLDHLTGRSCQVGSSNSAFVSTSSSRR--------YTEVYLEHRMQEAVEAERAGRG 114
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 42.6 bits (100), Expect = 7e-07 Identities = 33/167 (19%), Positives = 63/167 (37%), Gaps = 10/167 (5%) Query: 23 LLKGLDQEQANEVIAVLQMHNIEANKIDSGKLGYSITVAEPDFTAAVYWIKTYQLPPRPR 82 L L + ++A L NI + +G +I V + LP Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNI-PYRFANG--SGAIEVPADKVHELRLRLAQQGLPKGGA 109 Query: 83 VEIAQMFPADSLVSSPRAEKARLYSAIEQRLEQSLQTMEGVLSARVHISYDIDA---GEN 139 V + + S +E+ A+E L ++++T+ V SARVH++ + E Sbjct: 110 VGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQ 168 Query: 140 GRPPKPVHLSALAVYERGSPLAHQISDIKRFLKNSFADVDYDNISVV 186 P V ++ QIS + + ++ A + N+++V Sbjct: 169 KSPSASVTVTLEPGRALDEG---QISAVVHLVSSAVAGLPPGNVTLV 212
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 28.1 bits (62), Expect = 0.045 Identities = 12/39 (30%), Positives = 21/39 (53%) Query: 234 MSTSTLKRKLAEEGTSFSDIYLSARMNQAAKLLRIGNHN 272 +S +K++ +GT+ IY+++ KLLRI N Sbjct: 241 LSIDIVKKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLG 279
>BACYPHPHTASE#Salmonella/Yersinia modular tyrosine phosphatase signature. Length = 468 Score = 304 bits (778), Expect = e-100 Identities = 67/212 (31%), Positives = 102/212 (48%), Gaps = 17/212 (8%) Query: 340 GKPVALAGSYPKNTPDALEAHMKMLLEKECSCLVVLTSEDQMQAKQ--LPPYFRGSYTFG 397 G +A YP LE+H +ML E L VL S ++ ++ +P YFR S T+G Sbjct: 252 GNTRTIACQYP--LQSQLESHFRMLAENRTPVLAVLASSSEIANQRFGMPDYFRQSGTYG 309 Query: 398 EVHTNSQKVSSASQGEAI--DQYNMQL-SCGEKRYTIPVLHVKNWPDHQPLPS--TDQLE 452 + S+ G+ I D Y + + G+K ++PV+HV NWPD + S T L Sbjct: 310 SITVESKMTQQVGLGDGIMADMYTLTIREAGQKTISVPVVHVGNWPDQTAVSSEVTKALA 369 Query: 453 YLADRVKNSNQNGAPGRSSS-----DKHLPMIHCLGGVGRTGTMAAALVLKDNPHSNL-- 505 L D+ + +N + SS K P+IHC GVGRT + A+ + D+ +S L Sbjct: 370 SLVDQTAETKRNMYESKGSSAVGDDSKLRPVIHCRAGVGRTAQLIGAMCMNDSRNSQLSV 429 Query: 506 EQVRADFRDSRNNRMLEDASQF-VQLKAMQAQ 536 E + + R RN M++ Q V +K + Q Sbjct: 430 EDMVSQMRVQRNGIMVQKDEQLDVLIKLAEGQ 461
>PF05932#Tir chaperone protein (CesT) Length = 127 Score = 33.2 bits (76), Expect = 7e-05 Identities = 16/111 (14%), Positives = 39/111 (35%), Gaps = 7/111 (6%) Query: 4 PLTFDDNNQCLLLLDSDIFTSIEAK--DDIWLLNGMIIPLSPVCGDSIWRQIMVINGELA 61 PL FDD+ C +++D+ ++ + LL G++ P D + ++ Sbjct: 21 PLVFDDHGTCNMIIDNTFALTLSCDYARERLLLIGLLEPH----KDIPQQCLLAGALNPL 76 Query: 62 ANNEGTLAYIDAAETLLFIHAI-TDLTNTYHIISQLESFVNQQEALKNILQ 111 N L + + +I + + + ++ + + Q Sbjct: 77 LNAGPGLGLDEKSGLYHAYQSIPREKLSVPTLKREMAGLLEWMRGWREASQ 127
>BACINVASINC#Salmonella/Shigella invasin protein C signature. Length = 409 Score = 515 bits (1327), Expect = 0.0 Identities = 407/409 (99%), Positives = 408/409 (99%) Query: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP Sbjct: 1 MLISNVGINPAAYLNNHSVENSSQTASQSVSAKDILNSIGISSSKVSDLGLSPTLSAPAP 60 Query: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS Sbjct: 61 GVLTQTPGTITSFLKASIQNTDMNQDLNALANNVTTKANEVVQTQLREQQAEVGKFFDIS 120 Query: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ Sbjct: 121 GMSSSAVALLAAANTLMLTLNQADSKLSGKLSLVSFDAAKTTASSMMREGMNALSGSISQ 180 Query: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV Sbjct: 181 SALQLGITGVGAKLEYKGLQNERGALKHNAAKIDKLTTESHSIKNVLNGQNSVKLGAEGV 240 Query: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKDSNKQISPEHQAILSKRLESV 300 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIK+SNKQISPEHQAILSKRLESV Sbjct: 241 DSLKSLNMKKTGTDATKNLNDATLKSNAGTSATESLGIKNSNKQISPEHQAILSKRLESV 300 Query: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASGQYAATQERSEQQISQVN 360 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGAS QYAATQERSEQQISQVN Sbjct: 301 ESDIRLEQNTMDMTRIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVN 360 Query: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA Sbjct: 361 NRVASTASDEARESSRKSTSLIQEMLKTMESINQSKASALAAIAGNIRA 409
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 842 bits (2175), Expect = 0.0 Identities = 592/593 (99%), Positives = 592/593 (99%) Query: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE Sbjct: 1 MVNDASSISRSGYTQNPRLAEAAFEGVRKNTDFLKAADKAFKDVVATKAGDLKAGTKSGE 60 Query: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE Sbjct: 61 SAINTVGLKPPTDAAREKLSSEGQLTLLLGKLMTLLGDVSLSQLESRLAVWQAMIESQKE 120 Query: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG Sbjct: 121 MGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAATKKLTQAQNKLQSLDPADPG 180 Query: 181 YAQTEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 YAQ EAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN Sbjct: 181 YAQAEAAVEQAGKEATEAKEALDKATDATVKAGTDAKAKAEKADNILTKFQGTANAASQN 240 Query: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF Sbjct: 241 QVSQGEQDNLSNVARLTMLMAMFIEIVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEF 300 Query: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA Sbjct: 301 QEETRKAEETNRIMGCIGKVLGALLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAA 360 Query: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV Sbjct: 361 TGVSFIQQALNPIMEHVLKPLMELIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMV 420 Query: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG Sbjct: 421 AVIVVVAVVGKGAAAKLGNALSKMMGETIKKLVPNVLKQLAQNGSKLFTQGMQRITSGLG 480 Query: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML Sbjct: 481 NVGSKMGLQTNALSKELVGNTLNKVALGMEVTNTAAQSAGGVAEGVFIKNASEALADFML 540 Query: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA Sbjct: 541 ARFAMDQIQQWLKQSVEIFGENQKVTAELQKAMSSAVQQNADASRFILRQSRA 593
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 128 bits (322), Expect = 2e-40 Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 4/160 (2%) Query: 4 QNNVSEERVAEMIWDAVSEGATLKDVHGIPQDMMDGLYAHAYEFYNQGRLDEAETFFRFL 63 Q + + + G T+ ++ I D ++ LY+ A+ Y G+ ++A F+ L Sbjct: 3 QETTDTQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQAL 62 Query: 64 CIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAK 123 C+ D Y+ + +GL A Q Q+ A Y+ + + R F +C L + A+ Sbjct: 63 CVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAE 122 Query: 124 ARQCF----ELVNERTEDESLRAKALVYLEALKTAETEQH 159 A EL+ ++TE + L + LEA+K + +H Sbjct: 123 AESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKEMEH 162
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 341 bits (876), Expect = e-118 Identities = 119/360 (33%), Positives = 205/360 (56%), Gaps = 19/360 (5%) Query: 1 MSSNKTEKPTKKRLEDSAKKGQSFKSKDLIIACLTLGGIAYLVSYGSFN-EFMGIIKIII 59 MS KTE+PT K++ D+ KKGQ KSK+++ L + A L+ + E + +I Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 60 ADNFDQSMADYSLAVFGIGLKYLIPFMLLCL---VCSALPAL----LQAGFVLATEALKP 112 +QS +S A+ + L+ F LC +AL A+ +Q GF+++ EA+KP Sbjct: 61 ---AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKP 117 Query: 113 NLSALNPVEGAKKLFSMRTVKDTVKTLLYLSSFVVAAIICWKKYKVEIFSQLNGNVVDIA 172 ++ +NP+EGAK++FS++++ + +K++L + V+ +I+ W K + + L I Sbjct: 118 DIKKINPIEGAKRIFSIKSLVEFLKSILKV---VLLSILIWIIIKGNLVTLLQLPTCGIE 174 Query: 173 VIWRELLLALVLTCLACA---LIVLLLDAVAEYFRTMKDMKMDKEEVKREMKEQEGNPEV 229 I L L + C +++ + D EY++ +K++KM K+E+KRE KE EG+PE+ Sbjct: 175 CITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 230 KSKRREVHMEILSEQVKSDIENSRLIVANPTHITIGIYFKPELMPIPMISVYETNQRALA 289 KSKRR+ H EI S ++ +++ S ++VANPTHI IGI +K P+P+++ T+ + Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 290 VRAYAEKVGVPVIVDIKLARSLFKTHRRYDLVSLEEIDEVLRLLVWLE--EVENAGKDVI 347 VR AE+ GVP++ I LAR+L+ + E+I+ +L WLE +E +++ Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHSEML 354
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 184 bits (470), Expect = 5e-60 Identities = 48/237 (20%), Positives = 103/237 (43%), Gaps = 4/237 (1%) Query: 12 LVASAALGFARVAPIFFFLPFLNSGVLSGAPRNAIIILVALGVWPHALNEVPPFLSVAMI 71 + RV + P L+ + + + +++ + P P S + Sbjct: 12 WLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFAL 71 Query: 72 PLVLQEAAVGVMLGCLLSWPFWVMHALGCIIDNQRGATLSSSIDPANGIDTSEMANFLNM 131 L +Q+ +G+ LG + + F + G II Q G + ++ +DPA+ ++ +A ++M Sbjct: 72 WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDM 131 Query: 132 SAAVVHLQNGGLVTMVDVLTKSYQLCDPMNEC--TPSLPPLLTFINQVAQNALVLASPVV 189 A ++ L G + ++ +L ++ E + + L + + N L+LA P++ Sbjct: 132 LALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLI 191 Query: 190 LVLLLSEVFLGLLSRFAPQMNAFAISLTVKSGIAVLIMLLYFS--PVLPDNVLRLSF 244 +LL + LGLL+R APQ++ F I + + + +M +++ F Sbjct: 192 TLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIF 248
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 88.7 bits (220), Expect = 4e-27 Identities = 86/86 (100%), Positives = 86/86 (100%) Query: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 Query: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86 FLLSGWYGEVLLSYGRQVIFLALAKG Sbjct: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 303 bits (777), Expect = e-107 Identities = 224/224 (100%), Positives = 224/224 (100%) Query: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIAT 224
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 538 bits (1387), Expect = 0.0 Identities = 300/303 (99%), Positives = 302/303 (99%) Query: 1 MSLRVRQIDRREWLLAQTATECQRHGQEATLEYPTRQGMWVRLSDAEKRWSAWIQPGDWL 60 MSLRVRQIDRREWLLAQTATECQRHG+EATLEYPTRQGMWVRLSDAEKRWSAWI+PGDWL Sbjct: 1 MSLRVRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWL 60 Query: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL Sbjct: 61 EHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLL 120 Query: 121 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQCSLLGRIGIGDVLLIRTS 180 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQ SLLGRIGIGDVLLIRTS Sbjct: 121 HIMSDRGGLWFEHLPELPAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTS 180 Query: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR Sbjct: 181 RAEVYCYAKKLGHFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYR 240 Query: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG Sbjct: 241 KNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESG 300 Query: 301 NGE 303 NGE Sbjct: 301 NGE 303
>SSPANPROTEIN#Salmonella invasion protein InvJ signature. Length = 336 Score = 600 bits (1548), Expect = 0.0 Identities = 331/336 (98%), Positives = 334/336 (99%) Query: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL Sbjct: 1 MGDVSAVSSSGNILLPQQDEVGGLSEALKKAVEKHKTEYSGDKKDRDYGDAFVMHKETAL 60 Query: 61 PVLLAAWRHGAPAKSEHHNGNVSGLHHNGKGDLRIAEKLLKVTAEKSVGLISAEAKVDKS 120 P+LLAAWRHGAPAKSEHHNGNVSGLHHNGK +LRIAEKLLKVTAEKSVGLISAEAKVDKS Sbjct: 61 PLLLAAWRHGAPAKSEHHNGNVSGLHHNGKSELRIAEKLLKVTAEKSVGLISAEAKVDKS 120 Query: 121 AALLSSKNRPLESVSGKKLSADLKAVESVSEVADNATGISDDNIKALPGDNKAIAGEGVR 180 AALLSSKNRPLESVSGKKLSADLKAVESVSEV DNATGISDDNIKALPGDNKAIAGEGVR Sbjct: 121 AALLSSKNRPLESVSGKKLSADLKAVESVSEVTDNATGISDDNIKALPGDNKAIAGEGVR 180 Query: 181 KEGAPLARDVAPARMAAANTGKPDDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 KEGAPLARDVAPARMAAANTGKP+DKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA Sbjct: 181 KEGAPLARDVAPARMAAANTGKPEDKDHKKVKDVSQLPLQPTTIADLSQLTGGDEKMPLA 240 Query: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH Sbjct: 241 AQSKPMMTIFPTADGVKGEDSSLTYRFQRWGNDYSVNIQARQAGEFSLIPSNTQVEHRLH 300 Query: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA Sbjct: 301 DQWQNGNPQRWHLTRDDQQNPQQQQHRQQSGEEDDA 336
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 169 bits (429), Expect = 3e-57 Identities = 141/147 (95%), Positives = 143/147 (97%) Query: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRGLQAEEEAILEQIAGLKLLLDTLRAEN 60 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDR LQ EEEAI+EQIAGLKLLLDTLRAEN Sbjct: 1 MHSLTRIKVLQRRCTVFHSQCESILLRYQDEDRRLQVEEEAIVEQIAGLKLLLDTLRAEN 60 Query: 61 RQLSREEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNY 120 RQLSREEIY LLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQ+KSKYWLRKEGNY Sbjct: 61 RQLSREEIYALLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQEKSKYWLRKEGNY 120 Query: 121 QRWIIRQKRFYIQREIQQEEAESEEII 147 QRWIIRQKR YIQREIQQEEAESEEII Sbjct: 121 QRWIIRQKRLYIQREIQQEEAESEEII 147
>SSPAKPROTEIN#Invasion protein B family signature. Length = 133 Score = 205 bits (522), Expect = 9e-72 Identities = 43/133 (32%), Positives = 75/133 (56%) Query: 1 MQHLDIAELVRSALEVSGCDPSLIGGIDSHSTIVLDLFALPSICISVKEDDVWIWAQLGA 60 M ++++ +LVR +L GC PS+I +DSHS I + L ++P+I I++ + V +WA A Sbjct: 1 MSNINLVQLVRDSLFTIGCPPSIITDLDSHSAITISLDSMPAINIALVNEQVMLWANFDA 60 Query: 61 DSMVVLQQRAYEILMTIMEGCHFARGGQLLLGEQNGELTLKALVHPDFLSDGEKFSTALN 120 S V LQ AY IL ++ ++ + L + L L+ ++ D++ DG F+ L+ Sbjct: 61 PSDVKLQSSAYNILNLMLMNFSYSINELVELHRSDEYLQLRVVIKDDYVHDGIVFAEILH 120 Query: 121 GFYNYLEVFSRSL 133 FY +E+ + L Sbjct: 121 EFYQRMEILNGVL 133
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 604 bits (1558), Expect = 0.0 Identities = 371/372 (99%), Positives = 371/372 (99%) Query: 1 MIPGSTSGISFSRILSRQTSHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 MIPGSTSGISFSRILSRQ SHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA Sbjct: 1 MIPGSTSGISFSRILSRQASHQDATQHTDAQQAEIQQAAEDSSPGAEVQKFVQSTDEMSA 60 Query: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP Sbjct: 61 ALAQFRNRRDYEKKSSNLSNSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFP 120 Query: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS Sbjct: 121 DPSDLVLVLRELLRRKDLEEIVRKKLESLLKHVEEQTDPKTLKAGINCALKARLFGKTLS 180 Query: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR Sbjct: 181 LKPGLLRASYRQFIQSESHEVEIYSDWIASYGYQRRLVVLDFIEGSLLTDIDANDASCSR 240 Query: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL Sbjct: 241 LEFGQLLRRLTQLKMLRSADLLFVSTLLSYSFTKAFNAEESSWLLLMLSLLQQPHEVDSL 300 Query: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE Sbjct: 301 LADIIGLNALLLSHKEHASFLQIFYQVCKAIPSSLFYEEYWQEELLMALRSMTDIAYKHE 360 Query: 361 MAEQRRTIEKLS 372 MAEQRRTIEKLS Sbjct: 361 MAEQRRTIEKLS 372
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 576 bits (1485), Expect = 0.0 Identities = 169/540 (31%), Positives = 271/540 (50%), Gaps = 57/540 (10%) Query: 4 HILLARVLACAALVLVTPGYSSE----KIPVTGSGFVAKDDSLRTFFDAMALQLKEPVIV 59 H RVL L+L + ++ E IP +VAK +SLR V+V Sbjct: 6 HSFFKRVLTGTLLLLSSYSWAQELDWLPIPYV---YVAKGESLRDLLTDFGANYDATVVV 62 Query: 60 SKMAARKKITGNFEFHDPNALLEKLSLQLGLIWYFDGQAIYIYDASEMRNAVVSLRNVSL 119 S K++G FE +P L+ ++ L+WY+DG +YI+ SE+ + ++ L+ Sbjct: 63 SD-KINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEA 121 Query: 120 NEFNNFLKRSGLYNKNYPLRGDNRKGTFYVSGPPVYVDMVVNAATMMDKQND--GIELGR 177 E L+RSG++ + R D YVSGPP Y+++V A +++Q + G Sbjct: 122 AELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGA 181 Query: 178 QKIGVMRLNNTFVGDRTYNLRDQKMVIPGIATAIERLLQGEEQPLGNIVSSEPPAMPAFS 237 I + L DRT + RD ++ PG+AT ++R+L + + P Sbjct: 182 LAIEIFPLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIP------ 235 Query: 238 ANGEKGKAANYAGGMSLQEALKQNAAAGNIKIVAYPDTNSLLVKGTAEQVHFIEMLVKAL 297 Q A + +A A ++ A P N+++V+ + E++ + L+ AL Sbjct: 236 -----------------QAATRASAQA---RVEADPSLNAIIVRDSPERMPMYQRLIHAL 275 Query: 298 DVAKRHVELSLWIVDLNKSDLERLGTSWSGSI-----------TIGDKLGVSLNQSSIST 346 D +E++L IVD+N L LG W I T GD+ ++ N + S Sbjct: 276 DKPSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASNGALGSL 335 Query: 347 LDG---SRFIAAVNALEEKKQATVVSRPVLLTQENVPAIFDNNRTFYTKLIGERNVALEH 403 +D +A VN LE + A VVSRP LLTQEN A+ D++ T+Y K+ G+ L+ Sbjct: 336 VDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKG 395 Query: 404 VTYGTMIRVLPRFSADG---QIEMSLDIEDGNDKTPQSDTTTSVDALPEVGRTLISTIAR 460 +TYGTM+R+ PR G +I ++L IEDGN Q ++ ++ +P + RT++ T+AR Sbjct: 396 ITYGTMLRMTPRVLTQGDKSEISLNLHIEDGN----QKPNSSGIEGIPTISRTVVDTVAR 451 Query: 461 VPHGKSLLVGGYTRDANTDTVQSIPFLGKLPLIGSLFRYSSKNKSNVVRVFMIEPKEIVD 520 V HG+SL++GG RD + + +P LG +P IG+LFR S+ VR+F+IEP+ I + Sbjct: 452 VGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRIIDE 511
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 83.7 bits (207), Expect = 2e-20 Identities = 55/217 (25%), Positives = 92/217 (42%), Gaps = 31/217 (14%) Query: 1 MQIIITGGGGFLGQKLASALLNSSL------AFNELLLVDLKMPARLS--DSPRLRCLEA 52 M+ ++TG GF+G ++ LL + N+ V LK ARL P + + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQ-ARLELLAQPGFQFHKI 59 Query: 53 DLT-QPGVLESVITANTSVVYHLAA-------IVSSHAEDDFDLGWKVNLDLTRQLLEAC 104 DL + G+ + + + V+ + + HA D NL +LE C Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD------SNLTGFLNILEGC 113 Query: 105 RRQPQKIRFVFSSSLAVYGG--TLPECVTDTTALTPRSSYGAQKAACELLVNDYTRKGYV 162 R + +++SS +VYG +P D+ P S Y A K A EL+ + Y+ + Sbjct: 114 RHNKIQ-HLLYASSSSVYGLNRKMPFSTDDSVD-HPVSLYAATKKANELMAHTYSHLYGL 171 Query: 163 DGLALRLPTICVRPGKPNRAASSFVSAIIREPLQGET 199 LR T+ G+P+ A F A+ L+G++ Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAM----LEGKS 204
>BONTOXILYSIN#Bontoxilysin signature. Length = 1196 Score = 29.5 bits (66), Expect = 0.023 Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 3/51 (5%) Query: 298 YEPINGTDQLNVAVKRITSLHKNMNKVYGQRTDTASFDVMNQQGSMEDVLD 348 Y + TD +N++ + ++ + +KVY Q+ D V+ E LD Sbjct: 1077 YLSLKNTDGINISSVKFKLINIDESKVYVQKWDECIICVL---DGTEKYLD 1124
>PF07675#Cleaved Adhesin Length = 1358 Score = 30.4 bits (68), Expect = 0.023 Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 10/81 (12%) Query: 217 KTTVVIP---PQND--IDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNK-----KT 266 T +P PQN + A+ ++VAI+++G L G + G++ + K Sbjct: 249 TNTYTLPASLPQNQASYSIQASAGSYVAISKDGVLYGTGVANASGVATVNMTKQITENGN 308 Query: 267 YARTASEFGYLPLEHTLAVAE 287 Y + YLP+ + E Sbjct: 309 YDVVITRSNYLPVIKQIQAGE 329
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 29.3 bits (65), Expect = 0.036 Identities = 31/132 (23%), Positives = 51/132 (38%), Gaps = 9/132 (6%) Query: 211 GYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLA-----GEGN 265 P L N + A+ +E K YE+GK I+L + + ++ + + Sbjct: 147 RETPKLIINIKDYAINSEQSKEVYYEIGKGISLDIISKDKSLDPEFLNLIKSLSDDSDSS 206 Query: 266 KAFTSEEFTHFLEELTKQYPIVSIEDGLDESDW---DGFAYQTKVLG-DKIQLVGDDLFV 321 S++F LE K I I++ L E F+Y ++L D+F Sbjct: 207 DLLFSQKFKEKLELNNKSIDINFIKENLTEFQHAFSLAFSYYFAPDHRTVLELYAPDMFE 266 Query: 322 TNTKILKEGIEK 333 K+ K G EK Sbjct: 267 YMNKLEKGGFEK 278
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 703 bits (1816), Expect = 0.0 Identities = 217/875 (24%), Positives = 375/875 (42%), Gaps = 73/875 (8%) Query: 12 PIACGVGMLLSVSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKIN 71 +AC + +P S++ ++ FN FL + ++++F + PG Y + I +N Sbjct: 31 FVACA---FAAQAPLSSA--ELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLN 85 Query: 72 GQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSL 131 + V + QG C + +G+ + + + C+ S+ Sbjct: 86 NGYMATR-DVTFN-TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLL--ADDACVPLTSM 141 Query: 132 -KGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQES 190 Q D+G L + +PQA+M + PP WD GI +L+YN + ++ Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIG 201 Query: 191 GSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLG 250 G+ N G N+GAWRLR + SY+ D + + R + L Sbjct: 202 GNS-HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKN--KWQHINTWLERDIIPLR 258 Query: 251 AKLTLGESYLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGR 310 ++LTLG+ Y Q D+FD N+ GA + SDD MLP RG+AP I GIAR A+V + G Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318 Query: 311 VLYETQVPAGPFRIQDLNQ-SVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKM 369 +Y + VP GPF I D+ SG L VT++E +G TQ F V +SVP L R G RY + Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378 Query: 370 ALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVA 429 G + + F + G+ GW++YGG Y+A G GK++G +GA++ Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438 Query: 430 VDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYL 489 VD+T + + +P D G S R Y++ +E + + GYR+S + + +D Sbjct: 439 VDMTQANSTLPDD-----SQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTT 493 Query: 490 DAKT--YHHLNA-----------------GHEKERYTVTYNQNFREQGMSAYFSYSRSTF 530 ++ Y+ +++ + +T Q + Y S S T+ Sbjct: 494 YSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTS-TLYLSGSHQTY 552 Query: 531 WDSPDQS-NYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWG------ 583 W + + + L+ F+ + + S + ++ + +D + +++++P+ Sbjct: 553 WGTSNVDEQFQAGLNTAFEDIN---WTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSD 609 Query: 584 ------NDSISYNGT-FNGSQHRNQLGYSGH--SQNGDNWQLHVG-----QDEQGAQADG 629 + S SY+ + + N G G N ++ + G G+ Sbjct: 610 SKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYA 669 Query: 630 YYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGI 689 +++G + ++ + + + L + GG+ G L + T +LV G Sbjct: 670 TLNYRGGYGNANIGYSHSDD-IKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGA 726 Query: 690 ADVPVSGNDSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAI 749 D V N + T+ G AV+ Y + +D N L + + +V + T GAI Sbjct: 727 KDAKV-ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAI 785 Query: 750 GYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVANDGNAWLAGVKAGETLK 809 F G K++ + PFGA V +E Q G+VA++G +L+G+ ++ Sbjct: 786 VRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQ 844 Query: 810 VFW--DGAAQCEA--SLPPTFTPELLANALLLPCK 840 V W + A C A LPP +LL L C+ Sbjct: 845 VKWGEEENAHCVANYQLPPESQQQLL-TQLSAECR 878
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 28.4 bits (63), Expect = 0.003 Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 2/68 (2%) Query: 20 GQVIALKKMLDEPHECAAVLQQIAAIRGAVNGLMREVIKGHLTEHIVHQSDEVRREEDLD 79 + +LD+ A +Q+ +A G N E I+ + E + +++ +EDL Sbjct: 198 AFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQETPEDITDQEDL- 256 Query: 80 VILKVLDS 87 + + LDS Sbjct: 257 -LDRGLDS 263
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 626 bits (1617), Expect = 0.0 Identities = 230/856 (26%), Positives = 380/856 (44%), Gaps = 66/856 (7%) Query: 19 SQATEFNASLLDSGNLSNVDLTAFSREGYVAPGNYILDIWLNDQPVREQYPVRVVPVAGL 78 S FN L + DL+ F + PG Y +DI+LN+ + + V Sbjct: 44 SAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATRD-VTFNTGDSE 102 Query: 79 DAAVICVTTDMVAMLGLKDKIIHGLKPVTGIPDGQCLELRSA--DSQVRYSAENQRLTFI 136 V C+T +A +GL + + + D C+ L S D+ + QRL Sbjct: 103 QGIVPCLTRAQLASMGLNTA---SVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLT 159 Query: 137 IPQAWMRYQDPDWVPPSRWSDGVTAGLLDYSLMVNRYMPQQGETSTSYSLYGTAGFNLGA 196 IPQA+M + ++PP W G+ AGLL+Y+ N + G S L +G N+GA Sbjct: 160 IPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGA 219 Query: 197 WRLRSDYQYSRFDS-GQGASQSDFYLPQTYLFRALPALRSKLTLGQTYLSSAIFDSFRFA 255 WRLR + +S S S++ + T+L R + LRS+LTLG Y IFD F Sbjct: 220 WRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFR 279 Query: 256 GLTLASDERMLPPSLQGYAPKISGIANSNAQVTVSQNGRILYQTRVSPGPFELPDLSQ-N 314 G LASD+ MLP S +G+AP I GIA AQVT+ QNG +Y + V PGPF + D+ Sbjct: 280 GAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAG 339 Query: 315 ISGNLDVSVRESDGSVRTWQVNTASVPFMARQGQVRYKVAAGRPLYGGTHNNSTVSPDFL 374 SG+L V+++E+DGS + + V +SVP + R+G RY + AG G N P F Sbjct: 340 NSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSG---NAQQEKPRFF 396 Query: 375 LGEATWGAFNNTSLYGGLIASTGDYQSAALGIGQNMGLLGALSADVTRSDARLPHGQKQS 434 G ++YGG + Y++ GIG+NMG LGALS D+T++++ LP + Sbjct: 397 QSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHD 455 Query: 435 GYSYRINYAKTFDKTGSTLAFVGYRFSDRHFLSMPEYLQRRTTDGGD------------- 481 G S R Y K+ +++G+ + VGYR+S + + + R Sbjct: 456 GQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKF 515 Query: 482 ------AWHEKQSYTVTYSQSVPVLNMSAALSVSRLNYWNAQ-SNNNYMLSLNKVFSLGD 534 A++++ +T +Q + + LS S YW + + LN F Sbjct: 516 TDYYNLAYNKRGKLQLTVTQQLG-RTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF---- 570 Query: 535 LQGLPASVSFARNQYTGG-GSQNQVYATISIPWGDSR-----------QVSYSVQKDNRG 582 + + ++S++ + G + ++IP+ SYS+ D G Sbjct: 571 -EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629 Query: 583 GLQQTVNYSD--FHNPDTTWNISAGHNRYDTGSN-SSFSGSVQSRLPWGQAAADATLQPG 639 + + + ++++ G+ G++ S+ ++ R +G A + Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689 Query: 640 QYRSLGLSWYGSVTATAHGAAFSQSMAGNEPRMMIDTGDVAGVPVNGNSGV-TNRFGVGV 698 + L G V A A+G Q + N+ +++ V +GV T+ G V Sbjct: 690 -IKQLYYGVSGGVLAHANGVTLGQPL--NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAV 746 Query: 699 VSAGSSYRRSDISVDVAALPEDVDVSSSVISQVLTEGAVGYRQIDANQGEQVLGHIRLAD 758 + + YR + +++D L ++VD+ ++V + V T GA+ + A G ++L + + Sbjct: 747 LPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HN 805 Query: 759 GASPPFGALVVSGKTGRTAGMVGDGGLAYLTGLSGEDRRTLNVSW--DGRVQCRLTLPET 816 PFGA+V S +++G+V D G YL+G+ + V W + C Sbjct: 806 NKPLPFGAMVTSES-SQSSGIVADNGQVYLSGM--PLAGKVQVKWGEEENAHCVANYQLP 862 Query: 817 VTLSRGPL---LLPCR 829 + L CR Sbjct: 863 PESQQQLLTQLSAECR 878
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 96.1 bits (239), Expect = 1e-27 Identities = 53/183 (28%), Positives = 77/183 (42%), Gaps = 17/183 (9%) Query: 1 MNKMLLAGSAGIVLLSAAASPVWADDNASTFSLGYAQSH-TNHAGTLRGVRLANNYEMSP 59 M K+ SA +L+ A A ST + GYAQS + G L YE Sbjct: 1 MKKIACL-SALAAVLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNKMGGFNLKYRYEEDN 57 Query: 60 D-WGLTTSFAWLNGSQRYSDESSNGRVTTRYYSLLAGPSWKINNQLSLYSQVGPVLLHQR 118 G+ SF + S SS +YY + AGP+++IN+ S+Y VG + Sbjct: 58 SPLGVIGSFTYTEKS---RTASSGDYNKNQYYGITAGPAYRINDWASIYGVVGVGYGKFQ 114 Query: 119 DH---GINESDSKVGYGYSAGVAYTPVSNVAITLGYEGADFDATHNSGSLNSNGFNLGVG 175 S G+ Y AG+ + P+ NVA+ YE + S++ + GVG Sbjct: 115 TTEYPTYKHDTSDYGFSYGAGLQFNPMENVALDFSYEQS------RIRSVDVGTWIAGVG 168 Query: 176 YRF 178 YRF Sbjct: 169 YRF 171
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.0 bits (67), Expect = 0.030 Identities = 21/108 (19%), Positives = 34/108 (31%), Gaps = 5/108 (4%) Query: 231 DEWISRFKSQYEQGYANVYRQRIARLKKLGFLRDDIPLPGLELDKEWQAMTPEQQKYTAK 290 + + R Q Y N+ L+K R ++P E ++ W M + Sbjct: 580 NPYAFRRIKDGGQLYLNLENYTYYALRKGASTRSELPKNSGESNENWLYMGKTSDEAKRN 639 Query: 291 VM-----QVYAAMIANMDAQIGTVIETLKKTGRDKNTILVFLSDNGVN 333 VM + + G L T + K+ FL G N Sbjct: 640 VMNHINNERMNGFNGYFGEEEGKNNGNLNVTFKGKSEQNRFLLTGGTN 687
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 49.7 bits (118), Expect = 2e-08 Identities = 40/238 (16%), Positives = 76/238 (31%), Gaps = 8/238 (3%) Query: 197 PNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALERKLEIEQQEAFMTLEQ 256 N A+ + + E R A + E E +QE+ + Sbjct: 999 TPNNIQAD-VPSVPSNNEEIARVDEAPVPPPAPATPSETT---ETVAENSKQESKTVEKN 1054 Query: 257 EQQVKTRTAEQNAKIAAFEAERHREAE-QTRILAERQIQETEIEREQAVRSRKVEAEREV 315 EQ TA+ + A EA+ + +A QT +A+ + E + + + VE E + Sbjct: 1055 EQDATETTAQN--REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112 Query: 316 RIKEIEQQQVTEIANQTKSIAIAAKSEQQSQAEARANDALADAVRAQ-QNVETTRQTAEA 374 +++ + Q+V ++ +Q +++ Q AR ND + Q Q T A Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172 Query: 375 DRAKQVALIAAAQDAETKAVELTVRAKAEKEAAELQAAAIIELAEATRKKGLAEAEAQ 432 + V A Q E + + + + Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSV 1230
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 29.0 bits (65), Expect = 0.027 Identities = 10/37 (27%), Positives = 20/37 (54%) Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383 G FD + GH+ + +L D++ VAV + + + + Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 69.7 bits (170), Expect = 3e-15 Identities = 34/187 (18%), Positives = 65/187 (34%), Gaps = 32/187 (17%) Query: 90 GLGSGVIIDAAKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGGDDQ 137 + SGV++ K +LTN HV++ L +G ++ + Sbjct: 102 FIASGVVV--GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIISALG 190 D+A+++ + ++++ + +V G P ++ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSIGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ I N Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW---GGVPNEFNGAVFINEN 269 Query: 250 MAQTLAQ 256 + L Q Sbjct: 270 VRNFLKQ 276
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 53.5 bits (128), Expect = 4e-10 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKI-------NATGGLPTIPINTKRTPHIGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 129 bits (325), Expect = 2e-39 Identities = 83/216 (38%), Positives = 130/216 (60%), Gaps = 3/216 (1%) Query: 1 MAKKTKADALKTRQHLIETAIAQFALRGVANTTLNDIADAADVTRGAIYWHFENKTQLFN 60 MA+KTK +A +TRQH+++ A+ F+ +GV++T+L +IA AA VTRGAIYWHF++K+ LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EVW-LQQPPLRELIQDRLTGCWNDNPLQDLREKFIAALQYIAAVPRQQALMQILYHKCEF 119 E+W L + + EL + +PL LRE I L+ R++ LM+I++HKCEF Sbjct: 61 EIWELSESNIGELELEYQAKF-PGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 HNGM-ISEQAIREKIGFHHQSLLEVLQRCMDKKLISGSLDLDVILIILHGSFSGIVKNWL 178 M + +QA R + + + L+ C++ K++ L II+ G SG+++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 MNPTSYDLYKQAPALVDNLLKMLSPDGSVRQLMPNE 214 P S+DL K+A V LL+M ++R NE Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1386 bits (3590), Expect = 0.0 Identities = 914/1032 (88%), Positives = 972/1032 (94%) Query: 1 MANFFIRRPIFAWVLAIILMMAGALAIMQLPVAQYPTIAPPAVSISATYPGADAQTVQDT 60 MANFFIRRPIFAWVLAIILMMAGALAI+QLPVAQYPTIAPPAVS+SA YPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSFLMVAGFVSDNPNTTQDDISDYVASNIKDSISRLNGVGDVQLFGA 180 EVQQQGISVEKSSSS+LMVAGFVSDNP TTQDDISDYVASN+KD++SRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDANLLNKYQLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240 QYAMRIWLDA+LLNKY+LTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KDPEEFGKVTLRVNTDGSVVHLKDVARIELGGENYNVVARINGKPASGLGIKLATGANAL 300 K+PEEFGKVTLRVN+DGSVV LKDVAR+ELGGENYNV+ARINGKPA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIKAKLAELQPFFPQGMKVVYPYDTTPFVKISIHEVVKTLFEAIILVFLVMYLFLQ 360 DTA AIKAKLAELQPFFPQGMKV+YPYDTTPFV++SIHEVVKTLFEAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NIRATLIPTIAVPVVLLGTFAVLAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N+RATLIPTIAVPVVLLGTFA+LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 MEDNLSPREATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTRAIYRQFSITIVSAMAL 480 MED L P+EATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST AIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATLLKPVSAEHHEKKSGFFGWFNTRFDHSVNHYTNSVSGIVRNTGRY 540 SVLVALILTPALCATLLKPVSAEHHE K GFFGWFNT FDHSVNHYTNSV I+ +TGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LIIYLLIVVGMAVLFLRLPTSFLPEEDQGVFLTMIQLPSGATQERTQKVLDQVTHYYLNN 600 L+IY LIV GM VLFLRLP+SFLPEEDQGVFLTMIQLP+GATQERTQKVLDQVT YYL N Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFTVNGFSFSGQGQNSGMAFVSLKPWEERNGEENSVEAVIARATRAFSQIRDG 660 EKANVESVFTVNGFSFSGQ QN+GMAFVSLKPWEERNG+ENS EAVI RA +IRDG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LVFPFNMPAIVELGTATGFDFELIDQGGLGHDALTKARNQLLGMVAKHPDLLVRVRPNGL 720 V PFNMPAIVELGTATGFDFELIDQ GLGHDALT+ARNQLLGM A+HP LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKLDVDQEKAQALGISLSDINETISAALGGYYVNDFIDRGRVKKVYVQADAQFRM 780 EDT QFKL+VDQEKAQALG+SLSDIN+TIS ALGG YVNDFIDRGRVKK+YVQADA+FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPGDINNLYVRSANGEMVPFSTFSSARWIYGSPRLERYNGMPSMELLGEAAPGRSTGEAM 840 LP D++ LYVRSANGEMVPFS F+++ W+YGSPRLERYNG+PSME+ GEAAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 SLMENLASQLPNGIGYDWTGMSYQERLSGNQEPALYAISLIVVFLCLAALYESWSIPFSV 900 +LMENLAS+LP GIGYDWTGMSYQERLSGNQ PAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAASLRGLNNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGRGLI 960 MLVVPLG+VG LLAA+L NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEG+G++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEASRMRLRPILMTSLAFILGVMPLVISRGAGSGAQNAVGTGVMGGMLTATLLAIFF 1020 EATL A RMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGM++ATLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVKRRF 1032 VPVFFVV++R F Sbjct: 1021 VPVFFVVIRRCF 1032
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.0 bits (98), Expect = 5e-06 Identities = 30/176 (17%), Positives = 56/176 (31%), Gaps = 5/176 (2%) Query: 146 ANATQPAPGATSAEQTAGNTSQDISLPPISSTPTQGQSPVVADGQQRVEVQGDLNNALTQ 205 N Q + + + +PP + + VA+ ++ + N Sbjct: 1000 PNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDAT 1059 Query: 206 NPEQMNNVAVN---STLPTEPATVAPVRNGSTTRQAAVSEPTERHTTRPERKQAVIKPKK 262 N S + T ++GS T++ +E E T E K V K Sbjct: 1060 ETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKT 1119 Query: 263 PQTTAKTTTAEPKKPVAPVKRTEPAAPAATPKATTTTTAPQATASAAPVQTAKPAQ 318 + T+ PK+ + + +P A A T + + T +PA+ Sbjct: 1120 QEVPKVTSQVSPKQEQS--ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 32.5 bits (74), Expect = 6e-04 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 32 FYDSDQEIEKRTGADVGWVFDVEGEDGFRN----------REEKVINELTEKQGIVLATG 81 FYD + KR + GW+ + G+R E + I +L E+ IV+A+G Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193 Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112 GG V + +GV E I+K LA Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 269 bits (688), Expect = 2e-86 Identities = 82/301 (27%), Positives = 134/301 (44%), Gaps = 18/301 (5%) Query: 117 LENRSINLQYADAGELAKAGEKLLSAKGTIMVDKRTNRLLLRDNRAALAELEKWVSQMDL 176 L + +I D + +A SA+ + D N +++RD+ + ++ + +D Sbjct: 219 LSDATIQQVTVDNQRIPQAAT-RASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDK 277 Query: 177 PVAQVELAAHIVTINEKSLRELGVKWTLADATQAGAVGDVATLSSDLSVAAATSRVGFNI 236 P A++E+A IV IN L ELGV W + T + T ++A+ G Sbjct: 278 PSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASN----GALG 333 Query: 237 GRINGRLLDL---ELSALEQKQQLDIIASPRLLASHLQPASIKQGSEIPYQVSSGESGAT 293 ++ R LD ++ LE + +++ P LL A I SE Y +G+ A Sbjct: 334 SLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA- 391 Query: 294 SVEFKEAVLG--MEVTPTVLQKG---RIRLKLHISQNVPGQVLQQADGEVLAIDKQEIET 348 E K G + +TP VL +G I L LHI +G + I + ++T Sbjct: 392 --ELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG-IPTISRTVVDT 448 Query: 349 QVEVKSGETLALGGIFSRKNKSGSDSVPLLGDIPWLGQLFRHDGKEDERRELVVFITPRL 408 V G++L +GGI+ + VPLLGDIP++G LFR + R + I PR+ Sbjct: 449 VARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRI 508 Query: 409 V 409 + Sbjct: 509 I 509 Score = 29.5 bits (66), Expect = 0.032 Identities = 18/98 (18%), Positives = 34/98 (34%), Gaps = 4/98 (4%) Query: 1 MKRWIAIILIALMPAAQAG----KAAKVTLVVDDVPVVQVLQALAEQERQNLVVSPDVSG 56 KR + L+ L + A V + +L +VVS ++ Sbjct: 9 FKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIND 68 Query: 57 TLSLHLTDVPWKQALQTVVNSAGLVLRQEGNILHVHSQ 94 +S + LQ + + LV +GN+L++ Sbjct: 69 KVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKN 106
>PF06580#Sensor histidine kinase Length = 349 Score = 31.8 bits (72), Expect = 0.006 Identities = 27/188 (14%), Positives = 71/188 (37%), Gaps = 45/188 (23%) Query: 270 INKDIEECNAIIEQFIDYLR------TGQEMPM--EMADLNSVL-------GEVIAAESG 314 I +D + ++ + +R +++ + E+ ++S L + + Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQ---- 241 Query: 315 YEREINTALQAGSIQVKMHPLSIKRAVANMVVNA--ARYGNGWIKVSSGTESHRAWFQVE 372 +E +IN A+ V++ P+ ++ V N + + G I + ++ +VE Sbjct: 242 FENQINPAIM----DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 373 DDGPGIKLEQRKHLFQPFVRGDSARSTSGTGLGLAIV-QRIIDNH--NGMLEIGTSERGG 429 + G ++ TG GL V +R+ + +++ + ++G Sbjct: 298 NTGSLALKNTKE----------------STGTGLQNVRERLQMLYGTEAQIKL-SEKQGK 340 Query: 430 LSIRAWLP 437 ++ +P Sbjct: 341 VNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.8 bits (246), Expect = 6e-26 Identities = 39/136 (28%), Positives = 72/136 (52%), Gaps = 3/136 (2%) Query: 11 KILVVDDDMRLRALLERYLTEQGFQVRSVANAEQMDRLLTRESFHLMVLDLMLPGEDGLS 70 ILV DDD +R +L + L+ G+ VR +NA + R + L+V D+++P E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 71 ICRRLRSQSNPMPIIMVTAKGEEVDRIVGLEIGADDYIPKPFNPRELLARIRAVL---RR 127 + R++ +P+++++A+ + I E GA DY+PKPF+ EL+ I L +R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 128 QANELPGAPSQEEAVI 143 + ++L ++ Sbjct: 125 RPSKLEDDSQDGMPLV 140
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 41.8 bits (98), Expect = 9e-06 Identities = 42/142 (29%), Positives = 66/142 (46%), Gaps = 30/142 (21%) Query: 1 MKKLTIGLIGNPNSGKTTLFNQL---TGARQRVGNW-AGVTV------ERKEG---QFAT 47 MK + IG++ + ++GKTTL L +GA +G+ G T ER+ G Q Sbjct: 1 MKIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGI 60 Query: 48 T-----DHQVTLVDLPGTYSLTTISSQTSLDEQIACHYILSGDADLLINVVDASNLE-RN 101 T + +V ++D PG + SL +L G A LLI+ D + R Sbjct: 61 TSFQWENTKVNIIDTPG-HMDFLAEVYRSLS-------VLDG-AILLISAKDGVQAQTRI 111 Query: 102 LYLTLQLLELGIPCIVALNMLD 123 L+ L+ ++GIP I +N +D Sbjct: 112 LFHALR--KMGIPTIFFINKID 131
>PF06580#Sensor histidine kinase Length = 349 Score = 27.5 bits (61), Expect = 0.038 Identities = 12/26 (46%), Positives = 16/26 (61%), Gaps = 5/26 (19%) Query: 96 AKMRKVADDAPLMERVEYALQSQINP 121 KM +A +A LM AL++QINP Sbjct: 152 WKMASMAQEAQLM-----ALKAQINP 172
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.044 Identities = 18/115 (15%), Positives = 42/115 (36%), Gaps = 6/115 (5%) Query: 534 QSEIQFAQGFLQAAWETQERAFQLIKEQHLEQLPMHEFLVRIRAQLL------WAWARLD 587 +++ Q L A Q R L + L +LP + Q + + + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 588 EAEASARSGIAVLSTFQPQQQLQCLTLLVQCSLARGDLDNARSQLNRLENLLGNG 642 E ++ ++ +++ + LT+L + + +S+L+ +LL Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.0 bits (67), Expect = 0.024 Identities = 15/114 (13%), Positives = 34/114 (29%), Gaps = 2/114 (1%) Query: 17 DKEQKQEQTEEQQIVEEQRPVEPPVETAADVDAQTPAHSKAETEAFAEEVVDVTEKVQES 76 +++ K E + Q++ + V P E + V Q + + +E T ++ Sbjct: 1109 EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 77 EKP-QPVEPEPAAAIETAAPQIAVEREELPLPEEVKDEAISPEEWQAEAETVEV 129 E+P + + + PE P + + Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVV-ENPENTTPATTQPTVNSESSNKPKN 1221
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 26.7 bits (59), Expect = 0.027 Identities = 6/29 (20%), Positives = 16/29 (55%) Query: 7 FFIIIIALIVVAASFRFVQQRREKAANEA 35 +++I AA ++F++Q+ K ++ Sbjct: 173 ALMVLIQSTSEAARYKFIEQQIGKRVDKT 201
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.2 bits (68), Expect = 0.039 Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 3/78 (3%) Query: 336 AEERRAPIERFIDRFSRIYTPVIMVIALLVTLIPPLMFDGGWQEWIYKGLTLLLIGCPCA 395 E++ P E S+I ++ + +L + P+ F GG IY+ ++ ++ A Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS---A 477 Query: 396 LVISTPAAITSGLAAAAR 413 + +S A+ A A Sbjct: 478 MALSVLVALILTPALCAT 495
>PF01206#SirA family protein Length = 76 Score = 101 bits (254), Expect = 2e-32 Identities = 28/72 (38%), Positives = 42/72 (58%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQTGETLLIIADDPATTRDIPGFCTFMEHDLLAQET 68 D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F H+LL Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 EGLPYRYLLRKA 80 E Y + L++A Sbjct: 65 EDGTYHFRLKRA 76
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.035 Identities = 17/91 (18%), Positives = 28/91 (30%), Gaps = 14/91 (15%) Query: 121 LGQILDVHVFNRLRQNRRWWLAPTASTLFGNISDTLAFFFIAFWRSPDAFMAEHWMEIAL 180 LG I + L+ + +TL + + AE W+ Sbjct: 347 LGVIWRENPCRWLKPDES---PVLMATLMECDENNQPL--AGAYIDRSGLDAETWLT--- 398 Query: 181 VDYCFKVLISIIFFLPMYGVLL-----NMLL 206 V++ + L YGV L N+ L Sbjct: 399 -QLFRVVVVPLYHLLCRYGVALIAHGQNITL 428
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 48.3 bits (115), Expect = 3e-08 Identities = 75/403 (18%), Positives = 137/403 (33%), Gaps = 42/403 (10%) Query: 13 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHD--AMGFSAFWAGLIISLQYFATLLSR 70 ++ N ++ I+ + IGL + VLPG + D G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 71 PHAGRYADVLGPKKIVVFGLCGCFLSGFGYLLADIASAWPMISLLLLGLGRVILGI-GQS 129 P G +D G + +++ L G + Y + A L +L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPF-----LWVLYIGRIVAGITGAT 112 Query: 130 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLCYAWGGLQGLALTVMGV 189 A G+ + + R + M G LG L G Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLM----GGFSPHAPFFAA 166 Query: 190 ALLAILLAL----------PRPSVKANKGKPLPFRAVLGRVWLYGMALALA-----SAGF 234 A L L L + P + + +A +A Sbjct: 167 AALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVG 226 Query: 235 GVIATFITLFYDAK-GWDGAAFALTLFSVAFVGT---RLLFPNGINRLGGLNVAMICFGV 290 V A +F + + WD ++L + + + ++ RLG M+ Sbjct: 227 QVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA 286 Query: 291 EIIGLLLVGTAAMPWMAKIGVLLTGMGFSLVFPALGVVAVKAVPPQNQGAALATYTVFMD 350 + G +L+ A WMA ++L + PAL + + V + QG + Sbjct: 287 DGTGYILLAFATRGWMAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTS 345 Query: 351 MSLGVTGPLAGLVMTWAGVPV----IYLAAAGLVAMALLLTWR 389 ++ + GPL + A + ++A A L + L R Sbjct: 346 LT-SIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 34.2 bits (78), Expect = 2e-04 Identities = 30/116 (25%), Positives = 54/116 (46%), Gaps = 9/116 (7%) Query: 30 RRASWLAGRVLLSRALSPL---PEMVYGEQGKPAFSAGTPLWFNLSHSGDTIALLLSDEG 86 R+A LAGR+ AL + G++ +P + G L+ ++SH T ++S + Sbjct: 46 RKAEHLAGRIAAVHALREVGVRTVPGMGDKRQPLWPDG--LFGSISHCATTALAVISRQ- 102 Query: 87 EVGCDIEVIRPRDNWRSLANAVFSLGEHAEMEAERPERQLADFWRI-WTRKEAIVK 141 +G DIE I + LA ++ E ++A LA + ++ KE++ K Sbjct: 103 RIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLAL--TLAFSAKESVYK 156
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 48.0 bits (114), Expect = 2e-08 Identities = 43/171 (25%), Positives = 73/171 (42%), Gaps = 7/171 (4%) Query: 200 REREHGTVEHLLVMPVTPFEIMMAKV-WSMGLVVLVVSGLSLMLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G + +V LG + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAG---IGVVAAALGY-TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQAVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGLSIVWPQFLTLLAIGGVFFL-IALLRFR 367 +P +H + L + I+ + + + I FFL ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 8e-04 Identities = 65/374 (17%), Positives = 127/374 (33%), Gaps = 57/374 (15%) Query: 2 SVAQASYLITAYGITVTLAAWVTGVLVQTLGPRKVMFCGLVAFIIGS-IGFIGIGLKNMD 60 A +++ TA+ +T ++ V G L LG ++++ G++ GS IGF+G + Sbjct: 47 PPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG----HSF 102 Query: 61 LVWMLPFYAIRGIGYPLFAYSFLIWINYSTPVARRSTAVGWFWFTFSLGLSVIGPFFSSI 120 ++ I+G G F ++ + P R A G ++G +GP + Sbjct: 103 FSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEG-VGPAIGGM 161 Query: 121 ALPVLGEIHVLWVGLLFVLIGSILGIWVNRDVVPASEIHP-------------------- 160 + W LL + + +I+ + ++ Sbjct: 162 IAHYIH-----WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216 Query: 161 --------------FSAGELLKGITILQRPIIAIGL-------VVKSVNGIAQYGLATFL 199 S +K I + P + GL + GI +A F+ Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276 Query: 200 PL--YLISYGYSKTEWLHMWSSVFLVAIFANLFFGFFGDKFGWRKTIMWVGGFGYAVVLL 257 + Y++ + + + S + + + FG+ G R+ ++V G + + Sbjct: 277 SMVPYMMKDVHQLSTAE-IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335 Query: 258 LVWAVPQLLGHNFYVMAF-VLCLCGVTMAGYVPLSALFPM-LAPDSKGAAMSVLNLGAGL 315 LL + M ++ + G +S + L GA MS+LN + L Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395 Query: 316 GAFIAPAITALFYS 329 AI S Sbjct: 396 SEGTGIAIVGGLLS 409
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 28.5 bits (63), Expect = 0.049 Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 9/71 (12%) Query: 7 INSFRSRPELFNADNKMDTC-----DLITRMGNVDGLTHIELNYPDHF---IGQDKKIIK 58 I F+ R L+N +N+MD C D I G G +N P+++ G+ K I Sbjct: 755 IEQFKERLALYNNNNRMDICVVRNTDDIKACGTAIG-NQSMVNNPENYKYLEGKAWKNIG 813 Query: 59 QCITDNGLKVS 69 T NG K+S Sbjct: 814 ISKTANGSKIS 824
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 28.3 bits (63), Expect = 0.043 Identities = 14/49 (28%), Positives = 19/49 (38%), Gaps = 3/49 (6%) Query: 254 YDAAIAHHADGIIYAGTGAGSVSVRSDAGIKKAEKAGIIVVRASRTGNG 302 AI D II G +KKA + I+V+ A+ GN Sbjct: 133 IYYAIEQKVD-IISMSLGGPEDVPELHEAVKKAVASQILVMCAA--GNE 178
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 29.3 bits (65), Expect = 0.016 Identities = 44/160 (27%), Positives = 63/160 (39%), Gaps = 30/160 (18%) Query: 93 DFFTRHHLLASVNVDGPTLIAMRRQPDILAAMERLPWLRFELV----EHIRLPKDSSFAS 148 DF+ H +++ G T A R D AA WL E V EHI ++ Sbjct: 157 DFWLLHDSNGILHLLGKT--AAARLSDPQAASHTAQWLVEESVTPAGEHI------YYSY 208 Query: 149 MCEFGPLWLDDFGTGMANFSA---LSEVRYDYIKVALELFVMLRQSAEGRNLFTLLLQLM 205 + E G + + SA LS+V+Y A +L++ + + LFTL+ Sbjct: 209 LAENGDNVDLNGNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAVQWLFTLVFDYG 268 Query: 206 NRYCRGVIVEGVETLEEWRDVQRSPAFAAQGYFLSRPVPL 245 R GV D Q PAF AQ +L+R P Sbjct: 269 ER--------GV-------DPQVPPAFTAQNSWLARQDPF 293
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 8e-05 Identities = 20/52 (38%), Positives = 26/52 (50%), Gaps = 5/52 (9%) Query: 76 VAPDALRHGIGKALL----EYVQQR-FPLLSLEVYQKNQSAVNFYHALGFRI 122 VA D + G+G ALL E+ ++ F L LE N SA +FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 116 bits (293), Expect = 1e-33 Identities = 43/124 (34%), Positives = 64/124 (51%), Gaps = 11/124 (8%) Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVVGYTDSTGSHDLNMRLS 165 + ++V F+ + ATLKP G L + L +V V+GYTD GS N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASSLITQGVDASRIRTSGMGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A +I GMG +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SPLQ 220 ++ Sbjct: 335 KGIK 338
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 43.8 bits (103), Expect = 4e-08 Identities = 26/121 (21%), Positives = 50/121 (41%), Gaps = 9/121 (7%) Query: 18 FFSSVHTIASHYYTREQIDAWAPADIDLERWANHIKELQPFVVELDGEIAGYADFQPN-- 75 F + V T +++ + D+D+ K F+ L+ G + N Sbjct: 30 FENGVWTYTEERFSKPYFKQYEDDDMDVSYVEEEGKAA--FLYYLENNCIGRIKIRSNWN 87 Query: 76 --GYIDHFFVSGTYSRQGVGILLMNCIHEEARQRGISEL---TSNVSKAAEVFFLRHGFH 130 I+ V+ Y ++GVG L++ E A++ L T +++ +A F+ +H F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 131 I 131 I Sbjct: 148 I 148
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 42.0 bits (98), Expect = 1e-06 Identities = 30/105 (28%), Positives = 47/105 (44%), Gaps = 17/105 (16%) Query: 134 TRRIPWNTLLERVDIIPTSMVATMAAAESGWGTSKLARSN----NNLFGMKCT---KGRC 186 + L + +P ++ AA ESGWG ++ R N NLFG+K + KG Sbjct: 154 AQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPV 213 Query: 187 T---------NTPGKVKG-YSQFASVEESVSAYVANLNTHPAYSS 221 T KVK + ++S E++S YV L +P Y++ Sbjct: 214 TEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAA 258
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 26.6 bits (58), Expect = 0.009 Identities = 11/29 (37%), Positives = 18/29 (62%) Query: 7 FGAALAARVGTGVYRDFREAQRDLQHPVR 35 F A+A TG Y + ++AQ+DL+ +R Sbjct: 591 FNKAVADAKNTGNYDEVKKAQKDLEKSLR 619
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 52.9 bits (127), Expect = 2e-09 Identities = 35/106 (33%), Positives = 53/106 (50%), Gaps = 16/106 (15%) Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47 I HVD GKTTL +++ +G D E++RG+TI G + Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65 Query: 48 PDGRVLGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTR 93 + +V ID PGH FL+ + + +D A+L+++ DGV AQTR Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTR 110
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 72.2 bits (177), Expect = 1e-17 Identities = 18/80 (22%), Positives = 38/80 (47%), Gaps = 2/80 (2%) Query: 1368 VENKMSGGIASAMAMAGLPQAYAPGANMTSIAGGTFNGESAVAIGV-SMVSESGGWVYKL 1426 + ++ G+A+ A++ L Q G S A G + ++A+AIGV S +++ + Sbjct: 1 LSKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGV 60 Query: 1427 QGTSNSQGDYSAAIGAGFQW 1446 + + G S G+++ Sbjct: 61 AFNTYN-GGMSYGASVGYEF 79
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 100 bits (250), Expect = 2e-26 Identities = 75/348 (21%), Positives = 124/348 (35%), Gaps = 67/348 (19%) Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLNI 47 +VTG AGFIG ++ K L + G ++ +DNL D +++ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 48 ADYMDKEDFLIQIMSGEELGDIEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100 AD + + + G E +F + +Y ++N + Y+ + Sbjct: 62 ADR----EGMTDLF---ASGHFERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109 Query: 101 LHYCLERGIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158 L C I LYASS++ YG F + P+++Y +K + Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169 Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218 G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224 Query: 219 AVNL------------WFLESGKSG-------IFNLGTGRAESFQAVADATLAY-HKKGS 258 + W +E+G ++N+G A + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRNA-GYDKPFKTVAEGVTEYMAW 305 +P G T AD L G+ P TV +GV ++ W Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327
>SECA#SecA protein signature. Length = 901 Score = 41.0 bits (96), Expect = 2e-05 Identities = 27/79 (34%), Positives = 38/79 (48%), Gaps = 7/79 (8%) Query: 291 MRLVQGDV-----GSGKTLVAALAA-LRAIAHGKQVALMAPTELLAEQHANNFRNWFEPL 344 M L + + G GKTL A L A L A+ GK V ++ + LA++ A N R FE L Sbjct: 92 MVLNERCIAEMRTGEGKTLTATLPAYLNALT-GKGVHVVTVNDYLAQRDAENNRPLFEFL 150 Query: 345 GVEVGWLAGKQKGKARQAQ 363 G+ VG A++ Sbjct: 151 GLTVGINLPGMPAPAKREA 169
>PERTACTIN#Pertactin signature. Length = 922 Score = 119 bits (300), Expect = 1e-29 Identities = 162/715 (22%), Positives = 278/715 (38%), Gaps = 94/715 (13%) Query: 235 TGDSSEGLRTGQSGSLIRLGDDATIETSGASSTGIYAASSSRTELGNNATITVNGASAHA 294 TG + G+ G+++ L ATI A + G + + Sbjct: 236 TGGRAAGV-AAMDGAIVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGG-FGPLLDGWYG 292 Query: 295 VYATNATVNLGENATISVNSASKAASYSKAPAGLYALSRGAINLAGGAAITMAGDNSSES 354 V +++TV+L A V + A+ +S G+++ G I G Sbjct: 293 VDVSDSTVDL---AQSIVEAPQLGAAIRAGRGARVTVSGGSLSAPHGNVIETGGGARRFP 349 Query: 355 YAISTETGGIVDGS--SGGRFVIDGDIRAAGATAASGTLPQ--------------QNSTI 398 S + + G+ G + T A G Q + + Sbjct: 350 PPASPLSITLQAGARAQGRALLYRVLPEPVKLTLAGGAQGQGDIVATELPPIPGASSGPL 409 Query: 399 KLNMTDNSRWDGASYITSATAGTGVISVQMSDATWNMTSSSTLTDLTLNSGATINFSH-- 456 + + +RW GA+ V S+ + +ATW MT +S + L L S +++F Sbjct: 410 DVALASQARWTGATRA--------VDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPA 461 Query: 457 EDGEPWQTLTINEDYVGNGGKLVFNTVLNDDDSETDRLQVLGNTSGNTFVAVNNIGGAGA 516 E G ++ L ++ G+G +F + D +D+L V+ + SG + V N G A Sbjct: 462 EAGR-FKVLMVDT-LAGSG---LFRMNVFADLGLSDKLVVMRDASGQHRLWVRNSGSEPA 516 Query: 517 QTIEGIEIVNVAGNSNGTFEKASR---IVAGAYDYNVVQKGKNWYLTSYIEPDEPIIPDP 573 + + +V S TF A++ + G Y Y + G + S + P P P Sbjct: 517 -SGNTMLLVQTPRGSAATFTLANKDGKVDIGTYRYRLAANGNGQW--SLVGAKAPPAPKP 573 Query: 574 VDPVIPDPVDPDPVDPVIPDPVIPDPVDPDPVDPEPVDPVIPDPVIPDIGQSDTPPITEH 633 P P P P P P P P P +P P P ++ + + Sbjct: 574 APQPGPQPGPQPPQPPQPPQP----PQPPQPPQRQPEAPAPQPPAGRELSAAANAAVNTG 629 Query: 634 QFRPEVGSYLANNYAANTLFMTRLHDRLGETQYTDMLTGEKKVTSLWMRNVGAHTRFNDG 693 + A + A L RLGE + G W R + ++ Sbjct: 630 GVGLASTLWYAESNA--------LSKRLGELRLNPDAGG------AWGRGFAQRQQLDNR 675 Query: 694 SGQLKTRINSYV--LQLGGDLAQWSTDGLDRWHIGAMAGYANSQNRTQSSVSDYHSRGQV 751 +G+ R + V +LG D A + G RWH+G +AGY + D G Sbjct: 676 AGR---RFDQKVAGFELGADHA-VAVAG-GRWHLGGLAGYTRGD---RGFTGD--GGGHT 725 Query: 752 TGYSVGLYGTWYANNIDRSGAYVDTWMLYNWFDN--KDMGQDQAA--EKYKSKGITASVE 807 VG Y T+ AN+ G Y+D + + +N K G D A KY++ G+ S+E Sbjct: 726 DSVHVGGYATYIANS----GFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGVSLE 781 Query: 808 AGYSFRLGESAHQSYWLQPKAQVVWMGVQADDNREANGTLVKDDTAGNLLTRMGVKAYIN 867 AG F ++L+P+A++ V R ANG V+D+ ++L R+G++ Sbjct: 782 AGRRFAH----ADGWFLEPQAELAVFRVGGGAYRAANGLRVRDEGGSSVLGRLGLEV--- 834 Query: 868 GHNAIDNDKSREFQPFVEANWIHNTQPA-SVKMNDVS--SDMRGTKNIGELKVGI 919 I+ R+ QP+++A+ + A +V+ N ++ +++RGT+ EL +G+ Sbjct: 835 -GKRIELAGGRQVQPYIKASVLQEFDGAGTVRTNGIAHRTELRGTR--AELGLGM 886
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 26.7 bits (58), Expect = 0.043 Identities = 15/42 (35%), Positives = 21/42 (50%) Query: 70 AEAQVIIEQANKRRAQILDEAKTEAEQERTKIVAQAQAEIEA 111 A+A + ANK R Q EAK +AE++ + A A A Sbjct: 210 AKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYA 251
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 31.6 bits (71), Expect = 0.008 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFADAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 33.7 bits (77), Expect = 0.005 Identities = 16/71 (22%), Positives = 31/71 (43%), Gaps = 13/71 (18%) Query: 343 SGLEPLNIGDDSLFVNVGERTN---VTGSA----KFKRLIKEEKYSEALDVARQQVEGGA 395 +P+ D ++ + +TN VT + +R+I + LD+ R QV A Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQ------LDIRRPQVLVEA 351 Query: 396 QIIDINMDEGM 406 I ++ +G+ Sbjct: 352 IIAEVQDADGL 362
>TRNSINTIMINR#Translocated intimin receptor (Tir) signature. Length = 549 Score = 28.9 bits (64), Expect = 0.012 Identities = 14/54 (25%), Positives = 31/54 (57%), Gaps = 2/54 (3%) Query: 13 AGLVTSKKMAKVQRTAKKSRVQAREAREAVEENKKAQLERDKQLSEQQKQAVLA 66 +G + + ++ + AK++ AR+ +AVE N +AQ + Q + +Q++ L+ Sbjct: 310 SGELKDDIVEQIAQQAKEAGEVARQ--QAVESNAQAQQRYEDQHARRQEELQLS 361
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 266 bits (682), Expect = 8e-87 Identities = 87/425 (20%), Positives = 175/425 (41%), Gaps = 25/425 (5%) Query: 9 LMMIIISLTILIIILTYFIEINSVVHGQGVITTKDNAQLISLSKGGTIQDIYVAEGDTVK 68 + I+ ++ IL+ ++ V G +T ++ I + +++I V EG++V+ Sbjct: 60 VAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVR 119 Query: 69 KGELLAKVVNLDLQKEYQRYRTQKGYLDKDVNEI-------SFILDKENESGLITLDGTR 121 KG++L K+ L E +TQ L + + S L+K E L + Sbjct: 120 KGDVLLKLT--ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQ 177 Query: 122 SLSNKEVKANIELVHSQIRA-------KELKKTSLDSEISGLQEKLSSKEKELALLAEEI 174 ++S +EV L+ Q KEL +E + +++ E + + Sbjct: 178 NVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRL 237 Query: 175 NILSPLVKKGISPYTNFLNKKQAYIKVKSEINDIESSITLKKDDIELVVNDIEALNNELR 234 + S L+ K L ++ Y++ +E+ +S + + +I + + + + Sbjct: 238 DDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK 297 Query: 235 LSLSKIISKNLQELEVVNSTLKVIEKQINEEDIYSPVDGVIYKINKSATTHGGVIQAADL 294 + + + + ++ L E++ I +PV + ++ T GGV+ A+ Sbjct: 298 NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQL--KVHTEGGVVTTAET 355 Query: 295 LFEIKPKVRTMLADVKILPKYRDQIYVDEAVKLDVQSIIQPKIKSYNATIDNISPDSYEE 354 L I P+ T+ + K I V + + V++ + + NI+ D+ E+ Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415 Query: 355 NTGGTIQRYYKVIIAFDVNE----DDLRWLKPGMTVDASVITGKHSIMEYLLSPLMKGVD 410 G + VII+ + N + L GM V A + TG S++ YLLSPL + V Sbjct: 416 QRLGL---VFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLEESVT 472 Query: 411 KAFSE 415 ++ E Sbjct: 473 ESLRE 477
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.3 bits (117), Expect = 3e-07 Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 30/190 (15%) Query: 96 DSAQVEKKGNGKRRNKKEEEELKKQLDDAENAKK--EADKAK-EEAEKAKEAAEKALNEA 152 A+ + + + L++ LD + AKK EA+ K EE K EA+ ++L Sbjct: 293 LEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352 Query: 153 FEVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQA---KATQASKQNDAEKVLP 208 + +K Q+E Q N + S+AS+Q+ + + +A KQ + Sbjct: 353 LDASREAKKQLEAEHQKLEEQN-------KISEASRQSLRRDLDASREAKKQVEKALEEA 405 Query: 209 QPI-------NKNTSTGK--SNSSKNEEN-KLDAESVKEPLKVTLALAAES----NSGSK 254 NK K + K E KL+AE+ + LK LA AE +G Sbjct: 406 NSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEA--KALKEKLAKQAEELAKLRAGKA 463 Query: 255 DDSITNFTKP 264 DS T KP Sbjct: 464 SDSQTPDAKP 473 Score = 48.1 bits (114), Expect = 9e-07 Identities = 35/136 (25%), Positives = 63/136 (46%), Gaps = 4/136 (2%) Query: 98 AQVEKKGNGKRRNKKEEEELKKQLDDAENAKKEADKAKEEAEKAKEAAEKALNEAFEVQN 157 A+ +K + ++ + L++ LD + AKK+ +KA EEA A EK E E + Sbjct: 365 AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 424 Query: 158 SSKQIEEMLQNFL-ADNVA-KDNLAQQSD--ASQQNTQAKATQASKQNDAEKVLPQPINK 213 +++ + LQ L A+ A K+ LA+Q++ A + +A +Q K +P Sbjct: 425 LTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQA 484 Query: 214 NTSTGKSNSSKNEENK 229 + K N +K + Sbjct: 485 PQAGTKPNQNKAPMKE 500 Score = 43.1 bits (101), Expect = 3e-05 Identities = 17/115 (14%), Positives = 42/115 (36%), Gaps = 19/115 (16%) Query: 101 EKKGNGKRRNKKEEEELKKQLDDAENAKKEAD-------KAKEEAEKAKEAAEKALNEAF 153 ++ ++ + + + E + + + A ++L Sbjct: 260 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLRRDL 318 Query: 154 EVQNSSK-QIEEMLQNFLADNVAKDNLAQQSDASQQNTQAK---ATQASKQNDAE 204 + +K Q+E Q + + N + S+AS+Q+ + + +A KQ +AE Sbjct: 319 DASREAKKQLEAEHQ-----KLEEQN--KISEASRQSLRRDLDASREAKKQLEAE 366
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 31.1 bits (70), Expect = 0.007 Identities = 16/65 (24%), Positives = 26/65 (40%), Gaps = 7/65 (10%) Query: 130 PPPPPPPVVAKRVESAPRPTEPARNPFKSSDDRLTGVTSSNTVTRPAARASAGAGDKVVI 189 P P P P K+VE R +P + S + + RP + + A K V Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPVESRPASPFE-------NTAPARPTSSTATAATSKPVT 151 Query: 190 AIDAG 194 ++ +G Sbjct: 152 SVASG 156
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 30.1 bits (68), Expect = 0.028 Identities = 26/161 (16%), Positives = 57/161 (35%), Gaps = 18/161 (11%) Query: 31 VENSLDAGATRVDIDIER---GGAKLIR-IRDNGCGIKKEELALALARHATSKIASLDDL 86 ++ SLD A + ++ I R A++ ++ N G E + A+ + +L++ Sbjct: 5 IQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLNLEEA 64 Query: 87 EAIISLGFRGEAL----------ASISSVSRLTLTSRTAEQAEAWQAYAEGRDMDVTVK- 135 + G++G L I RLT + Q +A Q +D+ +K Sbjct: 65 ITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDIYLKV 124 Query: 136 -PAAHPVGTTLEVLDLFYNTPARRKFMRTEK--TEFNHIDE 173 + +G + + + + + F + Sbjct: 125 NSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEH 165
>SECA#SecA protein signature. Length = 901 Score = 33.3 bits (76), Expect = 0.002 Identities = 26/144 (18%), Positives = 55/144 (38%), Gaps = 6/144 (4%) Query: 282 HVVDAADVRVQENIEAVNTVLEEIDAHEIPTLMVMNKIDMLDDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P + ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQSGVGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIEY 424 +R I R +++P EY Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.0 bits (64), Expect = 0.030 Identities = 18/65 (27%), Positives = 30/65 (46%), Gaps = 3/65 (4%) Query: 225 NRMRAEREAVARRHRSQGQEEAEKLRAAADYEVTK---TLAEAERQGRIMRGEGDAEAAK 281 N+ R + A A+R + + +RAA Y + +A A +G I +G A A+ Sbjct: 220 NKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQVAQGAASLAQ 279 Query: 282 LFADA 286 +DA Sbjct: 280 AISDA 284
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.9 bits (80), Expect = 6e-05 Identities = 22/106 (20%), Positives = 38/106 (35%), Gaps = 7/106 (6%) Query: 43 KGYTVADPNLDELYQVYSQPGAAYWVVEQNGCVVGGGGVAPLSCSEPDICELQKMYFLPV 102 K Y D ++ V + AA+ +N C+ G + + ++ + Sbjct: 48 KQYEDDD---MDVSYVEEEGKAAFLYYLENNCI----GRIKIRSNWNGYALIEDIAVAKD 100 Query: 103 ISGQGLAKKLALMALEHAREQGFKRCYLETTAFLREAIALYERLGF 148 +G+ L A+E A+E F LET A Y + F Sbjct: 101 YRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 137 bits (347), Expect = 8e-42 Identities = 84/253 (33%), Positives = 131/253 (51%), Gaps = 8/253 (3%) Query: 10 KNILITGAAQGIGYLLATGLGRYGARIIVNDITPERAETAVTKLQQEGIKAIAAPFNVTH 69 K ITGAAQGIG +A L GA I D PE+ E V+ L+ E A A P +V Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 70 KQDIEAAVGHIEKDIGAIDVLINNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQAVT 129 I+ IE+++G ID+L+N AG+ R ++EW +VN T VF S++V+ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 130 RRMVARQAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGIAPG 189 + M+ R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N ++PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 190 YFKTEMTKALVEDE--------AFTSWLCKRTPAARWGDPQELIGAAVFLSSKASDFVNG 241 +T+M +L DE P + P ++ A +FL S + + Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248 Query: 242 HLLFVDGGMLVAV 254 H L VDGG + V Sbjct: 249 HNLCVDGGATLGV 261
>TETREPRESSOR#Tetracycline repressor protein signature. Length = 218 Score = 29.1 bits (65), Expect = 0.015 Identities = 20/56 (35%), Positives = 28/56 (50%), Gaps = 5/56 (8%) Query: 120 FALVTKKRKLIDALAQQILEAHFPTSIQEDIADEMGFDIRTSLRQRDPKFRQAVLR 175 + V KR L+DALA +IL H S+ G ++ LR FR+A+LR Sbjct: 42 YWHVKNKRALLDALAVEILARHHDYSLPAA-----GESWQSFLRNNAMSFRRALLR 92
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.4 bits (87), Expect = 8e-05 Identities = 32/129 (24%), Positives = 49/129 (37%), Gaps = 33/129 (25%) Query: 26 CDVLLANGKIIAVGADIPSDIVPDCT--------VINLSGRMLCPGFIDQHVHLIGG--- 74 D+ L +G+I A+G D+ P T VI G+++ G +D H+H I Sbjct: 86 ADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFICPQQI 145 Query: 75 ------------GGEAGP------TTRTP-EVSLSRLTEA--GITTVVGLLGTDSVSRHP 113 GG GP TT TP ++R+ EA + G + S P Sbjct: 146 EEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEAADAFPMNLAFAGKGNAS-LP 204 Query: 114 ASLLAKTRA 122 +L+ Sbjct: 205 GALVEMVLG 213
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.8 bits (93), Expect = 1e-05 Identities = 83/396 (20%), Positives = 141/396 (35%), Gaps = 31/396 (7%) Query: 9 PRHPIFTALFGMMVLTLGMGVGRFLYTPMLPVMLAEKQLTFNQLSWIASANYAGYLAGSL 68 P P+ L + + +G+G L P+LP +L + + N ++ A Y Sbjct: 3 PNRPLIVILSTVALDAVGIG----LIMPVLPGLLRDLVHS-NDVTAHYGILLALYALMQF 57 Query: 69 LFSFGLFHLPSRL--RPMLLASAVATGILILSMAIFTQPAVVMLVRFLAGVASAGMMIFG 126 + L L R RP+LL S + MA V+ + R +AG+ A + G Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG 117 Query: 127 SMI-----VLHHTRHPFVIAALFSGVGAGIALGNEYVIGGLHYALSAHSLWLGAGALAGI 181 + I RH ++A F G G+ G V+GGL S H+ + A AL G+ Sbjct: 118 AYIADITDGDERARHFGFMSACF---GFGMVAGP--VLGGLMGGFSPHAPFFAAAALNGL 172 Query: 182 LLLIVAMLIPPRAHALPPAPLARIENQPMPWWQLA-LLYGFAGFGYIIVATYLPLMAKSA 240 L L+P +H PL R P+ ++ A + A + L +A Sbjct: 173 NFLTGCFLLPE-SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAA 231 Query: 241 GSPLLTAHL--WSLVGLAIIPGCFGWLWA----------AKHWGVLPCLTANLLIQSACV 288 + W + I FG L + A G L ++ Sbjct: 232 LWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY 291 Query: 289 LLSLASDSLLLLILSSIGFGATFMGTTSLVMPLARQLSAPGNINLLGLVTLTYGIGQILG 348 +L + + + + +G +L L+RQ+ L G + + I+G Sbjct: 292 ILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG 351 Query: 349 PLAASLSGNGASAIINATLCGAAALFFAALISAAQQ 384 PL + + N A A + + A ++ Sbjct: 352 PLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 47.6 bits (113), Expect = 5e-08 Identities = 35/141 (24%), Positives = 64/141 (45%), Gaps = 5/141 (3%) Query: 58 SLYLAGGMALQWLLGPLSDRIGRRPVLIAGALIFTLACAATLLTTSMTQFLV-ARFVQGT 116 L + G A+ G LSD++G + +L+ G +I + S L+ ARF+QG Sbjct: 59 MLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA 115 Query: 117 SICFIATVGYVTVQEAFGQTKAIKLMAIITSIVLVAPVIGPLSGAALMHFVHWKVLFGII 176 + V V + K +I SIV + +GP G + H++HW L +I Sbjct: 116 GAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL-LI 174 Query: 177 AVMGLLALCGLLLAMPETVQR 197 ++ ++ + L+ + + V+ Sbjct: 175 PMITIITVPFLMKLLKKEVRI 195
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 31.1 bits (70), Expect = 0.020 Identities = 13/61 (21%), Positives = 28/61 (45%), Gaps = 5/61 (8%) Query: 785 ADEIWGYLPGEREKYVFTGEWYDGLFGLEENEEFNDAFWDDVRYIK---DQVNKELENQK 841 AD+ W + ++EK++ E + EE + N+ + ++ K + K + + K Sbjct: 344 ADKKWSHFGTQKEKWIGVAE--NHFSNTEEQAKINNKIKEAIKMFKELPEDFVKYINSDK 401 Query: 842 A 842 A Sbjct: 402 A 402
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 28.8 bits (64), Expect = 0.007 Identities = 12/32 (37%), Positives = 19/32 (59%) Query: 8 NSAILVHFTLKLDDGSTAESTRNNGKPALFRL 39 + + V +T L DG+ +ST GKPA F++ Sbjct: 144 SDTVTVEYTGTLIDGTVFDSTEKAGKPATFQV 175
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 698 bits (1802), Expect = 0.0 Identities = 241/885 (27%), Positives = 385/885 (43%), Gaps = 67/885 (7%) Query: 3 HYKKFRLSTLAAVVGIVLAVGPENSYAEAPIQFNTRFLDVKDDASLDLSRFSRKGYIMPG 62 H +K RL+ + + A + + A + FN RFL A DLSRF + PG Sbjct: 17 HIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 63 SYHLQVLVNQSQIAQDNIITYSVDNNDPDNTYPCLSPELVSLLGLKPEIADKMIWINAGQ 122 +Y + + +N +A ++ + D+ PCL+ ++ +GL M + Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQ--GIVPCLTRAQLASMGLNTASVSGMNLLADDA 134 Query: 123 CLQPDQL-EGMETQTDLSQSTLTVIIPQAYLEYSDEEWDPPSRWDEGIPGVLFDYNVNSQ 181 C+ + Q D+ Q L + IPQA++ + PP WD GI L +YN + Sbjct: 135 CVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGN 194 Query: 182 WRHAEHDDGDEYDISGNGTVGANLGAWRLRADWQANYRHENDSEDKDNFGSSSEQNWDWN 241 Y N G N+GAWRLR + +Y + S S S+ W Sbjct: 195 SVQNRIGGNSHY-AYLNLQSGLNIGAWRLRDNTTWSYNSSDSS-------SGSKNKWQHI 246 Query: 242 RYYAWRAIPQLRAQLTLGEGSLESDIFDGFNYVGGSLITDDQMLPPNLRGYAPDISGVAR 301 + R I LR++LTLG+G + DIFDG N+ G L +DD MLP + RG+AP I G+AR Sbjct: 247 NTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIAR 306 Query: 302 TNAKVTVTQRGRVIYESQVPAGPFRIQDINET-VSGDLHVKIEEQSGQVQEYDVSTASIP 360 A+VT+ Q G IY S VP GPF I DI SGDL V I+E G Q + V +S+P Sbjct: 307 GTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVP 366 Query: 361 FLTRPGQVRYKLAAGRPQDWDHNMEGGFFTSAEASWGIANGWSLYGGAIGEQDYQALALG 420 L R G RY + AG + + E F + G+ GW++YGG Y+A G Sbjct: 367 LLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFG 426 Query: 421 LGRDLALLGAFSVDVTHSRATLPEGSAYGDGTIQGNSFRASYAKDFDDIDSRLTFAGYRF 480 +G+++ LGA SVD+T + +TLP D G S R Y K ++ + + GYR+ Sbjct: 427 IGKNMGALGALSVDMTQANSTLP-----DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRY 481 Query: 481 SEENYMTMDEFIDTHNDDNDR-----------------QRTGHDKEMYTLTYSQNFSAIN 523 S Y + + + + + + LT +Q Sbjct: 482 STSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-T 540 Query: 524 VNAYINYTHRTYWNQPNQD-SYNLTLSHYFDVGEVRGISLSVNGFRNEYDNERDDGVYVS 582 Y++ +H+TYW N D + L+ F+ +LS + +N + RD + ++ Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALN 597 Query: 583 LSIPWGN-----------NRTLSYNGSFSDDNN-SNQVGYYERI--DDRNNYQINAGRAD 628 ++IP+ + + + SY+ S + +N G Y + D+ +Y + G A Sbjct: 598 VNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAG 657 Query: 629 -----NGATLDGYYRHQASYADIDVSANYQEGDYTSGGLNIQGGATLTAKGGALHRTSVN 683 +G+T ++ Y + ++ ++ D + GG A G L + Sbjct: 658 GGDGNSGSTGYATLNYRGGYGNANIGYSHS-DDIKQLYYGVSGGVLAHANGVTLGQPL-- 714 Query: 684 GGSRLMVDVGDEANVPISGYSTPVYTNAFGKAVIVDVNDYYRNLVKIDITQLPEDAEATL 743 + ++V + + + V T+ G AV+ +Y N V +D L ++ + Sbjct: 715 NDTVVLVKAPGAKDAKVENQTG-VRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDN 773 Query: 744 SIAQATLTEGAIGYRRMEVLSGKKAMASIRLRDGGTPPFGAEVYNSRQQQLGIVGEDGSV 803 ++A T GAI + G K + ++ + PFGA V + Q GIV ++G V Sbjct: 774 AVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQV 832 Query: 804 YLIGINPGERLQVTW--EGKTQCEA--ALPDPLPGDLFSGLLLPC 844 YL G+ ++QV W E C A LP L + L C Sbjct: 833 YLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 31.2 bits (70), Expect = 0.001 Identities = 39/140 (27%), Positives = 58/140 (41%), Gaps = 17/140 (12%) Query: 43 PPCTVTGGEVEFGNV-LTTKVDGVNYRQAVGYRLSCNGRVSDYLKLQIQGNAVTINGESV 101 P CTV EV +G++ + V ++ ++C + +K+ I N T N V Sbjct: 37 PACTVQNAEVNWGDIEIQNLVQSGGNQKDFTVDMNCPYSLGT-MKVTITSNGQTGNSILV 95 Query: 102 LQTDV---DGLGIRLQTATDGALVSPGNTQWLSFQYS----GGSGPA-----IEAIPVKD 149 T DGL I L + + + GN L Q + G+ PA + K Sbjct: 96 PNTSTASGDGLLIYLYNSNNSGI---GNAVTLGSQVTPGKITGTAPARKITLYAKLGYKG 152 Query: 150 NGVTLTGGAFNAGATLVVDY 169 N +L G F+A ATLV Y Sbjct: 153 NMQSLQAGTFSATATLVASY 172
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 41.2 bits (96), Expect = 3e-07 Identities = 42/171 (24%), Positives = 76/171 (44%), Gaps = 19/171 (11%) Query: 1 MKKMALMV-LISSSFAAQSAENLKFHGTLISPPNCTISHNQTIEVKFGNMLISKIDGTRY 59 M +++L + L+ +S A + + G + PP CTI++ Q I V FGN+ +D +R Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPP-CTINNGQNIVVDFGNINPEHVDNSRG 59 Query: 60 AQNVPYEITCDSAVRDDTMTMTLTLSGSVTDFNQ-AAINTSVAGLGIELRQN---DQPFT 115 I+C + ++ + ++G+ Q + T++ GI L Q P T Sbjct: 60 EVTKNISISCPY----KSGSLWIKVTGNTMGVGQNNVLATNITHFGIALYQGKGMSTPLT 115 Query: 116 LGS------TITV---NEQSAPVLKAIPVKKSGASLTEGDFDATATLQVDY 157 LG+ +T +S ++P + L GDF TA++ + Y Sbjct: 116 LGNGSGNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 33.1 bits (75), Expect = 3e-04 Identities = 46/161 (28%), Positives = 72/161 (44%), Gaps = 18/161 (11%) Query: 23 VSAADNLHFSGSLVASPCTLTMQGADIAEVDFSSLDASDFIPGGQSARKPLVFELTDCDS 82 V AADNL F G L+ CT+ AEV++ ++ + + G +K ++ C Sbjct: 22 VHAADNLTFKGKLIIPACTVQN-----AEVNWGDIEIQNLVQSG-GNQKDFTVDMN-CPY 74 Query: 83 ALSNGVQVIFTGTEATGMRGILAIDSYSGASGIGIGIETLSGVPVGINNES--GAVFT-- 138 +L ++V T TG IL ++ S ASG G+ I + GI N G+ T Sbjct: 75 SLGT-MKVTITSNGQTG-NSILVPNT-STASGDGLLIYLYNSNNSGIGNAVTLGSQVTPG 131 Query: 139 LVTGKN---TLSLNAWV-QRLPGEDLIPGRFSASALATFEY 175 +TG ++L A + + + L G FSA+A Y Sbjct: 132 KITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 340 bits (874), Expect = e-114 Identities = 120/379 (31%), Positives = 190/379 (50%), Gaps = 51/379 (13%) Query: 191 DALDMTRLTRRQRVDYSSG--KGLQTRYELGDIRGQSPQMEQLRQTITLYARSRAAVLIQ 248 D ++ + R + K + + G+S M+++ + + ++ ++I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 249 GETGTGKELAAQAIHQTFFHRQPHRQNKPSPPFVAVNCGAITESLLEAELFGYEEGAFTG 308 GE+GTGKEL A+A+H R+N P FVA+N AI L+E+ELFG+E+GAFTG Sbjct: 167 GESGTGKELVARALHD-----YGKRRNGP---FVAINMAAIPRDLIESELFGHEKGAFTG 218 Query: 309 SRRGGRAGLFEIAHGGTLLLDEIGEMPLPLQTRLLRVLEEKAVTRVGGHQPIPVDVRVIS 368 ++ G FE A GGTL LDEIG+MP+ QTRLLRVL++ T VGG PI DVR+++ Sbjct: 219 AQTR-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVA 277 Query: 369 ATHCDLDREIMQGRFRPDLFYRLSILRLTLPPLRERQADILPLAESFLKQSLAAMEIPFT 428 AT+ DL + I QG FR DL+YRL+++ L LPPLR+R DI L F++Q + Sbjct: 278 ATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLD-- 334 Query: 429 ESIRHGLTQCQPLLLAWRWPGNIRELRNMMERLALFLS---------------------- 466 ++ + L+ A WPGN+REL N++ RL Sbjct: 335 --VKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPI 392 Query: 467 ----VDPAPTLDRQFMRQLLPELMVNTAELTPST---------VDANALQDVLARFKGDK 513 Q + + + + + + P + ++ + L +G++ Sbjct: 393 EKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQ 452 Query: 514 SAAARYLGISRTTLWRRLK 532 AA LG++R TL ++++ Sbjct: 453 IKAADLLGLNRNTLRKKIR 471
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.003 Identities = 19/69 (27%), Positives = 29/69 (42%) Query: 254 DVLREIRERTELPLGAYQVSGEYAMIKFAAMAGAIDEEKVVLESLGSIKRAGADLIFSYF 313 + E+ + +L L QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKNI 322 L+L E+ I Sbjct: 526 DLNLVERRI 534
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 122 bits (308), Expect = 1e-30 Identities = 109/492 (22%), Positives = 184/492 (37%), Gaps = 78/492 (15%) Query: 538 TLNADLVNDRTWDTTQANYGYGVVAMNSDGHL-----------------TINGNGDINNG 580 +++ +++ TW N G + + SDG + T+ G+G Sbjct: 429 AVDSLSIDNATW-VMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAGSGLFRMN 487 Query: 581 DEADASSTTDNVVA---ATGNYKVRIDNATGAGSVADYKGNELIYVNDINTDATFSAAN- 636 AD +D +V A+G +++ + N+ GS L+ + + ATF+ AN Sbjct: 488 VFAD-LGLSDKLVVMQDASGQHRLWVRNS---GSEPASANTLLLVQTPLGSAATFTLANK 543 Query: 637 --KADLGAYTYQAKQEGNTV------------------------------------VLEQ 658 K D+G Y Y+ GN Sbjct: 544 DGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAG 603 Query: 659 MELTDYANMALSIP--SANTNIWNLEQDTVGTRLTNARHGLADNGGAWVSYFGGNFNGDN 716 EL+ AN A++ + +W E + + RL R D GGAW F DN Sbjct: 604 RELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRL-NPDAGGAWGRGFAQRQQLDN 662 Query: 717 GTIN-YDQDVNGIMVGVDTKVDGNNAKWIVGAAAGFAKGDLS---DRTGQVDQDSQSAYI 772 +DQ V G +G D V +W +G AG+ +GD D G D S ++ Sbjct: 663 RAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTD----SVHV 718 Query: 773 YSSARFANN--IFVDGNLSYSHFNNDLSANMSDGTYVDGNTSSDAWGFGLKLGYDLKLGD 830 A + + ++D L S ND SDG V G + G L+ G D Sbjct: 719 GGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHAD 778 Query: 831 AGYVTPYGSVSGLFQSGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYSEDQALTPYF 890 ++ P ++ G Y+ +N ++V + S+ LG++ G + + + PY Sbjct: 779 GWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYI 838 Query: 891 KLAYVYD-DSNNDADVNGDSIDNGVEGSAVRVGLGTQFSFTKNFSAYTDANYLGGGDVDQ 949 K + + + D NG + + G+ +GLG + + S Y Y G + Sbjct: 839 KASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAM 898 Query: 950 DWSANVGVKYTW 961 W+ + G +Y+W Sbjct: 899 PWTFHAGYRYSW 910
>PF06291#Lambda prophage Bor protein Length = 102 Score = 30.0 bits (67), Expect = 0.002 Identities = 21/68 (30%), Positives = 30/68 (44%), Gaps = 11/68 (16%) Query: 28 VNDKEIICSPDESNTHTFVILEGVVSLVRGDKVLIGIVQAPFIFGLADGVAKKEAQYKLI 87 V +K +P E+ TH F VS + K V A I G A+ V K E Q + Sbjct: 29 VGNKPTAVTPKETITHHFF-----VSGIGQKKT----VDAAKICGGAENVVKTETQQTFV 79 Query: 88 AESGCIGY 95 +G +G+ Sbjct: 80 --NGLLGF 85
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.0 bits (244), Expect = 7e-26 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNKLNEPWPDLILLDWMLPGGSGLQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKREAMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPGSHRVMTGDSP 152 E L D + G S Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 17/123 (13%), Positives = 38/123 (30%), Gaps = 28/123 (22%) Query: 300 TFTFEVDDSLSVLGNEEQLRSAISNLVYNAVNH----TPAGTHITVSWRRVAHGAEFCIQ 355 F +++ ++ + + + LV N + H P G I + + ++ Sbjct: 241 QFENQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 356 DNGPGIAAEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHALNH---HESRLEIDSSPG 412 + G +G GL V+ L E+++++ G Sbjct: 298 NTGSLAL------------------KNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 413 KGT 415 K Sbjct: 340 KVN 342
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 69.5 bits (170), Expect = 6e-15 Identities = 35/165 (21%), Positives = 79/165 (47%), Gaps = 4/165 (2%) Query: 433 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIF-FYKKFGLIATSALVANLVLIV 491 ++I ++GP + + + + + LA VV + ++ F +F L A ALV +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 492 GIMSLLPGATLSMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAINEGYAGA 549 G+ ++L + +A ++ +++ V++ +R++E L ++ +N Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 550 FSSIFDANITTLIKVIILYAVGTGAIKGFAITTGIGVATSMFTAI 594 S +TTL+ ++ + G I+GF GV T ++++ Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSV 298
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 352 bits (904), Expect = e-124 Identities = 104/309 (33%), Positives = 176/309 (56%), Gaps = 12/309 (3%) Query: 17 YDFMRWDFWAFGISGLLLIAAIVIMGVRGFNWGLDFTGGTVIEITLEKPAEMDVMREALQ 76 +DF RW + FG + +++IA++++ V G N+G+DF GGT I ++ V R AL+ Sbjct: 14 FDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALE 73 Query: 77 KAGYEEPQLQNFGS------SHDIMVRMPPTEGETGGQVLGSKVVTIINE------ATNQ 124 + + H M+R+ E G + G++ ++N+ A + Sbjct: 74 PLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDP 133 Query: 125 NAAVKRIEFVGPSVGADLAQTGAMALLVALISILVYVGFRFEWRLAAGVVIALAHDVIIT 184 + E VGP V +L T +LL A + I+ Y+ RFEW+ A G V+AL HDV++T Sbjct: 134 ALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLT 193 Query: 185 LGILSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQT 244 +G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +T Sbjct: 194 VGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 245 LHRTLITSGTTLVVILMLYLFGGPVLEGFSLTMLIGVSIGTASSIYVASALALKLGMKRE 304 L RT++T TTL+ ++ + ++GG V+ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRN 313 Query: 305 HMLQQKVEK 313 + +K Sbjct: 314 KEKKDPSDK 322
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 32.9 bits (75), Expect = 4e-04 Identities = 14/56 (25%), Positives = 24/56 (42%), Gaps = 5/56 (8%) Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAQRLAVSERTIYRDIRDLSLSGVPVEG 53 + R +I +I+ + T L V++ T+ RDI++L L VP Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNN 57
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 499 bits (1286), Expect = 0.0 Identities = 240/295 (81%), Positives = 254/295 (86%), Gaps = 9/295 (3%) Query: 36 MKKTLLAVSAALALTSSFTANAAENDQPQYLSDWWHQSVNVVGSYHTRFSPKLNNDVYLE 95 MKKTLLA A +AL+++F A AAEND+PQYLSDWWHQSVNVVGSYHTRF P++ ND YLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 96 YEAFAKKDWFDFYGYIDIPKTFDWGNGNDKGIWSDGSPLFMEIEPRFSIDKLTGADLSFG 155 YEAFAKKDWFDFYGYID P F GN KGIW+ GSPLFMEIEPRFSIDKLT DLSFG Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFG-GNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFG 119 Query: 156 PFKEWYFANNYIYDMGDNKASRQSTWYMGLGTDIDTGLPMGLSLNVYAKYQWQNYGASNE 215 PFKEWYFANNYIYDMG N + QSTWYMGLGTDIDTGLPM LSLNVYAKYQWQNYGASNE Sbjct: 120 PFKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNE 179 Query: 216 NEWDGYRFKVKYFVPITDLWGGKLSYIGFTNFDWGSDLGDDP--------NRTSNSIASS 267 NEWDGYRFKVKYFVP+TDLWGG LSYIGFTNFDWGSDLGDD RTSNSIASS Sbjct: 180 NEWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASS 239 Query: 268 HILALNYDHWHYSVVARYFHNGGQWQNGAKLNWGDGDFSAKSTGWGGYLVVGYNF 322 HILALNY HWHYS+VARYFHNGGQW + AKLN+GDG FS +STGWGGY VVGYNF Sbjct: 240 HILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1367 bits (3539), Expect = 0.0 Identities = 809/1033 (78%), Positives = 918/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISATYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDPISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPTELTKYQLTPVDVINAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ L KY+LTPVDVIN +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTDEFGKILLKVNQDGSQVRLRDVAKIELGGENYDVIAKFNGQPASGLGIKLATGANAL 300 + +EFGK+ L+VN DGS VRL+DVA++ELGGENY+VIA+ NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIRAELKKMEPFFPPGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+L +++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 TEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPVAKGDHGEGKKGFFGWFNRLFDKSTHHYTDSVGNILRSTGR 540 SVLVALILTPALCAT+LKPV+ H E K GFFGWFN FD S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLLLYLIIVVGMAYLFVRLPSSFLPDEDQGVFLTMVQLPAGATQERTQKVLDEVTDYHLN 600 YLL+Y +IV GM LF+RLPSSFLP+EDQGVFLTM+QLPAGATQERTQKVLD+VTDY+L Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKANVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEKNKVEAITQRATAAFSQIKD 660 EKANVESVF VNGF F+G+ QN G+AFVSLK W +R G++N EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLFGEVAKYPDLLVGVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQL G A++P LV VRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDINDWYVRGSDGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D++ YVR ++G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MAMMEELASKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFS 900 MA+ME LASKLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 VEATLEAVRMRLRPILMTSLAFMLGVMPLVISSGAGSGAQNAVGTGVLGGMVTATVLAIF 1020 VEATL AVRMRLRPILMTSLAF+LGV+PL IS+GAGSGAQNAVG GV+GGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.9 bits (101), Expect = 1e-06 Identities = 32/211 (15%), Positives = 75/211 (35%), Gaps = 17/211 (8%) Query: 64 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 123 + Y A +L + + ++ + Q +++ ++ L +Q T + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 124 AKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVQNGQASALATVQQLDPIYVDVTQ 182 + + + +P+S ++ + V TEG +V + + V + D + V Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTALV 372 Query: 183 SSNDFLRLKQEL-------ANGSLKQENGKAKVDLVTSDGIKFPQSGTLEFSDVTVDQTT 235 + D + A + KV + D I+ + G + +++++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENC 432 Query: 236 GSITLRAIFPNPDHTLLPGMFVRARLQEGTK 266 S + I L GM V A ++ G + Sbjct: 433 LSTGNKNIP------LSSGMAVTAEIKTGMR 457 Score = 31.0 bits (70), Expect = 0.007 Identities = 24/133 (18%), Positives = 45/133 (33%), Gaps = 10/133 (7%) Query: 13 PLQITTELPGR-TIAYRIAEVRPQVSGIILKRNFV-EGSDIEAGVSLYQIDP-------A 63 ++I G+ T + R E++P + I+ K V EG + G L ++ Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEADTL 137 Query: 64 TYQATYDSAKGDLAKAQAAANIAELTVKRYQKLLGTQYISKQEYDQALADAQQATAAVVA 123 Q++ A+ + + Q + EL KL Y ++ L Sbjct: 138 KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Query: 124 AKAAVETARINLA 136 + +NL Sbjct: 198 WQNQKYQKELNLD 210
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 36.2 bits (83), Expect = 7e-04 Identities = 41/219 (18%), Positives = 81/219 (36%), Gaps = 18/219 (8%) Query: 92 RQKVAQAPEKMRQ-ATAALNALSDVDNDDEMRKTLSALSLRQLELRVA--QVLDDLQNSQ 148 R ++A+A EK R+ A AA A + + + + A + RQL+L A + L L Sbjct: 129 RLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEA 188 Query: 149 NDLAAYNSQLVSLQTQPERVQNAMYTASQQI-------QQIRNRLDGNNVGEAALRPSQQ 201 + +L + Q++ ++ + T + ++ L G A + Sbjct: 189 KAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHARDAEMKTLAGKRNELAQASAKYK 248 Query: 202 VLLQAQQALLNAQID--------QQRKSLEGNTVLQDTLQKQRDYVTANSNRLEHQLQLL 253 L + + L D + + G +++ QKQ NR+ + + Sbjct: 249 ELDELVKKLSPRANDPLQNRPFFEATRRRVGAGKIREEKQKQVTASETRINRINADITQI 308 Query: 254 QEAVNSKRLTLTEKTAQEAISPDETARIQANPLVKQELD 292 Q+A++ A+ + + + Q N L Q D Sbjct: 309 QKAISQVSNNRNAGIARVHEAEENLKKAQNNLLNSQIKD 347
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.9 bits (69), Expect = 0.005 Identities = 28/63 (44%), Positives = 37/63 (58%), Gaps = 6/63 (9%) Query: 222 AEPGALIRQLAQGAPQYKEQLMT--IAEWLEE---KGRTEGLQKGLEQGLAQGREAEARA 276 AEP +L +QLAQ Q EQ IAE ++ +G EGL +GLEQGLA+ + +A Sbjct: 36 AEP-SLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPI 94 Query: 277 IAR 279 AR Sbjct: 95 HAR 97 Score = 29.0 bits (64), Expect = 0.023 Identities = 19/67 (28%), Positives = 31/67 (46%), Gaps = 8/67 (11%) Query: 233 QGAPQYKEQLMTIAEWLEEKGRTEGLQKGLEQGLAQGREAEARAIARKMLANGLEPGLIA 292 + P ++QL + E+G G+ +G +QG QG + + LA GLE GL Sbjct: 35 EAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQ--------EGLAQGLEQGLAE 86 Query: 293 SVTGITP 299 + + P Sbjct: 87 AKSQQAP 93
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 45.1 bits (106), Expect = 8e-07 Identities = 57/278 (20%), Positives = 92/278 (33%), Gaps = 48/278 (17%) Query: 366 PEPETPRQSFAPVAPTAVMTPP--QVQQPSAP-----------APQTSPAPLPASTSQVL 412 PE E Q V T + TP Q PS P AP PAP S + Sbjct: 983 PEVEKRNQ---TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 413 AARNQLQRAQGVTKTKK--SEPAAASRARPVNHSALERLASVSERVQARPAPSALETAPV 470 A N Q ++ V K ++ +E A + R + + + E A Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQN-----------REVAKEAKSNVKANTQTNEVAQS 1088 Query: 471 KKEAYRWKATTPVVQTKEVVATPKALKKALEHEKTPELAAKLAAEAIERDPWAAQVSQLS 530 E T +TKE K K +E EKT E K+ ++ + + V + Sbjct: 1089 GSET----KETQTTETKETATVEKEEKAKVETEKTQE-VPKVTSQVSPKQEQSETVQPQA 1143 Query: 531 LPKLVEQVALNAWKEQNGNAVCLHLRSTQRHLNSSGAQQKLAQALSDLTGTTV-ELTIVE 589 P +N + Q+ N++ ++ A+ S V E T V Sbjct: 1144 EPARENDPTVNIKEPQSQT-------------NTTADTEQPAKETSSNVEQPVTESTTVN 1190 Query: 590 DDNPAVRTPLEWRQAIYEEKLAQARESIIADNNIQTLR 627 N V P A + + + + + +++R Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR 1228
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 45.3 bits (107), Expect = 1e-07 Identities = 35/139 (25%), Positives = 60/139 (43%), Gaps = 5/139 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFVGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYL 333 P+ H D+ + I L Sbjct: 209 AARFLPLSHSIDLIRPIML 227
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.009 Identities = 22/89 (24%), Positives = 29/89 (32%), Gaps = 21/89 (23%) Query: 294 PRFEDAFIDLLGGAGTSESPLGSILHTVEGTAGETVIEAQELTKKFGDFAATDHVNFVVQ 353 PR E + +LG P + Q + K HV V++ Sbjct: 548 PRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVME 590 Query: 354 RGEIFG----LLGPNGAGKSTTFKMMCGL 378 G F L G G GKST + GL Sbjct: 591 PGCKFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.7 bits (66), Expect = 0.044 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 60.6 bits (147), Expect = 2e-12 Identities = 44/262 (16%), Positives = 97/262 (37%), Gaps = 27/262 (10%) Query: 79 YENALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVRQAQAAYDYAQNFYNRQQGLWK 138 ++N Q + + +A+ +LA E R + + + L + Sbjct: 198 WQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQ 257 Query: 139 SRTISA--NDLENARSSRDQAQATLKSAQDKLSQYRTGNREQDI----AQAKASLEQAKA 192 N+L +S +Q ++ + SA+++ T + +I Q ++ Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL-VTQLFKNEILDKLRQTTDNIGLLTL 316 Query: 193 QLAQAQLDLQDTTLIAPANGTLLTRAV-EPGSMLNAGSTVLTLSLT-RPVWVRAYVDERN 250 +LA+ + Q + + AP + + V G ++ T++ + + V A V ++ Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKD 376 Query: 251 LSQTQPGRDILLYTDGRPDKPYH---GKIGFVSPTAEFTPKTVETPDLRTDLVYRLRIIV 307 + G++ ++ + P Y GK+ ++ A D R LV+ + I + Sbjct: 377 IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISI 428 Query: 308 T-------DADDALRQGMPVTV 322 + + L GM VT Sbjct: 429 EENCLSTGNKNIPLSSGMAVTA 450
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 32/224 (14%), Positives = 72/224 (32%), Gaps = 25/224 (11%) Query: 6 TTTKGEQAKSQLIAAALAQFGEYGLHATT-RDIAALAGQNIAAITYYFGSKEDLYLACAQ 64 T + ++ + ++ AL F + G+ +T+ +IA AG AI ++F K DL+ + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 65 WIADFLGEKFRPHAEKAERLFSQPAPD-RDAIRELILLACKNMIMLLTQEDTVNLSKFIS 123 +GE E + P R+ + ++ L E + +F+ Sbjct: 65 LSESNIGELEL---EYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV- 120 Query: 124 REQLSPTSAYQLVHEQVIDPLHTHLTRLVAA---YTGCDANDTRMILHTHALLGEVLAFR 180 E A + + + D + L + A +I+ + ++ Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM--RGYISGLM--- 175 Query: 181 LGKETILLRTGWPQFDEEKAELIYQTVTCHIDLILHGLTQRSLD 224 W + + ++ ++L Sbjct: 176 ---------ENWLFAPQSFDLK--KEARDYVAILLEMYLLCPTL 208
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.023 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.010 Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 6/106 (5%) Query: 394 LMIGMITFQFSNFSFGIGNAAGLLFAGIML-GFLRANHPTFG-YIPQ--GALNMVKEFGL 449 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 450 MVFMAGVGLSAGSGINNGLGAVGGQM--LIAGLVVSLVPVVICFLF 493 M G G+ AG + +G A + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 385 bits (990), Expect = e-136 Identities = 126/350 (36%), Positives = 204/350 (58%), Gaps = 4/350 (1%) Query: 2 SEKTEQPTEKKLRDGRKEGQVVKSIEITSLFQLIALYLYFHFFTEKMILILIESITFTLQ 61 EKTEQPT KK+RD RK+GQV KS E+ S ++AL ++ + + + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62 Query: 62 LVNKPFSYALTQL-SHALIESLTSALLFLGAGVIVATVGSVFLQVGVVIASKAIGFKSEH 120 PFS AL+ + + L+E L ++A + S +Q G +I+ +AI + Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMA-IASHVVQYGFLISGEAIKPDIKK 121 Query: 121 INPVSNFKQIFSLHSVVELCKSSLKVIMLSLIFAFFFYYYASTFRALPYCGLACGLLVVS 180 INP+ K+IFS+ S+VE KS LKV++LS++ T LP CG+ C ++ Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 181 SLIKWLWVGVMAFYIVVGILDYSFQYYKIRKDLKMSKDDVKQEHKDLEGDPQMKTRRREM 240 +++ L V ++V+ I DY+F+YY+ K+LKMSKD++K+E+K++EG P++K++RR+ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 241 QSEIQSGSLAQSVKQSVAVVRNPTHIAVGLGYHPTDMPIPRVLEKGSDAQANYIVNIAER 300 EIQS ++ ++VK+S VV NPTHIA+G+ Y + P+P V K +DAQ + IAE Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 301 NCIPVVENVELARSLFFEVERGDKIPETLFEPVAALLRMVMK--IDYAHS 348 +P+++ + LAR+L+++ IP E A +LR + + I+ HS Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIEKQHS 351
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 76.5 bits (188), Expect = 2e-17 Identities = 48/194 (24%), Positives = 84/194 (43%), Gaps = 3/194 (1%) Query: 8 LVWLAGLSMLGFLATDMYLPAFAAIRADLQTPAAAVSASLSLFLAGFAVAQLLWGPLSDR 67 L+WL LS L + + I D P A+ + + F+ F++ ++G LSD+ Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 68 YGRKPILLLGLSIFALGSLGMLWVESAAALLTL-RFVQAVGVCAATVIWQALVTDYYPSQ 126 G K +LL G+ I GS+ S +LL + RF+Q G A + +V Y P + Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 127 KINRIFATIMPLVGLSPALAPLLGSWILTHFSWQAIFATLFVITLLLMLPALRLKPSGKA 186 + F I +V + + P +G I + W + L + ++ +P L + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEV 193 Query: 187 RTEGQDKLTFATLL 200 R +G + L+ Sbjct: 194 RIKGHFDIKGIILM 207
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 510 bits (1314), Expect = 0.0 Identities = 247/385 (64%), Positives = 286/385 (74%), Gaps = 25/385 (6%) Query: 1 MKLKLVAVAVTSLLAAGVVNAAEVYNKDGNKLDLYGKVHAQHYFSDDNGSDGDKTYARLG 60 MK K++A+ + +LLAAG +AAE+YNKDGNKLDLYGKV HYFSDD+ DGD+TY R+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQINDQLTGFGQWEYEFKGNRTESQGADKDKTRLAFAGLKFADYGSFDYGRNYGVA 120 FKGETQINDQLTG+GQWEY + N TE +GA+ TRLAFAGLKF DYGSFDYGRNYGV Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANS-WTRLAFAGLKFGDYGSFDYGRNYGVL 119 Query: 121 YDIGAWTDVLPEFGGDTWTQTDVFMTGRTTGVATYRNTDFFGLVEGLNFAAQYQGKNDRD 180 YD+ WTD+LPEFGGD++T D +MTGR GVATYRNTDFFGLV+GLNFA QYQGKN+ Sbjct: 120 YDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQ 179 Query: 181 GAY----------------ESNGDGFGLSATYEY-EGFGVGAAYAKSDRTNNQVKAASNL 223 A NGDGFG+S TY+ GF GAAY SDRTN QV A Sbjct: 180 SADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAG-GT 238 Query: 224 NAAGKNAEVWAAGLKYDANNIYLATTYSETLNMTTFGE-DAAGDAFIANKTQNFEAVAQY 282 A G A+ W AGLKYDANNIYLAT YSET NMT +G+ D D +ANKTQNFE AQY Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQY 298 Query: 283 QFDFGLRPSIAYLKSKGKNLGT----YGDQDLVEYIDVGATYYFNKNMSTFVDYKINLLD 338 QFDFGLRP++++L SKGK+L D+DLV+Y DVGATYYFNKN ST+VDYKINLLD Sbjct: 299 QFDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLD 358 Query: 339 DSD-FTKAAKVSTDNIVAVGLNYQF 362 D D F K A +STD+IVA+G+ YQF Sbjct: 359 DDDPFYKDAGISTDDIVALGMVYQF 383
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 396 bits (1019), Expect = e-136 Identities = 87/395 (22%), Positives = 171/395 (43%), Gaps = 15/395 (3%) Query: 20 IDATVLHVAAPTLSMTLGASGNELLWIIDIYSLVMAGMVLPMGALGDRIGFKRLLMLGGT 79 ++ VL+V+ P ++ W+ + L + G L D++G KRLL+ G Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 80 LFGLASLAAAFSYT-ASWLIATRVLLAIGAAMIVPATLAGIRATFCEEKHRNMALGVWAA 138 + S+ ++ S LI R + GAA PA + + A + +++R A G+ + Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFGLIGS 146 Query: 139 VGSGGAAFGPLIGGILLEHFYWGSVFLINVPIVLVVMGLTARYVPRQAGRRDQPLNLGHA 198 + + G GP IGG++ + +W +L+ +P++ ++ + ++ R ++ Sbjct: 147 IVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGI 204 Query: 199 VMLIIAILLLVYSAKTALKGSLSLWAISLTLLTGTLLLGLFIRTQLATSRPMIDMRLFTH 258 +++ + I+ + + L + +S +F++ + P +D L + Sbjct: 205 ILMSVGIVFFMLFTTSYSISFLIVSVLS---------FLIFVKHIRKVTDPFVDPGLGKN 255 Query: 259 RIILSGVVMAMTAMITLVGFELLMAQELQFVHGLSPYEAG-VFMLPVMVASGFSGPIAGA 317 + GV+ T+ GF ++ ++ VH LS E G V + P ++ G I G Sbjct: 256 IPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGI 315 Query: 318 LVSRLGLRLVATGGMALSALSFYGLAMTDFSTQQWQAWGLMALLGFSAASALLASTSAIM 377 LV R G V G+ ++SF + +T + ++ +LG + + + ST Sbjct: 316 LVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSS 375 Query: 378 AAAPAEKAAAAGAIETMAYELGAGLGIAIFGLLLS 412 + E A + ++ L G GIAI G LLS Sbjct: 376 SLKQQEAGAGMSLLNFTSF-LSEGTGIAIVGGLLS 409
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.8 bits (121), Expect = 3e-10 Identities = 26/167 (15%), Positives = 54/167 (32%), Gaps = 9/167 (5%) Query: 5 NRDERREVILQAAMRVALAEGFTAMTVRRIASEADVAAGQVHHHFSSAGEL-KALAFVH- 62 E R+ IL A+R+ +G ++ ++ IA A V G ++ HF +L + + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 63 -----LIRTLLDAGQVPPPATWRARLHAMLGS--EDGGFEPYIKLWREAQILADRDPHIK 115 L P + R L +L S + +++ ++ Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127 Query: 116 DAYLLTMQMWHEETVTIIEQGKQAGEFTFTANATDIAWRLIALVCGL 162 A ++ ++ +A A + + GL Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.040 Identities = 50/272 (18%), Positives = 95/272 (34%), Gaps = 36/272 (13%) Query: 126 TPFGVFMLIALLCGFAGANF-ASSMGNISFFFPKARQGSALGINGGLGNLGVSVMQLIA- 183 + F + ++ + G A F A M ++ + PK +G A G+ G + +G V I Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGG 160 Query: 184 --------PLVIFLPIFTFLGV---RSVPQPDGSLLALTNAAWIWVPLLAVATLAAWFGM 232 ++ +P+ T + V + + + + + I + + + + Sbjct: 161 MIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTS 220 Query: 233 NDIGSSKASVASQLPVLKRL--------------HLWLLSLLYLATFGSFIGFSAGFAML 278 I SV S L +K + ++ + + G G AGF + Sbjct: 221 YSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCG--GIIFGTVAGFVSM 278 Query: 279 AKTQFPDVNILQLAFFGP---FIGALARSA----GGVISDKFGGVRVTLINFIFMALFTA 331 DV+ L A G F G ++ GG++ D+ G + V I F+++ Sbjct: 279 VPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFL 338 Query: 332 LLFLTLPGSGAGSFSAFYLVFMGLFLTAGLGS 363 L + V GL T + S Sbjct: 339 TASFLLETTSWFMTIIIVFVLGGLSFTKTVIS 370
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.022 Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 1/58 (1%) Query: 140 TPFSTFIIISLLCGFAGANF-ASSMANISFFFPKQKQGGALGLNGGLGNMGVSVMQLV 196 + FS I+ + G A F A M ++ + PK+ +G A GL G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAI 158
>PF06580#Sensor histidine kinase Length = 349 Score = 51.4 bits (123), Expect = 4e-09 Identities = 30/123 (24%), Positives = 54/123 (43%), Gaps = 17/123 (13%) Query: 473 SARFGFTVKLDYQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SHADDVVVTV 523 S +F ++ + Q+ P + VP L+Q E N +KH +++ Sbjct: 233 SIQFEDRLQFENQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKG 285 Query: 524 TQCGKQVKLKVQDNGCGVPENAERSNHYGMIIMRDRAQSLRG-DCQVRRRETGGTEVTVT 582 T+ V L+V++ G +N + S G+ +R+R Q L G + Q++ E G + Sbjct: 286 TKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 583 FIP 585 IP Sbjct: 346 LIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.2 bits (177), Expect = 6e-17 Identities = 33/117 (28%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLVSMAPDISVVGEASNGEQGIDLAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A V SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKALSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>INTIMIN#Intimin signature. Length = 939 Score = 247 bits (632), Expect = 1e-74 Identities = 125/444 (28%), Positives = 216/444 (48%), Gaps = 24/444 (5%) Query: 57 SFSLSLLLLTASGTIRAQAQDPFDQNHL----PDLGMMPESHEGEKHFAEMAKAFGEASM 112 F S L L S + A N L PD+ + + ++A A + + Sbjct: 118 PFEYSALPLLGSAPLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQL 177 Query: 113 KNNDLDTGEQARQFAFGQVRDVVSEQVNQQLESWLSAWGSASVDINVDNEGHFNGSRGSW 172 ++ L+ G+ A+ A G + Q + QL++WL +G+A V++ N F+GS + Sbjct: 178 QSRSLN-GDYAKDTALG----IAGNQASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDF 230 Query: 173 FIPLQDKQRYLTWSQLGLTQQTDGLVSNIGVGQRWAQDGWLLGYNTFYDNLLDENLQRAG 232 +P D ++ L + Q+G +N+G GQR+ +LGYN F D + R G Sbjct: 231 LLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLG 290 Query: 233 FGAEAWGEYLRLSANYYQPFADWQT--HTATLEQRMARGYDINAQMRLPFYQHINTSVSL 290 G E W +Y + S N Y + W + ++R A G+DI LP Y + + Sbjct: 291 IGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMY 350 Query: 291 EQYFGDSVDLFDSGTGYHNPVALKLGLNYTPVPLLTVTAQHKQGESGVSQNNLGLTLNYR 350 EQY+GD+V LF+S NP A +G+NYTP+PL+T+ ++ G + + Y+ Sbjct: 351 EQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQ 410 Query: 351 FGVPLKKQLAASEVAQSQSLRGSRYDTLQRNSLPTMEYRQRKTLTVFLATPPWDLTPGET 410 F P +Q+ V + ++L GSRYD +QRN+ +EY+++ L++ + + T T Sbjct: 411 FDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERST 469 Query: 411 VALKLQVRSVHGIRHLSWQGDTQALSLTAG----TDTRSTEGWTIIMPAWDHREGAANRW 466 ++L V+S +G+ + W D AL G + ++S + + I+PA+ +G +N + Sbjct: 470 QKIQLIVKSKYGLDRIVW--DDSALRSQGGQIQHSGSQSAQDYQAILPAY--VQGGSNVY 525 Query: 467 RLSVVVEDEKGQRVSSNEITLALT 490 +++ D G SSN + L +T Sbjct: 526 KVTARAYDRNGN--SSNNVLLTIT 547
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 419 bits (1080), Expect = e-149 Identities = 100/351 (28%), Positives = 179/351 (50%), Gaps = 14/351 (3%) Query: 7 DDKTEAPTPHRLEKAREEGQIPRSRELTSLLILLVGVCIIWFGGESLARQLAGMLSAGLH 66 +KTE PTP ++ AR++GQ+ +S+E+ S +++ ++ + + ++ + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML--IP 60 Query: 67 FDHRMVNDPNLILGQIILLIKAAMMALLPLIAGVVLVALISPVMLGGLIFSGKSLQPKFS 126 + + + + ++ PL+ L+A+ S V+ G + SG++++P Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 127 KLNPLPGIKRMFSAQTGAELLKAVLKSTLVGCVTGFYLWHHWPQMMRLMAESPIVAMGNA 186 K+NP+ G KR+FS ++ E LK++LK L+ + + + +++L P + Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQL----PTCGIECI 176 Query: 187 LDLVGLCALLVVLGVIPMVGF------DVFFQIFSHLKKLRMSRQDIRDEFKESEGDPHV 240 L+G +L L VI VGF D F+ + ++K+L+MS+ +I+ E+KE EG P + Sbjct: 177 TPLLG--QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEI 234 Query: 241 KGKIRQMQRAAAQRRMMEDVPKADVIVTNPTHYSVALQYDENKMSAPKVVAKGAGLIALR 300 K K RQ + R M E+V ++ V+V NPTH ++ + Y + P V K Sbjct: 235 KSKRRQFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQT 294 Query: 301 IRELGAEHRVPTLEAPPLARALYRHAEIGQQIPGQLYAAVAEVLAWVWQLK 351 +R++ E VP L+ PLARALY A + IP + A AEVL W+ + Sbjct: 295 VRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.7 bits (220), Expect = 7e-24 Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGFGFIISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG +++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADSAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.2 bits (159), Expect = 7e-14 Identities = 31/142 (21%), Positives = 62/142 (43%), Gaps = 6/142 (4%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIREGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAARARIAAHKPM 141 +AE R ++ + M Sbjct: 116 GRALAEPKRRPSKLEDDSQDGM 137
>PF06580#Sensor histidine kinase Length = 349 Score = 42.2 bits (99), Expect = 4e-06 Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 378 ELDKSLIERIIDPLT--HLVRNSLDHGIEMPEKRLEAGKNAVGNLILSAEHQGGNICIEV 435 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 436 TDDGAGLNRERILAKAMSQGMAVNENMTDDEVGMLIFAPGFSTAEQVTDVSGRGVGMDVV 495 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 496 KRNIQEMGG---HVEIQSKQGSGTTIRILLP 523 + +Q + G +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 42.2 bits (99), Expect = 1e-06 Identities = 25/118 (21%), Positives = 46/118 (38%), Gaps = 11/118 (9%) Query: 162 FKTGSAEVEPYMRDILRAIAPVL---NGIPNRISLAGHTDDFPYANGEKGYSNWELSADR 218 F A ++P + L + L + + + G+TD G Y N LS R Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI----GSDAY-NQGLSERR 277 Query: 219 ANASRRELVAGGLDNGKVLRVVGMAATMRLSDRGPDDAINRR--ISLLVLNKQAEQAI 274 A + L++ G+ K+ GM + ++ D+ R I L +++ E + Sbjct: 278 AQSVVDYLISKGIPADKI-SARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>PF05844#YopD protein Length = 295 Score = 31.9 bits (72), Expect = 0.002 Identities = 12/28 (42%), Positives = 21/28 (75%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQQGMFSLERDIEN 103 ++LL +L+R+ K+R+ G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.4 bits (66), Expect = 0.025 Identities = 10/53 (18%), Positives = 17/53 (32%), Gaps = 2/53 (3%) Query: 184 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFIGMIGWALLT 234 R L R + + + A L + P R R M + ++L Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLG 78
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 6e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 213 bits (543), Expect = 5e-71 Identities = 231/260 (88%), Positives = 246/260 (94%) Query: 1 MIQVTSEQWLYWLHLYFWPLLRVLALISTAPILSERAIPKRVKLGLGIMITLVIAPSLPA 60 M+QVTSEQWL WL+LYFWPLLRVLALISTAPILSER++PKRVKLGL +MIT IAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDTPLFSIAALWLAMQQILIGIALGFTMQFAFAAVRTAGEFIGLQMGLSFATFVDPGSHL 120 ND P+FS ALWLA+QQILIGIALGFTMQFAFAAVRTAGE IGLQMGLSFATFVDP SHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLAMLLFLTFNGHLWLISLLVDTFHTLPIGSNPVNSNAFMALARAGGLIF 180 NMPVLARIMDMLA+LLFLTFNGHLWLISLLVDTFHTLPIG P+NSNAF+AL +AG LIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPVITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGIMLMAALMPLIAPFC 240 LNGLMLALP+ITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGI LMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEIFNLLADIVSEMPI 260 EHLFSEIFNLLADI+SE+P+ Sbjct: 241 EHLFSEIFNLLADIISELPL 260
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 537 bits (1386), Expect = 0.0 Identities = 261/389 (67%), Positives = 298/389 (76%), Gaps = 17/389 (4%) Query: 1 MKVKVLSLLVPALLVAGAANAAEIYNKDGNKLDLFGKVDGLHYFSDDKGSDGDQTYMRIG 60 MK KVL+L++PALL AGAA+AAEIYNKDGNKLDL+GKVDGLHYFSDD DGDQTYMR+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQVNDQLTGYGQWEYQIQGNQTEG-SNDSWTRVAFAGLKFADAGSFDYGRNYGVTY 119 FKGETQ+NDQLTGYGQWEY +Q N TEG +SWTR+AFAGLKF D GSFDYGRNYGV Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVTSWTDVLPEFGGDTYG-ADNFMQQRGNGYATYRNTDFFGLVDGLDFALQYQGKNGSVS 178 DV WTD+LPEFGGD+Y ADN+M R NG ATYRNTDFFGLVDGL+FALQYQGKN S S Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 179 GEN--------TNGRSLLNQNGDGYGGSLTYAIGEGFSVGGAITTSKRTADQNNTADEHL 230 ++ NG + NGDG+G S TY IG GFS G A TTS RT +Q N Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGG--T 238 Query: 231 YGNGDRATVYTGGLKYDANNIYLAAQYSQTYNATRFGTSNGNNKSTSYGFANKAQNFEVV 290 GD+A +T GLKYDANNIYLA YS+T N T +G ++ G ANK QNFEV Sbjct: 239 IAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDK---GYDGGVANKTQNFEVT 295 Query: 291 AQYQFDFGLRPSVAYLQSKGKDISNGYGASYGDQDIVKYVDVGATYYFNKNMSTYVDYKI 350 AQYQFDFGLRP+V++L SKGKD++ + D+D+VKY DVGATYYFNKN STYVDYKI Sbjct: 296 AQYQFDFGLRPAVSFLMSKGKDLTYN-NVNGDDKDLVKYADVGATYYFNKNFSTYVDYKI 354 Query: 351 NLLDKND-FTRDAGINTDDIVALGLVYQF 378 NLLD +D F +DAGI+TDDIVALG+VYQF Sbjct: 355 NLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.1 bits (117), Expect = 5e-09 Identities = 26/145 (17%), Positives = 60/145 (41%), Gaps = 20/145 (13%) Query: 16 MNNMNVIIADDHPIVLFGIRKSLEQIEWVNVVGEFEDSTALINNLPKLDAHVLITDLSMP 75 M +++ADD + + ++L + + + ++ L + D +++TD+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 76 GDKYGDGITLIKYIKRHFPSLSIIVLTMNNNPAILSAVLDLDIEGIVLKQGA------PT 129 + L+ IK+ P L ++V++ N +A+ ++GA P Sbjct: 59 D---ENAFDLLPRIKKARPDLPVLVMSAQNTFM--TAIKA-------SEKGAYDYLPKPF 106 Query: 130 DLPKALAALQKGKKFTPESVSRLLE 154 DL + + + + S+L + Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLED 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 1e-17 Identities = 29/104 (27%), Positives = 47/104 (45%) Query: 827 ILVVDDHPINRRLLADQLGSLGYQCKTANDGVDALNVLSKNAIDIVLSDVNMPNMDGYRL 886 ILV DD R +L L GY + ++ ++ D+V++DV MP+ + + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 887 TQRIRQLGLTLPVVGVTANALAEEKQRCLESGMDSCLSKPVTLD 930 RI++ LPV+ ++A + E G L KP L Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 27.8 bits (62), Expect = 0.031 Identities = 16/75 (21%), Positives = 29/75 (38%), Gaps = 15/75 (20%) Query: 133 AERDTQAYLKLDHDFHYVFVKYADNKYISQAHLLISARLLAIRYRLDFTAEYITSSNRGH 192 A+R+ L F VF+ + A+RY L+ Y S+ G Sbjct: 62 ADREGMTDLFASGHFERVFISPH------RL---------AVRYSLENPHAYADSNLTGF 106 Query: 193 ATILDMLKNNNVEGV 207 IL+ ++N ++ + Sbjct: 107 LNILEGCRHNKIQHL 121
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 31.7 bits (72), Expect = 0.006 Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 20/133 (15%) Query: 87 VLKAIRDAGICAEANSQYEVRKCLEIGFRGDQIVFNGVVKKPADLEYAIANDLYLINVDS 146 + AI A N + E E G++G ++ G DLE + L Sbjct: 46 IWSAIGATDGFALLNLE-EAITLRERGWKGPILMLEGFFH-AQDLEIYDQHRLTT----C 99 Query: 147 LYELEHIDAIS-RKLKKVANVCVRVEPNVPSATHAELVTAFHAKSGLDLEQAEETCRRIL 205 ++ + A+ +LK ++ ++V + + + G ++ +++ Sbjct: 100 VHSNWQLKALQNARLKAPLDIYLKVN------------SGMN-RLGFQPDRVLTVWQQLR 146 Query: 206 AMPYVHLRGLHMH 218 AM V L H Sbjct: 147 AMANVGEMTLMSH 159
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 343 bits (882), Expect = e-116 Identities = 120/371 (32%), Positives = 183/371 (49%), Gaps = 24/371 (6%) Query: 122 NMSGVRRLQEQVVELNQLLYADHHE---KHHAIITENPEMLSNIAKAKRLAASNIPVTIV 178 +++ + + + + + + + ++ + M RL +++ + I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 179 GETGTGKELFSRLIHQCSKRANKPFIALNCGALPPTLIESTLFDTVRGAYTGAENS-QGY 237 GE+GTGKEL +R +H KR N PF+A+N A+P LIES LF +GA+TGA+ G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 238 LELANGGTLFLDELNAMPIEMQSKLLRFLQDKTFWRLGGQQQLHSDVRIVAAMNEAPVKL 297 E A GGTLFLDE+ MP++ Q++LLR LQ + +GG+ + SDVRIVAA N+ + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 298 IQQERLRADLFYRLSVGMLTLPPLRARPEDIPLLANYFFDKYRNDVPQDIHGLSETARAD 357 I Q R DL+YRL+V L LPPLR R EDIP L +F + + D+ + A Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 358 LLNHAWPGNVRMLENAIVRSMIMQEKDGLLKHIIF-------------------EQDELN 398 + H WPGNVR LEN + R + +D + + II ++ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 399 LGVPETAPENPLPSSPDPQYEGSLEVRVANYERHLIETALDTHQGNIAAAARSLNVSRTT 458 V E + G + +A E LI AL +GN AA L ++R T Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 459 LQYKVQKYAIR 469 L+ K+++ + Sbjct: 466 LRKKIRELGVS 476
>ANTHRAXTOXNA#Anthrax toxin LF subunit signature. Length = 800 Score = 33.6 bits (76), Expect = 0.002 Identities = 13/37 (35%), Positives = 24/37 (64%), Gaps = 2/37 (5%) Query: 469 KDVDQQYLDFLDSLRND-DAKAVLFQNEM-ENLEMHN 503 K +D ++L+ + SL +D D+ +LF + E LE++N Sbjct: 186 KSLDPEFLNLIKSLSDDSDSSDLLFSQKFKEKLELNN 222
>PERTACTIN#Pertactin signature. Length = 922 Score = 28.9 bits (64), Expect = 0.019 Identities = 19/60 (31%), Positives = 22/60 (36%), Gaps = 4/60 (6%) Query: 99 PIPVETPKPKPVEKPKPQPKPQQPVVAASTPTPAPQPATDDKPAPTGKAYVVQLGALKNA 158 P P P+P P P+P PQ P P QP P G+ L A NA Sbjct: 569 PAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRE----LSAAANA 624 Score = 28.5 bits (63), Expect = 0.022 Identities = 16/49 (32%), Positives = 17/49 (34%) Query: 106 KPKPVEKPKPQPKPQQPVVAASTPTPAPQPATDDKPAPTGKAYVVQLGA 154 K P KP PQP PQ P P P P +A Q A Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPA 614
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 28.7 bits (64), Expect = 0.026 Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 6/63 (9%) Query: 204 VRNIVGS-LLEVGAHNQPESWIAELLAARDRTLAAATAKAEGLYLVAVDYPDRFDLPKPP 262 +NI GS ++ + PES + E AA LAA +K++ V VDY + ++ KP Sbjct: 496 TKNIPGSHVIVKNIMDIPESTLLE--AAN---LAAYYSKSQNSSNVPVDYTEVKNVKKPN 550 Query: 263 MGP 265 Sbjct: 551 GAK 553
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 397 bits (1021), Expect = e-144 Identities = 236/251 (94%), Positives = 248/251 (98%) Query: 1 MKLRLSALALGTTLLVGCASSGTEQQGRSDPFEGFNRTMYNFNFNVLDPYVVRPVAVAWR 60 MKLRLSALALGTTLLVGCASSGT+QQGRSDP EGFNRTMYNFNFNVLDPY+VRPVAVAWR Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 Query: 61 DYVPQPARNGLSNFTGNLEEPAIMVNYFLQGDPYQGMVHFTRFFLNTLLGMGGFIDVAGM 120 DYVPQPARNGLSNFTGNLEEPA+MVNYFLQGDPYQGMVHFTRFFLNT+LGMGGFIDVAGM Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 Query: 121 ANPKLQRVEPHRFGSTLGHYGVGYGPYMQIPFYGSFTLREDGGDMADTLYPVLSWLTWPM 180 ANPKLQR EPHRFGSTLGHYGVGYGPY+Q+PFYGSFTLR+DGGDMAD LYPVLSWLTWPM Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180 Query: 181 SIGKWTIEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGKLKPQENPNAQA 240 S+GKWT+EGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGG+LKPQENPNAQA Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 Query: 241 IQDELKEIDSE 251 IQD+LK+IDSE Sbjct: 241 IQDDLKDIDSE 251
>PF06580#Sensor histidine kinase Length = 349 Score = 28.3 bits (63), Expect = 0.036 Identities = 22/113 (19%), Positives = 46/113 (40%), Gaps = 12/113 (10%) Query: 199 WIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLV-FNGTLPWSDFFW 257 I W+ G I+ ++ +I + V + I L+ F T P + F Sbjct: 61 SFIKRQGWLKLNMG-QIILRVLPACVVIGM----VWFVANTSIWRLLAFINTKPVA-FTL 114 Query: 258 PFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAE 310 P AL ++ N+ TF+++L+ K ++A + ++ ++A+ Sbjct: 115 PLAL-SIIFNVVVVTFMWSLLYFGWHFF----KNYKQAEIDQWKMASMAQEAQ 162
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 476 bits (1226), Expect = e-173 Identities = 149/320 (46%), Positives = 213/320 (66%), Gaps = 11/320 (3%) Query: 1 MKKHAIAVMMIAVFSESVYAESTLFIPDVSPDSVTTSLSVGVLNGKSRELVYD-TDTGRK 59 M+ + +++ + S +A + +PD++ +S+G L+GK++E VY + GRK Sbjct: 1 MRAKLLGIVLTTPIAISSFASTET--LSFTPDNINADISLGTLSGKTKERVYLAEEGGRK 58 Query: 60 LSQLDWKIKNVATLQGDLSWEPYSFMTLDARGWTSLASGSGHMVDHDWMSSEQPG-WTDR 118 +SQLDWK N A ++G ++W+ +++ A GWT+L S G+MVD DWM S PG WTD Sbjct: 59 VSQLDWKFNNAAIIKGAINWDLMPQISIGAAGWTTLGSRGGNMVDQDWMDSSNPGTWTDE 118 Query: 119 SIHPDTSVNYANEYDLNVKGWLLQGDNYKAGVTAGYQETRFSWTARGGSYIYDNGR---- 174 S HPDT +NYANE+DLN+KGWLL NY+ G+ AGYQE+R+S+TARGGSYIY + Sbjct: 119 SRHPDTQLNYANEFDLNIKGWLLNEPNYRLGLMAGYQESRYSFTARGGSYIYSSEEGFRD 178 Query: 175 YIGNFPHGVRGIGYSQRFEMPYIGLAGDYRINDFECNVLFKYSDWVNAHDNDEHY--MRK 232 IG+FP+G R IGY QRF+MPYIGL G YR DFE FKYS WV + DNDEHY ++ Sbjct: 179 DIGSFPNGERAIGYKQRFKMPYIGLTGSYRYEDFELGGTFKYSGWVESSDNDEHYDPGKR 238 Query: 233 LTFREKTENSRYYGASIDAGYYITSNAKIFAEFAYSKYEEGKGGTQIIDKTSGDTAYFGG 292 +T+R K ++ YY +++AGYY+T NAK++ E A+++ KG T + D + +T+ + Sbjct: 239 ITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRVTNKKGNTSLYDHNN-NTSDYSK 297 Query: 293 DAAGIANNNYTVTAGLQYRF 312 + AGI N N+ TAGL+Y F Sbjct: 298 NGAGIENYNFITTAGLKYTF 317
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 204 bits (520), Expect = 1e-63 Identities = 105/427 (24%), Positives = 169/427 (39%), Gaps = 73/427 (17%) Query: 1 MSDVCMPGCSGIDLMTLFHQDDDQLPILLITGHGDVPMAVDAVKKGAWDFLQKPVDPGKL 60 ++DV MP + DL+ + LP+L+++ A+ A +KGA+D+L KP D +L Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111 Query: 61 LILIEDALRQRRSVIARRQYCQQTLQVELIGRSEWMNQFRQRLQQLAETDIAVWFYGEHG 120 + +I AL + + ++ + Q L+GRS M + + L +L +TD+ + GE G Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESG 170 Query: 121 TGRMTGARYLHQLGRNAKGPFVRYELT--PENAGQLETF-----------------IDQA 161 TG+ AR LH G+ GPFV + P + + E F +QA Sbjct: 171 TGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQA 230 Query: 162 QGGTLVLSHPEYLTREQQHHLAR-LQSLEHRP----------FRLVGVGSASLVEQAAAN 210 +GGTL L + + Q L R LQ E+ R+V + L + Sbjct: 231 EGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQG 290 Query: 211 QIAAELYYCFAMTQIACQSLSQRPDDIEPLFRHYLRKACLRLNHPVPEIAGELLKGIMRR 270 +LYY + + L R +DI L RH++++A + V E L+ + Sbjct: 291 LFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAH 349 Query: 271 AWPSNVRELANAAELFAV-----------------------------------GVLPLAE 295 WP NVREL N + E Sbjct: 350 PWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVE 409 Query: 296 TVNPQLL------LQEPTPLDRRVEEYERQIITEALNIHQGRINEVAEYLQIPRKKLYLR 349 Q L DR + E E +I AL +G + A+ L + R L + Sbjct: 410 ENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK 469 Query: 350 MKKYGLS 356 +++ G+S Sbjct: 470 IRELGVS 476
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 28.7 bits (64), Expect = 0.049 Identities = 5/35 (14%), Positives = 15/35 (42%), Gaps = 4/35 (11%) Query: 342 QQLVQRMFDTAISFRLAQLKDAWRALHSAETRLKR 376 +++ + LA ++++W + RL + Sbjct: 150 NSVMEGVIVRI----LANVRESWTQVIDLRPRLGQ 180
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.4 bits (79), Expect = 8e-04 Identities = 72/429 (16%), Positives = 140/429 (32%), Gaps = 45/429 (10%) Query: 28 IQALLSVFLGYLAYYIVRNNFTLSTPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVM 84 I L +V L + ++ P L L S G+L + + V+ Sbjct: 8 IVILSTVALDAVGIGLI----MPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 85 SSLADKASPKVFMACGLVLCAIVNVGLGFSSAFWIFAALVVFNGLFQGMGVGPSFITIAN 144 +L+D+ + + L A+ + + W+ + G+ G IA+ Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122 Query: 145 WFPRRERGRVGAFWNISHNVGGGIVA-PIVGAAFAILGSEHWQSASYIVPACVAVIFALI 203 ER R F +S G G+VA P++G A + A + + L Sbjct: 123 ITDGDERAR--HFGFMSACFGFGMVAGPVLGGLMGGFSPH----APFFAAAALNGLNFLT 176 Query: 204 VLVLGKGSPREEGLPSLEQMMPEEKVVLKTKNTAKAPENMSAWQIFCTYVLRNKNAWYIS 263 L +PE + + P A ++ + Sbjct: 177 GCFL----------------LPE------SHKGERRPLRREALNPLASFRWARGMTVVAA 214 Query: 264 LVDVFVYMVRFGMISWLPIYLLTVKHFSKEQMSVAFLFFEWA---AIPSTLLAGWLSDKL 320 L+ VF M G + + F + ++ + ++ ++ G ++ +L Sbjct: 215 LMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARL 274 Query: 321 FKGRRMPLAMICMALIFVCLIGYWKSESLLMVTIFAAIVGCLIYVPQFLASVQTMEIVPS 380 + R + L MI ++ L + + + A G + Q + S Q E Sbjct: 275 GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQG 334 Query: 381 FAVGSAVGLRGFMSYIFGASLGTSLFGVMVDKLGWYGGFYLLMGGIVCCILFCYLSHRGA 440 GS L ++ I G L T+++ + + G+ + G + + L RG Sbjct: 335 QLQGSLAALTS-LTSIVGPLLFTAIYAA---SITTWNGWAWIAGAALYLLCLPAL-RRGL 389 Query: 441 LELERQRQN 449 QR + Sbjct: 390 WSGAGQRAD 398
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.8 bits (67), Expect = 0.014 Identities = 8/55 (14%), Positives = 17/55 (30%) Query: 277 FAIMKLPLADINAQNAMMHAGKSSEADVQGHVDGWINAHQQQFDGWVKEALAAQK 331 F + ++P + S +D + HV + + Q + Q Sbjct: 133 FVLNEMPARYRKNVLGAVLLSPSQSSDFEIHVSEMVTSDNQSARYLTLPEVNKQT 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 46.4 bits (110), Expect = 1e-07 Identities = 31/165 (18%), Positives = 66/165 (40%), Gaps = 2/165 (1%) Query: 34 LDTIAHHFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFE-RRTLIVSMTLLAAGGMLI 92 L IA+ F+ +S ++ TA L ++ G L D +R L+ + + G ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 93 TASSQSLSMMILGTALTGLFSVVAQILVPLA-ATLATPATRGKVVGTIMSGLLLGILLAR 151 S++I+ + G + LV + A RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 152 TVAGLLANLGGWRTVFWVASALMALMAVALWRGLPKLKSDTHLNY 196 + G++A+ W + + + + + +++ H + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 75.6 bits (186), Expect = 5e-17 Identities = 63/418 (15%), Positives = 128/418 (30%), Gaps = 101/418 (24%) Query: 28 KRKTALLLLTLLFVIIAVAYGIYWFLVLRHIEETDDA----YVAGNQVQITAQVSGSVTK 83 +R + + F++IA + L +E A +G +I + V + Sbjct: 55 RRPRLVAYFIMGFLVIAFILSV-----LGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 84 VWADNTDFVKEGDVLVTLDQT--------------------------------------- 104 + + V++GDVL+ L Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 105 -------------DAKQAFEKAKTALASSVRQTHQLMINSKQ-------LQANIDVQKTA 144 + + K ++ Q +Q +N + + A I+ + Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENL 229 Query: 145 LAQAQSDLNRRVPLGNANLIGREELQHARDAVASAQAQLDVAIQQYNANQAMILNSNLED 204 +S L+ L + I + + + A +L V Q ++ IL++ E Sbjct: 230 SRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEY 289 Query: 205 QPAVQQAATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQ 245 Q Q E+ + + + I +P++ V + V G Sbjct: 290 QLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGV 349 Query: 246 ISPTTPLMAVVPATD-LWVDANFKETQLANMRIGQPVTIITDIYGDDVKY---TGKVVGL 301 ++ LM +VP D L V A + + + +GQ I + + +Y GKV + Sbjct: 350 VTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI 408 Query: 302 DMGTGSAFSLLPAQNATGNWIKVVQRLPVRVELDARQLEQHPLRIGLSTLVTVDTANR 359 + ++ G V+ + + PL G++ + T R Sbjct: 409 -----NLDAIE--DQRLGLVFNVIISIEENCLSTGNK--NIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 130 bits (328), Expect = 5e-35 Identities = 94/405 (23%), Positives = 164/405 (40%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRF 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ + Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFMWSTVAFAAASWACGVS-SSLNMLIFFRVVQGVVAGPLIPLSQSLLLNNYPPAK 135 G +L ++ + S V S ++LI R +QG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGIAVVLMTLHTLRGRETH 195 R A L V + GP +GG I+ HW ++ I + I I V + L+ Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVRI 195 Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVIAISFLIVWELTD 255 D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 196 KG--HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DHPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAWTFEPGMDFGASAWPQFIQRF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 287 bits (736), Expect = e-103 Identities = 130/170 (76%), Positives = 145/170 (85%) Query: 2 PLLDSFAVDHTRMQAPAVRVAKTMNTPHGDAITVFDLRFCIPNKEVMPEKGIHTLEHLFA 61 PLLDSF VDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ EKGIHTLEHL+A Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60 Query: 62 GFMRDHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMADVLKVQDQNQIP 121 GFMR+HLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAM DVLKV++QN+IP Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120 Query: 122 ELNVYQCGTYQMHSLSEAQDIARHILERDVRVNSNKELALPKEKLQELHI 171 ELN YQCGT MHSL EA+ IA++ILE V VN N ELALP+ L+EL I Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 35.8 bits (82), Expect = 4e-05 Identities = 37/144 (25%), Positives = 62/144 (43%), Gaps = 26/144 (18%) Query: 57 PPCTIGGAS---VEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGE 113 PPCTI V+FG++ V ++ S++C + S L +++ G T + Sbjct: 32 PPCTINNGQNIVVDFGNINPEHVDNSRGEVTKNISISCPYK-SGSLWIKVTGNTMGVGQN 90 Query: 114 QVLQTSVQGLGIRIQQ-------------AGNKQLVPVGI-TDWLNFTLSGSNGPELEAV 159 VL T++ GI + Q +GN V G+ T FT + +V Sbjct: 91 NVLATNITHFGIALYQGKGMSTPLTLGNGSGNGYRVTAGLDTARSTFTFT--------SV 142 Query: 160 PVKEPTTQLAGGDFNASATLVVDY 183 P + + L GGDF +A++ + Y Sbjct: 143 PFRNGSGILNGGDFRTTASMSMIY 166
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 43.2 bits (101), Expect = 7e-08 Identities = 44/170 (25%), Positives = 77/170 (45%), Gaps = 18/170 (10%) Query: 5 IVKRVLILTLLITQFAC-AD-NLTFHGKLINPPACTINNGETLEVSFGSVIIDNIDGVNY 62 +++ L ++LL+T A AD + G + PP CTINNG+ + V FG++ +++D N Sbjct: 1 MIRLSLFISLLLTSVAVLADVQINIRGNVYIPP-CTINNGQNIVVDFGNINPEHVD--NS 57 Query: 63 LTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSANALTTNVPELGIELQQNGTVFPPGT- 121 E+ ++ ++ +L ++ T N L TN+ GI L Q + P T Sbjct: 58 RGEVTKNISISCPYKSGSLWIKVTG-NTMGVGQNNVLATNITHFGIALYQGKGMSTPLTL 116 Query: 122 ----------SLTIDES-SLPTLKAVPVKQPGKEPAEGDFEAFATLQVDY 160 + +D + S T +VP + GDF A++ + Y Sbjct: 117 GNGSGNGYRVTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMIY 166
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 38.1 bits (88), Expect = 3e-05 Identities = 26/92 (28%), Positives = 39/92 (42%), Gaps = 2/92 (2%) Query: 156 AQGCEGKNVIIVGAGT-IGLLALQCARELGARSVTAIDINPQKLELAKALGATHTCNSRE 214 A+G EGK I GA IG + GA + A+D NP+KLE + ++ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 215 MTADDIQTALSDIQFDQLVLETAGTPQTVSLA 246 AD +A D ++ E V++A Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.5 bits (76), Expect = 0.003 Identities = 23/120 (19%), Positives = 38/120 (31%), Gaps = 8/120 (6%) Query: 289 QAVEMQPAAAPDAPVEPGVEETQPQMTNGVASPSQASVSDLTDDAPAQSATPVSAPQTPP 348 Q+ +QP A P +P V +PQ + A + + PV+ T Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTN----TTADTEQPAKETSSNVEQPVTESTTVN 1190 Query: 349 ATASAPADPSAELKIYDTSSQPLD-QVLAQVQQDGASIVVGPLLKNNVEALMKSNTPLNV 407 S +P ++QP + ++ V + N A SN V Sbjct: 1191 TGNSVVENPENTT---PATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 28.0 bits (62), Expect = 0.028 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.8 bits (67), Expect = 0.007 Identities = 14/55 (25%), Positives = 22/55 (40%), Gaps = 16/55 (29%) Query: 15 VLITGATGLVGGHLLRMLINTPQVSAIAAPTRRPLTDIVGV--YNP-HDPQLTDA 66 L+TGA G +G H+ + L+ +VG+ N +D L A Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGH-------------QVVGIDNLNDYYDVSLKQA 44
>adhesinb#Adhesin B signature. Length = 310 Score = 29.0 bits (65), Expect = 0.001 Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%) Query: 1 MKR---LIPVALLTTLLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWETAGAIAGG 57 MK+ L+ + L LA C+ + +V TN+ + T++ IAG Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53 Query: 58 AAAVAGLT 65 + + Sbjct: 54 KINLHSIV 61
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 148 bits (376), Expect = 2e-46 Identities = 59/143 (41%), Positives = 86/143 (60%), Gaps = 2/143 (1%) Query: 46 ALPFLIFYASFSLLLGIYDARTGLLPDRFTCPLLWGGLLYHQICLPERLPDALWGAIAGY 105 L L+ + L D LLPD+ T PLLWGGLL++ + L DA+ GA+AGY Sbjct: 134 TLAALLLT-WVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGY 192 Query: 106 GGFALIYWGYRLRYQKEGLGYGDVKYLAALGAWHCWETLPLLVFLAAMLACGGFGVALLV 165 +YW ++L KEG+GYGD K LAALGAW W+ LP+++ L++++ G+ L++ Sbjct: 193 LVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV-GAFMGIGLIL 251 Query: 166 RGKSALINPLPFGPWLAVAGFIT 188 P+PFGP+LA+AG+I Sbjct: 252 LRNHHQSKPIPFGPYLAIAGWIA 274
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 36.4 bits (84), Expect = 1e-05 Identities = 18/103 (17%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+++ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDRLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKIIELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 616 bits (1591), Expect = 0.0 Identities = 178/698 (25%), Positives = 305/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMQDLANEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KQALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 +Q R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKTARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPENPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P+ ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKSGPLAGYPVVDLGVRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D + +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.7 bits (64), Expect = 0.024 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 164 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 222 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 223 SK 224 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 128 bits (323), Expect = 2e-38 Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 9/226 (3%) Query: 28 AAKPAATADSKAAFKNDDQKAAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87 A A A + D K +Y++GA LG K + GI ++ D L G+QD Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66 Query: 88 A-DKSKLSDQEIEQTLQTFEARVKSAAQAKMEKDAADNEAKGKTFRDAFAKEKGVKTSST 146 + + L++++++ L F+ + + A+ K A +N+AKG F A + G+ + Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126 Query: 147 GLLYKVEKEGTGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206 GL YK+ GTG P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186 Query: 207 KNIKKGGKIKLVIPPALAYGKTGVPG-IPANSTLVFDVELLDIKPA 251 + + G ++ +P LAYG V G I N TL+F + L+ +K A Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 27.7 bits (61), Expect = 0.025 Identities = 35/138 (25%), Positives = 52/138 (37%), Gaps = 22/138 (15%) Query: 11 YAHPESQDSVANRVLLKPAIQHNNVTVHDLYARYPDFFID--TPYEQ-----ALLREHDV 63 Y P + D N+V P + +HD+ + D F +P + L+ V Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 64 IVFQH--PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLVGKYWRSVITTGEPESA---- 117 Q P+ + P DR L F GPG N G Y +IT PE Sbjct: 69 ---QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLNS--GPYEEKIITELAPEDDDLVL 121 Query: 118 --YRYDALNRYPMSDVLR 133 +RY A R + +++R Sbjct: 122 TKWRYSAFKRTNLLEMMR 139
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.9 bits (69), Expect = 0.019 Identities = 21/85 (24%), Positives = 33/85 (38%), Gaps = 7/85 (8%) Query: 522 VQKQENQADDAPKENNANSAQSRKDQKRREAELRTLT---QPLRKEITRLEKEMEKLNAQ 578 + E + A +E N N ++ RE E T + + I+ L+ M L A Sbjct: 151 TRTAEEIGEQAVREGNINGPEAYMRFLDREMEGLTAAYNVKLFTEAISSLQIRMNTLTAA 210 Query: 579 LA----QAEEKLGDSSLYDPSRKAE 599 A A K + + + RKAE Sbjct: 211 KASIEAAAANKAREQAAAEAKRKAE 235
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 25.1 bits (54), Expect = 0.024 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 3/46 (6%) Query: 3 IPWQGLAPDTLDNLIESFV---LREGTDYGEHERSLEQKVADVKRQ 45 +PW+ PD L FV E T E E SLEQ++A ++ Q Sbjct: 5 LPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQ 50
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 36.0 bits (83), Expect = 1e-04 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 71 PEANDFSLLEHTFIEYGQTGKGQSRKYLHTYDEAVPWNQVPGTFTP 116 P+ + + E ++ KG SRK++ ++ + + GTF Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 30.1 bits (67), Expect = 0.015 Identities = 15/50 (30%), Positives = 25/50 (50%), Gaps = 3/50 (6%) Query: 268 FAADESVGVLEYVNDDGVTVKEEVKPETGDYGRVYDALYQTLTVGTPNYV 317 ++AD+ ++Y N DG + K + G+ Y +YQ GT NY+ Sbjct: 1052 YSADDLSNYVDYANADGNKLSNTCK---LNPGKYYLCVYQFENSGTGNYI 1098
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.5 bits (79), Expect = 8e-05 Identities = 18/92 (19%), Positives = 33/92 (35%), Gaps = 16/92 (17%) Query: 55 VACIDDIVVGHLSIQVTQRPRRSHVADFGICVDARWHNRGIASALIRTMID------MCD 108 + +++ +G + I+ + + D + D R G+ +AL+ I+ C Sbjct: 69 LYYLENNCIGRIKIRSNWN-GYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125 Query: 109 NWLRVDRIELTVFVDNEPAVAVYKKYGFEIEG 140 L I N A Y K+ F I Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 31.2 bits (70), Expect = 0.012 Identities = 33/176 (18%), Positives = 57/176 (32%), Gaps = 22/176 (12%) Query: 204 NSKAIFWKDGEPLKKGDKLVQKNLAKSLEMIAENGPDAFYKGAIADQIAGEMQ----KNG 259 +SK I + G P +++K + ++ G +A A M+ N Sbjct: 154 DSKGIHYNQGNPYNLLTPVIEKVKPGEQSFVGQHA----ATGCVATATAQIMKYHNYPNK 209 Query: 260 GLMTKEDLASYKAVERTPISGDY-----RGYQVFSMPPPSSGGIHIVQILNILE------ 308 GL S + R Y ++ P SG VQ + I E Sbjct: 210 GLKDYTYTLSSNNPYFNHPKNLFAAISTRQYNWNNILPTYSGRESNVQKMAISELMADVG 269 Query: 309 ---NFDMKKYGFGSADAMQIMAEAEKYAYADRSEYLGDSDFVKVPWQALTNKDYAK 361 + D + + A E + Y + DF K W+A +K+ ++ Sbjct: 270 ISVDMDYGPSSGSAGSSRVQRALKENFGYNQSVHQINRGDFSKQDWEAQIDKELSQ 325
>PF04619#Dr-family adhesin Length = 160 Score = 28.0 bits (62), Expect = 0.027 Identities = 11/60 (18%), Positives = 21/60 (35%), Gaps = 4/60 (6%) Query: 29 VGAQYGHTMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGGW 84 +G ++ D + G+ FL+ D+N ++ W + D G W Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.040 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 40.1 bits (93), Expect = 1e-05 Identities = 43/176 (24%), Positives = 70/176 (39%), Gaps = 17/176 (9%) Query: 133 SGHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQLPKTWQELADYTAKLRAAGMKCGYASGW 192 +G L++ P L YNKD PKTW+E+ +L+A G + Sbjct: 126 NGKLIAYPIAVEALSLIYNKDLLPNP-------PKTWEEIPALDKELKAKGKSALMFNLQ 178 Query: 193 QGWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYVG 250 + + +A G F +N +D D ++ K + L++ + D Y Sbjct: 179 EPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY-- 236 Query: 251 RKDESTEKFYNGDCAMTTASSGSLANIRQYAKFNYGVGMMPYDADIKGAPQNAIIG 306 + F G+ AMT + +NI +K NYGV ++P KG P +G Sbjct: 237 --SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 71.0 bits (174), Expect = 7e-16 Identities = 61/355 (17%), Positives = 115/355 (32%), Gaps = 75/355 (21%) Query: 25 IATKIAGRIDTILVSEGQFVRQGEVLAKMDTRV----------------LQEQRLEAI-- 66 I + I+V EG+ VR+G+VL K+ L++ R + + Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 67 ----------------------------------AQIKEAESAVAAARALLEQRQSEMRA 92 Q ++ L+++++E Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218 Query: 93 AQSVVKQREAELDSVSKRHVRSRSLSQRGAVSVQQLDDDRAAAESARAALETAKAQVSAA 152 + + + E R SL + A++ + + A L K+Q+ Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQI 278 Query: 153 KAAIEAARTSIIQ-------------AQTRVEAAQATERRIVADID--DSELKAPRDGRV 197 ++ I +A+ QT T + S ++AP +V Sbjct: 279 ESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338 Query: 198 -QYRVAEPGEVLSAGGRVLNMVDLSDVY-MTFFLPTEQAGLLKIGGEARLVLDAAPDLRI 255 Q +V G V++ ++ +V D +T + + G + +G A + ++A P R Sbjct: 339 QQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRY 398 Query: 256 PATISFVASVAQFTPKTVETHDERLKLMFRVKARIPPELLRQHLEYVKTGLPGMA 310 V V +E D+RL L+F V I L + + GMA Sbjct: 399 G---YLVGKVKNINLDAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLS-SGMA 447
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.1 bits (68), Expect = 0.022 Identities = 23/194 (11%), Positives = 57/194 (29%), Gaps = 40/194 (20%) Query: 12 TGLLLLLALAFVLFYEAINGFHDTANAVATVIY------TRAMRSQLAVVMAAVFNFFGV 65 L++AL+ +L + F + + ++A+ + V+ F Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89 Query: 66 LLGGLSVAYAIVHML-------------------PTDLLLNMGSAHGLAMVFSMLLAAII 106 LL ++ H++ P + + S L +L ++ Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVL 149 Query: 107 WNLGTWYFGLPASSSHTLIGAIIGIGLTNAMMTGTSVVDALNIPKVINIFGSLIISPIVG 166 ++ W + ++ + T + T ++ + L++ VG Sbjct: 150 LSILIWIIIKG------NLVTLLQLP-TCGIECITPLLGQI--------LRQLMVICTVG 194 Query: 167 LVFAGGLIFLLRRY 180 V + Y Sbjct: 195 FVVISIADYAFEYY 208
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 35.9 bits (82), Expect = 4e-04 Identities = 24/89 (26%), Positives = 41/89 (46%), Gaps = 12/89 (13%) Query: 21 RQASIEILLLLGIHTTEGKEPRWFMEQLEQARLNLGGWGAVAKKLRINDAQLSQFMLQLR 80 R + +++ LG+H+ G++P+ F E + L +G GA K S L + Sbjct: 280 RASGEPVVIDLGLHSRSGEQPKGFTESFKAPELGVGNLGASEK---------SDVFLVVS 330 Query: 81 HLQQHVPQYDSGQEVSENQLLAALRFVTS 109 L + ++ E+ NQ LRF+TS Sbjct: 331 TLLHCIEGFEKNPEIKPNQ---GLRFITS 356
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.9 bits (75), Expect = 6e-04 Identities = 23/138 (16%), Positives = 48/138 (34%), Gaps = 26/138 (18%) Query: 11 GISIQSVGQAEELWQKIESAPDALVMLDSGLDAEFCREVLQRIAQQFPEVK-IIITAMDG 69 G ++ A LW+ I + LV+ D + E ++L RI + P++ ++++A Sbjct: 27 GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSA--- 83 Query: 70 SQKWLHEVMQFNVQAVVPRDSDAETFVLALNAVARGMMFLPGDWLNSTELESRDIKALSA 129 + T + A A + P D + R AL+ Sbjct: 84 -------------------QNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---ALAE 121 Query: 130 RQREILQMLAAGESNKQI 147 +R ++ + + Sbjct: 122 PKRRPSKLEDDSQDGMPL 139
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.5 bits (79), Expect = 8e-04 Identities = 83/367 (22%), Positives = 126/367 (34%), Gaps = 59/367 (16%) Query: 79 IGSALFGHFGDRVGRKVTLVASLLTMGISTVIIGLLPGYATIGIFAPLLLALARFGQGLG 138 IG+A++G D++G K LL GI G + G+ F+ LL +ARF QG G Sbjct: 64 IGTAVYGKLSDQLGIK-----RLLLFGIIINCFGSVIGFVGHSFFS--LLIMARFIQGAG 116 Query: 139 LGGEWGGAALLATENAPPRKR----ALYGSFPQLGAPIGFFFANGTFLLLSW-------- 186 ++ P R L GS +G +G + W Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 187 -----LLTDEQFMSWGWRV--PF-IFSAVLVIIG-------------LYVRVSLHETPVF 225 + + + R+ F I +L+ +G ++ VS+ +F Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236 Query: 226 AKVAAAKKQVKIPLGTLLTKHVRVTVLGTFIMLATYTLFYIMTVYSMTYSTAAAPVGLGL 285 K + G + VL I+ T F M Y M + +G Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG- 295 Query: 286 PRNEILWMLMMAVIGFGVMVPIAGLLADAFGRRKSMVIITTLIIL-FALFAFTPLLGSGN 344 + I++ M+VI FG I G+L D G + I T + + F +F S Sbjct: 296 --SVIIFPGTMSVIIFG---YIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWF 350 Query: 345 PALVFVFLLLGLSLMGL---TFGPMGALLPELFPTEVRYTGASFS-YNVSSILGASVAPY 400 ++ VF+L GLS T L E GA S N +S L Sbjct: 351 MTIIIVFVLGGLSFTKTVISTIVSSS-----LKQQEA---GAGMSLLNFTSFLSEGTGIA 402 Query: 401 IAAWLQS 407 I L S Sbjct: 403 IVGGLLS 409
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 28.4 bits (63), Expect = 0.030 Identities = 13/69 (18%), Positives = 27/69 (39%), Gaps = 9/69 (13%) Query: 51 TVKKAVDQLVREGVLVQVQGKGTFVKKENVAYPLGEGLLSFAEALASQKINFTTSVITSR 110 ++ +A Q+ RE V G F K N+ + F ++++S T V + Sbjct: 49 SIDQAAAQITREN--VSWNGTNVFGKSANLTF-------KFLQSVSSIPSGDTGFVKFNA 99 Query: 111 LEPANRFVA 119 + ++ Sbjct: 100 EQIEQAKLS 108
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.0 bits (83), Expect = 3e-04 Identities = 27/168 (16%), Positives = 64/168 (38%), Gaps = 16/168 (9%) Query: 49 FNIAQNDMISTYGLSMTELGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89 Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 F + +G S F ++ + F Q G + + + ++ P+ RG G Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRF 212 +G + A+Y+ + + + P +I +I ++ Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP-MITIITVPFLMKL 188
>PF06580#Sensor histidine kinase Length = 349 Score = 39.1 bits (91), Expect = 3e-05 Identities = 30/142 (21%), Positives = 55/142 (38%), Gaps = 11/142 (7%) Query: 378 LRPRQLDDLTLAQAIRSLLREMELESRGIVSHLDWRIDETALSESQRVTLFRVCQEGLNN 437 LR ++LA + + ++L S L + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 438 IVKHA-----NASAVTLQGWQQDERLMLVIEDDGSGLPPGSHQ-QGFGLTGMRERVSALG 491 +KH + L+G + + + L +E+ GS + + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 492 G---TLTISCTHG-TRVSVSLP 509 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.4 bits (149), Expect = 3e-13 Identities = 23/116 (19%), Positives = 45/116 (38%), Gaps = 5/116 (4%) Query: 23 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 82 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 83 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHT 135 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>PF06872#EspG protein Length = 398 Score = 28.5 bits (63), Expect = 0.021 Identities = 14/54 (25%), Positives = 27/54 (50%) Query: 111 LLLEAGMEVNDDFKEPADHLAIYLELLSHLHFSLGESFQQRRMNKLRQKTLSSL 164 L+L+A +++N D+K+P + + +LL L L + + Q L+ L Sbjct: 29 LVLDATIKINSDYKKPWNEMTCAEKLLKILTLGLWNPKYSQDERQQFQGLLTVL 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.4 bits (188), Expect = 3e-18 Identities = 28/115 (24%), Positives = 56/115 (48%), Gaps = 1/115 (0%) Query: 4 HIVIVEDEPVTQARLQAYFEQEGYSVSVTDSGAGLRDIMEHEHVSLILLDINLPDENGLM 63 I++ +D+ + L + GY V +T + A L + L++ D+ +PDEN Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 LTRALRER-STVGIILVTGRCDQIDRIVGLEMGADDYVTKPLELRELVVRVKNLL 117 L +++ + +++++ + + I E GA DY+ KP +L EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 55.2 bits (133), Expect = 7e-10 Identities = 25/137 (18%), Positives = 54/137 (39%), Gaps = 3/137 (2%) Query: 681 RLLLIEDNMLTQRITAEMLTGKGVKVSVAESANDALRCLAEGESFDVALVDFDLPDYDGL 740 +L+ +D+ + + + L+ G V + +A R +A G+ D+ + D +PD + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63 Query: 741 TLAQQLMSLYPAMKRIGFSAH-VIDDNLRQRTAGLFCGIIQKPVPREELYRMIAHYLQGK 799 L ++ P + + SA ++ G + + KP EL +I L Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAEP 122 Query: 800 SHNARAMLNEHQLAGDM 816 + ++ Q + Sbjct: 123 KRRPSKLEDDSQDGMPL 139
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.1 bits (112), Expect = 8e-08 Identities = 65/384 (16%), Positives = 118/384 (30%), Gaps = 36/384 (9%) Query: 66 AEMGYVFSAFAWLYTLCQIPGGWFLDRIGSRLTYFIAIFGWSVATLLQGFATGLLSLIGL 125 A G + + +A + C G DR G R +++ G +V + A L L Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 126 RAITGIFEAPAFPANNRMVTSWFPEHERASAVGFYTSGQFVGLAFLTPLLIWIQEMLSWH 185 R + GI A + ERA GF ++ G+ P+L + S H Sbjct: 103 RIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV-AGPVLGGLMGGFSPH 160 Query: 186 WVFIVTGGIGIIWSLVWFKVYQPPRLTKSLSQAELEYIRDGGGLVDGDAPAKKEARQPLT 245 F + + L + + P ++EA PL Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE-------------------RRPLRREALNPLA 201 Query: 246 KADWKLVFHRKLVGVYLGQFAVNSTLWFFLTWFPNYLTQEKGITALKAGFMTTV-PFLAA 304 W + + F + + + A G L + Sbjct: 202 SFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHS 260 Query: 305 FFGVLLSGWLADKLVKKGFSLGVARKTPIICGLLISTC--IMGANYTNDPLWIMALMAIA 362 +++G +A +L + ++ G++ I+ A T + ++ +A Sbjct: 261 LAQAMITGPVAARL---------GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLA 311 Query: 363 FFGNGFASITWSLISSLAPMRLIGLTGGMFNFIGGLGGISVPLVIGYL-AQSYGFAPALV 421 G G ++ +++S G G + L I PL+ + A S Sbjct: 312 SGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWA 370 Query: 422 YISVVALLGALSYILLVGDVKRVG 445 +I+ AL L G G Sbjct: 371 WIAGAALYLLCLPALRRGLWSGAG 394
>SECA#SecA protein signature. Length = 901 Score = 28.3 bits (63), Expect = 0.017 Identities = 14/74 (18%), Positives = 27/74 (36%) Query: 11 KAFGKQRRKTREELNQEARDRKRLKKHRGHAPGSRAAGGNSASGGGNQNQQKDPRIGSKT 70 K + + EE+ + + R+ + +SA+ Q + ++G Sbjct: 824 STLSKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRND 883 Query: 71 PVPLGVTEKVTQQH 84 P P G +K Q H Sbjct: 884 PCPCGSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 597 bits (1540), Expect = 0.0 Identities = 204/478 (42%), Positives = 299/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGNEVLAALASKTPDVLLSDIRMPGM 60 M + V DDD++IR VL +AL+ AG N + +A+ D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNIEVNGPTTDMIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLERRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L++ + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRIHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETETALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFEASAPDSPSHLPPDSWATLLAQWADRALRS---- 416 EN R LT + + + + EL S + ++Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.034 Identities = 33/189 (17%), Positives = 71/189 (37%), Gaps = 39/189 (20%) Query: 171 IIEQADRLRNLVDRL-------LGPQHPGMHIT--ESIHKVAERVVALVSMELPDNVRLI 221 I+E + R ++ L L ++ + + V + + L S++ D ++ Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYS-NARQVSLADELTVV-DSYLQLASIQFEDRLQFE 243 Query: 222 RDYDPSLPELPHDPEQIEQVLL-NIVRNALQALGPEGGEITLRTRTAFQLTLHGERYRLA 280 +P++ ++ P + Q L+ N +++ + L P+GG+I L+ Sbjct: 244 NQINPAIMDVQV-PPMLVQTLVENGIKHGIAQL-PQGGKILLKGT------KDNGTVT-- 293 Query: 281 ARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHAGK---IEFTSWPG 337 ++VE+ G + ++ TG GL R + G I+ + G Sbjct: 294 --LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQG 339 Query: 338 HTEFSVYLP 346 V +P Sbjct: 340 KVNAMVLIP 348
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 178 bits (454), Expect = 1e-50 Identities = 100/448 (22%), Positives = 170/448 (37%), Gaps = 87/448 (19%) Query: 4 NLRNIAIIAHVDHGKTTLVDKLLQQSGTFDARAETQE--RVMDSNDLEKERGITILAKNT 61 + NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAHGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIIYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QAIIDHVPAPDVDLDGPLQMQISQLDYNNYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247 + I + + L ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NAKVGKVLTHLGLERIDSDIAEAGDIIAITGLG-ELN--ISDTICDPQNVEALPALSVDE 304 K+ ++ T + E D A +G+I+ + +LN + DT PQ + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPL 343 Query: 305 PTVSMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGEL 364 P + + + D L LR +S G++ Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKV 394 Query: 365 HLSVLIENMRRE-GFELAVSRPKVIFRE 391 + V ++ + E+ + P VI+ E Sbjct: 395 QMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.5 bits (74), Expect = 0.005 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 28.2 bits (63), Expect = 0.023 Identities = 11/27 (40%), Positives = 13/27 (48%), Gaps = 1/27 (3%) Query: 2 SYTLPSLPYAYDALEPHFDKQTMAIHH 28 S T P+ PY + L H D M HH Sbjct: 298 SSTNPTRPYTVNTLAEHLD-MLMVCHH 323
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.037 Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 28/108 (25%) Query: 354 LENIVRNALRY------SHTKIEVGFSVDKDGITITVDDDGPGVSPEDREQIFRPFYRTD 407 ++ +V N +++ KI + + D +T+ V++ G +E Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309 Query: 408 EARDRESGGTGLGLAIVESAMQQHRGWVKAD---DSPLGGLRLTLWLP 452 TG GL V +Q G +A G + + +P Sbjct: 310 --------STGTGLQNVRERLQMLYG-TEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.1 bits (234), Expect = 2e-24 Identities = 36/128 (28%), Positives = 64/128 (50%), Gaps = 2/128 (1%) Query: 3 KILLVDDDRELTSLLKELLEMEGFNVLVAHDGEQALELL-DDSIDLLLLDVMMPKKNGID 61 IL+ DDD + ++L + L G++V + + + DL++ DV+MP +N D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 TLKALRQTH-QTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRRSHW 120 L +++ PV++++A+ + + + E GA DYLPKPF+ EL+ I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 SEQQQSSD 128 + D Sbjct: 125 RPSKLEDD 132
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 28.4 bits (63), Expect = 0.030 Identities = 31/140 (22%), Positives = 58/140 (41%), Gaps = 27/140 (19%) Query: 10 SRASIAATAMASALLLIKIFAWWYTGSVSILAALVD-SLVDIAASLTNLLVVRYSLQPAD 68 ++A++A + + W S+L AL +L +A + ++V +L P+ Sbjct: 123 TKAALAGAGIGVVAAALGYTQWL-----SLLYALPVIALTGLAFASLGMVVT--ALAPSY 175 Query: 69 DEHTFGHGKAESLAALAQSMFISGSAL--------------FLFLTSIQNLIKPTPMNDP 114 D F ++L + +F+SG+ FL L+ +LI+P + P Sbjct: 176 DYFIF----YQTLV-ITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP 230 Query: 115 GVGIGVTVIALICTIILVTF 134 V + V AL I++ F Sbjct: 231 VVDVCQHVGALCIYIVIPFF 250
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.041 Identities = 12/58 (20%), Positives = 24/58 (41%), Gaps = 1/58 (1%) Query: 16 MAINVVIIAMQLLLAYFYTDIYGLSAADVGVLFVVVRMIDAII-DPAMGVLTDKLNTR 72 I + ++ Y D++ LS A++G + + + II G+L D+ Sbjct: 266 GIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPL 323
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 120 bits (303), Expect = 1e-39 Identities = 49/89 (55%), Positives = 66/89 (74%) Query: 2 NKTQLIDVIADKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61 NK LI +A+ EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90 NPQTG+EIKI A+ VPAF +GKALKDAVK Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF06580#Sensor histidine kinase Length = 349 Score = 38.7 bits (90), Expect = 5e-05 Identities = 48/268 (17%), Positives = 102/268 (38%), Gaps = 53/268 (19%) Query: 294 IVLSALAAVLLATLLAFFWHQRYQRSHRELLDAMKRKEKLVAMGHLAAGVA----HEIRN 349 I+ + + + +LL F WH + +++++ + + L A A H + N Sbjct: 120 IIFNVVVVTFMWSLLYFGWH--FFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFN 177 Query: 350 PLSSIKGLAKYFAERTPAGGESHELAQVM---TKEADRLNRVVSELLELVKPAHLTLQAV 406 L++I+ L + A L+++M + ++ +++ L +V ++L L ++ Sbjct: 178 ALNNIRALILEDPTK--AREMLTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASI 234 Query: 407 NLNDIITHSLNLVSQDAQSREIQLRFTANETLKRIQADPDRLTQVLLNLYLNAI-HAIGR 465 D + + + ++Q+ P L Q L+ N I H I + Sbjct: 235 QFEDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVE---NGIKHGIAQ 274 Query: 466 Q---GTITVEAKESGTDRVIITVTDSGKGIAPDQLEAIFTPYFTTKADGTGLGLAVVQNI 522 G I ++ + V + V ++G + E TG GL V+ Sbjct: 275 LPQGGKILLKGTKDN-GTVTLEVENTGSLALKNTKE------------STGTGLQNVRER 321 Query: 523 IEQHGG---AIKVKSIEGKGAVFTIWLP 547 ++ G IK+ +GK + +P Sbjct: 322 LQMLYGTEAQIKLSEKQGKVNA-MVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 528 bits (1361), Expect = 0.0 Identities = 183/475 (38%), Positives = 256/475 (53%), Gaps = 37/475 (7%) Query: 1 MIRGKIDILVVDDDVSHCTILQALLRGWGYNVALAYSGHDALAQVREKVFDLVLCDVRMA 60 M I LV DDD + T+L L GY+V + + + DLV+ DV M Sbjct: 1 MTGATI--LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 EMDGIATLKEIKALNPAIPILIMTAFSSVETAVEALKAGALDYLIKPLDFDRLQETLEKA 120 + + L IK P +P+L+M+A ++ TA++A + GA DYL KP D L + +A Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LAHTRETGAELPSASAAQFGMIGSSPAMQHLLNEIAMVAPSDATVLIHGDSGTGKELVAR 180 LA + ++L S ++G S AMQ + +A + +D T++I G+SGTGKELVAR Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 181 ALHACSARSDKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLD 240 ALH R + P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLD Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 241 EIGDISPLMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAEEVSAGRFRQDLYY 300 EIGD+ Q RLLR +Q+ E VG I DVR++AAT++DL + ++ G FR+DLYY Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 301 RLNVVAIEMPSLRQRREDIPLLADHFLRRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRE 360 RLNVV + +P LR R EDIP L HF+++ + VK F +A++L+ + WPGN+RE Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRE 357 Query: 361 LENAIERAVVLLTGEYISERELPLAIAATPIKTECSGEIQP------------------- 401 LEN + R L + I+ + + + + Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 402 ---------------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKISR 441 L E+E +ILAAL T GN+ +AA LG+ R TL KI Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.8 bits (90), Expect = 2e-06 Identities = 16/54 (29%), Positives = 22/54 (40%), Gaps = 5/54 (9%) Query: 78 VDPDVRGQGIGKRLVEHALTLAP-----GLTTNVNEQNTQAVGFYKKMGFKVTG 126 V D R +G+G L+ A+ A GL + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 20/86 (23%), Positives = 33/86 (38%), Gaps = 9/86 (10%) Query: 61 LALRNGEVVGMISLHMQFHLHHANWIG--EIQELVVLPPMRGQKIGSQLLAWAEEEARQA 118 L +G I + +NW G I+++ V R + +G+ LL A E A++ Sbjct: 69 LYYLENNCIGRIKIR-------SNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKEN 121 Query: 119 GAELTELSTNIKRRDAHRFYLREGYK 144 L T A FY + + Sbjct: 122 HFCGLMLETQDINISACHFYAKHHFI 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.9 bits (101), Expect = 2e-06 Identities = 53/284 (18%), Positives = 104/284 (36%), Gaps = 40/284 (14%) Query: 85 FFGMLGDKYGRQKILAITIVIMSISTFCIGLIPSYATIGIWAPILLLLCKMAQGFSVGGE 144 G L D++GR+ +L +++ ++ + P +W +L + ++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIVAGIT-GAT 112 Query: 145 YTGASIFVAEYSPDRKR----GFMGSWLDFGSIAGFVLGAGVVVLISTIVGEENFLEWGW 200 A ++A+ + +R GFM + FG +AG VLG G++ S Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSP------------ 159 Query: 201 RIPFFIALPLGIIGLYLRHALEETPAFQQHVDKLEQGDREGLQDGPKVSFKEIATKHWRS 260 PFF A L + L K E+ P SF+ + Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESH------KGERRPLRREALNPLASFRWARGMTVVA 213 Query: 261 LLSCIGLVIATNVTYYMLLTYMPSYLSHNLHYS-EDHGVLIIIAIMIGMLFVQPVMGLLS 319 L + ++ + + + H+ G+ + ++ L + G ++ Sbjct: 214 ALMAVFFIM--QLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVA 271 Query: 320 DRFGRRPFVIMGSIA-LFALAIPAFILINSNVIGLIFAGLLMLA 362 R G R +++G IA + AF + F +++LA Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFA----TRGWMAFPIMVLLA 311 Score = 37.9 bits (88), Expect = 7e-05 Identities = 37/164 (22%), Positives = 73/164 (44%), Gaps = 16/164 (9%) Query: 286 LSHNLHYSEDHGVLI-IIAIMIGMLFVQPVMGLLSDRFGRRPFVIMGSIALFALAIPAFI 344 L H+ + +G+L+ + A+M PV+G LSDRFGRRP ++ ++L A+ I Sbjct: 35 LVHSNDVTAHYGILLALYALM--QFACAPVLGALSDRFGRRPVLL---VSLAGAAVDYAI 89 Query: 345 LINSNVIGLIFAGLLMLAVILNCFTGVMASTLPAMFPTHIR---YSALAAAFNISVLIAG 401 + + + +++ G ++A I V + + + R + ++A F ++AG Sbjct: 90 MATAPFLWVLYIG-RIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAG 147 Query: 402 LTPTLAAWLVESSQDLMMPAYYLMVIAVIGLVTGI-SMKETANR 444 P L + S P + + + +TG + E+ Sbjct: 148 --PVLGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPESHKG 187
>PF06580#Sensor histidine kinase Length = 349 Score = 37.5 bits (87), Expect = 6e-05 Identities = 39/182 (21%), Positives = 78/182 (42%), Gaps = 34/182 (18%) Query: 184 ARLDQMMDSVSQLLQLARVGQSFSSGNYQEVKLLEDV-ILPSYDELNTM-LETR-QQTLL 240 + +M+ S+S+L++ S N ++V L +++ ++ SY +L ++ E R Q Sbjct: 191 TKAREMLTSLSELMR-----YSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 241 LPESAADVVVRGDATLLRMLLRNLVENAHRY----SPEGTHITIHISADPDAI-MAVEDE 295 + + DV V ML++ LVEN ++ P+G I + + D + + VE+ Sbjct: 246 INPAIMDVQV------PPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 296 GPGIDESKCGKLSEAFVRMDSRYGGIGLGLSIV-SRITQLHQGQFFLQNRTERTGTRAWV 354 G + + G GL V R+ L+ + ++ ++ A V Sbjct: 300 GSLA--------------LKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 355 LL 356 L+ Sbjct: 346 LI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.9 bits (223), Expect = 5e-23 Identities = 45/144 (31%), Positives = 68/144 (47%), Gaps = 1/144 (0%) Query: 2 KILIVEDDTLLLQGLILAAQTEGYACDGFSTVRAAEHSLESGHYSLMVLDLGLPDEDGLH 61 IL+ +DD + L A GY S + +G L+V D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLTRIRQKKYTLPVLILTARDTLNDRITGLDVGADDYLVKPFALEELHARI-RALLRRHN 120 L RI++ + LPVL+++A++T I + GA DYL KPF L EL I RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 NQGESELTVGNLTLNIGRHQAWRD 144 + E + +GR A ++ Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQE 148
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 32.1 bits (73), Expect = 0.006 Identities = 39/163 (23%), Positives = 60/163 (36%), Gaps = 13/163 (7%) Query: 80 CVFILVGAAAQYFILTYGIIIDHSMIANMMDTTPAETFALM-TPQMVLTLG---LSGVLA 135 CV +V A +L+ + +M P T LM V T G L +LA Sbjct: 177 CVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLA 236 Query: 136 AVIAFWVKIRPATPRLRSGLYRLASVLISILLVILVAAFFYKDYASLFRNNKQLIKALSP 195 +AF V +R R+ L LI + L A + + + L + L++A+ Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRI 296 Query: 196 SNSIVASWSWYSHQRLANLPLVRIGEDAHRN--------PLML 230 S V S + H+ VR G H+ P+M Sbjct: 297 S-GDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMR 338
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 48.4 bits (115), Expect = 8e-10 Identities = 17/59 (28%), Positives = 29/59 (49%) Query: 62 DEATLFNIAVDPDFQRRGLGRMLLEHLIDELEKRGVVTLWLEVRASNAAAIALYESLGF 120 A + +IAV D++++G+G LL I+ ++ L LE + N +A Y F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 213 bits (545), Expect = 6e-64 Identities = 109/452 (24%), Positives = 209/452 (46%), Gaps = 44/452 (9%) Query: 12 KRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSSQHAKSDWMEMEKQRGISIT 71 K +++H DAGKTT+TE +L AI G+V ++D +E+QRGI+I Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGT----TRTDNTLLERQRGITIQ 57 Query: 72 TSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTR 131 T + F + + VN++DTPGH DF + YR+L+ +D +++I A GV+ +TR L R Sbjct: 58 TGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALR 117 Query: 132 LRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETY 191 P + F+NK+D++ D + +++ +L K + Sbjct: 118 KMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVI------------------KQKVE 159 Query: 192 LYQTGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVQGASNEFDEELFLAGEI 251 LY E + + D + + ++ + + LEL Q S F + Sbjct: 160 LYPNMCVTNFTESEQWDTVIEGN-DDLLEKYMSGKSLEALELEQEES-----IRFHNCSL 213 Query: 252 TPVFFGTALGNFGVDHMLDGLVAWAPAPMPRQTDTRTVEASEEKFTGFVFKIQANMDPKH 311 PV+ G+A N G+D++++ + + + + G VFKI+ K Sbjct: 214 FPVYHGSAKNNIGIDNLIEVITNKFYSS---------THRGQSELCGKVFKIE--YSEK- 261 Query: 312 RDRVAFMRVVSGKYEKGMKLRQVRTGKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHG 371 R R+A++R+ SG +R K + I++ T + G+ +++AY G+I+ L N Sbjct: 262 RQRLAYIRLYSGVLHLRDSVRISEKEK-IKITEMYTSINGELCKIDKAYSGEIVILQNEF 320 Query: 372 TIQIGDTFTQGEMMKFTGIPNFA-PELFRRIRLKDPLKQKQLLKGLVQLSEEG-AVQVFR 429 +++ +++ P L + P +++ LL L+++S+ ++ + Sbjct: 321 -LKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379 Query: 430 PISNNDLIVGAVGVLQFDVVVARLKSEYNVEA 461 + +++I+ +G +Q +V A L+ +Y+VE Sbjct: 380 DSATHEIILSFLGKVQMEVTCALLQEKYHVEI 411
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 27.3 bits (60), Expect = 0.002 Identities = 16/49 (32%), Positives = 20/49 (40%), Gaps = 8/49 (16%) Query: 4 WGIIFLVIALIA--------AALGFGGLAGTAAGAAKIVFVVGIVLFLV 44 W +FL + A AL F LAGT G I V GI+ + Sbjct: 460 WKPLFLTLEKKAADAGVSYVVALLFSLLAGTTLGIWGIAIVTGILCSYI 508
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 28.6 bits (64), Expect = 0.028 Identities = 32/141 (22%), Positives = 51/141 (36%), Gaps = 37/141 (26%) Query: 6 IDTHCHFDFPPFTGDERASIQRACEAGVEKIIVPATEAA-------------HFPRVLAL 52 +D+H HF P I+ A +G+ ++ T A H R++ Sbjct: 133 MDSHIHFICP-------QQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIEA 185 Query: 53 AARFPSLYAALGLHPIVIERHADDDPDKLQQALAQQQNVVAVGEIGLDLYRDDPQFARQE 112 A FP A G + P AL + V G L L+ D + Sbjct: 186 ADAFPMNLAFAG-------KGNASLPG----ALVEM---VLGGATSLKLHED---WGTTP 228 Query: 113 RFLDAQLQLAKRYDLPVILHS 133 +D L +A YD+ V++H+ Sbjct: 229 AAIDCCLSVADEYDVQVMIHT 249