>PF08280#M protein trans-acting positive regulator Length = 530 Score = 115 bits (290), Expect = 2e-34 Identities = 68/70 (97%), Positives = 70/70 (100%) Query: 6 ILQDNVYQIPDLKPDLVITHSQLIPFVHHELTKGIAVAEISFDESILSIQELMYRVKEEK 65 +LQDNVYQIPDLKPDLVITHSQLIPFVHHELTKGIAVAEISFDESILSIQELMY+VKEEK Sbjct: 461 LLQDNVYQIPDLKPDLVITHSQLIPFVHHELTKGIAVAEISFDESILSIQELMYQVKEEK 520 Query: 66 FQADLTKQLT 75 FQADLTKQLT Sbjct: 521 FQADLTKQLT 530
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 633 bits (1634), Expect = 0.0 Identities = 418/426 (98%), Positives = 421/426 (98%) Query: 1 MIEKYLESSIESKCQLVVLFFKTSYLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMT 60 +IEKYLESSIESKCQLVVLFFKTS LPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMT Sbjct: 34 LIEKYLESSIESKCQLVVLFFKTSSLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMT 93 Query: 61 IQKRMISCQFTHPFKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAY 120 IQKRMISCQFTHP KETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAY Sbjct: 94 IQKRMISCQFTHPSKETYLYQLYASSNVLQLLAFLIKNGSHSRPLTDFARSHFLSNSSAY 153 Query: 121 RMREALIPLLRNFELKLSKNKIVGEEYRIRYLIALLYSKFGIKVYDLTQQDKNTIHSFLS 180 RMREALIPLLRNFELKLSKNKIVGEEYRIRYLIALLYSKFGIKVYDLTQQDKN IHSFLS Sbjct: 154 RMREALIPLLRNFELKLSKNKIVGEEYRIRYLIALLYSKFGIKVYDLTQQDKNIIHSFLS 213 Query: 181 HSSTHLKTSPWLSESFSFYDILLALSWKRHQFSVTIPQTRIFQQLKKLFIYDSLKKSSRD 240 HSSTHLKTSPWLSESFSFYDILLALSWKRHQFSVTIPQTRIFQQLKKLF+YDSLKKSSRD Sbjct: 214 HSSTHLKTSPWLSESFSFYDILLALSWKRHQFSVTIPQTRIFQQLKKLFVYDSLKKSSRD 273 Query: 241 IIETYCQLNFSAGDLDYLYLIYITANNSFASLQWTPEHIRQCCQLFEENDTFRLLLKPII 300 IIETYCQLNFSAGDLDYLYLIYITANNSFASLQWTPEHIRQCCQLFEENDTFRLLL PII Sbjct: 274 IIETYCQLNFSAGDLDYLYLIYITANNSFASLQWTPEHIRQCCQLFEENDTFRLLLNPII 333 Query: 301 TLLPNLKEQKPSLVKALMFFSKSFLFNLQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEW 360 TLLPNLKEQK SLVKALMFFSKSFLFNLQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEW Sbjct: 334 TLLPNLKEQKASLVKALMFFSKSFLFNLQHFIPETNLFVSPYYKGNQKLYTSLKLIVEEW 393 Query: 361 LAKLPGKRYLNHKHFHLFCHYVEQILRNIQPPLVVVFVASNFINAHLLTDSFPRYFSDKS 420 +AKLPGKRYLNHKHFHLFCHYVEQILRNIQPPLVVVFVASNFINAHLLTDSFPRYFSDKS Sbjct: 394 MAKLPGKRYLNHKHFHLFCHYVEQILRNIQPPLVVVFVASNFINAHLLTDSFPRYFSDKS 453 Query: 421 IDFHSY 426 IDFHSY Sbjct: 454 IDFHSY 459
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 30.3 bits (68), Expect = 0.018 Identities = 16/77 (20%), Positives = 27/77 (35%), Gaps = 5/77 (6%) Query: 274 DPPKPGETSEHNPKTPELDGTPIPEDPKHPDDNLEPTLPPVMLDGEEVPEVPSESLEPAL 333 +PP+ + PE + PIPE PK +E P + P + +E Sbjct: 61 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPK-----PKPKPKPVKKVEQPK 115 Query: 334 PPLMPELDGQEVPEKPS 350 + P P + + Sbjct: 116 RDVKPVESRPASPFENT 132
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 92.6 bits (230), Expect = 5e-22 Identities = 42/160 (26%), Positives = 64/160 (40%), Gaps = 24/160 (15%) Query: 265 DIDWTQTDDDTKYESHGMHVTGIVAGNSKEAAATGERFLGIAPEAQVMFMRVFANDVMGS 324 + D D HG HV G +A +G+APEA ++ ++V G Sbjct: 74 EGDPEIFKDY---NGHGTHVAGTIAAT-----ENENGVVGVAPEADLLIIKVLNKQGSGQ 125 Query: 325 AESLFIKAIEDAVALGADVINLSLGTANGAQLSGSKPLMEAIEKAKKAGVSVVVAAGNER 384 + + I+ I A+ D+I++SLG L EA++KA + + V+ AAGNE Sbjct: 126 YDWI-IQGIYYAIEQKVDIISMSLGGP-----EDVPELHEAVKKAVASQILVMCAAGNEG 179 Query: 385 VYGSDHDDPLATNPDYGLVGSPSTGRTPTSVAAINSKWVI 424 D+ +G P SV AIN Sbjct: 180 DGDDRTDE----------LGYPGCYNEVISVGAINFDRHA 209 Score = 78.7 bits (194), Expect = 2e-17 Identities = 36/147 (24%), Positives = 58/147 (39%), Gaps = 18/147 (12%) Query: 562 FDSVVSKAPSQKGNEMNHFSNWGLTSDGYLKPDITAPGGDIYSTYNDNHYGSQTGTSMAS 621 ++ V+S + FSN + D+ APG DI ST Y + +GTSMA+ Sbjct: 194 YNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSMAT 247 Query: 622 PQIAGASLLVKQ-YLEKTQPNLPKEKIADIVKNLLMSNAQIHVNPETKTTTSPRQQGAGL 680 P +AGA L+KQ + +L + L+ SP+ +G GL Sbjct: 248 PHVAGALALIKQLANASFERDL----TEPELYAQLIKRT-------IPLGNSPKMEGNGL 296 Query: 681 LNIDGAVTSGLYVTGKDNYGSISLGNI 707 L + + G +S ++ Sbjct: 297 LYLTAVEELSRIFDTQRVAGILSTASL 323 Score = 40.6 bits (95), Expect = 4e-05 Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 1/34 (2%) Query: 128 HDWVKTKGAWDKGYKGQGKVVAVIDTGIDPAHQS 161 + ++ W++ G+G VAV+DTG D H Sbjct: 26 VEMIQAPAVWNQTR-GRGVKVAVLDTGCDADHPD 58
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 36.6 bits (84), Expect = 7e-05 Identities = 42/163 (25%), Positives = 69/163 (42%), Gaps = 13/163 (7%) Query: 93 INTSLDKTKGELSQLTPELRDQVAQLDAATHRLVIPWNIVVYRYV-YETFLRDIGVSHAD 151 IN L + G L+ PEL +V ++ A IP N++VYR + F + D Sbjct: 295 INNYL-ISNGPLNNPNPELDSKVNNIENALKLTPIPSNLIVYRRSGPQEFGLTLTSPEYD 353 Query: 152 LTSYYRNDQFDPHILCKIKL-GTRYTKHSFMSTT--ALKNGAMTHRPVEVRICVKKGAKA 208 D F K K G T +F+ST+ ++ A R + +RI + K + Sbjct: 354 FNKIENIDAF------KEKWEGKVITYPNFISTSIGSVNMSAFAKRKIILRINIPKDSPG 407 Query: 209 AFVEPYSAVPSEVELLFPRGCQLEV--VGAYVSQDHKKLHIEA 249 A++ E E+L G + ++ V +Y KL ++A Sbjct: 408 AYLSAIPGYAGEYEVLLNHGSKFKINKVDSYKDGTVTKLILDA 450
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.5 bits (68), Expect = 0.002 Identities = 22/88 (25%), Positives = 36/88 (40%), Gaps = 9/88 (10%) Query: 57 SEKELEQKYGEDRFQGYLDGYKEGLEKSDIPKWSDIKVPDGRDDDYRDGYEQGFLEGRRE 116 +E LEQ+ + + Q + GY+ G+ + G Y++G QG +G E Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGR---------QQGHKQGYQEGLAQGLEQGLAE 86 Query: 117 ARPIASFFEAVWQVLTDIFGGWFSSNDS 144 A+ + A Q L F + DS Sbjct: 87 AKSQQAPIHARMQQLVSEFQTTLDALDS 114
>PF05844#YopD protein Length = 295 Score = 26.9 bits (59), Expect = 0.005 Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 2/39 (5%) Query: 12 MASISGGNAPGDAVIGGLGGLASG--LKFCKLLHPVLAG 48 MA I+G A AV+G LG L +G + K L + G Sbjct: 123 MAVIAGVGALASAVVGSLGALKNGKAISQEKTLQKNIDG 161
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 38.8 bits (90), Expect = 4e-06 Identities = 14/58 (24%), Positives = 24/58 (41%) Query: 7 TKKKIAKAFKKQLAVKSFDKISVVDIMDQAQIRRQTFYNHFLDKYELLDWIFETELQE 64 T++ I + + + S+ +I A + R Y HF DK +L I+E Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.8 bits (228), Expect = 1e-23 Identities = 29/133 (21%), Positives = 65/133 (48%), Gaps = 1/133 (0%) Query: 3 KILIVDDEKPISDIIKFNLTKEGYDIVTAFDGREAVTIFEEEKPDLIILDLMLPELDGLE 62 IL+ DD+ I ++ L++ GYD+ + DL++ D+++P+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VAKEIRKT-SHVPIIMLSAKDSEFDKVIGLEIGADDYVTKPFSNRELLARVKAHLRRTET 121 + I+K +P++++SA+++ + E GA DY+ KPF EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 IETAVAEENASSG 134 + + +++ Sbjct: 125 RPSKLEDDSQDGM 137
>PF06580#Sensor histidine kinase Length = 349 Score = 44.5 bits (105), Expect = 5e-07 Identities = 30/187 (16%), Positives = 72/187 (38%), Gaps = 34/187 (18%) Query: 253 DETNRMMRMISDLL--NLSRIDNQVTQLAVEMTNFTAFITSILNRFDLVKNQHTGTGKVY 310 + M+ +S+L+ +L + + LA E+T +++ +F +++ ++ Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQF---EDRLQFENQIN 247 Query: 311 EIVRDYPITSVWLEIDNDKMTQVIENILNNAIKYSPDGGKITVRMKTTDTQLIISISDQG 370 + D + + ++ ++EN + + I P GGKI ++ + + + + + G Sbjct: 248 PAIMDVQVPPMLVQT-------LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTG 300 Query: 371 LGIPKTDLPLIFDRFYRVDKARSRAQGGTGLGLAIAKEIIKQHHGF---IWAKSDYGKGS 427 K + TG GL +E ++ +G I GK Sbjct: 301 SLALKNT------------------KESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV- 341 Query: 428 TFTIVLP 434 +++P Sbjct: 342 NAMVLIP 348
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 48.5 bits (115), Expect = 1e-07 Identities = 48/313 (15%), Positives = 95/313 (30%), Gaps = 10/313 (3%) Query: 209 AKVAKQFLELDANRKQLQLDILVKDIDIAQERQTKDTEALAALQQDLASYYAKRQSMEED 268 + VA + + Q + D + + + + + + AL+ + + +E Sbjct: 41 SAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEK 100 Query: 269 YQKFKQKKQVLSQESDQTQTTLLELTKLIADLEKQIELVKLESGQ---EAEKKAEAKKHL 325 +K + + + + + +L K + + E A K L Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 160 Query: 326 EQLQEQLDGFQAEEKQRTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEF 385 E+ E F + + + L L + +L + FS+ ++TL E Sbjct: 161 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 220 Query: 386 VLLMQKEAALSNQLTALKAHLDKEKQARQHKAQEYQLLVTKLDQLNDESQKAQAHYKAQK 445 L ++A L L + + E L + +L + A A Sbjct: 221 AALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADS 280 Query: 446 EQVEMLLQNYQEGDKRVQELERDYQLNQERLFDLLDQ-------KKGKEARKASLESIQK 498 +++ L + +LE Q+ L KK EA LE K Sbjct: 281 AKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340 Query: 499 SHSQFYAGVRAVL 511 +R L Sbjct: 341 ISEASRQSLRRDL 353 Score = 30.4 bits (68), Expect = 0.045 Identities = 38/243 (15%), Positives = 88/243 (36%), Gaps = 18/243 (7%) Query: 169 KYKTRKKETQIKLNQTQDNLDRLEDIIYELDTQLAPLEKQAKVAKQFLELDANRKQLQLD 228 + + + LE L+ + A LEK + A F ++ Sbjct: 229 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTA----DSAKIK 284 Query: 229 ILVKDIDIAQERQTKDTEALAALQ-------QDLASYYAKRQSMEEDYQKFKQKKQVLSQ 281 L + + + L +DL + ++ +E ++QK +++ ++ Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344 Query: 282 ESDQTQTTLLELTKLIADLEKQIELVKLESGQEAEKKAEAKKHLEQLQEQLDGFQAEEKQ 341 + L + LE + + ++ E+ ++ + L+ LD + +KQ Sbjct: 345 SRQSLRRDLDASREAKKQLEAEHQKLE-------EQNKISEASRQSLRRDLDASREAKKQ 397 Query: 342 RTEQLLHIDQQLCDVKQQLNELSNALERFSSDPDQLMETLREEFVLLMQKEAALSNQLTA 401 + L + +L +++ EL + + + +L L E L +K A + +L Sbjct: 398 VEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAK 457 Query: 402 LKA 404 L+A Sbjct: 458 LRA 460
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.3 bits (89), Expect = 4e-05 Identities = 28/141 (19%), Positives = 59/141 (41%), Gaps = 13/141 (9%) Query: 52 SVIGVLFNLFGGVIADSFKR----KKIIITTNILCGTACLVLSFLTKEQWLVYAIVLTNV 107 + G+L +L +I ++ ++ I GT ++L+F T W+ + I+ V Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT-RGWMAFPIM---V 308 Query: 108 ILAFMSAFSSPSYKAFTKEIVKKDSISQLNSLLETTSTVIKVTVPMVAIFLYKLLGIHGV 167 +LA P+ +A V ++ QL L +++ + P++ +Y + Sbjct: 309 LLAS-GGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA----ASI 363 Query: 168 LLLDGLSFLIAALLISFILPV 188 +G +++ A L LP Sbjct: 364 TTWNGWAWIAGAALYLLCLPA 384
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 31.2 bits (70), Expect = 0.010 Identities = 43/226 (19%), Positives = 88/226 (38%), Gaps = 24/226 (10%) Query: 171 NLYDNIARYKERLKDKSDQLTTFRNARKYAFISNLVGGKKQFEANVSEIKRLEYDLSHLQ 230 ++ + E + S + + A + L + + E + S Sbjct: 225 ARKADLEKALEGAMNFSTADSA-KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKI 283 Query: 231 DTQQDKIDSDDIEKNQQKLQ-------LRNTKLELDNSLRDKQRRLKLLDISIEFGLYPT 283 T + + + + EK + Q ++ + +LD S R+ +++L+ +E + Sbjct: 284 KTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDAS-REAKKQLEAEHQKLEEQNKIS 342 Query: 284 ESDLTELQQYFPDTNLKKLYEVEAYHKKL----------ATILDSEFSTERES---LIAE 330 E+ L++ D + + ++EA H+KL L + RE+ + Sbjct: 343 EASRQSLRRDL-DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKA 401 Query: 331 IDELESQLTTLSQELQELGNIPNLS-SEYLENYSKLTATINALKEQ 375 ++E S+L L + +EL L+ E E +KL A ALKE+ Sbjct: 402 LEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEK 447
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 26.0 bits (57), Expect = 0.043 Identities = 16/63 (25%), Positives = 25/63 (39%), Gaps = 8/63 (12%) Query: 3 PLIQSLTEGQLR-SDIPNFRPGDTVRVHAKVVE-------GTRERIQIFEGVVISRKGQG 54 ++ + +L DI R GD +R+H V G R++ GVV + Sbjct: 260 DVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIGNRKKFLCQPGVVGKKIAAQ 319 Query: 55 ISE 57 I E Sbjct: 320 ILE 322
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.9 bits (70), Expect = 0.004 Identities = 15/76 (19%), Positives = 32/76 (42%), Gaps = 1/76 (1%) Query: 37 SYQDFLDVLLSLFQFVVIILVLFFYSATINLGEVLTFLTQTSWHWQILCYLVLYLMAIIE 96 S + ++ L S+ + V++ ++++ NL +L T L +L + +I Sbjct: 133 SIKSLVEFLKSILKVVLLSILIWII-IKGNLVTLLQLPTCGIECITPLLGQILRQLMVIC 191 Query: 97 MTLLVLILIFDVLLQK 112 V+I I D + Sbjct: 192 TVGFVVISIADYAFEY 207
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.4 bits (105), Expect = 6e-07 Identities = 21/112 (18%), Positives = 45/112 (40%), Gaps = 13/112 (11%) Query: 170 QQLQDLNDAYADAQAEVNKAQIALNDTVVISSVSGTVVE-----VNNDIDPSSKNSQTLV 224 +L+ D E+ K + +V+ + VS V + + ++TL+ Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTT----AETLM 357 Query: 225 HVATEGQ-LQVKGTLTEYDLANVKVGQSVKIKSKVYSNQEW---TGKISYVS 272 + E L+V + D+ + VGQ+ IK + + + GK+ ++ Sbjct: 358 VIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409 Score = 37.1 bits (86), Expect = 1e-04 Identities = 24/185 (12%), Positives = 53/185 (28%), Gaps = 29/185 (15%) Query: 21 ITLVLIITGVVLWKQQQNTLTADIAKEPYSTVSVTEGSIASSTLLSGTVKALSEEYIYFD 80 ++ + + + + V+ G + S S +K + + Sbjct: 62 YFIMGFLVIAFIL--------SVLG--QVEIVATANGKLTHSGR-SKEIKPIENSIV--- 107 Query: 81 ANKGNDATVTVKVGDQVTQGQQLVQYNTTTA-------QSAYDTAVRSLNKIGRQINHLK 133 + VK G+ V +G L++ A QS+ A + ++ Sbjct: 108 ------KEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161 Query: 134 TYGVPAV--STETNKDEATGEETTTTVQPSAQQNANYKQQLQDLNDAYADAQAEVNKAQI 191 +P + E + EE +Q + ++ Q +AE Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLA 221 Query: 192 ALNDT 196 +N Sbjct: 222 RINRY 226
>BACTRLTOXIN#Bacterial toxin signature. Length = 266 Score = 49.5 bits (118), Expect = 1e-10 Identities = 23/78 (29%), Positives = 33/78 (42%), Gaps = 13/78 (16%) Query: 3 TDKKEVAIQEFDVKSRYYLQKHFNICGFSDVKNFGRSSRFKSGLEEGNIVFHLNSGEKIS 62 TDKK V QE D+K+R +L N+ F+ S E G I F N+G Sbjct: 176 TDKKSVTAQELDIKARNFLINKKNLYEFNS-----------SPYETGYIKFIENNGNTFW 224 Query: 63 YNLFDT--EFGDRESILK 78 Y++ + D+ L Sbjct: 225 YDMMPAPGDKFDQSKYLM 242
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 112 bits (282), Expect = 4e-28 Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 8/156 (5%) Query: 12 KIRNFSIIAHIDHGKSTLADRILEK---TETVSSREMQAQLLDSMDLERERGITIKLNAI 68 KI N ++AH+D GK+TL + +L + S + D+ LER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 ELNYTAKDGETYIFHLIDTPGHVDFTYEVSRSLAACEGAILVVDAAQGIEAQTLANVYLA 128 + E ++IDTPGH+DF EV RSL+ +GAIL++ A G++AQT + Sbjct: 62 SFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 129 LDNDLEILPVINKIDLPAADPERVRHEVEDVIGLDA 164 + + INKID D V ++++ + + Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEI 152 Score = 93.0 bits (231), Expect = 7e-22 Identities = 44/214 (20%), Positives = 93/214 (43%), Gaps = 16/214 (7%) Query: 171 SAKAGIGIEEILEQIVEKVPAPTGDVDAPLQALIFDSVYDAYRGVILQVRIVNGIVKPGD 230 SAK IGI+ ++E I K + T + L +F Y R + +R+ +G++ D Sbjct: 220 SAKNNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRD 279 Query: 231 KIQMMSNGKTFDVTEVGIFTP-KAVGRDFLATGDVGYVAASIKTVADTRVGDTVTLANNP 289 +++ K +TE+ + D +G++ + + +GDT L Sbjct: 280 SVRISEKEKI-KITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQRE 337 Query: 290 AKEALHGYKQMNPMVFAGIYPIESNKYNDLREALEKLQLNDASLQFE--PETSQALGFGF 347 E P++ + P + + L +AL ++ +D L++ T + + Sbjct: 338 RIENPL------PLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII---- 387 Query: 348 RCGFLGLLHMDVIQERLEREFNIDLIMTAPSVVY 381 FLG + M+V L+ ++++++ + P+V+Y Sbjct: 388 -LSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420 Score = 43.3 bits (102), Expect = 2e-06 Identities = 21/104 (20%), Positives = 41/104 (39%), Gaps = 12/104 (11%) Query: 393 VSNPSEFPAPTRVAFIE----------EPYVKAQIMVPQEFVGAVMELSQRKRGDFVTMD 442 VS P++F + + EPY+ +I PQE++ + + + V Sbjct: 510 VSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ 569 Query: 443 YIDDNRVNVIYQIPLAEIVFDFFDKLKSSTRGYASFDYDMSEYR 486 + +N V + +IP I ++ L T G + ++ Y Sbjct: 570 -LKNNEVILSGEIPARCI-QEYRSDLTFFTNGRSVCLTELKGYH 611
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 73.6 bits (180), Expect = 2e-16 Identities = 36/105 (34%), Positives = 48/105 (45%), Gaps = 12/105 (11%) Query: 258 QPGKPAPKTPEVPQKPDTAPDTPKPPQIPGQSKDVTPAPQNPSNRGLNKPQTQGGNQLAK 317 + K A + ++ + TP P + +G NQ Sbjct: 447 KLAKQAEELAKLRAGKASDSQTPDAK----------PGNKAVPGKGQAPQAGTKPNQ--N 494 Query: 318 TPAAHDTHRQLPATGETTNPFFTAAAVAIMTTAGVVAVAKRQENN 362 +T RQLP+TGET NPFFTAAA+ +M TAGV AV KR+E N Sbjct: 495 KAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 539
>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature. Length = 483 Score = 36.6 bits (84), Expect = 1e-04 Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 4/82 (4%) Query: 31 SGSQSDKLVIYNWGDYIDPALLKKFTKETGIEVQYETFDSNEAMYTKIKQGGTTYDIAVP 90 S S V+ N+ YI P LL++ + + + T+ SNE + TY +AV Sbjct: 21 SSCGSTTFVLANFESYISPLLLER--VQEKHPLTFLTYPSNEKLINGF--ANNTYSVAVA 76 Query: 91 SDYTIDKMIKENLLNKLDKSKL 112 S Y + ++I+ +LL+ +D S+ Sbjct: 77 STYAVSELIERDLLSPIDWSQF 98
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 3e-16 Identities = 23/132 (17%), Positives = 51/132 (38%), Gaps = 2/132 (1%) Query: 3 VLIIEDDPMVDFIHRNYLEKLNLFDRIISSDSMKAVQSILTDYAIDLILLDIHITDGNGI 62 +L+ +DD + + L + + + + + + DL++ D+ + D N Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 63 QFLEKLRAQHIPCEVIIISAANDGNIIRDGFHLGIIDYLIKPFTFERFQESIQQFVTHRE 122 L +++ V+++SA N G DYL KPF I + + + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 123 HLANQQLEQAQT 134 ++ + +Q Sbjct: 124 RRPSKLEDDSQD 135
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 65.8 bits (160), Expect = 5e-14 Identities = 76/299 (25%), Positives = 120/299 (40%), Gaps = 45/299 (15%) Query: 36 DLKVAMVTDTGGVDDKSFNQSAWEGLQSWGKEMGLQKGTGFDYFQSTSESEYATNLDTAV 95 LK ++TD G +DDKSFNQSA+E L++ + K TG + S + + ++A+ Sbjct: 61 KLKPVLITDEGKIDDKSFNQSAFEALKA------INKQTGIEINNVEPSSNFESAYNSAL 114 Query: 96 SGGYQLIYGIGFALKDAIAKAAGD------NEGVKFVIIDDIIEGKDNV-ASVTFADHEA 148 S G+++ GF + +I + +K + ID IE + S+ F E+ Sbjct: 115 SAGHKIWVLNGFKHQQSIKQYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIKES 174 Query: 149 AYLAGIAAAKTTKTK-----TVGFVGGMEGTVITRFEKGFEAGVKS---------VDDTI 194 A+ G A A + V GG +T F +GF G+ + T Sbjct: 175 AFTTGYAIASWLSEQDESKRVVASFGGGAFPGVTTFNEGFAKGILYYNQKHKSSKIYHTS 234 Query: 195 QVKVDYAGSFGDAAKGKTIAAAQYAAGADVIYQAAGG---TGAGVFNEAKAINEKRSEAD 251 VK+D +G I + ADV Y G F + N+ + Sbjct: 235 PVKLD-SGFTAGEKMNTVINNVLSSTPADVKYNPHVILSVAGPATFETVRLANKGQ---- 289 Query: 252 KVWVIGVDRDQKDEGKYTSKDGKEANFVLASSIKEVGKAVQLINKQVADKKFPGGKTTV 310 +VIGVD DQ +D +L S +K + +AV + +K G K V Sbjct: 290 --YVIGVDSDQG-----MIQDKDR---ILTSVLKHIKQAVYETLLDLILEKEEGYKPYV 338
>PF06580#Sensor histidine kinase Length = 349 Score = 39.1 bits (91), Expect = 2e-05 Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 5/75 (6%) Query: 312 YGKIFYFQNQVNRSLRMDKALLKQLITILFDNAIKY----TDKNGIIEIIVKTTDKNLLI 367 + F+NQ+N ++ D + L+ L +N IK+ + G I + + + + Sbjct: 236 FEDRLQFENQINPAIM-DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTL 294 Query: 368 SVIDNGPGITDEEKK 382 V + G K+ Sbjct: 295 EVENTGSLALKNTKE 309
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 30.0 bits (67), Expect = 0.042 Identities = 23/127 (18%), Positives = 40/127 (31%), Gaps = 27/127 (21%) Query: 216 AFSKDYQKRVTQNQAHLDNLLKDNGQ-----KRYDDLQNQYDLALKNGRAALAKETVKLA 270 FS ++ +A L + + + +K A A + A Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 271 ASEENLTFLEVS---------ALQEAKHQIEQGKQALAKEEKQ------------LEQVQ 309 E L + A +EAK Q+E Q L +E+ + L+ + Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKL-EEQNKISEASRQSLRRDLDASR 357 Query: 310 ATKDKLE 316 K +LE Sbjct: 358 EAKKQLE 364
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 7e-04 Identities = 18/41 (43%), Positives = 23/41 (56%), Gaps = 2/41 (4%) Query: 36 KGELVVIL-GASGAGKSTVLNILGGMD-TVDAGQVIIDGKD 74 K + V+L G G GKST++N L G+D D I GKD Sbjct: 594 KFDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 40.8 bits (95), Expect = 8e-07 Identities = 13/48 (27%), Positives = 26/48 (54%) Query: 4 RHTETKAYVKTALITLLTEQSFETLTVSDLTKKAGINRGTFYLHYTDK 51 ET+ ++ + L ++Q + ++ ++ K AG+ RG Y H+ DK Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK 55
>BACTRLTOXIN#Bacterial toxin signature. Length = 266 Score = 43.0 bits (101), Expect = 2e-07 Identities = 28/115 (24%), Positives = 50/115 (43%), Gaps = 18/115 (15%) Query: 71 PEEKAIYINIFGEKELRTLTAKDKITFKNNIVTLQEIDVRLRKSLMGDSKIKLYEYD-SL 129 + + + ++ K T ++ VT QE+D++ R L+ +K LYE++ S Sbjct: 153 GNLQNVLVRVYENKRN---TISFEVQTDKKSVTAQELDIKARNFLI--NKKNLYEFNSSP 207 Query: 130 YKKGFWDIHYKDGGIRHTNLFTYPD-----------YTDNETIDMSKVSHFDVHL 173 Y+ G+ +G ++ P Y DN+T+D SK +VHL Sbjct: 208 YETGYIKFIENNGNTFWYDMMPAPGDKFDQSKYLMMYNDNKTVD-SKSVKIEVHL 261
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 94.8 bits (235), Expect = 9e-26 Identities = 45/123 (36%), Positives = 64/123 (52%), Gaps = 8/123 (6%) Query: 23 SLTAAQAILESGWGKHA-------PHNALFGIKADSSWTGKSFDTKTQEEYQPGVVTDIV 75 L AQA LESGWG+ P LFG+KA +W G + T E Y+ G + Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230 Query: 76 DRFRAYGSWDESILDHGKFLNDNPRYKAVVGETDYKKACHAIKEAGYATASGYAELLIQI 135 +FR Y S+ E++ D+ L NPRY AV ++ A+++AGYAT YA L + Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290 Query: 136 IKE 138 I++ Sbjct: 291 IQQ 293
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 438 bits (1127), Expect = e-158 Identities = 195/270 (72%), Positives = 226/270 (83%), Gaps = 2/270 (0%) Query: 5 KKKETDNKIAKLESIKADKDTVYLKAESKKELDKKMNLTGGTMTGQLQFKPN-SHIKHSS 63 +K+ET++KI KLES KADK+ VYLKAESK ELDKK+NL GG MTGQLQFKPN S IK SS Sbjct: 65 QKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQLQFKPNKSGIKPSS 124 Query: 64 STGGAINIDMSKSAGAAMVMYTNKDTTDGPLMILRSDKDTFDQSAQFVDYSGKTNAVNIV 123 S GGAINIDMSKS GA +V+Y+N DT+DGPLM LR+ K+TF+QSA FVDYSGKTNAVNI Sbjct: 125 SVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALFVDYSGKTNAVNIA 184 Query: 124 MRQPSTPNFSSALNITSANEGGSAMQIRGIERALGTLKITHENPNVDAKYDENAAALSID 183 MRQP+TPNFSSALNITS NE GSAMQIRG+E+ALGTLKITHENPNV+A YDENAAALSID Sbjct: 185 MRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVEANYDENAAALSID 244 Query: 184 IVGKRGASGNGTAAQGIFINSSAGTTGKMLRIRNKNKDKFYVNPDGGFHSYADSIVDGNL 243 IV K+ G GTAAQGI+INS++GTTGK+LRIRN DKFYV DGGF++ S +DGNL Sbjct: 245 IV-KKQKGGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQIDGNL 303 Query: 244 TVKDPTSGKHAATKDYVDKKFDELKKLIQK 273 +K+PT+ HAATK YVD + +LK L+ Sbjct: 304 KLKNPTADDHAATKAYVDSEVKKLKALLMD 333
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 28.9 bits (64), Expect = 0.041 Identities = 23/65 (35%), Positives = 29/65 (44%), Gaps = 6/65 (9%) Query: 387 ERINALENNQKVITNNQKQFELNLPKYLNDINGKRVWYEKPDDNIEHKIGDYWFEKNGKY 446 E I AL Q ++ K EL + + I KR EK + + K YW K G Y Sbjct: 66 EEIYALLRKQSIVRRQIKDLELQIIQ----IQEKRSELEKKREEFQEK-SKYWLRKEGNY 120 Query: 447 QRTWI 451 QR WI Sbjct: 121 QR-WI 124
>BINARYTOXINA#Clostridial binary toxin A signature. Length = 454 Score = 38.9 bits (90), Expect = 6e-06 Identities = 41/148 (27%), Positives = 65/148 (43%), Gaps = 21/148 (14%) Query: 51 DEYIIASSGPTINGRLRSGSVDEKIENIYQTLKKYSTKADIVVYRGVSMETLEKMVESA- 109 + Y+I S+GP N + +D K+ NI LK ++++VYR + + S Sbjct: 296 NNYLI-SNGPLNN---PNPELDSKVNNIENALKLTPIPSNLIVYRRSGPQEFGLTLTSPE 351 Query: 110 ----QVEGCIDFKEK---------GFLHTSL--VKGFEFRDPYKKLRIKIPKGTNAFYVG 154 ++E FKEK F+ TS+ V F LRI IPK + Y+ Sbjct: 352 YDFNKIENIDAFKEKWEGKVITYPNFISTSIGSVNMSAFAKRKIILRINIPKDSPGAYLS 411 Query: 155 NLNNEETHYYEVIIQKGAKLKVISIDDY 182 + YEV++ G+K K+ +D Y Sbjct: 412 AIPGYAGE-YEVLLNHGSKFKINKVDSY 438
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 52.0 bits (124), Expect = 1e-08 Identities = 44/245 (17%), Positives = 102/245 (41%), Gaps = 3/245 (1%) Query: 15 DTQPLQRALKGINKESAESTKELKQIDKALKFDTGNVTLLTQKQEVLSKQIATTKEKLET 74 + +K + E A +++KAL+ T + K + L + A + Sbjct: 170 FSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKAD 229 Query: 75 LRQAQSQVEAQFQRGDIGAEQYRAFQREVETTQNVLKSYETKLEGVNRALDSHGNTVESN 134 L +A + + + + E + E LEG + +++ Sbjct: 230 LEKALEGAMNFST---ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTL 286 Query: 135 RSKLNSLEAEQAQLASESEKLNSTFRLQESQLGSNASESEKLALAQRRIASQSELVERQI 194 ++ +LEAE+A L +S+ LN+ + L ++ ++L +++ Q+++ E Sbjct: 287 EAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASR 346 Query: 195 ANLERQLELTKSEYGENSVEANRLEKTLNDTKTAYNNLQQEMEGLSNASQQSAASLEQTN 254 +L R L+ ++ + E +LE+ ++ + +L+++++ A +Q +LE+ N Sbjct: 347 QSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEAN 406 Query: 255 GLLKA 259 L A Sbjct: 407 SKLAA 411 Score = 51.2 bits (122), Expect = 2e-08 Identities = 37/256 (14%), Positives = 87/256 (33%), Gaps = 17/256 (6%) Query: 11 EIGGDTQPLQRALKGINKESAESTKELKQIDKALKFDTGNVTLLTQKQEVLSKQIATTKE 70 ++ + + L+ + +E + + ++L++ DK+L + L ++ L K + Sbjct: 75 DLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMN 134 Query: 71 KLETLRQAQSQVEAQFQRGDIGAEQYRAFQREVETTQNVLKSYETKLEGVNRAL------ 124 +EA+ A + ++ +E N + K++ + Sbjct: 135 FSTADSAKIKTLEAEKA---ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEAR 191 Query: 125 -DSHGNTVESNRSKLNSLEAEQAQLASESEKLNSTFRLQESQLGSNASESEKLALAQRRI 183 +E + + A+ L +E L + E L + S + + + Sbjct: 192 QAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTL 251 Query: 184 ASQSELVERQIANLERQLELTKSEYGENSVEANRLEKTLNDTKTAYNNLQQEMEGLSNAS 243 ++ +E + A LE+ LE + + + L+ E L + S Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTA-------DSAKIKTLEAEKAALEAEKADLEHQS 304 Query: 244 QQSAASLEQTNGLLKA 259 Q A+ + L A Sbjct: 305 QVLNANRQSLRRDLDA 320 Score = 51.2 bits (122), Expect = 2e-08 Identities = 25/211 (11%), Positives = 70/211 (33%), Gaps = 6/211 (2%) Query: 54 LTQKQEVLSKQIATTKEKLETLRQAQSQVEAQFQRGDIGAEQYRAFQREVETTQNVLKSY 113 + L + + + L+ ++ + E+ R + + + ++ Sbjct: 62 FEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAK---EKLRKNDKSLSEKASKIQEL 118 Query: 114 ETKLEGVNRALDSHGNTVESNRSKLNSLEAEQAQLASESEKLNSTFRLQESQLGSNASES 173 E + + +AL+ N ++ +K+ +LEAE+A LA+ L + +++++ Sbjct: 119 EARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 178 Query: 174 EKLALAQRRIASQSELVERQIANLERQLELTKS---EYGENSVEANRLEKTLNDTKTAYN 230 + L + + ++ +E+ + + + L Sbjct: 179 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 238 Query: 231 NLQQEMEGLSNASQQSAASLEQTNGLLKADI 261 N + A+LE L+ + Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKAL 269
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 30.1 bits (67), Expect = 0.020 Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 3/41 (7%) Query: 55 LQKFGDRTIR---RWEAEETHPSKLEQSSIQSFFNSLKNPP 92 QKFGD+ R W + + PSK+ SI++F ++ PP Sbjct: 109 FQKFGDQRYRIFTSWVSHQNDPSKINTRSIRNFMENIIQPP 149
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.9 bits (77), Expect = 0.002 Identities = 34/216 (15%), Positives = 63/216 (29%), Gaps = 49/216 (22%) Query: 35 LVGANGAGKSTLFKVLLGELIPPGCKMNHLGELAYIPQLD-EVTLQEEKDFA--LVGKLG 91 L G G GKSTL L+G D + KD + G + Sbjct: 601 LEGTGGIGKSTLINTLVGLDF----------------FSDTHFDIGTGKDSYEQIAGIVA 644 Query: 92 VEQLNIQTMSGGEETRLKIAQALSAQVHGI---LADEPTSHLDREGI--------DFL-- 138 E + + +K S++ H R+ + +L Sbjct: 645 YELSEMTAFRRADAEAVK--AFFSSRKDRYRGAYGRYVQDH-PRQVVIWCTTNKRQYLFD 701 Query: 139 -IGQLKYFTGALLVISH-DRYFLDEIVDKIW-ELK----DGKITEYWGNYSDYLRQKEEE 191 G +++ +LV + +L + +++ E G+ Y+ + D ++ Sbjct: 702 ITGNRRFWP--VLVPGRANLVWLQKFRGQLFAEALHLYLAGE--RYFPSPED---EEIYF 754 Query: 192 RKRQAAEYEQFIAERARLERAAEEKRKQARKIEQKA 227 R Q + + E A QK Sbjct: 755 RPEQELRLVETGVQGRLWALLTREGAPAAEGAAQKG 790
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.6 bits (108), Expect = 2e-07 Identities = 54/337 (16%), Positives = 113/337 (33%), Gaps = 16/337 (4%) Query: 59 VFGPAIGVLVDRHDRKKIMIGADLIIAAAGSVLTIVAFYMELPVWMVMIVLFIRSIGTAF 118 P +G L DR R+ + L+++ AG+ + +W++ I + I T Sbjct: 58 ACAPVLGALSDRFGRRPV-----LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGA 111 Query: 119 HTPALNAVTPLLVPEEQLTKCAGYSQSLQSISYIVSPAVAALLYSVWELNAIIAIDVLGA 178 A + ++ + G+ + + P + L+ A L Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNG 171 Query: 179 VIASITVAIVRIPKLGDRVQSLDPNFIREMQEGMAVLRQNKGLFALLLVGTLYMFVYMPI 238 + ++ G+R R + AL+ V + V Sbjct: 172 LNFLTGCFLLPESHKGER--RPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229 Query: 239 NALFPLISMDYFNGTPVHISITEISF-ASGMLIGGLLLGLFGNYQKRILLITASIFMMGI 297 AL+ + D F+ I I+ +F L ++ G + + G Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289 Query: 298 SLTISGLLPQS-GFFIFVVCCAIMGLSVPFYSGVQTALFQEKIKPEYLGRVFSLTGSIMS 356 + + F +V A G+ +P A+ ++ E G++ ++ S Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGIGMPAL----QAMLSRQVDEERQGQLQGSLAALTS 345 Query: 357 LAMPIG-LILSALFADRIGV-NHWFLLSGTLIICIAI 391 L +G L+ +A++A I N W ++G + + + Sbjct: 346 LTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCL 382
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 29.8 bits (67), Expect = 0.007 Identities = 12/60 (20%), Positives = 20/60 (33%), Gaps = 12/60 (20%) Query: 10 TIFLTRTSCSNCGKQSTFERFDRVYAAKTPEIISAILDWDFFKFTCHNCNHKVLIDYPTV 69 + + R+ C +C E I +L W + + C C + YP V Sbjct: 66 NLMVPRSCCPHCNHPI------TAL-----ENIP-LLSWLWLRGRCRGCQAPISARYPLV 113
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.9 bits (103), Expect = 4e-06 Identities = 46/305 (15%), Positives = 100/305 (32%), Gaps = 23/305 (7%) Query: 303 LENTQKELEAQKQTNSQMITEKGKEVLKLDGEIGGEQGKLEEAKRKILDFNFALKEAQDA 362 ++ T Q + + +E+ ++D ++ + +E++ Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV 1051 Query: 363 KQRYEQAKEEGTVKPDEDPGFDQIIETIKKDIQSKEQEKAGIGTKITELTGKKEKAQQEK 422 ++ + A E + +K + Q+ E ++G TK T+ T KE A EK Sbjct: 1052 EKNEQDATET---TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEK 1108 Query: 423 AGLESKNRELDKQIQEKKSKVDEIKTKIGPKQQESQEIEKKIQNNIPQDVETRIEKLKEE 482 E K + ++ QE ++ PKQ++S+ ++ + + D I++ + + Sbjct: 1109 ---EEKAKVETEKTQEVPKVTSQVS----PKQEQSETVQPQAEPARENDPTVNIKEPQSQ 1161 Query: 483 IKT--------EENKVKGGEIVLLTQEREKANLEKLIKENQEKLEKLERLLAEKAKLEK- 533 T +E + V + N EN + +E + K Sbjct: 1162 TNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKN 1221 Query: 534 ----EIQGLEGEIEDTNKSKPQFEKQAEEAKKARDTQKELVKKAKKDLSEEEEKLKNIQN 589 ++ + +E S A + +T L K K + Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQ 1281 Query: 590 TIKEK 594 I + Sbjct: 1282 HISQL 1286 Score = 33.9 bits (77), Expect = 0.005 Identities = 44/267 (16%), Positives = 85/267 (31%), Gaps = 31/267 (11%) Query: 509 KLIKENQE------KLEKLERLL-AEKAKLEKEIQGLEGEIEDTNKSKPQFEKQ--AEEA 559 KL N ++EK + + IQ + N+ + ++ A Sbjct: 970 KLRNVNGRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPA 1029 Query: 560 KKARDTQKELVKKAKKDLSEEEEKLKNIQNTIKEKQNKLKGLDNKDQAIKDLEEEKAKIQ 619 E V + K S+ EK E+ N++ A +E K+ ++ Sbjct: 1030 PATPSETTETVAENSKQESKTVEK--------NEQDATETTAQNREVA----KEAKSNVK 1077 Query: 620 ENIDANKKEIEELEQEKNASKALSEKTANEIKTLKEKLLKLEEEQKAEDEKVKELKEKIK 679 N N+ E ++ + E E EE+ K E EK +E+ + Sbjct: 1078 ANTQTNEVAQSGSETKETQTTETKETATVE----------KEEKAKVETEKTQEVPKVTS 1127 Query: 680 KIDEKINGLDLEINNLKAEINKKRQMLAALEQKPISEIINPLLPKNKIKVNNLEKLTEKE 739 ++ K + + + Q + + P + N + +TE Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187 Query: 740 KEEIKNKIKDLNKNNFPKNTQVEVDEK 766 N + + +N P TQ V+ + Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSE 1214 Score = 32.7 bits (74), Expect = 0.011 Identities = 28/262 (10%), Positives = 67/262 (25%), Gaps = 26/262 (9%) Query: 367 EQAKEEGTVKPDEDPGFDQIIETIKKDIQSKEQEKAGIGTKITELTGKKEKAQQEKAGLE 426 E K TV + I + + E+ + E Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043 Query: 427 SKNRELDKQIQEKKSKVDEIKTKIGPKQQESQEIEKKIQNNIPQDVETRIEKLKEEIKTE 486 SK + E+ + Q + ++ N + + E Sbjct: 1044 SKQESKTVEKNEQDAT--------ETTAQNREVAKEAKSNVKANTQTNEVAQSGSE---- 1091 Query: 487 ENKVKGGEIVLLTQEREKANLEKLIKENQEKLEKLERLLAEKAKLEKEIQGLEGEIEDTN 546 +E E EK EK + + ++ K + + E + Sbjct: 1092 --------------TKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSE 1137 Query: 547 KSKPQFEKQAEEAKKARDTQKELVKKAKKDLSEEEEKLKNIQNTIKEKQNKLKGLDNKDQ 606 +PQ E E + + D + ++ + + + ++ + Sbjct: 1138 TVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197 Query: 607 AIKDLEEEKAKIQENIDANKKE 628 ++ + N +++ K Sbjct: 1198 NPENTTPATTQPTVNSESSNKP 1219 Score = 31.2 bits (70), Expect = 0.032 Identities = 29/172 (16%), Positives = 62/172 (36%), Gaps = 8/172 (4%) Query: 149 LTAEKQKEKESSEKVTELKANLESAKKDLEKKEADYVKENALVERDKKDLEKFEKEIAKA 208 AE K++ + + E A +A+ KEA + V+ + + E + ++ Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEA-----KSNVKANTQTNEVAQSG-SET 1092 Query: 209 REKKQTTEKAIKDINASKHDLIDKDKKLKEKLETNKTSTKTLQ--TAYDKAKKNLEEKRT 266 +E + T K + + ++ +K + T++ S K Q T +A+ E T Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152 Query: 267 ELEKLNKQYPPHGPALDQKLEEIEKEIKALEDEMKGLENTQKELEAQKQTNS 318 K + +Q +E ++ E + +E + T Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTP 1204
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 151 bits (383), Expect = 1e-49 Identities = 49/154 (31%), Positives = 85/154 (55%), Gaps = 4/154 (2%) Query: 19 KKEASKNEKT--KAVLNQAVADLSVAASIVHQVHWYMRGPGFLYLHPKMDELLDSLNANL 76 K E +K +T + LN +++ + S +H+ HWY++GP F LH K +EL D + Sbjct: 2 KTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETV 61 Query: 77 DEMSERLITIGGAPYSTLAEFSKHSKLDEAKGTYDKTVAQHLARLVEVYLYLSSLYQVGL 136 D ++ERL+ IGG P +T+ E+++H+ + + + + ++ + LV Y +SS + + Sbjct: 62 DTIAERLLAIGGQPVATVKEYTEHASITDGGN--ETSASEMVQALVNDYKQISSESKFVI 119 Query: 137 DITDEEGDAGTNDLFTAAKTEAEKTIWMLQAERG 170 + +E D T DLF E EK +WML + G Sbjct: 120 GLAEENQDNATADLFVGLIEEVEKQVWMLSSYLG 153
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 29.4 bits (66), Expect = 0.009 Identities = 42/160 (26%), Positives = 59/160 (36%), Gaps = 25/160 (15%) Query: 70 SLIIILWASMVHWVSASYCYLLLFSLLFSLF--DWRSQ------EYPFILWLFSFVSLLL 121 +L+ + A + + LLL +L +L D P + F L Sbjct: 118 ALLSVAVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGG 177 Query: 122 FYSIN---------YLSLILLLLGLLAHLRPFSIGAGDFFYLASLALVLDLTSLIWLIQL 172 F S+ YL L L +G GDF LA+L L +L ++ L Sbjct: 178 FVSLGDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLL 237 Query: 173 ASLAGITACLLL-------GIKRIPFIPYLSFGLFWIVLL 205 +SL G + L K IPF PYL+ WI LL Sbjct: 238 SSLVGAFMGIGLILLRNHHQSKPIPFGPYLAIA-GWIALL 276
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 34.2 bits (78), Expect = 1e-05 Identities = 9/34 (26%), Positives = 19/34 (55%) Query: 6 KLILQGGKAMVTIKQVAEEAGVSRSTVSRYISQK 39 +L Q G + ++ ++A+ AGV+R + + K Sbjct: 22 RLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDK 55
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 153 bits (388), Expect = 2e-50 Identities = 58/157 (36%), Positives = 94/157 (59%), Gaps = 2/157 (1%) Query: 5 IGLYTGSFDPVTNGHLDIVKRASGLFDQIYVGIFDNPTKKSYFKLEVRKAMLTQALADFT 64 +Y GSFDP+T GHLDI++R LFDQ+YV + NP K+ F ++ R + +A+A Sbjct: 2 NAIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLP 61 Query: 65 NVIVVTSHERLAIDVAKELRVTHLIRGLRNATDFEYEENLEYFNHLLAPNIETVYLISRN 124 N V + E L ++ A++ + ++RGLR +DFE E + N LA ++ETV+L + Sbjct: 62 NAQVDSF-EGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTST 120 Query: 125 KWQALSSSRVRELIHFQSSLEGLVPQSVIAQV-EKMN 160 ++ LSSS V+E+ F ++E VP V A + ++ + Sbjct: 121 EYSFLSSSLVKEVARFGGNVEHFVPSHVAAALYDQFH 157
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 404 bits (1039), Expect = e-144 Identities = 140/315 (44%), Positives = 203/315 (64%), Gaps = 6/315 (1%) Query: 3 KQKIVVALGGNAIL--STDASAKAQQEALMSTSKSLVKLIKEGHEVIVTHGNGPQVGNLL 60 +++V+ALGGNA+ S + + + T++ + ++I G+EV++THGNGPQVG+LL Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 61 LQQAAADSEKN-PAMPLDTCVAMTEGSIGFWLVNALDNELQAQGIQKEVAAVVTQVIVDA 119 L A + PA P+D AM++G IG+ + AL NEL+ +G++K+V ++TQ IVD Sbjct: 62 LHMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDK 121 Query: 120 KDPAFENPTKPIGPFLTEEDAKKQMAESGASFKEDAGRGWRKVVPSPKPVGIKEANVIRS 179 DPAF+NPTKP+GPF EE AK+ E G KED+GRGWR+VVPSP P G EA I+ Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181 Query: 180 LVDSGVVVVSAGGGGVPVVEDATSKSLTGVEAVIDKDFASQTLSGLVDADLFIVLTGVDN 239 LV+ GV+V+++GGGGVPV+ + + GVEAVIDKD A + L+ V+AD+F++LT V+ Sbjct: 182 LVERGVIVIASGGGGVPVILED--GEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNG 239 Query: 240 VYINFNKPDQAKLEEVTVSQMKEYITQDQFAPGSMLPKVEAAIAFVENKPNAKAIITSLE 299 + + + L EV V ++++Y + F GSM PKV AAI F+E +AII LE Sbjct: 240 AALYYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEW-GGERAIIAHLE 298 Query: 300 NIDNVLSANAGTQII 314 L GTQ++ Sbjct: 299 KAVEALEGKTGTQVL 313
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.3 bits (76), Expect = 0.004 Identities = 11/73 (15%), Positives = 27/73 (36%), Gaps = 6/73 (8%) Query: 805 YLPLADLLNVEEELARLDKELAKWQKELDMVGKKLGNERFVANAKPEVVQKEKDKQADYQ 864 + +L E + EL ++ +L+ + ++ +AK E + + + Sbjct: 248 AIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEI------LSAKEEYQLVTQLFKNEIL 301 Query: 865 AKYDATQERIVEM 877 K T + I + Sbjct: 302 DKLRQTTDNIGLL 314
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.6 bits (204), Expect = 5e-19 Identities = 30/127 (23%), Positives = 51/127 (40%), Gaps = 3/127 (2%) Query: 3 KVLLVDDEYMILQGLTMIIDWQALGFEVVQTARSGKEALAYLTQYPVDVMISDVTMPGMT 62 +L+ DD+ I L + G++V + ++ D++++DV MP Sbjct: 5 TILVADDDAAIRTVLNQAL--SRAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GLDLIEAAKTYHPQLQTLILSGYQEFSYVQKAMELETKGYLLKPVDKAELQAKMKQFKDW 122 DL+ K P L L++S F KA E YL KP D EL + + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 123 LDAQQAE 129 + ++ Sbjct: 122 PKRRPSK 128
>PF06580#Sensor histidine kinase Length = 349 Score = 180 bits (457), Expect = 1e-53 Identities = 69/317 (21%), Positives = 130/317 (41%), Gaps = 33/317 (10%) Query: 251 LSKAYRMQYNRSGDLLAYVAVRKSYLLAEAVRTVFVYGLVSLLLAWLLLQLL-FRVFRNY 309 L+ AYR R G L + + A + + V+ W LL + + Sbjct: 55 LTHAYRSFIKRQG-WLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFT 113 Query: 310 IQQVSEITDTVEMVAAGDLSLTIDNSHMELELYHISEAINQMLASIKAYIDEVYVLEVEQ 369 + I V +V + M LY + +A ID+ + Sbjct: 114 LPLALSIIFNVVVV-----------TFMWSLLYF---GWHFFKNYKQAEIDQWK-MASMA 158 Query: 370 RDAQMRALQSQINPHFLYNTLEYIRMYALSCQQEELADVIYAFASLLRNNI--SQDKMTT 427 ++AQ+ AL++QINPHF++N L IR L + +++ + + L+R ++ S + + Sbjct: 159 QEAQLMALKAQINPHFMFNALNNIRALILE-DPTKAREMLTSLSELMRYSLRYSNARQVS 217 Query: 428 LKEELAFCEKYIYLYQMRYPDSFAYHVKIDESIADLAIPKFVIQPLVENYFVHGIDYSRH 487 L +EL + Y+ L +++ D + +I+ +I D+ +P ++Q LVEN HGI Sbjct: 218 LADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQ 277 Query: 488 DNALSIKALDETDHLLIQVLDNGRGISQERLADMKRRLQEHQTTGNSSIGLQNVYLRLFH 547 + +K + + ++V + G L T ++ GLQNV RL Sbjct: 278 GGKILLKGTKDNGTVTLEVENTG-------------SLALKNTKESTGTGLQNVRERLQM 324 Query: 548 HFRDRVSWSMAKEPNDG 564 + ++++ Sbjct: 325 LYGTEAQIKLSEKQGKV 341
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 94.8 bits (235), Expect = 5e-24 Identities = 46/123 (37%), Positives = 65/123 (52%), Gaps = 8/123 (6%) Query: 23 SLTAAQAILESGWGKYA-------PHNALFGIKADSSWTGKSFNTKTQEEYQPGIVTDIV 75 L AQA LESGWG+ P LFG+KA +W G T E Y+ G + Sbjct: 172 HLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTE-YENGEAKKVK 230 Query: 76 DRFRAYDSWEDSIADHGQFLADNPRYKAVIGEADYKKACHAIKDAGYATASGYADLLIQL 135 +FR Y S+ ++++D+ L NPRY AV A ++ A++DAGYAT YA L + Sbjct: 231 AKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNM 290 Query: 136 IEE 138 I++ Sbjct: 291 IQQ 293
>PF05043#Transcriptional activator Length = 493 Score = 54.9 bits (132), Expect = 4e-10 Identities = 30/162 (18%), Positives = 71/162 (43%), Gaps = 7/162 (4%) Query: 23 IEDLMDKERRAQYRLLVTLYHAKETLRLKDLMRLSNLSKVTLLKYIDNLNHLCREQGLAC 82 + DL+ K+ Q LL L+ K +L L N ++ + + ++ + Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIF-- 58 Query: 83 QLLLEKDSLSLKENGQFHWEDLVALLLKESVAYQILTYMYCHEHFNITNLSVELMVSEAT 142 + + E + K S + IL +++ +E ++ E +S ++ Sbjct: 59 -HSSTNGIRIINTDDS-DIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFYISSSS 116 Query: 143 LNRQLAHLNQLLS---EFDLALSQGRQLGSELQWRYFYFELF 181 L R ++ +N+++ +F+++L+ + +G+E RYF+ + F Sbjct: 117 LYRIISQINKVIKRQFQFEVSLTPVQIIGNERDIRYFFAQYF 158
>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature. Length = 544 Score = 39.2 bits (91), Expect = 2e-06 Identities = 15/78 (19%), Positives = 29/78 (37%), Gaps = 3/78 (3%) Query: 49 NQPKTSQTSKKVKLSEDKAKSIALKDASVTEADAQMLSVTQDNEDGKAVYEIEFQNKDQE 108 + S ++ +D A + + + E L + D E + YE+ + Sbjct: 134 TEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVRFLTPV 193 Query: 109 ---YSYTIDANSGDIVEK 123 + Y IDA G ++ K Sbjct: 194 PGNWIYMIDAADGKVLNK 211
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 6e-04 Identities = 34/195 (17%), Positives = 62/195 (31%), Gaps = 29/195 (14%) Query: 170 LKLDLNKANEQTASLQASINGLRQEYQDAERKLSASYQTGINGLKA-TMANDKY--DLKA 226 LKL A T Q+S L Q + R S +N L + ++ Y ++ Sbjct: 125 LKLTALGAEADTLKTQSS---LLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 227 EIQATARGLSQE----YDNKLHQLSAKIKTTSSG------TTEAYENKLAGLRAEFTR-- 274 E L +E + N+ +Q + + YEN ++ Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 275 --SNQG-----TRTELESQISGLRAVQQTTASQISQEIRDRTGAVSRVQQDLESYQR--- 324 ++ E E++ + SQ+ Q + A Q + ++ Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301 Query: 325 -RLQDAEDNYSSLTH 338 +L+ DN LT Sbjct: 302 DKLRQTTDNIGLLTL 316
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 519 bits (1338), Expect = 0.0 Identities = 271/346 (78%), Positives = 297/346 (85%), Gaps = 15/346 (4%) Query: 1 MSENIPLRVQFKRMTASEWARSDVILLESEIGFETDTGFVRAGDGHNRFSELGYISPLDY 60 M+E IPLRVQFKRMTA EW RSDVILLESEIGFETDTG+ + GDG N+FS+L Y+ Sbjct: 1 MTETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL----- 55 Query: 61 NLLTNKPNIDELATKVETAQKLQQ----KADKETVYTKAESKQELDKKLNLKGGVMTGQL 116 NKP++ A K ET K+ + KADK VY KAESK ELDKKLNLKGGVMTGQL Sbjct: 56 ----NKPDLGAFAQKEETNSKITKLESSKADKNAVYLKAESKIELDKKLNLKGGVMTGQL 111 Query: 117 KFKPAAT-VAYSSSTGGAVNIDLSSSRGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALF 175 +FKP + + SSS GGA+NID+S S GAGVVVYSNNDTSDGPLMSLRTGKETFNQSALF Sbjct: 112 QFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNNDTSDGPLMSLRTGKETFNQSALF 171 Query: 176 VDYKGTTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPSIK 235 VDY G TNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENP+++ Sbjct: 172 VDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSAMQIRGVEKALGTLKITHENPNVE 231 Query: 236 ADYDKNAAALSIDIVKKQESGGKGTAAQGIYINSTSGTTGKLLRIRNLNDDKFYVKPDGG 295 A+YD+NAAALSIDIVKKQ+ GGKGTAAQGIYINSTSGTTGKLLRIRNL DDKFYVK DGG Sbjct: 232 ANYDENAAALSIDIVKKQK-GGKGTAAQGIYINSTSGTTGKLLRIRNLGDDKFYVKHDGG 290 Query: 296 FYAKETSQIDGNLKLKDPIANDHAATKAYVDGEVEKLKALLTAKQM 341 FYAK+TSQIDGNLKLK+P A+DHAATKAYVD EV+KLKALL KQ+ Sbjct: 291 FYAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKLKALLMDKQV 336
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 34.9 bits (80), Expect = 0.002 Identities = 65/330 (19%), Positives = 116/330 (35%), Gaps = 44/330 (13%) Query: 643 ISAVIQSLTGVITAVFNGIATVISSVGSAIKDVLTG--LGTAFEGFGNGVK-SALEGVGA 699 ++ I ++S+ + + L+ + + +G S+ E A Sbjct: 124 AGNILGGGAENIGDNLGKAGGILSTFQNFLGTALSSMKIDELIKKQKSGGNVSSSELAKA 183 Query: 700 VIESFGSAVRNVLDGVANILDSMGTAALNAGRGVKEMARGIKMLVDLSLGDLVATLAAVA 759 IE V V N+ +S G + K L +G+ + L Sbjct: 184 SIELINQLVDTVASLNNNV-NSFSQQLNTLG----SVLSNTKHLN--GVGNKLQNL---- 232 Query: 760 SGLGKIAASAGQMTMLGSAMSKVANGMTHLATSATIAVAGLTVFATTMATIKTAVATLPP 819 L I A ++ + SA+S A + T A AG+ + + + ++ Sbjct: 233 PNLDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYI- 291 Query: 820 VLTMAASGFTTFTTQAVAAVTGLTAINAPITMFKAQLMTITPALAQAGAGFAAFVAQSST 879 + AA G +T + A A L+ LA + F + +A Sbjct: 292 IAQRAAQGLST--SAAAAG-----------------LIASAVTLAISPLSFLS-IADKFK 331 Query: 880 FSTGLASAGPTIAAFNANLMSLSAT----TGVLVASIAGLSAVLSVVSAGFSQIGASATA 935 + + + SL A TG + AS+ +S VL+ VS+G S A+ T+ Sbjct: 332 RANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGIS--AAATTS 389 Query: 936 TVGQ-IQAFASSTTVVSSAF--ASMQSMIQ 962 VG + A + T + S AS Q+M + Sbjct: 390 LVGAPVSALVGAVTGIISGILEASKQAMFE 419 Score = 32.6 bits (74), Expect = 0.013 Identities = 63/283 (22%), Positives = 107/283 (37%), Gaps = 47/283 (16%) Query: 423 LDKIGSKFGLFGNKAKEGTDKASNGARRSGGIISQIFSGLGNIVKSAGTAISTAAKGIGA 482 LDK+ K+ GN G + + ++GGI+S + LG + + I K + Sbjct: 114 LDKLLQKYQKAGNILGGGAENIGDNLGKAGGILSTFQNFLGTAL--SSMKIDELIKKQKS 171 Query: 483 G-----IKTALSGIPPI------ISSLGTAISTVAQGIGT-----GLAIAFKGLGAAIAM 526 G + A + I I ++SL +++ +Q + T G+G + Sbjct: 172 GGNVSSSELAKASIELINQLVDTVASLNNNVNSFSQQLNTLGSVLSNTKHLNGVGNKLQN 231 Query: 527 VPPTTWLALGAAVLM-----VGAAFALAGTQADG----------ISQILRTVGDVVVQ-- 569 +P + G + + A+F L+ AD +++L VG + Q Sbjct: 232 LPNLDNIGAGLDTVSGILSAISASFILSNADADTRTKAAAGVELTTKVLGNVGKGISQYI 291 Query: 570 ILQQVTDSLATLLPIIANAIGSMLPIVAGAISQIVGAVAGGLSQLVIAVSTGASLVIGAF 629 I Q+ L+T AG I+ V LS L IA + I + Sbjct: 292 IAQRAAQGLSTSAA------------AAGLIASAVTLAISPLSFLSIADKFKRANKIEEY 339 Query: 630 TGLLGGISGVINSISAVIQSLTGVITAVFNGIATVISSVGSAI 672 + + +S+ A TG I A I+TV++SV S I Sbjct: 340 SQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGI 382
>cloacin#Cloacin signature. Length = 551 Score = 25.8 bits (56), Expect = 0.012 Identities = 7/13 (53%), Positives = 12/13 (92%) Query: 8 NKKQKEWDESHPI 20 N++Q+EWD +HP+ Sbjct: 304 NRRQQEWDATHPV 316
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 32.8 bits (74), Expect = 0.002 Identities = 50/218 (22%), Positives = 87/218 (39%), Gaps = 15/218 (6%) Query: 50 YQRYADKEK--IDLSEARKRASELDISAYQKKAKELVAKAEK----LRKEGRTVTRDDFT 103 YQ + +K +D + ++ + +K+AKE KA+K RKE R R + Sbjct: 122 YQEFLKTKKLIVDAPDPKELEEQKKALEKEKEAKEQAQKAQKDKREKRKEERAKNRANLE 181 Query: 104 HQENADMSIYNLAMKTNALELLRLNIDLE---------MQELANGEHKLTKKFLDEGYRK 154 + NA + NL+ N EL++ + E MQE A + L++ + Sbjct: 182 NLTNAMSNPQNLSNNKNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQAE 241 Query: 155 ETEFQAGLLGLSVASQASVKSLADAVINANFKGAKWSDNIWDRQDKLRSIISQSVQSAIL 214 E Q +S+ + S KS D I + + W N+ R +K + Sbjct: 242 EAVRQRAKDKISIKTDKSQKSPEDNSIELSPSDSAWRTNLVVRTNKALYQFILRIAQKDN 301 Query: 215 RGKNGLTIARDIRREFDVSASYAKRLAITEHARVQMEV 252 LT+ + + +VS+ + L E A+ Q E+ Sbjct: 302 FASAYLTVKLEYPQRHEVSSVIEEELKKREEAKRQREL 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 46.0 bits (109), Expect = 3e-08 Identities = 21/118 (17%), Positives = 51/118 (43%), Gaps = 6/118 (5%) Query: 2 KILLIDDHRLFAKSIQLLFQQYD-EVDVIDTITSHFNDVTIDLSKYDIILLDINLTNISK 60 IL+ DD + + +V + + + + D+++ D+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVM---PD 59 Query: 61 ENGLEIAKELIQSTPHLKVVMLTGYVKSIYRERAKKVGAYGFVDKNIDPKQLISILKK 118 EN ++ + ++ P L V++++ + +A + GAY ++ K D +LI I+ + Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 28.1 bits (62), Expect = 0.014 Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 4/62 (6%) Query: 28 DDIRSMPMKFHTPLFRDNPSLSGGQKQRISLARE----LVTTPRILVLDEPTSALDVKTE 83 + + S+ + ++R+ P + K ISL + L+ P +L L ALDV E Sbjct: 64 NVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPE 123 Query: 84 RI 85 Sbjct: 124 HF 125
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 29.8 bits (67), Expect = 0.005 Identities = 21/85 (24%), Positives = 38/85 (44%), Gaps = 11/85 (12%) Query: 1 MKKKERHEKILDILKVDGFIKVKDIIDEM-----NISDMTARRDLDTLADKGLL-IRTHG 54 M K +RH KI +I+ + +++D + N++ T RD+ L L+ + T+ Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL---HLVKVPTNN 57 Query: 55 GAQYLDYSSAKDEGHEKTHTEKKVL 79 G+ YS D+ K+ L Sbjct: 58 GSYK--YSLPADQRFNPLSKLKRSL 80
>PF04605#Virulence-associated protein D (VapD) Length = 125 Score = 29.8 bits (67), Expect = 0.008 Identities = 8/44 (18%), Positives = 17/44 (38%), Gaps = 2/44 (4%) Query: 227 INGYKVTSWNDLTEAV-DLATRD-LGPSQTIKVTYKSHQRLKTV 268 + ++ L E + DL +D + +Q+LK + Sbjct: 80 FDITEIGEQYSLKETIQDLCAKDFHQKLKEFTEKTPKNQKLKDL 123
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 14/56 (25%), Positives = 20/56 (35%), Gaps = 9/56 (16%) Query: 34 IVFVGPSGCGKSTTLRMIAGLEDISEGELKIGGEVVNDKSPKDRDIAMVFQNYALY 89 +V G G GKST + + GL+ S+ IG +D Y Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQIAGIVAY 645
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.0 bits (78), Expect = 6e-04 Identities = 10/30 (33%), Positives = 19/30 (63%) Query: 243 ALWSEHGNLVQTAQRLYIHRNSLQYKLDKF 272 AL + GN ++ A L ++RN+L+ K+ + Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>STREPKINASE#Streptococcus streptokinase protein signature. Length = 440 Score = 799 bits (2064), Expect = 0.0 Identities = 389/440 (88%), Positives = 409/440 (92%) Query: 1 MKNYLSIGVIALLFALTFGTVKPVQAIAGYGWLLDRPPVNNSQLVVSMAGIVEGTDKKVF 60 MKNYLS G+ ALLFALTFGTV VQAIAG WLLDRP VNNSQLVVS+AG VEGT++ + Sbjct: 1 MKNYLSFGMFALLFALTFGTVNSVQAIAGPEWLLDRPSVNNSQLVVSVAGTVEGTNQDIS 60 Query: 61 INFFEIDLTSQPAHGGKTEQGLSPKSKPFATNSSAMPHKLEKADLLKAIQERLIANVHSN 120 + FFEIDLTS+PAHGGKTEQGLSPKSKPFAT+S AM HKLEKADLLKAIQE+LIANVHSN Sbjct: 61 LKFFEIDLTSRPAHGGKTEQGLSPKSKPFATDSGAMSHKLEKADLLKAIQEQLIANVHSN 120 Query: 121 DGYFEVIDFASDATITDRNGKVYFADKDDSVTLPTQPVQEFLLRGHVRVRPYKEKPIQTP 180 D YFEVIDFASDATITDRNGKVYFADKD SVTLPTQPVQEFLL GHVRVRPYKEKPIQ Sbjct: 121 DDYFEVIDFASDATITDRNGKVYFADKDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQ 180 Query: 181 AKSVDVRYTVQFTPLNPDDDFRPVLKNTKLLKTLAIGGTVTSQELLAQAQSILNESHPDY 240 AKSVDV YTVQFTPLNPDDDFRP LK+TKLLKTLAIG T+TSQELLAQAQSILN++HP Y Sbjct: 181 AKSVDVEYTVQFTPLNPDDDFRPGLKDTKLLKTLAIGDTITSQELLAQAQSILNKNHPGY 240 Query: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYHIKDREQAYGINKKSGQEEKTNNTDLISEKY 300 TIYERDSSIVTHDNDIFRTILPMDQEFTY +K+REQAY INKKSG E+ NNTDLISEKY Sbjct: 241 TIYERDSSIVTHDNDIFRTILPMDQEFTYRVKNREQAYRINKKSGLNEEINNTDLISEKY 300 Query: 301 YVLKKGEKPYDPFDRSHLKLFTINYVDVNTNKLLKSEQLLTASERNLDFRDLYDPRDKAK 360 YVLKKGEKPYDPFDRSHLKLFTI YVDV+TN+LLKSEQLLTASERNLDFRDLYDPRDKAK Sbjct: 301 YVLKKGEKPYDPFDRSHLKLFTIKYVDVDTNELLKSEQLLTASERNLDFRDLYDPRDKAK 360 Query: 361 LLYNNLDAFGIMDYTLTGKVEDNHDKNNRVVTVYMGKRPEGENASYHLAYDKDRYTEEER 420 LLYNNLDAFGIMDYTLTGKVEDNHD NR++TVYMGKRPEGENASYHLAYDKDRYTEEER Sbjct: 361 LLYNNLDAFGIMDYTLTGKVEDNHDDTNRIITVYMGKRPEGENASYHLAYDKDRYTEEER 420 Query: 421 EVYSYLRYTGTPIPDNPKDK 440 EVYSYLRYTGTPIPDNP DK Sbjct: 421 EVYSYLRYTGTPIPDNPNDK 440
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 63.9 bits (155), Expect = 7e-14 Identities = 38/87 (43%), Positives = 44/87 (50%), Gaps = 1/87 (1%) Query: 163 EMPEQPGEKAPEKSKEVTPAPEKPADKEANQTPE-RRNGNMAKTPVANNHRRLPSTGEQA 221 E + S+ P A Q P+ N K P+ R+LPSTGE A Sbjct: 453 EELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETA 512 Query: 222 NPFFTAAAVAVMTTAGVLAVTKRKENN 248 NPFFTAAA+ VM TAGV AV KRKE N Sbjct: 513 NPFFTAAALTVMATAGVAAVVKRKEEN 539
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 55.6 bits (134), Expect = 2e-10 Identities = 34/144 (23%), Positives = 55/144 (38%), Gaps = 10/144 (6%) Query: 60 DISLTLAGEVTANNSSKVKIDSSKGEVKDVFVKKGDVVKVGQPLFSYETSQRLTAQSSEF 119 +I T G++T + SK VK++ VK+G+ V+ G L +LTA +E Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLL------KLTALGAEA 134 Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSHYNSAPDESLLEQIRSAEDSVSQAL 179 D + Q + A L+ Y I K PDE + + E +L Sbjct: 135 DTL----KTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSL 190 Query: 180 SDAKTADSDVKAAQIELDKANATA 203 + + + Q EL+ A Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRA 214 Score = 39.4 bits (92), Expect = 2e-05 Identities = 27/180 (15%), Positives = 60/180 (33%), Gaps = 16/180 (8%) Query: 120 DVQTKANQLQVAKTNAALKWETYNRKVNEINTLKSHYN---SAPDESLLEQIRSAEDSVS 176 D + ++ +AK + Y VNE+ KS S + E + + Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298 Query: 177 QALSDAKTADSDVKAAQIELDKANATAATEKGKLEYDTVKSDTAGTIVSLNTDLPNQSKS 236 + L + ++ +EL K + + +++ + + L Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEE-------RQQASVIRAPVSVKVQQLKVHTEGGVV- 350 Query: 237 KKENETFMEII-DKSKMLVKGNISEFDRDKLKIGQKVEV-IDRKDNSK--KWTGKVTQVG 292 ET M I+ + + V + D + +GQ + ++ ++ GKV + Sbjct: 351 -TTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNIN 409
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 27.2 bits (60), Expect = 0.006 Identities = 6/24 (25%), Positives = 9/24 (37%) Query: 22 YSKKVLADEPTSYQPPAAHSPCDD 45 + + A + T AA S D Sbjct: 27 KNPQPQAQQTTQTTTTAAGSAADQ 50
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 59.7 bits (144), Expect = 4e-14 Identities = 39/102 (38%), Positives = 56/102 (54%), Gaps = 7/102 (6%) Query: 2 EMHFVRTEPEARRIAETFCAENTQTKTPMRVQQLSYPSDTDHSGGEL-----YIYALSPA 56 + +F R E EA+ A TF ++ K R + D + GGEL Y+Y +S Sbjct: 28 DQNFARNEKEAKDSAITFIQKSAAIKAGARSAE-DIKLDKVNLGGELSGSNMYVYNISTG 86 Query: 57 GFIIVSGDTRAHTILGYSFDNNLDLN-HDNVRSMIEAYQKQI 97 GF+IVSGD R+ ILGYS + D N +N+ S +E+Y +QI Sbjct: 87 GFVIVSGDKRSPEILGYSTSGSFDANGKENIASFMESYVEQI 128
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 710 bits (1833), Expect = 0.0 Identities = 396/398 (99%), Positives = 397/398 (99%) Query: 1 MNKKKLGIRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAE 60 MNKKKLG+RLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAE Sbjct: 1 MNKKKLGVRLLSLLALGGFVLANPVFADQNFARNEKEAKDSAITFIQKSAAIKAGARSAE 60 Query: 61 DIKLDKVNLGGELSGSNMYVYNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASF 120 DIKLDKVNLGGELSGSNMYVYNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASF Sbjct: 61 DIKLDKVNLGGELSGSNMYVYNISTGGFVIVSGDKRSPEILGYSTSGSFDANGKENIASF 120 Query: 121 MESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNLLTPVIEKVKPGE 180 MESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNLLTPVIEKVKPGE Sbjct: 121 MESYVEQIKENKKLDTTYAGTAEIKQPVVKSLLDSKGIHYNQGNPYNLLTPVIEKVKPGE 180 Query: 181 QSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY 240 QSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY Sbjct: 181 QSFVGQHAATGCVATATAQIMKYHNYPNKGLKDYTYTLSSNNPYFNHPKNLFAAISTRQY 240 Query: 241 NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQ 300 NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQ Sbjct: 241 NWNNILPTYSGRESNVQKMAISELMADVGISVDMDYGPSSGSAGSSRVQRALKENFGYNQ 300 Query: 301 SVHQINRSDFSKQDWEAQIDKELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWG 360 SVHQINR DFSKQDWEAQIDKELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWG Sbjct: 301 SVHQINRGDFSKQDWEAQIDKELSQNQPVYYQGVGKVGGHAFVIDGADGRNFYHVNWGWG 360 Query: 361 GVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP 398 GVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP Sbjct: 361 GVSDGFFRLDALNPSALGTGGGAGGFNGYQSAVVGIKP 398
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 30.0 bits (67), Expect = 0.021 Identities = 34/145 (23%), Positives = 58/145 (40%), Gaps = 22/145 (15%) Query: 242 GGQVMETVGIENMIGTLYT--EGPKLMAEVEAHTKSYDVDIIKAQLATSIEKKENIEVTL 299 G M+ G+E +GTL E P + A + + + +DI+K K++ + T Sbjct: 205 NGSAMQIRGVEKALGTLKITHENPNVEANYDENAAALSIDIVK--------KQKGGKGTA 256 Query: 300 ANGAVLQAKTAILALGAKWRNINVPGEDEFRNKGVTYCPHCDGPLFEGKDVAVIGGGNSG 359 A G + + + + RN+ +D+F K DG + K + GN Sbjct: 257 AQGIYINSTSGTTGKLLRIRNLG---DDKFYVKH-------DGGFYAKKTSQI--DGNLK 304 Query: 360 LEAALDLAGLAKHVYVLEFLPELKA 384 L+ A YV + +LKA Sbjct: 305 LKNPTADDHAATKAYVDSEVKKLKA 329
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 47.4 bits (113), Expect = 7e-08 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 6/53 (11%) Query: 46 IAIKDGLIVALG-SGEPDAE-----LVGPQTIMRSYKGKIATPGIIDCHTHLV 92 I +KDG I A+G +G PD + +VGP T + + +GKI T G +D H H + Sbjct: 88 IGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAGGMDSHIHFI 140
>SECA#SecA protein signature. Length = 901 Score = 25.2 bits (55), Expect = 0.021 Identities = 13/45 (28%), Positives = 24/45 (53%), Gaps = 2/45 (4%) Query: 23 AYDLFRKEVNFIEHDKHIEIYDELNKASAVIEDPSFLEAVEQAVE 67 A+ LF ++V++I D + I DE ++ + + + QAVE Sbjct: 316 AHALFTRDVDYIVKDGEVIIVDEHT--GRTMQGRRWSDGLHQAVE 358
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.2 bits (81), Expect = 0.001 Identities = 24/161 (14%), Positives = 57/161 (35%), Gaps = 16/161 (9%) Query: 276 GLSQLTQATTLSDEKAKGIQSLIVGLPVLNQGIQQLNTELSTLQPPNLNADELGNSLGAI 335 L +S+E+ + SLI +Q +T + LN D+ + Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIK---------EQFSTWQNQKYQKELNLDKKR-AERLT 218 Query: 336 AQAAKQVIAEETAAQNEELSALQA----TSVYQSLTAEQQGELAAALSQSDKSQTVSAAQ 391 A + + L + ++ + EQ+ + A ++ S + Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA--VNELRVYKSQLE 276 Query: 392 TILSSVQTLSTSLQSLSQEDQSKQLEQLKEAVAQIANQSNQ 432 I S + + Q ++Q +++ L++L++ I + + Sbjct: 277 QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE 317
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 47.3 bits (112), Expect = 4e-09 Identities = 20/134 (14%), Positives = 46/134 (34%), Gaps = 11/134 (8%) Query: 4 RKENTKQAILKAMVMLLKTESFDDITTVKLSKRAGISRSSFYTHYKDKYEMID------- 56 + T+Q IL + L + + +++K AG++R + Y H+KDK ++ Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 57 -YYQQTFFHKLEYIFEKKYQNKEQAFLEVFEFLQREQLLSSLLSANGTKEIQA---FIIN 112 + + + V E E+ L+ K ++ Sbjct: 68 SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQ 127 Query: 113 KVRLLITTDLQDKF 126 + + + + D+ Sbjct: 128 QAQRNLCLESYDRI 141
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 90.3 bits (224), Expect = 2e-22 Identities = 66/341 (19%), Positives = 135/341 (39%), Gaps = 22/341 (6%) Query: 18 KKLSSKHQHKFIQLLANLLSTGFSFAEVIAFLKRS--QLLQLDYVLKMEESLLKGQGLAD 75 +LS+ + LA L++ E + + + + + + +++G LAD Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 76 MLSGLG--FSDAILTQISLADRHGNIETTLVAIQHYLNQMARIRRKTVEVITYPLILLLF 133 + F ++ + G+++ L + Y Q ++R + + + YP +L + Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 134 LFVMMLGLRRYLVPQLETQNQ---------------ITYFLNHFPAFFIGFCSGLILLFG 178 ++ L +VP++ Q ++ + F + + + F Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 179 MVWLRWRSQSRLKLYSRLSRYPFLGRLLKQYLTSYYAREWGTLIGQGLDLMTILDIMAIE 238 + LR + + R+ + RL P +GR+ + T+ YAR L + L+ + I Sbjct: 243 V-MLR-QEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 239 KSSL-MKELAEDIRMSLLEGQAFHIKVATYPFFKKELSLMIEYGEIKSKLGAELEIYAQE 297 S+ + ++ EG + H + F + MI GE +L + LE A Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 298 SWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAAILLPIYQ 338 +F SQ+ L +P + + +A ++ I AIL PI Q Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQ 401 Score = 34.4 bits (79), Expect = 5e-04 Identities = 32/129 (24%), Positives = 60/129 (46%), Gaps = 6/129 (4%) Query: 216 REWGTLIGQGLDLMTILDIMAIE-KSSLMKELAEDIRMSLLEGQAFHIKVATYP-FFKKE 273 R+ TL+ + L LD +A + + + +L +R ++EG + + +P F++ Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERL 134 Query: 274 LSLMIEYGEIKSKLGAELEIYA--QESWEQFFSQLYQVTQLIQPAIFLVVAVTIVMIYAA 331 M+ GE L A L A E +Q S++ Q +I P + VVA+ +V I + Sbjct: 135 YCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQA--MIYPCVLTVVAIAVVSILLS 192 Query: 332 ILLPIYQNM 340 +++P Sbjct: 193 VVVPKVVEQ 201
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 52.6 bits (126), Expect = 4e-12 Identities = 28/94 (29%), Positives = 50/94 (53%), Gaps = 4/94 (4%) Query: 9 RHKKLKGFTLLEMLLVILVISVLMLLFVPNLSKQKDRVTETGNAAVVKLVENQAELYELS 68 K +GFTLLE+++VI++I VL L VPNL K++ + + + +EN ++Y+L Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62 Query: 69 QGSKPSLSQ-LKA--DGSITEKQEKAY-QDYYDK 98 P+ +Q L++ + Y ++ Y K Sbjct: 63 NHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIK 96
>OMPTIN#Omptin serine protease signature. Length = 317 Score = 26.5 bits (58), Expect = 0.037 Identities = 17/71 (23%), Positives = 25/71 (35%), Gaps = 9/71 (12%) Query: 37 LLKRSHYLARHDQDNWLLFSHQL--REELSGARFYKVADNK-LYVEKGKKVLAFGQFKSH 93 K S ++ D D ++ R ++ +Y VA N YV KV G + Sbjct: 217 TFKYSGWVESSDNDEHYDPGKRITYRSKVKDQNYYSVAVNAGYYVTPNAKVYVEGAWNRV 276 Query: 94 DFRKSASNGKG 104 N KG Sbjct: 277 T------NKKG 281
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 500 bits (1290), Expect = e-180 Identities = 209/401 (52%), Positives = 281/401 (70%), Gaps = 7/401 (1%) Query: 3 KTIAINAGSSSLKWQLYQMPEEEVLAQGIIERIGLKDSISTVKYDGKKEEQILDIHDHTE 62 K + IN GSSSLK+QL + + VLA+G+ ERIG+ DS+ T +G+K + D+ DH + Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 63 AVKILLNDLI--HFGIIAAYDEITGVGHRVVAGGELFKESVVVNDKVLEHIEELSVLAPL 120 A+K++L+ L+ +G+I EI VGHRVV GGE F SV++ D VL+ I + LAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 121 HNPGAAAGIRAFRDILPDITSVCVFDTSFHTSMAKHTYLYPIPQKYYTDYKVRKYGAHGT 180 HNP GI+A I+PD+ V VFDT+FH +M + YLYPIP +YYT YK+RKYG HGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 181 SHKYVAQEAAKMLGRPLEELKLITAHIGNGVSITANYHGKSVDTSMGFTPLAGPMMGTRS 240 SHKYV+Q AA++L +P+E LK+IT H+GNG SI A +GKS+DTSMGFTPL G MGTRS Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 241 GDIDPAIIPYLIEQDPELKDAADVVNMLNKKSGLSGVSGISSDMRDI-EAGLQEDNPDAV 299 G IDP+II YL+E+ E A +VVN+LNKKSG+ G+SGISSD RD+ +A + + A Sbjct: 242 GSIDPSIISYLMEK--ENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQ 299 Query: 300 LAYNIFIDRIKKCIGQYFAVLNGADALVFTAGMGENAPLMRQDVIGGLTWFGMDIDPEKN 359 LA N+F R+KK IG Y A + G D +VFTAG+GEN P +R+ ++ GL + G +D EKN Sbjct: 300 LALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKN 359 Query: 360 -VFGYRGDISTPESKVKVLVISTDEELCIARDVERL-KNTK 398 V G IST +SKV V+V+ T+EE IA+D E++ ++ K Sbjct: 360 KVRGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIVESLK 400
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 28.6 bits (63), Expect = 0.045 Identities = 29/112 (25%), Positives = 47/112 (41%), Gaps = 7/112 (6%) Query: 34 EELQKRLINEIALLEEKAKHQLHEVVV-------KKETAITSLTNQLEQIEKEQSYLRQE 86 EE + L ++A L+ +A Q ++ + K+ L LEQ E + Sbjct: 34 EEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAP 93 Query: 87 ELAKKDQLIASLEAKLDKLASQNALELANQLAEKDKEVVSLTNQLDKLALEK 138 A+ QL++ + LD L S A L E ++V+ T +D AL K Sbjct: 94 IHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIK 145
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 47.4 bits (113), Expect = 5e-08 Identities = 42/191 (21%), Positives = 79/191 (41%), Gaps = 16/191 (8%) Query: 170 RKTVERAGIKVENIIISPLAMAKTILNEGEREFGATVIDMGGGQTTVASMRAQELQYTNI 229 R++ + AG + +I P+A A G+ V+D+GGG T VA + + Y++ Sbjct: 127 RESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSS 186 Query: 230 YAEGGEYITKDISKVLKTSLAI------AEALKFNFGQAEISEASITETVK-VDVV-GSE 281 GG+ + I ++ + AE +K G A + V+ ++ G Sbjct: 187 VRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246 Query: 282 EPVEVTERYLSEIISARIRHILDRVKQDLER------GRLLDLPGGIVLIGGGAIMPGVV 335 + + E + + I+ V LE+ + + G+VL GGGA++ + Sbjct: 247 RGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISE--RGMVLTGGGALLRNLD 304 Query: 336 EIAQEIFGVTV 346 + E G+ V Sbjct: 305 RLLMEETGIPV 315
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 30.4 bits (68), Expect = 0.011 Identities = 20/99 (20%), Positives = 32/99 (32%), Gaps = 10/99 (10%) Query: 154 FEQEDQLSKVKHLGAVTKVFKDANQIPESTQLE-AVNEYFSRDLKTLLFIGGSAGAHVFN 212 FE ++K + + N + S+ E A N S K + G Sbjct: 83 FEALKAINKQTGI--------EINNVEPSSNFESAYNSALSAGHKIWVLNGFKHQQS-IK 133 Query: 213 QFISDHPELKQRYNIINITGDPHLNELSSHLYRVDYVTD 251 Q+I H E +R I I D + Y + + Sbjct: 134 QYIDAHREELERNQIKIIGIDFDIETEYKWFYSLQFNIK 172
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 186 bits (473), Expect = 4e-53 Identities = 102/477 (21%), Positives = 187/477 (39%), Gaps = 97/477 (20%) Query: 8 IRNVAIIAHVDHGKTTLVDELLKQSHTLDERKELQE--RAMDSNDLEKERGITILAKNTA 65 I N+ ++AHVD GKTTL + LL S + E + + D+ LE++RGITI T+ Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62 Query: 66 VAYNDVRINIMDTPGHADFGGEVERIMKMVDGVVLVVDAYEGTMPQTRFVLKKALEQNLI 125 + + ++NI+DTPGH DF EV R + ++DG +L++ A +G QTR + + + Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 126 PIVVVNKIDKPSARP-------------------------------------AEVVDEVL 148 I +NKID+ + V E Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182 Query: 149 ELFIELGADDEQLE-----------------FPVVYASAINGTSSLSDDPADQEHTMAPI 191 + +E + LE FPV + SA N + + Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIG------------IDNL 230 Query: 192 FDTIIDHIPAPVDNSDEPLQFQVSLLDYNDFVGRIGIGRVFRGTVKVGDQVTLSKLDGTT 251 + I + + L +V ++Y++ R+ R++ G + + D V +S Sbjct: 231 IEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS----EK 286 Query: 252 KNFRVTKLFGFFGLERREIQEAKAGDLIAVSGMEDIFVGETITPTDCVEALPILRIDEPT 311 + ++T+++ E +I +A +G+++ + E + + + T + + P Sbjct: 287 EKIKITEMYTSINGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPL 345 Query: 312 LQMTFLVNNSPFAGREGKWITSRKVEER--LLAELQT----DVSLRVDPTDSPDKWTVSG 365 LQ T + K ++R LL L D LR + + +S Sbjct: 346 LQTT---------------VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSF 390 Query: 366 RGELHLSILIETMRRE-GYELQVSRPEVIIKEIDGVKCEPFERVQIDTPEEYQGAII 421 G++ + + ++ + E+++ P VI E K E + I+ P A I Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKAE--YTIHIEVPPNPFWASI 445 Score = 42.5 bits (100), Expect = 4e-06 Identities = 18/79 (22%), Positives = 31/79 (39%), Gaps = 1/79 (1%) Query: 403 EPFERVQIDTPEEYQGAIIQSLSERKGDMLDMQMVGNGQTRLIFLIPARGLIGYSTEFLS 462 EP+ +I P+EY + +++D Q + N + L IPAR + Y ++ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595 Query: 463 MTRGYGIMNHTFDQYLPVV 481 T G + Y Sbjct: 596 FTNGRSVCLTELKGYHVTT 614
>PF03309#Bvg accessory factor Length = 271 Score = 31.3 bits (71), Expect = 0.004 Identities = 29/126 (23%), Positives = 43/126 (34%), Gaps = 14/126 (11%) Query: 5 LLGIDLGGTTIKFGILTAAGEVQE---KWAIETNILEGGKHIVPDIVASIKHRLDLYGLS 61 LL ID+ T G+++ +G+ + +W I T + D +A L G Sbjct: 2 LLAIDVRNTHTVVGLISGSGDHAKVVQQWRIRTE-----PEVTADELALTIDG--LIGDD 54 Query: 62 SADFVGIGMGSPGAVDRDTNTVTGAFNLNWKETQEVGSVVEKELGIPFAIDNDANVAALG 121 + G S V + V W V GIP +DN V A Sbjct: 55 AERLTGASGLS--TVPSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGA-- 110 Query: 122 ERWVGA 127 +R V Sbjct: 111 DRIVNC 116
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 578 bits (1492), Expect = 0.0 Identities = 191/410 (46%), Positives = 276/410 (67%), Gaps = 9/410 (2%) Query: 5 TPIHVYSEIGKLKKVLLHRPGKEIENLMPDYLERLLFDDIPFLEDAQKEHDAFAQALRDE 64 PI+++SEIG+LKKVLLHRPG+E+ENL P ++ LFDDIP+LE A++EH+ FA L++ Sbjct: 6 NPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFASILKNN 65 Query: 65 GIEVLYLETLAAESLVTP-EIREAFIDEYLSEANIRGRATKKAIRELLMAIEDNQELIEK 123 +E+ Y+E L +E LV+ + FI +++ EA I+ T +++ ++ +I K Sbjct: 66 LVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFTINLLKDYFSSL-TIDNMISK 124 Query: 124 TMAGVQKSELPEIPASEKGLTDLVESNYPFAIDPMPNLYFTRDPFATIGTGVSLNHMFSE 183 ++GV EL +S L DLV F IDPMPN+ FTRDPFA+IG GV++N MF++ Sbjct: 125 MISGVVTEELKNYTSS---LDDLVNGANLFIIDPMPNVLFTRDPFASIGNGVTINKMFTK 181 Query: 184 TRNRETLYGKYIFTHHPIYGGGKVPMVYDRNETTRIEGGDELVLSKDVLAVGISQRTDAA 243 R RET++ +YIF +HP+Y VP+ +R E +EGGDELVL+K +L +GIS+RT+A Sbjct: 182 VRQRETIFAEYIFKYHPVYKE-NVPIWLNRWEEASLEGGDELVLNKGLLVIGISERTEAK 240 Query: 244 SIEKLLVNIFKQNLGFKKVLAFEFANNRKFMHLDTVFTMVDYDKFTIHPEIEGDLRVYSV 303 S+EKL +++FK F +LAF+ NR +MHLDTVFT +DY FT + +Y + Sbjct: 241 SVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTSFTSDDMYFSIYVL 300 Query: 304 TYDNE--ELHIVEEKGDLAELLAANLGVEKVDLIRCGGDNLVAAGREQWNDGSNTLTIAP 361 TY+ ++HI +EK + ++L+ LG K+D+I+C G +L+ REQWNDG+N L IAP Sbjct: 301 TYNPSSSKIHIKKEKARIKDVLSFYLG-RKIDIIKCAGGDLIHGAREQWNDGANVLAIAP 359 Query: 362 GVVVVYNRNTITNAILESKGLKLIKIHGSELVRGRGGPRCMSMPFEREDI 411 G ++ Y+RN +TN + E G+K+ +I SEL RGRGGPRCMSMP REDI Sbjct: 360 GEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIREDI 409
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 123 bits (311), Expect = 4e-39 Identities = 60/146 (41%), Positives = 92/146 (63%), Gaps = 2/146 (1%) Query: 1 MNKKETRHQLIRSLISETTIHTQQELQERLQKNGITITQATLSRDMKELNLVKVTSGNDT 60 MNK + RH IR +I+ I TQ EL + L+K+G +TQAT+SRD+KEL+LVKV + N + Sbjct: 1 MNKGQ-RHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHLVKVPTNNGS 59 Query: 61 HYEALAISQTRWEH-RLRFYMEDALVMLKIVQHQIILKTLPGLAQSFGSILDAMQIPEIV 119 + +L Q +L+ + DA V + H I+LKT+PG AQ+ G+++D + EI+ Sbjct: 60 YKYSLPADQRFNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEEIM 119 Query: 120 ATVCGDDTCLIVCEDNEQAKACYETL 145 T+CGDDT LI+C ++ K + + Sbjct: 120 GTICGDDTILIICRTHDDTKVVQKKI 145
>PF06580#Sensor histidine kinase Length = 349 Score = 181 bits (460), Expect = 4e-54 Identities = 57/203 (28%), Positives = 100/203 (49%), Gaps = 10/203 (4%) Query: 362 EKAIGQYRLQALASQINPHFLYNTLDTIIWMAEFNDSKRVVEVTKSLAKYFRLALNQGN- 420 + +L AL +QINPHF++N L+ I + D + E+ SL++ R +L N Sbjct: 155 ASMAQEAQLMALKAQINPHFMFNALNNIRALIL-EDPTKAREMLTSLSELMRYSLRYSNA 213 Query: 421 EYIRLADELDHVSQYLFIQKQRYGDKLSYEVQGLDVYADFVIPKLILQPLVENAIYHGIK 480 + LADEL V YL + ++ D+L +E Q D +P +++Q LVEN I HGI Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIA 273 Query: 481 EVDRKGMIKVTVSDTAQHLMLTVWDNGKGIEDSSLTNSQSLLARGGVGLKNVDQRLKLHY 540 ++ + G I + + + L V + G ++ ++ G GL+NV +RL++ Y Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKEST-------GTGLQNVRERLQMLY 326 Query: 541 GEGYHMTIHSQSDHFTEIQLSLP 563 G + + + + + +P Sbjct: 327 GTEAQIKLSEKQGKVNAM-VLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.9 bits (236), Expect = 1e-24 Identities = 43/165 (26%), Positives = 75/165 (45%), Gaps = 12/165 (7%) Query: 3 SLLIVEDEYLIRQGIRSLVDFSQFKIDRVNEAENGQLAWDLFQKEPYDIVLTDINMPKLN 62 ++L+ +D+ IR + + + + V N W D+V+TD+ MP N Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY---DVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GIQLAELIKQESPQTHLVFLTGYDDFNYALSALKLGADDYLLKPFSKADVEDMLGKLQQK 122 L IK+ P ++ ++ + F A+ A + GA DYL KPF D+ +++G + + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF---DLTELIGIIGRA 118 Query: 123 LDLSKKTETIQELVEQPQKEVSAIAMAIHE------RLADSDLTL 161 L K+ + E Q + + A+ E RL +DLTL Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 6e-04 Identities = 37/208 (17%), Positives = 62/208 (29%), Gaps = 29/208 (13%) Query: 171 LKLDLNKANEQTASLQASINGLRQEYQDAERKLSASYQTGINGLKA-TMANDKY--DLKA 227 LKL A T Q+S L Q + R S +N L + ++ Y ++ Sbjct: 125 LKLTALGAEADTLKTQSS---LLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 228 EIQATARGLSQE----YDNKLHQLSAKITTTSS--GTTEAYENKLEGLRAEFTRSNQGMR 281 E L +E + N+ +Q + + T A N+ E L Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 282 T-------------ELESQISGLRAVQQSTASQISQEIRNREGAVSRVQQNLASYQR--- 325 + E E++ + SQ+ Q A Q ++ Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL 301 Query: 326 -RLQSAEGNYNSLRETVAGYERRISNQD 352 +L+ N L +A E R Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASV 329
>PF07212#Hyaluronoglucosaminidase Length = 336 Score = 507 bits (1306), Expect = 0.0 Identities = 232/367 (63%), Positives = 271/367 (73%), Gaps = 35/367 (9%) Query: 4 EVASARIQHRGMTTQGWESSSDILMEREIGIDMTTGYPKVGDGKNKFKDLKDLRGPMGPQ 63 E R+Q + MT + W S IL+E EIG + TGY K GDGKN+F LK L Sbjct: 3 ETIPLRVQFKRMTAEEWTRSDVILLESEIGFETDTGYAKFGDGKNQFSKLKYL------- 55 Query: 64 GPTGERGPIGPTGPIGKTGTTDYNQLQNKPNLDAFAQKKETNSKITKLESSKADKSAVYS 123 NKP+L AFAQK+ETNSKITKLESSKADK+AVY Sbjct: 56 ---------------------------NKPDLGAFAQKEETNSKITKLESSKADKNAVYL 88 Query: 124 KAESKIELDKKLSLTGGIVTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAAMVMYTNK 183 KAESKIELDKKL+L GG++TGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGA +V+Y+N Sbjct: 89 KAESKIELDKKLNLKGGVMTGQLQFKPNKSGIKPSSSVGGAINIDMSKSEGAGVVVYSNN 148 Query: 184 DTTDGPLMILRSDKDTFDQSAQFVDYSGKTNAVNIVMRQPSEPNFSSALNITSANEGGSA 243 DT+DGPLM LR+ K+TF+QSA FVDYSGKTNAVNI MRQP+ PNFSSALNITS NE GSA Sbjct: 149 DTSDGPLMSLRTGKETFNQSALFVDYSGKTNAVNIAMRQPTTPNFSSALNITSGNENGSA 208 Query: 244 MQIRGIERKLGTLKITHENPSANAKYDENAAALSIDIVGKRGASGNGTAAQGIFINSSAG 303 MQIRG+E+ LGTLKITHENP+ A YDENAAALSIDIV K+ G GTAAQGI+INS++G Sbjct: 209 MQIRGVEKALGTLKITHENPNVEANYDENAAALSIDIV-KKQKGGKGTAAQGIYINSTSG 267 Query: 304 TTGKMLRIRNKNKDKFYVNPDGGFHSYASSTVAGNLTVNDPISEKHAATKDYVDKAISEL 363 TTGK+LRIRN DKFYV DGGF++ +S + GNL + +P ++ HAATK YVD + +L Sbjct: 268 TTGKLLRIRNLGDDKFYVKHDGGFYAKKTSQIDGNLKLKNPTADDHAATKAYVDSEVKKL 327 Query: 364 KKLIPKK 370 K L+ K Sbjct: 328 KALLMDK 334
>SSPAMPROTEIN#Salmonella surface presentation of antigen gene type M signature. Length = 147 Score = 29.3 bits (65), Expect = 0.032 Identities = 23/65 (35%), Positives = 29/65 (44%), Gaps = 6/65 (9%) Query: 387 ERINALENNQKVITNNQKQFELNLPKYLNDINGKRVWYEKPDDNIEHKIGDYWFEKNGKY 446 E I AL Q ++ K EL + + I KR EK + + K YW K G Y Sbjct: 66 EEIYALLRKQSIVRRQIKDLELQIIQ----IQEKRSELEKKREEFQEK-SKYWLRKEGNY 120 Query: 447 QRTWI 451 QR WI Sbjct: 121 QR-WI 124
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.3 bits (117), Expect = 8e-08 Identities = 35/205 (17%), Positives = 70/205 (34%), Gaps = 17/205 (8%) Query: 461 LTKESDETKKLKKEQEGLVESNKQLRDSVREGVQERKKGLESVKESTAAHQKLADEIIKL 520 T +S + K L+ E+ L L + + + +A + L E L Sbjct: 136 STADSAKIKTLEAEKAALAARKADLE-------KALEGAMNFSTADSAKIKTLEAEKAAL 188 Query: 521 AAKENKTAGEKQNLKNKIDQLNGSIDGLNLAYDKNSNSLSHNADQIKSRISAMEAESTWQ 580 A++ + + N + I L + + ++ + S Sbjct: 189 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKA----DLEKALEGAMNFS--- 241 Query: 581 TAQQNLLNIEQKRSEVSKKLAENADLRKKWNEEANVSDSVRKEKIAELTEEEAKLKNMQT 640 + I+ +E + A A+L K E A + KI L E+A L+ + Sbjct: 242 --TADSAKIKTLEAEKAALEARQAELEKA-LEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 641 QLQEEYNKTSATQQAAADAMAAAEE 665 L+ + +A +Q+ + A+ E Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASRE 323 Score = 30.4 bits (68), Expect = 0.045 Identities = 42/240 (17%), Positives = 77/240 (32%), Gaps = 33/240 (13%) Query: 461 LTKESDETKKLKKEQEGLVESNKQLRDSVREGVQERKKGLESVKESTAAHQKLADEIIKL 520 T +S + K L+ E+ L L ++ + +K A L +L Sbjct: 206 STADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL 265 Query: 521 AAKENKTAGEKQNLKNKIDQLNGSIDGL----------NLAYDKNSNSLSHNADQIKSRI 570 KI L L + + N SL + D + Sbjct: 266 EKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAK 325 Query: 571 SAMEAESTWQTAQQNLLNIEQKRSEVSKKLAENADLRK-------KWNEEANVSDSVR-- 621 +EAE Q ++ E R + + L + + +K K E+ +S++ R Sbjct: 326 KQLEAEH--QKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 383 Query: 622 -----------KEKI-AELTEEEAKLKNMQTQLQEEYNKTSATQQAAADAMAAAEESGSA 669 K+++ L E +KL ++ +E T++ A+ A E A Sbjct: 384 LRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKA 443
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 28.6 bits (64), Expect = 0.041 Identities = 19/76 (25%), Positives = 32/76 (42%), Gaps = 5/76 (6%) Query: 264 LASVATSIVGVVSFLGL---IVPHMSRLLVGSKHQILIPFSALLGAFVFLLADTLGRSLA 320 + S A + +GL H S+L++ Q +PFS L V + L Sbjct: 29 VVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFF-YLC 87 Query: 321 YPLEISPAIIMSIVGG 336 +PL ++ A +M+I Sbjct: 88 FPL-LTVAALMAIASH 102
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 68.8 bits (168), Expect = 7e-15 Identities = 56/266 (21%), Positives = 102/266 (38%), Gaps = 26/266 (9%) Query: 304 VACVNQHPKTAKETEQQRIVATSVAVVDICDRLNLDLVGVCDSKLYTL----PKRYDAVK 359 + A + RIVA V++ L + GV D+ Y L P D+V Sbjct: 20 PLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVI 79 Query: 360 RVGLPMNPDIELIASLKPTWILSPNSLQEDLEPKYQKLDTEYGFLNLRSVEG------MY 413 VGL P++EL+ +KP++++ P + L +G Sbjct: 80 DVGLRTEPNLELLTEMKPSFMV----WSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMAR 135 Query: 414 QSIDDLGNLFQRQQEAKELRQQYQDYYRAFQAKRKGK-KKPKVLILMGLPGSYLVATNQS 472 +S+ ++ +L Q A+ QY+D+ R+ + + + +P +L + P LV S Sbjct: 136 KSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNS 195 Query: 473 YVGNLLDLAGGENVYQSDEKEFLSANP---EDMLA-KEPDLILRTAHAIPDKVKVMFDKE 528 +LD G N +Q E F + + + A K+ D++ D +M Sbjct: 196 LFQEILDEYGIPNAWQG-ETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALM---- 250 Query: 529 FAENDIWKHFTAVKEGKVYDLDNTLF 554 +W+ V+ G+ + F Sbjct: 251 --ATPLWQAMPFVRAGRFQRVPAVWF 274
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 29.2 bits (65), Expect = 0.030 Identities = 12/36 (33%), Positives = 15/36 (41%) Query: 117 KPTDQPKPTDQPKPSPSKVDTAPASSLSRQLPEVRT 152 KP K +QPK V++ PAS P T Sbjct: 99 KPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLT 134
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 344 bits (883), Expect = e-119 Identities = 122/367 (33%), Positives = 195/367 (53%), Gaps = 21/367 (5%) Query: 7 RPTVARVNLQAIKENVASVQKHIPLGVKTYAVVKADAYGHGAVQVSKALLPQVDGYCVSN 66 RP A ++LQA+K+N++ V++ + ++VVKA+AYGHG ++ A+ DG+ + N Sbjct: 3 RPIQASLDLQALKQNLSIVRQAAT-HARVWSVVKANAYGHGIERIWSAI-GATDGFALLN 60 Query: 67 LDEALQLRQAGIDKEILIL-GVLLPNELKLAITRQVTVTVASLEWLAMAKQEWPDLKG-L 124 L+EA+ LR+ G IL+L G +L++ ++T V S L + LK L Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNA--RLKAPL 118 Query: 125 KVHIKIDSGMGRIGLRSVTEVDNLIAGLKSMGAD-VEGIFTHFATADEADDTKFNQQLQF 183 +++K++SGM R+G + V + L++M + +HFA A+ D + Sbjct: 119 DIYLKVNSGMNRLGFQP-DRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGIS--GAMAR 175 Query: 184 FKKLIAGLEDKPRLVHASNSATSIWHSDTIFNAVRLGIVSYGLNPSGS-DLSLPFPLQEA 242 ++ GL SNSA ++WH + F+ VR GI+ YG +PSG L+ Sbjct: 176 IEQAAEGL---ECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPV 232 Query: 243 LSLESSLVHVKMISAGDTVGYGATYTAKKSEYVGTVPIGYADGWTRNM-QGFSVLVDGQF 301 ++L S ++ V+ + AG+ VGYG YTA+ + +G V GYADG+ R+ G VLVDG Sbjct: 233 MTLSSEIIGVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVR 292 Query: 302 CEIIGRVSMDQLTIRLSKA--YPLGTKVTLIGSNQQKNISTTDIANYRNTINYEVLCLLS 359 +G VSMD L + L+ +GT V L G K I D+A T+ YE++C L+ Sbjct: 293 TMTVGTVSMDMLAVDLTPCPQAGIGTPVELWG----KEIKIDDVAAAAGTVGYELMCALA 348 Query: 360 DRIPRIY 366 R+P + Sbjct: 349 LRVPVVT 355
>SECA#SecA protein signature. Length = 901 Score = 1052 bits (2723), Expect = 0.0 Identities = 394/903 (43%), Positives = 560/903 (62%), Gaps = 73/903 (8%) Query: 1 MANILRKVIENDKG-ELRKLEKIAKKVESYADQMASLSDRDLQGKTLEFKERYQKGETLE 59 + +L KV + LR++ K+ + + +M LSD +L+GKT EF+ R +KGE LE Sbjct: 2 LIKLLTKVFGSRNDRTLRRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLE 61 Query: 60 QLLPEAFAVVREAAKRVLGLFPYRVQIMGGIVLHNGDVPEMRTGEGKTLTATMPVYLNAI 119 L+PEAFAVVREA+KRV G+ + VQ++GG+VL+ + EMRTGEGKTLTAT+P YLNA+ Sbjct: 62 NLIPEAFAVVREASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNAL 121 Query: 120 AGEGVHVITVNEYLSTRDATEMGEVYSWLGLSVGINLAAKSPAEKREAYNCDITYSTNSE 179 G+GVHV+TVN+YL+ RDA ++ +LGL+VGINL KREAY DITY TN+E Sbjct: 122 TGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAADITYGTNNE 181 Query: 180 VGFDYLRDNMVVRQEDMVQRPLNFALVDEVDSVLIDEARTPLIVSGAVSSETNQLYIRAD 239 GFDYLRDNM E+ VQR L++ALVDEVDS+LIDEARTPLI+SG + ++Y R + Sbjct: 182 YGFDYLRDNMAFSPEERVQRKLHYALVDEVDSILIDEARTPLIISGPAEDSS-EMYKRVN 240 Query: 240 MFVKTLT------------SVDYVIDVPTKTIGLSDSGIDKAESYFNLS-------NLYD 280 + L + +D ++ + L++ G+ E +LY Sbjct: 241 KIIPHLIRQEKEDSETFQGEGHFSVDEKSRQVNLTERGLVLIEELLVKEGIMDEGESLYS 300 Query: 281 IENVALTHFIDNALRANYIMLLDIDYVVSEDGEILIVDQFTGRTMEGRRFSDGLHQAIEA 340 N+ L H + ALRA+ + D+DY+V +DGE++IVD+ TGRTM+GRR+SDGLHQA+EA Sbjct: 301 PANIMLMHHVTAALRAHALFTRDVDYIV-KDGEVIIVDEHTGRTMQGRRWSDGLHQAVEA 359 Query: 341 KEGVRIQEESKTSASITYQNMFRMYKKLAGMTGTAKTEEEEFREVYNMRIIPIPTNRPIA 400 KEGV+IQ E++T ASIT+QN FR+Y+KLAGMTGTA TE EF +Y + + +PTNRP+ Sbjct: 360 KEGVQIQNENQTLASITFQNYFRLYEKLAGMTGTADTEAFEFSSIYKLDTVVVPTNRPMI 419 Query: 401 RIDHTDLLYPTLESKFRAVVEDVKTRHAKGQPILVGTVAVETSDLISRKLVEAGIPHEVL 460 R D DL+Y T K +A++ED+K R AKGQP+LVGT+++E S+L+S +L +AGI H VL Sbjct: 420 RKDLPDLVYMTEAEKIQAIIEDIKERTAKGQPVLVGTISIEKSELVSNELTKAGIKHNVL 479 Query: 461 NAKNHFKEAQIIMNAGQRGAVTIATNMAGRGTDIKLG----------------------- 497 NAK H EA I+ AG AVTIATNMAGRGTDI LG Sbjct: 480 NAKFHANEAAIVAQAGYPAAVTIATNMAGRGTDIVLGGSWQAEVAALENPTAEQIEKIKA 539 Query: 498 ------EGVRELGGLCVIGTERHESRRIDNQLRGRSGRQGDPGESQFYLSLEDDLMRRFG 551 + V E GGL +IGTERHESRRIDNQLRGRSGRQGD G S+FYLS+ED LMR F Sbjct: 540 DWQVRHDAVLEAGGLHIIGTERHESRRIDNQLRGRSGRQGDAGSSRFYLSMEDALMRIFA 599 Query: 552 SDRIKAFLDRMKLDEEDTVIKSGMLGRQVESAQKRVEGNNYDTRKQVLQYDDVMREQREI 611 SDR+ + ++ + + I+ + + + +AQ++VE N+D RKQ+L+YDDV +QR Sbjct: 600 SDRVSGMMRKLGM-KPGEAIEHPWVTKAIANAQRKVESRNFDIRKQLLEYDDVANDQRRA 658 Query: 612 IYANRRDVITANRDLGPEIKAMIKRTIDRAVDAHARSNR---KDAIDAIVTFARTSLVPE 668 IY+ R +++ + D+ I ++ + +DA+ I + + + Sbjct: 659 IYSQRNELLDVS-DVSETINSIREDVFKATIDAYIPPQSLEEMWDIPGLQERLKNDFDLD 717 Query: 669 ESIS--AKELRGLKDDQIKEKLYQRALAIYDQQLSKLRDQEAIIEFQKVLILMIVDNKWT 726 I+ + L ++ ++E++ +++ +Y ++ + E + F+K ++L +D+ W Sbjct: 718 LPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVG-AEMMRHFEKGVMLQTLDSLWK 776 Query: 727 EHIDALDQLRNAVGLRGYAQNNPVVEYQAEGFKMFQDMIGAIEFDVTRTMMKAQIH-EQE 785 EH+ A+D LR + LRGYAQ +P EY+ E F MF M+ +++++V T+ K Q+ +E Sbjct: 777 EHLAAMDYLRQGIHLRGYAQKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEE 836 Query: 786 RERASQRATTAAPQNIQSQQSANTDD-------------LPKVERNEACPCGSGKKFKNC 832 E Q+ A + Q QQ ++ DD KV RN+ CPCGSGKK+K C Sbjct: 837 VEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCGSGKKYKQC 896 Query: 833 HGR 835 HGR Sbjct: 897 HGR 899
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 26.7 bits (58), Expect = 0.015 Identities = 25/84 (29%), Positives = 42/84 (50%), Gaps = 7/84 (8%) Query: 10 QAQKLQKQMEQKQADLAAMQFTGKSAQDLVTA-----TFTGDKKLVGIDFKEAVVDPEDV 64 QAQK QK +K+ + A + ++L A + +K L + ++ + + + Sbjct: 157 QAQKAQKDKREKRKEERAKNRA--NLENLTNAMSNPQNLSNNKNLSELIKQQRENELDQM 214 Query: 65 ETLQDMTTQAINDALTQIDEATKK 88 E L+DM QA +AL QI+E KK Sbjct: 215 ERLEDMQEQAQANALKQIEELNKK 238
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 27.9 bits (62), Expect = 0.022 Identities = 14/57 (24%), Positives = 27/57 (47%), Gaps = 1/57 (1%) Query: 72 KNQKAWKKLQWKMGISIFLAIVSY-VGLILLSNYLQKFWLVYVAMGLFLPGFSWLVI 127 + Q+ ++Q M L +V+ V ILLS + K ++ M LP + +++ Sbjct: 161 QRQQMRSRIQQAMIYPCVLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLM 217
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.003 Identities = 24/102 (23%), Positives = 45/102 (44%), Gaps = 10/102 (9%) Query: 84 EETKQRELLEILVDEKNTEITRLYEQLKAKDAQLASKDEQMRVKDVQIAEKDKQLDQQQQ 143 E ++E + +E++ T + AK+A+ K + Q E + + Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKA------NTQTNEVAQS--GSET 1092 Query: 144 LTAKAMADKETLKLELEE-AKAEANQARLQVEEVQAEVGPKK 184 + KET +E EE AK E + + +V +V ++V PK+ Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQ-EVPKVTSQVSPKQ 1133
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.9 bits (64), Expect = 0.035 Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 1/56 (1%) Query: 7 IIIGGGPAGMMAAISSSYYGYKTLLIEKNRRLGKKLAGTGGGRCNVTNSGNLDVLM 62 + +G PAG+ ++Y K + + LG +LA RCN+ + G+ + M Sbjct: 140 VTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY-NIRCNIVSPGSTETDM 194
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.022 Identities = 9/16 (56%), Positives = 12/16 (75%) Query: 45 IIGASGSGKSLLAHAI 60 I G SG+GK L+A A+ Sbjct: 165 ITGESGTGKELVARAL 180
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 34.3 bits (78), Expect = 0.002 Identities = 24/87 (27%), Positives = 35/87 (40%), Gaps = 2/87 (2%) Query: 226 IPKKDLSPSELAAAQAYWSQKQGRGARPSDY-RPTPAPGRRKAPIPDVTPNPGQGHQPD- 283 IP+ DL+P A A + P++ P PG R P PD NP D Sbjct: 310 IPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDTDG 369 Query: 284 NGGYHPAPPRPNDASQNKHQRDEFKGK 310 G P P D +H+++ +G+ Sbjct: 370 QPGTRPDSPAVPDRPNGRHRKERKEGE 396
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 247 bits (633), Expect = 3e-83 Identities = 82/323 (25%), Positives = 143/323 (44%), Gaps = 34/323 (10%) Query: 1 MKKGFFLMAMAVSLVMIAGCDKSANPKQPTQGMSVVTSFYPMYAMTKEVSGDLNDVR-MI 59 MKK L+ + +S +++ C Q + VV + + +TK ++GD D+ ++ Sbjct: 1 MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIV 60 Query: 60 QSGAGIHSFEPSVNDVAAIYDADLFVYHSHTLE----AWARDLDPNLKKSKVDVFEASKP 115 G H +EP DV +ADL Y+ LE AW L N KK++ + A Sbjct: 61 PIGQDPHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFA--- 117 Query: 116 LTLDRVKGLEDMEVTQGIDPATLY--------DPHTWTDPVLAGEEAVNIAKELGRLDPK 167 V+ G+D L DPH W + A NIAK+L DP Sbjct: 118 -------------VSDGVDVIYLEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPN 164 Query: 168 HKDSYTKKAKAFKKEAEQLTEEYTQKFKKVR--SKTFVTQHTAFSYLAKRFGLKQLGISG 225 +K+ Y K K + + ++L +E KF K+ K VT AF Y +K +G+ I Sbjct: 165 NKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWE 224 Query: 226 ISPEQEPSPRQLKEIQDFVKEYNVKTIFAEDNVNPKIAHAIAKSTGAKVKT---LSPLEA 282 I+ E+E +P Q+K + + +++ V ++F E +V+ + +++ T + + Sbjct: 225 INTEEEGTPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAE 284 Query: 283 APSGNKTYLENLRANLEVLYQQL 305 +Y ++ NL+ + + L Sbjct: 285 QGKEGDSYYSMMKYNLDKIAEGL 307
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 107 bits (269), Expect = 3e-27 Identities = 50/226 (22%), Positives = 85/226 (37%), Gaps = 47/226 (20%) Query: 117 KAGKGAGTVVAVIDAGFDKNHEAWRLTDKTKARYQSKEDLEKAKKEHGITYGEWVNDKIA 176 +G G VAV+D G D +H DL KA+ G + + Sbjct: 36 NQTRGRGVKVAVLDTGCDADHP----------------DL-KARIIGGRNFTDDDEGDPE 78 Query: 177 YYHDYSKDGKTAVDQEHGTHVSGILSGNAPSETKEPYRLEGAMPEAQLLLMRVEIVNGLA 236 + DY+ HGTHV+G ++ + G PEA LL+++V G Sbjct: 79 IFKDYNG---------HGTHVAGTIAATENE-----NGVVGVAPEADLLIIKVLNKQGSG 124 Query: 237 DYARNYAQAIRDAVNLGAKVINMSFGNAALAYANLPDETKKAFDYAKSKGVSIVTSAGND 296 Y Q I A+ +I+MS G E +A A + + ++ +AGN+ Sbjct: 125 QYD-WIIQGIYYAIEQKVDIISMSLGGPED-----VPELHEAVKKAVASQILVMCAAGNE 178 Query: 297 SSFGGKTRLPLADHPDYGVVGTPAAADSTLTVASYSPDKQLTETAT 342 +T +G P + ++V + + D+ +E + Sbjct: 179 GDGDDRT----------DELGYPGCYNEVISVGAINFDRHASEFSN 214 Score = 80.3 bits (198), Expect = 3e-18 Identities = 37/139 (26%), Positives = 58/139 (41%), Gaps = 22/139 (15%) Query: 457 NATPKVLPTASGTK---LSRFSSWGLTADGNIKPDIAAPGQDILSSVANNKYAKLSGTSM 513 +V+ + S FS+ + D+ APG+DILS+V KYA SGTSM Sbjct: 192 GCYNEVISVGAINFDRHASEFSNSNN------EVDLVAPGEDILSTVPGGKYATFSGTSM 245 Query: 514 SAPLVAGIMGL-LQKQYETQYPDMTPSERLDLAKKVLMSSATALYDEDEKAYFSPRQQGA 572 + P VAG + L Q + D+T E L+ L + SP+ +G Sbjct: 246 ATPHVAGALALIKQLANASFERDLTEPE----LYAQLIKRTIPLGN-------SPKMEGN 294 Query: 573 GAVDAKKASA-ATMYVTDK 590 G + + ++ T + Sbjct: 295 GLLYLTAVEELSRIFDTQR 313
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 172 bits (438), Expect = 9e-51 Identities = 223/361 (61%), Positives = 259/361 (71%), Gaps = 15/361 (4%) Query: 55 KARELLNKYDVENSMLQANNDKLTTENKNLTDQNKELKAEENRLTTENKGLTKKLSEAEE 114 + + L ++ A L E L + +L+ + + K+ E Sbjct: 194 ELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEA 253 Query: 115 EAANKEQESKETIGTLKKILDETVKDKIAREQKSKQDIGALKQELAKKDEGNKVSEASRK 174 E A E E + + A+ + + + AL+ E A + ++V A+R+ Sbjct: 254 EKAALEARQAE-LEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQ 312 Query: 175 GLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRKGLRRDLDASREAKKQVEK 234 LRRDLDASREAKKQ+E AE K++E+ +IS+ASR+ LRRDLDASREAKKQ+E Sbjct: 313 SLRRDLDASREAKKQLE-------AEHQKLEEQNKISEASRQSLRRDLDASREAKKQLE- 364 Query: 235 DLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKE 294 AE K++E+ +IS+ASRQ LRRDLDASREAKKQVEKALEEANSKLAALEKLNKE Sbjct: 365 ------AEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKE 418 Query: 295 LEESKKLTEKEKAELQAKLEAEAKALKEQLAKQAEELAKLRAGKASDSQTPDAKPGNKVV 354 LEESKKLTEKEKAELQAKLEAEAKALKE+LAKQAEELAKLRAGKASDSQTPDAKPGNK V Sbjct: 419 LEESKKLTEKEKAELQAKLEAEAKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAV 478 Query: 355 PGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEE 414 PGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEE Sbjct: 479 PGKGQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEE 538 Query: 415 N 415 N Sbjct: 539 N 539 Score = 72.8 bits (178), Expect = 6e-16 Identities = 84/334 (25%), Positives = 133/334 (39%), Gaps = 2/334 (0%) Query: 1 MAKNNTNRHYSLRKLKKGTASVAVALSVIGAGLVVNTNEVSARVFPRGTVENPDKARELL 60 M KNNTNRHYSLRKLK GTASVAVAL+V+GAGL V + V R + +K +E Sbjct: 1 MTKNNTNRHYSLRKLKTGTASVAVALTVLGAGL-VVNTNEVSAVATRSQTDTLEKVQERA 59 Query: 61 NKYDVENSMLQANNDKLTTENKNLTDQNKELKAEENRLTTENKGLTKKLSEAEEEAANKE 120 +K+++EN+ L+ N L+ NK L D N EL E + + + K LSE + E Sbjct: 60 DKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELE 119 Query: 121 QESKETIGTLKKILDETVKDKIAREQKSKQDIGALKQELAKKDEGNKVSEASRKGLRRDL 180 + + + A+ + + + AL A ++ + + + Sbjct: 120 ARKAD-LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKI 178 Query: 181 DASREAKKQVEKDLANLTAELDKVKEEKQISDASRKGLRRDLDASREAKKQVEKDLANLT 240 K +E A L L+ A K L + A K +EK L Sbjct: 179 KTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 238 Query: 241 AELDKVKEEKQISDASRQGLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKK 300 + + +A + L +A + ++K+ LE LE K Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 301 LTEKEKAELQAKLEAEAKALKEQLAKQAEELAKL 334 E + L A ++ + L + + A+ Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKKQLEAEH 332
>PF05043#Transcriptional activator Length = 493 Score = 520 bits (1340), Expect = 0.0 Identities = 106/473 (22%), Positives = 214/473 (45%), Gaps = 20/473 (4%) Query: 34 ELSKALNISMLTLQTCLTNMQ-FMKEVGGITYKNGYITIWYHQHCGLQEVYQKALRHSQS 92 EL++ LN + ++ L++++ ++ + NG I ++ VY +HS Sbjct: 30 ELAELLNCTERAVKDDLSHVKSAFPDLIFHSSTNGIRIINT-DDSDIEMVYHHFFKHSTH 88 Query: 93 FKLLETLFFRDFNSLEELAEELFVSLSTLKRLIKKTNAYLTHTFGITILTSPVQVSGDEH 152 F +LE +FF + E + +E ++S S+L R+I + N + F + +PVQ+ G+E Sbjct: 89 FSILEFIFFNEGCQAESICKEFYISSSSLYRIISQINKVIKRQFQFEVSLTPVQIIGNER 148 Query: 153 QIRLFYLKYFSEAYKISEWPFGEILNLKNCERLLSLMIKEVDVRVNFTLFQHLKILSSVN 212 IR F+ +YFSE Y EWPF + + +LL L+ KE +N + + LK+L N Sbjct: 149 DIRYFFAQYFSEKYYFLEWPFEN-FSSEPLSQLLELVYKETSFPMNLSTHRMLKLLLVTN 207 Query: 213 LIRYYKGYSAVYDNKKTSHRFSQLIQSSLETQDLSRLFYLKFGLYLDETTIAEMFSNHVN 272 L R G+ D + + + + + +++ F ++ + LDE + ++F ++ Sbjct: 208 LYRIKFGHFMEVDKDSFNDQSLDFLMQAEGIEGVAQSFESEYNISLDEEVVCQLFVSYFQ 267 Query: 273 DQLEIGYAF--DSIKQDSPTGCRKVTNWIHLL----DELEINLNLSVTNKYEVAVILHNT 326 I + +K+DS V HLL D++ + + + NK + LHNT Sbjct: 268 KMFFIDESLFMKCVKKDS-----YVEKSYHLLSDFIDQISVKYQIEIENKDNLIWHLHNT 322 Query: 327 TVLKEEDITANYLFFDYKKSYLNFYKQEHPHLYKAFVAGVEKLMRSEKEPISTELTNQLI 386 L +++ ++ FD K + + ++ P + + + + S+ + N L Sbjct: 323 AHLYRQELFTEFILFDQKGNTIRNFQNIFPKFVSDVKKELSHYLETLEVCSSSMMVNHLS 382 Query: 387 YAFFITWENSFLKVNQKDEKIRLLVI----ERSFNSVGNFLKKYIGEFFSITNFNELDAL 442 Y F ++ + + Q K+++LV+ + V L Y F + + EL+ Sbjct: 383 YTFITHTKHLVINLLQNQPKLKVLVMSNFDQYHAKFVAETLSYYCSNNFELEVWTELELS 442 Query: 443 TIDLEEIEKQYDVIVTDVMVGKSDELEIFFFYKMIPEAIIDKLNVFLNISFAD 495 LE + YD+I+++ ++ + + + + ++I LN + I + Sbjct: 443 KESLE--DSPYDIIISNFIIPPIENKRLIYSNNINTVSLIYLLNAMMFIRLDE 493
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 34.6 bits (79), Expect = 6e-04 Identities = 21/90 (23%), Positives = 23/90 (25%), Gaps = 4/90 (4%) Query: 119 PSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKGPEKAAEKTPEPNRDAPKPIQPP 178 P+P Q P Q P PE PE E E KP P Sbjct: 44 PAP-AQPISVTMVAPADLEPP-QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP 101 Query: 179 LGAAAPVFAPWRESDKDLSKLKPSSRSSAA 208 P E K K S +S Sbjct: 102 --KPKPKPVKKVEQPKRDVKPVESRPASPF 129 Score = 31.5 bits (71), Expect = 0.006 Identities = 12/83 (14%), Positives = 22/83 (26%), Gaps = 1/83 (1%) Query: 104 DKNDTKQPDSSDQSTPSPKDQSSQKESQNKDGRPTPSPDQQKDQTPDKTPEKGPEKAAEK 163 + P T ++ P P+ + + P+ E K Sbjct: 39 QVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 98 Query: 164 TPEPNRDAP-KPIQPPLGAAAPV 185 + P K ++ P PV Sbjct: 99 PKPKPKPKPVKKVEQPKRDVKPV 121
>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature. Length = 136 Score = 32.1 bits (73), Expect = 0.002 Identities = 14/62 (22%), Positives = 28/62 (45%), Gaps = 8/62 (12%) Query: 10 VINGLIIVVVTSILLVLYFAMPIYYTKVKDKEVKREFDQTSKQIKGKTVTEIRDILTKKI 69 V + LI+ ++ A+ + + KE +K+ +TEIRD+L ++ Sbjct: 82 VFDFLIVA------FAIFMAIKLINKLNRKKEEPAAAPAPTKEEV--LLTEIRDLLKEQN 133 Query: 70 NK 71 N+ Sbjct: 134 NR 135
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 1e-20 Identities = 31/128 (24%), Positives = 55/128 (42%), Gaps = 1/128 (0%) Query: 3 KILVVEDDDTISQVICEFLKANNYDPDCVFDGQAALDKWQTTSYDLIILDIMLPSLSGLE 62 ILV +DD I V+ + L YD + DL++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 VLKTIRKT-SDVPIIMLTALDDEYTQLVSFNHLISDYVTKPFSPLILIKRIENVLRVSTP 121 +L I+K D+P+++++A + T + + DY+ KPF LI I L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 DEKRQIGD 129 + D Sbjct: 125 RPSKLEDD 132