>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 384 bits (988), Expect = e-134 Identities = 175/406 (43%), Positives = 268/406 (66%), Gaps = 2/406 (0%) Query: 1 MPAFRFEAIDASGRAQKGVIEADSARNARGQLRTQGLTPLVVEPAASAQRGARSQRLALG 60 M + ++A+DA G+ +G EADSAR AR LR +GL PL V+ Q+ + S L+L Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 61 R--KLSQREQAILTRQLASLLVAGLPLDEALAVLTEQAERDYIRELMAAIRAEVLGGHSL 118 R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 119 ANALTQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEERNALKQKILLAFTYPAIVT 178 A+A+ P F +Y A+VAAGE +G L VL+RLADY E+R ++ +I A YP ++T Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 179 VIALGIVTFLLSYVVPQVVNVFASTKQQLPLLTIVMMALSDFVRHWWWAILIGIAAVVYL 238 V+A+ +V+ LLS VVP+VV F KQ LPL T V+M +SD VR + +L+ + A Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 239 VKATLSRDGPRLAFDRWLLTAPLAGKLVRGYNTVRFASTLGILSAAGVPILRALQAAGET 298 + L ++ R++F R LL PL G++ RG NT R+A TL IL+A+ VP+L+A++ +G+ Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 299 LSNRAMRGNIEDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358 +SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+ Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 359 ESRELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404 + RE + L EPLL+++M +VL IVLA++ PI++LN ++ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 188 bits (480), Expect = 6e-65 Identities = 67/140 (47%), Positives = 94/140 (67%), Gaps = 3/140 (2%) Query: 10 QAARRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRL 69 +A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+L Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61 Query: 70 DNGRYPTQDQGLNALIQKPTTDPIPNNWKDGGYLERLPNDPWGNSYKYLNPGVHGEIDVF 129 DN YPT +QGL +L++ PT P+ N+ GY++RLP DPWGN Y +NPG HG D+ Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121 Query: 130 SYGADGKEGGESNDSDIGSW 149 S G DG+ G E DI +W Sbjct: 122 SAGPDGEMGTE---DDITNW 138
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 51.5 bits (123), Expect = 1e-10 Identities = 14/72 (19%), Positives = 26/72 (36%) Query: 48 RARGFTLLEMLVVLVIAGILVSVASLTLRRNPRTDLREEAQRVALLFETAGDEAQVRARP 107 R RGFTLLEM+++L++ G+ + L + + R + Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61 Query: 108 IAWRTTDHGFRF 119 ++F Sbjct: 62 FGVSVHPDRWQF 73
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 31.1 bits (70), Expect = 8e-04 Identities = 14/58 (24%), Positives = 26/58 (44%), Gaps = 8/58 (13%) Query: 12 RSRGFTMIEVLVALAIIAIALAASIRAVGSMATSASDLHARLLAGWSADNALAQLRLA 69 R RGFT++E+++ L ++ ++ + + S D A + AQLR Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGM---VLLAFPASRDD-----SAAQTLARFEAQLRFV 51
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 34.9 bits (80), Expect = 9e-05 Identities = 19/77 (24%), Positives = 37/77 (48%), Gaps = 3/77 (3%) Query: 28 ARRGERGFTLIEMMIAITILAVIA-ILSWRGLDQIIRGREKVAAAMEDERVFEQMFDQMR 86 A +RGFTL+E+M+ I I+ V+A ++ + + ++ A+ D E D + Sbjct: 3 ATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK--AVSDIVALENALDMYK 60 Query: 87 IDARRAATDDEAGQPAV 103 +D T ++ + V Sbjct: 61 LDNHHYPTTNQGLESLV 77
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 121 bits (306), Expect = 4e-32 Identities = 87/402 (21%), Positives = 163/402 (40%), Gaps = 18/402 (4%) Query: 30 LALGTFMEVLDTSIANVAVPTISGSLGVATSEGTWVISSYSVASAIAVPLTGWLARRVGE 89 L + +F VL+ + NV++P I+ + WV +++ + +I + G L+ ++G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 90 VRLFTLSVLAFTIASALCGLAENFES-LIAFRLLQGLVSGPMVPLSQTILMRSYPPAKRG 148 RL ++ S + + +F S LI R +QG + L ++ R P RG Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 149 LALGLWAMTVIVAPIFGPVMGGWITDNYTWPWIFYINLPIGMFSAACAFFLLR-GRETKT 207 A GL V + GP +GG I W ++ +P M + FL++ ++ Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIP--MITIITVPFLMKLLKKEVR 194 Query: 208 TKQRIDAIGLALLVIGVSCLQMMLDLGKDRDWFNSTFITSLALIAVVSLAFMLVWEATEK 267 K D G+ L+ +G+ + F +++ S +++V+S + Sbjct: 195 IKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVT 244 Query: 268 EPVVDLSLFKDRNFALGAMIISFGFMAFFGSVVIFPLWLQTVMGYTAGLAGLATA-PVGF 326 +P VD L K+ F +G + F G V + P ++ V + G P Sbjct: 245 DPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTM 304 Query: 327 LALVLSPLIGRNMHRLDLRMVASFAFVVFAGVSIWNSTFTLDVPFNHVILPRLVQGIGVA 386 ++ + G + R V + V F VS ++F L+ + + + G++ Sbjct: 305 SVIIFGYIGGILVDRRGPLYVLNIG-VTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLS 363 Query: 387 CFFVPMTTITLSSISDERLASASGLSNFLRTLSGAIGTAVSS 428 ++TI SS+ + + L NF LS G A+ Sbjct: 364 FTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 273 bits (699), Expect = 2e-92 Identities = 82/324 (25%), Positives = 160/324 (49%), Gaps = 10/324 (3%) Query: 5 EFMSQEEVDALLKGVTGEDDSADEPAESSG---IRPYNIATQERIVRGRMPGLEIINDRF 61 E +SQ+E+D LL ++ D S ++ S I Y+ ++ + +M L ++++ F Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62 Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121 ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+ Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122 Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYAVAWKSVRPLQFEFVR 181 + F ++D LFGG G+ RD T E ++ ++ + + +W V L+ + Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180 Query: 182 SEMHTQFANVATPNEIVIVTQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239 E + QFA + P+E+V++ + G G ++ C+PY IEPI LSS ++ Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 240 EVDRRWVRVLSQQVQSAEVELVADLAEVPTTFEKILNLRAGDVLPLD---IADSITAKVD 296 +++ VL ++ + ++++VA++ + + IL LR GD++ L + D + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 297 GVPVMECGYGIFNGQYALRVQKMI 320 C G+ + A ++ + I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 32.0 bits (73), Expect = 0.003 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 3/67 (4%) Query: 144 AGQPGDAPFAPPTLVGDLGGGALYLAMGVLAGIVDAR-LRGKGQVVDAAIVDGSANLMNL 202 AG P ++V D+GGG +A+ L G+V + +R G D AI++ Sbjct: 151 AGLPVSEATG--SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGS 208 Query: 203 LLSIHAA 209 L+ A Sbjct: 209 LIGEATA 215
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.003 Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 5/44 (11%) Query: 106 GGILVYDQFVTP----PTPQPVQQRRLRWGAHGRSNNGDNFYVV 145 GG + P PTPQPV R + +GA+GRS + V Sbjct: 452 GGTIAAAPMGDPNASIPTPQPVHYRPM-FGAYGRSRTNSSVTFV 494
>adhesinb#Adhesin B signature. Length = 310 Score = 29.4 bits (66), Expect = 0.025 Identities = 9/36 (25%), Positives = 13/36 (36%) Query: 5 WKILGLAAAASISLAGCGGGDGGGSAQTGTLHVAMT 40 + L L A + LA C + L+V T Sbjct: 4 CRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVAT 39
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 70.6 bits (172), Expect = 2e-15 Identities = 57/161 (35%), Positives = 83/161 (51%), Gaps = 1/161 (0%) Query: 265 AASGAIAALQDAADSARATLAASSAPAALQQAA-PAALAANANAAAATAAPSLAPPVGTP 323 A L A++ S P+ + AA P AAP L+ P+G+ Sbjct: 179 APGTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSH 238 Query: 324 DWTEALSQKVVFLSNAHQQSAELTLNPPDLGPLQVVLRVAENHAHALFVSQHAQVRDAVE 383 +W ++LSQ + + QQSAEL L+P DLG +Q+ L+V +N A VS H VR A+E Sbjct: 239 EWQQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALE 298 Query: 384 AALPKLREAMEAGGLGLGSASVSDGGFASAQQQQTPQRQSS 424 AALP LR + G+ LG +++S F+ QQ + Q+QS Sbjct: 299 AALPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQ 339
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 62.1 bits (150), Expect = 3e-15 Identities = 43/140 (30%), Positives = 74/140 (52%) Query: 1 MAQSFPLQLLLDRAQDDLDTATKQLGHAQRERTDAQAQLDALVRYRDEYRERFAASAQSG 60 MA+ L L D A+ +++ A + LG +R A+ QL L+ Y++EYR + +G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MPAGNWRNFQAFLDTLDAAIEQQRRVLAAAQTRIDAARPEWQAKKRTLGSYEILQARGAR 120 + + W N+Q F+ TL+ AI Q R+ L ++D A W+ KK+ L +++ LQ R + Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 QDAQRAAKREQRDADEHAAK 140 + +Q+ DE A + Sbjct: 121 AALLAENRLDQKKMDEFAQR 140
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 107 bits (267), Expect = 6e-31 Identities = 65/184 (35%), Positives = 106/184 (57%), Gaps = 4/184 (2%) Query: 18 AAAALAAELQRVRDAAHAEGLAAGHVEGQALGYQAGYEQGRAKGFDEGQAEAHAHGAQLA 77 A +L +L +++ AH +G AG EG+ G++ GY++G A+G ++G AEA + A + Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIH 95 Query: 78 A----LAASFREALAGVERDLADDIATLALEIAQQVVRQHVQHDPAALIAAAREVLAAEP 133 A L + F+ L ++ +A + +ALE A+QV+ Q D +ALI +++L EP Sbjct: 96 ARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEP 155 Query: 134 ALAGAPHLIVNPADLPVVEAYLKDELDTLGWSVRTDAGVERGGCRAHASTGEIDATLATR 193 +G P L V+P DL V+ L L GW +R D + GGC+ A G++DA++ATR Sbjct: 156 LFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215 Query: 194 WERV 197 W+ + Sbjct: 216 WQEL 219
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 299 bits (767), Expect = e-102 Identities = 114/324 (35%), Positives = 190/324 (58%) Query: 5 GLTKSALLLMSIGEEEAAQVFKFLAPREVQKIGAAMASLKNVTREQVEDVLSEFVHEAEK 64 G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E ++VL EF Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76 Query: 65 HTALSLDSSEYIRSVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSAAVAELIKNEH 124 + +Y R +L K+LG KA +I+ + + E ++ D A + I+ EH Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136 Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPTALRELDDVLTGLLSGS 184 PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+ Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196 Query: 185 DNLKRAPMGGIRTAAEILNFMTSVHEEAVLENVKQYDPDLAQKIIDQMFVFENLLDLEDR 244 + GG+ EI+N E+ ++E++++ DP+LA++I +MFVFE+++ L+DR Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256 Query: 245 AIQLLLKEVESEALIVALKGAPPALRQKFLSNMSQRAAELLAEDLDARGPVRVSEVETQQ 304 +IQ +L+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316 Query: 305 RKILQVVRNLAESGQIVIGGKAED 328 +KI+ ++R L E G+IVI E+ Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 469 bits (1208), Expect = e-161 Identities = 253/559 (45%), Positives = 364/559 (65%), Gaps = 32/559 (5%) Query: 133 LARMKTNPRLPFLIGAALAIAAIVALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 192 L R++ NPR+P ++ + A+A +VA+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75 Query: 193 YKFADAGGAILVPANQVHETRLKLAAMGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 252 Y+FA+ GAI VPA++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135 Query: 253 EGELQRTVESVNAVRAARVHLAIPKPSVFVRDREAPSASVLVDLYPGRVLDEGQVLAITR 312 EGEL RT+E++ V++ARVHLA+PKPS+FVR++++PSASV V L PGR LDEGQ+ A+ Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195 Query: 313 LVSSSVPDMPAKNVTIVDQDGNLLTQT-ASATGLDASQLKYVQQIERNTQKRIDAILAPI 371 LVSS+V +P NVT+VDQ G+LLTQ+ S L+ +QLK+ +E Q+RI+AIL+PI Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255 Query: 372 FGAGNARSQVSADVDFSKIEQTSESYAPNGTPQQSAIRSQQTSTSTELAQSGTSGVPGAL 431 G GN +QV+A +DF+ EQT E Y+PNG ++ +RS+Q + S ++ GVPGAL Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315 Query: 432 SNTPPQPASAPIVA-------------SNGQPAAPAATPVSDRKDSTTNYELDKTVRHVE 478 SN P P API ++ + +A P S +++ T+NYE+D+T+RH + Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375 Query: 479 QSMGTIKRLSVAVVVNYQPSTDAKGRVTMQPLTADKLAQVQQLVKDAMGYDEKRGDSVNV 538 ++G I+RLSVAVVVNY+ D K PLTAD++ Q++ L ++AMG+ +KRGD++NV Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431 Query: 539 VNSAFSAAADPFANLPWWRQPDMIELGKDIAKWLGVAAAAAALYFMFVRPAL-RRAFPPP 597 VNS FSA + LP+W+Q I+ +WL V A L+ VRP L RR Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491 Query: 598 AEPAAAVPALDGPDDALALDGLPSPDKKQLAEEDEEHPALLAFESEKNRYERNLDYARTI 657 A A + + + E+ ++ +++ E R + Sbjct: 492 AAQEQAQVRQETEEA--------VEVRLSKDEQLQQRR-----ANQRLGAEVMSQRIREM 538 Query: 658 ARQDPKIVATVVKNWVSDE 676 + DP++VA V++ W+S++ Sbjct: 539 SDNDPRVVALVIRQWMSND 557
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 59.3 bits (143), Expect = 6e-15 Identities = 46/112 (41%), Positives = 65/112 (58%), Gaps = 9/112 (8%) Query: 3 APVNGIASALQQMQAMAAQAAGGAASPAASLAGSGAATAGSFASAMKASLEKISGDQQKA 62 + + GI + Q+QA A A + P ++ SFA + A+L++IS Q A Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTI---------SFAGQLHAALDRISDTQTAA 51 Query: 63 LGEAHAFEIGAQNVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNEIMQMSV 114 +A F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY E+M M V Sbjct: 52 RTQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 31.3 bits (71), Expect = 0.004 Identities = 54/298 (18%), Positives = 91/298 (30%), Gaps = 76/298 (25%) Query: 10 GGTGFIGSRLVNALVDAGAHVRIG----------ARRRDHARHLATLPVDIVELTAFDVR 59 G GFIG + L++AG V +G + ++ LA ++ D Sbjct: 7 GAAGFIGFHVSKRLLEAGHQV-VGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLADRE 65 Query: 60 ELARFVAGAHAAVNLVGVLHGGRGKRY----GEGFERLHVALPAALAAACIEARVPRMLH 115 + A H V + RY + ++ + C ++ +L+ Sbjct: 66 GMTDLFASGH--FERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHLLY 123 Query: 116 VSA---LGADPNAP-----------SMYLRSKGDGEAALHAQAAAGVLDVTVFRPSIVFG 161 S+ G + P S+Y +K E H + L T R V+G Sbjct: 124 ASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFTVYG 183 Query: 162 PG---DAFLNTFARLQRIFPVLPLAMPDALMQPI-------------YVGDVAQAI---- 201 P D L F + AM + + I Y+ D+A+AI Sbjct: 184 PWGRPDMALFKFTK----------AMLEG--KSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 202 -------------ANACARDATRGRTYELGGPRTYRLEEIVRYAGRLVGRPARIVRLP 246 A R Y +G L + ++ +G A+ LP Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLP 289
>cloacin#Cloacin signature. Length = 551 Score = 29.3 bits (65), Expect = 0.016 Identities = 22/64 (34%), Positives = 32/64 (50%), Gaps = 11/64 (17%) Query: 115 SVSAEMHAGFPALRSEMPLNVRESHPGRGATPAALADVARIDELWRTCLAASGGPFLFGE 174 +V+A + GFPAL + + S GA AA+AD+ +AA GPF FG Sbjct: 83 AVAAPVAFGFPALSTPGAGGLAVSISA-GALSAAIADI----------MAALKGPFKFGL 131 Query: 175 FSIA 178 + +A Sbjct: 132 WGVA 135
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.9 bits (77), Expect = 0.002 Identities = 29/222 (13%), Positives = 58/222 (26%), Gaps = 17/222 (7%) Query: 123 VIEPRPAESNSRMAAAAPNGWSRPATSAMPRTGPNGNANPAASTAGSYFPASPASARAGW 182 ++ + + + A P+ S A P PA PA+P S Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPV--PPPA--------PATP-SETTET 1039 Query: 183 NAPAGATASVAANPNNPMTPVATGVNPEFRAGAASHAPARAPAWVPARVPADARRVAMVV 242 A S N T N E A S+ A A+ ++ + Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 243 AQDAPPPAGQPANTRAAAQSWQAARMQGATTA-QGGVIPVSFRSQPAPRMLPPRPEPIRA 301 ++ + ++ + ++ + Q V +++PA +P Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARE-----NDPTVN 1154 Query: 302 AAATASASGAPPAAATAAATGAAAAPPPPAGQQDGESIRRAA 343 S + A ++ P + Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVV 1196
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 26.8 bits (59), Expect = 0.030 Identities = 10/38 (26%), Positives = 17/38 (44%) Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKQLMLKTLTI 139 V+ +E N+ + Y AN + L TA + + I Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.4 bits (201), Expect = 1e-19 Identities = 29/128 (22%), Positives = 52/128 (40%) Query: 49 SRVLTIEDDEITANEIVGELKSRGFTVDWVANGRDGMARAISDDYDVITLDRMLPGVDGL 108 + +L +DD + L G+ V +N + D D++ D ++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 109 TILTTMRSIGVRTPVLMLSALGDVDERVRGLRAGGDDYLTKPFDTEEMTARLEVLLRRSQ 168 +L ++ PVL++SA ++ G DYL KPFD E+ + L + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 169 ASPAPFET 176 P+ E Sbjct: 124 RRPSKLED 131
>cloacin#Cloacin signature. Length = 551 Score = 32.0 bits (72), Expect = 0.003 Identities = 17/40 (42%), Positives = 21/40 (52%), Gaps = 3/40 (7%) Query: 242 GGGVGARVGGPFIGGRGGGWGGGSDGFRGGGGGFGGGGAS 281 GGG G+ G GG G GG +G GGG G GG ++ Sbjct: 47 GGGSGS---GIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83 Score = 31.6 bits (71), Expect = 0.003 Identities = 24/53 (45%), Positives = 26/53 (49%), Gaps = 9/53 (16%) Query: 239 TLLGGGVGARVGG-------PFIGGRGGG--WGGGSDGFRGGGGGFGGGGASG 282 T LG G GA G P+ GG G G WGGGS GGG G GGG+ Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGT 77
>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature. Length = 144 Score = 29.0 bits (64), Expect = 0.020 Identities = 21/44 (47%), Positives = 23/44 (52%), Gaps = 3/44 (6%) Query: 36 VLHPLAGRPLLSHVIDTARALAPSRLVVVIGHGAERVRAAVAAP 79 V PLAG L S A APS LV+ +GHG AA AAP Sbjct: 18 VCGPLAGASLASPATAPASLYAPSALVLTVGHGES---AATAAP 58
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.037 Identities = 17/136 (12%), Positives = 34/136 (25%), Gaps = 8/136 (5%) Query: 2 GTTIRDVAQAANVSIGTVSRALKNQPGLSEATRARIVE-----IAHRMNYDPTQLRPRIK 56 T++ ++A+AA V+ G + K++ L P ++ Sbjct: 31 STSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLR 90 Query: 57 -RLTFLLHRQHNNFATTPFFSHVLHGVEDACRERGIVPSLLTTGPTDDVIRQMRPHAPDA 115 L +L + H E E +V + ++ Sbjct: 91 EILIHVLESTVTEERRRLLMEIIFHKCEFV-GEMAVVQQAQRNLCL-ESYDRIEQTLKHC 148 Query: 116 IAVAGFMEPETLEALA 131 I A Sbjct: 149 IEAKMLPADLMTRRAA 164
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 72.9 bits (179), Expect = 1e-16 Identities = 80/358 (22%), Positives = 128/358 (35%), Gaps = 48/358 (13%) Query: 18 VAALAAGAPSVRAQSSVQLYGQVDEWIGAQKFPGGQRAWGVQGGGMST-----SYWGLRG 72 + AL A V A + V LYG + + + A + S G +G Sbjct: 5 LIALTLAALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKG 64 Query: 73 TEDLGGGYQAIFTLEDFFRAQNGHYGRFDGDTFFGRNAYVGLATPYGTVRAGRLTTQLFV 132 EDLG G +AI+ +E Q D R +++GL +G +R GRL + L Sbjct: 65 QEDLGNGLKAIWQVE-----QKASIAGTDSGWG-NRQSFIGLKGGFGKLRVGRLNSVLK- 117 Query: 133 STILFNPFVDSYVFSPMVYHVFLGLGTFPTYTTDQGVVGDSGWNNAIDYTSPSFGGFNAA 192 T NP+ + LG + ++ + Y SP F G + + Sbjct: 118 DTGDINPWDSKSDY----------LGVNKIAEPEARLIS-------VRYDSPEFAGLSGS 160 Query: 193 AMYAFGNTAGDNRSKKWSGQLNYSNGPFAATAVYQYVNFNGGPGDLGALVSGMKSQGVAQ 252 YA + AG + S+ + NY NG F Y + V+ K Q + + Sbjct: 161 VQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRH----HQVQENVNIEKYQ-IHR 215 Query: 253 VGLSYDFKLAKIYA-QYMYTNNERNAGNWHVNTVQGGVAVPL----GPGSALASYAYS-- 305 + YD +YA + + + + + Q VA L G + SYA+ Sbjct: 216 LVSGYD--NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFK 273 Query: 306 --RDSGGLDQTRRTWALGYDYPLSKRTDLYAAYM---NDRYSGMSSGDTFGAGIRAKF 358 D+ + +G +Y SKRT + + G G+R KF Sbjct: 274 GSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWLQEGKGESKFVSTAGGVGLRHKF 331
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 27.9 bits (62), Expect = 0.020 Identities = 9/48 (18%), Positives = 18/48 (37%) Query: 8 PPAAASALDAIDRELLRALADDARQPVSELARRVGLSAPSTADRLRRL 55 L ++ L+ A R + A +GL+ + ++R L Sbjct: 426 SGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 33.8 bits (77), Expect = 8e-04 Identities = 25/111 (22%), Positives = 41/111 (36%), Gaps = 5/111 (4%) Query: 44 EPFEPVEPDNVPVQVELLKPQPIARAPAPVKPAAGRPQAAQKRAAPAHAPMPRARAPRAS 103 EP + V+P PV +P+PI P +P+ K P P + Sbjct: 56 EPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK-----PKPVKKVQEQP 110 Query: 104 QPVLSAAESPIESPAAASAAEPASAATAGATSEATGGAAAGAAGAGAAAPP 154 + + ES SP +A +++TA A + + A A + P Sbjct: 111 KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQP 161
>PF03309#Bvg accessory factor Length = 271 Score = 199 bits (509), Expect = 8e-66 Identities = 57/278 (20%), Positives = 103/278 (37%), Gaps = 47/278 (16%) Query: 5 CLLIDAGNSRIKWALADTGRHFVTSGAFEHADDTPDWSTLPAPR------GAWISNVAGD 58 L ID N+ L G+ +HA W P I + GD Sbjct: 2 LLAIDVRNTHTVVGLIS--------GSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGD 53 Query: 59 AAAA---------------RIDALIDAHWPALPRTVVRACAAQCGVTNGYAEPARLGSDR 103 A + +++ +WP +P ++ + G+ P +G+DR Sbjct: 54 DAERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEP-GVRTGIPLLVDNPKEVGADR 112 Query: 104 WAGLIGAHAAFPGEHLLIATFGTATTLEALRADGRFTGGLIAPGWALMMRSLGMHTAQLP 163 + A+ + +++ FG++ ++ + A G F GG IAPG + + +A L Sbjct: 113 IVNCLAAYHKYGTAAIVV-DFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALR 171 Query: 164 TVSIDAATSLLDELAANDAHAPFAIDTPHALSAGCLQAQAGLIE----RAWRDLEKAWKA 219 V + S++ + +T + AG + AGL++ R D++ A Sbjct: 172 RVELTRPRSVIGK------------NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGA 219 Query: 220 PVRLVLSGGAADAIVRALTVPHTRHDTLVLTGLALIAH 257 V +V +G A ++ L L L GL L+ Sbjct: 220 DVAVVATGHTAPLVLPDLRTVEHYDRHLTLDGLRLVFE 257
>SECA#SecA protein signature. Length = 901 Score = 30.6 bits (69), Expect = 0.008 Identities = 19/50 (38%), Positives = 23/50 (46%), Gaps = 4/50 (8%) Query: 199 AAAEVDALRARDATLAGGLP----PVALAAVRAGATLTDTLAAALNALAA 244 A+ V +R D L GG+ +A G TLT TL A LNAL Sbjct: 74 ASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALTG 123
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.006 Identities = 13/47 (27%), Positives = 24/47 (51%), Gaps = 6/47 (12%) Query: 7 TLLVDILDA--GNLSKAAQRLKMSRANVSYRLNQLEKSIGLQLVRRT 51 L++ L A GN KAA L ++R + ++ +L G+ + R + Sbjct: 439 PLILAALTATRGNQIKAADLLGLNRNTLRKKIREL----GVSVYRSS 481
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 515 bits (1329), Expect = e-176 Identities = 197/567 (34%), Positives = 310/567 (54%), Gaps = 7/567 (1%) Query: 302 PNTLAGVCAAPGIAVGALVRWDETDIAPPELASGTPAAESRLLDRALAAVDAELETTVRE 361 + + G+ A+ G+A+ E ++ + + + E L AL EL + Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 362 ASQRGAIGEAGIFAVHRVLLEDPSLVDAARDLI-SLGKSAGYAWRETIRAQTAVLAGVDD 420 +A IFA H ++L+DP LVD + I + +A YA +E ++ +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 421 ALLAERAADLRDIDKRVLRAL-GYASATARELPAEAVLAAEEFTPSDLASLDRERVTALV 479 + ERAAD+RD+ KRVL L G + + + E V+ AE+ TPSD A L+++ V Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 480 MARGGATSHAAIIARQLGIPSLVAVGDALYAIPQRTQVVVDASAGRLEYAPTALDVERAR 539 GG TSH+AI++R L IP++V + I V+VD G + PT +V+ Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 540 HERQRLAGVREANRRMSGEAAVTRDGHKIEVAANIATLDDARVAVDNGADAVGLLRTELM 599 +R ++ ++ GE + T+DG +E+AANI T D + NG + +GL RTE + Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 600 FIHRQAAPTTSEHQQSYQSIVDALQGRTAIIRTLDVGADKEVDYLTLPPEPNPALGLRGI 659 ++ R PT E ++Y+ +V + G+ +IRTLD+G DKE+ YL LP E NP LG R I Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 660 RLAQVRPDLLDDQLQGLLSVKPYGSVRILLPMVTDVGELVRIRERIDAFARALGR----- 714 RL + D+ QL+ LL YG+++++ PM+ + EL + + + L Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 715 ADPIEVGVMIEVPSAALLADQLAKHADFLSIGTNDLTQYTLAMDRCQADLAAQADGLHPA 774 +D IEVG+M+E+PS A+ A+ AK DF SIGTNDL QYT+A DR ++ HPA Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 775 VLRLVDATVRGAEKHGKWVGVCGALGGDPVAVPVLVGLGVTELSVDPVSVPGIKAQVRRL 834 +LRLVD ++ A GKWVG+CG + GD VA+P+L+GLG+ E S+ S+ ++Q+ +L Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 835 DYQLCRQRAQDLLALESAQAVRAASRE 861 + + AQ L L++A+ V ++ Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKK 568
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 32.8 bits (75), Expect = 3e-04 Identities = 14/42 (33%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 51 KRETKQQFIDAIAAGRRRYRQIEIQSQDVL-PVGDATYVVAG 91 KRE K+ +RR EIQS+++ V ++ VVA Sbjct: 222 KREYKEMEGSPEIKSKRRQFHQEIQSRNMRENVKRSSVVVAN 263
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 95.3 bits (237), Expect = 2e-23 Identities = 76/368 (20%), Positives = 144/368 (39%), Gaps = 31/368 (8%) Query: 100 RATTSLAAIFALRMLGLFMIMPVFSVYAKT-IPGGDNVVLVGIALGAYGVTQSLLYIFYG 158 R + + AL +G+ +IMPV + + D GI L Y + Q G Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 159 WASDKFGRKPVIAAGLLIFALGSFVAAFAHDITWIIVGRVIQGM-GAVSSAVLAFIADLT 217 SD+FGR+PV+ L A+ + A A + + +GR++ G+ GA + A+IAD+T Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 218 SEHNRTKAMAMVGGSIGVSFAVAIVGAPI--VFHWVGMSGLFAIVGALSVVAIGVVLWVV 275 R + + G + G + + F AL+ + +++ Sbjct: 125 DGDERARHFGFMSACFGFGM---VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181 Query: 276 PDAPRPVHVPAPFAEVLHNVELLRLNFGVLVLHATQTALFLVVPRLLVDGGLPVA----- 330 P++ + P E L+ + R G+ V+ A F+ + + G +P A Sbjct: 182 PESHKGERRPLR-REALNPLASFRWARGMTVVAALMAVFFI----MQLVGQVPAALWVIF 236 Query: 331 ----SHWQ-----VYLPVMGL--AFVMMVPAIIVAEKQGKMKPVLLGGIAAILIGQLLLG 379 HW + L G+ + + VA + G+ + ++L G+ A G +LL Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML-GMIADGTGYILLA 295 Query: 380 MATHTILIVAAILFIYFLGFNILEASQPSLVSKLAPGSRKGAATGVYNTTQSIGLALGGV 439 AT + + + I + +++S+ R+G G S+ +G + Sbjct: 296 FATRGWMAF--PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353 Query: 440 VGGVLLKH 447 + + Sbjct: 354 LFTAIYAA 361
>cloacin#Cloacin signature. Length = 551 Score = 41.6 bits (97), Expect = 7e-07 Identities = 30/71 (42%), Positives = 33/71 (46%), Gaps = 2/71 (2%) Query: 109 GGRGGSGGGGGGDDG-GYGGGGG-YGGGRDMERGGGGGGRASGGGGAGARSGGGGGGGGA 166 GG G G GGG DG G+ +GGG GGG GGG G GG G GG Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81 Query: 167 SRPSAPAGGGF 177 S +AP GF Sbjct: 82 SAVAAPVAFGF 92 Score = 32.0 bits (72), Expect = 0.001 Identities = 27/79 (34%), Positives = 29/79 (36%), Gaps = 16/79 (20%) Query: 114 SGGGGGGD-----------DGGYGGGGGYGGGRD-----MERGGGGGGRASGGGGAGARS 157 SGG G G +GG G G GG D E GGG SG G Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61 Query: 158 GGGGGGGGASRPSAPAGGG 176 G GGG G S + GG Sbjct: 62 HGNGGGNGNSGGGSGTGGN 80 Score = 29.3 bits (65), Expect = 0.011 Identities = 22/59 (37%), Positives = 22/59 (37%) Query: 109 GGRGGSGGGGGGDDGGYGGGGGYGGGRDMERGGGGGGRASGGGGAGARSGGGGGGGGAS 167 GG G GGG G GGG G GG G A G A S G GG S Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 61.3 bits (148), Expect = 3e-11 Identities = 56/208 (26%), Positives = 81/208 (38%), Gaps = 40/208 (19%) Query: 12 LNLPSGGGSVGGDGGDFSVDLNTGTATLKFDLTVPAGPNGITPPHTLQYSAGAGDGAFGL 71 LP GG ++ G D G A++ L + A G P L YS+G G+G FG+ Sbjct: 18 PFLPKGGKALSQSGPD-------GLASITLPLPISAE-RGFAPALALHYSSGGGNGPFGV 69 Query: 72 GWSLGLMTIRRR-----------------------ITPATGAAEPAPPGACTLVGVGELV 108 GWS M+I R T +TG A P P V Sbjct: 70 GWSCATMSIARSTSHGVPQYNDSDEFLGPDGEVLVQTLSTGDA-PNPVTCFAYGDVSFPQ 128 Query: 109 DMGARRFRPIVDATGLLIEFTGAS------WTATDKTDTQYTLGTSANAQIGDGGALP-- 160 R++P +++ +E+ + W D + LG +A A++ D A Sbjct: 129 SYTVTRYQPRTESSFYRLEYWVGNSNGDDFWLLHDSNGILHLLGKTAAARLSDPQAASHT 188 Query: 161 AAWLVDRCADSAGNAIAYTWLAVGGARV 188 A WLV+ AG I Y++LA G V Sbjct: 189 AQWLVEESVTPAGEHIYYSYLAENGDNV 216
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.8 bits (93), Expect = 6e-05 Identities = 27/230 (11%), Positives = 60/230 (26%), Gaps = 27/230 (11%) Query: 653 QANGQIDAAQQQLAVAQAQAHAYQAGVTVAQTRATNAAKNAQEYGALNSQVIVIQATGQQ 712 A Q L A+ + YQ + K E + ++ Sbjct: 131 GAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQ-------NVSEEE 183 Query: 713 VSGGDDGDYNGVSAMANQYLSG-----QRISGDSATVAAATNLAANRLSQQFQIDSMNRT 767 V S NQ ++ + +A ++ ++D + Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243 Query: 768 TAE---MQQALAQAQAQLAAANAQVSAAGANLAVAQLNAQAAAQTLGVFDADTFTPQVWK 824 + + A+ + + + A ++ + L + +A + + Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL-------- 295 Query: 825 AMGNFIQQIYERYMNMALRAAKLMQQAYNFENDVSVSFIKASYQGVVNGL 874 F +I ++ L + E S I+A V L Sbjct: 296 ----FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQL 341
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 33.9 bits (77), Expect = 0.004 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%) Query: 149 QVLDGLAHAHANGVVHRDLKPQNVMVTTLDGEPCAKILDFGI 190 ++LD H GVVH D+KP NV+ GEP ++D G+ Sbjct: 253 RLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV--VIDLGL 292
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 57.9 bits (140), Expect = 6e-11 Identities = 33/122 (27%), Positives = 51/122 (41%), Gaps = 15/122 (12%) Query: 484 HALVVDDNENARETLGAMLTALGIRADLRGTGKEGLRCFGECQHDIVVLDLELPDISGFE 543 LV DD+ R L L+ G + R D+VV D+ +PD + F+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 544 VAEQIRWATSPDAAKKTTILGVSAYES------AMLKGDHAVFDAFVPKPIHLDTLNGIV 597 + +I+ A +L +SA + A KG +D ++PKP L L GI+ Sbjct: 65 LLPRIK-----KARPDLPVLVMSAQNTFMTAIKASEKG---AYD-YLPKPFDLTELIGII 115 Query: 598 SR 599 R Sbjct: 116 GR 117
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 32.3 bits (73), Expect = 0.005 Identities = 28/144 (19%), Positives = 49/144 (34%), Gaps = 12/144 (8%) Query: 530 HQGLLSSLPSQPLGAPSPRTSHHHPAAIHRNARPPSPPQSSDPSRTRSPRSPEPESLAKP 589 HQ + P+QP+ + P PP P +P P P+ + Sbjct: 38 HQVIELPAPAQPISVTMVAPADLEPP--QAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 95 Query: 590 RSRSPFRALGLRCRPLRLRPSPAVRR-GRRGGLRRTSPIEDERPRAPPPIVARNGRAGDT 648 + + + +P ++ +R + R SP E+ P P A + Sbjct: 96 KPKPKPKP-----KPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPV 150 Query: 649 LA----PRQLTCNSPPRPSRPRRA 668 + PR L+ N P P+R + Sbjct: 151 TSVASGPRALSRNQPQYPARAQAL 174
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.0 bits (78), Expect = 0.001 Identities = 16/42 (38%), Positives = 24/42 (57%) Query: 287 ILIALALLIGTPFFVFFGSLSDKIGRKPIILAGCLIAALTYF 328 IL+AL L+ G+LSD+ GR+P++L AA+ Y Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88 Score = 33.6 bits (77), Expect = 0.002 Identities = 46/264 (17%), Positives = 93/264 (35%), Gaps = 28/264 (10%) Query: 77 AIVFGRLGDLVGRKHTFLITIVIMGISTFVVGFLPGYASIGIAAPVIFIAMRLMQGLALG 136 A V G L D GR+ L+++ + ++ P V++I R++ G+ G Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-------FLWVLYIG-RIVAGIT-G 110 Query: 137 GEYGGAATYVAEHAPAHRRGFYTSWIQTTATLGLFLSLLVILGVRTAIGEEAFGNWGWRV 196 A Y+A+ R + ++ G+ + G +G G + Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGM------VAG--PVLGGLM-GGFSPHA 161 Query: 197 PFVASILLLAVSVWIRLQLNESPVFLRIKAEGKTSKAPLTEAFGQWKNLKIVILALIGLT 256 PF A+ L ++ L + + + PL + + L Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASF-----RWARGMTVVAALM 216 Query: 257 AGQAVVWYTGQFYA---LFFLTQTLKVDGGSANILIALALLIGTPF-FVFFGSLSDKIGR 312 A ++ GQ A + F D + I +A ++ + + G ++ ++G Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276 Query: 313 KPIILAGCLIAALTYFPLFKALTH 336 + ++ G +IA T + L T Sbjct: 277 RRALMLG-MIADGTGYILLAFATR 299
>PF06580#Sensor histidine kinase Length = 349 Score = 43.7 bits (103), Expect = 1e-06 Identities = 24/75 (32%), Positives = 35/75 (46%), Gaps = 21/75 (28%) Query: 408 LIDNAIRY----TPTGGRITVRVRADHAAGVVHLEVEDTGPGIPANERERVVERFYRILG 463 L++N I++ P GG+I ++ D+ G V LEVE+TG N +E Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDN--GTVTLEVENTGSLALKNTKE----------- 309 Query: 464 REGDGSGLGLAIVRE 478 +G GL VRE Sbjct: 310 ----STGTGLQNVRE 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.7 bits (238), Expect = 2e-24 Identities = 36/118 (30%), Positives = 63/118 (53%), Gaps = 1/118 (0%) Query: 46 RILIAEDDSILADGLTRSLRQSGYAVDHVRNGVEADTALSMQAFDLLILDLGLPRMPGLD 105 IL+A+DD+ + L ++L ++GY V N ++ DL++ D+ +P D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 106 VLRRLRARNSNLPVLILTAADSVDERVKGLDLGADDYMAKPFALNE-LEARVRALTRR 162 +L R++ +LPVL+++A ++ +K + GA DY+ KPF L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 44.5 bits (105), Expect = 3e-08 Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 5/59 (8%) Query: 33 RRMMRGRGFTLIELMIVLAIVGVVAAYAIPAYQDYLARSRVGEGLALAASARLAVAENA 91 R + RGFTL+E+M+V+ I+GV+A+ +P + A + + ENA Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLM-----GNKEKADKQKAVSDIVALENA 55
>PF06580#Sensor histidine kinase Length = 349 Score = 29.8 bits (67), Expect = 0.026 Identities = 20/107 (18%), Positives = 42/107 (39%), Gaps = 14/107 (13%) Query: 205 AALSVLLSVGLALTVSRGPWLQVGVM----------VVAGFWMAFAQTR--RDPA--ARR 250 + + ++L+ + R WL++ + VV G A T R A + Sbjct: 49 SLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTK 108 Query: 251 ARAWVIPVVLGALFVAVNVAVRWANAHYHLGLAESAAERMRDAGQIA 297 A+ +P+ L +F V V W+ ++ ++ + D ++A Sbjct: 109 PVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMA 155
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 43.0 bits (101), Expect = 8e-08 Identities = 25/131 (19%), Positives = 38/131 (29%), Gaps = 2/131 (1%) Query: 6 APVGDSRDGRCRRTTTNMKTYSTYLTLPLAASLLAGCAAFAPSDAAKLECTMPVAAYPEN 65 P D + R + T T A + + S L P YP Sbjct: 113 QPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ--YPAR 170 Query: 66 AKPLERRATVLVRAMITASGNAENVTVTTSSRNAAADRAAVDAMSRIVCSQTPARGGEPY 125 A+ L V V+ +T G +NV + ++ +R +AM R G Sbjct: 171 AQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVV 230 Query: 126 PFTLTRPFVFE 136 E Sbjct: 231 NILFKINGTTE 241
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.023 Identities = 14/35 (40%), Positives = 17/35 (48%) Query: 32 VVFVGPSGCGKSTLMRMIAGLEDISGGELLIDGAK 66 VV G G GKSTL+ + GL+ S I K Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 72.0 bits (176), Expect = 5e-17 Identities = 53/199 (26%), Positives = 76/199 (38%), Gaps = 16/199 (8%) Query: 3 IRDNVFLITGGASGLGAGTARLLTEAGGKVVLADLNQDAGEALARELGGVFVRCDVAREE 62 I + ITG A G+G AR L G + D N + E + L + + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 63 DAQAAVAA------ATKLGTLRGLVNCAGIAPAAKTVGKDGPHPLELFAKTITVNLIGTF 116 +A ++G + LVN AG+ G E + T +VN G F Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVL----RPGLIHSLSDEEWEATFSVNSTGVF 121 Query: 117 NMIRVAAAAMAANEPAPTGERGVIVSTASVAAFDGQIGQAAYAASKAGVAGMTLPIARDL 176 N R + M G IV+ S A + AAYA+SKA T + +L Sbjct: 122 NASRSVSKYMMDRR------SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175 Query: 177 SRNAIRVMTIAPGIFETPM 195 + IR ++PG ET M Sbjct: 176 AEYNIRCNIVSPGSTETDM 194
>SECA#SecA protein signature. Length = 901 Score = 26.0 bits (57), Expect = 0.010 Identities = 14/36 (38%), Positives = 21/36 (58%), Gaps = 5/36 (13%) Query: 33 KLAYPIRDGIPVMLVDEARQTVEGTPVDPAGPAQGR 68 KL Y + D + +L+DEAR TP+ +GPA+ Sbjct: 202 KLHYALVDEVDSILIDEAR-----TPLIISGPAEDS 232
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.9 bits (67), Expect = 0.012 Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 3/59 (5%) Query: 149 TLMPAAQRLAARLLMIAEGYG---GISTRHRRIRLSQERLGAMLSLSRQTANQLLKELA 204 TL+ A+ AAR +I + G T + +R +++ + S N K++A Sbjct: 118 TLVSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIA 176
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 38.6 bits (90), Expect = 3e-05 Identities = 32/192 (16%), Positives = 56/192 (29%), Gaps = 51/192 (26%) Query: 32 RVLIVG-CGDVGMRCAAQLRARHENLRVIALTS---------RRSRCAELRAAGVVPVVG 81 + L+ G G +G + +L +V+ + + +++R L G Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH--QVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 82 DLDARATLERIARVAHV--VLHLAPPQATGHVDRRTQALVAALASPRRPRQLAPAYGRLR 139 DL R + + H V +A S P AY Sbjct: 60 DLADREGMTDLFASGHFERVFISP-------------HRLAVRYSLENPH----AYADSN 102 Query: 140 -AGW----AAARSARPRFQASAIVPDAPSRPVVVYASTSGVYGDCGGARVDETRPV-RPA 193 G+ R + + ++YAS+S VYG V P Sbjct: 103 LTGFLNILEGCRHNKIQH--------------LLYASSSSVYGLNRKMPFSTDDSVDHPV 148 Query: 194 NPRAQRRVSAER 205 + A + + E Sbjct: 149 SLYAATKKANEL 160
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 27.4 bits (60), Expect = 0.036 Identities = 20/61 (32%), Positives = 25/61 (40%), Gaps = 15/61 (24%) Query: 77 WLADVRK--------GNDPSAAKLAARMS--PTVKELCTQFMEEYSRP-----RNKPSTV 121 W D R+ G D S +L + S P VKEL ME +RP RN+P Sbjct: 131 WYEDERRINRTYGCYGVDSSIMRLMSDYSRFPEVKELMESQMERLARPYWEKLRNRPDMY 190 Query: 122 D 122 Sbjct: 191 Y 191
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 43.4 bits (102), Expect = 6e-07 Identities = 38/174 (21%), Positives = 62/174 (35%), Gaps = 15/174 (8%) Query: 50 ACALAPAVAHAELAVTDDAGHTITLAAPARRVVSLAPHVTELIYAAG----GGAKLVGAV 105 A AL+P + A R+V+L EL+ A G G A + Sbjct: 15 AMALSPLLWQMNTAHAAAID--------PNRIVALEWLPVELLLALGIVPYGVADTINYR 66 Query: 106 SYSDYPPAAKAIPRVGSNQALDLERIAALKPDLIVVWRHGNAGRETERLRALGIPLYFSE 165 + PP ++ VG +LE + +KP +V E A G FS+ Sbjct: 67 LWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSD 126 Query: 166 PRH-LDDVAASLDKLGTLLGTREIAAAAANAYRQQIARLRARYAGK--PPVTVF 216 + L SL ++ LL + A Y I ++ R+ + P+ + Sbjct: 127 GKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 114 bits (286), Expect = 9e-30 Identities = 73/398 (18%), Positives = 153/398 (38%), Gaps = 16/398 (4%) Query: 25 LAVLDGAIANVALPTIARDLRASDAASIWIVNAYQLAVTISLLPLASLGDRIGYRRVYIA 84 +VL+ + NV+LP IA D A++ W+ A+ L +I L D++G +R+ + Sbjct: 25 FSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLF 84 Query: 85 GLALFTAASLGCALS-STLPALATLRVIQGFGAAGIMSVNTALVRMIYPSSQLGRGVAIN 143 G+ + S+ + S L R IQG GAA ++ +V P G+ + Sbjct: 85 GIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 144 AMVVALSSAVGPTIASAVLAVAPWPWLFAINVPIGVAAVYGSLRALPVNPGR-DAPYDFV 202 +VA+ VGP I + W +L +P+ L L R +D Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIKGHFDIK 202 Query: 203 SAMMNACVFGLLIVSVDGLGHGEDRVSVALTALAAVVIGYF-FVRRQLTQPAPLLPVDLL 261 ++ + ++ S +++ L V+ + FV+ P + L Sbjct: 203 GIILMSVGIVFFMLFTT---------SYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLG 253 Query: 262 RIPIFALSISTSVASFTSQMLAFVALPFWLQNTLGFSQVQTG-LYMTPWPLVIVVAAPLA 320 + F + + F + +P+ +++ S + G + + P + +++ + Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313 Query: 321 GVLSDRYSAGALGGVGLALFASGLLALATIGAHPTPVDIVWRMALCGAGFGLFQSPNNRA 380 G+L DR + +G+ + L + + T + + G ++ + Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFL-LETTSWFMTIIIVFVLGGLSFTKTVISTI 372 Query: 381 ILSAAPRERAGGASGMLGTARLTGQTFGAALVALIFGV 418 + S+ ++ AG +L + G A+V + + Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 31.7 bits (71), Expect = 0.007 Identities = 43/225 (19%), Positives = 78/225 (34%), Gaps = 12/225 (5%) Query: 106 LAQARQACLAARKLLAAGTDPTEQKREIKRARAIEASSSFEAVAREWFESQKDGWTEVYA 165 L Q + L A+ L + E + R I ++ E + Sbjct: 136 LNQKKITSLGAKNFLTRTAE--EIGEQAVREGNINGPEAYMRFLDREMEGLTAAYNVKLF 193 Query: 166 NKVINSLEVDAFPRIGSKPLRDIEAPDMLEIVRAIEARGVRETAKRVLQRSRAVFQYGIM 225 + I+SL++ +K + A + A EA+ E R RA Y + Sbjct: 194 TEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMP 253 Query: 226 TGRCSRNPAADIDAETVLKKGQGVKHMARVKPVEIPQLMRDIAAYSGDRVTQLALRFMAL 285 AA +++ QG +A+ I L R +A+ +A+ F +L Sbjct: 254 ANGSVVATAA---GRGLIQVAQGAASLAQAISDAIAVLGRVLASAPS----VMAVGFASL 306 Query: 286 TFTRTTEMINAEWDEFDERAAEWRIPPDRMKMRDPHIVPLSRQAL 330 T++ T +W + + + + D K+ P V L+ A Sbjct: 307 TYSSRT---AEQWQDQTPDSVRYALGMDAAKLGLPPSVNLNAVAK 348
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 35.7 bits (82), Expect = 4e-05 Identities = 19/61 (31%), Positives = 34/61 (55%), Gaps = 5/61 (8%) Query: 95 APNGLAANAGIAAVTQVLTGNI----ASNGLAHGPTAGVASASGIGGMIAGSVTNAVAPL 150 A A AG+ T+VL GN+ + +A G+++++ G+IA +VT A++PL Sbjct: 263 ADTRTKAAAGVELTTKVL-GNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAISPL 321 Query: 151 T 151 + Sbjct: 322 S 322
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 30.2 bits (68), Expect = 0.023 Identities = 36/244 (14%), Positives = 70/244 (28%), Gaps = 31/244 (12%) Query: 92 FRSVSDHGSASYMAGRSSAFDASYKTAKSSSSTSDSSSWSRSGSQSASSS--AANGSLSV 149 + + +D + D + + + + R Q + +L + Sbjct: 486 YFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYL 545 Query: 150 TTSK----------------IGLASNGGTASVSGGANLSASEKESFSVAKSFVPVPHGF- 192 + S + A ++S +A +K + V +P Sbjct: 546 SGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHW 605 Query: 193 --SANGSENESVTASISGSAHLNWGKQTYSGQYGAYDATKKSSTDSTSSASDSSWSASHS 250 S + S+ +AS S S LN +G YG S + + S S Sbjct: 606 LRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGS 665 Query: 251 DTSASSSSSGAMNKTASASFSKQSKWNYNDTRSSVDVTKTGSVTQYVDTRQAGTLTATTG 310 A+ + G A+ +S ++D + +G V + TL Sbjct: 666 TGYATLNYRGGY-GNANIGYS------HSDDIKQLYYGVSGGV---LAHANGVTLGQPLN 715 Query: 311 DKAA 314 D Sbjct: 716 DTVV 719
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 338 bits (868), Expect = e-114 Identities = 131/343 (38%), Positives = 190/343 (55%), Gaps = 38/343 (11%) Query: 166 ESNEMVGACDAMQQLFRTIRKIALTDATVFISGESGTGKELSALAIHERSARGKAPFVAI 225 + +VG AMQ+++R + ++ TD T+ I+GESGTGKEL A A+H+ R PFVAI Sbjct: 135 DGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194 Query: 226 NCGAIPPNLLQSELFGYERGAFTGANQRKIGRVEAAAGGTLFLDEIGDMPLESQASMLRF 285 N AIP +L++SELFG+E+GAFTGA R GR E A GGTLFLDEIGDMP+++Q +LR Sbjct: 195 NMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRV 254 Query: 286 LQEGKIERLGGREPIPVDVRIVSATHVDIEAAIREGRFREDLYHRLCVLRLDIPALRARG 345 LQ+G+ +GGR PI DVRIV+AT+ D++ +I +G FREDLY+RL V+ L +P LR R Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRA 314 Query: 346 KDIEILAHRALHKFGGDSARQIRGFTSCAIEAMYRYSWPGNVRELINRIRRAIVLSDSCL 405 +DI L + + + ++ F A+E M + WPGNVREL N +RR L + Sbjct: 315 EDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDV 373 Query: 406 ISAADLD-------------------------------LAQFVTQHA------TTLAQAR 428 I+ ++ + Q+ + Sbjct: 374 ITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVL 433 Query: 429 DIAEHRAIEASLLRHRGHLAEAATELGVSCTALSRLMAKYGLP 471 E+ I A+L RG+ +AA LG++ L + + + G+ Sbjct: 434 AEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 41.6 bits (97), Expect = 3e-06 Identities = 24/148 (16%), Positives = 48/148 (32%), Gaps = 16/148 (10%) Query: 149 QQQADAARARHDARLARQKREREAAEARAAARRAASAAAA-APAPTAAASAAPAADDPEA 207 + A ++ +++ +K E++A E A R A A + A T A + + + Sbjct: 1036 TTETVAENSKQESK-TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 208 KKRAIIA----------AALERARKKKEELAAQGAGPKN----TEGVSAAVQAQIDAAEA 253 + A +E + ++ PK T A + D Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154 Query: 254 RRRRLAGQRDREDDARPASDTSPTPKTE 281 + + D +PA +TS + Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQP 1182
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 73.1 bits (179), Expect = 4e-18 Identities = 31/189 (16%), Positives = 65/189 (34%), Gaps = 10/189 (5%) Query: 5 KIKRDPEGTRRRILLAAAEEFATGGLFGARVDQIARRAETNERMLYYYFGSKELLFTAVL 64 K K++ + TR+ IL A F+ G+ + +IA+ A +Y++F K LF+ + Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63 Query: 65 EYAFSALMEAERTIDLDGVAPVEAITR---LAHFVWDYYRDHPDLLRLLNNENLHEARYL 121 E + S + E E ++ R + + LL + + Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123 Query: 122 QKSTRIREMI-SPIVKTLDGVLERGQKAGLFRTDIDPLRFYVTLSGL------GYYMVSN 174 + + + ++ L+ +A + D+ R + + G + Sbjct: 124 AVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQ 183 Query: 175 RFTLAAIFG 183 F L Sbjct: 184 SFDLKKEAR 192
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 29.9 bits (67), Expect = 0.018 Identities = 12/62 (19%), Positives = 20/62 (32%) Query: 415 QPKKPAPQAGPTPTSPSTPRQSTSGRETASAAPAKAAALRLTSAKRPAAKTRAAKPAAAK 474 QPK+ P SP + + A + S R ++ + PA A+ Sbjct: 113 QPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQ 172 Query: 475 RA 476 Sbjct: 173 AL 174
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.9 bits (103), Expect = 7e-06 Identities = 45/294 (15%), Positives = 83/294 (28%), Gaps = 21/294 (7%) Query: 718 AATPAPTATSETSDATDAKDAIGAADTKPQAVVAQHAPAIAAADRPPSTVHPASAAAVAN 777 TP S ++ + I D A V APA + + + Sbjct: 997 ITTPNNIQADVPSVPSNNE-EIARVDE---APVPPPAPATPSETTETVAENSKQESKTVE 1052 Query: 778 DNARHPVAAPASPSAAAAAIDAAAQAP-KTNAGAIDRQSIGAVSGETAHAVAQPAVAAAS 836 N + A + A +A + T + + +T V Sbjct: 1053 KNEQD--ATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110 Query: 837 HAAARVSPAIADLRHA--LAPWEDARDTAAAAATSA--PAPTESRAQPQSPQGTTQSVAA 892 A + ++P ++ +T A A PT + +PQS TT Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170 Query: 893 PA---PDKTEAAASNGSTVPSASASAVSPAAPATSSAAAAPVAPASSATQTSTGNAAGAA 949 PA E + +TV + ++ +P T+ A P + S+ + + Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSVVENPE--NTTPATTQPTVNSESSNKPKNRHRRSVR 1228 Query: 950 GIAGAAFGMLDAARAAAATASAAAASASATTPAVGTPGGDRAASTAAAASSAGA 1003 + ++ + A T+ D A A + G Sbjct: 1229 SVPHN-----VEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGK 1277 Score = 42.7 bits (100), Expect = 1e-05 Identities = 43/298 (14%), Positives = 72/298 (24%), Gaps = 23/298 (7%) Query: 430 AATPQPVARSQTAAPAAEIARKRPAAPARAPLYAWNEKPAERIAPAASVHETLRSIEASA 489 Q P+ + A AP+ APA T E S Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPV--------PPPAPATPSETTETVAENSK 1045 Query: 490 AQWTALAGATGAAAAPEAACEPALAPAARSGDAAMQAASGMHAPTTVETAAVAIPAGTAT 549 + + A A A + A Q + + + TAT Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105 Query: 550 AVPPVDDRV-------APDIAADVTCAAEDGAAEAVEAVEAVEAVEAVEAATVPATPAVI 602 +V P + + V+ E +A A E + P + Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN-DPTVNIKEPQSQTNT 1164 Query: 603 GSSAIANARAAASAVAPASGGVGTRIAHGHETRLSVEAAPTATEDARHADASFALDAAAA 662 + A+ +S V T P T+ ++++S Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNK------ 1218 Query: 663 GAAVGNAVPGVDVAATVDESAKQSPLPSAAPASGAAAPLAASATSSGAAATQPVAAAT 720 + V V+ + S S + + S A Q VA Sbjct: 1219 -PKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNV 1275
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 61.0 bits (148), Expect = 3e-12 Identities = 39/148 (26%), Positives = 58/148 (39%), Gaps = 6/148 (4%) Query: 259 VIAACIIVPQAIVAMLSPWVGRSAQRWGRRPILLLGFAALPLRALLFAGVSSPYLLVPVQ 318 ++ A + Q A P +G + R+GRRP+LL+ A + + A ++L + Sbjct: 47 ILLALYALMQFACA---PVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGR 103 Query: 319 MLDGISAAVFGVMLPLIAADVAGGKGRYNLCIGLFGLAAGVGATLSTALAGFAADHFGNA 378 ++ GI+ A V I AD+ G R G G G L G F Sbjct: 104 IVAGITGATGAVAGAYI-ADITDGDERARH-FGFMSACFGFGMVAGPVLGGLMGG-FSPH 160 Query: 379 MSFFGLAAAGALATLLVWFAMPETRDAT 406 FF AA L L F +PE+ Sbjct: 161 APFFAAAALNGLNFLTGCFLLPESHKGE 188
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.007 Identities = 17/101 (16%), Positives = 35/101 (34%), Gaps = 7/101 (6%) Query: 3 VEPASEPVAAPEPASAPEPVETTAPKKPHREAAPRRKPARVAPPVPR-------PAPPPA 55 V+P +EP +P + ++ E + + V PV + Sbjct: 1139 VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198 Query: 56 PLVTTRAIERSQVHALLDSEVRRSGKVIGRAVDMTADAAGA 96 P TT A + V++ ++ + + R+V + A Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATT 1239
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.9 bits (75), Expect = 1e-04 Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 2/57 (3%) Query: 16 LEPHCRGMALALNSSGIFAGISLGSALGGRVADT--WGVGLLAPTSAALTVAALIAF 70 + RG A L S + G +G A+GG +A W LL P +TV L+ Sbjct: 132 IPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKL 188
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.1 bits (221), Expect = 1e-22 Identities = 39/157 (24%), Positives = 70/157 (44%), Gaps = 3/157 (1%) Query: 9 TVVLIEDEKQIRRFVRSALEEEGIAVFDAETGRQGLIEAATRKPDLAIVDLGLPDGDGLD 68 T+++ +D+ IR + AL G V A DL + D+ +PD + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 69 VIRELR-GWSEMPVIVLSARTHEEEKVAALDAGADDYLTKPFGVSELLARIRAHL--RRR 125 ++ ++ ++PV+V+SA+ + A + GA DYL KPF ++EL+ I L +R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 126 NQGGAAESPVVKFGDVSVDLALRRVWRGGEVVHLTPL 162 + V A++ ++R + T L Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDL 161
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.7 bits (72), Expect = 0.015 Identities = 23/116 (19%), Positives = 36/116 (31%), Gaps = 23/116 (19%) Query: 404 LPPLAPLTRVPPARVLRREWGDAGRVAWLGYAVGIALFAALLIAAAGNLTLGAIVAGGFA 463 LP L P +R R + Y + L A +++ G + + A Sbjct: 42 LPAHLELIETPVSRRPR----------LVAYFIMGFLVIAFILSVLGQVEIVATA----N 87 Query: 464 GSLVLFALVARLALFALARV----VRDG-RVAAGLGWRYALASLDRRGAASALQIT 514 G L + + V V++G V G L L GA + T Sbjct: 88 GKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKG----DVLLKLTALGAEADTLKT 139
>SECGEXPORT#Protein-export SecG membrane protein signature. Length = 110 Score = 82.7 bits (204), Expect = 6e-24 Identities = 47/102 (46%), Positives = 69/102 (67%), Gaps = 1/102 (0%) Query: 8 IIVVQLLSALGVIGLVLLQHGKGADMGAAFGSGASGSLFGATGSANFLSRTTAVLATIFF 67 ++VV L+ A+G++GL++LQ GKGADMGA+FG+GAS +LFG++GS NF++R TA+LAT+FF Sbjct: 5 LLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLFF 64 Query: 68 VATLALTYLGSYKSAPSVGVLGAAPAPAASAAAASQAPAASA 109 + +L L + S K+ APA + APA Sbjct: 65 IISLVLGNINSNKTNKGSEWEN-LSAPAKTEQTQPAAPAKPT 105
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 29.9 bits (67), Expect = 0.013 Identities = 16/96 (16%), Positives = 28/96 (29%), Gaps = 10/96 (10%) Query: 138 YAVILAGWASNSKYAFLGAMR-------AAAQMVSYEISMGFALVLVLMTAGSLNLSEIV 190 Y GW+ F+ A Y+++ + G + Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPY---K 85 Query: 191 GSQQHGFFAGHGVNFLSWNWLPLLPVFVIYFISGIA 226 GS ++G + GV + P+ IY G Sbjct: 86 GSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGM 121
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 29.1 bits (64), Expect = 0.015 Identities = 15/45 (33%), Positives = 25/45 (55%) Query: 150 AIAVGVVAAAAAGVQIAIAEGTLVVVPSGYALNALLLALGEAWFT 194 +IA+G A AA G +A+ G++ + A+ L ALG++ T Sbjct: 72 SIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVT 116
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 94.1 bits (234), Expect = 2e-23 Identities = 89/396 (22%), Positives = 140/396 (35%), Gaps = 75/396 (18%) Query: 51 MKKSLLALVALSAFAGAAHAQSSVTLYGIIDEGFNINTNAGGKHL-----YNLSSGVLQG 105 MKKSL+AL L+A AA A VTLYG I G + + + V G Sbjct: 1 MKKSLIALT-LAALPVAAMAD--VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57 Query: 106 SRWGLRGTEDLGGGLKALFVLENGFDVNSGKLNQGGLEFGRQAYVGLSSSFGTVTLGRQY 165 S+ G +G EDLG GLKA++ +E + G RQ+++GL FG + +GR Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWGN----RQSFIGLKGGFGKLRVGRLN 113 Query: 166 DSVVDF--VGPLEA-GDQWGGYIAAHPGDLDNFNNAYRVNNAVKFTSANYGGFTFGGLYS 222 + D + P ++ D G A P + + V++ S + G + Y+ Sbjct: 114 SVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQYA 164 Query: 223 FGGVAGDFSRNQTWSLGAGYTNGPLVLGVGYLNARTPSTAGGLFGNNTTSSSPAAVTTPV 282 AG ++++ G Y NG + G R + Sbjct: 165 LNDNAG-RHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEK------------- 210 Query: 283 YAGYASAHTYQVIGAGGAYSFGAATVGVTYSNIKFMNFASTVFPNQTATFNNAEINFKY- 341 YQ+ Y A V + + + + T A + +++ Sbjct: 211 ---------YQIHRLVSGYDNDALYASV-AVQQQDAKLVEENYSHNSQTEVAATLAYRFG 260 Query: 342 QLTPTLLAGAAYDYTQGSKIAGA-SAAKYHQGSVGVDYFLSKRTDVYAIGVYQHASGNVI 400 +TP + +Y + Y Q VG +Y SKRT + Sbjct: 261 NVTPRV----SYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWLQ------ 310 Query: 401 EADGNTVGPATAAINGLTPSSNRNQFTARVGIRHKF 436 E G + +TA VG+RHKF Sbjct: 311 EGKGESKFVSTAGG---------------VGLRHKF 331
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 125 bits (314), Expect = 1e-36 Identities = 77/257 (29%), Positives = 131/257 (50%), Gaps = 8/257 (3%) Query: 14 LAGKVALVTGAGRGIGAAIARAFAREGAAVAIAELDAALADETVDAIARDVADARVLAVP 73 + GK+A +TGA +GIG A+AR A +GA +A + + ++ V ++ + A A P Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE--AFP 63 Query: 74 ADVAQAESVAAALACTERAFGPLDVLVNNAGVNVFGDPLALAEEDWRRCFAIDLDGVWHG 133 ADV + ++ A ER GP+D+LVN AGV G +L++E+W F+++ GV++ Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 134 CRAALPGMVERGRGSIVNIASTHAFKIIPGCFPYPVAKHGVLGLTRALGVEYAPRNVRVN 193 R+ M++R GSIV + S A Y +K + T+ LG+E A N+R N Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 194 AIAPGYIETQSTHDWWNAQPDPEAARRETLALQ-----PMKRIGRADEVAMTAVFLASDE 248 ++PG ET W + + + P+K++ + ++A +FL S + Sbjct: 184 IVSPGSTETDMQWSLWADE-NGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 249 APFINASCITIDGGRSV 265 A I + +DGG ++ Sbjct: 243 AGHITMHNLCVDGGATL 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.037 Identities = 14/38 (36%), Positives = 17/38 (44%), Gaps = 5/38 (13%) Query: 18 RALD-GISFDVQAGQVHGLMGENGAGKSTLLKILGGEY 54 R ++ G FD L G G GKSTL+ L G Sbjct: 587 RVMEPGCKFDY----SVVLEGTGGIGKSTLINTLVGLD 620
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 127 bits (319), Expect = 2e-37 Identities = 78/251 (31%), Positives = 112/251 (44%), Gaps = 8/251 (3%) Query: 27 LAGRAVLITGGATGIGASFVEHFARQGARVAFVDLDEQAARALAARLADAAHEPVFVACD 86 + G+ ITG A GIG + A QGA +A VD + + + + L A D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 87 LTDIAALRGAIEAIRARIGPIAALVNNAANDVRHAIADVTPDSFDACIAVNLRHQFFAAQ 146 + D AA+ I +GPI LVN A I ++ + ++A +VN F A++ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 147 AVIDDMKRLGGGSIVNLGSISWMLKNAGYPVYASAKAAVQGLTRALARELGPFGIRVNTL 206 +V M GSIV +GS + YAS+KAA T+ L EL + IR N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 207 VPGWVMTDKQRRLWLDDAGRAAIKAGQCIDAEL--------LPGDLARMALFLAADDSRM 258 PG TD Q LW D+ G + G + P D+A LFL + + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 259 ITAQDVVVDGG 269 IT ++ VDGG Sbjct: 246 ITMHNLCVDGG 256
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 31.3 bits (71), Expect = 0.002 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 7/75 (9%) Query: 99 VHTIDRLKIAQRLAEQRPAHLPPLNVCVQVNISGEASKSGVAPSDAAELARAIAALPALR 158 VH+ +LK Q + P + + ++ ++ G P + + + A+ + Sbjct: 100 VHSNWQLKALQNARLKAPLD-------IYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVG 152 Query: 159 LRGLMAIPEPAADPE 173 LM+ A P+ Sbjct: 153 EMTLMSHFAEAEHPD 167
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.5 bits (68), Expect = 0.021 Identities = 26/132 (19%), Positives = 48/132 (36%), Gaps = 23/132 (17%) Query: 262 DALASLEPAEIQLQEASYSLSHYAQRLDLDPDRLAQVETRLDALHSTARKFRLPPETLHG 321 +A E++ A+Y++ L + ++ ++ R++ L + Sbjct: 171 EAYMRFLDREMEGLTAAYNV-------KLFTEAISSLQIRMNTLTAAKASIEAAAANK-- 221 Query: 322 EHEARRAQLAELDAAADLSALQAIADRAKDAYL----------ADAKKLSKARAQAAKAL 371 AR AE A+ A Q A RA + Y A + L + AQ A +L Sbjct: 222 ---AREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGLIQV-AQGAASL 277 Query: 372 GAAVTTGMQELS 383 A++ + L Sbjct: 278 AQAISDAIAVLG 289
>SECA#SecA protein signature. Length = 901 Score = 28.7 bits (64), Expect = 0.014 Identities = 17/49 (34%), Positives = 23/49 (46%) Query: 93 IKSEEALADDAIAYVGDDVNDLPVIDLVGVSYAPADAHALVKRRVDYVV 141 + EE L + I G+ + I L+ A AHAL R VDY+V Sbjct: 280 VLIEELLVKEGIMDEGESLYSPANIMLMHHVTAALRAHALFTRDVDYIV 328
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 98.9 bits (246), Expect = 2e-27 Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 10/137 (7%) Query: 33 QGGAVSTQPNPENVAQVTVDPLNDPNSPLAKRSVYFDFDSYSVQDQYQPLLQQHAQYLKS 92 QG A A S V F+F+ +++ + Q L Q L + Sbjct: 193 QGEAAPVVAPAPAPAPEVQTKHFTLKS-----DVLFNFNKATLKPEGQAALDQLYSQLSN 247 Query: 93 HPQRH--ILIQGNTDERGTSEYNLALGQKRAEAVRRALSLLGVGDSQMEAVSLGKEKPVA 150 + +++ G TD G+ YN L ++RA++V L G+ ++ A +G+ PV Sbjct: 248 LDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVT 307 Query: 151 LGHDEASWAQNRRADLV 167 +RA L+ Sbjct: 308 ---GNTCDNVKQRAALI 321
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 58.2 bits (140), Expect = 2e-11 Identities = 26/149 (17%), Positives = 55/149 (36%), Gaps = 10/149 (6%) Query: 77 VAPPPPPVKNEEADIALQQKRREQQAAAEREAQLEEQRRQQQLKAQQ-----LAAQQAAQ 131 V PP P +E + + ++E + + E E Q + A++ A Q + Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE 1084 Query: 132 LAAQKAVEREKQKQAEKLKQQQQLAEQQRKLEQQKLEQQKLE-----QQKKQEQLAAQKK 186 +A + +E Q K + E+ + ++ E K+ +Q++ E + Q + Sbjct: 1085 VAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAE 1144 Query: 187 ADAEKAAKAAAAKANAAAKAKLDKERQAR 215 E + + D E+ A+ Sbjct: 1145 PARENDPTVNIKEPQSQTNTTADTEQPAK 1173 Score = 36.2 bits (83), Expect = 2e-04 Identities = 21/148 (14%), Positives = 41/148 (27%), Gaps = 16/148 (10%) Query: 86 NEEADIALQQKRREQQAAAEREAQLEEQRRQQQLKAQQLAAQQAAQLAAQKAVEREKQKQ 145 N E + Q + + +A A E + Sbjct: 982 NPEVEKRNQTVDTTNITTPNNIQA--DVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 146 -AEKLKQQQQLAEQQRKLEQQKLEQQKLE------------QQKKQEQLAAQKKADAEKA 192 AE KQ+ + E+ + + Q + Q + Q ++ K Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 193 AKAAAAKANAAAKAKLDKERQARLAQLQ 220 K A KAK++ E+ + ++ Sbjct: 1100 TKETATVE-KEEKAKVETEKTQEVPKVT 1126 Score = 30.8 bits (69), Expect = 0.010 Identities = 13/109 (11%), Positives = 39/109 (35%) Query: 87 EEADIALQQKRREQQAAAEREAQLEEQRRQQQLKAQQLAAQQAAQLAAQKAVEREKQKQA 146 +EA ++ + + A E Q + + A ++A + + Q Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129 Query: 147 EKLKQQQQLAEQQRKLEQQKLEQQKLEQQKKQEQLAAQKKADAEKAAKA 195 ++Q + + Q + ++ +++ + Q A + A++ + Sbjct: 1130 SPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 88.2 bits (218), Expect = 5e-23 Identities = 55/180 (30%), Positives = 85/180 (47%), Gaps = 4/180 (2%) Query: 2 IVFVTGASAGFGAAIARAFVKGGHRVVASARRKERLDALAAELGGALLPIE---LDVRDR 58 I F+TGA+ G G A+AR G + A E+L+ + + L E DVRD Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 59 AAVEAVPAALPAEFAAIDVLVNNAGLALGVEPAHRASLDEWQTMIDTNCSGLVTVTRTLL 118 AA++ + A + E ID+LVN AG+ L H S +EW+ N +G+ +R++ Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PGMVERGRGHIFNLGSVAGSYPYPGGNVYGATKAFVRQFSLNLRADLIGTPLRVTDIEPG 178 M++R G I +GS P Y ++KA F+ L +L +R + PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 38.4 bits (89), Expect = 5e-06 Identities = 14/53 (26%), Positives = 27/53 (50%), Gaps = 1/53 (1%) Query: 12 GFMLVELMVALVIVALVAVLSVPTFAGARMRDRVDARARVFGASLAYARGEAV 64 GF L+E+M+ L+++ + A + + F +R AR F A L + + + Sbjct: 5 GFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLAR-FEAQLRFVQQRGL 56
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 47.6 bits (113), Expect = 1e-09 Identities = 16/59 (27%), Positives = 35/59 (59%) Query: 4 IERSSRLRGFTLIEVVVALAIVAVLAAFAVPSYRSHVERGNRLTAIAALYRAAQYVDAF 62 + + + RGFTL+E++V + I+ VLA+ VP+ + E+ ++ A++ + +D + Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMY 59
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.5 bits (74), Expect = 0.003 Identities = 74/361 (20%), Positives = 133/361 (36%), Gaps = 32/361 (8%) Query: 18 QIVSVVSFTFVCYLTIGLPLAVLPGFVHDELGFSAIVAGAAISVQYFAT--LASRPLAGR 75 ++ ++S + + IGL + VLPG + D + + + A I + +A A P+ G Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 76 CADTLGPKRTVLRGLAACGASGALLLSAFAFARWPAASIVLLVASRLVLGV-GESLVGTG 134 +D G + +L LA GA+ + A A W +L R+V G+ G + G Sbjct: 66 LSDRFGRRPVLLVSLA--GAAVDYAIMATAPFLW------VLYIGRIVAGITGATGAVAG 117 Query: 135 AILWGI----------GRVGAAHNARVISWNGIATY-GALAIGAPVGVAIAHALIPAVLG 183 A + I G + A +++ + G + AP A A + + G Sbjct: 118 AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTG 177 Query: 184 VLVIALAALGYYLARLIAPVPLVHGERMSYASVFTRVLPHGLGLALGSAGFGSI-ATFIT 242 ++ + G R ++ + V+ + + G + A Sbjct: 178 CFLLPESHKG---ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWV 234 Query: 243 LYYASR-HWPNA--ALSLTVFGTLFIGARLLFANTIKTHGGFRVAI-VSFAFECAGLLML 298 ++ R HW +SL FG L A+ + + G R A+ + + G ++L Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294 Query: 299 WLAPVPHVALVGAALTGFGFALIFPALGVEAVALVPPASRGAALSAYSVFLDLSLGITGP 358 A +A L G + PAL V +G + + L+ I GP Sbjct: 295 AFATRGWMAFPIMVLLASG-GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-SIVGP 352 Query: 359 L 359 L Sbjct: 353 L 353
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 174 bits (444), Expect = 2e-53 Identities = 90/350 (25%), Positives = 137/350 (39%), Gaps = 45/350 (12%) Query: 46 ILVTGGAGFIGANFVLDWLAQSDEAVLNVDKLT--YAGNLGTLK-SLQGNPKHVFARVDI 102 LVTG AGFIG + L + V+ +D L Y +L + L P F ++D+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 103 CDRAAIDALLAQYKPRAILHFAAESHVDRSIHGPADFVQTNVVGTFTLLEAARQYWSALD 162 DR + L A + V S+ P + +N+ G +LE R Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH------ 115 Query: 163 ADAKAAFRFLHVSTDEVFGSLSPADPQFSETTPYA-PNSPYSATKAGSDHLVRAYHHTYG 221 + L+ S+ V+G L+ P FS P S Y+ATK ++ + Y H YG Sbjct: 116 NKIQ---HLLYASSSSVYG-LNRKMP-FSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 222 LPTLTTNCSNNYGPYQFPEKLIPLMIANALGGKPLPVYGDGQNVRDWLYVGDHCSAIREV 281 LP YGP+ P+ + L GK + VY G+ RD+ Y+ D AI + Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230 Query: 282 L------------------ARGVPGETYNVGGWNEKKNLDVVHTLCDLLD-EARPKAAGS 322 A P YN+G + + +D + L D L EA+ Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN---- 286 Query: 323 YRDQITYVTDRPGHDRRYAIDARKLERELGWKPAETFETGLAKTVRWYLD 372 + +PG + D + L +G+ P T + G+ V WY D Sbjct: 287 ------MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 58.6 bits (142), Expect = 5e-12 Identities = 34/160 (21%), Positives = 57/160 (35%), Gaps = 27/160 (16%) Query: 1 MKILVTGANGQVGWELARSLAVLGQVV--------------------PLARD-----EAD 35 MK LVTGA G +G+ +++ L G V LA+ + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 36 LGRPETLARIVEDAKPDVVVNAAAYTAVDAAESDGAAAKVVNGEA-VGVLAAATKRVGGL 94 L E + + + V + AV + + A N + +L Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 95 FVHYSTDYVFDGTKSSPYIETDPT-CPVNAYGASKLLGEL 133 ++ S+ V+ + P+ D PV+ Y A+K EL Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 31.8 bits (72), Expect = 0.002 Identities = 18/64 (28%), Positives = 28/64 (43%) Query: 195 LFTMVLMFLSPVFYPASALPEKYRFWLELNPLTLFIEQSRGILLEGRVPDFHPLGLALLG 254 L ++FLS +P LP ++ PL+ I+ R I+L V D AL Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCI 243 Query: 255 GVVV 258 +V+ Sbjct: 244 YIVI 247
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 166 bits (422), Expect = 1e-50 Identities = 82/363 (22%), Positives = 136/363 (37%), Gaps = 58/363 (15%) Query: 13 KILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHANAVRPVALHEKAE----LVVAD 68 K LVTG AGFIG +S+RL + V +DNL+ + +++ L A+ D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60 Query: 69 VTDAGAWDALLSDFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDAIVKHGIAV 128 + D L + E + SL +A N+ G + + + + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--I 118 Query: 129 EHILLTSSRAVYGEGAWQKADGTIVYPGQRGRAQLEAAQWDFPGMTMLPSRADRTEPRPT 188 +H+L SS +VYG +P D + P Sbjct: 119 QHLLYASSSSVYGLN------------------------------RKMPFSTDDSVDHPV 148 Query: 189 SVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFSRLAREKKV 248 S+Y ATK A E + +S P + LR VYGP + F++ E K Sbjct: 149 SLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKS 204 Query: 249 IPLYEDGNVTRDFVSIDDVADAIVATLAREPEA-----------------LSLFDIGSGQ 291 I +Y G + RDF IDD+A+AI+ P A +++IG+ Sbjct: 205 IDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSS 264 Query: 292 ATSILDMARIIAAHYGAPEPQVNGAFRDGDVRHAACDLSESLANLGWKPQWSLERGIGEL 351 ++D + + G + + GDV + D +G+ P+ +++ G+ Sbjct: 265 PVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324 Query: 352 QTW 354 W Sbjct: 325 VNW 327
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 127 bits (320), Expect = 9e-36 Identities = 91/386 (23%), Positives = 143/386 (37%), Gaps = 62/386 (16%) Query: 1 MKKTLIVAALSGVFATAAHAQSSVTLYGLIDAGITYTNNQGGHSAWSQSSG-----SVNG 55 MKK+LI L+ + A + VTLYG I AG+ + + + A + S G Sbjct: 1 MKKSLIALTLAALPVAAM---ADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57 Query: 56 SRWGLRGAEDLGGGLKAIFVLENGFGINNGTLKQNGREFGRQAFVGLSHDQYGSLTLGRQ 115 S+ G +G EDLG GLKAI+ +E I + RQ+F+GL +G L +GR Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGT----DSGWGNRQSFIGLKGG-FGKLRVGRL 112 Query: 116 YDSVVDY--IGPLSLTGTQFGGVQFAHPFDNDNLNNSFRINNAVKYTSVNWAGLKFGALY 173 + D I P G + A P + S V+Y S +AGL Y Sbjct: 113 NSVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQY 163 Query: 174 GFSNSNEFANNRAYSAGVSYSYAGFNVGAGYLQLNNDFGPTVSNASGAVALDNTFVGKRQ 233 +++ N+ +Y AG +Y GF V G + ++ Sbjct: 164 ALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQV------------QENVNIEKY 211 Query: 234 RVFGGGLNYTYGPATAGFVFTQSRVNRATAISSGASGVSSGIALDGTFMRFNNYEVNARY 293 ++ Y A + + A + S S RF N Sbjct: 212 QIHRLVSGYD---NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFG----NVTP 264 Query: 294 AITPAWTVAGSYTYTAGFIENHHPGWNQFNLQTAYALSKRTDVYLQGVYQKVNSDGTGLG 353 ++ A GS+ T N++ ++Q + Y SKRT + + + +G G Sbjct: 265 RVSYAHGFKGSFDAT-----NYNNDYDQVVVGAEYDFSKRTSALVSAGWLQ---EGKGES 316 Query: 354 AYINGVGGMSSSEKQIAVTAGLRHRF 379 ++ A GLRH+F Sbjct: 317 KFV-----------STAGGVGLRHKF 331
>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature. Length = 398 Score = 28.8 bits (64), Expect = 0.035 Identities = 10/50 (20%), Positives = 20/50 (40%), Gaps = 1/50 (2%) Query: 281 DVSAEKTVTLKGFKRDADGDFLVES-VTHEYAGRSWETEVVLNAGNKGKA 329 D +A +GF + + + ++H + + +V L KG A Sbjct: 212 DFNAWSKEYARGFAKTGKSIYYSHASMSHSWDDWDYAAKVTLANSQKGTA 261
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 60.8 bits (147), Expect = 2e-11 Identities = 48/335 (14%), Positives = 114/335 (34%), Gaps = 5/335 (1%) Query: 167 AAGVSKYKERRRETENRLHDTRENLTRVEDIVRELGANLEKLEAQAVVATKYKELVADGE 226 + ++ + + + + D+ A + + + KE + + Sbjct: 46 RSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKND 105 Query: 227 EKQRLLWLLRKNEAAAEQDKQRRAIGEAQIELDAQTAKLREVEAQLETLRVAHYSASDAT 286 + + A + D ++ A+ A A +AK++ +EA+ L A Sbjct: 106 KSLSEKASKIQELEARKADLEK-ALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 164 Query: 287 QGAQGALYEANAEVSRLEAEIKFIVESRNRVQSQIAALVAQQEQWRAQADKAQGDLEEAE 346 +GA +A++ LEAE + + ++ + + A+ + + Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224 Query: 347 EARAVADEKAAIAEDDAAAKHDALPALEARWRDAQTGLNDERGRIAQTEQALKLEAAHQR 406 +A ++ A + + A + LEA + + + ++A + Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 284 Query: 407 NADQQLQQLQQRHERLKAEAGGLDAPDEAQLEELRMQLAEHEEILGEAQARLADAQETLP 466 + + L+ L+ ++ L A + LR L E + +A +E Sbjct: 285 TLEAEKAALEAEKADLEHQSQVL----NANRQSLRRDLDASREAKKQLEAEHQKLEEQNK 340 Query: 467 RLDAERRAAHERVQAESAQIHQLEARLAALKQLQE 501 +A R++ + A QLEA L++ + Sbjct: 341 ISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375 Score = 42.0 bits (98), Expect = 1e-05 Identities = 66/337 (19%), Positives = 127/337 (37%), Gaps = 22/337 (6%) Query: 170 VSKYKERRRETENRLHDTRENLTRVEDIVRELGANLEKLEAQAVVATKYKELVADGEEKQ 229 +S KE+ R+ + L + + +E +L LE A + + Sbjct: 94 LSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMN------FSTADSAKIKTLE 147 Query: 230 RLLWLLRKNEAAAEQDKQRRAIGEAQIELDAQTAKLREVEAQLETLRVAHYSASDATQGA 289 K AA + +A+ A A +AK++ +EA+ L A +GA Sbjct: 148 A-----EKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGA 202 Query: 290 QGALYEANAEVSRLEAEIKFIVESRNRVQSQIAALVAQQEQWRAQADKAQGDLEEAEEAR 349 +A++ LEAE + + ++ + + A+ + + E + Sbjct: 203 MNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQ 262 Query: 350 AVADEKAAIAEDDAAAKHDALPALEARWRDAQT---GLNDERGRIAQTEQALKLEAAHQR 406 A ++ A + + A + LEA + L + + Q+L+ + R Sbjct: 263 AELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASR 322 Query: 407 NADQQLQQLQQRHERLK----AEAGGLDAPDEAQLEELRMQLAEHEEILGEAQARLADAQ 462 A +QL+ Q+ E A L +A E + AEH+++ + + A Q Sbjct: 323 EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382 Query: 463 ETLPRLDAERRAAHERVQAESAQIHQLEARLAALKQL 499 LDA R A ++V+ + ++LAAL++L Sbjct: 383 SLRRDLDA-SREAKKQVEKALE---EANSKLAALEKL 415 Score = 40.8 bits (95), Expect = 3e-05 Identities = 40/273 (14%), Positives = 86/273 (31%), Gaps = 6/273 (2%) Query: 741 EELEEIGAQIEEQRALRAESEANFERHDAELAELQARFEDNQLAFESLDETLTNARQEAR 800 E + + + + + + EL+ + + N + + Sbjct: 64 IENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKA 123 Query: 801 ELERAATDARFAARQSANRIDELKRSIQVAHEQAERVAASLEDARAELETINEQTAHTGL 860 +LE+A A + + +I L+ + + +LE A L Sbjct: 124 DLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTAD--SAKIKTL 181 Query: 861 QDALEVRAAKEQALGAARAELDDLTAKLRAADETRLAAERSLQPLRDRITELQLKEQAAR 920 + A++ L A + + A +T E L R +L+ + A Sbjct: 182 EAEKAALEARQAELEKALEGAMNFSTADSAKIKT---LEAEKAALAARKADLEKALEGAM 238 Query: 921 MTGEQFAEQLATAEVDEAALREKLTP-DMKPSYLQGEVTRLNNAINALGPVNMAALEELA 979 + ++ T E ++AAL + + T + I L A E A Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 980 AASERKVFLDAQSADLTNAIETLEDAIRKIDQE 1012 + L+A L ++ +A ++++ E Sbjct: 299 DLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331 Score = 37.7 bits (87), Expect = 3e-04 Identities = 59/290 (20%), Positives = 108/290 (37%), Gaps = 13/290 (4%) Query: 662 RAQEIENLTRQVRAQALLSDEAKSAAIRAE--AAHTQASQALTEVRAQAERATQRVHALQ 719 + + + L +++A A +AE A A T A+ + AL Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224 Query: 720 MDVLKLTQAHERYTQRSTQIREELEEIGAQIEEQRALRAESEANFERHDAELAELQARFE 779 L +A E ST +++ + A+ A +AE E E A+ + Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 284 Query: 780 DNQLAFESLDETLTNARQEARELERAATDARFAARQSANRIDELKRSIQVAHEQAERVAA 839 + +L+ + +++ L R S +L+ Q EQ + A Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344 Query: 840 SLEDARAELETINEQTAHTGLQDALEVRAAK-EQALGAARAELDDLTAKLRAADETRLAA 898 S + R +L+ E LE K E+ + A L L A+ E + Sbjct: 345 SRQSLRRDLDASREAKK------QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQV 398 Query: 899 ERSLQPLRDRITELQLK----EQAARMTGEQFAEQLATAEVDEAALREKL 944 E++L+ ++ L+ E++ ++T ++ AE A E + AL+EKL Sbjct: 399 EKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKL 448
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.0 bits (78), Expect = 9e-04 Identities = 40/132 (30%), Positives = 57/132 (43%), Gaps = 4/132 (3%) Query: 213 AQTSGNVLAIASLMGIAGAALASYLGGRAARRAMLLAGYAILAASLVALAAAPNAAGFAI 272 G +LA+ +LM A A + L R RR +LL A A +A AP I Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYI 101 Query: 273 A--IFGFKFAWTFVLPFILASVAAVDTTGRLIATLNLVIGSGLAAGPLVAGLMLDGGGTL 330 + G A V +A + D R ++ G G+ AGP++ GLM GG + Sbjct: 102 GRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSP 159 Query: 331 RALFSIAAAVSA 342 A F AAA++ Sbjct: 160 HAPFFAAAALNG 171
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 62.1 bits (151), Expect = 8e-13 Identities = 71/324 (21%), Positives = 117/324 (36%), Gaps = 27/324 (8%) Query: 87 TLAALSGPTHAQSTLTLYGVTDAGVQYLSHADGRHDAWRLQNYGI----LPSQIGVKGDE 142 TLAAL P A + +TLYG AGV+ G L S+IG KG E Sbjct: 9 TLAAL--PVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLGSKIGFKGQE 66 Query: 143 DLGGGWHALFKLEQGVNLNDGAASTPGYAFFRGAYVGVGGPAGAVTLGRQFSTLFDKTLF 202 DLG G A++++EQ + A T R +++G+ G G + +GR S L D Sbjct: 67 DLGNGLKAIWQVEQKAS----IAGTDSGWGNRQSFIGLKGGFGKLRVGRLNSVLKDTGDI 122 Query: 203 YDPLWYASYSGQGVLVPLSANFVDHSIKFQSATFAGFDVEALAATAGIAGNARAGRVLEL 262 +P S + + S+++ S FAG ++ Sbjct: 123 -NPWDSKSDYLGVNKIAEPEARLI-SVRYDSPEFAGL-SGSVQYALNDNAGRHNSESYHA 179 Query: 263 GGQFTSNGLSASVV-LHRSHGAV--DGGVDRAAQRRDLGTVAARYTFASLPLTVYAGVER 319 G + + G ++ H V + +++ R + +AS+ + Sbjct: 180 GFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQQQDAKLV 239 Query: 320 LTGDLDPARTIV-------WGGARYQTSGRFGFAGGIYRTDSPTPQIGHPTLFIASATCS 372 ++T V +G + S GF G T+ + A Sbjct: 240 EENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNN----DYDQVVVGAEYD 295 Query: 373 LSKRTVAYLNLGYAKNSGRSSQTV 396 SKRT A ++ G+ + S+ V Sbjct: 296 FSKRTSALVSAGWLQEGKGESKFV 319
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.009 Identities = 19/112 (16%), Positives = 40/112 (35%), Gaps = 12/112 (10%) Query: 51 EAAAAGVEASLSKSDLPSPQEIRDILDQYVIGQERAKKILAVAVYNHYKRL-------KH 103 +A+ G L K E+ I+ + + +R L + + + Sbjct: 92 KASEKGAYDYLPKP--FDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149 Query: 104 LDKKDDVELSKSNILLIGPTGSGKTLLAQTLARL---LNVPFVIADATTLTE 152 + + +++ G +G+GK L+A+ L N PFV + + Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.0 bits (93), Expect = 3e-05 Identities = 35/192 (18%), Positives = 73/192 (38%), Gaps = 15/192 (7%) Query: 92 KVLVEGLQRAQALSIEEQETQFSCEVMPLEPDHADSAETEALRRAIVSQFDQYVKLNKKI 151 L + L+ A S + + E A A+ E ++ K + Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKIKTL 251 Query: 152 PPEILTSLSGIDEAGRLADTIAAHLPLKLDQKQHILEMFPVIERLEHLLAQLEAEIDILQ 211 E + E + + + + + LE A LE + +L Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE---KAALEAEKADLEHQSQVLN 308 Query: 212 VEKRIRGRVKRQMEKSQREYYLNEQVKAIQKELGEGEEGAD--LEELEKRINAARMPKEA 269 R ++R ++ S+ +Q++A ++L E + ++ + L + ++A+R EA Sbjct: 309 AN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEASRQSLRRDLDASR---EA 359 Query: 270 KKKADAELKKLK 281 KK+ +AE +KL+ Sbjct: 360 KKQLEAEHQKLE 371
>PF07520#Virulence protein SrfB Length = 1041 Score = 31.9 bits (72), Expect = 0.001 Identities = 13/40 (32%), Positives = 17/40 (42%) Query: 95 LLRSREEQARAEHPMQRIMSIDTGGGATVVTTTDIHLARN 134 + R + A E P R+ ID GGG T + T N Sbjct: 580 KGQPRPDPAGGESPSLRLACIDVGGGTTDLMVTTYRGEDN 619
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 334 bits (859), Expect = e-113 Identities = 128/357 (35%), Positives = 183/357 (51%), Gaps = 40/357 (11%) Query: 127 ERLTTVRSASAKPSGEGLVGGSDAFNAALSALQRVAPSTLPVLLLGESGTGKELFARALH 186 + + G LVG S A L R+ + L +++ GESGTGKEL ARALH Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH 181 Query: 187 EASARAMGPFVVVDCSGIAETLFESELFGYEKGAFTGATARKPGLVETAQGGTLFLDEIG 246 + R GPFV ++ + I L ESELFG+EKGAFTGA R G E A+GGTLFLDEIG Sbjct: 182 DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241 Query: 247 DVPLSMQVKLLRLIESGTFRRVGGVEVLRADFRLVAATHKPLKAMIGDGRFRPDLYYRIS 306 D+P+ Q +LLR+++ G + VGG +R+D R+VAAT+K LK I G FR DLYYR++ Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301 Query: 307 AYPIALPAVRERPGDMPLLVDSILRRIAALGPAAGQRFTVAPDALARLEAYAWPGNIREL 366 P+ LP +R+R D+P LV +++ G +AL ++A+ WPGN+REL Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAEKEG---LDVKRFDQEALELMKAHPWPGNVREL 358 Query: 367 RNVLDRACLLTDDGVIRVEHLPDEVARAGDAREEAGASAK-------------------- 406 N++ R L VI E + +E+ A+A+ Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 407 --------------LSDDELARIARA---FVGTRRALAGRVGMSERTLYRRLRALGI 446 L++ E I A G + A +G++ TL +++R LG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 435 bits (1120), Expect = e-138 Identities = 227/1063 (21%), Positives = 422/1063 (39%), Gaps = 76/1063 (7%) Query: 13 LSAWALRHQALVIYLIALSTIAGILAYSRLAQSEDPPFTFRVMVIRTFWPGATARQVQEQ 72 ++ + +R L + +AG LA +L ++ P + + +PGA A+ VQ+ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 73 VTDRIGRKLQEMPAIDYLRSYS-RPGESLLFFAMKDSAPVKDVPETWYQVRKKVGDISMT 131 VT I + + + + Y+ S S G + + D QV+ K+ + Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSG---TDPDIAQVQVQNKLQLATPL 117 Query: 132 LPPGIQGP-FFNDEFGDVYTNIYTLEGDG--FSPAQLHDYAD-QLRVVLLRVPGVAKVDY 187 LP +Q ++ Y + D + + DY ++ L R+ GV V Sbjct: 118 LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL 177 Query: 188 FGDPDQRIFVEIDNTRLARLGISPQQIAQAINAQNDVSSPGVLTAAHD------RVFIRP 241 FG + + +D L + ++P + + QND + G L I Sbjct: 178 FG-AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIA 236 Query: 242 SGQYESVDAIADTLIRVN--GRTFRLGDLATIKRGYDDPPVTQMRTIGRDAKGRAVLGIG 299 ++++ + +RVN G RL D+A ++ G + + G+ G+G Sbjct: 237 QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG------GENYNVIARINGKPAAGLG 290 Query: 300 ITMQPGGDVIRLGKALDASAKALQAQLPAGLTLTEVSSMPHAVSRSVDDFLEAVAEAVAI 359 I + G + + KA+ A LQ P G+ + V S+ + ++ + EA+ + Sbjct: 291 IKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML 350 Query: 360 VLIVSLVSLG-LRTGMVVVISIPVVLAVTALFMYLFDIGLHKVSLGTLVLALGLLVDDAI 418 V +V + L +R ++ I++PVVL T + F ++ +++ +VLA+GLLVDDAI Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410 Query: 419 IAVEMMA-VKLEQGFSRARAAAFAYTSTAFPMLTGTLVTVSGFLPIALAKSSTGEYTRSI 477 + VE + V +E A + + ++ +V + F+P+A STG R Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470 Query: 478 FEVSAIALIASWFAAVVLIPLLGYHMLPERKHPPKDAAAGPPHAPDAAHDHEHGHDIYDT 537 A+ S A++L P L +L + G + DH Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHS-------V 523 Query: 538 RFYTRLRGWIKWCIERRFAVLAITIALFVVALAGFSLVPQQFFPSSDRPELLVDLRLPEG 597 YT G I + L I + + F +P F P D+ L ++LP G Sbjct: 524 NHYTNSVGKI---LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAG 580 Query: 598 ASFDATLKQAERLEKLIAN--RPEIDHAVNFVGSGAPRFYLPLDQQLQLPNFAQFVITAK 655 A+ + T K +++ + ++ G Q N ++ K Sbjct: 581 ATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSG---------QAQNAGMAFVSLK 631 Query: 656 SVEERDKLSAWLEPVLRDQFTAART------------RISRLENGPPVGYPVQFRVSGDS 703 EER+ E V+ I L + + + +G Sbjct: 632 PWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQ-AGLG 690 Query: 704 IATVRAISEKVAATMR---ADTRATNVQFDWDEPAERSVRFELDQHKARELNVSSQDVAS 760 + ++ A + D + E+DQ KA+ L VS D+ Sbjct: 691 HDALTQARNQLLGMAAQHPASLVSVRPNGLEDTA---QFKLEVDQEKAQALGVSLSDINQ 747 Query: 761 FLAMTLSGTTVTQYRERDKLIAVDLRAPRAQRVDPANLANLAMPTPNG-PVPLGSLGRFH 819 ++ L GT V + +R ++ + ++A R+ P ++ L + + NG VP + H Sbjct: 748 TISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSH 807 Query: 820 DTLEYGVVWERDRQPTITVQSDVIAGAQGIDVTHAIDAKLNALRAQLPVGYRIEIGGSVE 879 + + P++ +Q + G + A + L ++LP G + G Sbjct: 808 WVYGSPRLERYNGLPSMEIQGEAAPG----TSSGDAMALMENLASKLPAGIGYDWTGMSY 863 Query: 880 ESAKGQTSINAQMPLMAIAVLTLLMIQLQSFSRVLMVVLTAPLGMIGVVGTLLLFGKPFG 939 + A + + + V L +S+S + V+L PLG++GV+ LF + Sbjct: 864 QERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKND 923 Query: 940 FVAMLGVIAMFGIIMRNSVILVDQIEQDIAA-GHGRFDAIVGATVRRFRPITLTAAAAVL 998 M+G++ G+ +N++++V+ + + G G +A + A R RPI +T+ A +L Sbjct: 924 VYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFIL 983 Query: 999 ALIPLLRSNFFG-----PMATALMGGITSATVLTLFFLPALYA 1036 ++PL SN G + +MGG+ SAT+L +FF+P + Sbjct: 984 GVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFV 1026 Score = 80.3 bits (198), Expect = 2e-17 Identities = 58/332 (17%), Positives = 121/332 (36%), Gaps = 27/332 (8%) Query: 735 AERSVRFELDQHKARELNVSSQDVASFL--------AMTLSGTTVTQYRERDKLIAVDLR 786 A+ ++R LD + ++ DV + L A L GT ++ + I R Sbjct: 180 AQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 787 APRAQRVDPANLANLAMPTPNG-PVPLGSLGRFHDTLE-YGVVWERDRQPTITVQSDVIA 844 + +G V L + R E Y V+ + +P + + Sbjct: 240 FKNPEEFGK----VTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLAT 295 Query: 845 GAQGIDVTHAIDAKLNALRAQLPVGYRIEI----GGSVEESAKGQTSINAQMPLMAIAVL 900 GA +D AI AKL L+ P G ++ V+ S + + + L Sbjct: 296 GANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSI--HEVVKTLFEAIMLVFL 353 Query: 901 TLLMIQLQSFSRVLMVVLTAPLGMIGVVGTLLLFGKPFGFVAMLGVIAMFGIIMRNSVIL 960 + + LQ+ L+ + P+ ++G L FG + M G++ G+++ +++++ Sbjct: 354 VMYLF-LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVV 412 Query: 961 VDQIEQDIAAGHGRF-DAIVGATVRRFRPITLTAAAAVLALIPLL-----RSNFFGPMAT 1014 V+ +E+ + +A + + + A IP+ + + Sbjct: 413 VENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSI 472 Query: 1015 ALMGGITSATVLTLFFLPALYAAWFRVKPDER 1046 ++ + + ++ L PAL A + E Sbjct: 473 TIVSAMALSVLVALILTPALCATLLKPVSAEH 504
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.1 bits (99), Expect = 2e-06 Identities = 22/116 (18%), Positives = 39/116 (33%), Gaps = 15/116 (12%) Query: 66 IAGKIVER-KVRLGDAVKKGQVLALLDTSDVAKNAASAQAQLDAATHALTFAQ---QQRE 121 I IV+ V+ G++V+KG VL L + Q+ L A T Q + E Sbjct: 102 IENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIE 161 Query: 122 RDR-----------AQARENLIAPAQLEQTENAYAAARAQRDQAEQQLALAKNQLQ 166 ++ Q + ++ + Q+ Q E L + + Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217 Score = 35.2 bits (81), Expect = 4e-04 Identities = 10/71 (14%), Positives = 28/71 (39%) Query: 100 ASAQAQLDAATHALTFAQQQRERDRAQARENLIAPAQLEQTENAYAAARAQRDQAEQQLA 159 + A+++ + + + + + + IA + + EN Y A + + QL Sbjct: 217 LTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLE 276 Query: 160 LAKNQLQYATL 170 ++++ A Sbjct: 277 QIESEILSAKE 287
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 61.2 bits (148), Expect = 9e-14 Identities = 24/71 (33%), Positives = 43/71 (60%), Gaps = 1/71 (1%) Query: 5 RLTREQSKDLTRERLLSAAHATFTKKGYVATSVEDIASAAGYTRGAFYSNFRSKAELLLE 64 R T++++++ TR+ +L A F+++G +TS+ +IA AAG TRGA Y +F+ K++L E Sbjct: 3 RKTKQEAQE-TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 65 LLRRDHEEAEA 75 + Sbjct: 62 IWELSESNIGE 72
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.0 bits (83), Expect = 2e-04 Identities = 61/378 (16%), Positives = 128/378 (33%), Gaps = 37/378 (9%) Query: 35 AAAGINQDLGISKGLSSLIGALFFLGYFFFQIPGAIYAERRSVKTLVFWSLVLWGACASL 94 + I D ++ + F L + +++ +K L+ + +++ S+ Sbjct: 36 SLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF-GSV 94 Query: 95 TGIV--SNIPSLMAIRFLLGVVEAAVMPAMLIFISNWFTKRERSRANTFLILGNPVTVLW 152 G V S L+ RF+ G AA +++ ++ + K R +A + + Sbjct: 95 IGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGV 154 Query: 153 MSVVSGYLVHEFGWRHMFVAEGLPAIVWAVCWWFLVQDKPAQAKWLTESDKRDLDAALAA 212 + G + H W ++ L ++ + FL++ + + D + + Sbjct: 155 GPAIGGMIAHYIHWSYLL----LIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVG 210 Query: 213 EQAALKPVRNYRDAFRSPAVV----------------------------KLCAQYFCWSI 244 + +Y +F +V+ Sbjct: 211 IVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFG 270 Query: 245 GVYGFVLWLPSIVKNGSALGMVETGWLSALP-YLAATIAMLAASWASDRLGSRKGFVWPF 303 V GFV +P ++K+ L E G + P ++ I DR G Sbjct: 271 TVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGV 330 Query: 304 LLIGAAAFAASYALGSTHFWLSYALLVVAGAAMYAPYGPFFAIVPELLPKNVSGGAMALI 363 + + AS+ L +T ++++ ++ V G + IV L + +G M+L+ Sbjct: 331 TFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKT-VISTIVSSSLKQQEAGAGMSLL 389 Query: 364 NSMGALGSFVGSYVVGYL 381 N L G +VG L Sbjct: 390 NFTSFLSEGTGIAIVGGL 407
>TACYTOLYSIN#Bacterial thiol-activated pore-forming cytolysin signature. Length = 574 Score = 29.6 bits (66), Expect = 0.021 Identities = 25/119 (21%), Positives = 52/119 (43%), Gaps = 17/119 (14%) Query: 42 IDDLRVERWSRDRDYFVAAATKLGAKVSVQSADASEERQISQIENLISRGVDVIVIVPFN 101 ID+L V +W + + L A+ + + QI N+ S+ +D + + F Sbjct: 232 IDNL-VNQWHDN----YSGGNTLPARTQYTESMVYSKSQIEAALNVNSKILDGTLGIDFK 286 Query: 102 SKTLGNVVAEAKRAGIKIVSYDRLILDADVDAYIS----FD-NVKVGELQARGVYDAKP 155 S +++ ++ + I +Y ++ + + FD +V + ELQ +GV + P Sbjct: 287 S------ISKGEKK-VMIAAYKQIFYTVSANLPNNPADVFDKSVTLKELQRKGVSNEAP 338
>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature. Length = 483 Score = 28.9 bits (64), Expect = 0.040 Identities = 10/22 (45%), Positives = 14/22 (63%) Query: 293 GDDFRDELNIKNFPNGLRFDIL 314 G D RDEL+ + P+G F I+ Sbjct: 278 GGDLRDELSEEQIPDGNNFHIV 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 89.7 bits (222), Expect = 2e-23 Identities = 53/187 (28%), Positives = 82/187 (43%), Gaps = 6/187 (3%) Query: 1 MTGKRILVTGAGSGFGREVALRLAAKGHHVIAGVQIAPQVTELNAEAARRGAALDAVKLD 60 + GK +TGA G G VA LA++G H+ A ++ ++ + +A D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 61 VT-SARDRAQAARWD-----IDVLLNNAGAGEAGALADLPVDIVRELFETNVFGPLELTQ 114 V SA AR + ID+L+N AG G + L + F N G ++ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 115 QVARGMIARGRGRIVFVSSIAGLITGAYTGAYCASKHAVEAIAEAMHAELAVHGIQIAVV 174 V++ M+ R G IV V S + AY +SK A + + ELA + I+ +V Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 175 NPGPYST 181 +PG T Sbjct: 186 SPGSTET 192
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 32.3 bits (73), Expect = 0.002 Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 5/58 (8%) Query: 46 RAELVVNTAELDLDAIVALLVQAHGKGQDVARVHSG-----DPSLYGAIGEQIRRLAA 98 R EL TAEL + +++ + H +H D +LYG E+I + +A Sbjct: 154 REELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIFKASA 211
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 29.9 bits (66), Expect = 0.025 Identities = 25/63 (39%), Positives = 28/63 (44%) Query: 249 ADGATPAAIAGALAARGFGPSAMTVFEHLGGPLERRADARADAWGDARAAALNVVAIECR 308 A GAT A GA A G G A V GPL + A +G A A + VAI R Sbjct: 74 AIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGAR 133 Query: 309 ASA 311 AS Sbjct: 134 AST 136
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 43.7 bits (103), Expect = 9e-07 Identities = 40/176 (22%), Positives = 64/176 (36%), Gaps = 24/176 (13%) Query: 3 AAYPFSALIGQ-AALQQALLLVA-VDPGLGGVLVAGPRGTAKSTAARALAELLP--EGRF 58 + L+G+ AA+Q+ ++A + +++ G GT K ARAL + G F Sbjct: 132 DSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPF 191 Query: 59 VTLPLSATDEQVTGSLDLASAL-------ADNAVRFSPGLVARAHLGVLYVDEINLLPDA 111 V + ++A + + S L A S G +A G L++DEI +P Sbjct: 192 VAINMAAIPRDL-----IESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMD 246 Query: 112 LVDALLDAAASGVNTVERDGVSHSHAARFALVGTMNP------EEGELRPQLLDRF 161 LL G G + +V N +G R L R Sbjct: 247 AQTRLLRVLQQG--EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 117 bits (295), Expect = 3e-32 Identities = 80/264 (30%), Positives = 116/264 (43%), Gaps = 15/264 (5%) Query: 136 PARIVVLEFMFAEDLAALDITPVGMADPAYYPIWIGYEDARLARVPDVGTRQEPSLEAIA 195 P RIV LE++ E L AL I P G+AD Y +W+ E V DVG R EP+LE + Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVS-EPPLPDSVIDVGLRTEPNLELLT 93 Query: 196 ATKPALILGVGLRHAPIFDALSRIAPTVLFKYSPNYVEDGRQVTQYDWARAILRTIGCLT 255 KP+ ++ + P + L+RIAP F +S DG+Q AR L + L Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFS-----DGKQ--PLAMARKSLTEMADLL 145 Query: 256 GRERAARAVQARVDAGLARDARRIAAAGRAGERVAWLQELGLPDRYWAFTGNSASAGIAR 315 + AA A+ + + R + G R L L P F NS I Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202 Query: 316 ALGLE-PWPGEPTREGTAYVTSEDLLKQPDLAVLFVSATAPGVPLDAKLDSRIWRFVPAR 374 G+ W GE G+ V+ + L D+ VL +DA + + +W+ +P Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261 Query: 375 RAGRVALVERNIWGFGGPMSALRL 398 RAGR V +W +G +SA+ Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHF 284
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 60.8 bits (147), Expect = 5e-13 Identities = 49/182 (26%), Positives = 74/182 (40%), Gaps = 16/182 (8%) Query: 74 RALVSQWSKYYFNLAASAGFAAALLLGRPLDMTPSRMRVAL-RAGMPVALIFDADALRPA 132 + L+S W+++Y L A L + LD++P G D + A Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVACFWVDVCEDKNA 148 Query: 133 -QAEPAPRYAALVDH-LRATIEALAALAKLSPRVLWANAGNLLD-YLFE---QCADAPRA 186 P R L+ L ++AL A +++ +++W+N G L++ YL E +A Sbjct: 149 TPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEMKQLLGEATVE 208 Query: 187 AADAAWLFGPVDAHGEANPLRLPVRRVKPCSARLPDPFRARRVCCLRNEIPGEDQLCGSC 246 + A F +GE NPL V L D RR CC R +P Q CG C Sbjct: 209 SLRHALFFEKTLTNGEDNPLWRTV--------VLRDGLLVRRTCCQRYRLPDVQQ-CGDC 259 Query: 247 PL 248 L Sbjct: 260 TL 261
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.041 Identities = 12/23 (52%), Positives = 13/23 (56%) Query: 36 VTALCGPNGCGKSTLLRTLAGLQ 58 L G G GKSTL+ TL GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 118 bits (296), Expect = 2e-34 Identities = 79/252 (31%), Positives = 117/252 (46%), Gaps = 15/252 (5%) Query: 9 GRSFLVTGASSGIGRAAVVALRGCGARVVAAARNVRELDRLAGETGC-----EPLELDVG 63 G+ +TGA+ GIG A L GA + A N +L+++ E DV Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 64 RDASVSAAFSG-ERMRDAFDGLVNCAGVTSLAAAIDATADEFDRVMAVNARGAMLVARHV 122 A++ + ER D LVN AGV + +E++ +VN+ G +R V Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARAMISAGRGGSIVNVSSQAALVALPSHLAYCASKAALDAMTRVLCVELGPHGVRVNSVN 182 ++ M R GSIV V S A V S AY +SKAA T+ L +EL + +R N V+ Sbjct: 128 SKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PTVTLTPMAERAWSDPDASGPMLA--------AIPLGRFASVADVVGPILFLLSDAAAMV 234 P T T M W+D + + ++ IPL + A +D+ +LFL+S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 235 SGVALPVDGGYT 246 + L VDGG T Sbjct: 247 TMHNLCVDGGAT 258
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.011 Identities = 18/104 (17%), Positives = 34/104 (32%), Gaps = 2/104 (1%) Query: 409 APRLTLPIFAGGRNRANLDVADARKHIAVAEYEKTIQTAFREV--ADALAARDQIDAQLA 466 P L LP +N + +V I Q +E+ A R + A++ Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224 Query: 467 AQQAVYGADAERLRLAERRYGSGVASYLELLDAQRSTFESGQEL 510 + + + RL + +L+ + E+ EL Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNEL 268
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1077 bits (2786), Expect = 0.0 Identities = 518/1030 (50%), Positives = 702/1030 (68%), Gaps = 6/1030 (0%) Query: 1 MARFFIDRPVFAWVISLFIMLGGIFAIRALPVAQYPDIAPPVVSLYATYPGASAQVVEES 60 MA FFI RP+FAWV+++ +M+ G AI LPVAQYP IAPP VS+ A YPGA AQ V+++ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTAVIEREMNGVPGLLYTSATS-SAGQASLSLTFKQGVSADLAAVDVQNRLKTVEARLPE 119 VT VIE+ MNG+ L+Y S+TS SAG +++LTF+ G D+A V VQN+L+ LP+ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 120 PVRRDGISIEKAADNAQIIVSLTSEDGRLSGVELGEYASANVLQALRRVEGVGKVQFWGA 179 V++ GIS+EK++ + ++ S++ + ++ +Y ++NV L R+ GVG VQ +GA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 180 EYAMRIWPDPVKMAALGLTASDIASAVRAHNARVTIGDVGRSAVPDSAPIAATVLADAPL 239 +YAMRIW D + LT D+ + ++ N ++ G +G + + A+++A Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 240 TTPDAFGAIALRARADGSTLYLRDVARIEFGGNDYNYPSFVNGKTATGMGIKLAPGSNAV 299 P+ FG + LR +DGS + L+DVAR+E GG +YN + +NGK A G+GIKLA G+NA+ Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 300 ATEKRVRATMDELAKFFPPGVKYQIPYETASFVRVSMSKVVTTLVEAGVLVFAVMFLFMQ 359 T K ++A + EL FFP G+K PY+T FV++S+ +VV TL EA +LVF VM+LF+Q Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 360 NFRATLIPTLVVPVALLGTFGAMLAAGFSINVLTMFGMVLAIGILVDDAIVVVENVERLM 419 N RATLIPT+ VPV LLGTF + A G+SIN LTMFGMVLAIG+LVDDAIVVVENVER+M Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 420 VEEKLPPYEATVKAMKQISGAIVGITVVLTSVFVPMAFFGGAVGNIYRQFAFALAVSIGF 479 +E+KLPP EAT K+M QI GA+VGI +VL++VF+PMAFFGG+ G IYRQF+ + ++ Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 480 SAFLALSLTPALCATLLKPVADDHHE-KDGFFGWFNRFVARSTHRYTQRVGRVLKRPLRW 538 S +AL LTPALCATLLKPV+ +HHE K GFFGWFN S + YT VG++L R+ Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 539 LVVYGALTAAAALLITKLPAAFLPDEDQGNFMVMVIRPQGTPLAETMQSVRRVEEYVRTH 598 L++Y + A +L +LP++FLP+EDQG F+ M+ P G T + + +V +Y + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 599 SPSAY--TFALGGYNLYGEGPNGGMIFVTMKDWKERKRAQDQVQAIIAGINAHFAGTPNT 656 + F + G++ G+ N GM FV++K W+ER ++ +A+I + Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 657 MVFAINMPALPDLGLTGGFDFRLQDRGGLGYGAFVAAREKLLADGRKDPV-LTDLMFAGT 715 V NMPA+ +LG GFDF L D+ GLG+ A AR +LL + P L + G Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 716 QDAPQLKLDIDRAKASALGVSMEEINATLAVMFGSDYIGDFMHGSQVRRVIVQADGQHRL 775 +D Q KL++D+ KA ALGVS+ +IN T++ G Y+ DF+ +V+++ VQAD + R+ Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 776 DPGDVTKLRVRNAKGEMVPLAAFATLHWTMGPPQLTRYNGFPSFTINGAASAGHSSGEAM 835 P DV KL VR+A GEMVP +AF T HW G P+L RYNG PS I G A+ G SSG+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 836 AAIERIASALPAGIGYAWSGQSYEERLSGAQAPMLFALSVLVVFLALAALYESWSIPFAV 895 A +E +AS LPAGIGY W+G SY+ERLSG QAP L A+S +VVFL LAALYESWSIP +V Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 896 MLVVPLGVIGAVAGVTLRGMPNDIYFKVGLIATIGLSAKNAILIVEVAKDL-VAQGMSLA 954 MLVVPLG++G + TL ND+YF VGL+ TIGLSAKNAILIVE AKDL +G + Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 955 DAALEAARLRLRPIVMTSLAFGVGVLPLAFATGAASGAQIAIGTGVLGGVISATLFAIFL 1014 +A L A R+RLRPI+MTSLAF +GVLPLA + GA SGAQ A+G GV+GG++SATL AIF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1015 VPLFFVCVGR 1024 VP+FFV + R Sbjct: 1021 VPVFFVVIRR 1030
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.0 bits (91), Expect = 2e-05 Identities = 18/133 (13%), Positives = 41/133 (30%), Gaps = 5/133 (3%) Query: 67 EVRARVAGIVTARTYEEGQEVKRGAVLFRIDPAPFKAARDAAAGALEKAQAAHLAALDKR 126 E++ IV +EG+ V++G VL ++ +A +L +A+ Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 127 RRYDELVRDRAVSERDHTEALADERQAKAAVASARAELA-----RAQLQLDYATVTSPID 181 R + + E + + + + + + Q +L+ + Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217 Query: 182 GRARRALVTEGAL 194 R E Sbjct: 218 TVLARINRYENLS 230 Score = 34.4 bits (79), Expect = 8e-04 Identities = 17/100 (17%), Positives = 39/100 (39%), Gaps = 10/100 (10%) Query: 102 KAARDAAAGALEKAQAAHLAALDKRRRYDELVRDRAVSERDHTEALADERQAKAAVASAR 161 LE+ ++ L+A ++ + +L + E L RQ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFK---------NEILDKLRQTTDNIGLLT 315 Query: 162 AELARAQLQLDYATVTSPIDGR-ARRALVTEGALVGQDQA 200 ELA+ + + + + +P+ + + + TEG +V + Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 118 bits (297), Expect = 2e-35 Identities = 55/211 (26%), Positives = 100/211 (47%), Gaps = 6/211 (2%) Query: 1 MARKTREESLNTKNRILDAAELVLLERGVGQTAMADIAEAAGMSRGAVYGHFKGKIEVCV 60 MARKT++E+ T+ ILD A + ++GV T++ +IA+AAG++RGA+Y HFK K ++ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 AVCDRAFSRAAEGFDLSDERPA---LATLRLAASHYLHQCGEPGSMQRVLEILYMKCEHS 117 + + + S E + L+ LR H L + ++EI++ KCE Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 118 EENAPLMRRRTLYELQTLRIVKALLRRAVAAGELDASLDVHLAGVYLLSLLEGIFGSMMW 177 E A + + + L++ ++ L+ + A L A L A + + + G+ + W Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN--W 178 Query: 178 SARL-RGDRWRDAEAMLDAGVDTLRASPALR 207 D ++A + ++ P LR Sbjct: 179 LFAPQSFDLKKEARDYVAILLEMYLLCPTLR 209
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 29.4 bits (65), Expect = 0.022 Identities = 28/132 (21%), Positives = 47/132 (35%), Gaps = 4/132 (3%) Query: 41 RVATARNELQNAADAAALAGAASLESSPGAPAWAAAASAASAALSLNASDGATLASGVVQ 100 A A+ + + A A AA+ + P + A A+ + + GA + + Sbjct: 226 AAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGRGL---IQVAQGAASLAQAIS 282 Query: 101 TGYWNVTGAPAGLEPTTLAPGAYDVPAVQTTVTRATNQNGGPLSLLMGGFLGILGTPAAA 160 V G P+ +A G + T + +Q + +G LG P + Sbjct: 283 DAI-AVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSV 341 Query: 161 TAVAVAAAPSTV 172 AVA A TV Sbjct: 342 NLNAVAKASGTV 353
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 28.7 bits (64), Expect = 0.018 Identities = 19/83 (22%), Positives = 32/83 (38%) Query: 42 NVAESALAAGNAELAATLFERALKADPRSLPARVGLGDAMYQTGELARAGVLYAQAAAAA 101 ++A + +G E A +F+ D +GLG G+ A Y+ A Sbjct: 41 SLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMD 100 Query: 102 PDDPRAQLGLARVALRERHLDDA 124 +PR A L++ L +A Sbjct: 101 IKEPRFPFHAAECLLQKGELAEA 123
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.027 Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 294 IVISGGTGSGKTTLLNAL---SHFIDSHERIVTIEDAAELQLQQPHVVSL 340 +V+ G G GK+TL+N L F D+H I T +D+ E Q+ L Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYE-QIAGIVAYEL 647
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 36.3 bits (84), Expect = 2e-04 Identities = 29/165 (17%), Positives = 49/165 (29%), Gaps = 20/165 (12%) Query: 16 GARLIAIVADAASDEVIRNLIVDQAMTGAHVARGGIDDAIALMRDLPHGPQHLLVDVSGA 75 GA ++ DAA V+ + + ++ DV Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRIT--SNAATLWRWIAA--GDGDLVVTDVV-- 56 Query: 76 AMP----LSDLARLADVCDPSVNVIVVGEHNDVGLFRSMLRVGVRDYLVKPL----TVEL 127 MP L R+ P + V+V+ N G DYL KP + + Sbjct: 57 -MPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114 Query: 128 VHRALSAADPNAAARTGKAIGFVGARGGVGVTSIAVALARHLADR 172 + RAL+ + + + G S A+ + R Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVG----RSAAMQEIYRVLAR 155
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 145 bits (366), Expect = 2e-39 Identities = 68/283 (24%), Positives = 116/283 (40%), Gaps = 16/283 (5%) Query: 180 VVQTLKPYLRQQESLVNRLTLARPIQVHLRVRITEVDRNITQQLGINWSALGA------- 232 +V + E ++ +L + RP QV + I EV LGI W+ A Sbjct: 322 IVTAAPDVMNDLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN 380 Query: 233 SGNFVGGLFNGRTLFDTASKAFDLSPSGAFSVVGGFHTSHYSIDG--VLDALDQEGLITM 290 SG + G ++ S A S G Y + +L AL + Sbjct: 381 SGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLLTALSSSTKNDI 439 Query: 291 LAEPNLTAISGQTASFLAGGEFPIPVAQDTTGA----ITIQFKPYGVSLDFTPTVLADNR 346 LA P++ + A+F G E P+ TT T++ K G+ L P + + Sbjct: 440 LATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDS 499 Query: 347 ISLKVRPEVSEIDPTNSVTTGSIKVPALTVRRVDTTVELSSGQSFAIGGLLQSKSSDVLA 406 + L++ EVS + S T+ + R V+ V + SG++ +GGLL SD Sbjct: 500 VLLEIEQEVSSVADAASSTSSDLGA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTAD 558 Query: 407 ELPGLARLPVLGKLFSSRNYLNDKTEVVVIVTPYIVQPANPGE 449 ++P L +PV+G LF S + K +++ + P +++ + Sbjct: 559 KVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 30.9 bits (70), Expect = 0.002 Identities = 33/146 (22%), Positives = 58/146 (39%), Gaps = 14/146 (9%) Query: 9 IVASWTLASLALADLRTRRLA---TFAVALVGALYGVQALAGAPGD---GGFAPHAAIGA 62 ++ +W L +L DL L T + G L+ + + GD G A + + + Sbjct: 138 LLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWS 197 Query: 63 IAFAFGAAMFRIGWIAGGDVKLAAVVFLWAGPAHAWPVAFAIGVGGLAVGVVCIAARRAP 122 + +AF + + GD KL A + W G V + G +G+ I R Sbjct: 198 LYWAFKL-LTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNH- 255 Query: 123 RALAWFAPARGVPYGVALAAGGVLAV 148 ++ +P+G LA G +A+ Sbjct: 256 ------HQSKPIPFGPYLAIAGWIAL 275
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.0 bits (65), Expect = 0.034 Identities = 56/306 (18%), Positives = 102/306 (33%), Gaps = 53/306 (17%) Query: 57 TTQLLNTAGVFAAGF-LMRPIGGWLFGRIADKHGRRAAMMISVLMMCGGSLVIAVLPTYA 115 + + G+ A + LM+ + G ++D+ GRR +++S+ ++A P Sbjct: 38 SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW 97 Query: 116 QIGAFAPLLLLVARLFQGLSVGGEYGTSATYMSEVALKGRR----GFF-ASFQYVTLIGG 170 +L + R+ G++ G + Y++++ R GF A F + + G Sbjct: 98 --------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGP 148 Query: 171 QLCALLVLVILQQTLSTAELKAWGWRIPFVVGAAAALIS-----LYLRKSLDETSTSESR 225 L L+ PF AA ++ L +S R Sbjct: 149 VLGGLM-----------GGFSP---HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRR 194 Query: 226 KAKDAGT-IRGVWQHKG-AFLTVIGFTAGGSLIFYTFTTYMQKYLVNTAGMHAKTASNVM 283 +A + R A L + F L+ + + A T + Sbjct: 195 EALNPLASFRWARGMTVVAALMAVFFIM--QLVGQVPAALWVIFGEDRFHWDATTIGISL 252 Query: 284 TA-----ALFVYMLMQPVFGALSDKIGRRMSMILFGTG----AVIGTVP------LMNAL 328 A +L M+ PV L ++ + MI GTG A ++ A Sbjct: 253 AAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312 Query: 329 GGVTSP 334 GG+ P Sbjct: 313 GGIGMP 318
>AEROLYSIN#Aerolysin signature. Length = 493 Score = 31.2 bits (70), Expect = 0.015 Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 2/76 (2%) Query: 250 GLAKMQASLADTVRSVRVGSESIATAARQIAAGNIDLSSRTEQQAAALEETASSMEELTG 309 GL+ MQ +LA +R VR G +A Q AGNI++ + A + A S++ Sbjct: 404 GLSTMQNNLARVLRPVRAGITGDFSAESQF-AGNIEIGAPVPLAADSKVRRARSVDGAGQ 462 Query: 310 TVQRNAD-NARQASAL 324 ++ +A++ S L Sbjct: 463 GLRLEIPLDAQELSGL 478
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 293 bits (751), Expect = 7e-97 Identities = 128/461 (27%), Positives = 199/461 (43%), Gaps = 53/461 (11%) Query: 4 FDVEVIRADNEELSAERTAMRPSLAIISVSMIE-SGAAFLRTWQA-DIGMPVVWVGA--- 58 +DV + ++ L A L + V M + + L + +PV+ + A Sbjct: 28 YDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNT 86 Query: 59 -----------ARDHDPSLYPPEYSHILPLDFTCAELRGMISKLAVQLRAHAAKTLEPST 107 A D+ P P + + ++ + +++ + + + Sbjct: 87 FMTAIKASEKGAYDYLPK--PFDLTELIGIIGRA------LAEPKRRPSKLEDDSQDGMP 138 Query: 108 LVAHSDCMQALLLEVDTFADCDTNVLLHGETGVGKERIAQLLHEKHSRYSMGEFVPVNCG 167 LV S MQ + + D +++ GE+G GKE +A+ LH+ + + G FV +N Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-YGKRRNGPFVAINMA 197 Query: 168 AIPDGLFESLFFGHAKGSFTGAVGTHKGYFEQAAGGTLFLDEVGDLPLYQQVKLLRVLED 227 AIP L ES FGH KG+FTGA G FEQA GGTLFLDE+GD+P+ Q +LLRVL+ Sbjct: 198 AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQ 257 Query: 228 GAVLRIGATAPVKVDFRLVAASNKKLPQLVKDGLFRADLYYRLAVIELSIPSLEERGPVD 287 G +G P++ D R+VAA+NK L Q + GLFR DLYYRL V+ L +P L +R D Sbjct: 258 GEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDR-AED 316 Query: 288 KIALFKSFVASIVGEDRLAALPELPYWLAEAVADSYFPGNVRELRNLAERVGV------- 340 L + FV E + E + +PGNVREL NL R+ Sbjct: 317 IPDLVRHFVQQAEKEGL--DVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374 Query: 341 -----------------TVRQTGGWDTARLQRLVAHARSAAQPVPAESAPDVFVDRSKWD 383 + + + + V ++ P + Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434 Query: 384 MAERNRVIAALDANGWRRQDTAQHLGISRKVLWEKMRKYQI 424 E ++AAL A + A LG++R L +K+R+ + Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 38.3 bits (89), Expect = 5e-05 Identities = 11/63 (17%), Positives = 26/63 (41%) Query: 79 AALRVSHPGLPIVALGSLGEPESALAALRAGVRDFIDFSAPAEDALRITRGLLDHVGDQP 138 ++ + P LP++ + + +A+ A G D++ + + I L +P Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126 Query: 139 SRH 141 S+ Sbjct: 127 SKL 129
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 138 bits (349), Expect = 1e-37 Identities = 58/249 (23%), Positives = 112/249 (44%), Gaps = 11/249 (4%) Query: 160 VQVDVRVVEFSRSVLKQAGLNFFKQSNGFTFGSFAPAGLASVTGGG----TSSMSVSANI 215 V V+ + E + G+ + ++ G T + + +++ G S+ Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA 406 Query: 216 PIASAFN-LVVGSATRGLFADLSILEANNLARVLAQPTLVALSGQSASFLAGGEIPVPVP 274 S+FN + G L+ L ++ +LA P++V L A+F G E+PV Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466 Query: 275 QSLGT-----ISIDWKPYGVGLTLTPTVLSPRRIALKVAPESSQLDFVHSITINGVTVPA 329 + +++ K G+ L + P + + L++ E S + S + + Sbjct: 467 SQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGAT 525 Query: 330 LTTRRADTTVELGDGESFVIGGLIDRETTSNVDKVPFLGDLPIIGTFFKHLSYQQNDKEL 389 TR + V +G GE+ V+GGL+D+ + DKVP LGD+P+IG F+ S + + + L Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585 Query: 390 VIIVTPHLV 398 ++ + P ++ Sbjct: 586 MLFIRPTVI 594
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 54.0 bits (130), Expect = 2e-11 Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 10/124 (8%) Query: 4 LFSIGFFFAWAAAVAIADCRDRRIPNELVLAGLAAVIIFTVCRQNPFETTLVGALIGGAV 63 + A+ D +P++L L L ++F + F +L A+IG Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL--LGGF-VSLGDAVIGAMA 190 Query: 64 GLVSLFPFFAL-------RLMGAADVKVFAVLGAWCGLPALPRLWIVASVAAGIHALGLL 116 G + L+ + MG D K+ A LGAW G ALP + +++S+ +GL+ Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250 Query: 117 LLTR 120 LL Sbjct: 251 LLRN 254
>PERTACTIN#Pertactin signature. Length = 922 Score = 40.9 bits (95), Expect = 1e-05 Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 4/53 (7%) Query: 427 EPPPDVEPPPEVEPPPEVEPPPPDRPPVEPEPPVPPEPEPLVPPEPPEPEPPS 479 + PP +P P+ P P +PP P +PP P+PP PP+ + PE P P+PP+ Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQ----PEAPAPQPPA 614 Score = 37.4 bits (86), Expect = 2e-04 Identities = 18/48 (37%), Positives = 25/48 (52%) Query: 433 EPPPEVEPPPEVEPPPPDRPPVEPEPPVPPEPEPLVPPEPPEPEPPSP 480 + PP +P P+ P P +PP P+PP PP+P +P P P P Sbjct: 566 KAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613 Score = 36.2 bits (83), Expect = 4e-04 Identities = 20/63 (31%), Positives = 25/63 (39%) Query: 454 VEPEPPVPPEPEPLVPPEPPEPEPPSPVVEIALPPPEPEPSSPLLIVPEPPHAERESMAA 513 V + P P+P P P+P P P PP+P P P+PP S AA Sbjct: 563 VGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAA 622 Query: 514 TVA 516 A Sbjct: 623 NAA 625 Score = 34.7 bits (79), Expect = 0.001 Identities = 21/59 (35%), Positives = 25/59 (42%), Gaps = 1/59 (1%) Query: 461 PPEPEPLVPPEPPEPEPPSPVVEIALPPPEPEPSSPLLIVPEP-PHAERESMAATVAAI 518 PP P+P P P P + PP P+P P P P A RE AA AA+ Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANAAV 626 Score = 32.4 bits (73), Expect = 0.006 Identities = 16/41 (39%), Positives = 18/41 (43%) Query: 424 PEVEPPPDVEPPPEVEPPPEVEPPPPDRPPVEPEPPVPPEP 464 P +P P P P P P P PP P +PE P P P Sbjct: 573 PAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPP 613
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.013 Identities = 15/42 (35%), Positives = 19/42 (45%), Gaps = 7/42 (16%) Query: 41 LLGDNGAGKSTLIKTLAGVHQPSEGQYLVDGKPVLFDSPKDA 82 L G G GKSTLI TL G+ + D + KD+ Sbjct: 601 LEGTGGIGKSTLINTLVGL------DFFSDT-HFDIGTGKDS 635
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 48.5 bits (115), Expect = 3e-08 Identities = 32/154 (20%), Positives = 53/154 (34%), Gaps = 26/154 (16%) Query: 119 GSGFIVSADGLILTTAYVVGQASEATVRLIDRR-----------EFKA-RVLAVDDQSDV 166 SG +V +LT +VV L F A ++ + D+ Sbjct: 104 ASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDL 162 Query: 167 AVLQIDATK--------LPTVRLGDSSRVRVGEPVLTIGTPDGSANTVTTGIVSATSRTL 218 A+++ + + + +++ +V + + G P G T L Sbjct: 163 AIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYP-GDKPVATMWESKGKITYL 221 Query: 219 PDGSRFPFFQTDVTGNLDNSGGPVFNRAGEVIGI 252 Q D++ NSG PVFN EVIGI Sbjct: 222 KG----EAMQYDLSTTGGNSGSPVFNEKNEVIGI 251
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 54.1 bits (130), Expect = 7e-11 Identities = 32/112 (28%), Positives = 51/112 (45%), Gaps = 7/112 (6%) Query: 5 VLIADDHPLVLLGVRHMLAGVG-DVSIVGEAHDPAGLLALLAATPCDIVITDFAMPEQPA 63 +L+ADD + + L+ G DV I A L +AA D+V+TD MP+ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAAT---LWRWIAAGDGDLVVTDVVMPD--- 59 Query: 64 ADGLAMLSAIRDGHPSVRVIVLTMLDNPVLMHTMRQAGALAVLSKRGDLDEL 115 + +L I+ P + V+V++ + + + GA L K DL EL Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.5 bits (157), Expect = 5e-13 Identities = 29/120 (24%), Positives = 48/120 (40%), Gaps = 10/120 (8%) Query: 398 RVLVVDDQEMNRIVLRYQLDALGHRARLVASGDEALRALVRSAFDVVLTDCRMPGMDGVA 457 +LV DD R VL L G+ R+ ++ R + D+V+TD MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 458 LTAAIRAH-PDARVRATPIVGVTALVSDAEHARCVAAGMTSCIGKP----TTLDALERAL 512 L I+ PD P++ ++A + + G + KP + + RAL Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 442 bits (1138), Expect = e-145 Identities = 159/808 (19%), Positives = 266/808 (32%), Gaps = 89/808 (11%) Query: 21 GTLYLELVVN-ALSTGRIVPIRYRDGVYYARA----GDLAQASVRTGAEP-------DAL 68 GT +++ +N R V D LA + T + DA Sbjct: 76 GTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDAC 135 Query: 69 VDL-SKLDGVQVEYESGEQRLKLSVPPDWLPQQTVG--SRRLYDRTPAAVSFGLLFNYDV 125 V L S + + + G+QRL L++P ++ + G L+D A L NY+ Sbjct: 136 VPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINA----GLLNYNF 191 Query: 126 --YTNSPTLGTSYTSAWTEQRLFDKWGAVTNTGVYRRDYGGGVGGAGSNRYLRYDTSWRY 183 + +G + A+ + GA Y +GS ++ +W Sbjct: 192 SGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLE 251 Query: 184 SDQDRML-TYTAGDVITGALPWSSAVRLGGVSVERDFKVRPDIVTYPLPQFSGQAAVPTA 242 D + T GD T + + G + D + PD P G A Sbjct: 252 RDIIPLRSRLTLGDGYTQGDIFDG-INFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310 Query: 243 VDLFINGSKTTTGQVNPGPFTMNNVPFINGAGEASVVTTDALGRQVATTIPFYVANTLLQ 302 V + NG V PGPFT+N++ +G+ V +A G T+P+ L + Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370 Query: 303 KGLSDYSLSAGAMRRDYGIRSFSYGKFAASGTARYGLADWLTIEGHAEGGERLALGGLGF 362 +G + YS++AG R + T +GL TI G + +R G Sbjct: 371 EGHTRYSITAGEYRSGNAQQE---KPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 363 DVGVGMFGVLNVAATQS-----SLAGTSGRQYAF----------------GYGYSSQRF- 400 +G G L+V TQ+ + G+ F GY YS+ + Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 401 SVSLQRIQRTAGFRDLS--------VYDLPADVTYRLVRSSTQATGALNLGAIG----GT 448 + + R G+ + R Q T LG Sbjct: 488 NFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSG 547 Query: 449 LGAGYFDVRGADGTRTRIANLSYTRPLFRRATLYASVNKTIGDHGVAAQLQLIV--PLGD 506 Y+ D N ++ + TL S+ K G L L V P Sbjct: 548 SHQTYWGTSNVDEQFQAGLNTAFEDINW---TLSYSLTKNAWQKGRDQMLALNVNIPFSH 604 Query: 507 K-----------GVVTGSVARDERNSFSERVQYSRSVPSDGGFGWNL--AYAGGGAHYQ- 552 + S++ D + ++ D +++ YAGGG Sbjct: 605 WLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSG 664 Query: 553 ---QADATWRNRYVQVQGGAYGYGAGRGYARWGEVQGSVVVMDGAVLPANRVDDAFVLID 609 A +R Y G + + V G V+ V ++D VL+ Sbjct: 665 STGYATLNYRGGYGNANIGYSHSDDIKQL--YYGVSGGVLAHANGVTLGQPLNDTVVLVK 722 Query: 610 TQGREGVPVRYENQLVGKTDGGGHLLVPWAPSYYAGKYEIDPLDLPSNVRVPIVERRVAV 669 G + V ENQ +TD G+ ++P+A Y + +D L NV + V Sbjct: 723 APGAKDAKV--ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 Query: 670 RDHRGALVTFPIQKIVSAQIALVDASGRPIGIGSRVLHEESGQAALVGWQGETYLEGLSA 729 F + + + + +P+ G+ V E S + +V G+ YL G+ Sbjct: 781 TRGAIVRAEFKARVG-IKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPL 839 Query: 730 VNHLRVT--TPDGRICHATFAADVDAAQ 755 ++V + C A + ++ Q Sbjct: 840 AGKVQVKWGEEENAHCVANYQLPPESQQ 867
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.0 bits (96), Expect = 6e-06 Identities = 29/173 (16%), Positives = 73/173 (42%), Gaps = 5/173 (2%) Query: 27 VDTQMFSLVIPALLTSWGIGKGQAGLIGGATLAAGAIGGLLAGMIADRFGRVRALQITVC 86 ++ + ++ +P + + + A + +IG + G ++D+ G R L + Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 87 WFSLFTFLSAFAQNFEQLLVL-KTLQGIGFGGEWTAGAVLLSETVRAQHRGKAMGIVQSA 145 + + +F LL++ + +QG G V+++ + ++RGKA G++ S Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147 Query: 146 WGFGWGGAVLLYTLVFSWLPPEWAWRALFAIGVLPALLVLYIRRAIPEPPRDD 198 G G + ++ ++ W L I ++ + V ++ + + + R Sbjct: 148 VAMGEGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKKEVRIK 196
>INTIMIN#Intimin signature. Length = 939 Score = 30.8 bits (69), Expect = 0.041 Identities = 37/181 (20%), Positives = 62/181 (34%), Gaps = 34/181 (18%) Query: 790 NRITLKGGDITVETPGQFLVKSGAHPFPGP--AAQSVSLPPLPVPAPLALFDEQIRFVNE 847 N + L ITV + GQ + + G F +A++ + A V + Sbjct: 540 NNVLLT---ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTAT----------VKK 586 Query: 848 NGEPLGNVAYKLKLADGSTVSGVTDDNGRTERVSTDGPTAIQSATLTPTQVV---DCCGR 904 NG NV + VSG + + + G + + P QVV Sbjct: 587 NGVAQANVP-----VSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEM 641 Query: 905 TSDVPPPAV------KVDIKGIGTNDTLVGSSEKS-----VTVKGESRPLTEGEIEMAKT 953 TS + AV K I I + T ++ + V V +P++ E+ T Sbjct: 642 TSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTT 701 Query: 954 V 954 + Sbjct: 702 L 702
>PF07675#Cleaved Adhesin Length = 1358 Score = 32.0 bits (72), Expect = 0.006 Identities = 33/99 (33%), Positives = 43/99 (43%), Gaps = 13/99 (13%) Query: 376 SYNVYRNGDKVGAS-TSTAYIDSGLIASTTYSYTVTEVDPSAGESAQ-------SSPVSA 427 +Y +YRN ++ + T T Y D L A+ Y+Y V +V GESA +S Sbjct: 1260 TYTIYRNNTQIASGVTETTYRDPDL-ATGFYTYGV-KVVYPNGESAIETATLNITSLADV 1317 Query: 428 TTQSSFACTETTATNYAHVQAGRA--YDSFGIAYAAGSN 464 T Q + T T Q G A YD G AAG N Sbjct: 1318 TAQKPYTLTVVGKTITVTCQ-GEAMIYDMNGRRLAAGRN 1355
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.8 bits (119), Expect = 1e-08 Identities = 42/282 (14%), Positives = 93/282 (32%), Gaps = 25/282 (8%) Query: 81 VLGGMADKIGRRATLVITITLMTIGTSLIAFAPTYKDAGIFAPLMIVCARLLQGFSAGGE 140 VLG ++D+ GRR L++++ + +++A AP ++ R++ G + G Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW--------VLYIGRIVAGIT-GAT 112 Query: 141 MGGATGFLRDNVPAERLGYYTSWIQASIGFAIILASVLAVVLVKVLSAEQVESWGWRIPF 200 A ++ D + + ++ A GF ++ VL ++ PF Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF---------SPHAPF 163 Query: 201 ----LIGLCLGPLGIYIRNQVHEPAEENVQIRERTPVLEIVRRWKSETLIGFGLVIF--W 254 + G ++ + H+ ++ P+ + V F Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223 Query: 255 TVCSYVLLFYIPTYASKVLHLPASTGFIAVLVGASIVLFVTPVFGYLSDRYGRRRFLMGA 314 V ++ + + G G L + G ++ R G RR LM Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLG 283 Query: 315 LAVAVMVAYPMFRLLNVSPGLHSLLLFQVVFGLVIACYEGPI 356 + A Y + +++ G+ + + + Sbjct: 284 MI-ADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAML 324
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 39.2 bits (91), Expect = 2e-06 Identities = 17/79 (21%), Positives = 36/79 (45%), Gaps = 7/79 (8%) Query: 58 FVIEADGALIGYADLQEN----GYIDHFFVSGDHPRQGVGRLLMETIHDYA-QRQSMKVL 112 F+ + IG ++ N I+ V+ D+ ++GVG L+ ++A + ++ Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLM 127 Query: 113 --TSDVSRTAQPFFEHFGF 129 T D++ +A F+ F Sbjct: 128 LETQDINISACHFYAKHHF 146
>PF05860#haemagglutination activity domain. Length = 117 Score = 63.7 bits (155), Expect = 8e-14 Identities = 25/138 (18%), Positives = 48/138 (34%), Gaps = 23/138 (16%) Query: 72 AQIVGAGP-NAPSVIQTPNGLPQVNINKPGGAGVSLNTYNQFDVSHAGAILNNSPTIVNT 130 AQI S I T + G+ + + + +F V +G N+PT Sbjct: 1 AQITPDTTLPINSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFNNPT---- 55 Query: 131 QQAGYINGNPNLSAGQAARIIVNQVNSTAASQIKGYVEIAGSRAEIVLANPAGIVVDGGG 190 + I+++V + S I G + A + A + L NP GI+ Sbjct: 56 ----------------NIQNIISRVTGGSVSNIDGLIR-ANATANLFLINPNGIIFGQNA 98 Query: 191 FINTSRAVLTTGVPQFGA 208 ++ + + + + Sbjct: 99 RLDIGGSFVGSTANRLKF 116
>PF07132#Harpin protein (HrpN) Length = 356 Score = 30.4 bits (68), Expect = 0.006 Identities = 19/63 (30%), Positives = 29/63 (46%) Query: 177 SVAGATGGMAAALAGAETGAVVGSIAGPLGTVFGGLAGAVIAGLVGSAAGCAAGSAVGAA 236 S T ++ G G +G + LG + GGL G + G +GS+ G GSA+G Sbjct: 52 SDIMTTMMFMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLGSGLGSALGGG 111 Query: 237 IDD 239 + Sbjct: 112 LGG 114
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.6 bits (69), Expect = 0.006 Identities = 20/137 (14%), Positives = 43/137 (31%), Gaps = 24/137 (17%) Query: 110 LRVLRNDGGPLSHGKDGFIQVLTVYH------------RRAAVLAADAVVAFLHQSYLNA 157 ++ +G + ++++ + RR L V+ Sbjct: 325 VQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVIT--------R 376 Query: 158 QLDPLSSR-EPWERFAADNALIDEHIGLAVDAEDDDSPTLRFLLPAGDDLPLKIEVSRLL 216 ++ R E + A + ++ E++ GD LP R+L Sbjct: 377 EIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA---SFGDALPPSGLYDRVL 433 Query: 217 YQLDRDAYVEALNAARG 233 +++ + AL A RG Sbjct: 434 AEMEYPLILAALTATRG 450
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.9 bits (127), Expect = 2e-09 Identities = 84/376 (22%), Positives = 143/376 (38%), Gaps = 47/376 (12%) Query: 203 PSAQLLATFGTFAAAF-LVRPLGGMVFGPLGDRIGRQRVLAMTMIMMAVGTFAIGLIPSY 261 S + A +G A + L++ V G L DR GR+ VL +++ AV + P Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96 Query: 262 DSIGLLAPALLLVARLVQGFSTGGEYGGAATFIAEFSTDKRR----GFMGSFLEFGTLIG 317 +L + R+V G TG A +IA+ + R GFM + FG + G Sbjct: 97 --------WVLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG 147 Query: 318 YVMGAGVVALLTASLSHDALLSWGWRVPFLIAGPLGLIG-LYIRMKLEETPAFKRQAEAR 376 V+G L S PF A L + L L E+ +R+ R Sbjct: 148 PVLGG-----LMGGFSP--------HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRR 194 Query: 377 EAQDKAVPKAHFRRQLARHWRALLLCVGLVLIFNVTDYMALSYLPSYLSSTLHFDEAH-G 435 EA + P A FR A L+ V ++ AL + + H+D G Sbjct: 195 EALN---PLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI--FGEDRFHWDATTIG 249 Query: 436 LVLILIVMVLMMPMTLATGRLSDAVGRKPVMLAGCIGLFALAIPALLLIRTGETSLVFGG 495 + L ++ + + TG ++ +G + ++ +G+ A +LL + F Sbjct: 250 ISLAAFGILHSLAQAMITGPVAARLGERRALM---LGMIADGTGYILLAFATRGWMAFPI 306 Query: 496 LLILGALLSCFTGVMPSALPALFPTEI---RYGALAIGFNVSVSLFGGTT-PLAAAWLVD 551 +++L G+ AL A+ ++ R G L G +++ PL + Sbjct: 307 MVLLA-----SGGIGMPALQAMLSRQVDEERQGQLQ-GSLAALTSLTSIVGPLLFTAIYA 360 Query: 552 ATGNLMMPAYYLMGAA 567 A+ ++ GAA Sbjct: 361 ASITTWNGWAWIAGAA 376 Score = 31.7 bits (72), Expect = 0.008 Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 7/99 (7%) Query: 420 LPSYLSSTLHFDEAHGLVLILIVMVLMMPMTLA--TGRLSDAVGRKPVMLAGCIGLFALA 477 LP L +H ++ IL+ + +M A G LSD GR+PV+ + L A Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVL---LVSLAGAA 84 Query: 478 IPALLLIRTGETSLVFGGLLILGALLSCFTGVMPSALPA 516 + ++ +++ G ++ G ++ TG + A A Sbjct: 85 VDYAIMATAPFLWVLYIGRIVAG--ITGATGAVAGAYIA 121
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 31.8 bits (72), Expect = 0.006 Identities = 27/122 (22%), Positives = 40/122 (32%), Gaps = 16/122 (13%) Query: 393 GEEADPATPAALRRGRKLVVQIGE----------TFGEKNAPMFVEKLDALRLADKLALD 442 + DP L G I + F EKN + +KLD L K + Sbjct: 133 KGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFN 192 Query: 443 LAPVMVYGDDVTHVVTEEGIANLLMCRDADEREHAIRGVAGYTEIGRGRDRRLVERLRER 502 P + +VT EG + I + E + + LVE+LR+ Sbjct: 193 KIP-----AEKKLIVTSEGAFKYF-SKAYGVPSAYIWEINTEEEGTPEQIKTLVEKLRQT 246 Query: 503 GV 504 V Sbjct: 247 KV 248
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.3 bits (78), Expect = 0.001 Identities = 37/248 (14%), Positives = 65/248 (26%), Gaps = 25/248 (10%) Query: 183 PTRRDKAAVKAAEKERVAPLPEPAETAEGAPMKLRTPAAPTPPAEPAPASAAAPEAASAG 242 P+ A E P P PA +E + E A A + Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR- 1066 Query: 243 TAAPASAAASAAASAPAASPSAASTPAAAAAPVTHAAPASAPATASAPTAASVPTPASAP 302 A A ++ A + + + + T AT A V T + Sbjct: 1067 -----EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121 Query: 303 MPAPASELAPAPSTSATSSIAPPAAPVAS----QTQPARANTSVSTSAAAMSASTSTPAP 358 +P S+++P S Q +PAR N S + +T Sbjct: 1122 VPKVTSQVSPKQEQS-------------ETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 359 APASTPVATAAPSPISPDAPFPA--DAAQTPPPAATPAAAPAAGPAPASANATATADAAP 416 + ++ P++ + P P ++ + Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR 1228 Query: 417 SATHDVNA 424 S H+V Sbjct: 1229 SVPHNVEP 1236 Score = 32.7 bits (74), Expect = 0.003 Identities = 21/163 (12%), Positives = 45/163 (27%), Gaps = 3/163 (1%) Query: 185 RRDKAAVKAAEKERVAPLPEPAETAEGAPMKLRTPAAPTPPAEPAPASAAAPEAASAGTA 244 + + AE R +P + + T A PA+ ++ P S Sbjct: 1134 EQSETVQPQAEPARE---NDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVN 1190 Query: 245 APASAAASAAASAPAASPSAASTPAAAAAPVTHAAPASAPATASAPTAASVPTPASAPMP 304 S + + PA + ++ ++ H + P S ++ + Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250 Query: 305 APASELAPAPSTSATSSIAPPAAPVASQTQPARANTSVSTSAA 347 S A + A + A V + ++ Sbjct: 1251 DLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293 Score = 30.4 bits (68), Expect = 0.015 Identities = 28/184 (15%), Positives = 49/184 (26%), Gaps = 11/184 (5%) Query: 236 PEAASAGT---AAPASAAASAAASAPAASPSAASTPAAAAAPVTHAAPASAPATASAPTA 292 PE + + A P+ + APV APA+ P+ Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPAT-------PSE 1035 Query: 293 ASVPTPASAPMPAPASELAPAPSTSATSSIAPPAAPVASQTQPARANTSVSTSAAAMSAS 352 + ++ + E +T T+ A S + V+ S + + Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095 Query: 353 TSTPAPAPASTPVATAAPSPISPDAPFPADAAQTPPPAATPAAA-PAAGPAPASANATAT 411 +T A+ A P +Q P P A PA + Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155 Query: 412 ADAA 415 + Sbjct: 1156 KEPQ 1159
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.030 Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 3/37 (8%) Query: 21 RVLEPLDLAIGAGETLVLLGPSGCGKTTTLRLIAGLD 57 RV+EP ++VL G G GK+T + + GLD Sbjct: 587 RVMEP---GCKFDYSVVLEGTGGIGKSTLINTLVGLD 620
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 46.7 bits (111), Expect = 8e-08 Identities = 58/249 (23%), Positives = 91/249 (36%), Gaps = 11/249 (4%) Query: 66 YATGMFVLAPLG----DRFDRRTLILLQIAGLSAALIAAAAAPTLAVLAAASLAIGILAT 121 YA F AP+ DRF RR ++L+ +AG + A AP L VL + GI Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA 111 Query: 122 IAQQAVPFAAEIAPPAERGQAVGTVMSGLLIGILLARTAAGFVAEYFGWRAVFGASVAAL 181 A + A+I ER + G + + G++ G + + A F A+ AAL Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF-SPHAPFFAA-AAL 169 Query: 182 AALAAVIVA-RLPRSSPTSTLPYGQLLASMWHLARKLRGLREASMTGAAIFAA--FSAFW 238 L + LP S P + + R RG+ + A F Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229 Query: 239 PVLTLLLAGAPFHLGPQAAGL-FGIVGAAGALAAPY-AGRFADKRGPRAIISLAIALIAL 296 L ++ FH G+ G +LA G A + G R + L + Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289 Query: 297 SFVIFALSG 305 +++ A + Sbjct: 290 GYILLAFAT 298
>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6 signature. Length = 547 Score = 30.4 bits (68), Expect = 0.025 Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 3/61 (4%) Query: 567 FNLG-LDPDKAREFHDETLPKDSAKVAHFC--SMCGPHFCSMKITQDVREFAAQQGMSED 623 F LG + P + R E P + + S CG H + +T + E Q ++ Sbjct: 265 FTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVSIAGA 324 Query: 624 D 624 D Sbjct: 325 D 325
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.3 bits (78), Expect = 4e-04 Identities = 34/197 (17%), Positives = 65/197 (32%), Gaps = 23/197 (11%) Query: 28 PPSAEVFNKSLADADAVAKAGDQERAIGLYQELAKSDPTREEPWSRIAQIQFQQGHYGQA 87 + N AD +V ++ I E P P + A Sbjct: 994 TTNITTPNNIQADVPSVPSNNEE---IARVDEAPVPPPAPATPSETTETV---------A 1041 Query: 88 IVAAQEALQRDKTDRQAKSVLAVAGLRIATESLGELRQDSSLAGDAKSDAQALAKQLRDT 147 + QE+ +K ++ A A +A E+ ++ ++ A+S ++ Q +T Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNR-EVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100 Query: 148 LGEAALFPPEQQATKPVVKKRRFVRRAKHVREVA----RAAESETAAAPAPPPAPPATPA 203 A ++ K V+ + K +V+ ++ + A PA P Sbjct: 1101 KETAT----VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIK 1156 Query: 204 APTAAPS--AAPSAPAK 218 P + + A PAK Sbjct: 1157 EPQSQTNTTADTEQPAK 1173
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.2 bits (86), Expect = 6e-06 Identities = 19/63 (30%), Positives = 29/63 (46%), Gaps = 1/63 (1%) Query: 66 ISALFVKPIFHGMGVGRELLERAVKWLRDNGVDRVTLGT-DPGSRADGFYQHLGWQRGAL 124 I + V + GVG LL +A++W ++N + L T D A FY + GA+ Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAV 151 Query: 125 DEY 127 D Sbjct: 152 DTM 154
>ICENUCLEATIN#Ice nucleation protein signature. Length = 1258 Score = 32.0 bits (72), Expect = 0.014 Identities = 39/146 (26%), Positives = 52/146 (35%), Gaps = 6/146 (4%) Query: 675 GGQAADTAGQHAVAAAFRNWTPGAGGADAPPDGGDRALMAFGAQAGSVNVTPKTHVTYAG 734 G +++ AG + + AG G D +L+A GS + AG Sbjct: 219 GEESSQMAGYGSTQTGMKGSDLTAGYGSTGTAGDDSSLIA---GYGSTQTAGEDSSLTAG 275 Query: 735 ENIDQVAQQHLQLMSGQRLNATAGQGMQLFARGAGVQAVAGEGPMLLQAQADTLTANAQK 794 Q AQ+ L +G TAG L A G G AGE T T AQK Sbjct: 276 YGSTQTAQKGSDLTAGYGSTGTAGADSSLIA-GYGSTQTAGEESTQTAGYGSTQT--AQK 332 Query: 795 GVKITTNEHEVFVSAPRIRLVAEDGS 820 G +T + L+A GS Sbjct: 333 GSDLTAGYGSTGTAGDDSSLIAGYGS 358 Score = 31.3 bits (70), Expect = 0.020 Identities = 46/156 (29%), Positives = 58/156 (37%), Gaps = 6/156 (3%) Query: 675 GGQAADTAGQHAVAAAFRNWTPGAGGADAPPDGGDRALMAFGAQAGSVNVTPKTHVTYAG 734 G ++ TAG + A + AG G D +L+A GS + AG Sbjct: 363 GEDSSLTAGYGSTQTAQKGSDLTAGYGSTGTAGADSSLIA---GYGSTQTAGEESTQTAG 419 Query: 735 ENIDQVAQQHLQLMSGQRLNATAGQGMQLFARGAGVQAVAGEGPMLLQAQADTLTANAQK 794 Q AQ+ L +G TAG L A G G AGE L T T AQK Sbjct: 420 YGSTQTAQKGSDLTAGYGSTGTAGDDSSLIA-GYGSTQTAGEDSSLTAGYGSTQT--AQK 476 Query: 795 GVKITTNEHEVFVSAPRIRLVAEDGSYLEIGNGITL 830 G +T + L+A GS G G TL Sbjct: 477 GSDLTAGYGSTSTAGYESSLIAGYGSTQTAGYGSTL 512
>PF05272#Virulence-associated E family protein Length = 892 Score = 36.6 bits (84), Expect = 3e-04 Identities = 24/123 (19%), Positives = 40/123 (32%), Gaps = 9/123 (7%) Query: 191 IDFLEAADARGKLAHIR--ERLAHVLGDARQGALLREGLSV----VLAGQPNVGKSSLLN 244 + L K +R + + + ++ G VL G +GKS+L+N Sbjct: 555 VHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLIN 614 Query: 245 ALAGAELAIVTPI-AGTTRDKVAQTIQVEGIPLHIIDTAGLRETEDEVEKIGIARTWGEI 303 L G + T GT +D Q + L + R + E K + Sbjct: 615 TLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMT--AFRRADAEAVKAFFSSRKDRY 672 Query: 304 ERA 306 A Sbjct: 673 RGA 675
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 495 bits (1276), Expect = e-173 Identities = 205/575 (35%), Positives = 320/575 (55%), Gaps = 45/575 (7%) Query: 1 MDIKRTVLWVIFFMSAVMLFDNWQRSHGRPSMFFPNVTQTNTASNATNGNGASGASAAAA 60 MD +R +L + + M++ W++ Q T + T Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQ-----PQAQQTTQTTTT------------- 42 Query: 61 NALPAAATGAAPATTAPAAQAQLVRFSTDVYNGEIDTRGGTLAKLTLTK---AGDGKQPD 117 AA AA + Q +L+ TDV + I+TRGG + + L + QP Sbjct: 43 -----AAGSAADQGVPASGQGKLISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQP- 96 Query: 118 LSVTLFDNAANHTYLARTGLLGGDFPN-----HNDVYTQAAGPTSLAAGQNTLKLAFESP 172 L + + Y A++GL G D P+ +Y LA GQN L++ Sbjct: 97 --FQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYT 154 Query: 173 VKGGVKVVKTYTFTRGSYVIGVDTKIENVGTAPVTPSVYMELVRD-----NTSVETPMFS 227 G KT+ RG Y + V+ ++N G P+ S + +L + + + F+ Sbjct: 155 DAAGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFA 214 Query: 228 -HTFLGPAVYTDQKHFQKITFSDIDKNKADYVTSADNGWIAMVQHYFASAWIPQQGAKRD 286 HTF G A T + ++K F I N+ ++S GW+AM+Q YFA+AWIP + Sbjct: 215 LHTFRGAAYSTPDEKYEKYKFDTIADNENLNISS-KGGWVAMLQQYFATAWIPHNDGTNN 273 Query: 287 IYVEKIDPTLYRVGVKQPVAAIAPGQSADVSARLFAGPEEERMLEGIAPGLELVKDYGWV 346 Y + + +G K + PGQ+ +++ L+ GPE + + +AP L+L DYGW+ Sbjct: 274 FYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWL 333 Query: 347 TIIAKPLFWLLEKIHGFVGNWGWAIVLLTLLIKAVFFPLSAASYKSMARMKEITPRMQAL 406 I++PLF LL+ IH FVGNWG++I+++T +++ + +PL+ A Y SMA+M+ + P++QA+ Sbjct: 334 WFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAM 393 Query: 407 RERFKSDPQKMNAALMELYKTEKVNPFGGCLPVVIQIPVFISLYWVLLASVEMRGAPWIL 466 RER D Q+++ +M LYK EKVNP GGC P++IQ+P+F++LY++L+ SVE+R AP+ L Sbjct: 394 RERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFAL 453 Query: 467 WIHDLSQRDPYFILPVLMAVSMFVQTKLNPTP-PDPVQAKMMMFMPIAFSVMFFFFPAGL 525 WIHDLS +DPY+ILP+LM V+MF K++PT DP+Q K+M FMP+ F+V F +FP+GL Sbjct: 454 WIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGL 513 Query: 526 VLYYVVNNVLSIAQQYYITRTL---GAAAAKKKAS 557 VLYY+V+N+++I QQ I R L G + +KK S Sbjct: 514 VLYYIVSNLVTIIQQQLIYRGLEKRGLHSREKKKS 548
>PF06776#Invasion associated locus B Length = 214 Score = 33.0 bits (75), Expect = 0.002 Identities = 13/60 (21%), Positives = 17/60 (28%), Gaps = 12/60 (20%) Query: 107 PVTAGPAPSGAADANAPA------------PAGMNAATAAAVAAVAAAQAAQAAQANAAA 154 PVT P+ A PA A A A A + +A+A Sbjct: 15 PVTNHAVPALKAIQMGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDRADAQG 74
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 78.6 bits (193), Expect = 1e-17 Identities = 75/243 (30%), Positives = 106/243 (43%), Gaps = 27/243 (11%) Query: 294 QASNQPGY-QGVPFDPSYPISEDDVSSWNAQNPQWVPNVGEFTSVSPGTRAGGGSMGFPM 352 +A+ PGY + V P ++ V+ N NP V V F S G + P Sbjct: 257 KATGYPGYSEKVEVAPGTKVNMGPVTDRNG-NPVQV--VATFGRDSQGNTTVDVQV-IPR 312 Query: 353 PNGNPGASPGTDPGTNPGTNPGTNPGTNPGANPGTNPGTNPGTNPGTNPGTDPGTNPGTN 412 P+ PG++ + P +P NP NP P NPGT P NP DP NP N Sbjct: 313 PDLTPGSAEAPNAQPLPEVSPAENPANNP------APNENPGTRP--NPEPDPDLNPDAN 364 Query: 413 PGTD--PGTNPGTNPGTNPGTGPGDTGKPQPPWPD---VCVLHPDASGCAPLGSAN---D 464 P TD PGT P + P P G K + D +C PD C L N D Sbjct: 365 PDTDGQPGTRPDS-PAV-PDRPNGRHRKERKEGEDGGLLCKFFPDILACDRLPEPNPAED 422 Query: 465 VDVKRESKGVSLSPISIGLNNGVCPRP--YEVVVFDA--RLSFSYQPICDLAVRLRPLVL 520 +++ E+ V I ++ CP P + V V D+ + +FS++ C +A RLR ++L Sbjct: 423 LNLPSETVNVEFQKSGIFQDSAQCPAPVTFTVTVLDSSRQFAFSFENACTIAERLRYMLL 482 Query: 521 MLS 523 L+ Sbjct: 483 ALA 485
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 110 bits (277), Expect = 5e-28 Identities = 73/340 (21%), Positives = 129/340 (37%), Gaps = 57/340 (16%) Query: 150 VYKPRYRKASDLRELVEPLIGSHSMLPPVSVGPVAGESAGAVQVPGAPAAVP-TNAPVAQ 208 V +Y KASDL E++ + S +M A + A+ A TNA Sbjct: 271 VIYLKYAKASDLVEVLTGI--SSTMQSEKQ----AAKPVAALDKNIIIKAHGQTNA---- 320 Query: 209 PVASGVQARGGELVIVGSRDEVAMLRKLVPELDTAPGEVVVRGWAYEVTNTDS------- 261 L++ + D + L +++ +LD +V+V EV + D Sbjct: 321 ------------LIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQW 368 Query: 262 ----------TNSAWSIAAKVLGGQLRISSGDTSSDKS---------AVRFTGPGIDAAI 302 TNS I+ + G G SS + A F + Sbjct: 369 ANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLL 428 Query: 303 SALNADSRFKVISSPHVRIASGERVRLNVGQQVPTQSSVSYQGSSGTPVQSITYQDAGLI 362 +AL++ ++ ++++P + NVGQ+VP + S S ++ + G+ Sbjct: 429 TALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG-SQTTSGDNIFNTVERKTVGIK 487 Query: 363 FDVEPTVMRDA-IELRVHEEISDFVPTKTGVDTS--PTKNTRQLQTVTRLTDGEVVVLGG 419 V+P + + L + +E+S + + T NTR + + GE VV+GG Sbjct: 488 LKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGG 547 Query: 420 LIQDRNSTARSGYAWLPSF-LDG---RSSSKQRTEVLLVL 455 L+ S L + G RS+SK+ ++ L+L Sbjct: 548 LLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLML 587 Score = 30.3 bits (68), Expect = 0.021 Identities = 39/204 (19%), Positives = 66/204 (32%), Gaps = 24/204 (11%) Query: 83 TPYVLGPDVLTDTRLVSFRLDDQSRDVRDVMVDFLDSLGFQVVTK-NGVDYVMRKPGAVL 141 ++ P V + S+ + ++ + LD GF V+ NGV V+R A Sbjct: 52 KTVIIDPSVRGTITVRSYDMLNEE-QYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKT 110 Query: 142 A------------KADRDVYVYKPRYRKASDLRELVEPLIGSHSMLPPVSVGPVAGESAG 189 A + V A DL L+ L + + V E + Sbjct: 111 AAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHY-----EPSN 165 Query: 190 AVQVPGAPAAVPTNAPVAQPVASGVQARGGELVIVGSR-DEVAMLRKLVPELDTAPGEVV 248 + + G A + + + V A +V V A + KLV EL+ + Sbjct: 166 VLLMTGRAAVIKRLLTIVERVD---NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSA 222 Query: 249 VRGW-AYEVTNTDSTNSAWSIAAK 271 + G V + TN+ Sbjct: 223 LPGSMVANVVADERTNAVLVSGEP 246
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 27.2 bits (60), Expect = 0.022 Identities = 24/85 (28%), Positives = 43/85 (50%), Gaps = 5/85 (5%) Query: 5 ATIARPYAEALFRVAEGGDISAWSTLVQELAQVAQLPEVLSVASSPKVSRAQVAELLLAT 64 AT + A+A+F+ GGD+S +Q++ + +P L+V +R + ELL T Sbjct: 28 ATTTKSAADAVFQQLGGGDVSG---AMQDIDLIMDIPVKLTVELGR--TRMTIKELLRLT 82 Query: 65 LKSPLASGAQAKNFVQMLVDNHRIA 89 S +A A + +L++ + IA Sbjct: 83 QGSVVALDGLAGEPLDILINGYLIA 107
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 115 bits (289), Expect = 7e-37 Identities = 45/88 (51%), Positives = 58/88 (65%) Query: 36 NKQELIDAVAAQTGASKAQTGETLDTLLEVIKKAVSKGDSVQLIGFGSFGSGKRAARTGR 95 NKQ+LI VA T +K + +D + + ++KG+ VQLIGFG+F +RAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 96 NPKTGETIKIPAAKTVKFTAGKAFKDAV 123 NP+TGE IKI A+K F AGKA KDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 32.5 bits (74), Expect = 0.004 Identities = 20/104 (19%), Positives = 45/104 (43%), Gaps = 21/104 (20%) Query: 205 EDVARLDTMVTVVDAFNFLRDYARDDALAEHGLAATEEDDRTLVELLIEQI-EFCDVLVI 263 ED + VV+ N D + + + + EE+D L E + +++ F D++++ Sbjct: 199 EDYTSAGGVDNVVEIINMA-DRKTEKFI----IESLEEEDPELAEEIKKKMFVFEDIVLL 253 Query: 264 NKADL------VDADSLARL---------QRILANLNPRARQIV 292 + + +D LA+ ++I N++ RA ++ Sbjct: 254 DDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASML 297
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 398 bits (1025), Expect = e-131 Identities = 212/684 (30%), Positives = 320/684 (46%), Gaps = 80/684 (11%) Query: 6 TALVVAGIVAAQAAHAQVTLNFVNADIDQVAKAIGAATGKTIIVDPRVKGQLNLVAERAV 65 T L+ A ++ AA + + +F DI + + KT+I+DP V+G + + + + Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72 Query: 66 PEDQALKTLQSALRMQGFALV-QDHGVLKVVPEADAKLQGVPTYIGNTPQAHGDQVVTQV 124 E+Q + S L + GFA++ ++GVLKVV DAK VP P GD+VVT+V Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGI-GDEVVTRV 131 Query: 125 FELRNESANNLLPVLRPLI--SPNNTITAYPANNTIVVTDYADNVRRIARIIAGVDNAAG 182 L N +A +L P+LR L + ++ Y +N +++T A ++R+ I+ VDNA Sbjct: 132 VPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGD 191 Query: 183 AQVAVVPLKNANAIDIAAQLTKLLDPGAIGNTDATLKVTVQADPRTNALLLRASNTQRLA 242 V VPL A+A D+ +T+L + ++ V AD RTNA+L+ R Sbjct: 192 RSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR-Q 250 Query: 243 AAKKIAQQLDAPSGVPGNMHVVPLRNADAVKLAKTLRGMLGKGGGESGSSASSNDANAFN 302 + +QLD GN V+ L+ A A L + L G+ Sbjct: 251 RIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGIS-------------------- 290 Query: 303 QGGSQSGSNFSTGTSGTPPLPSGLSSGSSGGMGGTMGGGGLGTAGLLGGDKDKSDENQPG 362 S + S + Sbjct: 291 ---------------------STMQSEKQAAKPVAALDKNI------------------- 310 Query: 363 GMIQADSATNSLIITASDPVYRNLRAVIDQLDARRAQVYIEALVVELNSTTNANLGIQWQ 422 +I+A TN+LI+TA+ V +L VI QLD RR QV +EA++ E+ NLGIQW Sbjct: 311 -IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369 Query: 423 VANNALYAGTN--LPTGGVGGGNSIVDLTTRAATSAVGAISTLTPGLNIGWLHNMFGIQG 480 N + TN LP G + + ++S A+S + + F Sbjct: 370 NKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALS------SFNGIAAGFYQGN 423 Query: 481 LGGLLQYFSGVSDANVLSTPNLVTLDNEEAKIVVGQNVPIPTGSYSNLTSGNTNNAFNTY 540 LL S + ++L+TP++VTLDN EA VGQ VP+ TGS + + +N FNT Sbjct: 424 WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGS----QTTSGDNIFNTV 479 Query: 541 DRRDVGLTLHVKPQITEGGILKLQLYTEDSSVVNTTVNNQS--GPTFNKRSIQSTVLADN 598 +R+ VG+ L VKPQI EG + L++ E SSV + + S G TFN R++ + VL + Sbjct: 480 ERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGS 539 Query: 599 GEIIVLGGLMQDNYQVSNSKVPLLGDIPWIGQLFRSEGKTRQKTNLMVFLRPVIITDRET 658 GE +V+GGL+ + + KVPLLGDIP IG LFRS K K NLM+F+RP +I DR+ Sbjct: 540 GETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDE 599 Query: 659 AQAVTSNRYDYIQGVTGAYKSDNN 682 + +S +Y + N Sbjct: 600 YRQASSGQYTAFNDAQSKQRGKEN 623
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 136 bits (343), Expect = 2e-44 Identities = 77/126 (61%), Positives = 97/126 (76%), Gaps = 3/126 (2%) Query: 19 AMDD-WAAALAEQNQQPIETGATGAGVFRPLSKAAASSTHNDIDMILDIPVKMTVELGRT 77 A+DD WA AL EQ ++ A VF+ L S DID+I+DIPVK+TVELGRT Sbjct: 14 ALDDLWADALNEQKATTTKSAADA--VFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRT 71 Query: 78 KIAIRNLLQLAQGSVVELDGLAGEPMDVLVNGCLIAQGEVVVVNDKFGIRLTDIITPSER 137 ++ I+ LL+L QGSVV LDGLAGEP+D+L+NG LIAQGEVVVV DK+G+R+TDIITPSER Sbjct: 72 RMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSER 131 Query: 138 IRKLNR 143 +R+L+R Sbjct: 132 MRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 295 bits (757), Expect = e-103 Identities = 155/242 (64%), Positives = 192/242 (79%), Gaps = 1/242 (0%) Query: 34 RWLPAILIGLAPALACAQAAGLPAFNSAPGPNGGTTYSLSVQTMLLLTMLSFLPAMLLMM 93 R L + L + A LP S P P GG ++SL VQT++ +T L+F+PA+LLMM Sbjct: 3 RLLSVAPVLLW-LITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMM 61 Query: 94 TSFTRIIIVLSLLRQAIGTASTPPNQVLVGLALFLTLFVMSPVLDKAYTDAYKPFSEGTL 153 TSFTRIIIV LLR A+GT S PPNQVL+GLALFLT F+MSPV+DK Y DAY+PFSE + Sbjct: 62 TSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKI 121 Query: 154 QMDQAVQRGTAPFKTFMLKQTRETDLALFAKISKAAPMQGPEDVPLSLLVPAFVTSELKT 213 M +A+++G P + FML+QTRE DL LFA+++ P+QGPE VP+ +L+PA+VTSELKT Sbjct: 122 SMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKT 181 Query: 214 GFQIGFTIFIPFLIIDMVVASVLMSMGMMMVSPATVSLPFKLMLFVLVDGWQLLIGSLAQ 273 FQIGFTIFIPFLIID+V+ASVLM++GMMMV PAT++LPFKLMLFVLVDGWQLL+GSLAQ Sbjct: 182 AFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQ 241 Query: 274 SF 275 SF Sbjct: 242 SF 243
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 68.6 bits (168), Expect = 3e-19 Identities = 27/85 (31%), Positives = 47/85 (55%) Query: 4 ENVMTLAHQAMYIGLLLAAPLLLVALAVGLVVSLFQAATQINEATLSFIPKLLAVAATMV 63 ++++ ++A+Y+ L+L+ +VA +GL+V LFQ TQ+ E TL F KLL V + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLSTMLDYLREILLRVATLG 88 + W +L Y R+++ G Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 161 bits (409), Expect = 7e-51 Identities = 118/250 (47%), Positives = 159/250 (63%), Gaps = 1/250 (0%) Query: 1 MFSVTYAQLNGWLTAFLWPFVRMLALVALAPVTGHRATPVRVKIGLAGFMALVVAPTLPP 60 M VT Q WL + WP +R+LAL++ AP+ R+ P RVK+GLA + +AP+LP Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 MPAATVFSAQGVWIIVNQFLIGAALGFTMQIVFAAVEAAGDIIGLSMGLGFATFFDPHSS 120 VFS +W+ V Q LIG ALGFTMQ FAAV AG+IIGL MGL FATF DP S Sbjct: 61 NDVP-VFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119 Query: 121 GATPVMGRFLNAVAILAFLAFDGHLQVFAVLVDSFRLVPISADLLRAAGWQTLVAFGSAI 180 PV+ R ++ +A+L FL F+GHL + ++LVD+F +PI + L + + L GS I Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179 Query: 181 FEMGLLLALPVVAALLIANLALGILNRAAPQIGIFQVGFPVTMLVGLLLVQLMAPNLIPF 240 F GL+LALP++ LL NLALG+LNR APQ+ IF +GFP+T+ VG+ L+ + P + PF Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239 Query: 241 VGRLFDTGVD 250 LF + Sbjct: 240 CEHLFSEIFN 249
>PF06580#Sensor histidine kinase Length = 349 Score = 48.7 bits (116), Expect = 2e-08 Identities = 23/128 (17%), Positives = 45/128 (35%), Gaps = 22/128 (17%) Query: 334 RIDLGAELDDDLQVAGSESLLSALLMNLVDNAVRY----THEGGCVTVSARRDGEAVVLD 389 R+ +++ + + L+ LV+N +++ +GG + + +D V L+ Sbjct: 239 RLQFENQINPAIM---DVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLE 295 Query: 390 VVDDGPGIPAEARPHVFKRFYRVAKDEEGTGLGLAIVEE-IAQSHGGTVTLGTGPGNRGV 448 V + G +E TG GL V E + +G + V Sbjct: 296 VENTGSLALKN--------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKV 341 Query: 449 RMTVRLPA 456 V +P Sbjct: 342 NAMVLIPG 349
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.4 bits (240), Expect = 2e-25 Identities = 30/119 (25%), Positives = 60/119 (50%), Gaps = 1/119 (0%) Query: 2 KLLLVEDNAELAHWIVDLLRGEGFGVDSAPDGESADTVLKAQRYDALLLDMRLPGMSGKE 61 +L+ +D+A + + L G+ V + + + A D ++ D+ +P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LLARLRRRGDNVPVLMLTAHGSVDDKVDCFSAGADDYVVKPFESRELVARI-RALIRRQ 119 LL R+++ ++PVL+++A + + GA DY+ KPF+ EL+ I RAL + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 62.1 bits (151), Expect = 6e-13 Identities = 55/228 (24%), Positives = 94/228 (41%), Gaps = 28/228 (12%) Query: 1 MKRQYLALSIATAACAAPQAHAQSSVQLYGLIDLSFPTYQSHANAKGDHVIGMGLGGEPW 60 MK+ +AL++A AA + V LYG I T +S A+ G + G Sbjct: 1 MKKSLIALTLAALPVAA-----MADVTLYGTIKAGVETSRSVAH-NGAQAASVETGTGIV 54 Query: 61 FSGSRWGLKGAEDIGGGTKVIFRLESEYTVADGNMEDPGQIFDRDAWVGVENDTFGKLTA 120 GS+ G KG ED+G G K I+++E + ++A + +R +++G++ FGKL Sbjct: 55 DLGSKIGFKGQEDLGNGLKAIWQVEQKASIAGTD----SGWGNRQSFIGLKGG-FGKLRV 109 Query: 121 GFQNTIARDAGAIYGDPYGSAKLTTEEGGWTNANNFKQMIFYAAGATGTRYNNGLAWKKL 180 G N++ +D G I +P+ S RY++ + Sbjct: 110 GRLNSVLKDTGDI--NPWDSKSDYLGVNKIAEPEARL---------ISVRYDS----PEF 154 Query: 181 FGNGIFASAGYAFSNSTSFGQNSTYQVALGYNGGPFNVSGFFSHVNHA 228 G+ S YA +++ + +Y Y G F V ++ H Sbjct: 155 A--GLSGSVQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHH 200
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 72.9 bits (179), Expect = 2e-16 Identities = 73/354 (20%), Positives = 111/354 (31%), Gaps = 71/354 (20%) Query: 1 MKK--FAVAAAGLAVATGAHASDGSVTLFGLVDAGVSYVSNEGGRHNVYFDDGIAVPNLW 58 MKK A+ A L VA A VTL+G + AGV + + Sbjct: 1 MKKSLIALTLAALPVAAMA-----DVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVD 55 Query: 59 -----GLRGTEDLGGGAKAIFELTSQYALGNGAALPTPGSMFSRTALVGLWSERLGSVTL 113 G +G EDLG G KAI+++ + T +R + +GL G + + Sbjct: 56 LGSKIGFKGQEDLGNGLKAIWQVEQ-----KASIAGTDSGWGNRQSFIGLKGG-FGKLRV 109 Query: 114 GQQYDFMTDSLTFGSFDGAFRYGGLYNFRQGPFSKLGIPDNPTGSFDFDRMAGSSRVPNA 173 G+ + D+ +D Y G +++A + Sbjct: 110 GRLNSVLKDTGDINPWDSKSDYLG-----------------------VNKIAEPEARLIS 146 Query: 174 VKYTSANLNGLVFGLMYGFGNQAGGGLSANSTVSAGLKYEAGSFALGAAYVEVKYPQMNN 233 V+Y S GL + Y + AG S + K G AY Q N Sbjct: 147 VRYDSPEFAGLSGSVQYALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENV 206 Query: 234 GHDGLRNWGLGARYALSAFDLNL-LYTNTRNT--LTGAAIDVIQAGARYVGAPWTIGANY 290 + + L + Y L + ++ + Q + A Sbjct: 207 NIEKYQIHRLVSGY--DNDALYASVAVQQQDAKLVEENYSHNSQTE---------VAATL 255 Query: 291 EYMKGNAQLDRNYAH----------------QVTAAVQYALSKRTSAYVETVYQ 328 Y GN +YAH QV +Y SKRTSA V + Sbjct: 256 AYRFGNVTPRVSYAHGFKGSFDATNYNNDYDQVVVGAEYDFSKRTSALVSAGWL 309
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 56.6 bits (136), Expect = 5e-12 Identities = 28/183 (15%), Positives = 61/183 (33%), Gaps = 12/183 (6%) Query: 22 ASRARPKPGERRVHILQTLASMLESPKREKITTAALAARLDVSEAALYRHFSSKAQMFEG 81 A + + + E R HIL + + +A V+ A+Y HF K+ +F Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 82 LIEFIEETFFGLVNQIAANEPNGVLQA-RSIALMLLNFSARNPGMTRVLTGEALVGEHER 140 + E E L + A P L R I + +L + ++ + H+ Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME----IIFHKC 117 Query: 141 LAERVDQMLERVEASIKQCLRVALLDANARADGAGGGAPPPVPLPDDYDPALRASLVVSY 200 ++++ + ++ ++ + P + A ++ Y Sbjct: 118 EFVGEMAVVQQAQRNLCLESY-DRIEQTLKHCIEAKMLPADL------MTRRAAIIMRGY 170 Query: 201 VLG 203 + G Sbjct: 171 ISG 173
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 44.0 bits (104), Expect = 4e-07 Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 6/99 (6%) Query: 236 IPVISPIGFGEDGLSYNINADLVAGKLATVLNAEKLVMMTNIPGVMDKEG----NLLTDL 291 +PVI G G+ I+ DL KLA +NA+ +++T++ G G L ++ Sbjct: 197 VPVILEDG-EIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREV 255 Query: 292 SAREIDALFEDGT-ISGGMLPKISSALDAAKSGVKSVHI 329 E+ +E+G +G M PK+ +A+ + G + I Sbjct: 256 KVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAII 294 Score = 37.1 bits (86), Expect = 7e-05 Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 10/60 (16%) Query: 87 GKTVVIKYGGNAMTEERLKQGF----------ARDVILLKLVGINPVIVHGGGPQIDQAL 136 GK VVI GGNA+ + K + AR + + G VI HG GPQ+ L Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 87.2 bits (216), Expect = 1e-22 Identities = 30/127 (23%), Positives = 59/127 (46%) Query: 1 MSDKNFLVIDDNEVFAGTLARGLERRGYAVRQAHNKDEALKLAGAEKFEFITVDLHLGND 60 M+ LV DD+ L + L R GY VR N + A + + D+ + ++ Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGLSLIAPLCDLQPDARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNAT 120 + L+ + +PD +LV++ + TA++A + GA +YL KP ++ ++ + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 EVQAEEA 127 E + + Sbjct: 121 EPKRRPS 127 Score = 45.2 bits (107), Expect = 4e-08 Identities = 16/101 (15%), Positives = 32/101 (31%), Gaps = 3/101 (2%) Query: 75 DARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNATEVQAEEALENPVVL 134 I+ + I + L+ VE + + + L Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL---YDR 431 Query: 135 SVDRLEWEHIQRVLAENNNNISATARALNMHRRTLQRKLAK 175 + +E+ I L N A L ++R TL++K+ + Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.6 bits (69), Expect = 0.015 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 3/36 (8%) Query: 49 TPKNILMIGPTGVGKTEIAR---RLAKLADAPFIKI 81 T +++ G +G GK +AR K + PF+ I Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 76.4 bits (188), Expect = 5e-18 Identities = 42/177 (23%), Positives = 74/177 (41%), Gaps = 20/177 (11%) Query: 50 RILLTGAAGSLGRVLRGRL-RRYADVVRVSDIAP-----LDGAR------DGEECVRCDL 97 + L+TGAAG +G + RL VV + ++ L AR G + + DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 98 ADAAAVDALVRD--VDVIVHFG---GV--SVERPFDTVLPANITGAYHVYEAARRHGVRR 150 AD + L + + V S+E P +N+TG ++ E R + ++ Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYA-DSNLTGFLNILEGCRHNKIQH 120 Query: 151 IVFASSNHVTGFYEQGERIDTAAPPRPDGYYGLSKAFGEQLARFHHDRYGIESVCIR 207 +++ASS+ V G + + P Y +K E +A + YG+ + +R Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 63.2 bits (154), Expect = 3e-15 Identities = 17/78 (21%), Positives = 31/78 (39%), Gaps = 1/78 (1%) Query: 10 AVLAYDAKGGDSAPRVVAKGYGLVAERIIERARDAGLYVHTAPEMV-SLLMQVDLDARIP 68 A+ +G P V K + + + A + G+ + + +L +D IP Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327 Query: 69 PQLYQAVAELLAWLYALE 86 + +A AE+L WL Sbjct: 328 AEQIEATAEVLRWLERQN 345
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 33.8 bits (77), Expect = 0.001 Identities = 17/58 (29%), Positives = 24/58 (41%) Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVKLITAQRNYQANAQTIKTQQTVDQTLINL 413 SA L S V+L + L Q+ Y ANAQ ++T + LIN+ Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545 Score = 29.9 bits (67), Expect = 0.019 Identities = 11/31 (35%), Positives = 17/31 (54%) Query: 6 GLSGLAGASSDLDVIGNNIANANTVGFKGST 36 +SGL A + L+ NNI++ N G+ T Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQT 37
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.2 bits (65), Expect = 0.020 Identities = 9/34 (26%), Positives = 18/34 (52%) Query: 4 LIYTAMTGATQSLEQQSVVANNLANASTTGFRAQ 37 LI AM+G + + +NN+++ + G+ Q Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.0 bits (101), Expect = 7e-07 Identities = 10/48 (20%), Positives = 23/48 (47%) Query: 213 TLKQGYVEASNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQM 260 L S VN+ +E N+ + Q+ Y N++ + T++ + + + Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545 Score = 40.7 bits (95), Expect = 4e-06 Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 14/80 (17%) Query: 4 SLYIAATGMNAQQAQMDVISNNLANVSTNGFKGSRAVFEDLLYQTVRQPGANSTQQTELP 63 + A +G+NA QA ++ SNN+++ + G+ RQ + + L Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT--------------RQTTIMAQANSTLG 48 Query: 64 SGLQLGTGVQQVATERLYTQ 83 +G +G GV +R Y Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 204 bits (520), Expect = 2e-68 Identities = 128/222 (57%), Positives = 156/222 (70%), Gaps = 7/222 (3%) Query: 25 AALAAAALALAGCAQIPREPITQQPMSAMPPMPPAMQAPGSIY---NPGYAG-RPLFEDQ 80 A + L+L GCA IP P+ Q SA P P A GSI+ P G +PLFED+ Sbjct: 10 AISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDR 69 Query: 81 RPRNVGDILTIVIAENINATKSSGANTNRQGNTSFDVPTAG-FLGGLF--NKANLSAQGA 137 RPRN+GD LTIV+ EN++A+KSS AN +R G T+F T +L GLF +A++ A G Sbjct: 70 RPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGG 129 Query: 138 NKFAATGGASAANTFNGTITVTVTNVLPNGNLVVSGQKQMLINQGNEFVRFSGVVNPNTI 197 N F GGA+A+NTF+GT+TVTV VL NGNL V G+KQ+ INQG EF+RFSGVVNP TI Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189 Query: 198 SGQNSVYSTQVADARIEYSAKGYINEAETMGWLQRFFLNIAP 239 SG N+V STQVADARIEY GYINEA+ MGWLQRFFLN++P Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 368 bits (947), Expect = e-128 Identities = 158/367 (43%), Positives = 216/367 (58%), Gaps = 19/367 (5%) Query: 36 LAFAPAAARAERLKDLAQIQGVRDNPLIGYGLVVGLDGTGDQTMQTPFTTQTLANMLANL 95 L+ PA A R+KD+A +Q RDN LIGYGLVVGL GTGD +PFT Q++ ML NL Sbjct: 19 LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMRAMLQNL 78 Query: 96 GISINNGSANGAGSSAMTNMQLKNVAAVMVTATLPPFARPGEAIDVTVSSLGNAKSLRGG 155 GI+ G +N KN+AAVMVTA LPPFA PG +DVTVSSLG+A SLRGG Sbjct: 79 GITTQGGQSN-----------AKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGG 127 Query: 156 TLLLTPLKGADGQVYALAQGNMAVGGAGASANGSRVQVNQLAAGRIAGGAIVERSVPNAV 215 L++T L GADGQ+YA+AQG + V G A + + + + R+ GAI+ER +P+ Sbjct: 128 NLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKF 187 Query: 216 AQMNGVLQLQLNDMDYGTAQRIVSAVN----ASFGAGTAMALDGRTIQLTAPADSAQQVA 271 L LQL + D+ TA R+ VN A +G A D + I + P + Sbjct: 188 KDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVA-DLTR 245 Query: 272 FMARLQNLEVSPEKAAAKVILNARTGSIVMNQMVTLQNCAIAHGNLSVVVNTQPVVSQPG 331 MA ++NL V + AKV++N RTG+IV+ V + A+++G L+V V P V QP Sbjct: 246 LMAEIENLTVETD-TPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPA 304 Query: 332 PFSNGQTVVAQQSQIQLKQDNGSLRMVTAGANLAEVVKALNSLGATPADLMSILQAMKAA 391 PFS GQT V Q+ I Q+ + + G +L +V LNS+G +++ILQ +K+A Sbjct: 305 PFSRGQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSA 363 Query: 392 GALRADL 398 GAL+A+L Sbjct: 364 GALQAEL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 222 bits (566), Expect = 4e-73 Identities = 124/296 (41%), Positives = 173/296 (58%), Gaps = 14/296 (4%) Query: 15 ALDVQGFDALRSKAAAVPPREGVKMVAGQFDAMFTQMMLKSMRDATPSDGLLDSNSSKMY 74 A D Q + L++KA P ++ VA Q + MF QMMLKSMRDA P DGL S +++Y Sbjct: 12 AWDAQSLNELKAKAGEDPAAN-IRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLY 70 Query: 75 TSMLDQQLAQQMSS-KGIGVADALTKQLLRNANVAPDAQSEGGLAAMNALAKAYANSNAS 133 TSM DQQ+AQQM++ KG+G+A+ + KQ+ + ++ + Y N S Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALS 130 Query: 134 SGNGALAGTHGYSAASALTPPLKGNGSPQAEAFVEKMAGAAQAASAATGIPARFIVGQAA 193 + + ++AF+ +++ AQ AS +G+P I+ QAA Sbjct: 131 QLVQKAVPRNYDDSLPG-----------DSKAFLAQLSLPAQLASQQSGVPHHLILAQAA 179 Query: 194 LESGWGKREIRGANGESSYNVFGIKATKGWTGRTVSAVTTEYVNGKPHRVVARFRAYDSY 253 LESGWG+R+IR NGE SYN+FG+KA+ W G TTEY NG+ +V A+FR Y SY Sbjct: 180 LESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSY 239 Query: 254 EHAMTDYASLLRNNPRYASVLNAGHSAEGFANGMQKAGYATDPHYAKKLISIMQQI 309 A++DY LL NPRYA+V A SAE A +Q AGYATDPHYA+KL +++QQ+ Sbjct: 240 LEALSDYVGLLTRNPRYAAVTTAA-SAEQGAQALQDAGYATDPHYARKLTNMIQQM 294
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 225 bits (576), Expect = 5e-68 Identities = 158/443 (35%), Positives = 248/443 (55%), Gaps = 10/443 (2%) Query: 3 NTLMNLGVSGLNAALWGLTTTGQNISNAATPGYSVERPVYAEASGQYTSSGYLPQGVSTV 62 ++L+N +SGLNAA L T NIS+ GY+ + + A+A+ + G++ GV Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 63 TVERQYNQYLSNQLNAAQTQGSSLSTYYTLVAQLNNYVGSPTAGIATAITNYFTGLQTVA 122 V+R+Y+ +++NQL AAQTQ S L+ Y +++++N + + T+ +AT + ++FT LQT+ Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 123 NNAADPSARQTAMSNAQTLASQLVAAGQQYSQLRQSVNSQLTDTVTQINSYTSQIAQLNE 182 +NA DP+ARQ + ++ L +Q Q + VN + +V QIN+Y QIA LN+ Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 183 QIA--SASSQGQPPNQLLDQRDLAVSKLSQFAGVQVVPT-NGSYSVFLAGGQPLVVGNAS 239 QI+ + G PN LLDQRD VS+L+Q GV+V G+Y++ +A G LV G+ + Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 240 YQLAAVASKSDPSELTIVSNGVAGANPQGSPQYLPDASLTGGTLGGLLAFRSQTLDPAQA 299 QLAAV S +DPS VA + +P+ L G+LGG+L FRSQ LD + Sbjct: 241 RQLAAVPSSADPSR-----TTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRN 295 Query: 300 QLGALAVSFASQVNAQNALGVDMSGNPGGNLFTAGSPIVYANQGNTSSSTLSASIANGAQ 359 LG LA++FA N Q+ G D +G+ G + F G P V N N + A++ + + Sbjct: 296 TLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASA 355 Query: 360 PPSSDYALSYDGSKYTLTDRATGSVVGTATPATNPPTMTIGGLNLSLSATPNAGDSFTVL 419 ++DY +S+D +++ +T R + T TP N + GL L+ + TP DSFT+ Sbjct: 356 VLATDYKISFDNNQWQVT-RLASNTTFTVTPDAN-GKVAFDGLELTFTGTPAVNDSFTLK 413 Query: 420 PTRGALEGFSLATANGSAIAAAS 442 P A+ + + + IA AS Sbjct: 414 PVSDAIVNMDVLITDEAKIAMAS 436 Score = 82.7 bits (204), Expect = 1e-18 Identities = 46/105 (43%), Positives = 66/105 (62%) Query: 561 GTNDGRNALALSQLVNSKTMNNGTTTLTGAYAGYVNAIGNAASQLKASSAAQTALVGQIT 620 G +D RN AL L ++ G + AYA V+ IGN + LK SSA Q +V Q++ Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500 Query: 621 QAQQSVSGVNQNEEAANLMQYQQLYQANAKVIQTANSVFQTVLGL 665 QQS+SGVN +EE NL ++QQ Y ANA+V+QTAN++F ++ + Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLAGELLIN#Flagellin signature. Length = 507 Score = 43.5 bits (102), Expect = 1e-06 Identities = 59/386 (15%), Positives = 121/386 (31%), Gaps = 10/386 (2%) Query: 16 MNDQQAQIAQLYQQVSSGISLTTPADNPLAAAQAVQLSATSATLSQYTQNQTIVQTALQT 75 +N Q+ ++ +++SSG+ + + D+ A A + ++ L+Q ++N + QT Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76 Query: 76 EDTTLTSVNDVLNAAYQSIMHAGDGGLSDSDRAALAAQIQGSRDHLLTLANTADGAGNYL 135 + L +N+ L + + A +G SDSD ++ +IQ + + ++N G + Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKV 136 Query: 136 FAGFQPTTQPFSNKPGGGVTY------AGDYGARTVQIADTRTVSQGDNGASVFMSVPFL 189 + G +T G + + + GD +S + Sbjct: 137 LSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYD 196 Query: 190 GSLPVPAAGASNTGTGTIGAVSITNPSDPTNTHQFTITFGGTAAAPTYTVTDNTVTPPTT 249 + +G + + T A T D T +T Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKST 256 Query: 250 TAAQPYSSGQGINLGGQTVAVSGTPAVGDTFTVTPAPQAGADVFATLDTVIAALKTPVGN 309 + G GG+ V T G D + T I K + Sbjct: 257 AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTK----TGNDGNGKVSTTINGEKVTLTV 312 Query: 310 SPSASTALTNTLATTSTKLMNTMTNVLTVQASVGGRLQEVKAMQSVTSTNSLQTTNSLSN 369 + + A AT + + V E + + + N+++ + ++ Sbjct: 313 ADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITV 372 Query: 370 LTATNLPAAISQFLQLQNSLSAAQKA 395 A A + L K Sbjct: 373 NGAEYTANAAGDKVTLAGKTMFIDKT 398
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.4 bits (66), Expect = 0.008 Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 2/63 (3%) Query: 110 YVQQGMMPVTAGLVVASAVLISEASNRSALQWGITAAVAAL-AYRTRLHPLWLLAGGALA 168 Y G++ T GL +A+LI E + + G A L A R RL P+ + + + Sbjct: 925 YFMVGLL-TTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFIL 983 Query: 169 GLV 171 G++ Sbjct: 984 GVL 986
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 28.3 bits (63), Expect = 0.038 Identities = 12/59 (20%), Positives = 16/59 (27%) Query: 176 GQSLTVHALVYGVHDGRMRNLGMAVSHAEQLDATYRRAVGALSANGAHSADNDVVAADA 234 G L + L Y V G YR G + +HS D + Sbjct: 639 GTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGV 697
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 31.7 bits (71), Expect = 0.007 Identities = 33/164 (20%), Positives = 50/164 (30%), Gaps = 21/164 (12%) Query: 235 RAQRDAEAQTAQAALDHAHAERDRERRRLA--REHDTIQRHAAATRRYAETANLPSGKRV 292 Q TA A A A + A + Q A R A T +P+ V Sbjct: 199 SLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSV 258 Query: 293 SLKNSAREIMGL------VRRDHRDAKAALGDAVREAAQRVEPDAPVLVS-------LPG 339 + R ++ + + + DA A LG + A + L Sbjct: 259 VATAAGRGLIQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTAEQWQD 318 Query: 340 TEVGARRRLFTLDAARLPWLPAHARAATVTWSAHGPARIAVTGP 383 + R +DAA+L P+ V +A A V P Sbjct: 319 QTPDSVRYALGMDAAKLGLPPS------VNLNAVAKASGTVDLP 356
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 55.8 bits (134), Expect = 3e-11 Identities = 50/188 (26%), Positives = 82/188 (43%), Gaps = 17/188 (9%) Query: 102 VVLVTGANRGLGRAFVEGLKAAGAKRIYAAARDPARVATPGVQPVRLDVTRAD----DVA 157 + +TGA +G+G A L + GA AA V ++ + A+ DV Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAH--IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 158 AAA----------RDLRDVNLLVNNAGIYRTGSLVADADGGGLQAQLDTNFFGLLAMARA 207 +A R++ +++LVN AG+ R G + + +D +A N G+ +R+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSD-EEWEATFSVNSTGVFNASRS 126 Query: 208 FAPVLRDNGGGAIVNVLSVLSWLGVPNAGAYGISKAAAWAATNAIRNELREQRTRVLALH 267 + + D G+IV V S + + + AY SKAAA T + EL E R + Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 268 SAYIDTDM 275 +TDM Sbjct: 187 PGSTETDM 194
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 73.9 bits (181), Expect = 1e-17 Identities = 50/188 (26%), Positives = 82/188 (43%), Gaps = 10/188 (5%) Query: 9 VFITGASSGLGLALAAEYARRGATLGLVARRADALAEFAQ------RFPKATISIHPADV 62 FITGA+ G+G A+A A +GA + V + L + R +A PADV Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPADV 66 Query: 63 RDADALALAASRFVAAHGCPDVVIANAGISKGAITGEGDLDAFREIMDVNYYGMIATFEP 122 RD+ A+ +R G D+++ AG+ + + + + VN G+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FIAPMTAARRGTLVGIASVAGVRGLPGSGAYSASKAAAIKYLEALRVELRPAQVAVVTIA 182 M R G++V + S AY++SKAAA+ + + L +EL + ++ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGYIRTPM 190 PG T M Sbjct: 187 PGSTETDM 194
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.007 Identities = 21/136 (15%), Positives = 41/136 (30%), Gaps = 5/136 (3%) Query: 58 ASQPQQFDPNRALQGKTPGQPVTPQAAQPAPPNTAPGQAANPSQPPLLPEPQIVEVPSSN 117 A ++ DP ++ T QPA ++ + + +VE P + Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202 Query: 118 NNGN-----GSSSASNNAADNGVAVAPKPADLTPPPAKKPQTAANGSSAPHAANNNAQAS 172 S S++ + +V P ++ P + + N NA S Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262 Query: 173 AAATPPKTAQAPKGAS 188 A + G + Sbjct: 1263 DARAKAQFVALNVGKA 1278
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 29.4 bits (66), Expect = 0.032 Identities = 12/77 (15%), Positives = 20/77 (25%), Gaps = 8/77 (10%) Query: 278 FVSLDADDVVYQDAAAFVAGPNPLVPAASTGNETIAPGTSLYVRGYSHGE----QTRWLE 333 V + + + F NP P N T + ++ S Q Sbjct: 120 AVMVMSARPEQDRWSRFYKTDNPQSPQNILANRTD---VFVEIKRVSFLGGNVAQVY-FT 175 Query: 334 QTLRRASNDRDIDWIVV 350 + SN D + Sbjct: 176 KESVTGSNSTKTDAVAT 192
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 37.0 bits (85), Expect = 2e-04 Identities = 18/75 (24%), Positives = 32/75 (42%), Gaps = 10/75 (13%) Query: 249 DALEELVAK------RTSELEGALRQYERTTHVLQRTRRKMEQEIDERKAAQARLEHEKE 302 ++ L A+ R +ELE AL + + +E E +A +A LEH+ + Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305 Query: 303 ----EQRRLIRRLEE 313 ++ L R L+ Sbjct: 306 VLNANRQSLRRDLDA 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 114 bits (286), Expect = 2e-29 Identities = 36/133 (27%), Positives = 64/133 (48%), Gaps = 1/133 (0%) Query: 47 ILIVDDEPSILSALKRLLRTARYQVVTAESGAAALDVLAAGEADLIISDMRMPGMTGAEF 106 IL+ DD+ +I + L + L A Y V + A +AAG+ DL+++D+ MP + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 107 LARAQALHPDTMRILLTGYSEIDAVVSAINEGGVYRYLNKPWDDHDLLLTVKQALEQRRL 166 L R + PD ++++ + + A E G Y YL KP+D +L+ + +AL + + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKAS-EKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 167 RQQTARLFALTQQ 179 R + Sbjct: 125 RPSKLEDDSQDGM 137
>OMS28PORIN#OMS28 porin signature. Length = 257 Score = 31.7 bits (71), Expect = 0.007 Identities = 32/129 (24%), Positives = 55/129 (42%), Gaps = 22/129 (17%) Query: 214 AQITGPLQRVLKQSLAVAAGQAGDNVHLNRVDEIGMIMRAVNQAGLNLRSLVDDVSEQLS 273 A I P VL+ S DN L++ D+ VNQA + + +DVS +L Sbjct: 29 ANILKPQSNVLEHS------DQKDNKKLDQKDQ-------VNQALDTINKVTEDVSSKLE 75 Query: 274 GLQSASGRITAGNDDLSGRSEQAAASLEETAASMEQMTATVRNNADTATQASQLAGSTSE 333 G++ +S + ND A +++ SM M+ + + +A+ +A + Sbjct: 76 GVRESSLELVESND---------AGVVKKFVGSMSLMSDVAKGTVVASQEATIVAKCSGM 126 Query: 334 AAEKGDAVV 342 AE + VV Sbjct: 127 VAEGANKVV 135
>PF06776#Invasion associated locus B Length = 214 Score = 29.1 bits (65), Expect = 0.044 Identities = 31/143 (21%), Positives = 43/143 (30%), Gaps = 21/143 (14%) Query: 96 RPIPAHAQPAGAAPPNFPADIPL-----HKQAFRNWSGEIAVADLWTAVPATPADVVAIV 150 RP+ HA PA A PA++ + A RN A L A A Sbjct: 14 RPVTNHAVPALKAIQMGPAELSPMLASCRRLARRNG------ARLMLAGAMAIALSFGWS 67 Query: 151 NWAASNGYRARPLGHMHNWSPLTVAGNGASER-----TILVDTTTHLTAVSVDASATPAR 205 + A + G G +W GA +V ++V T + Sbjct: 68 DRADAQGAVRSVHG---DWQIRCDTPPGAKAEQCALIQSVVAEDRSNAGLTVIILKTADQ 124 Query: 206 VVAQAGVSLDTLLATLEQHGLGM 228 V L L GLG+ Sbjct: 125 KSKLMRVVAP--LGVLLPSGLGL 145
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.1 bits (234), Expect = 2e-24 Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 1/126 (0%) Query: 21 RMRILLVEDDRMIADGVRKALKADGCAVDWVQDGDAALTALGGEAYDLLLLDLGLPKRDG 80 IL+ +DD I + +AL G V + + DL++ D+ +P + Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 81 IDVLRTLRARGLALPVLILTARDAVADRVKGLDAGADDYLVKPFDLDE-LAARMRALIRR 139 D+L ++ LPVL+++A++ +K + GA DYL KPFDL E + RAL Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 140 QSGRSE 145 + S+ Sbjct: 123 KRRPSK 128
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 77.4 bits (190), Expect = 1e-17 Identities = 34/160 (21%), Positives = 61/160 (38%), Gaps = 26/160 (16%) Query: 123 STSLGSGFIISADGYILTNAHVIDGANVVTVKLTDKR-----------EYKA-KVVGTDK 170 T + SG ++ +LTN HV+D + L + A ++ Sbjct: 100 GTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSG 158 Query: 171 QSDVAVLKIDA--------SGLPTVKIGDPAQSKVGQWVVAIGSPYGFDNTVTSGIISAK 222 + D+A++K + + + A+++V Q + G P +K Sbjct: 159 EGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW---ESK 215 Query: 223 SRALPDENYTPFIQTDVPVNPGNSGGPLFNLNGEVIGINS 262 + + +Q D+ GNSG P+FN EVIGI+ Sbjct: 216 GKITYLKGE--AMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 125 bits (315), Expect = 4e-38 Identities = 80/208 (38%), Positives = 114/208 (54%), Gaps = 1/208 (0%) Query: 1 MARRTKEEALATRDRILDAAEHVFFEKGVSHTSLADIAQHAGVTRGAIYWHFASKSELFD 60 MAR+TK+EA TR ILD A +F ++GVS TSL +IA+ AGVTRGAIYWHF KS+LF Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 AMFDRVLLPIDELKAGT-GEPHADPLGRVREILIWCLLGAARDPQLRRVFSILFMKCEYV 119 +++ I EL+ + DPL +REILI L + + R + I+F KCE+V Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 120 ADMGPLLQRNREGMRDALRNIEADLAQGVANGQLPADLDTWRATLMLHTLVSGFVRDMLM 179 +M + Q R ++ IE L + LPADL T RA +++ +SG + + L Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 180 LPGEIDAERHAEKLVDGCFDMLRTSPAM 207 P D ++ A V +M P + Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTL 208
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 46.0 bits (109), Expect = 2e-07 Identities = 42/216 (19%), Positives = 71/216 (32%), Gaps = 36/216 (16%) Query: 100 AQLNSAKATLAKAQANLATQNALVARYKVLVAANAVSKQDYDNAVATQ-GQAAADVAAGK 158 + A L ++ L + + K + Q + N + + Q ++ Sbjct: 259 NKYVEAVNELRVYKSQLEQIESEILSAK---EEYQLVTQLFKNEILDKLRQTTDNIGLLT 315 Query: 159 AAVDTAQINLGYTDVVSPITGRV-GISQVTPGAYVQASQATLMSTVQQLDPVYVDLTQSS 217 + + + + +P++ +V + T G V ++ TLM V + D + V + Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQN 374 Query: 218 LDGLKLRQDIQSGRIK-------TEGPGAAKVTLILEDGKAYSEPGKLQFSDVTVDQTTG 270 D + Q+ IK G KV I D DQ G Sbjct: 375 KDIGFINVG-QNAIIKVEAFPYTRYGYLVGKVKNI--------------NLDAIEDQRLG 419 Query: 271 SVT--IRAI------FPNKQRVLLPGMFVRARIEEG 298 V I +I NK L GM V A I+ G Sbjct: 420 LVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455 Score = 31.0 bits (70), Expect = 0.010 Identities = 15/101 (14%), Positives = 32/101 (31%) Query: 65 VRARVDGIVLRREFTEGSDVKAGQRLYKIDPAPYIAQLNSAKATLAKAQANLATQNALVA 124 ++ + IV EG V+ G L K+ A +++L +A+ L Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 125 RYKVLVAANAVSKQDYDNAVATQGQAAADVAAGKAAVDTAQ 165 ++ + ++ + + K T Q Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQ 199
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1269 bits (3286), Expect = 0.0 Identities = 674/1035 (65%), Positives = 820/1035 (79%), Gaps = 2/1035 (0%) Query: 1 MAKFFIDRPIFAWVIAIILMLAGVAAIFTLPIAQYPTIAPPSIQITANYPGASAKTVEDT 60 MA FFI RPIFAWV+AIILM+AG AI LP+AQYPTIAPP++ ++ANYPGA A+TV+DT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQQMSGLDNFLYMSSTSDDSGNATITITFAPGTNPDIAQVQVQNKLSLATPILPQ 120 VTQVIEQ M+G+DN +YMSSTSD +G+ TIT+TF GT+PDIAQVQVQNKL LATP+LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 VVQQLGLSVTKSSSSFLLVLAFNSEDGSMNKYDLANYVASHVKDPISRINGVGTVTLFGS 180 VQQ G+SV KSSSS+L+V F S++ + D+++YVAS+VKD +SR+NGVG V LFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDPTKLTNYGLTPVDVTSAISAQNVQIAGGQLGGTPAVPGTVLQATITEATLL 240 QYAMRIWLD L Y LTPVDV + + QN QIA GQLGGTPA+PG L A+I T Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTPEQFGDILLKVNQDGSRVRLKDVAQIGLGGETYNFDTKYNGQPTAALGIQLATNANAL 300 + PE+FG + L+VN DGS VRLKDVA++ LGGE YN + NG+P A LGI+LAT ANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 ATAKAVRAKIDEMSAYFPHGLVVKYPYDTTPFVRLSIEEVVKTLLEGIVLVFLVMYLFLQ 360 TAKA++AK+ E+ +FP G+ V YPYDTTPFV+LSI EVVKTL E I+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NLRATIIPTIAVPVVLLGTFAIMSMVGFSINVLSMFGLVLAIGLLVDDAIVVVENVERVM 420 N+RAT+IPTIAVPVVLLGTFAI++ G+SIN L+MFG+VLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLSPKEATRKAMGQITGALVGVALVLSAVFVPVAFSGGSVGAIYRQFSLTIVSAMVL 480 E+ L PKEAT K+M QI GALVG+A+VLSAVF+P+AF GGS GAIYRQFS+TIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATILKPIPQGHHEEKKGFFGWFNRTFNASRDKYHVGVHHVIKRSGRW 540 SVLVALILTPALCAT+LKP+ HHE K GFFGWFN TF+ S + Y V ++ +GR+ Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LIIYLAVIVAVGLLFVRLPKSFLPDEDQGLMFVIVQTPSGSTQETTARTLANISDYLLTQ 600 L+IY ++ + +LF+RLP SFLP+EDQG+ ++Q P+G+TQE T + L ++DY L Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKDIVESAFTVNGFSFAGRGQNSGLVFVKLKDYSQRQSSSQKVQALIGRMFGRYAGYKDA 660 EK VES FTVNGFSF+G+ QN+G+ FV LK + +R +A+I R +D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LVIPFNPPSIPELGTAAGFDFELTDNAGLGHDALMAARNQLLGMAAKDP-TLRGVRPNGL 719 VIPFN P+I ELGTA GFDFEL D AGLGHDAL ARNQLLGMAA+ P +L VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 NDTPQYKVDIDREKANALGVTADAIDQTFSIAWASKYVNNFLDTDGRIKKVYVQSDAPFR 779 DT Q+K+++D+EKA ALGV+ I+QT S A YVN+F+D GR+KK+YVQ+DA FR Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID-RGRVKKLYVQADAKFR 779 Query: 780 MTPEDMNIWYVRNGSGGMVPFSAFATGHWTYGSPKLERYNGISAMEIQGQAAPGKSTGQA 839 M PED++ YVR+ +G MVPFSAF T HW YGSP+LERYNG+ +MEIQG+AAPG S+G A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 840 MTAMETLAKKLPTGIGYSWTGLSFQEIQSGSQAPILYAISILVVFLCLAALYESWSIPFS 899 M ME LA KLP GIGY WTG+S+QE SG+QAP L AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 900 VIMVVPLGVIGALLAATLRGLENDVFFQVGLLTTVGLSAKNAILIVEFARELQQTEKMGP 959 V++VVPLG++G LLAATL +NDV+F VGLLTT+GLSAKNAILIVEFA++L + E G Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 960 IEAALEAARLRLRPILMTSLAFILGVLPLAISNGAGSASQHAIGTGVIGGMITATFLAIF 1019 +EA L A R+RLRPILMTSLAFILGVLPLAISNGAGS +Q+A+G GV+GGM++AT LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1020 MIPMFFVKVRAVFSG 1034 +P+FFV +R F G Sbjct: 1020 FVPVFFVVIRRCFKG 1034
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.039 Identities = 19/101 (18%), Positives = 37/101 (36%), Gaps = 4/101 (3%) Query: 76 LGAIVLGAYADRHGRKAALTLSILLMMAGTLVIAVLPTYATIGIAAPLML-VGARLMQGF 134 +G V G +D+ G K L I++ G+++ V ++ ++ I A + GA Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL 123 Query: 135 SAGGEFGSATAFLAEHVPGRRGFFSSWQVASQGLTTLLAAI 175 A E+ G S +G+ + + Sbjct: 124 VMV---VVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGM 161
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 317 bits (813), Expect = e-110 Identities = 93/320 (29%), Positives = 168/320 (52%), Gaps = 17/320 (5%) Query: 1 MEFFRIRKDIPFMRHALVFNVVSLVTFLAAVFFLFHRGLHLSVEFTGGTVIEVQYQQTAQ 60 ++ + + F R ++V +A+V GL+ ++F GGT I + Sbjct: 5 LKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAID 64 Query: 61 LEPVRATLGTLGYADAQVQNFGTSR------NVLIRLPLK--------QGLTSAQQSDQV 106 + RA L L D + +IR+ ++ QG + ++V Sbjct: 65 VGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKV 124 Query: 107 MAALKAQNADVALQRVEFVGPQVGKELATDGLLALACVVIGIVIYLSFRFEWKYAVAGII 166 AL A + + + E VGP+V EL + +L + I+ Y+ RFEW++A+ ++ Sbjct: 125 ETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVV 184 Query: 167 ANLHDVVIILGFFAFFQWEFSLSVLAAVLAVLGYSVNESVVIFDRIRETFRRERKMTVQE 226 A +HDV++ +G FA Q +F L+ +AA+L + GYS+N++VV+FDR+RE + + M +++ Sbjct: 185 ALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRD 244 Query: 227 VINHAITSTMSRTIITHTSTEMMVLSMFFFGGPTLHYFALALTVGIMFGIYSSVFVAGSL 286 V+N ++ T+SRT++T +T + ++ M +GG + F A+ G+ G YSSV+VA ++ Sbjct: 245 VMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNI 304 Query: 287 AMWLGIKREDLVKEKKSAHD 306 +++G+ R KEKK D Sbjct: 305 VLFIGLDRN---KEKKDPSD 321
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 83.4 bits (206), Expect = 2e-19 Identities = 47/245 (19%), Positives = 103/245 (42%), Gaps = 5/245 (2%) Query: 382 KGKGEVLTVATIQSELGDRFQITGQPTPQAAADLALLLRAGSLAAPMDIIEERTIGPSLG 441 + V + E G + G + + L A A + E ++GP + Sbjct: 91 REDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPALKITSFE--SVGPKVS 148 Query: 442 ADNIRMGFHSVIWGFVAIAVFM-IAYYMLFGVVSVLGLSVNLLLLVAVLSLMQATLTLPG 500 + + S++ V I ++ + + F + +V+ L ++LL V + +++Q L Sbjct: 149 GELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQLKFDLTT 208 Query: 501 IAAIALALGMAIDSNVLINERIREELR--RGASPQIAIQEGYAHAWATILDSNVTTLIAG 558 +AA+ G +I+ V++ +R+RE L + + + + + + +TTL+A Sbjct: 209 VAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTGMTTLLAL 268 Query: 559 LALLAFGSGPVRAFAIVHCLGILTSMFSAVFFSRGLVNLWYGGRKKLQSLAIGQVWRPAE 618 + +L +G +R F G+ T +S+V+ ++ +V R K + + + Sbjct: 269 VPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEKKDPSDKFFSNGA 328 Query: 619 AGAAP 623 AP Sbjct: 329 QDGAP 333
>SECA#SecA protein signature. Length = 901 Score = 36.4 bits (84), Expect = 6e-04 Identities = 29/120 (24%), Positives = 44/120 (36%), Gaps = 7/120 (5%) Query: 414 DDGASLSARLYAALPFTLTAAQERVVAEIAHDLTQPHPMQRLLQGDV-----GSGKTVVA 468 + G L + A A+ +RV D Q L + + G GKT+ A Sbjct: 55 EKGEVLENLIPEAFAVVREAS-KRVFGMRHFD-VQLLGGMVLNERCIAEMRTGEGKTLTA 112 Query: 469 ALAAAQAIDAGYQAALMAPTEILAEQHARKLRGWLEPLGVSVAWLAGSLKTKDKRAALEA 528 L A G ++ + LA++ A R E LG++V + KR A A Sbjct: 113 TLPAYLNALTGKGVHVVTVNDYLAQRDAENNRPLFEFLGLTVGINLPGMPAPAKREAYAA 172
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 90.2 bits (224), Expect = 1e-22 Identities = 75/342 (21%), Positives = 127/342 (37%), Gaps = 33/342 (9%) Query: 6 IITGITGQDGAYLAQLLLDKGYVVHG-----TYRRTSSVNFWRIEELGIGAHPNLHLVEY 60 ++TG G G ++++ LL+ G+ V G Y S + R+E L P + Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVS-LKQARLELLA---QPGFQFHKI 59 Query: 61 DLTDLSASIRLLRTTGATEVYNLAAQSFVGVSFDQPVTTAEITGIGPLNLLEAIRIVNPA 120 DL D L + V+ + V S + P A+ G LN+LE R Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 121 IRFYQASTSEMFGKVQAIPQTETTPF-YPRSPYGVAKLYAHWITVNYRESYNIFGCSGIL 179 AS+S ++G + +P + +P S Y K + Y Y + Sbjct: 120 -HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRF 178 Query: 180 FNHESPLRGR-EFVTRKITDSMAKIRLGK-LDVLELGNLDAKRDWGFAKEYVEGMWRMLQ 237 F P GR + K T +M + GK +DV G + KRD+ + + E + R+ Sbjct: 179 FTVYGP-WGRPDMALFKFTKAMLE---GKSIDVYNYGKM--KRDFTYIDDIAEAIIRLQD 232 Query: 238 ADKPDTYVLATNRTEKVRDFVGMAARAAGFKLAWEGREENEVGIDLGS------GKTIVR 291 V+ T+ + AA A +++ G +D G + Sbjct: 233 -------VIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKK 285 Query: 292 INPKFYRPAEVDLLIGDPKKARDELGWAPATTLEQLCQMMVE 333 +P +V D K + +G+ P TT++ + V Sbjct: 286 NMLPL-QPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVN 326
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 99.1 bits (247), Expect = 4e-26 Identities = 67/335 (20%), Positives = 126/335 (37%), Gaps = 49/335 (14%) Query: 3 KVLITGIGGFTGRYLARRLTQSGHDVCGI------------VHRTGV--ELEWRAHVADL 48 K L+TG GF G ++++RL ++GH V GI R + + ++ H DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 49 LDRGQLAEVFERERPDALVHLAAIAFV--AHDDASAIYQTNVVGTRNLLDALASSSHAPR 106 DR + ++F + + V + ++ A +N+ G N+L+ + + Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE--GCRHNKIQ 119 Query: 107 SVLLASSANVYG-NTDREWIDESVPPAPANDYAVSKLSMEFVAKLWCD--RLPIVVARPF 163 +L ASS++VYG N + + P + YA +K + E +A + LP R F Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179 Query: 164 NYTGVGQAANFLLPKIVSHFRSRAPVLELGNLDVIRDFSDVRAVAAAYEKLIG------- 216 G + L K + + RDF+ + +A A +L Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239 Query: 217 -----------GAFAGETFNVCSGVGYSLQDVLAMAEELTGYRPEIRVNPNFV--RANEV 263 +N+ + L D + E+ G I N + + +V Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG----IEAKKNMLPLQPGDV 295 Query: 264 RKLIGNGAKLRDAIG-EP---LAIPLRDTLAWMLE 294 + + L + IG P + +++ + W + Sbjct: 296 LETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 37.6 bits (87), Expect = 2e-05 Identities = 29/157 (18%), Positives = 57/157 (36%), Gaps = 6/157 (3%) Query: 103 HRNVRVIDFFFARLLLEISGATMSFTFLTIFFIIAGMMHPPENMMMILGAWLHLAVFGSG 162 + +R+ D + + A ++ + + G +++ L + + Sbjct: 105 YTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYT-QWLSLLYALPVIALTGLAFAS 163 Query: 163 LALIIGALSERSEAVERIWHTVAYL-MFPLSGSIFMVSWLPEKFQKAVLLLPMVHGTEML 221 L +++ AL+ S + T+ + LSG++F V LP FQ A LP+ H +++ Sbjct: 164 LGMVVTALA-PSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLI 222 Query: 222 RGGYF---GSLVTPHYSIRYMVFSDLILLLIGLYCVR 255 R V H + L L R Sbjct: 223 RPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.8 bits (80), Expect = 6e-04 Identities = 26/188 (13%), Positives = 62/188 (32%), Gaps = 16/188 (8%) Query: 172 DAQKINTELLDLGEQLVNRMNERAAKDTVSFAQRQVDAAAAKAKEAAVALAAYRNSNAVF 231 A + L L+ E+ +S + K + L V Sbjct: 128 TALGAEADTLKTQSSLLQARLEQTRYQILSRSIE-------LNKLPELKLPDEPYFQNVS 180 Query: 232 DPEKQSALQLQQVTSLQSQLFSAQTQLRQLQL-ISPQNPQISVLKNSISELEKQIKEATG 290 + E L ++ Q + Q Q Q +L + + + + I+ E + Sbjct: 181 EEEVLRLTSL-----IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 291 GVAGGKNSLSNKAASYTR-LQLDSQFAD--KQLASALAAMETARAEAQRQQLYLERLVQP 347 + + L +A + L+ ++++ + +L + +E +E + + + Q Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 348 NKPDIAIE 355 K +I + Sbjct: 296 FKNEILDK 303
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 58.1 bits (140), Expect = 4e-12 Identities = 67/251 (26%), Positives = 100/251 (39%), Gaps = 26/251 (10%) Query: 9 VVVTGASAGLGGALALAYAAPGVVLGLVGRDAARLDACAQACRARGAEVVVGQFDVRDAE 68 +TGA+ G+G A+A A+ G + V + +L+ + +A DVRD+ Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 69 --RAQAWLWAFDDAHPIDLLIANAGV--ASTLASASDWEELERTASVVDTNFYGALHAVL 124 + PID+L+ AGV + S SD EE E T SV N G +A Sbjct: 71 AIDEITARIE-REMGPIDILVNVAGVLRPGLIHSLSD-EEWEATFSV---NSTGVFNASR 125 Query: 125 PAVARMRPRGRGRIAMVSSLAALRGMAISPAYCASKAAIKAYADSVRPLLARDGVGMSVI 184 M R G I V S A AY +SKAA + + LA + +++ Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 185 LPGFVKTAMSDVFPGDKPFLWSADRAAAH-IRAKLAAGRAEIAFPGLLALGMRVLAFLPA 243 PG +T M LW+ + A I+ L + I L P+ Sbjct: 186 SPGSTETDMQWS-------LWADENGAEQVIKGSLETFKTGIPLKKLAK---------PS 229 Query: 244 ALADAILGRLS 254 +ADA+L +S Sbjct: 230 DIADAVLFLVS 240
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 37.3 bits (86), Expect = 6e-04 Identities = 33/162 (20%), Positives = 59/162 (36%), Gaps = 9/162 (5%) Query: 2138 LVVGGTGGLGFASARWMVSRGARHLTLASRGGALAEPLCDEVERWRSELGVATHVVACDA 2197 + G G+G A AR + S+GA H+ E +V D Sbjct: 12 FITGAAQGIGEAVARTLASQGA-HIAAVDYNPEKLE----KVVSSLKAEARHAEAFPADV 66 Query: 2198 TDAAALARTMGEIDARGTPLKGVLHSAMHIDDGLVRNLDDERFAAVLAPKVAGAWNLHRA 2257 D+AA+ I+ P+ +++ A + GL+ +L DE + A + G +N R+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 2258 T----RERALDFFVVYSSATTYLGNPGQASYVAANSFVEALV 2295 +R V S + A+Y ++ + Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFT 168
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 58.3 bits (141), Expect = 3e-11 Identities = 78/413 (18%), Positives = 127/413 (30%), Gaps = 60/413 (14%) Query: 165 LVIDGFDAQAMGY---VAPSVIAEWGVKKQA---LGPVFSASLFGMLLGALGLSVLADRI 218 L DA +G V P ++ + G + + A L L+DR Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70 Query: 219 GRRPVLIGATLFFALTMLATPFATSIPTLIALRFVTGLGLGCIMPNAMALVGECSPGAHR 278 GRRPVL+ + A+ A + L R V G+ G A A + + + G R Sbjct: 71 GRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDER 129 Query: 279 VKRM----MIVSCGFTAGAALGGFVSAALIPAFGWRAVFFVGGAVPLALAAAMAARLPES 334 + G AG LGG + F A FF A+ L Sbjct: 130 ARHFGFMSACFGFGMVAGPVLGGLMG-----GFSPHAPFFAAAALNG-LNFLTG------ 177 Query: 335 PQLLVLRGRHDAARAWLAKFAPQLAVSPDTRLVVREAGPQGAPVAELFRSGRASVTLLLW 394 +L H R + + A++P + G V L+ Sbjct: 178 --CFLLPESHKGER----RPLRREALNPLASFR--------------WARGMTVVAALMA 217 Query: 395 AINFMNLIDLYFLSNWLPTVMRDAGYASGTAVIVGTVLQTGGVIGTLS----LGRFIERY 450 M L+ + W+ + A +G L G++ +L+ G R Sbjct: 218 VFFIMQLVGQVPAALWVIFGEDRFHW---DATTIGISLAAFGILHSLAQAMITGPVAARL 274 Query: 451 GFVRVLFACFACAAIAVGLIGSVAHAFYWLLAAVFVGGFCVVGGQPAVNALAGHYYPTSL 510 G R L L+ + V + + G PA+ A+ Sbjct: 275 GERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGI--GMPALQAMLSRQVDEER 332 Query: 511 RSTGIGWSLGVGRVGSVLGPLVGGQLIA--------LGWSNDALFHAAAVPVL 555 + G + + S++GPL+ + A W A + +P L Sbjct: 333 QGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPAL 385
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.9 bits (101), Expect = 1e-06 Identities = 84/356 (23%), Positives = 139/356 (39%), Gaps = 25/356 (7%) Query: 27 LILSVAVVGLGTGATLPLTALALTEAGHGTRVV---GMLTAAQAGGGLVVVPFVAAITKR 83 ++ +VA+ +G G +P+ L + H V G+L A A P + A++ R Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69 Query: 84 LGGRQVIVASVIALAAATALMQFTSSLVVWGVLRVVCGAALMLLFTIGEAWVNQLADDAT 143 G R V++ S+ A A+M L V + R+V G G A++ + D Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDE 128 Query: 144 RGRVVAIYATNFTLFQMAGPVLVSQIAGMT-HARFALCGALFLLAL--------PSLASI 194 R R + F +AGPVL + G + HA F AL L S Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188 Query: 195 RKTPIADEPHHDAHDLWTRVMPKMPALVVGTAFFALFDTLALSLLPLFAMAR--GVASEA 252 R+ + + A W R M + AL+ L + +L +F R A+ Sbjct: 189 RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248 Query: 253 AVLLAAILLFGDTAMQFPIGWLADKLGRERVHIGAGCVVLALLPLMPVVVATPWLCWPLL 312 + LAA + A G +A +LG ER + G + ++ W+ +P++ Sbjct: 249 GISLAAFGILHSLAQAMITGPVAARLG-ERRALMLGMIADGTGYILLAFATRGWMAFPIM 307 Query: 313 FVLGAAAGSVYTL----SLVACGERFRGSALVTASSLVSASWSAASFGGPLVAGAL 364 +L + + L S ER +G + ++L S S GPL+ A+ Sbjct: 308 VLLASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALT----SLTSIVGPLLFTAI 358
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.1 bits (125), Expect = 1e-09 Identities = 98/399 (24%), Positives = 150/399 (37%), Gaps = 37/399 (9%) Query: 5 LFALAVAAFGIGTTEFVIMGLLPNVARDLGVSIPAA---GMLVSGYALGVTIGAPILAVV 61 L +A+ A GIG +IM +LP + RDL S G+L++ YAL AP+L + Sbjct: 11 LSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 TARMPRKAALLALIGVFIVGNLFCAIAPGYATLMIARVVTAFCHGAFFGIGSVVASSLVA 121 + R R+ LL + V A AP L I R+V G+ +A + Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITD 125 Query: 122 PNKRAQAIALMFTGLTLANVLGVPLGTALGQAFGWRATFWAVTAIGALAAAALAFCVPKR 181 ++RA+ M V G LG +G F A F+A A+ L F +P+ Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 LEMPAAGIAREFGVLRNPQVLMVLGISVLASASLFTVFTYITPI-----------LEDVT 230 + + RE NP + A+L VF + + ED Sbjct: 185 HKGERRPLRRE---ALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241 Query: 231 GFTPRQVTLVLLLFG-LGLTVGGTVGGRLADW---RRMPSLVATLASIGIVLAAFAGTMR 286 + + + L FG L + G +A RR L G +L AFA Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 287 APLPALVTIFAWGVLAFAIVPPLQILIVDRAS-HAPNLASTLNQGAFNLGNALGAWLGGT 345 P +V + + G+ +P LQ ++ + +L + +G L Sbjct: 302 MAFPIMVLLASGGI----GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 346 AIHAGVPLAK-LPW-AGAAL---AMAALALTLWSASLER 379 A + W AGAAL + AL LWS + +R Sbjct: 358 IYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQR 396
>SECA#SecA protein signature. Length = 901 Score = 33.7 bits (77), Expect = 0.001 Identities = 27/85 (31%), Positives = 41/85 (48%), Gaps = 5/85 (5%) Query: 116 LNRRLPRAVARTREGDFSLNGLLGFDLFGKTIGVIGTGLIGSVFARIMTGFGMRVLAHSL 175 L +P A A RE + G+ FD+ + +G G L A + TG G + L +L Sbjct: 60 LENLIPEAFAVVREASKRVFGMRHFDV--QLLG--GMVLNERCIAEMRTGEG-KTLTATL 114 Query: 176 PPHDDALIALGVRYVPLDALLAESD 200 P + +AL GV V ++ LA+ D Sbjct: 115 PAYLNALTGKGVHVVTVNDYLAQRD 139
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 76.5 bits (188), Expect = 3e-17 Identities = 71/357 (19%), Positives = 132/357 (36%), Gaps = 46/357 (12%) Query: 12 RLILLLGALAACGPIATDMYLPSLPAIADGFGVTAAAAQRTLTSFMAGFSIGMLLYGPLS 71 ++++ L L+ + + SLP IA+ F A+ T+FM FSIG +YG LS Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 72 DTYGRRPVLLGGIALFTLASIGCFVATS-IDMLIVVRFLQAFGAGAASVLARAIARDAHE 130 D G + +LL GI + S+ FV S +LI+ RF+Q GA A L + Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 131 PSDAAKVLSMVAIVTAIGPLLAPLIGGQVLRFSGWRGVFVVLTLFGAVCATAAFLRVPET 190 + K ++ + A+G + P IGG + + W + ++ + L E Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 191 WPREK--RASSAVLNSFAAYGRILADPVAWGHM--------------------------- 221 + +++ + + + + Sbjct: 194 RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLG 253 Query: 222 ---------LCGGMAFASMFAYITATPFVYIDYFHVSPQHYGLLFGLNV-VGIMIGNFLN 271 LCGG+ F ++ +++ P++ D +S G + + ++I ++ Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313 Query: 272 TRLVGRVGSLKIIAGASLLSGAASFCVAFFALTGLGGLWSIVASLFFVVSVVGILSA 328 LV R G L ++ + +F T + +V V+G LS Sbjct: 314 GILVDRRGPLYVLNIGVTFLSVSFLTASFLLET------TSWFMTIIIVFVLGGLSF 364
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.039 Identities = 14/32 (43%), Positives = 18/32 (56%) Query: 29 VVVVCGPSGSGKSTLIKTINGLEPFQKGSITV 60 VV+ G G GKSTLI T+ GL+ F + Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDI 629
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 29.0 bits (65), Expect = 0.017 Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 17/70 (24%) Query: 64 TPLLVQMFVVYYGLPDIGISLDPTSAGIFTLTLNAGAYLSESMRGAILGIGR--GQWAAS 121 P Q + +GLP T+ G L++ R GIG+ G A Sbjct: 392 KPRFFQ-STLLHGLPA-------------GWTIYGGTQLADRYRAFNFGIGKNMGALGAL 437 Query: 122 HSLGLTHAQT 131 S+ +T A + Sbjct: 438 -SVDMTQANS 446
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.7 bits (66), Expect = 0.025 Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 6/97 (6%) Query: 80 SGDAPSAAQIKGPLIQEWADQGVLVNIDAAAGDWKQNLPPEIDKIIKYKGNTVAAPFSVH 139 +GD P +A G+L I ++ L P ++Y G +A P +V Sbjct: 79 TGDGPDIIFWAHDRFGGYAQSGLLAEITPDKA-FQDKLYPFTWDAVRYNGKLIAYPIAVE 137 Query: 140 RVNWLYINKAALDKIGAKPPATWPEFFQIADKLKAAG 176 ++ +Y NK L PP TW E + +LKA G Sbjct: 138 ALSLIY-NKDLL----PNPPKTWEEIPALDKELKAKG 169
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 356 bits (915), Expect = e-119 Identities = 146/453 (32%), Positives = 220/453 (48%), Gaps = 46/453 (10%) Query: 196 VHVARSAHEAARRVKPDQPQAGIADL---DGFAPRELPTLEAVLRQQQVGWIALAGDARI 252 V + +A R + + D+ D A LP ++ V + ++ Sbjct: 30 VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV--LVMSAQNTF 87 Query: 253 NDPDVRRLIRQYCFDYMPGLPPHETIDYLVGHAYGMVALCDLDLMAGATETGDEMVGACD 312 + + +DY+P + ++G A + ++ G +VG Sbjct: 88 MTA--IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR-RPSKLEDDSQDGMPLVGRSA 144 Query: 313 AMQQLFRMIRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLL 372 AMQ+++R++ ++ TD T+ I+GESGTGKEL A A+H+ +RR PFVAIN AIP L+ Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLI 204 Query: 373 QSELFGYERGAFTGASQRKIGRVESADGGTLFLDEIGDMPLESQASMLRFLQEGKIERLG 432 +SELFG+E+GAFTGA R GR E A+GGTLFLDEIGDMP+++Q +LR LQ+G+ +G Sbjct: 205 ESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVG 264 Query: 433 GHESIPVDVRIISATHVDLDAAMREGRFREDLYHRLCVLKLEEPPLRARDKDIEILAHHI 492 G I DVRI++AT+ DL ++ +G FREDLY+RL V+ L PPLR R +DI L H Sbjct: 265 GRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHF 324 Query: 493 LHRFRSDGARRIHGFTSCAIEAMYNYQWPGNVRELINRIRRAIVMSDSRHLSAADLDL-- 550 + + +G + F A+E M + WPGNVREL N +RR + ++ ++ Sbjct: 325 VQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENEL 383 Query: 551 -----------------------------------APFAARQATTLAEARERAERRTIEA 575 A + E I A Sbjct: 384 RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILA 443 Query: 576 SLLRHRNRLTEAAAELGVSRATLYRLMVSHGLR 608 +L R +AA LG++R TL + + G+ Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 375 bits (964), Expect = e-128 Identities = 151/474 (31%), Positives = 233/474 (49%), Gaps = 46/474 (9%) Query: 17 ADLQRCFDRHGWQVDIVDSPREMRRSAARGVIAGGLLDFSCGVGAAELRELEASLKT--P 74 L + R G+ V I + + R A G + D A +L +K P Sbjct: 17 TVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF--DLLPRIKKARP 74 Query: 75 NVGWIAMTRRGQMGDDAVRRLVRDYCFDYVTVPYECERIVESVGHAYGMVTLSEGLAPAA 134 ++ + M+ + A++ +DY+ P++ ++ +G A + + Sbjct: 75 DLPVLVMSAQNTF-MTAIKAS-EKGAYDYLPKPFDLTELIGIIGRA--LAEPKRRPSKLE 130 Query: 135 ATVRNEGEMVGTCDAMLALFKMIRKVAATDAPVFISGESGTGKELTAVAIHERSARANAP 194 ++ +VG AM +++++ ++ TD + I+GESGTGKEL A A+H+ R N P Sbjct: 131 DDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGP 190 Query: 195 FVAINCGAIPPTLLQAELFGYERGAFTGANQRKIGRIEAANGGTLFLDEIGDLPFESQAS 254 FVAIN AIP L+++ELFG+E+GAFTGA R GR E A GGTLFLDEIGD+P ++Q Sbjct: 191 FVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTR 250 Query: 255 LLRFLQEHKVERVGGHQSISVDVRIVSATHVDMQVALRNGRFREDLYHRLCVLKLEEPPL 314 LLR LQ+ + VGG I DVRIV+AT+ D++ ++ G FREDLY+RL V+ L PPL Sbjct: 251 LLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPL 310 Query: 315 RERGKDIEILARHMLERFKGDAHRRLRGFTPDAIAALHNYAWPGNVRELINRVRRAIVMS 374 R+R +DI L RH +++ + + ++ F +A+ + + WPGNVREL N VRR + Sbjct: 311 RDRAEDIPDLVRHFVQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALY 369 Query: 375 EGRMISAADLELSGYAEVA-------------------------------------PMSL 397 +I+ +E +E+ Sbjct: 370 PQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLY 429 Query: 398 EEARESAERHAIEVALLRHRGRLADAARELGVSRVTLYRLLCAYGMRDDGSTRA 451 + E I AL RG AA LG++R TL + + G+ S+R+ Sbjct: 430 DRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRS 483
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 39.2 bits (91), Expect = 5e-05 Identities = 24/119 (20%), Positives = 39/119 (32%), Gaps = 3/119 (2%) Query: 780 PPIRSTPTPTHSAQPAPQPAGRAQPQPAWQTPRNEMRAPEAPRSVPRQEVAPPPAPRNEY 839 P + T A P A + P+P + PE P+ P P P P+ + Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105 Query: 840 RAPAPAPRPQVEAPRTEAPRMPAPRMEAPRMEPRPAAPPPAAPRNPPPAPRQEPPRQVR 898 + +P+ + E+ AP RP + A + P PR + Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAPA---RPTSSTATAATSKPVTSVASGPRALS 161
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 32.0 bits (72), Expect = 0.002 Identities = 19/78 (24%), Positives = 31/78 (39%), Gaps = 2/78 (2%) Query: 78 GRAAMLWALMDGSARPAGELTM--IAGLSPSAASAHLARLADGGLLALDVRGRHRYYRIA 135 G A + ++ G+ + M IA L A + L + R + Sbjct: 243 GEALGIGDILYGTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREAVDRWIQE 302 Query: 136 SPDVAAAIEALANVAQAA 153 +P+ A +EA+ NVA AA Sbjct: 303 NPNAAETVEAVFNVAAAA 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 114 bits (287), Expect = 5e-32 Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 4/153 (2%) Query: 11 TVFVVDDDEAVRDSLRWLLEANGYRVQCFSSAEQFLEAYQPAQQAGQIACLILDVRMSGM 70 T+ V DDD A+R L L GY V+ S+A AG ++ DV M Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----AGDGDLVVTDVVMPDE 60 Query: 71 SGLELQERLIAENAALPIIFVTGHGDVPMAVSTMKKGAMDFIEKPFDEAELRKLVERMLD 130 + +L R+ LP++ ++ A+ +KGA D++ KPFD EL ++ R L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 131 KARSESKSVQEQRAASERLSKLTAREQQVLERI 163 + + +++ L +A Q++ + Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153
>PF06580#Sensor histidine kinase Length = 349 Score = 31.8 bits (72), Expect = 0.012 Identities = 18/85 (21%), Positives = 32/85 (37%), Gaps = 18/85 (21%) Query: 711 PVLIEQVLV-NLMKNAAEAMQEARPQAENGVIRVVADLESGFVDIRVIDQGPGVDEATAE 769 P ++ Q LV N +K+ + G I + ++G V + V + G + T E Sbjct: 256 PPMLVQTLVENGIKHGIA------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE 309 Query: 770 RLFEPFYSTKSDGMGMGLNICRSII 794 S G G+ N+ + Sbjct: 310 ----------STGTGL-QNVRERLQ 323
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.5 bits (87), Expect = 1e-04 Identities = 17/83 (20%), Positives = 32/83 (38%), Gaps = 2/83 (2%) Query: 162 VPSPAAGVVKDIKVKVGDAVSEGSLIVVLEASGAAAA--SAPQAAAPAQAAPAPAAAPAP 219 + +VK+I VK G++V +G +++ L A GA A + A+ + Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 220 APQAAPAPQAAPAAAPAPAASGE 242 + + P+ P E Sbjct: 159 SIELNKLPELKLPDEPYFQNVSE 181 Score = 34.8 bits (80), Expect = 0.001 Identities = 15/52 (28%), Positives = 26/52 (50%), Gaps = 2/52 (3%) Query: 49 VPSPSAGTVKEVKVKVGDAVSQGSLIVLLD--GAQAAGRPAQANGAAASAAQ 98 + VKE+ VK G++V +G +++ L GA+A Q++ A Q Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.0 bits (70), Expect = 0.015 Identities = 15/44 (34%), Positives = 20/44 (45%) Query: 45 SMEVPSDVAGTVKEIKVKAGDKVSQGTVIALVEASAGAAAPAKA 88 S E+ VKEI VK G+ V +G V+ + A A K Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKT 139
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 48.1 bits (114), Expect = 1e-07 Identities = 36/156 (23%), Positives = 58/156 (37%), Gaps = 14/156 (8%) Query: 765 LVTGGNSGMGRAIGRHLVERGARVVA--------LSRRGGQSIPGLTA--IAVDVSDLDA 814 +TG G+G A+ R L +GA + A A DV D A Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAA 71 Query: 815 LRRAVAQIRAEHGPINGVVHSAGMPPHALLRTAADTAMRDVLAGKFLGARNLRQVLCADS 874 + A+I E GPI+ +V+ AG+ L+ + +D + G N + + Sbjct: 72 IDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYM 131 Query: 875 LD----FVVLCSSLRAHVPAAGASDYMAANLALEAL 906 +D +V S A VP + Y ++ A Sbjct: 132 MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMF 167
>PF04183#IucA / IucC family Length = 580 Score = 32.2 bits (73), Expect = 0.044 Identities = 27/187 (14%), Positives = 50/187 (26%), Gaps = 28/187 (14%) Query: 1140 EIDAVVDTVPGGAAN----VQDIYPLAPLQEGILFHHLQQT----QGDAYLLRSLLAFDT 1191 IDA + + + + + + H+Q GD LL++ Sbjct: 59 WIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSA 118 Query: 1192 RARLDAFLAALQQVIDRH------------DILRTAACWKELSQPVQVVWRQAALHAEIF 1239 ++ LQ ++ H E + ++ W I+ Sbjct: 119 SDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIW 178 Query: 1240 SPAEEGDVPAQLLKHTDPRERRLDLSRAPLFALDIARDPERDEWLLALTFHHLIADHLTL 1299 E D+ L DP+E F+ + WL L H Sbjct: 179 RCDNEMDIHQLLTAAMDPQEFA-------RFSQVWQENGLDHNWLP-LPVHPWQWQQKIA 230 Query: 1300 ELVVAEI 1306 +A+ Sbjct: 231 TDFIADF 237
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 33.5 bits (76), Expect = 0.004 Identities = 35/161 (21%), Positives = 62/161 (38%), Gaps = 11/161 (6%) Query: 828 VTGGTGALGLATARWLAGRGARHLLLISRRGEVGDGVRATCERLRGDGVDVRVVASDVAD 887 +TG +G A AR LA +GA H+ + E + V ++ L+ + +DV D Sbjct: 13 ITGAAQGIGEAVARTLASQGA-HIAAVDYNPEKLEKVVSS---LKAEARHAEAFPADVRD 68 Query: 888 EASLR---GALAAAARPIRGVVHCAGIVQDAPLATLDAAAFANVLRAKVGGAALLDRLTD 944 A++ + PI +V+ AG+++ + +L + G R Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 945 AQPLD----FFLLYSSISVAVGRHGQAAYAAANAYLDALAQ 981 +D + S V R AAYA++ A + Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTK 169
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 94.5 bits (235), Expect = 1e-23 Identities = 82/380 (21%), Positives = 131/380 (34%), Gaps = 74/380 (19%) Query: 49 AQSSVVLYGLIDTSITYASNQRTHGAGSPGSGGLAVTSGALNASRWGLRGREELGGGRSA 108 A + V LYG I + + + +GA + T S+ G +G+E+LG G A Sbjct: 17 AMADVTLYGTIKAGVETSRSVAHNGAQAASVE--TGTGIVDLGSKIGFKGQEDLGNGLKA 74 Query: 109 IFALENGFSASNGALSQKGVAMFGRQAWLGLKSKEGGALTFGRQYDLILDF--VTPLGAS 166 I+ +E S + RQ+++GLK G L GR ++ D + P + Sbjct: 75 IWQVEQKASIAGT-----DSGWGNRQSFIGLK-GGFGKLRVGRLNSVLKDTGDINPWDSK 128 Query: 167 GPGWGGNLAVHPYDNDDSNRNIRINHAVKYKSPTYRGWTFGAMYGFSNTAGQFGNNAAWS 226 G N P R I +V+Y SP + G + Y ++ AG N+ ++ Sbjct: 129 SDYLGVNKIAEP-----EARLI----SVRYDSPEFAGLSGSVQYALNDNAG-RHNSESYH 178 Query: 227 AGLSYANGPLKLGAGYLGINRNPNAANANGAVSTADGSATITGGSQQIWAIAGRY-AFGP 285 AG +Y NG + G + QI + Y Sbjct: 179 AGFNYKNGGFFVQYGGAYKRH-------------HQVQENVNIEKYQIHRLVSGYDNDAL 225 Query: 286 HSAGAAWSHSATDRVSGVLQGGGIVKLDGNALVFDNFSVDGHY-----VVTPRLSLSAAY 340 +++ A A ++ N V VTPR+S Y Sbjct: 226 YASVAVQQQDAK-------------LVEENYSHNSQTEVAATLAYRFGNVTPRVS----Y 268 Query: 341 TYTMGR-FDSRSGETRPKWNHVVAQADYAFSKRTDAYLEGVYQHVSGGNGNPAFNATIWT 399 + FD+ + ++ VV A+Y FSKRT A + + G Sbjct: 269 AHGFKGSFDATNYNND--YDQVVVGAEYDFSKRTSALVSAGWLQEGKGESK--------- 317 Query: 400 LTPSASGNQVVVALGLRHRF 419 +GLRH+F Sbjct: 318 ------FVSTAGGVGLRHKF 331
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 62.9 bits (153), Expect = 6e-13 Identities = 55/242 (22%), Positives = 101/242 (41%), Gaps = 5/242 (2%) Query: 61 IGALIFGRLADHFGRRPTLMINIACYSLLELASGFAPSLAALLVLRTLFGIAMGGEWGVG 120 A + G L+D FGRRP L++++A ++ AP L L + R + GI G V Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVA 116 Query: 121 SALTMETIPPRARGAVSGLLQAGYPSGYLLASVVFGLFYQYIGWRGMFMIGVLPALLVLY 180 A + R G + A + G + V+ GL + F L L L Sbjct: 117 GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLT 176 Query: 181 VRAKVPES-PAWRQMEKRARPSLVSTLKQNWKLSIYAVVLMTAF--NFFSHGTQDLYPTF 237 +PES R+ +R + +++ + +++ A ++ F L+ F Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236 Query: 238 LREQHHFDPHTVSW-ITIVLNVGAIAGGLTFGWLSERIGRRRAIFIAALIALPVLPLWAF 296 ++ H+D T+ + + ++A + G ++ R+G RRA+ + + L AF Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296 Query: 297 ST 298 +T Sbjct: 297 AT 298
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 7e-15 Identities = 21/83 (25%), Positives = 35/83 (42%) Query: 4 RQASRQSGGTKARILDAAEDLFIEHGFEAMSMRQITSRAAVNLAAVNYHFGSKEALIHAM 63 R+ +++ T+ ILD A LF + G + S+ +I A V A+ +HF K L + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 64 LSRRLDQLNEERLRILDRFDAQL 86 + E L +F Sbjct: 63 WELSESNIGELELEYQAKFPGDP 85
>cloacin#Cloacin signature. Length = 551 Score = 28.1 bits (62), Expect = 0.031 Identities = 30/83 (36%), Positives = 33/83 (39%), Gaps = 9/83 (10%) Query: 81 TGGGGRPGGREGGGHGPYGS-HGGPRESRGEGGGYGARESRGDGGYGSREPRGDG-GYGS 138 +GG GR G G H G+ +GGP G G G S G G P G G G G Sbjct: 2 SGGDGR--GHNTGAHSTSGNINGGP-----TGLGVGGGASDGSGWSSENNPWGGGSGSGI 54 Query: 139 REPRGDGGGYGSRESRGDGGYGT 161 G G G G GG GT Sbjct: 55 HWGGGSGHGNGGGNGNSGGGSGT 77
>cloacin#Cloacin signature. Length = 551 Score = 43.5 bits (102), Expect = 2e-06 Identities = 40/125 (32%), Positives = 54/125 (43%), Gaps = 11/125 (8%) Query: 38 GLDGSGSGGGNAISTTGD--GGSGSGGSGGTSGSGSGGT-------GGSGSTGGLSGGGG 88 G DG G G A ST+G+ GG G GG + GSG + GGSGS GG G Sbjct: 3 GGDGRGHNTG-AHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61 Query: 89 ST-SGGGSTSGGGSTSGGGSTSGGTSTTSSINALGTVAGNTGGIISGAGSTVSGLGTVVG 147 GG SGGGS +GG ++ AL T + AG+ + + ++ Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSISAGALSAAIADIMA 121 Query: 148 SQTLP 152 + P Sbjct: 122 ALKGP 126 Score = 35.8 bits (82), Expect = 4e-04 Identities = 40/135 (29%), Positives = 56/135 (41%), Gaps = 19/135 (14%) Query: 54 GDGGSGSGGSGGTSGSGSGGTGGSGSTGGLSGGGGSTSGGGSTS---GGGSTSGGGSTSG 110 GDG + G+ TSG+ +GG G G+ GG SG S + GGGS SG G Sbjct: 4 GDGRGHNTGAHSTSGNINGGPTG----LGVGGGASDGSGWSSENNPWGGGSGSGIHWGGG 59 Query: 111 GTSTTSSINALGTVAGNTGGIISGAGSTVSGLGTVVGSQTLPGVNPQTTQAIGGVVQSL- 169 G G SG GS G + V + G +T GG+ S+ Sbjct: 60 SGHGNG---------GGNGN--SGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVSIS 108 Query: 170 GGAVSALGSGVTSGI 184 GA+SA + + + + Sbjct: 109 AGALSAAIADIMAAL 123 Score = 35.8 bits (82), Expect = 4e-04 Identities = 34/98 (34%), Positives = 43/98 (43%), Gaps = 11/98 (11%) Query: 44 SGGGNAISTTGDGGSGSGGSGGTSGSGSGGTGGSGS--------TGGLSGGG---GSTSG 92 SGG TG + +GG +G G GG GS GG SG G G SG Sbjct: 2 SGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSG 61 Query: 93 GGSTSGGGSTSGGGSTSGGTSTTSSINALGTVAGNTGG 130 G+ G G++ GG T G S ++ A G A +T G Sbjct: 62 HGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPG 99
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 32.4 bits (73), Expect = 0.004 Identities = 19/50 (38%), Positives = 22/50 (44%) Query: 63 NDGDTVVADQVIATIDTEAKAGAAAAAAGAAEVQPAAAPAAAPAPAAQPA 112 + G+T V QVI D + A A EV PA PA PAP P Sbjct: 299 SQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPG 348
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 170 bits (432), Expect = 1e-47 Identities = 101/435 (23%), Positives = 173/435 (39%), Gaps = 62/435 (14%) Query: 5 LRNIAIIAHVDHGKTTLVDQLLRQSGTFRENQQIAE--RVMDSNDIEKERGITILAKNCA 62 + NI ++AHVD GKTTL + LL SG E + + D+ +E++RGITI + Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62 Query: 63 VEYEGTHINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVEGPMPQTRFVTKKALALGLK 122 ++E T +NI+DTPGH DF EV R LS++D +LL+ A +G QTR + +G+ Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 123 PIVVINKIDRPGARIDWV-------------INQTFDLFDKLGATE----EQLDFPIV-- 163 I INKID+ G + V I Q +L+ + T EQ D I Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182 Query: 164 -----------------YASGLNGY---ASLDP-----AARDGDMRPLFEAILEHVPVRP 198 + SL P A + + L E I Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242 Query: 199 ADPEAPLQLQITSLDYSTYVGRIGVGRITRGRIKPGQPVAMRFGPEGEVLNRKINQVLSF 258 ++ L ++ ++YS R+ R+ G + V + E E KI ++ + Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRI---SEKE--KIKITEMYTS 297 Query: 259 KGLERVQVESAEAGDIVLINGIEDVGIGATICAVDVPEALPMITVDEPTLTMNFLVNSSP 318 E +++ A +G+IV++ E + + + + + I P L + Sbjct: 298 INGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQ 356 Query: 319 LAGREGKFVTSRQIRDRLMKELNHNVALRVKDTGDETVFEVSGRGELHLTILVENMRRE- 377 + D L++ V E + +S G++ + + ++ + Sbjct: 357 QREMLLDALLEISDSDPLLR-------YYVDSATHEII--LSFLGKVQMEVTCALLQEKY 407 Query: 378 GYELAVSRPRVVMQE 392 E+ + P V+ E Sbjct: 408 HVEIEIKEPTVIYME 422 Score = 32.1 bits (73), Expect = 0.007 Identities = 16/100 (16%), Positives = 31/100 (31%), Gaps = 1/100 (1%) Query: 387 RVVMQEIDGVKHEPYELLTVDLEDEHQGGVMEELGRRKGEMLDMVSDGRGRTRLEYRIPA 446 V+++ EPY + E+ + + ++D L IPA Sbjct: 525 EQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQL-KNNEVILSGEIPA 583 Query: 447 RGLIGFQGEFLTLTRGTGLMSHIFDSYAPVKEGSVGERRN 486 R + ++ + T G + Y V + R Sbjct: 584 RCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRR 623
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 84.1 bits (208), Expect = 8e-20 Identities = 54/406 (13%), Positives = 116/406 (28%), Gaps = 98/406 (24%) Query: 29 VIAIAAIAYGLYYLLVARFHETTDDAYVNGNVV------QITPQVTGTVIAVKADDTQTV 82 ++A + + + +++ + A NG + +I P V + + ++V Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118 Query: 83 KAGDPLVVLDPADSQVALQQAEANLAQT-------------------------------- 110 + GD L+ L ++ + +++L Q Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178 Query: 111 ------VRQVRGLYVNDDQYRAQVALRQSDLSKAQDDL----RRRLAVAQTGAVSQEEIS 160 +R + ++ Q ++ +L K + + R V + + Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238 Query: 161 H----------ARDAVKAAQASLDAANQQLASNRA---------LTANTTIANHPN---- 197 A+ AV + A +L ++ L+A Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN 298 Query: 198 -VLAAAAKVRD-----------AYLNNARNTLPAPVAGYVAKRSVQ-VGQRVSPGTPLMS 244 +L + D + + APV+ V + V G V+ LM Sbjct: 299 EILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMV 358 Query: 245 VVPLNAV-WIDANFKEVQLKHMRIGQPVEL--TADIYGSSVKYHGKVVGFSAGTGAAFSL 301 +VP + + A + + + +GQ + A Y GKV + Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA------ 412 Query: 302 LPAQNATGNWIKVVQRLPVRIELDPKELKDHPLRIGLSMQVDVDIK 347 G V+ + + + + M V +IK Sbjct: 413 -IEDQRLGLVFNVIISIEENCLST----GNKNIPLSSGMAVTAEIK 453
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 135 bits (342), Expect = 5e-37 Identities = 84/396 (21%), Positives = 159/396 (40%), Gaps = 16/396 (4%) Query: 27 VFMNVLDTSIANVAIPTISGDLGVSSDQGTWVITSFAVANAISVPLTGWLTDRIGQVRLF 86 F +VL+ + NV++P I+ D WV T+F + +I + G L+D++G RL Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 87 LASIILFVISSWMCGLAPT-LPFLLASRVLQGAVAGPMIPLSQALLLSSYPRAKAPMALA 145 L II+ S + + + L+ +R +QGA A L ++ P+ A Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 146 LWSMTTLIAPVAGPILGGWISDNYSWPWIFYVNIPVGIAAAAVTWMIYRSRESAVRRAPI 205 L + GP +GG I+ W ++ IP+ I V +++ ++ + Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPM-ITIITVPFLMKLLKKEVRIKGHF 199 Query: 206 DGVGLALLVIWVGSLQIMLDKGKDLDWFASTTIVVLALTALIAFAFFVVWELTAEHPVVD 265 D G+ L+ + G + ML F ++ + + ++++F FV P VD Sbjct: 200 DIKGIILMSV--GIVFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249 Query: 266 LSLFRMRNFSGGTIALSVGYGLYFGNLVLLPLWLQTQIGYTATDAG-LVMAPVGFFAILL 324 L + F G + + +G G + ++P ++ + + G +++ P I+ Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309 Query: 325 SPLTGKFLSRTDPRYIATAAFLTFALCFWMRSRYTTGVDEWSLMAPTFVQGIAMAGFFIP 384 + G + R P Y+ ++ F S + + FV G ++ Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTV 368 Query: 385 LVSITLSGLPGHRIPAASGLSNFVRIMCGGIGTSIF 420 + +I S L A L NF + G G +I Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 73.4 bits (180), Expect = 2e-15 Identities = 68/277 (24%), Positives = 101/277 (36%), Gaps = 76/277 (27%) Query: 483 VMGHVDHGKTSLLDYIRRAKVAAGEAG------------------GITQHIGAYHVETPR 524 V+ HVD GKT+L + + A E G GIT G + Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67 Query: 525 GVVTFLDTPGHEAFTAMRARGAKATDIVILVVAADDGVMPQTKEAISHAKAGGVPIVVAI 584 V +DTPGH F A R D IL+++A DGV QT+ + G+P + I Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127 Query: 585 NKIDKPEANPDRVKQE----LVAEGVV-----------------PEEYG----------- 612 NKID+ + V Q+ L AE V+ E++ Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187 Query: 613 ----GDSP-----------------FVPV---SAKTGVGIDDLLENVLLQAEVLELKAPV 648 G S PV SAK +GID+L+E + + Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245 Query: 649 ESPAKGIVIEAKLDKGKGPVATVLVQSGTLNRGDVVL 685 +S G V + + + + +A + + SG L+ D V Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVR 282
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 29.8 bits (67), Expect = 0.014 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 227 EALAAGIREGMGIGVLPLYSAIAGLRHGD 255 + A IRE + +G+ L SAI L + + Sbjct: 138 QIAAGKIRENIPLGLPALDSAITTLFYYN 166
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 55.2 bits (133), Expect = 8e-11 Identities = 26/176 (14%), Positives = 66/176 (37%), Gaps = 14/176 (7%) Query: 58 PVRDNQFVKKGDLIMQIDPSHYQIAVEQAQAAVAARRAELQMRRADAARRADLDALVVSK 117 + Q + K ++ Q + +A + +++L+ ++ A + +V++ Sbjct: 242 SLLHKQAIAKHAVLEQ------ENKYVEAVNELRVYKSQLEQIESEILS-AKEEYQLVTQ 294 Query: 118 ESRENSMQTASSADAQYQQALAALDAAKLNLERTRVVAPVDGYVTNLQVF-KGDYATAGQ 176 + + L + + + + APV V L+V +G T + Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354 Query: 177 AKLAIV-DSHSFWVYGYFEETKLPRVKIGAKAEMRLMS-----GGVLKGHVESISR 226 + IV + + V + + + +G A +++ + G L G V++I+ Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINL 410 Score = 45.6 bits (108), Expect = 1e-07 Identities = 21/115 (18%), Positives = 48/115 (41%), Gaps = 8/115 (6%) Query: 46 VAPDVSGAVVDLPVRDNQFVKKGDLIMQIDPSHYQIAVEQAQAAVAARRAELQMRRA--D 103 + P + V ++ V++ + V+KGD+++++ + + Q+++ R E + Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 104 AARRADLDALVVSKE------SRENSMQTASSADAQYQQALAALDAAKLNLERTR 152 + L L + E S E ++ S Q+ +LNL++ R Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKR 213
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 5e-06 Identities = 34/150 (22%), Positives = 51/150 (34%), Gaps = 5/150 (3%) Query: 261 LIDRFHLSVQAAQIHLFVFLAAVAAGTIIGGPVG----DRIGRKYVIWTSILGVAPFTLM 316 L+ S H + LA A PV DR GR+ V+ S+ G A + Sbjct: 31 LLRDLVHSNDVTA-HYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAI 89 Query: 317 LPYANLFWTTVLTIVIGVVLASAFAAIIVYGQELIPGKVGTVAGLFFGLSFGLGGVGAAV 376 + A W + ++ + + A Y ++ G F FG G V V Sbjct: 90 MATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPV 149 Query: 377 LGQLADATSIAFVYKVCSFLPLIGVLTVFL 406 LG L S + + L + LT Sbjct: 150 LGGLMGGFSPHAPFFAAAALNGLNFLTGCF 179 Score = 37.1 bits (86), Expect = 1e-04 Identities = 63/294 (21%), Positives = 110/294 (37%), Gaps = 19/294 (6%) Query: 51 LILAIYPMLKSEFSLS---FAQIGLITLTYQITASLLQPVIGLYTDKRPQPFSLPVGMGF 107 LI+ + P L + S A G++ Y + PV+G +D+ + L V + Sbjct: 23 LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAG 82 Query: 108 TLTGLLLMAFAPTFPFLLVAAALVGCGSSVFHPESSRVARMASGGRH----GLAQSLFQV 163 +MA AP L + + G + + +A + G G + F Sbjct: 83 AAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGF 142 Query: 164 GGNAGSSLGPLLAALIVIPHGQRSIAWFSAAALVAIFVLVQIGRWYQRHPAARKKAAHAA 223 G AG LG L+ PH +F+AAAL + L + H R+ A Sbjct: 143 GMVAGPVLGGLMGG--FSPH----APFFAAAALNGLNFLTGCFLLPESHKGERRPLRREA 196 Query: 224 HPTLSRRQIGLALGVLVMLVFSKYFYLASINSY----FTFYLIDRFHLSVQAAQIHLFVF 279 L+ + + V+ L+ +F + + + + DRFH I L F Sbjct: 197 LNPLASFRWARGMTVVAALMAV-FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAF 255 Query: 280 -LAAVAAGTIIGGPVGDRIGRKYVIWTSILGVAPFTLMLPYANLFWTTVLTIVI 332 + A +I GPV R+G + + ++ ++L +A W +V+ Sbjct: 256 GILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVL 309 Score = 31.7 bits (72), Expect = 0.006 Identities = 42/173 (24%), Positives = 65/173 (37%), Gaps = 5/173 (2%) Query: 30 TVYPVLGAISFSHLLNDMIQSLILAIYPMLKSEFSLSFAQIGLITLTYQITASLLQPVI- 88 TV L A+ F L + + + I+ + F IG+ + I SL Q +I Sbjct: 210 TVVAALMAVFFIMQLVGQVPAALWVIFG--EDRFHWDATTIGISLAAFGILHSLAQAMIT 267 Query: 89 GLYTDKRPQPFSLPVGMGFTLTGLLLMAFAPTFPFLLVAAALVGCGSSVFHPESSRVARM 148 G + + +L +GM TG +L+AFA L+ G + ++R Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQ 327 Query: 149 ASGGRHGLAQSLFQVGGNAGSSLGPLLAALIVIPHGQ--RSIAWFSAAALVAI 199 R G Q + S +GPLL I AW + AAL + Sbjct: 328 VDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLL 380
>PF07824#Type III secretion chaperone Length = 120 Score = 27.2 bits (60), Expect = 0.035 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 7/67 (10%) Query: 135 ALAAQLKAACTRFVDEEGETLNARFRLRGARVHEGLIVSGDRFVSSEREVRALRDALPDA 194 AL+ D+EG +L AR L G E + V+ + ++S R L D Sbjct: 60 ALSLNYSEKICLATDDEGGSLIARLDLTGINEFEDIYVNTEYYISRVRW-------LKDE 112 Query: 195 LAVEMEG 201 A M+G Sbjct: 113 FARRMKG 119
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 33.2 bits (76), Expect = 5e-04 Identities = 22/123 (17%), Positives = 38/123 (30%), Gaps = 24/123 (19%) Query: 6 LKIALFGATGMIGSRIAAEAARRGHQVTAL-------------SRNPAASAANVSAKAAD 52 +K + GA G IG ++ GHQV + +R + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 53 LFDPASIAAALDG---QDVVASA------YGPKQEDASKVVAVAKAL--VEGARKAGVKR 101 L D + + V S Y + A + L +EG R ++ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 102 VVV 104 ++ Sbjct: 121 LLY 123
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 48.7 bits (116), Expect = 4e-08 Identities = 26/149 (17%), Positives = 57/149 (38%), Gaps = 16/149 (10%) Query: 212 AARGEMPVVLNALGTVTPLANV-TVRTQLSGYLQSVSFQEGQIVKQGDVLAQIDPRP--- 267 + G++ +V A G +T ++ + ++ + +EG+ V++GDVL ++ Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA 134 Query: 268 ----YQISLANAQGALARDEALLATARLDLKRYQTLLAQ---DSIAKQTADTQASLVKQY 320 Q SL A+ R + L + L+ L + +++++ SL+K+ Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE- 193 Query: 321 EGTVQIDRAAVDSAKLNLAYARITAPVSG 349 Q + L + A Sbjct: 194 ----QFSTWQNQKYQKELNLDKKRAERLT 218 Score = 37.1 bits (86), Expect = 2e-04 Identities = 31/182 (17%), Positives = 59/182 (32%), Gaps = 26/182 (14%) Query: 269 QISLANAQGALARDEALLAT--ARLDLKRYQTLLAQDSIAKQTADTQASLVKQY-EGTVQ 325 + ++ + L ++L+ + L A++ T + ++ + + T Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310 Query: 326 ID--RAAVDSAKLNLAYARITAPVSGRV-GLRQVDPGNYVTPSDA--------NGIVVIT 374 I + + + I APVS +V L+ G VT ++ + + V Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370 Query: 375 QLQPMSVIFTTSEDNLPAILKQVNAGGKLSVTAYNRNNTVPLE-TGALNTLDNQIDTATG 433 +Q + F AI+K V A+ L LD D G Sbjct: 371 LVQNKDIGFINVG--QNAIIK---------VEAFPYTRYGYLVGKVKNINLDAIEDQRLG 419 Query: 434 TV 435 V Sbjct: 420 LV 421
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 799 bits (2064), Expect = 0.0 Identities = 287/1034 (27%), Positives = 498/1034 (48%), Gaps = 29/1034 (2%) Query: 4 SRIFILRPVGTALLMAAIMLAGLVALRFLPLAALPEVDYPTIQVQTFYPGASPEVMTSSV 63 + FI RP+ +L +M+AG +A+ LP+A P + P + V YPGA + + +V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 64 TAPLERQFGQMPSLNQMSSQS-SAGASVITLQFSLDLPLDIAEQEVQAAINAAGNLLPSD 122 T +E+ + +L MSS S SAG+ ITL F DIA+ +VQ + A LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 LPAPPIYAKVNPADAPVITLAVTSKTLPLTQ--VQDLADTRLAMKISQVSGVGLVSLSGG 180 + I + + ++ S TQ + D + + +S+++GVG V L G Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 NRPAVRIQANPLALASYGLNLDDLRTTISNLNVNTPKGNFDGP------TRAYTINANDQ 234 A+RI + L Y L D+ + N G G +I A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 235 LTSADQYNDAVV-AYKNGRPVMLTDVAKIVAGSENTKLGAWVDAEPAIILNVQRQPGANV 293 + +++ + +G V L DVA++ G EN + A ++ +PA L ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 294 IQTVDNIKAILPKLQESLPAALDVQIVTDRTTMIRAAVRDVQFELGLAVALVVLVMYLFL 353 + T IKA L +LQ P + V D T ++ ++ +V L A+ LV LVMYLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 354 ANVYATIIPSLSVPLSLIGTLAVMYLSGFSLNNLSLMALTIATGFVVDDAIVMIENIARY 413 N+ AT+IP+++VP+ L+GT A++ G+S+N L++ + +A G +VDDAIV++EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 414 -VEEGDSALEAALKGSKQIGFTIISLTVSLIAVLIPLLFMGDVVGRLFHEFAITLAVTIV 472 +E+ EA K QI ++ + + L AV IP+ F G G ++ +F+IT+ + Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 473 ISAIVSLTLVPMMCAKLLRHTPPPESH---RFEAKVHGLIDRVIARYGVALEWVLDRQRS 529 +S +V+L L P +CA LL+ F + D + Y ++ +L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 530 TLVVAVLTLALTALLYVVIPKGFFPTQDTGVIQAITQAPQSVSYGAMAERQQALAAEILK 589 L++ L +A +L++ +P F P +D GV + Q P + + + LK Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 590 N--PDVVSLTSFIGVDGSNITLNSGRMLINLKPRDDRS---ESASDVIRSLQQQVAAVTG 644 N +V S+ + G S N+G ++LKP ++R+ SA VI + ++ + Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 645 ISLYMQPVQDLTIDSTVSPTQYQFMLTS---PNPDEFATWVPKLVDRLKQE-RSLADVAT 700 + P I + T + F L D +L+ Q SL V Sbjct: 660 GFVI--PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 701 DLQNNGKSVYIEIDRASAARFGITPATVDNALYDAYGQRIVSTIFTQSNQYRVILESEPQ 760 + + +E+D+ A G++ + ++ + A G V+ + ++ ++++ + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 761 MQHYTDSLNGIYLPSAGGGQVPLSAIATFHERPAPLLVSHLSQFPAATISFNLAPGASLG 820 + + ++ +Y+ SA G VP SA T H + + P+ I APG S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 821 EAVKAIDAAERELGLPASFQTRFQGAALAFQASLSNQLFLILAAIVTMYIVLGVLYESYI 880 +A+ ++ +L PA + G + + S + L+ + V +++ L LYES+ Sbjct: 838 DAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 881 HPITILSTLPSAGVGALLALMITGHDLDIIGIIGIVLLIGIVKKNAIMMIDFALEAERVE 940 P++++ +P VG LLA + D+ ++G++ IG+ KNAI++++FA + E Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 941 GKPPREAIYQACLLRFRPILMTTLAALLGAVPLIAGSGAGSELRQPLGIAIAGGLIVSQV 1000 GK EA A +R RPILMT+LA +LG +PL +GAGS + +GI + GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1001 LTLFTTPVIYLGFD 1014 L +F PV ++ Sbjct: 1016 LAIFFVPVFFVVIR 1029
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 754 bits (1948), Expect = 0.0 Identities = 278/1104 (25%), Positives = 507/1104 (45%), Gaps = 100/1104 (9%) Query: 3 LSRPFITRPVATTLLALGIALAGLFAFIKLPVSPLPQVDFPTILVQASLPGASPETVATS 62 ++ FI RP+ +LA+ + +AG A ++LPV+ P + P + V A+ PGA +TV + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 63 VTSPLERHLGSIADVAEMTSMS-SVGNARIVLQFNLNRDIDGAARDVQAAINAARADLPA 121 VT +E+++ I ++ M+S S S G+ I L F D D A VQ + A LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 122 SLKSNPTYRKVNPADSPIMVVSLTS--KTASPAKLYDAASTVLQQSLSQIDGIGQVSLSG 179 ++ + S +MV S + + D ++ ++ +LS+++G+G V L G Sbjct: 121 EVQ-QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179 Query: 180 SANPAVRVELEPQALFHYGIGLEDVRAALASANANSPKGAIETGP------HHYQLYTND 233 + A+R+ L+ L Y + DV L N G + P + + Sbjct: 180 AQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238 Query: 234 QASKAAQYKDLVI-AYRNNAAVSLSDVSSVVDSVEDLRNLGLMNGERAVLVILYRSPGAN 292 + ++ + + + + V L DV+ V E+ + +NG+ A + + + GAN Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298 Query: 293 IIDTIERVKAALPQLTAALPADIQVTPVLDRSRTIRASLADTEHTLLIAVSLVVMVVFLF 352 +DT + +KA L +L P ++V D + ++ S+ + TL A+ LV +V++LF Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358 Query: 353 LRNWRATLIPSVAVPISIVGTFGAMYLLGFSLNNLSLMALIVATGFVVDDAIVVLENIAR 412 L+N RATLIP++AVP+ ++GTF + G+S+N L++ +++A G +VDDAIVV+EN+ R Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418 Query: 413 HI-ENGTPRLQAAFDGAREVGFTVLSISLSLVAVFLPILLMGGIVGRLFREFALTLSLAI 471 + E+ P +A ++ ++ I++ L AVF+P+ GG G ++R+F++T+ A+ Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478 Query: 472 AVSLVVSLTLTPMMCARLL--PEAHDPREEGHVARWLERGFEWMQRGYERTLSWALRHPF 529 A+S++V+L LTP +CA LL A +G W F+ Y ++ L Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538 Query: 530 TVLMTLVATIALNIALYIVVPKGFFPQQDTGLMIGGIQADQTTSFQAMKLKFTEMMRIVR 589 L+ +A + L++ +P F P++D G+ + IQ + + + ++ Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598 Query: 590 ENP-----NVANVAGFT-GGAQTNSGFMFVALKDKPQR---KLSADQVIQQLRPRLAEVA 640 +N +V V GF+ G N+G FV+LK +R + SA+ VI + + L ++ Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658 Query: 641 GARTFLQAAQDIRAGGRQSNAQYQFT-LLGDSTAELYKWAP-ILTEALQKRPELADVNSD 698 I G + ++ G L + +L A Q L V + Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 699 QQQGGLEAMVTIDRATAARLGIKPAQIDNTLYDAFGQRQVSTIYNPLNQYHVVMEVAPQY 758 + + + +D+ A LG+ + I+ T+ A G V+ + + ++ ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 759 WQSPEMLKQIYISTSGGSASGAQTTNAAAGTYVAATARASTAGAAAQSAAAIAADSARNQ 818 PE + ++Y+ ++ G + +V + R R Sbjct: 779 RMLPEDVDKLYVRSANGEM--VPFSAFTTSHWVYGSPRLE-----------------RYN 819 Query: 819 ALNSIASSG--KSGASSGAAVSTSKSTMVPLSAIASFGPSTTPLAVNHQGLFVATTISFN 876 L S+ G G SSG A++ ++ Sbjct: 820 GLPSMEIQGEAAPGTSSGDAMALMENLAS------------------------------K 849 Query: 877 LPPGVSLSKATQAIYQTMAEAGVPPTIQGSFQGTAQAFQQSLKDQPILILAALAAVYIVL 936 LP G+ + G + + S P L+ + V++ L Sbjct: 850 LPAGIGY----------------------DWTGMSYQERLSGNQAPALVAISFVVVFLCL 887 Query: 937 GILYESYIHPVTILSTLPSAGVGALLGLLLFKTEFSIIALIGVILLIGIVKKNAIMMVDF 996 LYES+ PV+++ +P VG LL LF + + ++G++ IG+ KNAI++V+F Sbjct: 888 AALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEF 947 Query: 997 AIDA-SRQGKSSFDAIHEACLLRFRPIMMTTMAALLGALPLAFGSGDGAEMRAPLGIAIA 1055 A D ++GK +A A +R RPI+MT++A +LG LPLA +G G+ + +GI + Sbjct: 948 AKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVM 1007 Query: 1056 GGLIVSQMLTLYTTPVVYLYMDRL 1079 GG++ + +L ++ PV ++ + R Sbjct: 1008 GGMVSATLLAIFFVPVFFVVIRRC 1031 Score = 98.0 bits (244), Expect = 1e-22 Identities = 83/503 (16%), Positives = 167/503 (33%), Gaps = 25/503 (4%) Query: 2 NLSRPFITRPVATTLLALGIALAGLFAFIKLPVSPLPQVDFPTILVQASLP-GASPETVA 60 N + L+ I + F++LP S LP+ D L LP GA+ E Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587 Query: 61 TSVTSPLERHLGSIAD----VAEMTSMSSVGNAR----IVLQFNLNRDIDGAARDVQAAI 112 + + +L + V + S G A+ + + +G +A I Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647 Query: 113 NAARADLPASLKSNPTYRKVNPADSPIMVVSLTSKT-----ASPAKLYDAASTVLQQSLS 167 + A+ +L + + L A + +L + Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707 Query: 168 QIDGIGQVSLSGSAN-PAVRVELEPQALFHYGIGLEDVRAALASANANSPKGAIETGPHH 226 + V +G + ++E++ + G+ L D+ +++A + Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767 Query: 227 YQLYT---NDQASKAAQYKDLVIAYRNNAAVSLSDVSSVVDSVEDLRNLGLMNGERAVLV 283 +LY L + N V S ++ V L NG ++ + Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHW-VYGSPRLERYNGLPSMEI 826 Query: 284 ILYRSPGANIIDTIERVKAALPQLTAALPADIQVTPVLDRSRTIRASLADTEHTLLIAVS 343 +PG + D A + L + LPA I S R S + I+ Sbjct: 827 QGEAAPGTSSGD----AMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAISFV 881 Query: 344 LVVMVVFLFLRNWRATLIPSVAVPISIVGTFGAMYLLGFSLNNLSLMALIVATGFVVDDA 403 +V + + +W + + VP+ IVG A L + ++ L+ G +A Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941 Query: 404 IVVLENI-ARHIENGTPRLQAAFDGAREVGFTVLSISLSLVAVFLPILLMGGIVGRLFRE 462 I+++E + G ++A R +L SL+ + LP+ + G Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001 Query: 463 FALTLSLAIAVSLVVSLTLTPMM 485 + + + + ++++ P+ Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVF 1024 Score = 62.5 bits (152), Expect = 8e-12 Identities = 38/225 (16%), Positives = 84/225 (37%), Gaps = 4/225 (1%) Query: 870 ATTISFNLPPGVSLSKATQAIYQTMAE--AGVPPTIQGS-FQGTAQAFQQSLKDQPILIL 926 A + L G + +AI +AE P ++ T Q S+ + + Sbjct: 286 AAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLF 345 Query: 927 AALAAVYIVLGILYESYIHPVTILSTLPSAGVGALLGLLLFKTEFSIIALIGVILLIGIV 986 A+ V++V+ + ++ + +P +G L F + + + G++L IG++ Sbjct: 346 EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLL 405 Query: 987 KKNAIMMVDFAIDASRQGKSSF-DAIHEACLLRFRPIMMTTMAALLGALPLAFGSGDGAE 1045 +AI++V+ + K +A ++ ++ M +P+AF G Sbjct: 406 VDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGA 465 Query: 1046 MRAPLGIAIAGGLIVSQMLTLYTTPVVYLYMDRLRVWAEKRRHRR 1090 + I I + +S ++ L TP + + + Sbjct: 466 IYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 18/67 (26%), Positives = 25/67 (37%), Gaps = 13/67 (19%) Query: 22 GIDVDIADGEFVVLVGPSGCGKSTLLRMIAGLETVTEGEIAIGGRVVNTLEPKDRDIAMV 81 G D + VVL G G GKSTL+ + GL+ ++ IG +D Sbjct: 592 GCKFDYS----VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG---------TGKDSYEQ 638 Query: 82 FQNYALY 88 Y Sbjct: 639 IAGIVAY 645
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 28.3 bits (63), Expect = 0.044 Identities = 12/44 (27%), Positives = 20/44 (45%) Query: 187 FFWDVVLPLSKTSIAALFVITFIYGWNQYLWPILITTDASLSTA 230 F +D + S+ I + F G YL+ I IT + S ++ Sbjct: 262 FVYDSLKKSSRDIIETYCQLNFSAGDLDYLYLIYITANNSFASL 305
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 46.3 bits (109), Expect = 1e-07 Identities = 50/192 (26%), Positives = 80/192 (41%), Gaps = 15/192 (7%) Query: 121 EKAFVPTIASYYSDA--KTGRLVSMPFNSSTPVLYYNKDAFKKAGLDPNQPPKTWADVKA 178 +KAF + + DA G+L++ P L YNKD L PN PPKTW ++ A Sbjct: 108 DKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD------LLPN-PPKTWEEIPA 160 Query: 179 DAEKLKKAGYACGYTTGWQGWIQLENYSAWHGLPFATRNNGFDGADAVLEFNKPQQIAHI 238 ++LK G + + + +A G F N +D D ++ + A + Sbjct: 161 LDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAK--AGL 218 Query: 239 QFLQDMAKGGTFTYVGRKDEATAKFYSGDCAIMTTSSGALATIHKYAKFDFGTGMMPYDA 298 FL D+ K A A F G+ A+ A + I +K ++G ++P Sbjct: 219 TFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP--- 274 Query: 299 SVKGAPQNAIIG 310 + KG P +G Sbjct: 275 TFKGQPSKPFVG 286
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 201 bits (513), Expect = 6e-59 Identities = 105/411 (25%), Positives = 182/411 (44%), Gaps = 54/411 (13%) Query: 211 GTVSLRLNNVRWRSAFDALLDAHGLAMARRGSVIWVAPVAELAERERRRF-------DAH 263 G S+ + W SA D + L S + + VA + ER ++ Sbjct: 190 GDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR 249 Query: 264 ARAAQL-EPL--------ASRSFVLRYARAADVQRLLSG---------SAAQRILSKRGS 305 R + + L ++ L+YA+A+D+ +L+G AA+ + + + Sbjct: 250 QRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKN 309 Query: 306 VL--ADPRTNLLFVTDLSGRLAQIADLIGKLDTPSRQVLIEARIVEGDRGFSRNLGARLA 363 ++ A +TN L VT + + +I +LD QVL+EA I E NLG + A Sbjct: 310 IIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369 Query: 364 LR-----------APDAGDRAAGVVAGRNGTLADLTARPISGFDAATAGLTLFAARASRL 412 + P + A ++GT++ A +S F+ AG Sbjct: 370 NKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGF------YQGN 423 Query: 413 LDIELSALEAQGRGQIVSSPRVVTADRTKAVVEQGAELPYQ-----AKVGNGVSGVQFRR 467 + L+AL + + I+++P +VT D +A G E+P N + V+ + Sbjct: 424 WAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKT 483 Query: 468 ATLKLEVEPQITPDGRVILDLDVAKDSVGE-----ETASGPAIHTKHVQTRVEVENGGTV 522 +KL+V+PQI V+L+++ SV + + G +T+ V V V +G TV Sbjct: 484 VGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETV 543 Query: 523 SIGGIFESDDRDDVTRVPLLGKIPVLGALFRHRAQRAQRSELVVYITPTVV 573 +GG+ + D +VPLLG IPV+GALFR +++ + L+++I PTV+ Sbjct: 544 VVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVI 594 Score = 56.9 bits (137), Expect = 1e-10 Identities = 30/175 (17%), Positives = 75/175 (42%), Gaps = 13/175 (7%) Query: 180 SLNLQQASLAAVFDAFARFTGLNIVVSERVRGTVSLR----LNNVRWRSAFDALLDAHGL 235 S + + + + ++ +++ VRGT+++R LN ++ F ++LD +G Sbjct: 31 SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDVYGF 90 Query: 236 AMARRGSVIWVAPVAELAERERRRFDAHARAAQLEPLASRSFVLRYARAADVQRLLSGSA 295 A+ + + ++ A+ + A + + +R L A D+ LL Sbjct: 91 AVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLL---- 146 Query: 296 AQRIL---SKRGSVLADPRTNLLFVTDLSGRLAQIADLIGKLDTPSRQVLIEARI 347 R L + GSV+ +N+L +T + + ++ ++ ++D + ++ + Sbjct: 147 --RQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPL 199
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 357 bits (918), Expect = e-124 Identities = 108/344 (31%), Positives = 181/344 (52%), Gaps = 2/344 (0%) Query: 12 ERTEAATPKRREKAREEGQVARSRELASFALLSAGFYGAWMLSGPIGEHLRTMLHTAFSF 71 E+TE TPK+ AR++GQVA+S+E+ S AL+ A LS EH ++ Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LIPA 61 Query: 72 DRATAFDTNRMLSHAGILSLEGLYALVPVLALTGVAALAAPMALGGWLVSTKTFELKFER 131 +++ + + + LE Y P+L + + A+A+ + G+L+S + + ++ Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKK 121 Query: 132 LNPVAGLGRIFSIQGPIQLGMSIAKTLVVGGIGGIAIWRSKDELLGLATQPLHAALADAL 191 +NP+ G RIFSI+ ++ SI K +++ + I I + LL L T + Sbjct: 122 INPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLG 181 Query: 192 HLIAVCCGMTVAGMLVVAGLDVPYQLWQYNKKLRMTKEEVKREHRENEGDPHVKGRIRQQ 251 ++ + G +V++ D ++ +QY K+L+M+K+E+KRE++E EG P +K + RQ Sbjct: 182 QILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQF 241 Query: 252 QRAMARRRMMANVPTADVVVTNPTHFAVALKYTDGEMRAPKVVAKGVNLVAARIRELAAE 311 + + R M NV + VVV NPTH A+ + Y GE P V K + +R++A E Sbjct: 242 HQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEE 301 Query: 312 HHVPLLEAPPLARALYHNVDLEREIPGTLYSAVAEVLAWVYQLK 355 VP+L+ PLARALY + ++ IP A AEVL W+ + Sbjct: 302 EGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQN 345
>cloacin#Cloacin signature. Length = 551 Score = 34.7 bits (79), Expect = 7e-04 Identities = 23/61 (37%), Positives = 28/61 (45%), Gaps = 4/61 (6%) Query: 28 GGGGDGGSNASVNTGSGGGNTSA----GGGSTSGSGGSGGSGGSGGTPLASNQAAITVST 83 GG DG +S N GGG+ S GG GG+G SGG GT + A V+ Sbjct: 31 GGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAF 90 Query: 84 G 84 G Sbjct: 91 G 91 Score = 33.1 bits (75), Expect = 0.002 Identities = 18/44 (40%), Positives = 19/44 (43%), Gaps = 2/44 (4%) Query: 28 GGGGDGGSNASVNTGSGGG--NTSAGGGSTSGSGGSGGSGGSGG 69 GG G + GSG N GGGS SG GGSG G Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65 Score = 28.9 bits (64), Expect = 0.040 Identities = 18/40 (45%), Positives = 20/40 (50%), Gaps = 5/40 (12%) Query: 30 GGDG-GSNASVNTGSGGGNTSAGGGSTSGSGGSGGSGGSG 68 GGDG G N ++ SG N GG T G G S GSG Sbjct: 3 GGDGRGHNTGAHSTSGNIN----GGPTGLGVGGGASDGSG 38
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.4 bits (214), Expect = 5e-23 Identities = 32/110 (29%), Positives = 52/110 (47%), Gaps = 4/110 (3%) Query: 1 MDKSMKILVVDDFPTMRRIVRNLLKELGYSNVDEAEDGLAGLARLRGGGYDFVISDWNMP 60 M + ILV DD +R ++ L GY V + + G D V++D MP Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 NLDGLAMLKEIRADASLTHLPVLMVTAESKKENIIAAAQAGASGYVVKPF 110 + + +L I+ LPVL+++A++ I A++ GA Y+ KPF Sbjct: 59 DENAFDLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.5 bits (157), Expect = 2e-13 Identities = 30/143 (20%), Positives = 60/143 (41%), Gaps = 13/143 (9%) Query: 4 KIKVLCVDDSALIRSLMTEIINSQPDMEVCATAPDPLVARELIKQHNPDVLTLDVEMPRM 63 +L DD A IR+++ + ++ + I + D++ DV MP Sbjct: 3 GATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 64 DGLDFLEKLMRLRP-MPVVMVSSLTERGSEITLRALELGAVDFVTKPRVGIRDGMLEYAE 122 + D L ++ + RP +PV+++S+ ++A E GA D++ KP + E Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNT--FMTAIKASEKGAYDYLPKP--------FDLTE 110 Query: 123 KLADKVRAASRARVRQNPQPHAA 145 + RA + + R + + Sbjct: 111 LIGIIGRALAEPKRRPSKLEDDS 133
>PF06580#Sensor histidine kinase Length = 349 Score = 46.4 bits (110), Expect = 2e-07 Identities = 21/151 (13%), Positives = 51/151 (33%), Gaps = 52/151 (34%) Query: 451 ELDKSLIERIIDPLT--HLVRNSLDHGIETVEARRAAGKDAVGQLVLSAAHHGGNIVIEV 508 +++ ++++ + P+ LV N + HGI G+++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 509 SDDGAGLNRDKILAKAAKQGMQISENISDDEVWNLIFAPGFSTAEVVTDVSGRGVGMDVV 568 + G+ ++ G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 569 KRNIQSMGG---HVEISSQAGRGTTTRIVLP 596 + +Q + G +++S + G +++P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 77.9 bits (192), Expect = 4e-20 Identities = 39/114 (34%), Positives = 58/114 (50%), Gaps = 2/114 (1%) Query: 4 TILAIDDSATMRTLLSATLGEAGYDVTVASDGEVGLDVAMATPFDLVLTDHYMPKKNGLE 63 TIL DD A +RT+L+ L AGYDV + S+ A DLV+TD MP +N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 LIAALRAQSAYEATPILVLTTENGDAFKDAARAAGATGWIEKPLDPDALIELVA 117 L+ ++ A P+LV++ +N A GA ++ KP D LI ++ Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 39.9 bits (93), Expect = 9e-06 Identities = 25/117 (21%), Positives = 51/117 (43%), Gaps = 9/117 (7%) Query: 182 FAMSSDAVEPYMRDILREIGKTLNDV---PNRIIVQGHTDAVPYAGGEKGYSNWELSADR 238 F + ++P + L ++ L+++ ++V G+TD + G Y N LS R Sbjct: 223 FNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI----GSDAY-NQGLSERR 277 Query: 239 ANASRRELIAGGMDEAKVLRV-LGLASTQNLNKADPLDPENRRISIIVLNRKSELAL 294 A + LI+ G+ K+ +G ++ N D + I + +R+ E+ + Sbjct: 278 AQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 27.6 bits (61), Expect = 0.004 Identities = 11/26 (42%), Positives = 14/26 (53%) Query: 13 VAARLSSRGATAAGIAPSGRIASSPL 38 A LS+ A A IA + +A SPL Sbjct: 296 AAQGLSTSAAAAGLIASAVTLAISPL 321
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 27.3 bits (60), Expect = 0.035 Identities = 14/96 (14%), Positives = 26/96 (27%) Query: 69 QTSPEDQIDALEKALQQIRAKGNRPPPGFEAHLGMLYASVGKEQQAEQAFQAEKASFPES 128 + + I A ++ + R S E AE + Q K Sbjct: 996 NITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055 Query: 129 SPFMDFLLKKKSAATQAKPQAPAQPTAQTQTQAQQQ 164 + + + A +AK A Q+ + Sbjct: 1056 QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSE 1091
>BACSURFANTGN#Yersinia/Haemophilus virulence surface antigen signature. Length = 322 Score = 30.5 bits (68), Expect = 0.001 Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 8/74 (10%) Query: 17 YEKNEVAADQQYKGKSLL---VSGTVQSIDKDAFDNIVIQLRTSNE---FMPVHAYLASG 70 ++KN ++ + L V+GT +S D N + L T + +H Sbjct: 199 FKKNGISERMIERHCLLRPVDVTGTTESEGLDQLLNAI--LDTHGIGYGYKKIHLSGQMS 256 Query: 71 NEAVAASLDKGQKV 84 A+AA +++ V Sbjct: 257 AHAIAAYVNEKSGV 270
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 29.9 bits (67), Expect = 0.026 Identities = 15/84 (17%), Positives = 30/84 (35%) Query: 262 KRLPEAETQSRRLIEMKPDNAEAHRMLGLVLHAQRRYEEAVAACRRAVELAPNAAPANGT 321 + +A + L + ++ LG A +Y+ A+ + + Sbjct: 50 GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFH 109 Query: 322 LGVVLLEQGNVHEAIGRLRRAVEI 345 LL++G + EA L A E+ Sbjct: 110 AAECLLQKGELAEAESGLFLAQEL 133