>cdtoxina#Cytolethal distending toxin A signature. Length = 258 Score = 30.4 bits (68), Expect = 0.019 Identities = 23/82 (28%), Positives = 35/82 (42%), Gaps = 2/82 (2%) Query: 567 VAILVANGVDGKAVDAMKAALEAKGAHAKVLGPTSAPVKTADGKSLPVDASAEGLPSVAF 626 IL+ ++G + KA L+ K +V G + P G LP A LP+ Sbjct: 11 AGILIPILLNGCSSGKNKAYLDPKVFPPQVEGGPTVPSPDEPGLPLPGPGPA--LPTNGA 68 Query: 627 DAVFVPGGADSVKALSTDGVAL 648 + PG A +V ++ DG L Sbjct: 69 IPIPEPGTAPAVSLMNMDGSVL 90
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 29.3 bits (65), Expect = 0.035 Identities = 50/211 (23%), Positives = 80/211 (37%), Gaps = 37/211 (17%) Query: 97 AEGRWSSAQRHLHRAAEADAHPLLYYIGAARAANEQGRYEDCDNLLERAL----IRQPQA 152 A +WS+AQ +A +A A AN + +++ AL R P A Sbjct: 53 ATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHNASRTPSA 112 Query: 153 -ELAIALNHAQLQQDRGDTDGALTTLQAMHERHPHNPQVLRQLQRLYQQRGDWSALIRLM 211 ELA A N A +QA ER + + ++ ++ + Sbjct: 113 TELAHANNAA---------------MQAEDERLR----LAKAEEKARKEAEAAEKAFQ-E 152 Query: 212 PELRKDKVLPPRELAELERR---AWGENLTLAAYREEGEGSLTGLPSLEKAWQGLSSAQR 268 E R+ ++ RE AE ER+ A E LAA EE + ++E A + LS+AQ Sbjct: 153 AEQRRKEI--EREKAETERQLKLAEAEEKRLAALSEEAK-------AVEIAQKKLSAAQS 203 Query: 269 QEPQLILAYADQLRRLGAEAQAEEVLRSALK 299 + ++ RL + A + L Sbjct: 204 EVVKMDGEIKTLNSRLSSSIHARDAEMKTLA 234
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 130 bits (329), Expect = 7e-40 Identities = 71/213 (33%), Positives = 109/213 (51%), Gaps = 3/213 (1%) Query: 15 LAQATETPPNTDSHDLAYSLGASLGERLHQEVPDLDLKALVDGLKQAYQGKPLALKQERI 74 +A T TD L+YS+GA LG+ + D++ L G++ G L L +E++ Sbjct: 19 MAATDATSLTTDKDKLSYSIGADLGKNFKNQGIDINPDVLAKGMQDGMSGAQLILTEEQM 78 Query: 75 DQILREHDAAMAQAETTGTDAPTEAALGAEKRFMESEKAKPGVKVLADGILMTELTPGTG 134 +L + + + + E F+ + K+KPG+ VL G+ + GTG Sbjct: 79 KDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPSGLQYKIIDAGTG 138 Query: 135 PKPDVNGRVEVRYVGRLPDGTIFD---QSTQPQWFRLDSVISGWTSALQGMPTGAKWRLV 191 KP + V V Y G L DGT+FD ++ +P F++ VI GWT ALQ MP G+ W + Sbjct: 139 AKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVF 198 Query: 192 IPSDQAYGAEGAGDLIDPFTPLVFEIELIAVSQ 224 +P+D AYG G I P L+F+I LI+V + Sbjct: 199 VPADLAYGPRSVGGPIGPNETLIFKIHLISVKK 231
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 47.8 bits (113), Expect = 4e-08 Identities = 33/203 (16%), Positives = 50/203 (24%), Gaps = 16/203 (7%) Query: 131 TTREAKPAAPAKAAAAKPSAKTVAKAPVAKAPAAKAAAAKAPVAKAPAKATARPAAKTAA 190 T P A PS + A +APV P A A P+ T Sbjct: 991 TVDTTNITTPNNIQADVPSVPSNN--------EEIARVDEAPV---PPPAPATPSETTET 1039 Query: 191 KTVAAKAPVKAAVKPAAKPAAAAKPVAAKTAAAKPAPAKAAAKPAAAKAPAKPATAKPAA 250 +K K K AK A++ ++ + Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 251 AKPAASKAPAAAKPAAVKAPAKAPAKAPGKAAAKPAAAKPAAKPAAAKPAASTTPAV--K 308 K A+ + + P + P + A+PA P V K Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVT---SQVSPKQEQSETVQPQAEPARENDPTVNIK 1156 Query: 309 PAAAPAPAPAAAPAPAAANGATP 331 + A PA + Sbjct: 1157 EPQSQTNTTADTEQPAKETSSNV 1179
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 72.3 bits (177), Expect = 3e-17 Identities = 46/180 (25%), Positives = 68/180 (37%), Gaps = 2/180 (1%) Query: 83 SPTPPTPEPPPPPEPPPPPPPPPPPPEPEQPVEDPDAVEPPPKPIEKPKVEKPKPVKKPE 142 P T P EPP PPP P +P +P P P+ K + K Sbjct: 48 QPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKP 107 Query: 143 PVKKPTPPAPPKPVAAPAPAAPPTPTPAPPAPAAPAAPVKESAAV--SGLASLGNPPPEY 200 K P KPV + + PA P + A + SG +L P+Y Sbjct: 108 VKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQY 167 Query: 201 PGLALRRSWEGRVVLRIKVLPNGRAGTVEVTKSSGKPVLDEAAVEAVRNWKFIPAKRGDT 260 P A EG+V ++ V P+GR V++ + + + A+R W++ P K G Sbjct: 168 PARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSG 227
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 27.8 bits (62), Expect = 0.016 Identities = 10/30 (33%), Positives = 14/30 (46%), Gaps = 2/30 (6%) Query: 74 ISIVLGFLDGFLGTDQLIST--LYSIAVFL 101 SIV + LG Q+ S L +A+ L Sbjct: 30 FSIVFVMVRNALGLQQIPSNMTLNGVALLL 59
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 48.5 bits (115), Expect = 7e-09 Identities = 36/203 (17%), Positives = 80/203 (39%), Gaps = 20/203 (9%) Query: 12 NVLICGASRGIGLALCAALLARDDVAQVWAVAREASSSTGLAKLAEQYGQRLQRVDCDAR 71 I GA++GIG A+ L ++ A + AV + + + + D R Sbjct: 10 IAFITGAAQGIGEAVARTLASQG--AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 72 NEQALEALASETLEGCEHLHLVISALGILHQDGAKPEKGLAQLTLASMQASFATNTFAPI 131 + A++ + + + ++++ G+L + L+ +A+F+ N+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRP------GLIHSLSDEEWEATFSVNSTGVF 121 Query: 132 LLLKHLLPLLRKQPATFAALSARVGSIGDNRLG----GWYSYRASKAALNQLLHTASIEL 187 + + + + S + ++G N G +Y +SKAA +EL Sbjct: 122 NASRSVSKYMMDRR------SGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175 Query: 188 KRLNQASTVLAIHPGTTDTELSQ 210 N +++ PG+T+T++ Sbjct: 176 AEYNIRCNIVS--PGSTETDMQW 196
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 113 bits (285), Expect = 2e-32 Identities = 79/259 (30%), Positives = 117/259 (45%), Gaps = 15/259 (5%) Query: 9 IAIITGAAQGIGAAIAQRFVQEGCFVYVTDVND---VLGRATVKALGDRACYLDLDVRSE 65 IA ITGAAQGIG A+A+ +G + D N +++KA A DVR Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 66 KDWQRVTTHVLEAHGRLDVVVNNAGITGFEEGAVQHDPEHASLEDWQAVHRTNLDGVFLG 125 +T + G +D++VN AG+ G + S E+W+A N GVF Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV--LRPGLI----HSLSDEEWEATFSVNSTGVFNA 123 Query: 126 CKYAIRAIRHTGAGSIINISSRSGLVGIPGAAAYASSKAAVRNHTKTVALYCAEQGLKVR 185 + + + +GSI+ + S V AAYASSKAA TK + L AE +R Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN--IR 181 Query: 186 CNSIHPAAILTPMWEPMLGADAGREERMAALVRD----TPLRRFGLPEEVAAVALLLASD 241 CN + P + T M + + G E+ + + PL++ P ++A L L S Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 242 EATYITGSEFNIDGGLLAG 260 +A +IT +DGG G Sbjct: 242 QAGHITMHNLCVDGGATLG 260
>PF05272#Virulence-associated E family protein Length = 892 Score = 91.3 bits (226), Expect = 8e-21 Identities = 86/296 (29%), Positives = 124/296 (41%), Gaps = 52/296 (17%) Query: 11 SFAQVKSAALRNIDKVLAHWLPNGKRVDGGKEYTAPNPTRTDKRAGSLKISVSKGTWSDF 70 +F + A L +L WLP G V G EY + + S K++V+ G W DF Sbjct: 10 NFTSLADALLTRAKDLLPEWLPGGVLV--GHEYECG--SLAGGKGDSCKVNVTTGKWCDF 65 Query: 71 ATGDKGGDLIDLVRYIDGGTDVEACNKLAD----------LLGVTADS---EPAKPAPPK 117 +TG+ G DL+DL I G +A ++A ++G A + +P +P PP Sbjct: 66 STGESGRDLLDLYAEIHGLKVSKAAAQVAREEGLESVAGIVMGAPAGAPAPKPPRPEPPP 125 Query: 118 SKAPE---WIAIAPIPAEAMNKCPVKHRQHGAPSKIWIYRDDKGQP--LMALYRFDLGP- 171 E W I P+P +H P W +P + R+ +GP Sbjct: 126 RPVVEKECWETIQPVP------------EHAVPPSFWHPAPKGREPDKIEHTARYQVGPV 173 Query: 172 ---------DEDGKPKKVFAPLTWCKRSDGETTQWRWQGLPEPRPLLRLDELALRADAPV 222 DG K+ P + + + W+W+G +PRPL A + V Sbjct: 174 LWGYVVRFIKSDG--DKLTLPYVYSRSQRDGSEAWKWRGWDDPRPLYFPSHRAPESRT-V 230 Query: 223 VLCEGEKAADAAADLMPN-----HVATCWPNGSNSWHKADLTPLKGRDVLLWPDND 273 VL EGE+ AD L+ + WP GSN W KAD + L G V+LWPD D Sbjct: 231 VLVEGERKADCLQQLLDAGAPGVYCVASWPGGSNGWPKADWSWLAGCTVVLWPDCD 286
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 29.3 bits (66), Expect = 0.032 Identities = 16/22 (72%), Positives = 17/22 (77%), Gaps = 1/22 (4%) Query: 370 AQALGQEIGALEVGKRADWLVL 391 A L EIG+LEVGKRAD LVL Sbjct: 416 AHGLSHEIGSLEVGKRAD-LVL 436
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 58.4 bits (141), Expect = 2e-11 Identities = 69/372 (18%), Positives = 135/372 (36%), Gaps = 50/372 (13%) Query: 60 MPMLSQEFSITAAQSSLILSVATAMLAIGLLITGPVSDRLGRKSVMVMALFCASLFTIAS 119 +P ++ +F+ A ++ + + +IG + G +SD+LG K +++ + ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 120 ALMPSWEGVLV-TRALVGLSLSGLAAVAMTYLSEEIHPTHLGLAMGLYIGGSAVGGMSGR 178 + S+ +L+ R + G + A+ M ++ I + G A GL A+G G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 179 LIVGVMIDYVSWHAAMLV---------------------------VGGLALIAAAVFWRI 211 I G++ Y+ W +L+ G + + VF+ + Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216 Query: 212 LPESRNFRARSL----------HPRSLLDGFVVQ--FRDKGLPLLFLTAFLLMGAFVTLF 259 S + + H R + D FV ++ + L ++ G Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276 Query: 260 NYIAYRLLSEPYHLSQAVVG--VFSVVYLSGIYSSAKVGSLADRLGRRRVLWAVIVMMLF 317 + + Y ++ + + LS A +G + +S I G L DR G VL + + Sbjct: 277 SMVPY-MMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335 Query: 318 GLSLTLFTPL--PVVITGVLIFTFGFFGA-HSVASSWVGRRATVAR-GQATSLYLFCYYA 373 F +T +++F G +V S+ V G SL F + Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395 Query: 374 GSSVAGTGGGVF 385 GTG + Sbjct: 396 S---EGTGIAIV 404 Score = 30.6 bits (69), Expect = 0.011 Identities = 37/168 (22%), Positives = 65/168 (38%), Gaps = 9/168 (5%) Query: 22 PLGDTYIEKNTPLFKRTALALFAGGFSTFTLLYCVQPMMPMLSQEFS--ITAAQSSLILS 79 P D + KN P + G F + M+P + ++ TA S+I+ Sbjct: 246 PFVDPGLGKNIPF-----MIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIF 300 Query: 80 VATAMLAIGLLITGPVSDRLGRKSVMVMALFCASLFTIASALMPSWEGVLVTRALVGL-- 137 T + I I G + DR G V+ + + S+ + ++ + +T +V + Sbjct: 301 PGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG 360 Query: 138 SLSGLAAVAMTYLSEEIHPTHLGLAMGLYIGGSAVGGMSGRLIVGVMI 185 LS V T +S + G M L S + +G IVG ++ Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 110 bits (275), Expect = 2e-31 Identities = 79/248 (31%), Positives = 117/248 (47%), Gaps = 14/248 (5%) Query: 5 ILVTGSSRGIGRAIALRLAQAGYDLILHCRTGRSEAEAVQAEIIALGRQARVLQFDVSDR 64 +TG+++GIG A+A LA G I + E V + + A R A DV D Sbjct: 11 AFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 65 AACKEILEQDVETHGAYYGVVLNAGLTRDGAFPALTDDDWDQVLRTNLDGFYNVLHPLTM 124 AA EI + G +V AG+ R G +L+D++W+ N G +N ++ Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129 Query: 125 PMIRRRSAGRIVCITSVSGLIGNRGQVNYSASKAGLIGAAKALAIELGKRKITVNCVAPG 184 M+ RRS G IV + S + Y++SKA + K L +EL + I N V+PG Sbjct: 130 YMMDRRS-GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 185 LIDTAM-----LDENVPVD------ELMKM-IPAQRMGTPEEVAGAVNFLMSAEAAYITR 232 +T M DEN E K IP +++ P ++A AV FL+S +A +IT Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248 Query: 233 QVLAVNGG 240 L V+GG Sbjct: 249 HNLCVDGG 256
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 42.5 bits (100), Expect = 7e-06 Identities = 35/181 (19%), Positives = 63/181 (34%), Gaps = 33/181 (18%) Query: 629 VFASTQVSAAELKLASCVLIVLLLIVPFGFNGALRIV---ALPLLAALCSLASLGWLGQP 685 F + L +++V L++ F N ++ A+P+ L + A L G Sbjct: 331 PFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPV-VLLGTFAILAAFGYS 389 Query: 686 LTLFSLFGLLLVTAISVDYAILMRE----------------------QVGGAAVSLLGTL 723 + ++FG++L + VD AI++ E Q+ GA V + L Sbjct: 390 INTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVL 449 Query: 724 LAAVTTWLSFGLLAISGTPAISNFGLSVSLGLAFSFMLA----PWASPRQKKSAGSPEPR 779 A FG S F +++ +A S ++A P K + Sbjct: 450 SAVFIPMAFFG---GSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506 Query: 780 P 780 Sbjct: 507 N 507 Score = 32.5 bits (74), Expect = 0.008 Identities = 32/146 (21%), Positives = 58/146 (39%), Gaps = 14/146 (9%) Query: 264 ILLLLLLAFRRWSVLLAFVPVIVGMLFGAVACVAIFG-SMHVMTLVLGSSLIGVAVDYP- 321 ++ L L R + VPV+ L G A +A FG S++ +T+ IG+ VD Sbjct: 354 VMYLFLQNMRATLIPTIAVPVV---LLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAI 410 Query: 322 -----LHYLSKSWSLKPW----RSWPALRLTLPGLSLSLVTSCIGYLALAWTPFPALTQI 372 + + L P +S ++ L G+++ L I + Q Sbjct: 411 VVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470 Query: 373 AVFSAAGLVGAYLTAVCLLPALLGRI 398 ++ + + + L A+ L PAL + Sbjct: 471 SITIVSAMALSVLVALILTPALCATL 496 Score = 32.5 bits (74), Expect = 0.009 Identities = 15/63 (23%), Positives = 25/63 (39%), Gaps = 14/63 (22%) Query: 648 IVLLLIVPFGFNGALRIVALPLLAALCSLASLGWLGQPLTLFSLFGLLLVTAISVDYAIL 707 + ++L+VP G G L + Q ++ + GLL +S AIL Sbjct: 898 VSVMLVVPLGIVGVL--------------LAATLFNQKNDVYFMVGLLTTIGLSAKNAIL 943 Query: 708 MRE 710 + E Sbjct: 944 IVE 946
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 25.2 bits (55), Expect = 0.042 Identities = 11/43 (25%), Positives = 18/43 (41%) Query: 27 GNDQTLFGEGLGLDSVDALELGLAIQKRYGIKIDADAKDTRNH 69 G D +F G+G + + E L + G K+D + R Sbjct: 322 GVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVRGE 364
>PF06580#Sensor histidine kinase Length = 349 Score = 36.0 bits (83), Expect = 4e-04 Identities = 19/137 (13%), Positives = 40/137 (29%), Gaps = 51/137 (37%) Query: 407 LMHLLRNSMDHGIESAEARRASGKSAKGHLSLNAYHDSGSIVIEIADDGAGLNRERILEK 466 + L+ N + HGI G + L D+G++ +E+ + G+ + Sbjct: 260 VQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK--- 308 Query: 467 AQERGLVASGAVLTDQEIYNLIFEPGFSTAEAVTNLSGRGVGMDVVKRNITLLRG---TV 523 G G+ V+ + +L G + Sbjct: 309 ------------------------------------ESTGTGLQNVRERLQMLYGTEAQI 332 Query: 524 DLDSQPGEGTIVRIRLP 540 L + G+ + +P Sbjct: 333 KLSEKQGKVN-AMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 90.3 bits (224), Expect = 1e-24 Identities = 26/117 (22%), Positives = 58/117 (49%), Gaps = 2/117 (1%) Query: 4 SVLVVDDSSSVRQVVGIALKSAGYDVIEACDGKDALGKLSGQKVHLIISDVNMPNMDGIT 63 ++LV DD +++R V+ AL AGYDV + ++ L+++DV MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FVKEVKKLASYKFTPIIMLTTESQESKKAEGQAAGAKAWVVKPFQPAQMLAAVSKLI 120 + +KK P+++++ ++ + GA ++ KPF +++ + + + Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.006 Identities = 32/208 (15%), Positives = 67/208 (32%), Gaps = 24/208 (11%) Query: 170 QVIDSLKATQASRDETLTQVRSLTAYTGELRTMAADVAAIAAQTNLLALNA--AIEAARA 227 V+ L A A D TQ L A + R + + L L + Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSE 181 Query: 228 GEAGRGFAVVADAVRSLSSKSSE---TGQQMSAKVDIINNAITQLVQAASSGADQDS--- 281 E R +++ + + ++ + + A+ + I + + + Sbjct: 182 EEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFS 241 Query: 282 ----------HSVAESEQSIQHVLQRFQSITGRLAESADLLKQESYGIRDEMTEVLVSLQ 331 H+V E E + + +L + ++ E ++E LV+ Sbjct: 242 SLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ----IESEILSAKEEYQ--LVTQL 295 Query: 332 FQDRVSQILTHVRDNIDSLHTHLQQSSQ 359 F++ + L DNI L L ++ + Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEE 323
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 35.8 bits (82), Expect = 9e-05 Identities = 14/56 (25%), Positives = 25/56 (44%) Query: 90 NAWDNEDFVKAIKATGRKQLIIAGVVTDVCVAFPTLSALAEGFDVFVVTDSSGTFN 145 +A+ + ++ ++ GR QLII G+ + A E F V D+ F+ Sbjct: 127 SAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFS 182
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 29.1 bits (65), Expect = 0.032 Identities = 18/66 (27%), Positives = 28/66 (42%), Gaps = 5/66 (7%) Query: 136 AGAFGTTLSKDGSLLYV--NNEAAS---TLSVIDLDHQRPVAVVPGFSQPRQGIRVSPDG 190 A + DG++LY+ N+E AS L + + G +PR G R Sbjct: 87 ASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPDASN 146 Query: 191 KTVYVT 196 + VYV+ Sbjct: 147 RLVYVS 152
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 29.3 bits (65), Expect = 0.038 Identities = 41/186 (22%), Positives = 57/186 (30%), Gaps = 12/186 (6%) Query: 245 AGIISSGGQTSGI---GSFGAGAAIGAATMAASAAASAGSAALAGANEIAGGTSALTAAF 301 AG GG G G FG G S S LA + A A Sbjct: 265 AGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVG 324 Query: 302 KAAEAHLDSGS--TDTGNFEYGSGSEQHTASGSGQSAFGQAMGNGQNTGYASRVAQTG-R 358 + A + GS GN G+ + + S QA + Q RV + Sbjct: 325 RGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVK 384 Query: 359 LAASAGAL----IAEQVGQSI--SSRASAAVADTAGGRVAASINENSKASLSDKTEKFDG 412 L + GA I SI +S VA + R + S+ + T Sbjct: 385 LTLTGGADAQGDIVATELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTD 444 Query: 413 DSVSGS 418 +S G+ Sbjct: 445 NSNVGA 450
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.018 Identities = 36/224 (16%), Positives = 68/224 (30%), Gaps = 9/224 (4%) Query: 18 GREQKYLTYAEVNDHL--PEDISDPE--QVEDIIRMINDMGIPVHESAPDADALMLADAD 73 GR Y E + +I+ P Q + N+ I + AP ++ Sbjct: 976 GRYDLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSE 1035 Query: 74 TDEAAAEEAAAALAAVETDIGRTTDPVRMYMREMGTVELLTREGEIEIAK--RIEEGIRE 131 T E AE + VE + T+ RE+ + + + + +E Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQ-NREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 132 VMGAIAHFPGTVD--HILSEYTRVTSEGGRLSDVLSGYIDPDDGIAPPAEVPPPVDPKAA 189 TV+ T T E +++ +S + + + P AE DP Sbjct: 1095 TQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVN 1154 Query: 190 KAEGADDDEEESADASDEEDEVESGPDPVIAAQRFGAVSDQMEI 233 E + ++ + PV + + +E Sbjct: 1155 IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.001 Identities = 8/55 (14%), Positives = 19/55 (34%), Gaps = 3/55 (5%) Query: 97 AFVEVGKTVKVGDTICIVEAMKMMNHITAEKAGVIESILVENGQPVEFDQPLFTI 151 +V + K + I + +++ I+V+ G+ V L + Sbjct: 76 VLGQVEIVATANGKLTHSGRSKEIKPI---ENSIVKEIIVKEGESVRKGDVLLKL 127
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.009 Identities = 20/106 (18%), Positives = 37/106 (34%), Gaps = 2/106 (1%) Query: 22 LQGALGSLGQVVSAGTGSLDDLLALVDVTFASVVFVGLDREHLMNQSALIEGALEAKPML 81 L AL G V T + L + +V + N L+ +A+P L Sbjct: 19 LNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDV-VMPDENAFDLLPRIKKARPDL 76 Query: 82 AIVALGDGMDNQLVLNAMRAGARDFVAYGSRSSEVAGLVRRLSKRL 127 ++ + + A GA D++ +E+ G++ R Sbjct: 77 PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 139 bits (351), Expect = 4e-38 Identities = 70/300 (23%), Positives = 123/300 (41%), Gaps = 13/300 (4%) Query: 84 GVAPGTTSLMVWTACSKAPRQSMVFVRGRATASMVDVQPLPSADAQLPSQVQTDIRFIEV 143 A +L + + + V M D++ + + QV + EV Sbjct: 298 QAAKPVAALDKNIIIKAHGQTNALIVTAAP-DVMNDLERVIAQLDIRRPQVLVEAIIAEV 356 Query: 144 SRRKLKEASTSIFGKGSNNFLFGAPGTVPGVNVTPGTVSGTRP-----SIPLNNDTFNIV 198 K + F G +P G + S+ +FN + Sbjct: 357 QDADGLNLGIQWANKNAGMTQFTNSG-LPISTAIAGANQYNKDGTVSSSLASALSSFNGI 415 Query: 199 WGGGSSKVLGM-INAMENSGFAYTLARPSLVALNGQSASFLAGGEFPVPVPNGEGNG--- 254 G M + A+ +S LA PS+V L+ A+F G E PV + +G Sbjct: 416 AAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNI 475 Query: 255 -ISIEYKEFGVRLTLTPTVVGRDRILLKVAPEVSELDFTAGITIAGTSVPALNIRRTDTS 313 ++E K G++L + P + D +LL++ EVS + A + + N R + + Sbjct: 476 FNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGATFNTRTVNNA 534 Query: 314 ISLADGESFVISGLISSSNVSSVDKFPGLGDIPILGAFFRSSQIQRDERELLMIVTPHLV 373 + + GE+ V+ GL+ S + DK P LGDIP++GA FRS+ + +R L++ + P ++ Sbjct: 535 VLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVI 594
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.7 bits (207), Expect = 5e-22 Identities = 25/107 (23%), Positives = 42/107 (39%), Gaps = 3/107 (2%) Query: 6 TRQQLLLVDDEEDANEELAELLEGEGFCCFTASSVKMALQQLTLHPDIALVITDLRMPEE 65 T +L+ DD+ L + L G+ S+ + + LV+TD+ MP+E Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE 60 Query: 66 SGLQLIKHLREHTSRQHLPVIVTSGHADMDDVSDMLRLHVLDLFRKP 112 + L+ +++ R LPV+V S D KP Sbjct: 61 NAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 72.3 bits (177), Expect = 1e-16 Identities = 64/287 (22%), Positives = 108/287 (37%), Gaps = 47/287 (16%) Query: 5 RLTSLLAGSLLAAMACATQAAPIDIDDGQHKVHLPDAPKRVVVLEFSFLDSLASVGVTPV 64 RL + +A S L AA ID P R+V LE+ ++ L ++G+ P Sbjct: 11 RLLTAMALSPLLWQMNTAHAAAID-------------PNRIVALEWLPVELLLALGIVPY 57 Query: 65 GAADDGDANR--VLPKARKAVGEWQSVGLRSQPNIEVIARLKPDLIIADLGRHQALYNDL 122 G AD + P +V + VGLR++PN+E++ +KP ++ G + L Sbjct: 58 GVADTINYRLWVSEPPLPDSVID---VGLRTEPNLELLTEMKPSFMVWSAG-YGPSPEML 113 Query: 123 KSLAPTLMLPSRGEDYEGSLKSAEL------IGTALGKGPQMQARIAENREHLKVVAAQI 176 +AP +G A + L + +A+ + ++ + + Sbjct: 114 ARIAPGRGFNFS----DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRF 169 Query: 177 PADTK---VLFGVAREDSFSVHGPHSYAGSVLKAIGLQVPEVRKNAAPTEF-------VS 226 +L + V GP+S +L G+ NA E VS Sbjct: 170 VKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGI------PNAWQGETNFWGSTAVS 223 Query: 227 LEQLLAL-DPGWLLVGHYRRPSLVDSWSKQPLWQVLSAVRNKQVAEV 272 +++L A D L H + D+ PLWQ + VR + V Sbjct: 224 IDRLAAYKDVDVLCFDHDNSKDM-DALMATPLWQAMPFVRAGRFQRV 269
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 53.9 bits (129), Expect = 6e-11 Identities = 44/194 (22%), Positives = 80/194 (41%), Gaps = 19/194 (9%) Query: 4 KTALIIGASRGLGLGLVQRLTEQGWKVTATVRDPQNADNLKAIEGVRIEA-------VDI 56 K A I GA++G+G + + L QG + A D K + ++ EA D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAV--DYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 57 DDTASLEVLVQKLKGEV--FDVLFVNAGI--MGPKHQSAAQATAAELGQLFLTNAIAPIR 112 D+A+++ + +++ E+ D+L AG+ G H + + E F N+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDE----EWEATFSVNSTGVFN 122 Query: 113 LAERFVDHIRPETGVLAFMSSVLGSVACPEGETMTLYKASKAALNSMTNSFVVQLPEPRP 172 + ++ + +V + A +M Y +SKAA T ++L E Sbjct: 123 ASRSVSKYMMDRRS--GSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 173 TVLSLHPGWVKTDM 186 + PG +TDM Sbjct: 181 RCNIVSPGSTETDM 194
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 28.7 bits (64), Expect = 0.038 Identities = 9/15 (60%), Positives = 11/15 (73%), Gaps = 1/15 (6%) Query: 125 IPFLSDIPLIGRMLF 139 +P L DIP+IG LF Sbjct: 560 VPLLGDIPVIGA-LF 573
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 95.1 bits (236), Expect = 1e-25 Identities = 46/208 (22%), Positives = 85/208 (40%), Gaps = 17/208 (8%) Query: 5 LVQWSINPRRTAVIVVDMQKVFCEPTGALYVKNTAYIVQPIQRLLEAARAGGVMVVYLRH 64 V W +P R +++ DMQ F + A + I++L G+ VVY Sbjct: 21 KVSWVPDPNRAVLLIHDMQNYFVDAFTA-GASPVTELSANIRKLKNQCVQLGIPVVYTAQ 79 Query: 65 IVRGDGSDTGRMRDLY-PNVDQILARHDPDVEVIEALAPQSGDVIIDKLFYSGFHNTDLD 123 + D + D + P L + ++I LAP+ D+++ K YS F T+L Sbjct: 80 PGSQNPDDRALLTDFWGPG----LNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLL 135 Query: 124 TVLRARDVDTLIVCGTVTNVCCETTIRDGVHREYKVIALSDANAAMDYPDVGFGAVSAEE 183 ++R D LI+ G ++ C T + + K + DA A D+ + E Sbjct: 136 EMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVA--DF---------SLE 184 Query: 184 VQRISLTTIAYEFGEVTTTADVIQRIES 211 +++L A T ++ ++++ Sbjct: 185 KHQMALEYAAGRCAFTVMTDSLLDQLQN 212
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 97.2 bits (242), Expect = 3e-25 Identities = 41/120 (34%), Positives = 67/120 (55%), Gaps = 1/120 (0%) Query: 9 PAPRVLVVDDHRKIRDPLAVYLRRHLFEVRTAEDAAGMWQLLKQQSFDVVVLDVMLPDGD 68 +LV DD IR L L R ++VR +AA +W+ + D+VV DV++PD + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 69 GFDLCNRLH-RRENIPVILLTARDTPADRVRGLDIGADDYITKPFEPRELVARINSVLRR 127 FDL R+ R ++PV++++A++T ++ + GA DY+ KPF+ EL+ I L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 342 bits (879), Expect = e-121 Identities = 159/283 (56%), Positives = 200/283 (70%), Gaps = 1/283 (0%) Query: 3 LLDFLASSTLAFVIFIGVLGLLIGSFLNVVVYRLPKMMENDWKAQSREMLGLPAE-PEQP 61 LL+ + + + L+IGSFLNVV++RLP M+E +W+A+ R E ++P Sbjct: 4 LLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEP 63 Query: 62 TFNLILPHSRCPHCAHQIRPWENLPVVSYLMLGGKCSQCKAPISKRYPLVELVCALLSAY 121 +NL++P S CPHC H I EN+P++S+L L G+C C+APIS RYPLVEL+ ALLS Sbjct: 64 PYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVA 123 Query: 122 VAWHFGFGWQTAAMLVLSWGLLAMSLIDADTQLLPDSLVLPLMWLGLIVNAFGLFTSLND 181 VA GW T A L+L+W L+A++ ID D LLPD L LPL+W GL+ N G F SL D Sbjct: 124 VAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGD 183 Query: 182 ALWGAVAGYLSLWSVFWLFKLITGKEGMGYGDFKLLAMLGAWGGWQILPLTILLSSLVGA 241 A+ GA+AGYL LWS++W FKL+TGKEGMGYGDFKLLA LGAW GWQ LP+ +LLSSLVGA Sbjct: 184 AVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGA 243 Query: 242 ILGVIMMRLRNVESGTPIPFGPYLAIAGWIALLWGGQITDSYL 284 +G+ ++ LRN PIPFGPYLAIAGWIALLWG IT YL Sbjct: 244 FMGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 425 bits (1093), Expect = e-150 Identities = 122/403 (30%), Positives = 220/403 (54%), Gaps = 14/403 (3%) Query: 11 FTWEGVDKKGSKISGELSGHNPALIKAQLRKQGVNPTKVRKKTVSI---------FGKGK 61 + ++ +D +G K G + + LR++G+ P V + + Sbjct: 4 YHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRKI 63 Query: 62 KIKPLDIAFFARQMATMMKAGVPLLQSFDIISEGAENPNMRSLVDSLKQEVSAGNSFATA 121 ++ D+A RQ+AT++ A +PL ++ D +++ +E P++ L+ +++ +V G+S A A Sbjct: 64 RLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLADA 123 Query: 122 LRQKPDQFDNLFCNLVDAGEQAGALESLLDRVATYKEKTEKLKAKIKKAMTYPAAVLVVA 181 ++ P F+ L+C +V AGE +G L+++L+R+A Y E+ ++++++I++AM YP + VVA Sbjct: 124 MKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVVA 183 Query: 182 FIVSGILLIKVVPQFQAVFAGFGAELPAFTRLVIGLSEVVQTW--WLAIIGIFVGSFFIF 239 V ILL VVP+ F LP TR+++G+S+ V+T+ W+ + + + F+ Sbjct: 184 IAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLA---LLAGFMA 240 Query: 240 KRSYKQSQKFRDSVDRFLLKIPLIGPLIFKSSVARYARTLATTFAAGVPLVEALDSVAGA 299 R + +K R S R LL +PLIG + + ARYARTL+ A+ VPL++A+ Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 300 TGNVVFRNAVMKIKQDVSTGMQLNFSMRSTGVFPSLAIQMTAIGEESGALDSMLDKVATY 359 N R+ + V G+ L+ ++ T +FP + M A GE SG LDSML++ A Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 360 YEDEVDNMVDSLTSLMEPMIMALLGVIVGGLVIAMYLPIFQLG 402 + E + + L EP+++ + +V +V+A+ PI QL Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLN 403
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 43.3 bits (102), Expect = 3e-08 Identities = 20/69 (28%), Positives = 39/69 (56%), Gaps = 10/69 (14%) Query: 8 QKGFTLIELMIVVAIVGILAAVAIPAYQDYTIRAQ----VAELATLADGAKVAVSETYQ- 62 Q+GFTL+E+M+V+ I+G+LA++ +P +A V+++ L + + Y+ Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENAL-----DMYKL 61 Query: 63 TTGAFPTSN 71 +PT+N Sbjct: 62 DNHHYPTTN 70
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 29.1 bits (65), Expect = 0.029 Identities = 12/27 (44%), Positives = 13/27 (48%), Gaps = 1/27 (3%) Query: 172 QPAPAPAPAPAPAPAAPPPVKSTVSLS 198 Q AP APAPAP AP +L Sbjct: 193 QGEAAPVVAPAPAP-APEVQTKHFTLK 218
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 476 bits (1228), Expect = e-173 Identities = 187/334 (55%), Positives = 238/334 (71%), Gaps = 12/334 (3%) Query: 1 MTVLVTGAAGFIGFHVAKHLCEQGIEVVGIDNLNDYYSVELKHSRLAILERMPGFVFKRL 60 M LVTGAAGFIGFHV+K L E G +VVGIDNLNDYY V LK +RL +L + PGF F ++ Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQ-PGFQFHKI 59 Query: 61 DITDATGLSTLFEHHTFEQVIHLAAQAGVRYSMEQPDAYIQSNLVGFSNVLEACRQHRPS 120 D+ D G++ LF FE+V + VRYS+E P AY SNL GF N+LE CR ++ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 121 HLIYASSSSVYGANTRLPFRVEDAVDRPLSLYAATKRANELAAYSYCHLYGLRATGLRFF 180 HL+YASSSSVYG N ++PF +D+VD P+SLYAATK+ANEL A++Y HLYGL ATGLRFF Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179 Query: 181 TVYGPWGRPDMALFKFTQAMLREEPVDIYNHGEMARDFTYIDDIVESILRLRLRPPEPT- 239 TVYGPWGRPDMALFKFT+AML + +D+YN+G+M RDFTYIDDI E+I+RL+ P Sbjct: 180 TVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADT 239 Query: 240 -----NGEPA-----HQLFNIGRGQPVKLLEFVDCLEKALGLKAQRRYLPLQAGDVLQTW 289 G PA ++++NIG PV+L++++ LE ALG++A++ LPLQ GDVL+T Sbjct: 240 QWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETS 299 Query: 290 ADVTALTRWIDFQPHVSVDSGVSAFVEWYREHYQ 323 AD AL I F P +V GV FV WYR+ Y+ Sbjct: 300 ADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.6 bits (160), Expect = 1e-14 Identities = 25/121 (20%), Positives = 55/121 (45%), Gaps = 1/121 (0%) Query: 3 ILIIEDHQDIHDNLVEYFELRGHNVQSALDGLSGLHLAATQKFDAIILDIMLPGIDGNQI 62 IL+ +D I L + G++V+ + + A D ++ D+++P + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 63 CRSLRQYSKTEVAIVMLSARDELEDRLVGFSVGTDDYITKPFAMSEVLARVEAVVARSQR 122 +++ +VM SA++ + G DY+ KPF ++E++ + +A +R Sbjct: 66 LPRIKKARPDLPVLVM-SAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 123 R 123 R Sbjct: 125 R 125
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.015 Identities = 20/99 (20%), Positives = 36/99 (36%), Gaps = 21/99 (21%) Query: 38 RAAEQNIEQNSLPSIQVIDDIQIALLHAR---------LESIRMLASTDPDVKKASEAKV 88 + I+Q + S+ + Q+ L A+ L +IR L DP Sbjct: 143 NYKQAEIDQWKMASMA--QEAQLMALKAQINPHFMFNALNNIRALILEDPT--------- 191 Query: 89 RQAMDTLQSRSDFYQKNLISGEQDRSQFDDARNKMSNYL 127 +A + L S S+ + +L + D + +YL Sbjct: 192 -KAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYL 229
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 74.9 bits (184), Expect = 4e-16 Identities = 54/150 (36%), Positives = 70/150 (46%), Gaps = 17/150 (11%) Query: 33 VDDGKSTLIGRLLHDSKMIYEDHLEAITRDSKKSGTTGDDVDLALLVDGLQAEREQGITI 92 VD GK+TL LL++S I E K T D+ L ER++GITI Sbjct: 12 VDAGKTTLTESLLYNSGAITE------LGSVDKGTTRTDNTLL---------ERQRGITI 56 Query: 93 DVAYRYFSTAKRKFIIADTPGHEQYTRNMATGASTCDLAIILVDARYGVQTQTRRHSYIA 152 F K I DTPGH + + S D AI+L+ A+ GVQ QTR + Sbjct: 57 QTGITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 153 SLLGIKHIVVAINKMDLNGFD-EGVFESIK 181 +GI I INK+D NG D V++ IK Sbjct: 117 RKMGIPTIFF-INKIDQNGIDLSTVYQDIK 145
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 26.7 bits (59), Expect = 0.023 Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 3/41 (7%) Query: 1 MSNRRAFILRRPFTSLLLLLLAALAVLIFQYRVALQAFPTI 41 M+N F +RRP + +L ++ +A + ++ + +PTI Sbjct: 1 MAN---FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTI 38
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 30.9 bits (69), Expect = 0.032 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 3/41 (7%) Query: 592 KEDEEIILDYQNYHVTADGQLTYNFLTKMPPPNNYNYFIAP 632 +E ++IILD + Q +N L + P P NYNY+ AP Sbjct: 370 EEKQKIILDQAK---ALETQYVHNALKRNPVPRNYNYYQAP 407
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 34/160 (21%), Positives = 61/160 (38%), Gaps = 13/160 (8%) Query: 47 LPEIGRHFSWSEVEQAEIATWV---AVGTAVVALAIGPLVDRLGRRVGIMFTVSGSAICS 103 LP + R S A + A+ A +G L DR GRR ++ +++G+A+ Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87 Query: 104 ALTAIGGSWGKSPLILIRSLGGLGYAEETVNATYLSEIYAASDDPRLAKRRGFIYSLVQG 163 A+ A L + R + G+ A V Y+++I + A+ GF+ + Sbjct: 88 AIMATAPFL--WVLYIGRIVAGITGATGAVAGAYIADITDGDER---ARHFGFMSACFGF 142 Query: 164 GWPVGALIAAGLTAVLLPIIGWQGCFVFAAIPAIIIAIMA 203 G G ++ L+ F AA + + Sbjct: 143 GMVAGPVLGG-----LMGGFSPHAPFFAAAALNGLNFLTG 177
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 132 bits (334), Expect = 6e-40 Identities = 77/254 (30%), Positives = 125/254 (49%), Gaps = 11/254 (4%) Query: 4 KVALVTGAASGIGQALAVAFARQGVAVAGGFYPADPHDPDETRRLVEEAGGECLMLPLDV 63 K+A +TGAA GIG+A+A A QG +A Y + + + E E P DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE--AFPADV 66 Query: 64 ASTESVDNLASQALQAFGRIDYAVANAGLLRRAPLLEMTDARWNEMLDVDLTGVMRTFRA 123 + ++D + ++ + G ID V AG+LR + ++D W V+ TGV R+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 124 AARHM--GKGGALVAISSIAGGVYGWQDHSHYAAAKAGVPGLCRSLAVELAPKGIRCNAV 181 +++M + G++V + S GV + YA++KA + L +ELA IRCN V Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGV-PRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 182 IPGLIETP--QSL----DSKNSLGPEGLKQAAKAIPLGRVGRADEVASLVRFLCSDEASY 235 PG ET SL + + L+ IPL ++ + ++A V FL S +A + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 236 LTGQSIVIDGGLTV 249 +T ++ +DGG T+ Sbjct: 246 ITMHNLCVDGGATL 259
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 97.0 bits (241), Expect = 2e-26 Identities = 74/245 (30%), Positives = 109/245 (44%), Gaps = 18/245 (7%) Query: 4 LKDKRAVITGAGSGIGAAIARAYAVEGAQLVLGDRDPTNLTKVAEECRQLGAQVYACVAD 63 ++ K A ITGA GIG A+AR A +GA + D +P L KV + A AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 64 VGTVEGAQAGVDACVEQFGGIDILVNNAGMLTQARCVDLSIEMWNDMLRVDLTSVFVASQ 123 V + G IDILVN AG+L LS E W V+ T VF AS+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 124 RALPHMLAQSWGRIINVASQLGIKGGAELTHYSAAKAGVIGFTKSLALEVAKDNVLVNAI 183 +M+ + G I+ V S + Y+++KA + FTK L LE+A+ N+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 184 APGPIETPL--------------VAGISSAWKTAKAAELPLGRFGLAEEVAPVAVLLGSE 229 +PG ET + + G +KT +PL + ++A + L S Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTG----IPLKKLAKPSDIADAVLFLVSG 241 Query: 230 PGGNL 234 G++ Sbjct: 242 QAGHI 246
>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature. Length = 234 Score = 30.4 bits (68), Expect = 0.006 Identities = 11/32 (34%), Positives = 16/32 (50%) Query: 228 LSRKLRRVLLEEFGLDEAFVKAAGYWKLDGED 259 L ++R L + GL + K GYWK+ D Sbjct: 169 LDFEIRHQLTQIHGLYRSSDKTGGYWKITMND 200
>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature. Length = 544 Score = 28.4 bits (63), Expect = 0.004 Identities = 13/88 (14%), Positives = 33/88 (37%), Gaps = 5/88 (5%) Query: 14 TVAAGAAQADVRPDQIAGLQKSGAIGDLEQFNKQAQAKHPGFEIHDTELDKDVGGN---Y 70 T+ + ++ + +Q++ I + ++ + + E T L Sbjct: 122 TLIPNLDKRTLKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRL 181 Query: 71 IYQIELKDAKGVE--WNYDVNAKTGAVV 96 Y++ ++ V W Y ++A G V+ Sbjct: 182 AYEVNVRFLTPVPGNWIYMIDAADGKVL 209
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.9 bits (75), Expect = 0.002 Identities = 24/110 (21%), Positives = 47/110 (42%), Gaps = 8/110 (7%) Query: 67 VTGY-LARPLGGILMAHFADRLGRKRVFSLSILMMALPCLLIGIMPTYAQIGYWAPLVLL 125 T + L +G + +D+LG KR+ I++ ++ + ++ + L+ Sbjct: 55 NTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LI 107 Query: 126 ALRILQGAAVGGEVPSAWVFVAEHAPNGHRGYALGVLQAGLTFGYLLGAL 175 R +QGA V VA + P +RG A G++ + + G +G Sbjct: 108 MARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPA 157
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 55.2 bits (133), Expect = 2e-10 Identities = 67/354 (18%), Positives = 125/354 (35%), Gaps = 26/354 (7%) Query: 29 GFVIVTTEFLIIGL----LPALARDLGIS---ISNAGLLVTLFAFTVMLFGPPLTAMLSH 81 V + + IGL LP L RDL S ++ G+L+ L+A P L A+ Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69 Query: 82 LDRKRTFIVILLIFAASNALAAVSSNIWVLALARFIPALALPVFWGTASETAGLMAGPKQ 141 R+ +V L A A+ A + +WVL + R + + + A + G + Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDG-DE 128 Query: 142 AGKAVAQVYLGISAAMLFGIPLGTVFADAVGWRGAFWALTALSVLMAVLLAFSMPKMAPT 201 + + M+ G LG + F+A AL+ L + F +P+ Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKG 187 Query: 202 EKVGLAQQARILRDPHFIANLLLSILLFTAMF---------GAYTYLADTLERIAGIESA 252 E+ L ++A A + + A+F A ++ +R ++ Sbjct: 188 ERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF-HWDAT 246 Query: 253 QVGWWLMGFGAVGLIGNA-LGGRFVDRSPLGATIAFALLLALGMTASVPAAGSLP---LL 308 +G L FG + + A + G R + ++ + A + Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPI 306 Query: 309 AVVLAVWGIAHTALFPICQIRVMKAAPQAQALAGTLNVSAANAGIGLGSIIGGV 362 V+LA GI AL + +V + + Q + + +G ++ Sbjct: 307 MVLLASGGIGMPALQAMLSRQVDE---ERQGQLQGSLAALTSLTSIVGPLLFTA 357
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 30.3 bits (68), Expect = 0.007 Identities = 15/45 (33%), Positives = 19/45 (42%), Gaps = 4/45 (8%) Query: 25 MPVSQQEQAQQAPRYQNTVTAQSAARRADAAALQDEMATEDELAQ 69 MP Q+Q Q + Q TAQ A A L D++AQ Sbjct: 336 MPPQAQQQQGQGQQQQAQATAQEAVAAAAVRLLNG----SDQIAQ 376
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 46.4 bits (110), Expect = 1e-07 Identities = 66/341 (19%), Positives = 116/341 (34%), Gaps = 19/341 (5%) Query: 51 IALQNLMWGLAQPFAGALADRFGAAKVVFVGGVLYAVGLLCMSMADSPLSLSLSAGLLIG 110 +AL LM P GAL+DRFG V+ V AV MA +P L G ++ Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAI--MATAPFLWVLYIGRIVA 106 Query: 111 IGLSGTSFSVILGVVGRALPAEKRSMGMGIASAAGSFGQFAMLPGTLGLIGWLGWSGALV 170 G++G + +V + ++R+ G SA FG P GL+G Sbjct: 107 -GITGATGAVAGAYIADITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPHAPFF 164 Query: 171 VLGVM--VALILPLVGMLKDKPTESVGIQQT---LGEALREACSHSGF-WLLALGFFVCG 224 + + + + + E +++ + R A + L+A+ F + Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224 Query: 225 FQVVFIGVHLPAYLVDQHLPAKVGTTVLALIGLFN-IFGTYTAGWLGGRMSKPRLLTALY 283 V + + H A LA G+ + + G + R+ + R L Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM 284 Query: 284 LLRAVVIVLFLWIPLSQTTAYLFGVAMGLLWLSTV--PLTNGTVATLFGVRNLSMLGGIV 341 + +L + T ++ M LL + P ++ L G + Sbjct: 285 IADGTGYILLAFA----TRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSL 340 Query: 342 FLFHQLGAFLGGWLGGLVYDHTGSY--DLIWQVSILLSLLA 380 L + +G L +Y + + W L LL Sbjct: 341 AALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 33.5 bits (76), Expect = 0.002 Identities = 31/183 (16%), Positives = 58/183 (31%), Gaps = 11/183 (6%) Query: 25 QLQRRLTRRDAETALLDERLSMAQMAQDGLNAQLDASRDEISDLSQANAAKQADLAALRR 84 L R + + L A+ A ++L +A A Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281 Query: 85 EVELLRQESDNARDVAQGLNQERAIKEAELRRLDAQCAALGAELREQQDSHQQRLNDLQG 144 +++ L E L + + A + L A ++ + HQ+ + Sbjct: 282 KIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 341 Query: 145 SR----------DELRAQFAELAGKIFD-EREQRFAETSQQQLGQLLTPLKERIQSFEKR 193 S D R +L + E + + +E S+Q L + L +E + EK Sbjct: 342 SEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKA 401 Query: 194 VEE 196 +EE Sbjct: 402 LEE 404
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 56.4 bits (136), Expect = 5e-11 Identities = 21/123 (17%), Positives = 51/123 (41%), Gaps = 14/123 (11%) Query: 180 RVLTVDDSSVARKQVSRCLETVGVEVVALNDGRQALDYLRKMVEEGKKPHEEFLMMISDI 239 +L DD + R +++ L G +V ++ ++ + ++++D+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA---------GDGDLVVTDV 55 Query: 240 EMPEMDGYTLTAAIRN-DPRMQKMHITLHTSLSGVFNQAMVKKVGADDFLAK-FRPDDLA 297 MP+ + + L I+ P + + ++ + +A + GA D+L K F +L Sbjct: 56 VMPDENAFDLLPRIKKARPDLPVLVMSAQNTFM-TAIKAS--EKGAYDYLPKPFDLTELI 112 Query: 298 ARV 300 + Sbjct: 113 GII 115
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 34.5 bits (79), Expect = 1e-04 Identities = 8/38 (21%), Positives = 21/38 (55%) Query: 107 NVNVVEEMADMISASRSFQTNAEIMNTAKSMMQKVLTL 144 VN+ EE ++ + + NA+++ TA ++ ++ + Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545 Score = 28.0 bits (62), Expect = 0.014 Identities = 18/72 (25%), Positives = 29/72 (40%), Gaps = 14/72 (19%) Query: 8 NIAGSAMSAQTTRLNTTASNIANAETVSSSADATYRARHPVFATVMQGQQSTGGSLFQDQ 67 N A S ++A LNT ++NI++ + T + Q + S Sbjct: 5 NNAMSGLNAAQAALNTASNNISSYNVAGYTRQ-----------TTIMAQAN---STLGAG 50 Query: 68 GEAGQGVQVNGI 79 G G GV V+G+ Sbjct: 51 GWVGNGVYVSGV 62
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.5 bits (97), Expect = 6e-06 Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 4/70 (5%) Query: 2 SFNIGLSGLYAANKSLDVTGNNIANVATTGFKSSRAEFADQYAQSIRGTSGQTNVGSGVS 61 N +SGL AA +L+ NNI++ G+ A + VG+GV Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANST----LGAGGWVGNGVY 58 Query: 62 TAAVSQQFSQ 71 + V +++ Sbjct: 59 VSGVQREYDA 68 Score = 36.9 bits (85), Expect = 1e-04 Identities = 15/47 (31%), Positives = 23/47 (48%) Query: 395 ITGQALEESNVDLTMELVNLIKAQSNYQANAKTISTQSTIMQTTIQM 441 ++ Q S V+L E NL + Q Y ANA+ + T + I I + Sbjct: 499 LSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.5 bits (66), Expect = 0.012 Identities = 11/59 (18%), Positives = 23/59 (38%), Gaps = 2/59 (3%) Query: 178 GLIHTKSGRPADVDANV--QVESGFLQASNVNAVEEMTSVLALARQFELHVKMMKTAEE 234 G + NV Q+ + S VN EE ++ + + + ++++TA Sbjct: 479 GNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANA 537
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 42.6 bits (100), Expect = 9e-07 Identities = 12/41 (29%), Positives = 20/41 (48%) Query: 220 LENSNVSTVEELVNMITTQRAYEMNSKVISTADQMLQNLTQ 260 S V+ EE N+ Q+ Y N++V+ TA+ + L Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544 Score = 39.6 bits (92), Expect = 1e-05 Identities = 20/75 (26%), Positives = 33/75 (44%), Gaps = 14/75 (18%) Query: 5 LYVAKTGLAAQDTNLTTISNNLANVSTTGFKSDRAEFQDLLYQIKRQPGAQSTQDSELPS 64 + A +GL A L T SNN+++ + G+ RQ + +S L + Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGY--------------TRQTTIMAQANSTLGA 49 Query: 65 GLQLGTGVRIVGTQK 79 G +G GV + G Q+ Sbjct: 50 GGWVGNGVYVSGVQR 64
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 171 bits (435), Expect = 1e-55 Identities = 74/223 (33%), Positives = 113/223 (50%), Gaps = 13/223 (5%) Query: 20 IALLSGCVAPSAKPNDPYYAPVLPRTPMSAAANNGAIYQAGF-----EQNLYGDRKAFRV 74 + L+GC + P P + AN G+I+Q+ Q L+ DR+ + Sbjct: 16 VLSLTGCAWIPSTPLVQGATSAQPVPGPTPVAN-GSIFQSAQPINYGYQPLFEDRRPRNI 74 Query: 75 GDIITITLSERMAASKAASSALKKDSTNSIGLTSLFGSGLTTNNPIGSNDLSLNAGYNGK 134 GD +TI L E ++ASK++S+ +D + G + G+ + +G Sbjct: 75 GDTLTIVLQENVSASKSSSANASRDGKTNFGFDT---VPRYLQGLFGNARADV--EASGG 129 Query: 135 RATDGSGQAAQSNSLTGSVTVTVADVLPNGILAVRGEKWMTLNTGDELVRIAGLIRADDI 194 +G G A SN+ +G++TVTV VL NG L V GEK + +N G E +R +G++ I Sbjct: 130 NTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTI 189 Query: 195 ATDNTVSSTRIADARITYSGTGAFADSSQPGWFDRFF--LSPL 235 + NTV ST++ADARI Y G G ++ GW RFF LSP+ Sbjct: 190 SGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 435 bits (1119), Expect = e-155 Identities = 164/366 (44%), Positives = 218/366 (59%), Gaps = 10/366 (2%) Query: 7 LIAATLLLTTAFGAHAERLKDIASISGVRANQLIGYGLVVGLNGTGDQTTQTPFTLQTFN 66 A L T A R+KDIAS+ R NQLIGYGLVVGL GTGD +PFT Q+ Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72 Query: 67 NMLSQFGIKVPAGSGTVQLKNVAAVAVYADLPAFAKPGQTVDITVSSIGNSKSLRGGALL 126 ML GI G KN+AAV V A+LP FA PG VD+TVSS+G++ SLRGG L+ Sbjct: 73 AMLQNLGITTQGGQS--NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130 Query: 127 MTPMKGVDGNVYAIAQGNLVVGGFDAEGRDGSKITVNVPSSGRIPGGASVERSVPSGFNQ 186 MT + G DG +YA+AQG L+V GF A+G D + +T V +S R+P GA +ER +PS F Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189 Query: 187 GNTLTLNLNRSDFTTAKRIVDKINDL----LGPGVAQALDGGSVRVTAPLDPGQRVDYLS 242 L L L DF+TA R+ D +N G +A+ D + V P ++ Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248 Query: 243 ILENLEVDPGQTAAKVIINSRTGTIVIGQNVKVSPAAVTHGSLTVTITEDPIVSQPGALS 302 +ENL V+ T AKV+IN RTGTIVIG +V++S AV++G+LTV +TE P V QP S Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307 Query: 303 GGQTAVVPRSRVNAQQELHPMFKFGPGTTLDEIVRAVNQVGAAPGDLMAILEALKQAGAL 362 GQTAV P++ + A QE + G L +V +N +G ++AIL+ +K AGAL Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVA-IVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366 Query: 363 QADLIV 368 QA+L++ Sbjct: 367 QAELVL 372
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 132 bits (332), Expect = 2e-37 Identities = 67/161 (41%), Positives = 101/161 (62%), Gaps = 1/161 (0%) Query: 250 NADQFVETMLPLAKEAAARIGVDPVMLVAQAALETGWGKSIMRQQDGSSSHNLFGIKAAG 309 ++ F+ + A+ A+ + GV +++AQAALE+GWG+ +R+++G S+NLFG+KA+G Sbjct: 148 DSKAFLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASG 207 Query: 310 SWKGPEARAITSEFRDGKMVKETADFRSYTSYADSFHDLVSLLQNNNRYKEVVNSADKPE 369 +WKGP T+E+ +G+ K A FR Y+SY ++ D V LL N RY V +A E Sbjct: 208 NWKGPVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRYAAVTTAASA-E 266 Query: 370 QFVKELQKAGYATDPDYASKISQIAKQMKSYQTYAAATGSS 410 Q + LQ AGYATDP YA K++ + +QMKS + T S Sbjct: 267 QGAQALQDAGYATDPHYARKLTNMIQQMKSISDKVSKTYSM 307 Score = 61.7 bits (149), Expect = 9e-13 Identities = 54/177 (30%), Positives = 84/177 (47%), Gaps = 20/177 (11%) Query: 13 SGAYTDVNRLASLKH-GDKDSVANQKKVAQEFESLFVSQMLKAMRSANEVLAKDNPMNTA 71 + A D L LK +D AN + VA++ E +FV MLK+MR A KD ++ Sbjct: 9 ASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDAL---PKDGLFSSE 65 Query: 72 ATRQYQDMYDQQLAVTLSTRGNGIGLQDVLMRQLSKDKGIKHAAPTDQAATTADPAAPAK 131 TR Y MYDQQ+A ++ G G+GL +++++Q++ ++ P + PAAP K Sbjct: 66 HTRLYTSMYDQQIAQQMTA-GKGLGLAEMMVKQMTPEQ----PLPEEST-----PAAPMK 115 Query: 132 TGLANSV-YQRPLWATRSVAADQAAAAASASGEGRNDMALLNARRLSLPTKLTDRLL 187 L V YQ + A S G+ + +A +LSLP +L + Sbjct: 116 FPLETVVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLA-----QLSLPAQLASQQS 167
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 189 bits (480), Expect = 2e-54 Identities = 137/447 (30%), Positives = 227/447 (50%), Gaps = 17/447 (3%) Query: 2 SLISIGLSGINASSAAINTIGNNTANVDTAGYSRQQVMTTASAQINIGLGVGYIGTGTTL 61 SLI+ +SG+NA+ AA+NT NN ++ + AGY+RQ + + G++G G + Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTT--IMAQANSTLGAGGWVGNGVYV 59 Query: 62 SDVRRIYNSYLDSQLQSSTALKADATAYSGQATKTDQLLSDSTTGVAAQMTDFFTKLQSV 121 S V+R Y++++ +QL+++ + TA Q +K D +LS ST+ +A QM DFFT LQ++ Sbjct: 60 SGVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTL 119 Query: 122 ASSATQASSRSAFLTQATSVSGRFNSVAAQLTSQNDNVNAQLNTFTLQANELTKQIAGLN 181 S+A ++R A + ++ + +F + L Q+ VN + Q N KQIA LN Sbjct: 120 VSNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLN 179 Query: 182 KQI--TQASAGNTTPNSLLDSRNEAVRKLNELVGVKV-VENNGNYDVYTGTGQSLVSGAN 238 QI +PN+LLD R++ V +LN++VGV+V V++ G Y++ G SLV G+ Sbjct: 180 DQISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGST 239 Query: 239 AYTMSASPSAADPLQYNLQITYGQTKTDVT--SVVSGGSIGGLLRYRADILVPAANELGR 296 A ++A PS+ADP + + G +++ GS+GG+L +R+ L N LG+ Sbjct: 240 ARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQ 299 Query: 297 VAMVLADQMNSQMSQGIDSKGNFGSGLYTSINSADAILQRSTGNVNNSTGSGNLGVTIKD 356 +A+ A+ N+Q G D+ G+ G + A+LQ + + G +G T+ D Sbjct: 300 LALAFAEAFNTQHKAGFDANGDAGEDFFAIGKP--AVLQNT-----KNKGDVAIGATVTD 352 Query: 357 TSKLTADDYEVTFSDTNNYTIRRLPNGESVGTGALSDNPPKQFEGFSMSLSGNAVAAGDI 416 S + A DY+++F + R + T D K A D Sbjct: 353 ASAVLATDYKISFDNNQWQVTR--LASNTTFT-VTPDANGKVAFDGLELTFTGTPAVNDS 409 Query: 417 FKVTPTRNGASGIAVALTDPKDIAAAA 443 F + P + + V +TD IA A+ Sbjct: 410 FTLKPVSDAIVNMDVLITDEAKIAMAS 436 Score = 75.8 bits (186), Expect = 2e-16 Identities = 51/148 (34%), Positives = 79/148 (53%), Gaps = 11/148 (7%) Query: 544 TTTPNTRTAFEVEMTLSGTPIVN----DTFSIGLTG---AGSSDNRNALAMINLQISKSV 596 T TP +F ++ ++ D I + AG SDNRN A+++LQ + Sbjct: 401 TGTPAVNDSFTLKPVSDAIVNMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKT 460 Query: 597 GVTGGSVGTSLSGAYADIVSVVGTRTAQAKSDVTANESVLATAKAARDSVSGVSLDEEAA 656 GG+ S + AYA +VS +G +TA K+ +V+ + S+SGV+LDEE Sbjct: 461 --VGGA--KSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYG 516 Query: 657 NLIKYQQYYTASSQIIKAAQTIFSTLIN 684 NL ++QQYY A++Q+++ A IF LIN Sbjct: 517 NLQRFQQYYLANAQVLQTANAIFDALIN 544
>FLAGELLIN#Flagellin signature. Length = 507 Score = 62.0 bits (150), Expect = 2e-12 Identities = 77/461 (16%), Positives = 151/461 (32%), Gaps = 1/461 (0%) Query: 1 MRISTTQIYESTTANYQRNYSNVIKTGEEVSSGIKLNTASDDPVGAARVLQLTQQNAMLT 60 I+T + T N ++ S++ E +SSG+++N+A DD G A + T LT Sbjct: 2 QVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLT 61 Query: 61 QYESNIATISTNVDNSETAMSNITGTMQLAREAIVKAGNGTYTDASRVAIANELKQYQSQ 120 Q N + +E A++ I +Q RE V+A NGT +D+ +I +E++Q + Sbjct: 62 QASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEE 121 Query: 121 LLGLMNSQDSNGQYIFSGSKSSTPAYTESADGTY-VYNGDQTSMNLSVGDGLVLASNTTG 179 + + N NG + S + T + +L + V Sbjct: 122 IDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEAT 181 Query: 180 YEAFELSINSTRTSATRLSPATEDGKVVLSGGLVTSTSVYNSAYQGGEPYTLTFSSSTQF 239 + S + T A + V SG +VT T+ + ++ Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDA 241 Query: 240 RITDGTGKDVTTDASSAGNYTSGGIGAQTFTFRGVEMNLNVNLSAAEKATTATADAAMTN 299 TT +++ GA G + + T + ++ Sbjct: 242 ENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVST 301 Query: 300 RSYSLASTPDNVNATRSPGNASSATVSSSAVGTSAADLTAYNNTFPTGGAILRFTSATDY 359 T + T N +AT+ SS ++ + T + + Sbjct: 302 TINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEAN 361 Query: 360 ELYASPITGSSTPVSSGTMAGGNAKASGVNFAINGTPAAGDQFVVQSGTRQTENVLNTLT 419 + A G+ A+G ++ + Sbjct: 362 NAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPL 421 Query: 420 AAIKALSTPADGDLVATQKLNASLTSALGNLSSSIEQVSTA 460 A+I + + D + + SA+ NL +++ +++A Sbjct: 422 ASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSA 462
>FLAGELLIN#Flagellin signature. Length = 507 Score = 102 bits (256), Expect = 6e-27 Identities = 75/228 (32%), Positives = 115/228 (50%), Gaps = 3/228 (1%) Query: 2 ALTVNTNVTSLAVQKNLNRASDALSTSMSRLSSGLKVQNARDNVGVLSTIASINSQVRGQ 61 A +NTN SL Q NLN++ +LS+++ RLSSGL++ +A+D+ + S ++G Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TVAIQNANDGMSLAQTAEGALQESVSILQRMRELAVQSRNDSNSAVDRTALNKEFTAMSS 121 T A +NANDG+S+AQT EGAL E + LQR+REL+VQ+ N +NS D ++ E Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 ELTRISASTNLNGKNLLDGSASTMTFQVGANTGTSNQITLTLSASFDAETLGVGSAISIV 181 E+ R+S T NG +L M QVGAN G IT+ L G ++ Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGE--TITIDLQKIDVKSLGLDGFNVNGP 177 Query: 182 GSDSAASEAAFSAAITAIDSALQTISSSRADLGAAQNRLTTTISNLQN 229 + + +T D+ + R D+ + TT + + Sbjct: 178 KEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPD 225 Score = 76.6 bits (188), Expect = 6e-18 Identities = 63/281 (22%), Positives = 110/281 (39%), Gaps = 8/281 (2%) Query: 5 VNTNVTSLAVQKNLNRASDALSTSMSRLSSGLKVQNARDNVGVLSTIASINSQVRGQTVA 64 N +T+ + N + S + + + A T T Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291 Query: 65 IQNANDGMSLAQTAEGALQESVSI---LQRMRELAVQSRNDSNSAVDRTALNKEFTAMSS 121 + N +S E I + +QS + ++V + + Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351 Query: 122 ELTRISASTNLNGKNLLDGSASTMTFQVGANTGTSNQITLTLSASFDAETLGVGSAISIV 181 N K + + + A T+ A + ++ Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVST-----LI 406 Query: 182 GSDSAASEAAFSAAITAIDSALQTISSSRADLGAAQNRLTTTISNLQNINENASAALGRL 241 D+AA++ + + + +IDSAL + + R+ LGA QNR + I+NL N N ++A R+ Sbjct: 407 NEDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRI 466 Query: 242 QDTDFAAETAQLTKQQTLQQASTSILSQANQLPSAVLKLLQ 282 +D D+A E + ++K Q LQQA TS+L+QANQ+P VL LL+ Sbjct: 467 EDADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>INTIMIN#Intimin signature. Length = 939 Score = 53.1 bits (127), Expect = 3e-09 Identities = 49/231 (21%), Positives = 88/231 (38%), Gaps = 26/231 (11%) Query: 376 TVRVSYPGM-SGEDSVVLNWRGLSSHDTPAKTATGNELLFNVPKAWIIASQGGSASVTYT 434 TV+V V ++ KT T K + ++ G + V+ Sbjct: 681 TVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNG-----YAKVTLTSTTPGKSLVSAR 735 Query: 435 VTRDSVSKGSVPLWLTVEKELVFDTSPVTLAGKVYLIPSVPDLLPSLPAGT----SVRRQ 490 V+ +V E+ F T+ G + ++ + LP V + Sbjct: 736 VSDVAVD--------VKAPEVEFFTTLTIDDGNIEIVGTGVK--GKLPTVWLQYGQVNLK 785 Query: 491 ASGGQAPYRYTSSNPLVAKVDGN-GLTTVRGKGTATISVTDASGASKSYQVTVTKVIHCL 549 ASGG Y + S+NP +A VD + G T++ KGT TISV + + +Y + + Sbjct: 786 ASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLIVP 845 Query: 550 GLGSGSL--SQMSSAASAKGGRIPSINELKEIYATYGNRWPLGKGNYWSST 598 + +++ + G S NEL+ ++ +G K Y+ S+ Sbjct: 846 NMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWG---AANKYEYYKSS 893
>cloacin#Cloacin signature. Length = 551 Score = 28.5 bits (63), Expect = 0.001 Identities = 14/31 (45%), Positives = 15/31 (48%) Query: 19 SGCWPFWPGPGGHGGGGHHQGPGGGGGPGPG 49 SG W G GHG GG + GGG G G Sbjct: 50 SGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80 Score = 26.2 bits (57), Expect = 0.006 Identities = 17/40 (42%), Positives = 18/40 (45%) Query: 16 SSMSGCWPFWPGPGGHGGGGHHQGPGGGGGPGPGFGPDGG 55 SS + W G G H GGG G GGG G G GG Sbjct: 40 SSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGG 79
>PF07472#Fucose-binding lectin II Length = 245 Score = 32.3 bits (73), Expect = 0.009 Identities = 17/69 (24%), Positives = 25/69 (36%) Query: 894 SWEQAVRSGNDGAQAGATMSMAGSGGLLASNAYGLGSTARATYTVIAAEQGAVRAAAWAA 953 SW+ V++ G T++ AG+ G+L A G A Y A Q Sbjct: 68 SWQNKVKADAAGQVIACTVTWAGAPGVLPGAAAKFGVGAVVNYFSKATPQPEPTQPGTTT 127 Query: 954 SGARLSSVF 962 G +F Sbjct: 128 GGGERDGIF 136
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 229 bits (586), Expect = 3e-78 Identities = 66/209 (31%), Positives = 102/209 (48%), Gaps = 7/209 (3%) Query: 29 QAAEDDPWEGVNRAIFRFN-DVVDTYTLKPLAKGYQYVAPQFVEDGVHNFFNNIGDVGNL 87 Q DP EG NR ++ FN +V+D Y ++P+A ++ PQ +G+ NF N+ + + Sbjct: 25 QQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWRDYVPQPARNGLSNFTGNLEEPAVM 84 Query: 88 ANDVLQAKPAAAGVDTARLIFNTTFGLLGFIDVGTHMGLQ---RNDEDFGQTLGHWGVGS 144 N LQ P V R NT G+ GFIDV + FG TLGH+GVG Sbjct: 85 VNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGMANPKLQRTEPHRFGSTLGHYGVGY 144 Query: 145 GPFVVIPLLGPSTVRDAFAKIPDTYTTPYRYIDHVPTRNTALGVNLVDTRASLLSAERMI 204 GP+V +P G T+RD + D ++ + + ++TRA LL ++ ++ Sbjct: 145 GPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPMSVGKW-TLEGIETRAQLLDSDGLL 203 Query: 205 --SGDRYTFIRNAYLQNREFKVKDGQVED 231 S D Y +R AY Q +F G+++ Sbjct: 204 RQSSDPYIMVREAYFQRHDFIANGGELKP 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 27.2 bits (60), Expect = 0.011 Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 4/55 (7%) Query: 18 RVDADVNLIHAGQVIPAVCIDLSSSGMQVQAPRSFSVGDKL----NVSIDSDHPA 68 RV VN + + S + VQ PR + + N+++++D PA Sbjct: 207 RVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIENLTVETDTPA 261
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 116 bits (292), Expect = 8e-31 Identities = 42/129 (32%), Positives = 59/129 (45%), Gaps = 1/129 (0%) Query: 4 TSATLLIIDDDEVVRASLAAYLEDSGFSVLQASNGLQGIQIFEQKTPDLVVCDLRMPQMG 63 T AT+L+ DDD +R L L +G+ V SN + DLVV D+ MP Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 64 GLELIRQVTSIAPQTPVIVVSGAGVMSDAVEALRLGAADYLIKPLEDLAVLEHSVRRALD 123 +L+ ++ P PV+V+S A++A GA DYL KP DL L + RAL Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRALA 120 Query: 124 RARLLKENQ 132 + Sbjct: 121 EPKRRPSKL 129
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 27.4 bits (61), Expect = 0.018 Identities = 10/35 (28%), Positives = 15/35 (42%), Gaps = 6/35 (17%) Query: 19 ISMSSVGSQSPVIEKHV----SIAELCE--VREPD 47 + SPV EKH+ ++ ELC + D Sbjct: 93 YRQQDLVDYSPVSEKHLADGMTVGELCAAAITMSD 127
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-19 Identities = 35/123 (28%), Positives = 60/123 (48%), Gaps = 6/123 (4%) Query: 6 RILIIDDQRPNLELMEQLLAREGLTNVL-SSTEPLRTLDLFNSFEPDLVVLDLHMPEFDG 64 IL+ DD ++ Q L+R G + S+ L + + DLVV D+ MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATL--WRWIAAGDGDLVVTDVVMPDENA 62 Query: 65 FAVLEQLNRRIPTNDYVPIMVLTADATRDTRLRALALGARDFISKPLDALETMLRIWNLL 124 F +L ++ + P +P++V++A T T ++A GA D++ KP D E + I L Sbjct: 63 FDLLPRIKKARP---DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 125 ETR 127 Sbjct: 120 AEP 122
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 57.9 bits (140), Expect = 8e-11 Identities = 21/114 (18%), Positives = 47/114 (41%), Gaps = 5/114 (4%) Query: 653 GKVLCIEDNLSSMALIETLLQRRPGIRLLSSMQGQLGLDLARQHAPQLILLDLNLPDLQG 712 +L +D+ + ++ L R G + + L++ D+ +PD Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 713 LEVLQRLRRLPATAHTPILMITADAS-DTVQRTLQAAGATAILTKPIQVPAFLA 765 ++L R+++ A P+L+++A + T + + GA L KP + + Sbjct: 63 FDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASE-KGAYDYLPKPFDLTELIG 113
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 59.1 bits (143), Expect = 2e-12 Identities = 28/144 (19%), Positives = 46/144 (31%), Gaps = 5/144 (3%) Query: 6 RLVLADDHEVTRTGFVSLLAGHPEFEVVGQAANGQQAIELCEELQPDIAILDIRMPVLNG 65 +++ADD RT L+ V +N D+ + D+ MP N Sbjct: 5 TILVADDDAAIRTVLNQALSR--AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 66 LGAARLLQQRMPKLKVVIFTMDDSTDHLEAAISAGAVGYLLKDASRDEVIASLQRVARGE 125 +++ P L V++ + ++ A GA YL K E+I + R Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA---L 119 Query: 126 EALNSAVSARLLRRMTERNTSGAS 149 S G S Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRS 143
>adhesinb#Adhesin B signature. Length = 310 Score = 57.5 bits (139), Expect = 1e-11 Identities = 32/152 (21%), Positives = 55/152 (36%), Gaps = 12/152 (7%) Query: 132 IAVQPGQGVDGLNSQP---------WLASNNMGRMADVMAADLVRLAPAAKPKIEGNLAA 182 AV G V L Q WL N A +A L PA K E NL A Sbjct: 117 YAVSEGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKA 176 Query: 183 LKQQLLKLSASSEASLAS--ADNLSVVSLSDRFGYLVSGLNLELIDSQAL-TDEQWTPEA 239 ++L L ++ + + +V+ F Y N+ + T+E+ TP+ Sbjct: 177 YVEKLSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQ 236 Query: 240 VQKLAKTLKDNDVALVLDHRQPPEPVKAAIAQ 271 ++ L + L+ V + + +++ Sbjct: 237 IKTLVEKLRKTKVPSLFVESSVDDRPMKTVSK 268
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 162 bits (411), Expect = 5e-50 Identities = 73/305 (23%), Positives = 134/305 (43%), Gaps = 11/305 (3%) Query: 9 TLLRVLLIGLCATLMAPLSHAADPAKRLRIGITLHPYYSYVANIVGDKAEVVPLIPAGFN 68 TLL + L + A ++L++ T NI GDK ++ ++P G + Sbjct: 6 TLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVPIGQD 65 Query: 69 PHAYEPRAEDIKRIGSLDVVVLNGV-----GHDDFADRMIAASEKPDIKTIEANADVPLL 123 PH YEP ED+K+ D++ NG+ G+ F + A + + + V ++ Sbjct: 66 PHEYEPLPEDVKKTSEADLIFYNGINLETGGNAWFTKLVENAKKTENKDYFAVSDGVDVI 125 Query: 124 AATGVAARGAGKVVNPHTFLSISASIAQVNNIARELGKLDPDNAKTYTANARAYGKRLRQ 183 G +G +PH +L++ I NIA++L DP+N + Y N + Y +L + Sbjct: 126 YLEGQNEKGKE---DPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDK 182 Query: 184 MRADALAKLTKAPNADLRVATVHAAYDYLLREFGLEVTAVVEPAHGIEPSPSQLKKTIDQ 243 + ++ K K P + T A+ Y + +G+ + E E +P Q+K +++ Sbjct: 183 LDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEGTPEQIKTLVEK 242 Query: 244 LRELDVKVIFSEMDFPSTYVDTIQRESGVKLY-PLSHISYGEY--SADKYEKEMAGNLDT 300 LR+ V +F E + T+ +++ + +Y + S E D Y M NLD Sbjct: 243 LRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDK 302 Query: 301 VVRAI 305 + + Sbjct: 303 IAEGL 307
>PF06580#Sensor histidine kinase Length = 349 Score = 36.4 bits (84), Expect = 1e-04 Identities = 23/131 (17%), Positives = 41/131 (31%), Gaps = 29/131 (22%) Query: 229 GDDVQYEGQCKPLKTQPMALRSCLQNLVDNALRYA-------GSAKIVIEDGADRVKISV 281 D +Q+E Q P +Q LV+N +++ G + V + V Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 282 VDHGPGIAPELHESVFEPFYRLEGSRNRNSGGVGMGMTIAREAARRIGGE---LSLEQTP 338 + G E G G+ RE + + G + L + Sbjct: 297 ENTGSLALKNTKE------------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338 Query: 339 GGGLTAVLYLP 349 G A++ +P Sbjct: 339 GKV-NAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.0 bits (218), Expect = 2e-22 Identities = 37/130 (28%), Positives = 63/130 (48%), Gaps = 1/130 (0%) Query: 2 RALIVDDDVAIRELLCDYLTRFNIQARGVTDGAQMRLALSEESFDVVVLDLMLPGEDGLS 61 L+ DDD AIR +L L+R R ++ A + ++ D+VV D+++P E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LCRWLRST-SDIPILMLTARCEPTDRIIGLELGADDYMAKPFEPRELVARIQTVLRRVRD 120 L ++ D+P+L+++A+ I E GA DY+ KPF+ EL+ I L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 ERSDQRSTIR 130 S + Sbjct: 125 RPSKLEDDSQ 134
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 288 bits (738), Expect = 4e-87 Identities = 200/702 (28%), Positives = 307/702 (43%), Gaps = 87/702 (12%) Query: 142 GSTVTLTNS-TSTGVTAGASVTHFSLLNLQNSTLTGNGTSGLGLRLIAGAAEASGSSITG 200 S +TL + G AG + ++++LQ +T+ AG A G+ G Sbjct: 225 ASELTLDGGHITGGRAAGVAAMQGAVVHLQRATIRRGDAP-------AGGAVPGGAVPGG 277 Query: 201 TKQGVLVVAEQGYREGSLS--LDASQVTGQTGAAIRVAQSN---PTSALPIAVIN----V 251 G G+ G LD +G+++ +AQS P I V Sbjct: 278 AVPG-------GFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVGRGARVT 330 Query: 252 NNGSTLTGGNGNILETADG-----SHATLNV---NDSRLNGNVQVDASSTATVTLNQSS- 302 +G +L+ +GN++ET A L++ + G + V L + Sbjct: 331 VSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGG 390 Query: 303 --LTGDIVAE--------SGGTANVRLDNGSLLTGRLENTRSVAVGNGSQWTMVDNGNVE 352 GDIVA S G +V L + + TG S+++ N + W M DN NV Sbjct: 391 ADAQGDIVATELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNAT-WVMTDNSNVG 449 Query: 353 NLVMNG-GAV---QLGEAAAFYTLSVANLSGSGTFRMDVDFGGAQTDFIDITGSATGSHQ 408 L + G+V Q EA F L+V L+GSG FRM+V +D + + A+G H+ Sbjct: 450 ALRLASDGSVDFQQPAEAGRFKVLTVNTLAGSGLFRMNVFADLGLSDKLVVMQDASGQHR 509 Query: 409 LLVGSTGSDPTTDTSLHVVHAQAGDAS---FALVGGRVDLGTWSYDLIKQGDNDWYLDAT 465 L V ++GS+P + +L +V G A+ A G+VD+GT+ Y L G+ W L Sbjct: 510 LWVRNSGSEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGA 569 Query: 466 TRTIGPAPQ------------------------------TVLALFNA-----APTVWYGE 490 P P A N A T+WY E Sbjct: 570 KAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAE 629 Query: 491 LSSLRTRMGELRANGGRSGVWMRSYGNKFNVANASGFGYKQVQHGTALGADGSIPTSNGQ 550 ++L R+GELR N G W R + + + N +G + Q G LGAD ++ + G+ Sbjct: 630 SNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGR 689 Query: 551 WLAGVMAGQSTSDLDLDLGANGKVDSYYVGAYSTWLDSQSGYYLDGVIKLNRFNNKARVN 610 W G +AG + D G DS +VG Y+T++ + SG+YLD ++ +R N +V Sbjct: 690 WHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYI-ADSGFYLDATLRASRLENDFKVA 748 Query: 611 LSDGTRTKGDYSNSGVGASVEFGRHIKLDGSYYVEPYTQLIGALIESKDYELDNGLRAEG 670 SDG KG Y GVGAS+E GR +++EP +L Y NGLR Sbjct: 749 GSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRD 808 Query: 671 DSTRSLLGKVGVTTGRNFDMGQGRIVQPYLRVALAHEFVKSNEVKVNENRFDNDISGSRG 730 + S+LG++G+ G+ ++ GR VQPY++ ++ EF + V N ++ G+R Sbjct: 809 EGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRA 868 Query: 731 ELGAGVAVAFSERLEAHMDFEYSNGSSIEQPWGANVGLRYNW 772 ELG G+A A + +EYS G + PW + G RY+W Sbjct: 869 ELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 759 bits (1960), Expect = 0.0 Identities = 269/871 (30%), Positives = 427/871 (49%), Gaps = 56/871 (6%) Query: 9 LIPVRLRFMRLLLVCGSGALVLKPSSSAAATLQFQSGFLRQGPGYSSDAGVQALDSLTDT 68 + F+RL + C A + ++A L F FL P +D L + Sbjct: 20 KHRLAGFFVRLFVACAFAA----QAPLSSAELYFNPRFLADDPQAVAD-----LSRFENG 70 Query: 69 QDLVPGNYWIEIYVNTRYFGQRQIRFIQRPTDEGLVPCFSSPMLEQMGLRVESLAEPALL 128 Q+L PG Y ++IY+N Y R + F +++G+VPC + L MGL S++ LL Sbjct: 71 QELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLL 130 Query: 129 Q-EQCVDLLRLVPGSQIEFDGGRLQLSLSVPQVAMRRDMIGQVDPALWDHGINAAFFSYQ 187 + CV L ++ + + D G+ +L+L++PQ M G + P LWD GINA +Y Sbjct: 131 ADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYN 190 Query: 188 ASAQQSTATHTGRRNSADLYLNSGINLGAWRLRSNQSIR-----HDEEGGRQWKRAYAYA 242 S G + A L L SG+N+GAWRLR N + +W+ + Sbjct: 191 FSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWL 250 Query: 243 QRDLPGTHANLTLGETYTAGDVFASVPIEGALIRTDQEMLPDALQGYAPVIRGVAQSRAK 302 +RD+ + LTLG+ YT GD+F + GA + +D MLPD+ +G+APVI G+A+ A+ Sbjct: 251 ERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQ 310 Query: 303 LEVLQNGYPIYSTYVSAGPYVIEDLT-TAGSGELEVVLTEADGQVRRFIQPYATISNLLR 361 + + QNGY IY++ V GP+ I D+ SG+L+V + EADG + F PY+++ L R Sbjct: 311 VTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQR 370 Query: 362 EGVWRYSAALGRY-NGARDSEQPWLWQGTLAMGIGWNSTLYGGLMTSDIYHAGALGISRD 420 EG RYS G Y +G E+P +Q TL G+ T+YGG +D Y A GI ++ Sbjct: 371 EGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKN 430 Query: 421 MGQLGALAFDLTHSRADTDRLDENSVQGMSYAIKYGKAF-ATDTSLRFAGYRYSTEGYRD 479 MG LGAL+ D+T + + D++ G S Y K+ + T+++ GYRYST GY + Sbjct: 431 MGALGALSVDMTQANSTLP--DDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFN 488 Query: 480 FDEAVRQRDQ-------------------SNTFSGSRRSRLEASIHQRIGSRSSLGMTLS 520 F + R + ++R +L+ ++ Q++G S+L ++ S Sbjct: 489 FADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGS 548 Query: 521 QQDYWGTRSEQRQYQFNFNTRYAGITYNLYASQSLSEGRNRNSDRQIGLSLSMPLDIGHS 580 Q YWGT + Q+Q NT + I + L S + + + D+ + L++++P Sbjct: 549 HQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR-DQMLALNVNIPFSHWLR 607 Query: 581 SNVTFD----------TQSSGSRHSQRASLSGSL-DDNRLSYRTSLSSDDG----HQRSV 625 S+ + R + A + G+L +DN LSY G + Sbjct: 608 SDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTG 667 Query: 626 GLSAGYQAAFGSVGAGVTQGTGYRSTSINANGAVLLHADGIELGPNLGDTIALVQVPGTP 685 + Y+ +G+ G + + +G VL HA+G+ LG L DT+ LV+ PG Sbjct: 668 YATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAK 727 Query: 686 GVGILNATGVETNRQGYALVPYLRPYRYNQIALQTDQLGPEVEIENGSAQVVPTRGAVIK 745 + N TGV T+ +GYA++PY YR N++AL T+ L V+++N A VVPTRGA+++ Sbjct: 728 DAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVR 787 Query: 746 TTFAARTVTRLIITARTAGGQPLPFGARISDATGKPLGIAGQGGQVLIATDARPQTLDVR 805 F AR +L++T +PLPFGA ++ + + GI GQV ++ + V+ Sbjct: 788 AEFKARVGIKLLMTLT-HNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVK 846 Query: 806 WGEQGEPQCQLHIDPASMPQTDGYRLQELTC 836 WGE+ C + Q C Sbjct: 847 WGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.1 bits (234), Expect = 2e-24 Identities = 41/159 (25%), Positives = 68/159 (42%), Gaps = 4/159 (2%) Query: 3 QTATILVIDDEPQIRKFLRISLVSQGYKVLEAATGAEGLTQAALNKPDLLVLDLGLPDMD 62 ATILV DD+ IR L +L GY V + A A DL+V D+ +PD + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GQQVLSEFREWSA-VPVLVLSVRASEAQKVQALDAGANDYVTKPFGIQEFLARVRSLLRQ 121 +L ++ +PVLV+S + + ++A + GA DY+ KPF + E + + L + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 122 SSGIEKP---DAALSFGPLTVDLAYRRVLLDGNEVALTR 157 D+ + A + + + T Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.2 bits (68), Expect = 0.009 Identities = 10/43 (23%), Positives = 20/43 (46%) Query: 103 DEINRATPKSQSALLEAMEEGQVSIEGATRLLPDPFFVIATQN 145 DEI +Q+ LL +++G+ + G + ++A N Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 31.4 bits (71), Expect = 0.004 Identities = 17/70 (24%), Positives = 28/70 (40%), Gaps = 8/70 (11%) Query: 165 DRHDLRLLIKRVRYAAEAYPELSHQPKNMQARLKSAQSE-LGDWHDHLQWLAQAGEQPDL 223 + DLR I V E+S+Q + L +Q + L + +WL+Q + L Sbjct: 521 NGQDLRTGILTVD-------EISNQSTTLNKLLGGSQCQPLNKAQEVQKWLSQNNKSSYL 573 Query: 224 APCIAGWQIG 233 C +G Sbjct: 574 TQCKMDKSLG 583
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.6 bits (173), Expect = 9e-15 Identities = 30/120 (25%), Positives = 51/120 (42%), Gaps = 4/120 (3%) Query: 608 ILIVDDETGVREIAADLLSDQGYDVFEAADCISALEQARTLDRLDLLITDIGLPGPMNGI 667 IL+ DD+ +R + LS GYDV ++ + DL++TD+ +P N Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPD-ENAF 63 Query: 668 MLAQELTASRPTLKVLFITGYTKAEGITEGQSLGKMLF--KPFSLIEFSDSVKSILSKNE 725 L + +RP L VL ++ + G + KPF L E + L++ + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 33.8 bits (77), Expect = 2e-04 Identities = 12/19 (63%), Positives = 16/19 (84%) Query: 5 QRGFTLLEVMVAILLMSIV 23 QRGFTLLE+M+ +LLM + Sbjct: 3 QRGFTLLEMMLILLLMGVS 21
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 50.7 bits (121), Expect = 1e-10 Identities = 17/61 (27%), Positives = 32/61 (52%) Query: 5 RQQGFTLIELMVVLVIIGIASAAVSLSIKPDADALLRKDSQRLAQLLQIAQAEARADGRP 64 RQ+GFTL+E+M++L+++G+++ V L+ D + R L+ Q G+ Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61 Query: 65 I 65 Sbjct: 62 F 62
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 26.5 bits (58), Expect = 0.046 Identities = 24/124 (19%), Positives = 48/124 (38%), Gaps = 23/124 (18%) Query: 18 LLAALAGVVVWSSLL-MTSAQSSAPVQTSVTQE-----------GGSASPARQWFANQ-- 63 L ++ W L + SS + + ++ G S + + Sbjct: 25 LFCQQLAMIFWRIGLPDNAPVSSVQITPAQARQQPVTLNDFTLFGVSPEKNKAGALDASQ 84 Query: 64 -----PSQVQISVSGVMAG--ARGAVAVVRLNDGPARSVMAGERL-ARDVRLVAIEADGV 115 PS + +S++GVMAG ++A++ D S E + + ++V+I D V Sbjct: 85 MSNLPPSTLNLSLTGVMAGDDDSRSIAIIS-KDNEQFSRGVNEEVPGYNAKIVSIRPDRV 143 Query: 116 VIER 119 V++ Sbjct: 144 VLQY 147
>PilS_PF08805#PilS N terminal Length = 185 Score = 32.2 bits (73), Expect = 3e-04 Identities = 9/35 (25%), Positives = 21/35 (60%) Query: 7 ERGFTLVEVLVALAIIAVSMSAAVRVAGGMTQSNG 41 ++G TL+EVL+ + +I V ++A ++ + + Sbjct: 25 DKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQ 59
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 163 bits (414), Expect = 6e-55 Identities = 62/139 (44%), Positives = 86/139 (61%), Gaps = 6/139 (4%) Query: 14 RAQAGFTLIEIMVVVVILGILAAIVVPKVLDRPDQARATAARQDIGGLMQALKLYRLDHG 73 Q GFTL+EIMVV+VI+G+LA++VVP ++ ++A A DI L AL +Y+LD+ Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNH 64 Query: 74 SYPTQNQGLKVLVERP-ANVSKSNWRS--YLERLPNDPWGRPYNYLNPGVNGEVDIFSLG 130 YPT NQGL+ LVE P +N+ Y++RLP DPWG Y +NPG +G D+ S G Sbjct: 65 HYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSAG 124 Query: 131 ADGQPDGDGVNADIGSWQL 149 DG+ + DI +W L Sbjct: 125 PDGEMGTED---DITNWGL 140
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 142 bits (359), Expect = 6e-39 Identities = 68/253 (26%), Positives = 110/253 (43%), Gaps = 24/253 (9%) Query: 171 AQVNIRVRFAEVSRSELLRYGVNW-------NALFNNGTFSFGLLTG-------GGLASG 216 QV + AEV ++ L G+ W N+G + G G ++S Sbjct: 345 PQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSS 404 Query: 217 AAGGASNVISAGLASGNVNIDAMLEALQSNGVLEVLAEPNITAMTGQTASFLAGGEVAVP 276 A S+ N +L AL S+ ++LA P+I + A+F G EV P Sbjct: 405 LASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEV--P 462 Query: 277 VPVNREVVG-------IEYKPYGVSLLFSPTLLPNGRIALQVRPEVSSLMSTTTLDVNGY 329 V + +E K G+ L P + + L++ EVSS+ + + Sbjct: 463 VLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDL 522 Query: 330 QVPSFRVRRADTRVEVGSGQTFAIAGLFQRESSQDMDKVPMLGDMPILGNLFRSKRFQRN 389 +F R + V VGSG+T + GL + S DKVP+LGD+P++G LFRS + + Sbjct: 523 GA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVS 581 Query: 390 ETELVILITPYLV 402 + L++ I P ++ Sbjct: 582 KRNLMLFIRPTVI 594
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 35.3 bits (81), Expect = 8e-05 Identities = 21/113 (18%), Positives = 42/113 (37%), Gaps = 7/113 (6%) Query: 83 AERAFQRALELKANDPDALLGLGTAQLRQGKLERAVTALTQAADAS-QQPTAWNRLGIAH 141 A + FQ L D LGLG + G+ + A+ + + A ++P Sbjct: 55 AHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIKEPRFPFHAAECL 114 Query: 142 ILLGQAKPAQTAFNTSLRLAPND-----LDTRCNLALAYALGDDSQKALQTIE 189 + G+ A++ + L + L TR + L A+ + + ++ Sbjct: 115 LQKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLE-AIKLKKEMEHECVD 166 Score = 31.1 bits (70), Expect = 0.003 Identities = 17/114 (14%), Positives = 34/114 (29%), Gaps = 7/114 (6%) Query: 88 QRALELKANDPDALLGLGTAQLRQGKLERAVTALTQ--AADASQQPTAWNRLGIAHILLG 145 E+ ++ + L L Q + GK E A D + LG +G Sbjct: 26 AMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDH-YDSRFFLGLGACRQAMG 84 Query: 146 QAKPAQTAFNTSLRLAPNDLDTRCNLALAYALGDDSQKALQ----TIETVSQSP 195 Q A +++ + + + A + +A E ++ Sbjct: 85 QYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADKT 138
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 98.4 bits (245), Expect = 3e-26 Identities = 58/175 (33%), Positives = 82/175 (46%), Gaps = 11/175 (6%) Query: 4 FRLERFLVRSLM--LLALAGFSCVLNAAPDHEPDWFSKPYAYVLVDQDIRGALTEFGQNL 61 F L F R L LL L+ +S E DW PY YV + +R LT+FG N Sbjct: 3 FPLHSFFKRVLTGTLLLLSSYSWA------QELDWLPIPYVYVAKGESLRDLLTDFGANY 56 Query: 62 DLIVVFSDKVRGSARGTVRGASAGEFLSRLCDANQLSWYFDGNVLHIAQSDEVGTRVFDL 121 D VV SDK+ G + +FL + L WY+DGNVL+I ++ EV +R+ L Sbjct: 57 DATVVVSDKINDKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRL 116 Query: 122 PGPKLDELQHYLAQLEVSGQPMSSRASPDHDSLFVSGPPAYL---AQIQQHLDRQ 173 + EL+ L + + R + ++VSGPP YL Q L++Q Sbjct: 117 QESEAAELKQALQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQ 171
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 75.0 bits (184), Expect = 2e-17 Identities = 42/164 (25%), Positives = 74/164 (45%), Gaps = 7/164 (4%) Query: 27 LYTNLGEREANAMLAVLLRDGIPASRKVQDNGQLKVMVDEKRFAQAMAVLDDAGLPGQSF 86 L++NL +++ A++A L + IP + + + V + + L GLP Sbjct: 53 LFSNLSDQDGGAIVAQLTQMNIPY--RFANGSG-AIEVPADKVHELRLRLAQQGLP--KG 107 Query: 87 SNMG-EVFKGNGLVSSPVQERAQMVYALSEELSHTVSQIDGILSARVHVVLPDNDLLKRV 145 +G E+ S E+ AL EL+ T+ + + SARVH+ +P L R Sbjct: 108 GAVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVRE 167 Query: 146 ISPSSASVLVRFDPKTDIN-VLIPQIKTLVANGISGLGYDGVSV 188 SASV V +P ++ I + LV++ ++GL V++ Sbjct: 168 QKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLPPGNVTL 211
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 68.9 bits (168), Expect = 2e-15 Identities = 65/262 (24%), Positives = 103/262 (39%), Gaps = 40/262 (15%) Query: 104 EQAWLGWIEP---LEAI----------LGEPLQVVPWDADP-----------TARCLGVS 139 E+ W WI+P LE + G VVPW A + R L V Sbjct: 47 EKRWSAWIKPGDWLEHVSPALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVE 106 Query: 140 LEVHTADFPAARVELRMNSAAADHVAALLERHAMPDQGALQALRLVMSAEAGHAPLRVDE 199 V + P ++ M+ L E A+ G + LR + G + + Sbjct: 107 NPVPGSALPEGKLLHIMSDRGGLWFEHLPELPAV-GGGRPKMLRWPLRFVIGSSDTQRSL 165 Query: 200 LRSLAPGDVVMLDTLPDDQVRLRIGQHLQAYARRSGRSLEWCGPWRGSDPDLSAVTHLNR 259 L + GDV+++ T +V + L + R G + + + H+ Sbjct: 166 LGRIGIGDVLLIRTSRA-EVYCYAKK-LGHFNRVEGGII----------VETLDIQHIEE 213 Query: 260 NDAMNEPTVTPDLDVSLDALPLTLVCQLGSVELTLEQLRAMAPGTLLPLASSGQDEVDLM 319 N T T + L+ LP+ L L +TL +L AM LL L ++ + V++M Sbjct: 214 E---NNTTETAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIM 270 Query: 320 VNGRRIGRGELVRIGDGLGVRL 341 NG +G GELV++ D LGV + Sbjct: 271 ANGVLLGNGELVQMNDTLGVEI 292
>PF04183#IucA / IucC family Length = 580 Score = 161 bits (408), Expect = 1e-44 Identities = 109/476 (22%), Positives = 169/476 (35%), Gaps = 55/476 (11%) Query: 170 LRDRPYHPLAKAKQGLDEQQYRAYQAEFAKPVVLNWVAVDKTLLQCGEGVADLKASFPAR 229 L P K ++G ++ Y E+A L+W+AV + + Sbjct: 133 LSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLTA 192 Query: 230 YLLPTDLQARLDQEMQVRGIAHSHVALPVHPWQFDHVLEAQVGDALAKGDCLRLDFQEAS 289 + P + AR Q Q G+ H+ + LPVHPWQ+ + A+G + L Sbjct: 193 AMDPQEF-ARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQ 251 Query: 290 VFATSSLRSMTPCFDSAD--YLKLPMAIYSLGASRYLPAVKMINGNLSEALLRQVVEKDE 347 A SLR++T +KLP+ IY+ R +P + G L+ L+QV D Sbjct: 252 WLAQQSLRTLT-NASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFATDA 310 Query: 348 TLGRS-LHLCDERTWWAF-MPTGASLFDEGPRH---LSAMLRRYPAALLDDPECRLLPMA 402 TL +S + E A+L R+ L + R P L P+ + MA Sbjct: 311 TLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEMLGVIWRENPCRWL-KPDESPVLMA 369 Query: 403 ALGTPLPGSNRHFFDEWMAYRELPRNQASVLTLFRELSHSFFDINLRML-RLGMLGEVHG 461 L N+ ++ L T +L +L R G+ HG Sbjct: 370 TLMECDEN-NQPLAGAYIDRSGLD-----AETWLTQLFRVVVVPLYHLLCRYGVALIAHG 423 Query: 462 QNAVLVWKAGQAQGLLLRD-HDSLRIFVPWL-ERNGMQDPVYRMKKGHANTLYHERPEDL 519 QN L K G Q +LL+D +R+ E + + + + + L Sbjct: 424 QNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPEMDSLP-------QEVRDVTSRLSADYL 476 Query: 520 LFWLQTLGIQVNVRAIMDTLAQVYDIPVTALWTVLRDVL-DYLITTIEFDEEARNMLRHQ 578 + LQT G V V + L +P + +L VL DY+ + E R Sbjct: 477 IHDLQT-GHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKHPQMSE------RFA 529 Query: 579 LFEVPNWPQKLLLTPMIARA-------------GGPGSMPFGKGQVVNPFHRLRRE 621 LF L P I R GG +P + NP + +E Sbjct: 530 LFS--------LFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLEDLQNPLWLVTQE 577
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 83.5 bits (206), Expect = 1e-20 Identities = 74/297 (24%), Positives = 116/297 (39%), Gaps = 40/297 (13%) Query: 10 SRRKVLRLSLGLLVLPGLTLPGIARAAPLRVVTLFQGASDTAVALGVTPCGVVDS----- 64 SRR++L +L + A P R+V L + +ALG+ P GV D+ Sbjct: 8 SRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRL 67 Query: 65 WSEKPMYRYLRPALAAVPHVGLETQPSLEDIVLLKPDLIVASRFRHQRIAPLLEQIAPLV 124 W +P P +V VGL T+P+LE + +KP +V S P E +A + Sbjct: 68 WVSEP------PLPDSVIDVGLRTEPNLELLTEMKPSFMVWS----AGYGPSPEMLARIA 117 Query: 125 MLEEVFEF----------KRTLAMMGAALNRQQQAMALLGQWQQRVTTLREQLKARFAGR 174 F F +++L M LN Q A L Q++ + +++ + R R Sbjct: 118 PG-RGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKR-GAR 175 Query: 175 WPITVSVLDVREDHIRSYLPASFAGSVLSELGFD--WTPAAREAQGVSLKLSSKESLPVV 232 + +++D R H+ + P S +L E G W E S + L Sbjct: 176 PLLLTTLIDPR--HMLVFGPNSLFQEILDEYGIPNAWQ---GETNFWGSTAVSIDRLAAY 230 Query: 233 DADLFFIFQRGDSKAAQNTYEKLVQHPFWKQLRAPQDGQVWRVDAVAWSLSGGILGA 289 F +SK L+ P W+ + + G+ RV AV W G L A Sbjct: 231 KDVDVLCFDHDNSKDMD----ALMATPLWQAMPFVRAGRFQRVPAV-W-FYGATLSA 281
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.1 bits (195), Expect = 3e-20 Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 7/113 (6%) Query: 15 GLVLVVEDEQTIRDFVCEILETDVGLRTKAVENADEAMKYLQQNINKVALLLTDVRMPGS 74 +LV +D+ IR + + L G + NA +++ L++TDV MP Sbjct: 4 ATILVADDDAAIRTVLNQALS-RAGYDVRITSNAATLWRWIAAG--DGDLVVTDVVMPD- 59 Query: 75 MDGIALANVVGSQWSHIPVVVMSGHGTPGS--DQLKDDVL-FIAKPWTITQLV 124 + L + +PV+VMS T + + ++ KP+ +T+L+ Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 113 bits (283), Expect = 3e-32 Identities = 72/263 (27%), Positives = 116/263 (44%), Gaps = 7/263 (2%) Query: 1 MNSKRFHAATVVITGACRGIGEGIAERFAREGANLVMVSNADRINETARRIVELTGAQVL 60 MN+K ITGA +GIGE +A A +GA++ V E ++ Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 61 PVVADVTNEQEVIDLYAQAQARFGRVDVSVQNAGIITIDHFDRMPRADFDRVLQVNTTGV 120 ADV + + ++ A+ + G +D+ V AG++ + +++ VN+TGV Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 121 WLCCREAAKHMINSGRGGRLINTSSGQGRQGFIYTPHYAASKMGVIGITHSLAHELAPHG 180 + R +K+M++ R G ++ S YA+SK + T L ELA + Sbjct: 121 FNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179 Query: 181 ITVNAFCPGIIESEMWEYNDRVWGQILSTPDKTYGTGELMAEWVAGIPMKRAGTARDVAG 240 I N PG E++M +W G+ E + GIP+K+ D+A Sbjct: 180 IRCNIVSPGSTETDM---QWSLWADENGAEQVIKGSLE---TFKTGIPLKKLAKPSDIAD 233 Query: 241 LVTFLASADAAYITGQSINVDGG 263 V FL S A +IT ++ VDGG Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGG 256
>SALSPVAPROT#Salmonella virulence plasmid 28.1kDa A protein signature. Length = 255 Score = 35.6 bits (81), Expect = 0.002 Identities = 24/82 (29%), Positives = 39/82 (47%), Gaps = 1/82 (1%) Query: 121 LPFHRPFEQIKAVLEEKGVHSLDLLQKTSYYYPNFCYQNFRSTSLRSAMLSASGIDPETT 180 LP+H P +Q++ L V D++ ++S +P F + + AM AS + PE Sbjct: 133 LPYHFPHDQVELSLLNTDVSLEDIISESSIDWPWFLSNSLTGDNSNYAMELASRLSPEQQ 192 Query: 181 TLLLDQQATSKPDFFSITYGTN 202 TL + ++ D S Y TN Sbjct: 193 TLPTEPDNSTATDLTSF-YQTN 213
>cloacin#Cloacin signature. Length = 551 Score = 29.7 bits (66), Expect = 0.023 Identities = 38/148 (25%), Positives = 62/148 (41%), Gaps = 23/148 (15%) Query: 17 RDLQSVREALEFSCHAQIAAVEHQCEDTRNE-SQCSENLLESAIQQEQAAHQALESAQQA 75 R + R E+ + A E E R E +Q +E++ A QE+ A A Q Sbjct: 299 RQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDV---ARNQERQA-----KAVQV 350 Query: 76 LDSSQSWIGSAESSLAACLAQPD-----ANDDGAGPDCSWEYACVD--EAQADTDQAQSM 128 +S +S + +A +LA +A+ A+D AG W+ A + AQ D + Q+ Sbjct: 351 YNSRKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAA 410 Query: 129 LELA-------QADFERATENRQAMERR 149 + A A A E+R+ E + Sbjct: 411 FDAAAKEKSDADAALSSAMESRKKKEDK 438
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 30.5 bits (69), Expect = 0.034 Identities = 11/73 (15%), Positives = 26/73 (35%), Gaps = 1/73 (1%) Query: 4 SLNTPDHTMTRITEALADYREGLARINREFDAAALKKDRAILDLQKDMAQHLTPLAEETV 63 S+ + ++T AL +E L I + +A+ I + L + Sbjct: 33 SITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHLLVLDD-PELVDGIK 91 Query: 64 HRLMAERKQEDAS 76 ++ E+ + + Sbjct: 92 GKIENEQMNAEYA 104
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 42.0 bits (98), Expect = 1e-05 Identities = 58/370 (15%), Positives = 110/370 (29%), Gaps = 22/370 (5%) Query: 68 ANERAALATELLDKRAAFEVELYDKRAGLEVELHNKRVGLEVELRNKRTGLSDELRTLRT 127 NE +A+AT E DK L K L + + + L Sbjct: 37 TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSN 96 Query: 128 DAERKIAENREQQTSSLEEEIAKLRAKRLSEVGDAENLERDRIRIDISKEREAWAKHHED 187 E+ ++ + + + + R L + + + K EA Sbjct: 97 AKEKLRKNDKSLSEKASKIQELEARKADLEKALEGA-MNFSTADSAKIKTLEAEKAALAA 155 Query: 188 ARALLDREYSELAKQKAALSALQGDIHGRKTELEISERNLERREQRQEQQWN--RRNDQL 245 +A L++ A SA + K LE + LE+ + + Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215 Query: 246 AEDLAANLEEAHKSLNRHKESYVEDNQRLRDSLATQTDLIGVFEQLKRQLGGKDPAEVLR 305 E A L L + E + + + T E + +L E Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAEL------EKAL 269 Query: 306 ELNSQTDELKRLREDLATRPTEDMRLRTQAFESEYKTQKARADELSRQIESNSADVAEVG 365 E + + E + + A L R ++++ Sbjct: 270 EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDAS-------- 321 Query: 366 ELRRKNAEFHAQNVSLSHRASIFEGSANEAQAELNRLRTAYERPAEVEARHKEIEIPHIA 425 R + A++ L + I E S + +L+ R A ++EA H+++E + Sbjct: 322 --REAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAK---KQLEAEHQKLEEQNKI 376 Query: 426 AEKVVQPAQR 435 +E Q +R Sbjct: 377 SEASRQSLRR 386 Score = 33.9 bits (77), Expect = 0.003 Identities = 50/298 (16%), Positives = 90/298 (30%), Gaps = 20/298 (6%) Query: 110 ELRNKRTGLSDELRTLRTDAERKIAENREQQTSSLEEEIAKLRAKRLSEVGDAENLERDR 169 L ++ L L + A+ + + E + ++ E + Sbjct: 152 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 211 Query: 170 IRIDISKEREAWAKHHEDARALLDREYSELAKQKAALSALQGDIHGRKTELEISERNLER 229 + E+ A A D L+ + A + L+ + LE + LE+ Sbjct: 212 KIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA----ALEARQAELEK 267 Query: 230 REQRQEQQWNRRNDQLAEDLA--ANLEEAHKSLNRHKESYVEDNQRLRDSLATQTDLIGV 287 + + ++ A A LE L + + Q LR L + Sbjct: 268 ALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAK-- 325 Query: 288 FEQLKRQLGGKDPAEVLRELNSQTDELKRLREDLATRPTEDMRLRT--QAFESEYKTQKA 345 +QL+ + ++ + + LR DL +L Q E + K +A Sbjct: 326 -KQLEAEH-----QKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 379 Query: 346 RADELSRQ----IESNSADVAEVGELRRKNAEFHAQNVSLSHRASIFEGSANEAQAEL 399 L R E+ + E K A N L + E E QA+L Sbjct: 380 SRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKL 437
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 35.6 bits (82), Expect = 3e-04 Identities = 37/149 (24%), Positives = 56/149 (37%), Gaps = 24/149 (16%) Query: 41 VLIEGPRGMAKSTLARGLADL--LASGQFVTLPLGATEERLVGTLDLDAAL--SESRA-- 94 ++I G G K +AR L D +G FV + + A L +++ L E A Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDL-----IESELFGHEKGAFT 217 Query: 95 ---RFSPGVLAKADGGVLYVDEVNLLADHLVDLLLDVAASGVNLVERDGISHRHAARFVL 151 S G +A+GG L++DE+ + LL V G G + + Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGE--YTTVGGRTPIRSDVRI 275 Query: 152 IGTMNP------EEGELRPQLLDRFGLNV 174 + N +G R L R LNV Sbjct: 276 VAATNKDLKQSINQGLFREDLYYR--LNV 302
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 395 bits (1016), Expect = e-135 Identities = 132/371 (35%), Positives = 191/371 (51%), Gaps = 41/371 (11%) Query: 168 QVLLERRHALTEMPRLEPE-----SSSYGLISKSEPMRQTCQLVGKVLHSAYTVLLTGET 222 +++ AL E R + L+ +S M++ +++ +++ + T+++TGE+ Sbjct: 110 ELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGES 169 Query: 223 GTGKEVVARAIHTCGPRRKKAFVVQNCAAFPENLLESELFGYRKGAFTGADRDRRGLFDI 282 GTGKE+VARA+H G RR FV N AA P +L+ESELFG+ KGAFTGA G F+ Sbjct: 170 GTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQ 229 Query: 283 ADGGTLLLDEIGDMPLGLQAKLLRVLQEGEIRPLGSDTVRNVDVRIIAATHRDLPALISQ 342 A+GGTL LDEIGDMP+ Q +LLRVLQ+GE +G T DVRI+AAT++DL I+Q Sbjct: 230 AEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQ 289 Query: 343 GRFREDLYYRLAQFPVSLPPLRQRVEDIEPLARQFASDACSSLRREPVRWSESALSFLCD 402 G FREDLYYRL P+ LPPLR R EDI L R F A + R+ + AL + Sbjct: 290 GLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKA 348 Query: 403 YSFPGNVRQLKGFVERAVLLSDDGHLLPEHFP---------------------------- 434 + +PGNVR+L+ V R L + E Sbjct: 349 HPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAV 408 Query: 435 -------VATGAERSSHGVTLRERMEHFERDVLLESLRKSNGNRTQTARKLGVSRRTLLY 487 A+ + + E ++L +L + GN+ + A LG++R TL Sbjct: 409 EENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRK 468 Query: 488 RMMRLDINSVR 498 ++ L ++ R Sbjct: 469 KIRELGVSVYR 479
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 8e-04 Identities = 18/134 (13%), Positives = 37/134 (27%), Gaps = 21/134 (15%) Query: 404 ARVRISLAAAPQRLERLRTRYAEDQRQLDAMRRDAQAGLGVDERVLLTLEEGLQELQRQI 463 + R R+ R ++ +LD VL E + + Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL--------EQENKY 261 Query: 464 SEVEAVWNQQRTDVEQLLRLRGQLSALRAQHDLAANTDTERYTELFALIESLETELADVH 523 E N+ R QL ++ ++ + + ++ L + +L Sbjct: 262 VE---AVNELRVYKSQLEQIESEILSAKEEYQLVTQL----------FKNEILDKLRQTT 308 Query: 524 QTLTEATERLVSFE 537 + T L E Sbjct: 309 DNIGLLTLELAKNE 322
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.0 bits (62), Expect = 0.018 Identities = 9/46 (19%), Positives = 18/46 (39%), Gaps = 1/46 (2%) Query: 89 RGRGVARLMCEHSQQLARDSGFLAMQFNSVVATNEVAVALWHKLGF 134 R +GV + + + A+++ F + N A + K F Sbjct: 102 RKKGVGTALLHKAIEWAKENHFCGLML-ETQDINISACHFYAKHHF 146
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 7e-04 Identities = 14/37 (37%), Positives = 18/37 (48%) Query: 26 VLRGLNFSVRSGECLVLGGASGTGKSTLLRTLYGNYL 62 V R + + +VL G G GKSTL+ TL G Sbjct: 585 VARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGLDF 621
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 27.5 bits (61), Expect = 0.039 Identities = 40/146 (27%), Positives = 55/146 (37%), Gaps = 31/146 (21%) Query: 1 MLKFFAALLFVCSGLVQAQDTLH----TDLPLDYLAQ--ATTDKPDKPLVIFIHGYGSNA 54 ++K + LL + A + T LP++ Q A + PLVIF+ G G Sbjct: 5 LIKILSVLLLCSTANAFADEFADNLGLTLLPVEPSTQVNAASSHTKPPLVIFLSGDGG-W 63 Query: 55 ADLFSLKDRLPADY---NYLSVQAPVELQSDSYKWFTRKPGSAEYDGVTEELKSSTERLT 111 A L D+ V V S Y W + P K T+ Sbjct: 64 ATL----DKAVGGILQQQGWPV---VGWSSLKYYWKQKDP------------KDVTQDTL 104 Query: 112 AFIRQATATYKTQPDKVFLIGFSQGA 137 A I + A + TQ KV LIG+S GA Sbjct: 105 AIIDKYQAEFGTQ--KVILIGYSFGA 128
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 233 bits (596), Expect = 5e-69 Identities = 113/512 (22%), Positives = 212/512 (41%), Gaps = 39/512 (7%) Query: 266 GMSVGVFGLQRASVGELMPELQKMFGPDSGMPLAGMVRFLPIERTNSVVAISSQPEYLRE 325 + V L + +L P L+++ AG+ + E +N ++ ++ + ++ Sbjct: 126 EVVTRVVPLTNVAARDLAPLLRQL------NDNAGVGSVVHYEPSNVLL-MTGRAAVIKR 178 Query: 326 VGEWIHTIDEGGGNEPQMYVYDVRNMKATDLAKYLRQIYG---TGAIKDDSAAKVAPGLR 382 + + +D G + + A D+ K + ++ A+ A V R Sbjct: 179 LLTIVERVDNAGDRS--VVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADER 236 Query: 383 TTSLSSLNGTGSNGMSSSNGMGSGGISSGGGMGNGMNGSGGGFGNSQGMNSQNGTVSESG 442 T ++ + S I + M ++ GN++ + + S+ Sbjct: 237 TNAV----------LVSGEPNSRQRIIA---MIKQLDRQQATQGNTKVIYLKYAKASDLV 283 Query: 443 EEQGGAESDSAGEEGGGSAGNSKSLDASTRITAQKSSNQLLVRTRPAQWKEIESAIKRLD 502 E G S E+ +LD + I A +N L+V P ++E I +LD Sbjct: 284 EVLTGISSTMQSEKQAA--KPVAALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLD 341 Query: 503 NPPLQVQIETRILEVKLTGDLDMGVQWYLGRLAGNAGTSGNVTNTAGSQGA--------- 553 QV +E I EV+ L++G+QW T+ + + GA Sbjct: 342 IRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTV 401 Query: 554 LGAGGAVLAGTDSLFYSFVSNNLQIALRALETNGRTQVLSAPSLVVMNNQQAQIQVGDNI 613 + + L+ + + F N + L AL ++ + +L+ PS+V ++N +A VG + Sbjct: 402 SSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEV 461 Query: 614 PISQTTVNTNASATTLSSVEYVQTGVILDVVPRINPGGLVYMDIQQQVSDADTGSTDLNG 673 P+ T T + ++VE G+ L V P+IN G V ++I+Q+VS ++ + Sbjct: 462 PV-LTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSS 520 Query: 674 --NPRISTRSVATQVAAQSGQTVLLGGLIKQDNAESVSSVPYLGRIPGLKWLFGRTSRAK 731 +TR+V V SG+TV++GGL+ + +++ VP LG IP + LF TS+ Sbjct: 521 DLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKV 580 Query: 732 DRTELIVLITPRVITSSSQARQVTDDYRQQMQ 763 + L++ I P VI + RQ + Sbjct: 581 SKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612 Score = 99 bits (249), Expect = 8e-24 Identities = 58/282 (20%), Positives = 109/282 (38%), Gaps = 10/282 (3%) Query: 93 AAAPAAKAGETGDIVFNFTNQPIQAVINSIMGDLLHENYSIAQGVKGEVSFSTSKPVNKQ 152 AA + + +F IQ IN++ +L ++ I V+G ++ + +N++ Sbjct: 17 FAALLFRPAAAEEFSASFKGTDIQEFINTVSKNL-NKTVIIDPSVRGTITVRSYDMLNEE 75 Query: 153 QALSILETLLSWTDNAMIKQGNR--YVILPSNQAVAGKLVPEMRVAQPSAGMSARLFPLR 210 Q ++L A+I N V+ + A V + R+ PL Sbjct: 76 QYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLT 135 Query: 211 YISANEMQKLLKPFARENAFLLV--DPARNVLSMAGTPEELANYQDTIDTFDVDWLKGMS 268 ++A ++ LL+ V NVL M G + + VD S Sbjct: 136 NVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIV--ERVDNAGDRS 193 Query: 269 VGVFGLQRASVGELMPELQKMFGPDSG--MPLAGMVRFLPIERTNSVVAISSQPEYLREV 326 V L AS +++ + ++ S +P + + + ERTN+V+ +S +P + + Sbjct: 194 VVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVL-VSGEPNSRQRI 252 Query: 327 GEWIHTIDEGGGNEPQMYVYDVRNMKATDLAKYLRQIYGTGA 368 I +D + V ++ KA+DL + L I T Sbjct: 253 IAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQ 294
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 94.7 bits (235), Expect = 3e-26 Identities = 38/204 (18%), Positives = 77/204 (37%), Gaps = 5/204 (2%) Query: 16 QRRAPKGEKRREELLDAALQVFSLEGYTGASVAKVAAIVGISVAGLLHHFPSKISLLMGV 75 ++ + ++ R+ +LD AL++FS +G + S+ ++A G++ + HF K L + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 76 LERRDEVNGRIAAQV---RTDDSLTGLLGGLRAINQSNSTAPGVVRAFSILNAESLL--D 130 E + G + + D L+ L L + +S T I+ + + Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122 Query: 131 NQPAYEWFQTRYARIHAHLLAQFTALVERGEVRADVDLDMLIQQILSMMDGLQIQWLRFP 190 + + + + +E + AD+ + + GL WL P Sbjct: 123 MAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAP 182 Query: 191 ERVDLVKTFDAYIAQVDAAVRARP 214 + DL K Y+A + P Sbjct: 183 QSFDLKKEARDYVAILLEMYLLCP 206
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 30.3 bits (68), Expect = 0.015 Identities = 15/64 (23%), Positives = 27/64 (42%), Gaps = 1/64 (1%) Query: 43 SGQRVFSGLSVALLVMGFVSPAVSWLILRLGARQVLQLGSVLAAAGCCVLALCETVPVWF 102 + + +G+ + V+G V +S I+ A Q L + A + L + P+ F Sbjct: 265 TRTKAAAGVELTTKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGLIASAVTLAIS-PLSF 323 Query: 103 LGWA 106 L A Sbjct: 324 LSIA 327
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 198 bits (506), Expect = 1e-63 Identities = 48/250 (19%), Positives = 84/250 (33%), Gaps = 24/250 (9%) Query: 29 PKNPLQDSMEEVAMKFSESVERHSKGLDERHVRESTS--SQRVERVEKLAELYRLLDNAD 86 + D EEV FSE R LD+R + +S + S E+V + L+ Sbjct: 45 TLQSIADMAEEVTFVFSE---RKELSLDKRKLSDSQARVSDVEEQVNQYLSKVPELEQ-- 99 Query: 87 QPSLEQQARRLQGQLQQQGS-----LKDVLAQAGGDPTRADLLLQQVVRMSATEGKEDTH 141 +Q L L + LK L +P+ +L + + Sbjct: 100 ----KQNVSELLSLLSNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHL 155 Query: 142 ----DQAMALIDELRLSHGDKIRAGLN-TASAIALFSSDPQQRSAMRLLYYKAIVGQQPL 196 +QA + + G+ I G T A S +R Y A++G Q + Sbjct: 156 SHLVEQA---LVSMAEEQGETIVLGARITPEAYRESQSGVNPLQPLRDTYRDAVMGYQGI 212 Query: 197 ASLLESLLERFNEDQFARGLRTLQRALADDIAALAPSIPGAALRAMLRGLGASGQLNNLI 256 ++ L +RF + LQ+AL+ D+ + L ++ L + ++ Sbjct: 213 YAIWSDLQKRFPNGDIDSVILFLQKALSADLQSQQSGSGREKLGIVISDLQKLKEFGSVS 272 Query: 257 KTCLALLQRL 266 Q Sbjct: 273 DQVKGFWQFF 282
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 27.7 bits (61), Expect = 0.028 Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 1/47 (2%) Query: 3 AKPALHKPVPPRPPEPKPRPTGSSGNETA-QPTTRFERREHEPSETR 48 AK K + P+ KP G A Q T+ + + ET+ Sbjct: 456 AKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETK 502
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 52.7 bits (126), Expect = 7e-10 Identities = 33/181 (18%), Positives = 65/181 (35%), Gaps = 36/181 (19%) Query: 168 QWPISVPLLLGHLNLSPSQLASLRPGDVLLPDHSLFTPDGQGTLQLGGCRLSLAQTSADA 227 +WP+ ++G + S L + GDVLL S A+ Sbjct: 149 RWPL--RFVIGSSDTQRSLLGRIGIGDVLLIRTS----------------------RAEV 184 Query: 228 LCFTLTELEQIPMNATIDHFSAADDHPLHLDDIDEHEHHPEADSTDANEDGLQRFNDLSM 287 C+ + HF + + + +H E ++T + L N L + Sbjct: 185 YCYA----------KKLGHF--NRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPV 232 Query: 288 ALTVRAGNLSLSLGQLRSLAVGSVLTFNGCTPGHAMLHHGERVLAHGELVDVEGRLGLQI 347 L +++L +L ++ +L+ + + +L +GELV + LG++I Sbjct: 233 KLEFVLYRKNVTLAELEAMGQQQLLSLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEI 292 Query: 348 T 348 Sbjct: 293 H 293
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 231 bits (590), Expect = 2e-79 Identities = 71/218 (32%), Positives = 126/218 (57%), Gaps = 7/218 (3%) Query: 7 NPLTLALFLGALSLAPLLMIICTAFLKIAMVLLITRNAIGVQQAPPNMALYGIALAATLF 66 N ++L L +L P ++ T F+K ++V ++ RNA+G+QQ P NM L G+AL ++F Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62 Query: 67 IMAPVFSEMGDRVKKLPEHLDTFAAMESAGKHVVEPLRTFMTRNLDPDIQTHLLENTQRM 126 +M P+ + + + +++ ++ R ++ + D ++ + Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122 Query: 127 WPKEMA-------DKASRDDLLLVVPAFVLSELQAGFQIGFLIYIPFIVIDLIVSNILLA 179 E D+ + + ++PA+ LSE+++ F+IGF +Y+PF+V+DL+VS++LLA Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182 Query: 180 LGMQMVAPMTISLPLKILLFVLVDGWTRLLDGLFYSYM 217 LGM M++P+TIS P+K++LFV +DGWT L GL YM Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYM 220
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 58.6 bits (142), Expect = 3e-15 Identities = 31/83 (37%), Positives = 45/83 (54%) Query: 2 ETLTLFKQAMMLVVVLSAPPLIVAVVVGVITSLLQAVMQLQDQTLPFAIKLVAVGLALAL 61 + + +A+ LV++LS P IVA ++G++ L Q V QLQ+QTLPF IKL+ V L L L Sbjct: 3 DLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFL 62 Query: 62 TGRWIGIELMQLAYLSFSMISQT 84 W G L+ + Sbjct: 63 LSGWYGEVLLSYGRQVIFLALAK 85
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 149 bits (377), Expect = 5e-46 Identities = 41/244 (16%), Positives = 97/244 (39%), Gaps = 5/244 (2%) Query: 19 GMARLYPCLFLIPAFAFTELKGMLRHAIVLALALIPMPAIRMGLTGHELDWLDLCALLLK 78 + R+ + P + + ++ + + + P++ + L ++ Sbjct: 19 PLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDV--PVFSFFALWLAVQ 76 Query: 79 ESVIGLLLGLLLAMPFWLFESIGCLFDNQRGALVGGQINPALGDNTSELGHMLKQVLILL 138 + +IG+ LG + F + G + Q G ++PA N L ++ + +LL Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136 Query: 139 MILGGGYASLTQIMWDSYLVWPATQWVPVTGAAGFEVYLKLVASTFRFMVLYAAPLVGLL 198 + G+ L ++ D++ P + F K + F ++ A PL+ LL Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSLIFLNGLMLALPLITLL 194 Query: 199 LMIEFGMAILSLYSPQLQVSTLAMPAKSLAGLFFLVLYMPMLTLLGEGRLADLSD-LRHL 257 L + + +L+ +PQL + + P G+ + MP++ E +++ + L + Sbjct: 195 LTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254 Query: 258 LPLM 261 + + Sbjct: 255 ISEL 258
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 375 bits (965), Expect = e-132 Identities = 113/350 (32%), Positives = 196/350 (56%), Gaps = 6/350 (1%) Query: 2 SEKTEEPTQKKLDDARKKGQVGQSQDVPKLFIFAALMEMILGLVDGGMSRLKALIALPLT 61 EKTE+PT KK+ DARKKGQV +S++V + AL M++GL D L+ +P Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62 Query: 62 ELDRPFNAALGEVLTKAGWELLLFMLPVLGIAAAMRLAGGWVQFGPLFATDSLKLDFERL 121 + PF+ AL V+ E P+L +AA M +A VQ+G L + +++K D +++ Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122 Query: 122 NPINQFKQMFSSRQLFNLFNSLCKAVMITCVLYVLLPPALGDLIGLARTDLDSYWMALVE 181 NPI K++FS + L S+ K V+++ ++++++ L L+ L ++ L + Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQ 182 Query: 182 LFTHLSRTCLGLLLVLAGLDFALQKYFFVKGQRMSHEDIRKEYKESEGDPHMKSHRKALA 241 + L C +V++ D+A + Y ++K +MS ++I++EYKE EG P +KS R+ Sbjct: 183 ILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFH 242 Query: 242 REITDQPGSAAPARAPVEDADMLLVNPTHFAVALFYRPEQTPLPRIICKGRDAEARELIE 301 +EI + R V+ + +++ NPTH A+ + Y+ +TPLP + K DA+ + + + Sbjct: 243 QEIQSR-----NMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRK 297 Query: 302 RAREAGVPVVRFVWLARTLYRE-NVGQFIPRATLQAVAQVYRLLREMDEQ 350 A E GVP+++ + LAR LY + V +IP ++A A+V R L + + Sbjct: 298 IAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNIE 347
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 31.1 bits (70), Expect = 0.035 Identities = 29/155 (18%), Positives = 48/155 (30%), Gaps = 21/155 (13%) Query: 13 VHGATSQGHNPRGLEQRPEPPTQRASVSVVQLGKQPVQVPVTQQPDIPPRTFGPTPGALT 72 +HGA G + Q E P QP+ V + D+ P P Sbjct: 24 IHGAVVAGLLYTSVHQVIELPA----------PAQPISVTMVAPADLEPPQAVQPPPEPV 73 Query: 73 PTAAPE-QTAPQLDADDIAHISSARRPPVTRSSSTGSERPTTALQRELSFKDWLPSQESS 131 PE + P+ + I + P + +R+ + ES Sbjct: 74 VEPEPEPEPIPEPPKEAPVVIEKPKPKPKP---KPKPVKKVEQPKRD------VKPVESR 124 Query: 132 PARSDHQPGPSRSGGNTP-AQSHASGSTQDASPRP 165 PA P+R +T A + ++ + PR Sbjct: 125 PASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159
>cloacin#Cloacin signature. Length = 551 Score = 46.6 bits (110), Expect = 2e-07 Identities = 35/95 (36%), Positives = 40/95 (42%), Gaps = 13/95 (13%) Query: 245 GASKGGGGGGGGGGGGGVAPTGTGGGGGAPSVGGGGGGGGGSPSVGGGGGGGGGGGGGTP 304 GA G GG G GV + G G + GGG G GGG G G GGG G Sbjct: 12 GAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNG-- 69 Query: 305 SLGGGGGTPSIGGGGSTPAP---------TPGAGG 330 GGG+ + G + AP TPGAGG Sbjct: 70 --NSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGG 102 Score = 35.1 bits (80), Expect = 7e-04 Identities = 29/91 (31%), Positives = 34/91 (37%), Gaps = 4/91 (4%) Query: 272 GAPSVGGGGGGGGGSPSVGGGGGGGGGGGGGTPSLG-GGGGTPSIGGGGSTPAPTPGAGG 330 G G G S ++ GG G G GGG + G P GG GS G+G Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62 Query: 331 GTPTPTGPTGTPSPTGPTGTGTSGSATPVSF 361 G G G TG S A PV+F Sbjct: 63 G---NGGGNGNSGGGSGTGGNLSAVAAPVAF 90
>PF06704#DspF/AvrF protein Length = 129 Score = 175 bits (444), Expect = 6e-60 Identities = 69/127 (54%), Positives = 83/127 (65%) Query: 1 MANSQRDMQRFIARLSATLGTPLTLQNGVCALYDGQQRQAAVIEVAAHSDHVVIHSRLGQ 60 M NS D R I L A LGT LT QNGVCALYD Q +AAVIE+ HS+ V+ H R+G+ Sbjct: 1 MNNSPTDFSRLIKSLGAQLGTSLTAQNGVCALYDSQDNEAAVIEMPDHSEMVIFHCRVGR 60 Query: 61 LRKSPENLQRLLSANFDTAKLRGCWLALDQQDVRLCTQRELAGLDEGTFCDLVNGFIAQT 120 +LQ+LLS NFD A++ G W A+DQ DVRLC QRELA LDE FCD GFI Q Sbjct: 61 SPDRAADLQKLLSLNFDVARMHGSWFAVDQGDVRLCAQRELAVLDEAQFCDTARGFIVQA 120 Query: 121 QQTRTAV 127 ++ R + Sbjct: 121 REARALL 127
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 494 bits (1273), Expect = e-170 Identities = 158/532 (29%), Positives = 245/532 (46%), Gaps = 51/532 (9%) Query: 12 VPEEWRQSAYAYEASQTPLTKVLSDFASSYGVGLD-SRGITGVVDAKIRAGNAQEFLDRL 70 +W Y Y A L +L+DF ++Y + S I V + N Q+FL + Sbjct: 27 QELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQHI 86 Query: 71 ALEHQFQWFLYNGKLYVSPQSGQVSQRLEVSADAAPDLKQALTDIGLLDKRFGWGELPDE 130 A + W+ LY+ S S+ + + A +LKQAL G+ + RFGW Sbjct: 87 ASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQALQRSGIWEPRFGWRPDASN 146 Query: 131 GVVLVSGPARYVELIRGFSK-------EKVKAQDKHQVMMFSLRYAAVADREIQYREQSI 183 +V VSGP RY+EL+ + + + + +F L+YA+ +DR I YR+ + Sbjct: 147 RLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIFPLKYASASDRTIHYRDDEV 206 Query: 184 TIPGVATLLDGLLESQHRPPLPQDPAANIRAMQDMADMGQSKIMNLASNRKATPARSGES 243 PGVAT+L +L + + ++ Sbjct: 207 AAPGVATILQRVLSDATIQQV--------------------------TVDNQRIPQAATR 240 Query: 244 KSNSNRRVVADVRNNAVLIYDDPEKRETYQQLVQQLDQPSNLVEIDAVILDIDRSQLSSL 303 S R V AD NA+++ D PE+ YQ+L+ LD+PS +E+ I+DI+ QL+ L Sbjct: 241 ASAQAR-VEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQLTEL 299 Query: 304 ESRWSARAGSVN----------FGSSLLTGGS--STLFINDFDRFFADIQALEGQGVASV 351 W + N S++ + G+ S + D A + LE +G A V Sbjct: 300 GVDWRVGIRTGNNHQVVIKTTGDQSNIASNGALGSLVDARGLDYLLARVNLLENEGSAQV 359 Query: 352 IARPSVLTLENQPAVIDFSRTAYITTTGERVANVQPVTAGTSLRVIPRTIAGEQPNRFQL 411 ++RP++LT EN AVID S T Y+ TG+ VA ++ +T GT LR+ PR + + L Sbjct: 360 VSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTMLRMTPRVLTQGDKSEISL 419 Query: 412 IVDIEDGQLERTRDN--DTPDVKRGTVSTQAVIGENRSLVIGGFHVDESGERQDKVPILG 469 + IEDG + P + R V T A +G +SL+IGG + DE KVP+LG Sbjct: 420 NLHIEDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLLG 479 Query: 470 SLPVIGALFTSKRHEVSRRERLFILTPRLVGDQLDPSRYIARENRPQLDRAL 521 +P IGALF K R RLFI+ PR++ + + + ++A N L + Sbjct: 480 DIPYIGALFRRKSELTRRTVRLFIIEPRIIDEGI--AHHLALGNGQDLRTGI 529
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 71.2 bits (174), Expect = 2e-17 Identities = 32/173 (18%), Positives = 62/173 (35%), Gaps = 13/173 (7%) Query: 1 MKVRTEARREAIIDAAASVFLEMGYERTSMNEVTKRMGGSKATIYSYFPSKEDLFIAVVN 60 K + R+ I+D A +F + G TS+ E+ K G ++ IY +F K DLF + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 61 RHATAHLAEAVSELATYSEKALDLRGLLSRFGERMLAMLINDNTALDVYRMVVA------ 114 +++ E E A LS E ++ +L + T ++ Sbjct: 65 LS-ESNIGELELEYQ-----AKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118 Query: 115 ESGRSEIGMMFYESGPRQCMQTISTLMAQAMQNGQLRK-IDPDLAALQLTSLL 166 G + + + I + ++ L + AA+ + + Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYI 171
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 30.4 bits (68), Expect = 0.024 Identities = 26/76 (34%), Positives = 33/76 (43%), Gaps = 15/76 (19%) Query: 313 LPDNSTYNAAAASNTLARVMPNAIRNALGTLGLVAAR-----TQPSVFPLPSRS------ 361 LP N+T AS L + P A +A T LVA T SVF L +RS Sbjct: 843 LPTNTTNKVRFASYALIKNAPFARYSA--TPNLVAINQHDFGTIESVFELANRSNDIDTL 900 Query: 362 --VSGGEKEEDLEILL 375 SG + + L+ LL Sbjct: 901 YANSGAQGRDLLQTLL 916
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 65.9 bits (161), Expect = 3e-16 Identities = 18/73 (24%), Positives = 29/73 (39%), Gaps = 3/73 (4%) Query: 11 AIALSYDGQ--SAPTLSAKGDDQLAEAILDIAREYEVPIYENAELVK-LLARLELGDSIP 67 AI + Y P ++ K D + + IA E VPI + L + L + IP Sbjct: 268 AIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIP 327 Query: 68 EPLYRTIAEIIAF 80 AE++ + Sbjct: 328 AEQIEATAEVLRW 340
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.9 bits (67), Expect = 0.002 Identities = 30/154 (19%), Positives = 52/154 (33%), Gaps = 38/154 (24%) Query: 9 ITQLPSQIHMLEMQAAEEGFRFLTRLIVE-----WGSGANRFDAP--------------- 48 I ++ + ++M + E F R+I W RF P Sbjct: 2 IMKM-THLNMKDFNKPNEPFVVFGRMIPAFENGVWTYTEERFSKPYFKQYEDDDMDVSYV 60 Query: 49 ---GECLMAASLDGCLIGIGGVSVDPYMQNGVGRLRRLYVSPVARRQNVGRVLVERLVE- 104 G+ L+ IG + + NG + + V+ R++ VG L+ + +E Sbjct: 61 EEEGKAAFLYYLENNCIGRIKIRSN---WNGYALIEDIAVAKDYRKKGVGTALLHKAIEW 117 Query: 105 ----HAAGYFRIVRLYTDTTDGDA--FYLQCGFR 132 H G + L T + A FY + F Sbjct: 118 AKENHFCG----LMLETQDINISACHFYAKHHFI 147
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 27.8 bits (62), Expect = 0.014 Identities = 11/31 (35%), Positives = 14/31 (45%), Gaps = 1/31 (3%) Query: 113 ATLLTGLFAIVFTAGGGYHSAAWIRRRSTAR 143 A + G+ IVFTAG G + IR Sbjct: 317 AAAMGGVDVIVFTAGIGENGPE-IREFILDG 346
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 27.0 bits (60), Expect = 0.012 Identities = 12/45 (26%), Positives = 18/45 (40%), Gaps = 4/45 (8%) Query: 44 FNAWVTSRSFK-SGTEMAEAREEIVKYFCEQYRMMLEDNLDEHIQ 87 N V S S + G EA I+ Y Y ++ + E I+ Sbjct: 178 LNGVVYSSSVRIGGDRFDEA---IINYVRRNYGSLIGEATAERIK 219
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.6 bits (82), Expect = 3e-04 Identities = 31/170 (18%), Positives = 71/170 (41%), Gaps = 7/170 (4%) Query: 4 FICIVTETLPAGLLPEIGSGLGVSPSFAGQMVTVYALGSLLAAIPLTIATQSWRRRTVLL 63 F ++ E + LP+I + P+ + T + L + + + +LL Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 64 LPILGFLIFNSVTALSSNYW-LTLVARFFAGASAGLAWSLIAGYARRMVVPQLQGRAMAI 122 I+ + + + +++ L ++ARF GA A +L+ R + + +G A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG--KAF 141 Query: 123 AMVGTPIALSLGV--PLGTWLGGFMGWRMAFGLMSGMTLVLIAWVLIKVP 170 ++G+ +A+ GV +G + ++ W + M ++ L+K+ Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIP--MITIITVPFLMKLL 189
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 64.6 bits (157), Expect = 4e-15 Identities = 31/176 (17%), Positives = 63/176 (35%), Gaps = 4/176 (2%) Query: 1 MAQMGRPRTFDRDAAITQ-AMHLFWEHGYDATSLSQLKASIGGGITAPSFYAAFGSKQAL 59 MA+ + + I A+ LF + G +TSL ++ + G +T + Y F K L Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAG--VTRGAIYWHFKDKSDL 58 Query: 60 FTEVMERYLTTHGRVTDSLFDQTLP-PREAIEFTLRRSAKMQCEPDHPKGCLVSLGLMSA 118 F+E+ E + G + + P + L + + + + + Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118 Query: 119 CSEESKTISAPLARARDMNRAALVACVERAIQAGELPRTVMPETLAAVFDSFMLGL 174 E + + + ++ I+A LP +M A + ++ GL Sbjct: 119 FVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL 174
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.044 Identities = 22/72 (30%), Positives = 31/72 (43%), Gaps = 7/72 (9%) Query: 18 RRTVSGASLAQELGVS--LRTIRRDVATLQGMGADIEGEPGLGYILKPGFL-LPPLSFTE 74 R + G +A S L+ + ATL GA I GEP GY+ G+ L + Sbjct: 284 YRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRY 343 Query: 75 EEIQALMIGAQW 86 +E M+G W Sbjct: 344 QE----MLGVIW 351
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 31.0 bits (69), Expect = 0.004 Identities = 22/73 (30%), Positives = 30/73 (41%), Gaps = 8/73 (10%) Query: 19 THTLTAANDSTVKTVPIATKKAIVFFIGGAADQEKYYFQGAFHNIDGARNILDQRISANS 78 +HTL AN T TV +TKKAI + Y F +D + LD R+ Sbjct: 335 SHTLKTANSYTDVTVSNSTKKAI--------RESNQYTDHKFRQLDNRLDKLDTRVDKGL 386 Query: 79 KLSSKYTSWLRSY 91 S+ S + Y Sbjct: 387 ASSAALNSLFQPY 399
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 48.9 bits (116), Expect = 1e-09 Identities = 22/140 (15%), Positives = 47/140 (33%), Gaps = 4/140 (2%) Query: 2 SENARESILAAAKAAAQVHGYSGINFRSIADTVGIKNASIYYHFPSKADLGAAVARRYWQ 61 ++ R+ IL A G S + IA G+ +IY+HF K+DL + + Sbjct: 9 AQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSES 68 Query: 62 DTAAVLEAI--RDENTDPTRCLQLYPSIFRMSLENGNR--LCLSSFMAAEYEDLPEEVKS 117 + + + + ++ + ++ R L F E+ V+ Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128 Query: 118 EVKAFADANVAWLARVLADA 137 + + + + L Sbjct: 129 AQRNLCLESYDRIEQTLKHC 148
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 41.9 bits (98), Expect = 3e-07 Identities = 13/49 (26%), Positives = 21/49 (42%) Query: 97 PEHQGQGYGTESWHAVIDYAAAIGLDSLEATVTDGNIASCKLQEKCGFT 145 +++ +G GT H I++A L D NI++C K F Sbjct: 99 KDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 34.4 bits (79), Expect = 0.008 Identities = 28/157 (17%), Positives = 53/157 (33%), Gaps = 25/157 (15%) Query: 2467 FLVIGGSGGIGRTLCEHLLRNNGQRRVV---------LLSRHGECPEALQAYRSRIDPVQ 2517 +LV G +G IG + + LL Q + L + + + Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQA----RLELLAQPGFQFHK 58 Query: 2518 ADIADRTVWPQVLEQLERRYGHFDGVIH-AAGVGAGSLIRHRDARTLSEAMAAKTLGMLA 2576 D+ADR + GHF+ V + + + A S G L Sbjct: 59 IDLADREGMTDLFAS-----GHFERVFISPHRLAVRYSLENPHAYADSNLT-----GFLN 108 Query: 2577 VEELIQQMTPKFVLYCSSMAALFGGAGHLDYAAASGT 2613 + E + + +LY SS ++++G + ++ Sbjct: 109 ILEGCRHNKIQHLLYASS-SSVYGLNRKMPFSTDDSV 144
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 30.6 bits (69), Expect = 0.010 Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 10/74 (13%) Query: 169 IAGKNTAIGDAIGLALKRLRLRPANSRVLVLVTDGANNGGQIDPITAA-RLAANEGVRIY 227 IA G +G+A + +L++ GQ D I A + V I Sbjct: 94 IAATENENG-VVGVA--------PEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDII 144 Query: 228 TIGIGSDPDKSGIQ 241 ++ +G D + Sbjct: 145 SMSLGGPEDVPELH 158
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.3 bits (78), Expect = 0.002 Identities = 29/191 (15%), Positives = 58/191 (30%), Gaps = 27/191 (14%) Query: 387 EAGDYASAAQRFAEGNTAADHYNRGNALARSGELEAALDAYEQALERQPDFPAAVNNRAL 446 + + A A+ + NT N +A+SG + + + Sbjct: 1064 QNREVAKEAKSNVKANTQT------NEVAQSGSETK--ETQTTETKETATVEKEEKAKVE 1115 Query: 447 ---VQNLLDQANAAQPEQDKPE--KPEQDEAGQNGTQ---DQTSNHSPSEQDTARPSEDN 498 Q + + P+Q++ E +P+ + A +N + + + + DT +P+++ Sbjct: 1116 TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKET 1175 Query: 499 SSESLPPDTSGLQSSGPSTDDEQTTRPPLQPADRPVTSERRQELEQWLRQIPDDPGELLR 558 SS P + S P E + P R Sbjct: 1176 SSNVEQP----VTESTTVNTGNSVVENPENTTPATTQPTVNSESS-------NKPKNRHR 1224 Query: 559 RKFRYEQQHQE 569 R R + E Sbjct: 1225 RSVRSVPHNVE 1235
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 49.3 bits (117), Expect = 8e-08 Identities = 63/402 (15%), Positives = 143/402 (35%), Gaps = 12/402 (2%) Query: 626 TQHDEDEQASAQKAVDTLTEQRNQLREQVGGIIARQKELLRQHDQLTQRHQTLAPDLEAH 685 T+ D Q+ D + N L+ + + K L +D+LT+ L + Sbjct: 45 TRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKN 104 Query: 686 P---LGAQLLDRDPAKRDAWLSQQLSHLNEIILRDEQRQQALLNLQKDAARLQQSVQTAQ 742 ++ R A L + L D + + L + A + ++ A Sbjct: 105 DKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 164 Query: 743 EASQAAAHHVTEQLQQLSADQQRLDEELAALAPLVSSQTLDGLRSDASTTVMQLEQQVVQ 802 E + + + +++ L A++ L+ A L + L+G + ++ +++ + Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAEL-----EKALEGAMNFSTADSAKIKTLEAE 219 Query: 803 RLDQLEQQGEEQQEQRERQQRIDSEQVEQKNRLQRVTEQQQAVAALSEQQQASQQRLQDL 862 + ++ + ++ ++ + K + A L + + + Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD 279 Query: 863 LGDHTSAEQWQQTLEHAVEQARQAESSAAQSLQDIQSQLIQLAAELKSGEQQQQALQQEL 922 + E + LE + Q ++ L K E + Q L+++ Sbjct: 280 SAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQN 339 Query: 923 TELDATLTDWRTQHAELDDAALDTLLTYDDEQVEQLRQQSQAAEKALEQARILLNEREQR 982 +A+ R +A + ++E+ + S+A+ ++L R L RE + Sbjct: 340 KISEASRQSLRRDLDASREAKKQLEAEHQ--KLEEQNKISEASRQSLR--RDLDASREAK 395 Query: 983 VQQHQAQHAGLTDSDALNVALLQAQEQTALSEQHCAELRAQL 1024 Q +A + AL + +E L+E+ AEL+A+L Sbjct: 396 KQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKL 437
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 70.9 bits (173), Expect = 2e-14 Identities = 64/274 (23%), Positives = 86/274 (31%), Gaps = 19/274 (6%) Query: 839 VAGTVMSAPAEAQAHEQAERANSTVEAPVADAAEPAPAVETTIAETTTVETTAVEAPTEQ 898 V T ++ P QA + +N+ A V +A P PA T + T ET A + E Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPAT---PSETTETVAENSKQES 1048 Query: 899 APVAAEPTVEQPAVEAPVADETPKAEAAPEVEVQPTVTEVPAIAAQTELFEAPHAERVVP 958 V EQ A E + EA V+ EV AQ+ Sbjct: 1049 KTVE---KNEQDATETTAQNREVAKEAKSNVKANTQTNEV----AQSGSETKETQTTETK 1101 Query: 959 FTPTPEPTPEPQAPVEAKAQEEVPATESSELPTPAPAPVAEPAFVKEEPAPYFAPQAPAV 1018 T T E E +A VE + +EVP S P + +P EPA P V Sbjct: 1102 ETATVEK--EEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQ---AEPAR---ENDPTV 1153 Query: 1019 EEAPAVQEAQEPAAVEAPALPVSSTGRAP-NDPREVRRRKREEEARRQKEAEQAASAAPV 1077 + A E PA SS P + V E Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNS 1213 Query: 1078 ASEPAPVVAEAESVQPALNTEEHAEQQHAEKETE 1111 S P SV+ + E A ++ T Sbjct: 1214 ESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247 Score = 63.9 bits (155), Expect = 3e-12 Identities = 58/303 (19%), Positives = 95/303 (31%), Gaps = 23/303 (7%) Query: 564 PAPALPEPSLFKGLVKSLVSLFATKEEPAAPVVVEKPAATERPARNEERRNGRQQSRGRN 623 + P+ + V S+ S V AT N +Q+S+ Sbjct: 993 DTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVE 1052 Query: 624 NRRDEERKPREERAPREERAERAPREERAPRE--ERAPREERAPREERAPREERAPREAR 681 E+ E A E A+ A +A + E A + +E A E Sbjct: 1053 KN---EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109 Query: 682 DDAAPTTTAREERPARTSRERKPREGREERPVRELREPLDAAPAVNIAREERPERAPREE 741 + A T +E P TS + P++ + E + + P VNI + + Sbjct: 1110 EKAKVETEKTQEVPKVTS-QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADT 1168 Query: 742 RQP--RAPREERQPRTEQAVVEASEEEVLLNEEQAHDDSQDSNEGERPRRRSRGQRRRSN 799 QP QP TE V V E +Q P S + N Sbjct: 1169 EQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQ-------PTVNSESSNKPKN 1221 Query: 800 RRERQRDANGNVIEGSEENGNEEEQGSDAAADLAVTAAAVAGTVMSAPAEAQAHEQAERA 859 R R + + +E + + N+ + A DL + + ++A+A Q Sbjct: 1222 RHRRSVRSVPHNVEPATTSSNDRS--TVALCDL------TSTNTNAVLSDARAKAQFVAL 1273 Query: 860 NST 862 N Sbjct: 1274 NVG 1276 Score = 58.9 bits (142), Expect = 1e-10 Identities = 52/285 (18%), Positives = 83/285 (29%), Gaps = 12/285 (4%) Query: 704 PREGREERPVRELREPLDAAPAVNIAREERPERAPREE--RQPRAPREERQPRTEQAVVE 761 P + + V + NI + + EE R AP P T E Sbjct: 983 PEVEKRNQTVD----TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038 Query: 762 ASEEEVLLNEEQAHDDSQDSNEGERPRRRSRGQRRRSNRRERQRDANGNVIEGSEENGNE 821 E + + QD+ E R + + + + Q N GSE + Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQ--TNEVAQSGSETKETQ 1096 Query: 822 EEQGSDAAADLAVTAAAVAGTVMSAPAEAQAHEQAERANSTVEAPVADAAEPAPAVETTI 881 + + A A V + + ++ S P AEPA + T+ Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP---QAEPARENDPTV 1153 Query: 882 AETTTVETTAVEAPTEQAPVAAEPTVEQPAVEAPVADETPKAEAAPEVEVQPTVTEVPAI 941 T A TEQ VEQP E+ + PE P T+ Sbjct: 1154 NIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT-TPATTQPTVN 1212 Query: 942 AAQTELFEAPHAERVVPFTPTPEPTPEPQAPVEAKAQEEVPATES 986 + + + H V EP A ++ +T + Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257 Score = 56.6 bits (136), Expect = 6e-10 Identities = 68/333 (20%), Positives = 105/333 (31%), Gaps = 29/333 (8%) Query: 789 RRSRGQRRRSNRRERQRDANGNVIEGSEENGNEEEQGSDAAADLAVTAAAVAGTVMSAPA 848 R G+ N +R N V + N + + A V + PA Sbjct: 972 RNVNGRYDLYNPEVEKR--NQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPA 1029 Query: 849 EAQAHEQAE-------RANSTVEAPVADAAEPAPAVETTIAETTTVETTAVEAPTEQAPV 901 A E E + + TVE DA E E + V+A T+ V Sbjct: 1030 PATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKE----AKSNVKANTQTNEV 1085 Query: 902 AAEPTVEQPAVEAPVADETPKAEAAPEVEVQPTVT-EVPAIAAQTELFEAPHAERVVPFT 960 A+ E + ET E + +V+ T EVP + +Q +P E+ Sbjct: 1086 -AQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV----SPKQEQSETVQ 1140 Query: 961 PTPEPTPEPQAPVEAKAQEEVPATESSELPTPAPAPVAEPAFVKEEPAPYFAPQAPAVEE 1020 P EP E V K E + ++ T PA + +V E Sbjct: 1141 PQAEPARENDPTVNIK---EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVE 1197 Query: 1021 APAVQEAQEPAAVEAPALPVSSTGRAPNDPREVRRRKREEEAR--RQKEAEQAASAAPVA 1078 P E PA + SS R VR E + A + Sbjct: 1198 NP---ENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254 Query: 1079 SEPAPVVAE--AESVQPALNTEEHAEQQHAEKE 1109 + V+++ A++ ALN + Q ++ E Sbjct: 1255 TNTNAVLSDARAKAQFVALNVGKAVSQHISQLE 1287 Score = 47.8 bits (113), Expect = 3e-07 Identities = 36/186 (19%), Positives = 67/186 (36%), Gaps = 23/186 (12%) Query: 949 EAPHAERVVPFTPTPEPTP-EPQAPVEAKAQEEVPATESSELPTPAPAPVAEPAFVKEEP 1007 E + V T P + P EE+ + + +P PAPA +E E Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAEN 1043 Query: 1008 APYFAPQAPAVEEAPAVQEAQEPAAVEAPALPV------SSTGRAPNDPREVRRRKREEE 1061 + + E+ AQ + V + ++ ++ +E + + +E Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKET 1103 Query: 1062 ARRQK------EAEQAASAAPVASEPAPVVAEAESVQ----------PALNTEEHAEQQH 1105 A +K E E+ V S+ +P ++E+VQ P +N +E Q + Sbjct: 1104 ATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTN 1163 Query: 1106 AEKETE 1111 +TE Sbjct: 1164 TTADTE 1169
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 162 bits (411), Expect = 1e-51 Identities = 89/236 (37%), Positives = 127/236 (53%), Gaps = 7/236 (2%) Query: 1 MKQHRLAAAIALVGLVLAGCDKQASTVELKTPAQKASYGIGLNMGKSLAQEGMDDLDSKA 60 MK + AAI +GL ++ L T K SY IG ++GK+ +G+D ++ Sbjct: 1 MKMKLVTAAI--MGLAMSTAMAATDATSLTTDKDKLSYSIGADLGKNFKNQGID-INPDV 57 Query: 61 VALGIEDAVGKKDQKLKDEELVEAFAALQK----RAEERMAKMSEESAAAGKKFLEENGK 116 +A G++D + L +E++ + + QK + K +EE+ A G FL N Sbjct: 58 LAKGMQDGMSGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKS 117 Query: 117 KEGVVTTASGLQYQIIKKGDGAQPKPTDVVTVHYEGKLTDGKVFDSSVERGSPIDLPVGG 176 K G+V SGLQY+II G GA+P +D VTV Y G L DG VFDS+ + G P V Sbjct: 118 KPGIVVLPSGLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQ 177 Query: 177 VIPGWVEGLQLMHVGEKIKLFIPSDLAYGAQSPSPLIPANSVLVFDLELLGIKDPA 232 VIPGW E LQLM G ++F+P+DLAYG +S I N L+F + L+ +K A Sbjct: 178 VIPGWTEALQLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKAA 233
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 59.8 bits (145), Expect = 1e-12 Identities = 50/231 (21%), Positives = 81/231 (35%), Gaps = 28/231 (12%) Query: 30 EVAHRFGHIPFVAGSTRKTSILMAVLREVHRGHLDLNEPIRYEERLREGVMSGTFKYLTP 89 + F ST K + AVL V G L I Y ++ + K+L Sbjct: 52 TLTAWRADERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLAD 111 Query: 90 GFSIS-LRDALVQMIIVSDNVCTRMVLERIS-LARINDFCQSLDMGNTSHRNTIPRPDL- 146 G ++ L A + M SDN ++L + A + F + + G+ R +L Sbjct: 112 GMTVGELCAAAITM---SDNSAANLLLATVGGPAGLTAFLRQI--GDNVTRLDRWETELN 166 Query: 147 -AIDHKLEEVTTTSAFDQGLLYDLILQGSVNPATATLLGCSSEQCAFALDVLSWQKLRT- 204 A+ + TT ++ L L+ ++ + L L W Sbjct: 167 EALPGDARDTTTPASMAA-TLRKLLTSQRLSARSQRQL-------------LQWMVDDRV 212 Query: 205 ---KMASLLPADTKIAHKGGTGKRG-RMDGGIVFRDGAPLFIFTGYTDQVP 251 + S+LPA IA K G G+RG R ++ + I Y P Sbjct: 213 AGPLIRSVLPAGWFIADKTGAGERGARGIVALLGPNNKAERIVVIYLRDTP 263
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 28.8 bits (64), Expect = 0.029 Identities = 13/63 (20%), Positives = 24/63 (38%), Gaps = 1/63 (1%) Query: 167 ATVRLVPANMHERNK-LRDLRDRLAQCLGIRSADHDNYGFHITLGYLVQWMDARQTQDYA 225 A++ + + + E L +LRD + + + F + Q +Q Q A Sbjct: 295 ASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQQQGQGQQQQAQA 354 Query: 226 TVQ 228 T Q Sbjct: 355 TAQ 357
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 84.3 bits (208), Expect = 2e-21 Identities = 56/230 (24%), Positives = 88/230 (38%), Gaps = 14/230 (6%) Query: 46 REILLCLLLAMAGHG-LVGWFLFQSPADSEVIPAPL-PVVMQLVAPPIAPPINASAPTEP 103 R LL++ HG +V L+ S +PAP P+ + +VAP P A P Sbjct: 12 RRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPE 71 Query: 104 AAAPPPPEATPAVSTPAPQP---AKPTPKPAAKKPAAANKAPPQSQHTEQPGKEVAAPQQ 160 P PE P P P KP PKP K P+ + + + Sbjct: 72 PVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFEN 131 Query: 161 TAIAKPPAPAPE----QALVGPYGRAGYLNNPPPTYPPIAARLHQQGVVVLRVHVRADGH 216 TA A+P + + + L+ P YP A L +G V ++ V DG Sbjct: 132 TAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGR 191 Query: 217 PEQVQVFTSSGFDSLDQAAIKAVNQWTFMPAKRGEVATDGWVNVPLAFKL 266 + VQ+ ++ + ++ A+ +W + P K G V + FK+ Sbjct: 192 VDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIV-----VNILFKI 236
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 114 bits (286), Expect = 3e-33 Identities = 34/112 (30%), Positives = 51/112 (45%), Gaps = 12/112 (10%) Query: 65 YFEYDSSDLKPEAMRSLDVHA---KDLKSNGARVVLEGNTDERGTREYNMALGERRAKAV 121 F ++ + LKPE +LD +L VV+ G TD G+ YN L ERRA++V Sbjct: 222 LFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSV 281 Query: 122 QRYLVLQGVSPAQLELVSYGEERPVATGNDEQS---------WAQNRRVELR 164 YL+ +G+ ++ GE PV + A +RRVE+ Sbjct: 282 VDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIE 333
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.8 bits (147), Expect = 3e-12 Identities = 45/260 (17%), Positives = 96/260 (36%), Gaps = 11/260 (4%) Query: 78 ARQTEVEQLEQKKIEQLKQEAVKAAEQKKEESAQKAEEQKAADEAKK----AEQKAEEAK 133 A T E E E KQE+ + +++ + A+ ++ A EAK Q E A+ Sbjct: 1029 APATPSETTETVA-ENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQ 1087 Query: 134 KADDAKKADEAKKVADAKKVEEKQLADIAKKKAEDEAKKKAEEDAKKAAAEEAKKQAADE 193 + K+ + + VE+++ A + +K ++ K ++ K+ +E + QA Sbjct: 1088 SGSETKETQTTET-KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146 Query: 194 AKKKAAEDAKKKAAEDAKKKAAADSAKKAQEAARKSAEDKKAQALADLLSDKPERQQALA 253 + + K+ ++ AK+ + + + + + PE Sbjct: 1147 RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206 Query: 254 DERGDETAGSFDDLIR----VRASEGWSRPPS-ARNNMSVTLQIGMLPDGTIASVSIAKS 308 + + S R VR+ P + + N+ S + T A +S A++ Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARA 1266 Query: 309 SGDGPFDSSAVAAVKNIGRL 328 + A ++I +L Sbjct: 1267 KAQFVALNVGKAVSQHISQL 1286 Score = 57.0 bits (137), Expect = 6e-11 Identities = 34/205 (16%), Positives = 68/205 (33%), Gaps = 14/205 (6%) Query: 61 ATTQTNQKIAGEAKKTAARQTEVEQLEQKKIEQLKQEAVKAAEQKKEESAQKAEEQKAAD 120 Q + + AR E + A K+E + EQ A + Sbjct: 1001 NNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATE 1060 Query: 121 EAKKAEQKAEEAKKADDA-----KKADEAKKVADAKKVEEKQLADIAKKKAEDEAKKKAE 175 + + A+EAK A + A + + + E K+ A + E E K K E Sbjct: 1061 TTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV-----EKEEKAKVE 1115 Query: 176 EDAKKAAAEEAKKQAADEAKKKAAEDAKKKAAEDAKKKAAADSAKKAQEAARKSAEDKKA 235 + +E K + + K+ + + AE A++ + K+ Q +A+ ++ Sbjct: 1116 TEKT----QEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171 Query: 236 QALADLLSDKPERQQALADERGDET 260 ++P + + Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVV 1196
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 159 bits (403), Expect = 5e-53 Identities = 51/147 (34%), Positives = 81/147 (55%) Query: 8 SEEDRKSIVDGLSHLLSDTYVLYLKTHNFHWNVSGPMFRTLHLMFEEQYNELALAVDSIA 67 ++ ++ + + L+ LS+ ++LY K H FHW V GP F TLH FEE Y+ A VD+IA Sbjct: 6 AKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDTIA 65 Query: 68 ERIRALGFPAPGTYSTYARLSTIKEEEGVPSAEDMIKSLVQGQEAVVRTARSIFPLLDKV 127 ER+ A+G T Y ++I + SA +M+++LV + + ++ + L ++ Sbjct: 66 ERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAEEN 125 Query: 128 SDEPTADLLTQRMQVHEKTAWMLRSML 154 D TADL ++ EK WML S L Sbjct: 126 QDNATADLFVGLIEEVEKQVWMLSSYL 152
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 119 bits (300), Expect = 1e-31 Identities = 79/371 (21%), Positives = 142/371 (38%), Gaps = 48/371 (12%) Query: 22 VGIDLGTTNSLVAAVRSGLSEPLADAEGQVILPSAVRYHADRVEVGQSAKVAASQDPFNT 81 + IDLGT N+L+ G+ + PS V + G VAA Sbjct: 13 LSIDLGTANTLIYVKGQGIV---------LNEPSVVA--IRQDRAGSPKSVAAVGHD--- 58 Query: 82 VLSVKRLMGRGLTDVKQLGEQLPYRFVDGESHMPFIETVQGPKSPVEVSADILK-VLRQR 140 K+++GR ++ + P + G + V+ +L+ ++Q Sbjct: 59 ---AKQMLGRTPGNIAAIR---PMK--------------DGVIADFFVTEKMLQHFIKQV 98 Query: 141 AEEALGGELVGAVITVPAYFDDAQRQATKDAAKLAGLNVLRLLNEPTAAAVAYGLDQKAE 200 + ++ VP +R+A +++A+ AG + L+ EP AAA+ GL Sbjct: 99 HSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEA 158 Query: 201 GVVAIYDLGGGTFDISILRLTGGVFEVLATGGDTALGGDDFDHAIASWIVAEAGL--SAD 258 + D+GGGT +++++ L G V +GGD FD AI +++ G Sbjct: 159 TGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIGEA 213 Query: 259 LAPSAQRSLLQAACAAKEALTDADAVDVAYGDWKAVL--TREALNAMIEPMVARSLKACR 316 A + + A + + ++A G + + E L A+ EP + + A Sbjct: 214 TAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEP-LTGIVSAVM 272 Query: 317 RAVRDTGIELEE--VEA-VVMVGGSTRVPRVREAVAELFGRQPLTEIDPDQVVAIGAAIQ 373 A+ EL E +V+ GG + + + E G + DP VA G Sbjct: 273 VALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGKA 332 Query: 374 ADTLAGNKRDG 384 + + + D Sbjct: 333 LEMIDMHGGDL 343
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 33.6 bits (77), Expect = 0.003 Identities = 20/105 (19%), Positives = 34/105 (32%), Gaps = 3/105 (2%) Query: 554 GSFGSVQYSQMPNRVTGGEVKPEKARTWELGTRYDNGNLRAEIGAFLINFDNQYD--SNQ 611 G F + + V EK + L + YDN L A + + + S+ Sbjct: 187 GFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQQQDAKLVEENYSHN 246 Query: 612 TNDTVIARGETRHQGIETSINYALEGLSPALAGYDVYATYAFVDA 656 + V A R + ++YA G + + Y V Sbjct: 247 SQTEVAATLAYRFGNVTPRVSYAH-GFKGSFDATNYNNDYDQVVV 290
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 33.0 bits (75), Expect = 0.001 Identities = 19/114 (16%), Positives = 36/114 (31%), Gaps = 1/114 (0%) Query: 413 LSQALKQYPDDINLLYTRAMLAEKRNDLAQMEKDLRSIIKREPENAMALNALGYTLSDRT 472 ++ + D + LY+ A + K +++ + ++ LG Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAM- 83 Query: 473 TRYAEARALIEKAHSINPDDPAVLDSLGWVNYRMGNLDEAERLLRKALERFPDH 526 +Y A ++ +P + G L EAE L A E D Sbjct: 84 GQYDLAIHSYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQELIADK 137
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 184 bits (468), Expect = 1e-57 Identities = 83/353 (23%), Positives = 143/353 (40%), Gaps = 44/353 (12%) Query: 1 MKILVTGGAGFIGSAVIRHIISNTNDSVINVDKLT--YAGNL-ESLQSVEDSERYAFAHV 57 MK LVTG AGFIG V + ++ + V+ +D L Y +L ++ + + F + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 58 DICDREAIDKVFQEHQPDAIMHLAAESHVDRSITGPSEFIQTNIIGTYTLLEAARAYWNQ 117 D+ DRE + +F + + V S+ P + +N+ G +LE R Q Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 118 LDEARKSNFRFHHISTDEVYGDLEGPEDLFTETTPY-QPSSPYSASKASSDHLVRAWSRT 176 + S+ VYG + F+ P S Y+A+K +++ + +S Sbjct: 120 ---------HLLYASSSSVYGL--NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL 168 Query: 177 YGLPTLVTNCSNNYGPCHFPEKLIPLIILNALEGKPLPIYGKGDQVRDWLYVEDHARALY 236 YGLP YGP P+ + LEGK + +Y G RD+ Y++D A A+ Sbjct: 169 YGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228 Query: 237 KVV------------------TEGEIGETYNIGGHNEKQNIEVVHTVCALLDQLRPDSAH 278 ++ YNIG +E++ + AL D L + Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNS---SPVELMDYIQALEDALGIE--- 282 Query: 279 LPHASLITYVQDRPGHDLRYAIDASKIQRELGWVPEESFESGIRKTVEWYLNN 331 + + +PG L + D + +G+ PE + + G++ V WY + Sbjct: 283 ----AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 53.6 bits (129), Expect = 2e-10 Identities = 32/162 (19%), Positives = 59/162 (36%), Gaps = 20/162 (12%) Query: 1 MKILLLGKNGQVGWELQRSLAVLG-EVIALD---------------RQVASTAYGEISGD 44 MK L+ G G +G+ + + L G +V+ +D +A + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 45 LSNLDELRKTIRQVQPQVIVNAAAYTAVDKA-ETEQALARTVNALASQVLAEEALQLD-A 102 L++ + + + + + AV + E A A N + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYA-DSNLTGFLNILEGCRHNKIQ 119 Query: 103 LLVHYSTDYVFNGTGSQAWKETDAVS-PVNYYGATKLEGEQL 143 L++ S+ V+ + D+V PV+ Y ATK E + Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELM 161
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 31.1 bits (70), Expect = 0.004 Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 5/73 (6%) Query: 192 TVLTTVLLFLSPVLYPIAALPEVYRPWLQMNPLTYVIEESRSVLLFGHLPQWDSLGIAIV 251 T++ T +LFLS ++P+ LP V++ + PL++ I+ R ++L + + Sbjct: 183 TLVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVD-----VCQH 237 Query: 252 IGSLMAVAGFWFF 264 +G+L FF Sbjct: 238 VGALCIYIVIPFF 250
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 50.1 bits (119), Expect = 6e-08 Identities = 41/198 (20%), Positives = 72/198 (36%), Gaps = 20/198 (10%) Query: 695 LLTEPQVAERLLQQEEELKQALETTTDQSIREHSALEAIEAANLAEQESHRLTLANIEVE 754 L + A + + LE + LE + + + +E E Sbjct: 195 LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAE 254 Query: 755 NLSIQESHRLEVAELEAASLVIHENHRLTMAEMEAANLELQESHRLQRMEFEAANAALRE 814 +++ AELE A + A + +++ E A A + Sbjct: 255 KAALEA----RQAELEKA---LEGAMN----FSTADSAKIKTLEA----EKAALEAEKAD 299 Query: 815 HHERELQNLEAEKQAV---LDAHKKQCMAVEAEHLANQEYYQLILLQMEAEQARTLEQHR 871 E + Q L A +Q++ LDA ++ +EAEH +E + I R L+ R Sbjct: 300 L-EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNK-ISEASRQSLRRDLDASR 357 Query: 872 LALAKLETENQLLHENHR 889 A +LE E+Q L E ++ Sbjct: 358 EAKKQLEAEHQKLEEQNK 375 Score = 48.9 bits (116), Expect = 1e-07 Identities = 31/208 (14%), Positives = 57/208 (27%), Gaps = 21/208 (10%) Query: 700 QVAERLLQQEEELKQALETTTDQSIREHSALEAIEAANLAEQESHRLTLANIEVENLSIQ 759 + A + + LE + LE + + + +E E +++ Sbjct: 130 EGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALE 189 Query: 760 ESHR---------------LEVAELEAASLVIHENHRLTMAEMEAANLELQESHRLQRME 804 + R E + +++ Sbjct: 190 ARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 249 Query: 805 FEAANAALREHHERELQNLEAEKQAVLDAHKKQCMAVEAEHLANQEYYQLILLQMEAEQA 864 A A E + EL+ A + +EAE A + + Q + A Sbjct: 250 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNA 309 Query: 865 ------RTLEQHRLALAKLETENQLLHE 886 R L+ R A +LE E+Q L E Sbjct: 310 NRQSLRRDLDASREAKKQLEAEHQKLEE 337 Score = 32.3 bits (73), Expect = 0.017 Identities = 32/230 (13%), Positives = 67/230 (29%), Gaps = 16/230 (6%) Query: 717 ETTTDQSIREHSALEAIEAANLAEQESHRLTLANIEVENLSIQESHRLEVAELEAASLVI 776 T ++ E + +L +++ N ++++ + E+ E + + Sbjct: 42 AVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHND-ELTEELSNAKEK 100 Query: 777 HENHRLTMAEMEAANLELQESHRLQRMEFEAANAALREHHERELQNLEAEKQ------AV 830 + +++E + EL+ E A +++ LEAEK A Sbjct: 101 LRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTAD-SAKIKTLEAEKAALAARKAD 159 Query: 831 LDAHKKQCMAVEAEHLANQEYYQLILLQMEAEQARTLEQHRLALAKLET--------ENQ 882 L+ + M A + + +EA QA + A+ E + Sbjct: 160 LEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 219 Query: 883 LLHENHRLTLAGIDSDAMTLRRNQRLELREIESKTMTMLENHRLELEARD 932 R + + LE + ELE Sbjct: 220 KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKAL 269
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 98.3 bits (245), Expect = 1e-25 Identities = 69/337 (20%), Positives = 116/337 (34%), Gaps = 31/337 (9%) Query: 1 MKAIITGITGQDGAYLAQLLLEKGYTVYG-----TYRRTSSVNFWRIEELGIQHDANLHL 55 MK ++TG G G ++++ LLE G+ V G Y S + R+E L Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVS-LKQARLELLA---QPGFQF 56 Query: 56 VEYDLTDLSASIRLLQNTEATEIYNLAAQSFVGVSFEQPLTTAQITGIGAVNLLEAIRIV 115 + DL D L + ++ + V S E P A G +N+LE R Sbjct: 57 HKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116 Query: 116 NPKIRFYQASTSEMFGKVQSIPQIESTPF-YPRSPYGVAKLYAHWMTINYRESYGIFGAS 174 + AS+S ++G + +P +P S Y K M Y YG+ Sbjct: 117 KIQ-HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATG 175 Query: 175 GILFNHESPLRGRE-----FVTRKITDSVAKINMGLMDSFELGNMDAKRDWGFAKEYVEG 229 F P GR T+ + + + +D + G M KRD+ + + E Sbjct: 176 LRFFTVYGP-WGRPDMALFKFTKAMLEGKS------IDVYNYGKM--KRDFTYIDDIAEA 226 Query: 230 MWRMLQAETPDSFVLATNRTETVRSFVSMAFKATGVTVQWEGEAESERGIDAATGKVLVS 289 + R+ S G + E I A + + Sbjct: 227 IIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNS----SPVELMDYIQALEDALGIE 282 Query: 290 VNPKF--YRPTEVELLIGNPAKALEVLGWEPKTHLEE 324 +P +V + EV+G+ P+T +++ Sbjct: 283 AKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKD 319
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 100 bits (250), Expect = 2e-26 Identities = 61/240 (25%), Positives = 97/240 (40%), Gaps = 31/240 (12%) Query: 7 RALITGIHGFTGSFMARELAAQGCEVVGM----------------GSQPSDSDNYHQVDL 50 + L+TG GF G +++ L G +VVG+ +H++DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 51 LDMNGLTALLAGIQPDIVVHLAALAFVGH--GSPEAFYQVNLIGTRNLLEAIEASGKTPD 108 D G+T L A + V V + +P A+ NL G N+LE + Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--IQ 119 Query: 109 CVLLASSANVYG-NASSGMLDETTIPAPANDYAVSKLAMEYMASLWHA--RLPLVITRPF 165 +L ASS++VYG N + ++ P + YA +K A E MA + LP R F Sbjct: 120 HLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFF 179 Query: 166 NYTGVGQAENFLLPKIVSHFTRR---ESTIEL-GNLDVWRDFSDVRAVTSAYRGLLEARP 221 G + L K FT+ +I++ + RDF+ + + A L + P Sbjct: 180 TVYGPWGRPDMALFK----FTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIP 235
>PF06917#Periplasmic pectate lyase Length = 555 Score = 32.2 bits (73), Expect = 0.003 Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 9/97 (9%) Query: 4 GLGGLNKSPNGVVIGLAQLALPDPHTREAL--WAQTEKVVSMVAKARRSNPGMDLIVFPE 61 L LNK+ + AQ + P+ AL A+ + ++ A + + +F Sbjct: 424 QLAELNKTQRRATLMAAQRPIASPYLLLALVELAEHCQCPTLFTLAWQ----IGDDLFKR 479 Query: 62 YSLHGLSMSTAPEIMCSLDGPEVAAL---RQACRDHR 95 + GL + +A +D P AL A +D Sbjct: 480 HYHRGLFVESAQHRYFRIDNPIALALLTLIAAKQDKL 516
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 47.3 bits (112), Expect = 1e-08 Identities = 43/203 (21%), Positives = 73/203 (35%), Gaps = 29/203 (14%) Query: 10 PYPWPWNGKL---------NARNTALIVIDMQTDFCGVGGYVDSMGYDLALTRAPIEPIK 60 PY P + + L++ DMQ F VD+ + I+ Sbjct: 8 PYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIR 61 Query: 61 GLLALMRPLGFTIIHTREGHRPDLSDLPANKRWRSQRIGAGIGDPGPCGKILVRGEPGWE 120 L LG +++T + P ++ + + PG L G + Sbjct: 62 KLKNQCVQLGIPVVYTAQ---------PGSQNPDDRALLTDFWGPG-----LNSGPYEEK 107 Query: 121 LIDELAPLPGEIVIDKPGKGSFYATDLELVLRNRGIENLILTGITTDVCVHTTMRDANDR 180 +I ELAP ++V+ K +F T+L ++R G + LI+TGI + T +A Sbjct: 108 IITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFME 167 Query: 181 GFECILLEDCCGATDPANHAAAL 203 + + D H AL Sbjct: 168 DIKAFFVGDAVADFSLEKHQMAL 190
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 63.5 bits (154), Expect = 5e-14 Identities = 43/218 (19%), Positives = 75/218 (34%), Gaps = 21/218 (9%) Query: 3 DVSARPTRFAFEPASTALVIIDMQRDFLEPGGFGAALGNDVLPLQAIIPTVQQLLALARD 62 D+ + +P L+I DMQ F++ P+ + +++L Sbjct: 16 DMPQNKVSWVPDPNRAVLLIHDMQNYFVDA------FTAGASPVTELSANIRKLKNQCVQ 69 Query: 63 QHMTVIHTRESHVEDLADCPPAKLEHGLPGLRIGDAGPMGRILVRGEPGNQIINALAPIA 122 + V++T + ++ D G PGL +GP +II LAP Sbjct: 70 LGIPVVYTAQPGSQNPDDRALLTDFWG-PGLN---SGPYEE---------KIITELAPED 116 Query: 123 GEWVIDKPGKGMFFGTGLHGRLNTAGITHLIFAGVTTEVCVQSSMREANDRGYRCLLIED 182 + V+ K F T L + G LI G+ + + EA + + D Sbjct: 117 DDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGD 176 Query: 183 ATESYFPAFKQATLDMITAQGGIVGRVTSLSALEQALQ 220 A + Q L+ + V + S L+Q Sbjct: 177 AVADFSLEKHQMALEYAAGRCAFT--VMTDSLLDQLQN 212
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.011 Identities = 12/40 (30%), Positives = 20/40 (50%), Gaps = 3/40 (7%) Query: 44 RVEVTPKNILMIGPTGVGKTEIAR---RLAKLANAPFIKV 80 R+ T +++ G +G GK +AR K N PF+ + Sbjct: 155 RLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAI 194
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.3 bits (151), Expect = 3e-14 Identities = 35/147 (23%), Positives = 54/147 (36%), Gaps = 10/147 (6%) Query: 1 MKTRDRILECALTLFNQQGEPNVSTLEIANEMGISPGNLYYHFHGKEPLILGLFERFQTE 60 +TR IL+ AL LF+QQG + S EIA G++ G +Y+HF K L ++E ++ Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69 Query: 61 LAPLL---------DPPADARLNAEDYWMFLHLIVERLSHYRFLFQDLSNLAGRLPKLAR 111 + L DP + R R +F G + + + Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK-CEFVGEMAVVQQ 128 Query: 112 GIRNLLNSLKRTLASLLARLKSQGQLV 138 RNL + L L Sbjct: 129 AQRNLCLESYDRIEQTLKHCIEAKMLP 155
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 45.8 bits (108), Expect = 1e-07 Identities = 33/156 (21%), Positives = 60/156 (38%), Gaps = 8/156 (5%) Query: 111 VPSRNEVQALHSKVDQLTQQIEQLTGAKARPVAPRAAAAPKPAPKTTAKPLKAAAKTVAR 170 + + N +QA V ++I ++ A PV P A A P +T A+ K +KTV + Sbjct: 997 ITTPNNIQADVPSVPSNNEEIARVDEA---PVPPPAPATPSETTETVAENSKQESKTVEK 1053 Query: 171 TADKAADKAAAAKPAARKAAAKPLDAAE-----KAASKTASKAKDAAKPAAKPAAPRKPA 225 A + A + A++A + + ++ S+T K A K Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK 1113 Query: 226 AKKAAAPKPAASATADSPKPAAAPTPPPEAPANQPS 261 + + + SPK + T P+A + + Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149 Score = 39.3 bits (91), Expect = 1e-05 Identities = 29/205 (14%), Positives = 57/205 (27%), Gaps = 6/205 (2%) Query: 63 QKQIDEVKDTTKAAKSRVGDVKDMALGKWNELEGAFDKRLNSAISRLGVPSRNEVQALHS 122 ++ ++ DTT ++ NE D+ + E A +S Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044 Query: 123 KVDQLTQQIEQLTGAKARPVAPRAAAAPKPAPKTTAKPLKAA-AKTVARTADKAADKAAA 181 K + T + + + A K K + + A + + + K A Sbjct: 1045 KQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETA 1104 Query: 182 AKPAARKAAAKPLDAAEKAASKTASKAK----DAAKPAAKPAAPRKPA-AKKAAAPKPAA 236 KA + E + K + +P A+PA P K + Sbjct: 1105 TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNT 1164 Query: 237 SATADSPKPAAAPTPPPEAPANQPS 261 +A + P + + Sbjct: 1165 TADTEQPAKETSSNVEQPVTESTTV 1189
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 103 bits (258), Expect = 6e-31 Identities = 40/138 (28%), Positives = 60/138 (43%) Query: 1 MFGISFSELLLIGLVALLVLGPERLPGAARTAGLWIGRLKRSFNAIKQEVEREIGADEIR 60 MF I FSELLL+ ++ L+VLGP+RLP A +T WI L+ ++ E+ +E+ E + Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60 Query: 61 RQLHNEHILSLEDEARKMFAQQQHPEVAYEPIVPPTAPQAAQPASHHEIGPAEPADKAPL 120 L SL + ++ A A E + + AS P K Sbjct: 61 DSLKKVEKASLTNLTPELKASMDELRQAAESMKRSYVANDPEKASDEAHTIHNPVVKDNE 120 Query: 121 TLEKTAKPAADTTPDVTP 138 + PAA T +P Sbjct: 121 AAHEGVTPAAAQTQASSP 138
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 26.4 bits (58), Expect = 0.034 Identities = 14/36 (38%), Positives = 20/36 (55%) Query: 1 MKLSGILAASILLVGCTNSSTDLLTDSRSFDGEIRT 36 ++LS + + LLVGC +S TD S +G RT Sbjct: 3 LRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRT 38
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 27.9 bits (62), Expect = 0.026 Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 3/73 (4%) Query: 46 AYKDAADGLVEAMANRQVPLDSGIYPL-LFLYRHSLELQFKLMLKSARALTGKEPKNYDK 104 Y DA E + Q L+ G PL F+ R + E L + A + P+ Sbjct: 108 IYVDAYQPFSEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPM 167 Query: 105 HPLMPLW--SELR 115 L+P + SEL+ Sbjct: 168 RILLPAYVTSELK 180
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.2 bits (68), Expect = 0.030 Identities = 44/221 (19%), Positives = 85/221 (38%), Gaps = 18/221 (8%) Query: 98 GAAIAIFCAALGL-NWTAALLVGLT--LSLSSTAIAMQAMTERNMNSTAVGRSSFAVLLL 154 + L L N A L+ + + L T + A ++N+ + A+ LL Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGY-SINTLTMFGMVLAIGLL 405 Query: 155 QDIAAIPLVAMIPLLAANGGTPSGAELALSIAKIVGAIVAVVLLGQYVSRPVLRFVARSG 214 D AI +V + + P S+++I GA+V + ++ V P+ F +G Sbjct: 406 VD-DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 215 LREIFSAVALFLVFGFGLLLEEAGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLG 274 I+ ++ +V L + +++ + L LL H KG G Sbjct: 465 --AIYRQFSITIVSAMALSVL---VALILTPALCATLLKPVSAEH------HENKGGFFG 513 Query: 275 LFFIGVGMSIDFGTLIDSPLKVITLTLGFILIKLLVIKLLG 315 F S++ +S K++ T ++LI L++ + Sbjct: 514 WFNTTFDHSVNH--YTNSVGKILGSTGRYLLIYALIVAGMV 552
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 58.1 bits (140), Expect = 9e-13 Identities = 30/170 (17%), Positives = 56/170 (32%), Gaps = 6/170 (3%) Query: 5 TKAALLSYAETQMRSKGYSAFSYADLAAKVGIRKASIHHHFPTKECLGAELINDYIARFN 64 T+ +L A +G S+ S ++A G+ + +I+ HF K L +E+ + Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71 Query: 65 ETLV-SIEIRHPDPLQRLQD----FSRLFVISANEGLLPLCGALAAEMAALPLSLQGLTR 119 E + DPL L++ V LL E +Q R Sbjct: 72 ELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQR 131 Query: 120 DFFNSQLAWLQSTLSDAVRQHNWSLGTPAENFAFMLLSMLEGASLIDWTL 169 + ++ TL + A ++ + G + +W Sbjct: 132 NLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL-MENWLF 180
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 103 bits (258), Expect = 9e-29 Identities = 66/252 (26%), Positives = 106/252 (42%), Gaps = 8/252 (3%) Query: 6 KGKKLLVVGGTSGMGLETARQFLKAGGSVVLTGSKQDKADAVRAELSPLG-NVSVIVANL 64 +GK + G G+G AR G + +K + V + L + A++ Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 65 MTEEGMNHVRNEINANHSDIGFMVNSAGIFIPKPFIEHDEADYDMYLDLNRATFFITQAV 124 ++ + I I +VN AG+ P + +++ +N F Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 125 VKNMLAAKREGSIVNVGSIGAQAALAGSPATAYSMAKAGLHAVTRNLAIELAHSGIRVNA 184 V + +R GSIV VGS A AY+ +KA T+ L +ELA IR N Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTS--MAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 185 VSPGIVHTSIYEG-FMDKDAIPEAMK-SLNNFH---PLGRVGVPEDVANTILFLLSDKTS 239 VSPG T + + D++ + +K SL F PL ++ P D+A+ +LFL+S + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 240 WVTGAIWDVDAG 251 +T VD G Sbjct: 245 HITMHNLCVDGG 256
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.0 bits (158), Expect = 2e-15 Identities = 31/177 (17%), Positives = 56/177 (31%), Gaps = 10/177 (5%) Query: 1 MSTRSDLLTSAEVLLRTKGYAAFSYADLADDIGIKKASIHHHFPTKEGLAIAIVESYLFR 60 TR +L A L +G ++ S ++A G+ + +I+ HF K L I E Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESN 69 Query: 61 FKKQLDA-INDEHVSFLDRLNAFALMFAHSSQNGMLPLCGALAAELLALPESLKEMTK-- 117 + L L + S+ L + E + EM Sbjct: 70 IGELELEYQAKFPGDPLSVLREILIHVLESTVTE--ERRRLLMEIIFHKCEFVGEMAVVQ 127 Query: 118 ----DFFEIHLTWLQANIKLGQDRGELKADLDVIRVSRFILNTLEGASFVSWAMSDD 170 + ++ +K + L ADL R + + + G +W + Sbjct: 128 QAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGL-MENWLFAPQ 183
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 35.0 bits (80), Expect = 9e-04 Identities = 17/88 (19%), Positives = 22/88 (25%), Gaps = 4/88 (4%) Query: 571 DLPEPPKVPDLPGQVGAPVPGPQLPAVVTTPPAGTVPGAKVASAAPAPASQPPKPLALVP 630 DL P V P PV P+ P P P P Sbjct: 59 DLEPPQAVQPPP----EPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQP 114 Query: 631 AAVARSAPAQGAAARVQVKPAPPISLPQ 658 + ++ A+ PA P S Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTA 142 Score = 30.3 bits (68), Expect = 0.023 Identities = 15/94 (15%), Positives = 23/94 (24%) Query: 594 LPAVVTTPPAGTVPGAKVASAAPAPASQPPKPLALVPAAVARSAPAQGAAARVQVKPAPP 653 L V P ++ APA P P + K AP Sbjct: 33 LYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPV 92 Query: 654 ISLPQPNVLPFKPLQMPAPQISQADPIMLPPASA 687 + KP + + + D + A Sbjct: 93 VIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPA 126 Score = 29.9 bits (67), Expect = 0.028 Identities = 22/137 (16%), Positives = 41/137 (29%), Gaps = 5/137 (3%) Query: 589 VPGPQLPAVVTTPPAGTVPGAKVASAAPAPASQPPKPLALVPAAVARSAPAQGAAA---R 645 +P P P VT + + P P +P +P + + Sbjct: 43 LPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPK 102 Query: 646 VQVKPAPPISLPQPNVLPFKPLQMPAPQISQ-ADPIMLPPASADLAFSMPTKTALPERVE 704 + KP + P+ +V P + + + A P +A + Sbjct: 103 PKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSR 162 Query: 705 KVIELPARS-DKGIEAR 720 + PAR+ IE + Sbjct: 163 NQPQYPARAQALRIEGQ 179
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 24.1 bits (52), Expect = 0.036 Identities = 9/49 (18%), Positives = 18/49 (36%), Gaps = 6/49 (12%) Query: 12 LLGKKRIITNRLNTLR------DSTTKAERSDLIDEIDTLITEMYNLTK 54 L+GK + N+ T D +D+I+ ++ +L Sbjct: 132 LIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 28.0 bits (62), Expect = 0.032 Identities = 10/51 (19%), Positives = 14/51 (27%) Query: 75 PAIEQPAVEAQTPETDSEPALPASTPSATLRQEPYVVPTPAPATTAAQNAP 125 +E P PE EP ++ P V+ P P Sbjct: 58 ADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPV 108
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 27.0 bits (59), Expect = 0.017 Identities = 11/45 (24%), Positives = 22/45 (48%) Query: 67 NDIEMIFPQEKIPPEKRIFVVNASEGSVSKDFIKEWKLYLPCLTD 111 N E + E +P K+I + + G +DF+++ + P +D Sbjct: 80 NSFERVLEDEALPKAKQILKLISVHGGALEDFLRQARSLFPDPSD 124
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 34.7 bits (79), Expect = 5e-04 Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 12/109 (11%) Query: 156 WKELRDIALPLLDALAYAHARGVLHGDMKPSNVMLSEDGVRLFDFGLGQAEEGVMPGLPH 215 W ++ IA LLD + GV+H D+KP NV +FD G+ + GL Sbjct: 244 WGTIKFIAHRLLDVTNHLAKAGVVHNDIKPGNV--------VFDRASGEPVV-IDLGLHS 294 Query: 216 LSRDRFNAWTPGYAAPELLEGQT-LSASADVYGVACVIFELAGG--KHP 261 S ++ +T + APEL G S +DV+ V + G K+P Sbjct: 295 RSGEQPKGFTESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEGFEKNP 343
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 402 bits (1034), Expect = e-138 Identities = 143/377 (37%), Positives = 202/377 (53%), Gaps = 36/377 (9%) Query: 162 SFALGQLNLLQRLHQPVDEVRPAVVSTPSISGYGLIGKSASMRQTYSMISKVLHSPYTVL 221 F L +L + + RP+ + S G L+G+SA+M++ Y ++++++ + T++ Sbjct: 105 PFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLM 164 Query: 222 LRGETGTGKEVVARAIHDFGPRRSQAFIVQNCAAFPENLLESELFGYCKGAFTGADRDRT 281 + GE+GTGKE+VARA+HD+G RR+ F+ N AA P +L+ESELFG+ KGAFTGA T Sbjct: 165 ITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST 224 Query: 282 GLFEAANGGTLLLDEIGDMPLSLQAKLLRVLQEGEIRPLGSNDTRKIDVRILAATHRDLA 341 G FE A GGTL LDEIGDMP+ Q +LLRVLQ+GE +G + DVRI+AAT++DL Sbjct: 225 GRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLK 284 Query: 342 VMVSEGKFREDLYYRLAQFPIELPALRHREGDILDLARHFADKTCAFLQRGALRWSDAAL 401 +++G FREDLYYRL P+ LP LR R DI DL RHF + R+ AL Sbjct: 285 QSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEK-EGLDVKRFDQEAL 343 Query: 402 DHLSGYAFPGNVRELKGLVERAVLLCEGNELLAEHFSLR--------------------- 440 + + + +PGNVREL+ LV R L + + E Sbjct: 344 ELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLS 403 Query: 441 -PDAVPE-------------DSSLNLRERLEQVERSLLLDCLRKNDGNQTLSARELGLPR 486 AV E S L ++E L+L L GNQ +A LGL R Sbjct: 404 ISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNR 463 Query: 487 RTLLYRLGRLNINLGDF 503 TL ++ L +++ Sbjct: 464 NTLRKKIRELGVSVYRS 480
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 35.0 bits (80), Expect = 0.002 Identities = 18/84 (21%), Positives = 28/84 (33%), Gaps = 8/84 (9%) Query: 376 VAIAPPKPAVTATAPKKKPPISGTVEPDAQVQARGKKKNATKVEQKEHVDDAPAQSKNPA 435 + P + V PK KP Q Q K++ VE + +P ++ PA Sbjct: 78 IPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQ---PKRDVKPVESRP---ASPFENTAPA 131 Query: 436 DEPAEPAKKTCTNGDPVSMVTGEE 459 + A PV+ V Sbjct: 132 RLTSSTATA--ATSKPVTSVASGP 153
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.6 bits (69), Expect = 0.010 Identities = 20/67 (29%), Positives = 28/67 (41%), Gaps = 11/67 (16%) Query: 15 RVAATP--------ETIKKLISQGHSVTVQSGAGIHASVPDSAYEAAGAAISGADDTFAS 66 RV +P ETIKKL+ +G V G G+ + D + A I D A Sbjct: 163 RVVPSPDPKGHVEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVI---DKDLAG 219 Query: 67 ELILKVV 73 E + + V Sbjct: 220 EKLAEEV 226
>PF05946#Toxin-coregulated pilus subunit TcpA Length = 199 Score = 25.3 bits (55), Expect = 0.050 Identities = 16/45 (35%), Positives = 21/45 (46%), Gaps = 2/45 (4%) Query: 18 TGSAHAFDSTTQGLVKTGYATSQVSSSPF--DNKQIMAAQDDAAA 60 T A A T GLV G +S + +PF N I + +AAA Sbjct: 60 TADATAASKLTSGLVSLGKISSDEAKNPFIGTNMNIFSFPRNAAA 104
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.0 bits (72), Expect = 0.008 Identities = 27/104 (25%), Positives = 42/104 (40%), Gaps = 5/104 (4%) Query: 535 NADKTDKKAQRQQAAALRQQLAPHKREADKLERDLGTLHEKLAKVEEALA----DSANYD 590 +A + KK + L +Q + L RDL E ++E + + Sbjct: 319 DASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISE 378 Query: 591 AANKDKLRDLLAEQAKLKVRESELEDAWMQALELLESMQAELEA 634 A+ + RDL A + K E LE+A + L LE + ELE Sbjct: 379 ASRQSLRRDLDASREAKKQVEKALEEANSK-LAALEKLNKELEE 421
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 802 bits (2073), Expect = 0.0 Identities = 239/1064 (22%), Positives = 446/1064 (41%), Gaps = 59/1064 (5%) Query: 5 LIKFAIEQRIVVMLAVLLMAGLGIASYQKLPIDAVPDITNVQVQINTSAPGFSPLETEQR 64 + F I + I + +++ G + +LP+ P I V ++ + PG + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 65 ITFAIETNMAGLPGLQQTRSLSRS-GLSQVTVIFEDGTDLFFARQQVNERLQIAKDQLPE 123 +T IE NM G+ L S S S G +T+ F+ GTD A+ QV +LQ+A LP+ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 124 GVETMMGPVSTGLGEIFLWTVEAREGARKEDGTPYTPTDLRVIQDWIIKPQLRNVPGVAE 183 V+ V +L D T D+ +K L + GV + Sbjct: 121 EVQQQGISVEKSSS-SYLMVA-----GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174 Query: 184 INTIGGFAKQYEIAPDPKKLAAYKLTLNDLVAALERNNANVGAGYIERGGE------QLL 237 + G I D L YKLT D++ L+ N + AG + Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233 Query: 238 IRAPGQLENIDDIANIVI-ANVQGTPIRVSSVAEVGIGKEMRSGAATENGREVVLGTVFM 296 I A + +N ++ + + N G+ +R+ VA V +G E + A NG+ + + Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293 Query: 297 LIGENSRTVSQAVAAKLADINRTLPEGIEAVTVYDRTNLVEKAIATVKKNLVEGAILVIA 356 G N+ ++A+ AKLA++ P+G++ + YD T V+ +I V K L E +LV Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 357 ILFLFLGNIRAALITAMVIPLAMLFTFTGMFANKVSANLMSLG--ALDFGIIVDGAVVIV 414 +++LFL N+RA LI + +P+ +L TF + A S N +++ L G++VD A+V+V Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 415 ENSIRRLAHAQQKHGRMLTRAERFHEVFAAAKEARRPLIFGQLIIMVVYLPIFALTGVEG 474 EN R + + + + + L+ +++ V++P+ G G Sbjct: 414 ENVERVMMEDK---------LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 475 KMFHPMAFTVVIALLGAMILSVTFVPAAIAMFVTGKVKEEE----GFVMRTAR------Q 524 ++ + T+V A+ ++++++ PA A + E GF Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524 Query: 525 RYAPVLGWVLGHRWIAFTLAFVVMVLSGFTASRMGSEFIPSLSEGDFALQALRVPGTSL- 583 Y +G +LG + +++ R+ S F+P +G F G + Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584 Query: 584 -TQSVDMQQRLEKAVIEKMPEVERMFARTGTAEIAADPMPPNISDSYVMLKPQSEWPDLD 642 TQ V + Q + + + VE +F G + N ++V LKP E + Sbjct: 585 RTQKV-LDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDE 640 Query: 643 KSRETLIAELQKAAASVPGSNYELSQPIQLRFNELVSGVRSDVA-VKVFGDDMTVLNQTA 701 S E +I + + EL + D + G L Q Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNM--PAIVELGTATGFDFELIDQAGLGHDALTQAR 698 Query: 702 AKIAAAMQKVPGA-SEVKVEQTTGLPVLTINIDRDKAARYGLNVADVQDAIATALGGRQA 760 ++ + P + V+ + +D++KA G++++D+ I+TALGG Sbjct: 699 NQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758 Query: 761 GTLYEGDRRFDMVVRLSEQLRTDVAGLSSLLIPVPASAGSINQQISFISLSQVASLDLVL 820 + R + V+ + R + L V ++ G + S + V Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKL--YVRSANG------EMVPFSAFTTSHWVY 810 Query: 821 GPNQISRENGKRLVIVSANVRGRDLGSFVEEAGQTIDS-SVQIPAGYWTNWGGQFEQLQS 879 G ++ R NG + + G+ +A +++ + ++PAG +W G Q + Sbjct: 811 GSPRLERYNGLPSMEIQGEAAP---GTSSGDAMALMENLASKLPAGIGYDWTGMSYQERL 867 Query: 880 AAKRLQIVVPVALLLVLALLFLMFNNLKDGLLVFTGIPFALTGGVMALWLRDIPLSISAG 939 + + +V ++ ++V L ++ + + V +P + G ++A L + + Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927 Query: 940 VGFIALSGVAVLNGLVMISFIRNLRE-EGRSLHDAITEGALTRLRPVLMTALVASLGFIP 998 VG + G++ N ++++ F ++L E EG+ + +A RLRP+LMT+L LG +P Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987 Query: 999 MALATGTGAEVQRPLATVVIGGILSSTALTLLVLPALYQWAHRR 1042 +A++ G G+ Q + V+GG++S+T L + +P + R Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.0 bits (104), Expect = 7e-07 Identities = 24/120 (20%), Positives = 43/120 (35%), Gaps = 13/120 (10%) Query: 108 PMSTSVTFPGEIRFDEDRTAHVVPRVGGVVESVKVELGQSVKKGQVLAVIASQQISDQRS 167 + T G++ + P +V+ + V+ G+SV+KG VL + + + Sbjct: 79 QVEIVATANGKLTHSGRSKE-IKPIENSIVKEIIVKEGESVRKGDVLLKLTALG---AEA 134 Query: 168 ELNAAQRRQELARLTLQR---------EKKLWEDRISAEQDYQQARQAFQEADISLSNAR 218 + Q ARL R KL E ++ E +Q + SL + Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194 Score = 39.0 bits (91), Expect = 3e-05 Identities = 29/203 (14%), Positives = 67/203 (33%), Gaps = 30/203 (14%) Query: 158 ASQQISDQRSELNAAQRRQELARLTLQREKKLWEDRISAEQDYQQARQAFQEADISLSNA 217 A ++ +S+L + A+ Q +L+++ I +Q + L+ Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEIL--DKLRQTTDNIGLLTLELAKN 321 Query: 218 RQKLSAIGASVSPTAGNRYELIAPFDAMVVE-KHLAIGEVVSDASNAFTLS-DLSRVWAT 275 ++ + AP V + K G VV+ A + + + T Sbjct: 322 EERQQ------------ASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVT 369 Query: 276 FGVAPKDLDKVIVGRPVSVSAPDLN----ARVEGRIGYVG--SLLGEQT------RAATV 323 V KD+ + VG+ + + G++ + ++ ++ + Sbjct: 370 ALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIE 429 Query: 324 RVTL--ANPNGAWRPGLFVSVDV 344 L N N G+ V+ ++ Sbjct: 430 ENCLSTGNKNIPLSSGMAVTAEI 452
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 1e-20 Identities = 31/117 (26%), Positives = 60/117 (51%), Gaps = 1/117 (0%) Query: 2 RILVVEDEPKTAEYMHQGLTESGYVVDIAATGLDGLYLAQHQAYDIVILDVNLPEMDGWE 61 ILV +D+ ++Q L+ +GY V I + D+V+ DV +P+ + ++ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLARLRKT-VNTRIMMVTARGRLEEKVKGLEMGADDYLVKPFEFPELLARVRTLMRR 117 +L R++K + +++++A+ +K E GA DYL KPF+ EL+ + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PF06580#Sensor histidine kinase Length = 349 Score = 36.8 bits (85), Expect = 1e-04 Identities = 36/155 (23%), Positives = 53/155 (34%), Gaps = 32/155 (20%) Query: 315 EPIDLREESEKVA---ELFSASAEDR-DITLQIEGNGKAMGDRLMIQRAISNLLSNAIRH 370 + L +E V +L S EDR QI A+ D + + L+ N I+H Sbjct: 214 RQVSLADELTVVDSYLQLASIQFEDRLQFENQIN---PAIMDVQVPPMLVQTLVENGIKH 270 Query: 371 G----ASGTAITIRIVTHVEDITLAVRNAGEGIDAEHLPRLFDRFYRVHVSRARQQGGTG 426 G G I ++ +TL V N G + TG Sbjct: 271 GIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------------TKESTG 312 Query: 427 LGLAIVRSIMSL---HEGQVKAESEPGRFTTFSLI 458 GL VR + + E Q+K + G+ LI Sbjct: 313 TGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 38.3 bits (89), Expect = 3e-06 Identities = 19/101 (18%), Positives = 40/101 (39%), Gaps = 9/101 (8%) Query: 37 FAKLYERINHEMEEEAQHADALMRRILMLEGTP---------RMRPDDLDIGTTVPEMLA 87 F L+E+ + A+ D + R+L + G P D T+ EM+ Sbjct: 43 FFTLHEKFEELYDHAAETVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQ 102 Query: 88 SDLRLEYKVRAALCKGIALCELHKDYISRDILRVQLADTEE 128 + + ++ + I L E ++D + D+ + + E+ Sbjct: 103 ALVNDYKQISSESKFVIGLAEENQDNATADLFVGLIEEVEK 143
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 74.9 bits (184), Expect = 1e-16 Identities = 79/358 (22%), Positives = 139/358 (38%), Gaps = 33/358 (9%) Query: 29 LGMFMVLPVLATYGMDL--AGASPALIGLAIGAYGLTQAVLQIPFGIISDRIGRRPVIYL 86 +G+ +++PVL DL + A G+ + Y L Q G +SDR GRRPV+ + Sbjct: 19 VGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLV 78 Query: 87 GLIIFAIGSVVAANADSIWGIIAGRILQG-AGAISAAVMALLSDLTREQHRTKAMAMIGM 145 L A+ + A A +W + GRI+ G GA A A ++D+T R + + Sbjct: 79 SLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSA 138 Query: 146 TIGLSFAVAMVVGPVITGVFGLSGL---FLATGGMALLGILIIAFIVPKANGPLLHRESG 202 G MV GPV+ G+ G F A + L L F++P+++ Sbjct: 139 CFG----FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRR 194 Query: 203 VAKQALGQTLRHPDLLRLDLGIFVLHAMLMSSFVA-----LPLALVEKAGLPKEQHW--- 254 A L R G+ V+ A++ F+ +P AL G + HW Sbjct: 195 EALNPLAS-------FRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR-FHWDAT 246 Query: 255 ----WVYLTALLISFFAMIPFIIYGEKKRQMKRVLLGAVTVLMVSELYFWAFGNTLRTLV 310 + +L S + + + + ++LG + L +A T + Sbjct: 247 TIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA---TRGWMA 303 Query: 311 IGTVVFFTAFNLLEASLPSLISKVSPAGGKGTAMGVYSTSQFLGSAAGGILGGWLFQH 368 +V + + +L +++S+ +G G + L S G +L ++ Sbjct: 304 FPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.5 bits (68), Expect = 0.004 Identities = 17/41 (41%), Positives = 21/41 (51%) Query: 135 QSAPRPQQSRPQQSAPPQQNYNQQPPQQRESRPAPQQQAPQ 175 Q P+P PQ PPQ QPPQ++ PAPQ A + Sbjct: 576 QPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGR 616 Score = 28.1 bits (62), Expect = 0.027 Identities = 18/48 (37%), Positives = 21/48 (43%), Gaps = 1/48 (2%) Query: 132 APNQSAPRPQQSRPQQSAPPQQNYNQQPPQQRE-SRPAPQQQAPQPAA 178 AP P PQ PPQ QPPQ + + P+ APQP A Sbjct: 567 APPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPA 614
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.028 Identities = 11/32 (34%), Positives = 17/32 (53%) Query: 47 VILGPSGCGKSTLLRMIAGLEDVTQGQILMGE 78 V+ G G GKSTL+ + GL+ + +G Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 27.5 bits (61), Expect = 0.032 Identities = 13/54 (24%), Positives = 26/54 (48%), Gaps = 1/54 (1%) Query: 13 LMRQWRDDDLPAFAAMCADPQVMRYFPEPLSRLESAAMIGRMRGHFAELGFGLW 66 ++RQ R+ DL FA + + P+ L A + ++ F ++GF ++ Sbjct: 138 MLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAF-QIGFTIF 190
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.1 bits (73), Expect = 0.004 Identities = 34/136 (25%), Positives = 54/136 (39%), Gaps = 10/136 (7%) Query: 60 LAQFIPMLLLLMP-AGDLIDRYNRKVILMISWGVQAVCGLILLVFSAMNLQDLRLIYGAL 118 LA + M P G L DR+ R+ +L++S AV I+ L ++Y Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LWVLYIGR 103 Query: 119 MLYGCARAFTGPALQSLLPQIVPREQLASAIATNSVIMRCSTVGGPLIGGYLYWLGGAEL 178 ++ G A TG + + I ++ A S V GP++GG +GG Sbjct: 104 IVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGL---MGGFSP 159 Query: 179 TYSVCVAAFIAGILLL 194 AA + G+ L Sbjct: 160 HAPFFAAAALNGLNFL 175 Score = 32.1 bits (73), Expect = 0.004 Identities = 30/133 (22%), Positives = 54/133 (40%), Gaps = 18/133 (13%) Query: 70 LMPAGDLIDRYNRKVILMISWGVQAVCGLILLVFSAMNLQDLRLIYGALMLYGCARAFTG 129 M G + R + LM+ G ILL F+ + + ++L Sbjct: 264 AMITGPVAARLGERRALMLGMIAD-GTGYILLAFAT----RGWMAFPIMVLLASG-GIGM 317 Query: 130 PALQSLLPQIVPREQLASAIATNSVIMRCSTVGGPLIGGYLY-----------WLGGAEL 178 PALQ++L + V E+ + + + +++ GPL+ +Y W+ GA L Sbjct: 318 PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAAL 377 Query: 179 TYSVCVAAFIAGI 191 Y +C+ A G+ Sbjct: 378 -YLLCLPALRRGL 389
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 29.9 bits (67), Expect = 0.017 Identities = 10/44 (22%), Positives = 26/44 (59%), Gaps = 3/44 (6%) Query: 242 AALSNLEDNAVDNVTLVRLSAEELTQALNEVRPFRRLQGVDLKS 285 AALSN + + V++ ++++ Q ++++PF + G+++ Sbjct: 248 AALSNANK---PSASPVKVLSDKIIQIYSDIKPFADIAGINVPD 288
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.5 bits (66), Expect = 0.032 Identities = 35/143 (24%), Positives = 55/143 (38%), Gaps = 25/143 (17%) Query: 188 LKVKGAVLIGILAVTIAS-IALGFSEFGGVVSMPPSLAPTFMQLDIMGALDVGLVSIIFA 246 KV G V GI IA A G S + I A+ + + + F Sbjct: 277 TKVLGNVGKGISQYIIAQRAAQGLSTSAAAAGL------------IASAVTLAISPLSFL 324 Query: 247 FLFVDIFDNSGTLIGVAKRAGLMGKDGHMPKMGRALIAD---STAAMAGSLLGTSTTTSY 303 + D F + + ++R +G DG +L+A T A+ SL ST + Sbjct: 325 SI-ADKFKRANKIEEYSQRFKKLGYDGD------SLLAAFHKETGAIDASLTTISTVLAS 377 Query: 304 IESAAGVSAGGRTGLTAIVVAVL 326 + ++G+SA T L V+ L Sbjct: 378 V--SSGISAAATTSLVGAPVSAL 398
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 41.4 bits (97), Expect = 3e-07 Identities = 15/46 (32%), Positives = 31/46 (67%) Query: 2 KQTGFTLIELLVVVALVAILANVAMPSLTGVIDSNRRLAAAQELAS 47 KQ GFTL+E++VV+ ++ +LA++ +P+L G + + A ++ + Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA 51
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 41.4 bits (97), Expect = 2e-07 Identities = 16/55 (29%), Positives = 35/55 (63%), Gaps = 3/55 (5%) Query: 6 KGFSLIELLVTVSLVGILAAIAIPNFTSTL---QSNKADTELNDLQRALNYARLE 57 +GF+L+E++V + ++G+LA++ +PN KA +++ L+ AL+ +L+ Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLD 62
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 28.7 bits (64), Expect = 0.008 Identities = 9/28 (32%), Positives = 19/28 (67%), Gaps = 2/28 (7%) Query: 4 KPRHRQSGMTLIEVLVSVLILAIGLLGA 31 + +Q G TL+E++V ++I+ G+L + Sbjct: 2 RATDKQRGFTLLEIMVVIVII--GVLAS 27
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 31.8 bits (72), Expect = 0.001 Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 1/63 (1%) Query: 6 RGFGLVEIMVALVLGLVVSLGIVQIFTASRATYQSQNASARMQEDARFVLSKMIQEIRMT 65 RGF L+E+M+ L+L V + ++ F ASR +Q AR + RFV + +Q + Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTL-ARFEAQLRFVQQRGLQTGQFF 62 Query: 66 GMY 68 G+ Sbjct: 63 GVS 65
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 47.6 bits (113), Expect = 7e-10 Identities = 21/66 (31%), Positives = 37/66 (56%), Gaps = 2/66 (3%) Query: 1 MRATS--RGFTLIELMIVVAIVGILAAVAYPSYTEYVRRTHRAEIASLLSEQTQALERFY 58 MRAT RGFTL+E+M+V+ I+G+LA++ P+ + + + S + AL+ + Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60 Query: 59 SRSGTY 64 + Y Sbjct: 61 LDNHHY 66
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 495 bits (1275), Expect = e-175 Identities = 175/476 (36%), Positives = 253/476 (53%), Gaps = 33/476 (6%) Query: 3 PRQKILIVDDEPDIRELLEITLGRMKLDTRSACNVAEARQCLAREAFDLCLTDMRLPDGN 62 IL+ DD+ IR +L L R D R N A + +A DL +TD+ +PD N Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 63 GLELVQHIQQNFAHVPVAMITAHGSLDTAIHALKAGAFDFLTKPVDLGRLRELVNSALRL 122 +L+ I++ +PV +++A + TAI A + GA+D+L KP DL L ++ AL Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 123 TPVVQPIRALDNR----LLGDSPPMRILRGQIAKLARSQAPVYISGESGSGKELVARLIH 178 D++ L+G S M+ + +A+L ++ + I+GESG+GKELVAR +H Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH 181 Query: 179 EQGPRGEKPFVPVNCGAIPSDLMESEFFGHRKGSFTGAHEDKPGLFQAAQNGTLFLDEVA 238 + G R PFV +N AIP DL+ESE FGH KG+FTGA G F+ A+ GTLFLDE+ Sbjct: 182 DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIG 241 Query: 239 DLPLAMQVKLLRAIQEKSIRSVGGQQEQVVDVRILCATHKDLNVEVAAGRFRQDLYYRLN 298 D+P+ Q +LLR +Q+ +VGG+ DVRI+ AT+KDL + G FR+DLYYRLN Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301 Query: 299 VIELRVPSLRERREDIDQLAAIVLQRLATNSGLPAARLDAQALDTLKNYRFPGNVRELEN 358 V+ LR+P LR+R EDI L +Q+ GL R D +AL+ +K + +PGNVRELEN Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVRELEN 360 Query: 359 MLERAYTLCENDEIHASDLRL-TESARPQESDGPNLADIDN------------------- 398 ++ R L D I + S P A + Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420 Query: 399 --------LEDYLEGIERKLILQALEETRWNRTAAAQRLSLSFRSMRYRLKKLGLD 446 + L +E LIL AL TR N+ AA L L+ ++R ++++LG+ Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 30.5 bits (68), Expect = 0.015 Identities = 16/45 (35%), Positives = 24/45 (53%), Gaps = 1/45 (2%) Query: 465 PEHGLVPPDVFIPLAEQNGTIIALGEWVLDQACRQLR-EWHDQGF 508 P+ P F+P+ E TII E L+Q QL+ + H+QG+ Sbjct: 12 PDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGY 56
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 116 bits (292), Expect = 3e-32 Identities = 81/363 (22%), Positives = 131/363 (36%), Gaps = 70/363 (19%) Query: 1 MKILVTGASGFIGGRFARFALEQGMSVR----IN-----GRRAEGVEHLVRRGAEFVQGD 51 MK LVTGA+GFIG ++ LE G V +N + +E L + G +F + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTDPQLVRALCDD--VDAVVHCAGSVGV---WGRRQDFMLGNVQVTENIVEGCLKQRVPR 106 L D + + L + V + V + N+ NI+EGC ++ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 107 LVHLSSPSIYFDGHSRQ-GIKEEQVSKRFHNHYAASKYLAEQKVFGAQE-FGLEVIALRP 164 L++ SS S+Y G +R+ + + YAA+K E +GL L Sbjct: 121 LLYASSSSVY--GLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL-- 176 Query: 165 RFVT-----GAGDNSIFPRLLRMQQKKRLSIVGNGLNKVDFTSMQNLNEAMLSSL----- 214 RF T G D ++F M + K + + G K DFT + ++ EA++ Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236 Query: 215 -----------LATGSALGKAYNISNGAPVPLWDAINYVMRQMQLPQVTRYRSYGLAYTA 263 A A + YNI N +PV L D I + + + Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLP------- 289 Query: 264 AAINEGACMLWPGRPEPTLSRLGMQVMNKDFTLDISRAMHYLDYQPRVSLWAALDEFCGW 323 L PG T + D + + P ++ + F W Sbjct: 290 ---------LQPGDVLETSA-------------DTKALYEVIGFTPETTVKDGVKNFVNW 327 Query: 324 WKA 326 ++ Sbjct: 328 YRD 330
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.004 Identities = 24/211 (11%), Positives = 63/211 (29%), Gaps = 20/211 (9%) Query: 75 QVSLMEQQLVATQESFAR--ISEEAAGRLQDISGKVVATESLSSDGEALKQR-IKLLEAQ 131 + L+ + R I + + K+ + E R L++ Q Sbjct: 135 DTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194 Query: 132 LEDQDKQREGVEGQQSSLDKRLEQMAAQTTQQQTENAQLQEQLKGVVTELTALKAALPDL 191 Q+ E + A+ + + + + +L + L A + Sbjct: 195 FSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAV 254 Query: 192 KTAQADQGKLDTQLKSLAADVATLKKQGNPSAAVERLEQDLIVLKSEQENRPAPAAAGNT 251 + + +L+ + + ++ + + +++ ++ +N Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESE------ILSAKEEYQLVTQLFKN---------- 298 Query: 252 AEFDAFRAQVTRNINTLTSQIQNLSQQLNAR 282 E Q T NI LT ++ ++ A Sbjct: 299 -EILDKLRQTTDNIGLLTLELAKNEERQQAS 328
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 35.7 bits (82), Expect = 9e-05 Identities = 24/86 (27%), Positives = 27/86 (31%) Query: 48 PPVKPPVKPPVKPPVKPPVKPPVEPPVKPPVKPPVKPPVKPPVKAPIKPPVKPTEKPPVE 107 P P PV P P P P P V KP KP K K +P Sbjct: 58 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPV 117 Query: 108 MKMPADPAPPSRLEKLLSIVNTATGV 133 PA P + +L S TA Sbjct: 118 ESRPASPFENTAPARLTSSTATAATS 143 Score = 34.6 bits (79), Expect = 2e-04 Identities = 29/111 (26%), Positives = 38/111 (34%), Gaps = 3/111 (2%) Query: 33 PAVTPGEPMVKPPAKPPVKPPVKPPVKPPVKPPVKPPVEPPVKPPVKPPVKPPVKPPVKA 92 PA V+PP +P V+P +P P +E P P KP KP K + Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKP-KPKPKPVKKVQEQP 110 Query: 93 PIKPPVKPTEKPPVEMKMPADPAPPSRLEKLLSIVNTATGVLQLVHPLNSV 143 K VKP E P PA + + T V L+ Sbjct: 111 --KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRN 159
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 56.8 bits (137), Expect = 6e-11 Identities = 50/218 (22%), Positives = 90/218 (41%), Gaps = 5/218 (2%) Query: 5 LFILALSAFAIGTTEFVIMGLLPDVAADLGVSIPGAGWLVTGYALGVAIGAPFMAMATAR 64 L L + +F E V+ LPD+A D W+ T + L +IG + + Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 65 LPRKAALVTLMGIFIVGNLLCALA-SDYDVLMFARVVTALCHGAFFGIGSVVAAGLVPAN 123 L K L+ + I G+++ + S + +L+ AR + AF + VV A +P Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 124 RRASAVALMFTGLTLANVLGVPLGTALGQYAGWRSTFWAVTVIGVIALIGLIRFLPTN-R 182 R A L+ + + + +G +G + Y W S + +I +I + L++ L R Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW-SYLLLIPMITIITVPFLMKLLKKEVR 194 Query: 183 DEEKLDMRAELAALKGAGIWLSLTMTALFSASMFALFT 220 + D++ L GI + T +S S + Sbjct: 195 IKGHFDIKG--IILMSVGIVFFMLFTTSYSISFLIVSV 230
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 37.2 bits (86), Expect = 1e-04 Identities = 72/391 (18%), Positives = 133/391 (34%), Gaps = 58/391 (14%) Query: 76 IGGWLFGRVADKHGRKNSMLISVTMMCAGSLIIACLPTYASIGAWAPALLLMARLLQGLS 135 IG ++G+++D+ G K +L + + C GS+I ++ S+ L+MAR +QG Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LIMARFIQGAG 116 Query: 136 VGG----EYGTTATYMSEVALRGQRGFYASFQYVT-----LIGGQLL------------- 173 A Y+ + G S + IGG + Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM 176 Query: 174 -AVLTVVILQQFLTTEELRDYGWRIPFVIGAAAAVIALLLRRTLNETT------------ 220 ++TV L + L E + I +I + ++ +L T + Sbjct: 177 ITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIF 236 Query: 221 TAESRKDKDAGSITALFKHHKAAFITVLGYTAGGSLI-FYTFTTYMQKYLVNTGGMEAKT 279 RK D L K+ + G G++ F + YM K + A+ Sbjct: 237 VKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDV--HQLSTAEI 294 Query: 280 ASYIMTGALFLYMCMQPFFGMLADRIGRRNSMLWFGALGTLCTVPILMTLKTNTNPFMAF 339 S I+ + G+L DR G +L G + L T+ FM Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLY-VLNIGVTFLSVSFLTASFLLETTSWFMTI 353 Query: 340 VLITLALAIVSFYTSISGLVKAEMFPPQVRA----------LGVGLAYAVANAAFGGSAE 389 +++ + + T IS +V + + + A L G A+ S Sbjct: 354 IIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL--SIP 411 Query: 390 FVALKLKSAGMENSFYWYVTAMMAIAFLFSL 420 + +L ++ S Y Y ++ + + + Sbjct: 412 LLDQRLLPMEVDQSTYLYSNLLLLFSGIIVI 442
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 30.2 bits (68), Expect = 0.023 Identities = 15/92 (16%), Positives = 29/92 (31%), Gaps = 8/92 (8%) Query: 103 AVSANGSGSRQRVPGDQTQTGQSAITSSYSATLGVSAYELDLFG------RVRSLSQQAL 156 A+S + + + +P D GQS + Y+ +L S + L G + + Sbjct: 436 ALSVDMTQANSTLPDDSQHDGQS-VRFLYNKSLNESGTNIQLVGYRYSTSGYFNFA-DTT 493 Query: 157 ETYFASEEARRSTQISLVANVANAYLTWQADK 188 + + V Y +K Sbjct: 494 YSRMNGYNIETQDGVIQVKPKFTDYYNLAYNK 525
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1289 bits (3337), Expect = 0.0 Identities = 667/1038 (64%), Positives = 830/1038 (79%), Gaps = 7/1038 (0%) Query: 1 MSRFFIDRPIFAWVLALVIMLVGTLSIMKLPINQYPAIAPTAIDIQVTYPGASAQTVQDT 60 M+ FFI RPIFAWVLA+++M+ G L+I++LP+ QYP IAP A+ + YPGA AQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VVQIIEQQLNGIDNLRYVSSDSNSDGSMTITVTFNQGTNPDTAQVQVQNKLNLATPLLPQ 120 V Q+IEQ +NGIDNL Y+SS S+S GS+TIT+TF GT+PD AQVQVQNKL LATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGLRVTKSVKNFLLVIGLVAEDGSLTREDLSNYIVSNIQDPISRTSGVGDFQVFGS 180 EVQQQG+ V KS ++L+V G V+++ T++D+S+Y+ SN++D +SR +GVGD Q+FG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDPAKLNNFQLTPVDVTTAVSAQNVQIATGQLGGLPALPGTQLNATIIGKTRL 240 QYAMRIWLD LN ++LTPVDV + QN QIA GQLGG PALPG QLNA+II +TR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTAEQFGNIFLKVNADGSQVRLKDVARIELGGQNYSIDAQFNGKPASGMAIKLASGANAL 300 + E+FG + L+VN+DGS VRLKDVAR+ELGG+NY++ A+ NGKPA+G+ IKLA+GANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAKAIRATISELEPFFPPGMKVVYPYDTTPTVTESISGVVHTLIEAIVLVFLVMYLFLQ 360 DTAKAI+A ++EL+PFFP GMKV+YPYDTTP V SI VV TL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NLRATIITTMTVPVVLLGTFGILAAFGFTINTLTMFGMILAIGLLVDDAIVVVENVERVM 420 N+RAT+I T+ VPVVLLGTF ILAAFG++INTLTMFGM+LAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEHLSPKEATQKSMDQIQGALVGIAMVLSAVLLPMAFFGGSTGVIYKQFSITIVSAMAL 480 E+ L PKEAT+KSM QIQGALVGIAMVLSAV +PMAFFGGSTG IY+QFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALIFTPALCATMLKPIDPEKHGQPKRGFFGWFNRTFDRSVVSYENGVKRMVTHKLP 540 SVLVALI TPALCAT+LKP+ +H + K GFFGWFN TFD SV Y N V +++ Sbjct: 481 SVLVALILTPALCATLLKPV-SAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 AFVVYLIIVAGMIWLFTRIPAAFLPEEDQGVIFAQVQTPAGSSAERTQKVIDDMRDFLLD 600 ++Y +IVAGM+ LF R+P++FLPEEDQGV +Q PAG++ ERTQKV+D + D+ L Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL- 598 Query: 601 KENGEGKGVNSVFSVNGFNFAGRGQSSGLAFVMLKPWDERD-AETTVFKIAERAQAHFAS 659 E V SVF+VNGF+F+G+ Q++G+AFV LKPW+ER+ E + + RA+ Sbjct: 599 --KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGK 656 Query: 660 FRDAMVFAVVPPSVLELGNATGFDVYLQDQGGVGHQKLLDARNQFLGMAAQSKI-LAGVR 718 RD V P+++ELG ATGFD L DQ G+GH L ARNQ LGMAAQ L VR Sbjct: 657 IRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716 Query: 719 PNGLNDEPQYQLTVDDEKASALGITLSNINQTLSIALGGSYVNDFIDRGRVKKVYVQGEA 778 PNGL D Q++L VD EKA ALG++LS+INQT+S ALGG+YVNDFIDRGRVKK+YVQ +A Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776 Query: 779 FSRMTPEDLQKWFVRNDSGTMVPLSAIASGEWIYGSPKLSRYNGVAAMEVLGTPAPGYSS 838 RM PED+ K +VR+ +G MVP SA + W+YGSP+L RYNG+ +ME+ G APG SS Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836 Query: 839 GQAMAEVEAIAKKLPAGIGYSFTGLSFEERLSGSQAPALYALSMLVVFLCLAALYESWSI 898 G AMA +E +A KLPAGIGY +TG+S++ERLSG+QAPAL A+S +VVFLCLAALYESWSI Sbjct: 837 GDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 899 PIAVMLVVPLGVIGALMATSLRGLSNDVFFQVGLLVTVGLAAKNAILIVEFAKELHE-QG 957 P++VMLVVPLG++G L+A +L NDV+F VGLL T+GL+AKNAILIVEFAK+L E +G Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 958 KSLVESAMEACRMRLRPIIMTSMAFILGVVPLAISSGAGSGSQHAIGTGVIGGMITATIL 1017 K +VE+ + A RMRLRPI+MTS+AFILGV+PLAIS+GAGSG+Q+A+G GV+GGM++AT+L Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 1018 AIFWVPMFYVAVSSVFKG 1035 AIF+VP+F+V + FKG Sbjct: 1017 AIFFVPVFFVVIRRCFKG 1034
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 46.0 bits (109), Expect = 1e-07 Identities = 38/204 (18%), Positives = 76/204 (37%), Gaps = 26/204 (12%) Query: 97 SVYEASANSAKATLQSAKSMSDRYKQLVNEQAVSRQEYDTALASTQEAQAALQSAQINLR 156 VY++ ++ + SAK QL + + + L + + Sbjct: 269 RVYKSQLEQIESEILSAKEEYQLVTQLFKNEI--LDKLRQTTDNIGLLTLELAKNEERQQ 326 Query: 157 FTKVLAPISGRIGRSAV-TEGALVSNGQTNAMATIQQLDPIYVDVNQSSADMLKLRADLA 215 + + AP+S ++ + V TEG +V+ +T M + + D + V + D+ + Sbjct: 327 ASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTALVQNKDIGFINVGQ- 384 Query: 216 SGRLQKSGDNSASVKLTLEDGSEYPQ-EGKLE--FSEVSVDQATGSVTLRAVFPNPDHM- 271 +A +K+ + Y GK++ + DQ G V + + + Sbjct: 385 ----------NAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLS 434 Query: 272 -------LLPGMFVHAQLKAGVNS 288 L GM V A++K G+ S Sbjct: 435 TGNKNIPLSSGMAVTAEIKTGMRS 458 Score = 41.0 bits (96), Expect = 6e-06 Identities = 32/159 (20%), Positives = 56/159 (35%), Gaps = 27/159 (16%) Query: 55 PGRTTAF-RVAEVRPQVNGIILKRLFTEGGDVKAGQQLYQIDPSVYEASANSAKATLQSA 113 G+ T R E++P N I+ + + EG V+ G L ++ EA +++L A Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146 Query: 114 KSMSDRYK----------------------QLVNEQAVSRQEYDTALASTQEAQAALQSA 151 + RY+ Q V+E+ V R T+L Q + Q Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL---TSLIKEQFSTWQNQKY 203 Query: 152 QINLRFTKVLAPISG-RIGRSAVTEGALVSNGQTNAMAT 189 Q L K A + + V + + ++ Sbjct: 204 QKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 153 bits (388), Expect = 6e-49 Identities = 78/211 (36%), Positives = 126/211 (59%) Query: 1 MARRTKEEAQITRSQILEAAEQAFYERGVARTTLADIATLAGVTRGAIYWHFNNKADLVQ 60 MAR+TK+EAQ TR IL+ A + F ++GV+ T+L +IA AGVTRGAIYWHF +K+DL Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 AMLDSLQEPLDEMSQASQDEDEEDPLGCMKNLLVHLFHELALDPKTRRINEILFHKCEFT 120 + + + + E+ Q + DPL ++ +L+H+ + + R + EI+FHKCEF Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 DEMCDFRRQRQENAIGCHERIQLGLSNAVRQGQLPEDLDTARAAVALFSYVNGIIYQWLL 180 EM ++ ++ + ++RI+ L + + LP DL T RAA+ + Y++G++ WL Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 VPDSYSLPSESEQLVEVCMDMLRFSPSLRVP 211 P S+ L E+ V + ++M P+LR P Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNP 211
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 140 bits (353), Expect = 1e-38 Identities = 99/415 (23%), Positives = 186/415 (44%), Gaps = 31/415 (7%) Query: 14 ILFALMMAVFLSALDQTIVAVSMPAISAQF-KDIDLLAWVISAYMVSLTVAVPIYGKLGD 72 IL L + F S L++ ++ VS+P I+ F K WV +A+M++ ++ +YGKL D Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 73 LYGRRKLMLFGLGLFTLASLFCGLAQSM-EQLVLARVLQGIGAGGMVSVSQAIIADIVPP 131 G ++L+LFG+ + S+ + S L++AR +QG GA ++ ++A +P Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 132 RERGRYQGYFSSMYAVASVAGPVLGGLMTEYLSWRWVFLINLPLGIFALVVAWRTLKGLP 191 RG+ G S+ A+ GP +GG++ Y+ W +L+ +P+ ++ L L Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPM---ITIITVPFLMKLL 189 Query: 192 IPQ--RKPIIDYLGTILMIIGLTALLLGITEIGQGHGLDDMQVQALLGVALLTLALFVWY 249 + K D G ILM +G+ +L T L V++L+ +FV + Sbjct: 190 KKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISF----------LIVSVLSFLIFVKH 239 Query: 250 ERRAREPLLPMHLFANR---SAVLCWCTVFFTSFQAISLIVLMPLRYQTVTG-GGADSAA 305 R+ +P + L N VLC +F T + ++P + V A+ + Sbjct: 240 IRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGT---VAGFVSMVPYMMKDVHQLSTAEIGS 296 Query: 306 LHLLPLAMGMPMGAYFAGRRTALTGRYKPLIATGAVLMPIAILGMAFTPPQSIVLMSLFM 365 + + P M + + Y G G ++ G + ++ L +F + M++ + Sbjct: 297 VIIFPGTMSVIIFGYIGGILVDRRGPLY-VLNIGVTFLSVSFLTASFLLETTSWFMTIII 355 Query: 366 ILTGIASGMQFPTSLVGT--QNSVDIRDMGVATSTTNLFRSLGGAVGVALMSALL 418 + G+ F +++ T +S+ ++ G S N L G+A++ LL Sbjct: 356 VFV--LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLL 408
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.2 bits (81), Expect = 4e-04 Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 7/82 (8%) Query: 82 IGGWLMGLYADYKGRKAALMASVLLMCFGSLIIALTPGYESIGVGAPILLVFARLLQGLS 141 IG + G +D G K L+ +++ CFGS+I + + S+ L+ AR +QG Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL-------LIMARFIQGAG 116 Query: 142 VGGEYGTSATYLSEMATKERRG 163 ++ KE RG Sbjct: 117 AAAFPALVMVVVARYIPKENRG 138
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 123 bits (311), Expect = 3e-33 Identities = 83/354 (23%), Positives = 149/354 (42%), Gaps = 52/354 (14%) Query: 3 VGIDLGTTNSLVAVWRDGKSELVTNALGDTLTPSVVGLDDEGQ------ILVGKAARERL 56 + IDLGT N+L+ V G +V N PSVV + + VG A++ L Sbjct: 13 LSIDLGTANTLIYVKGQG---IVLN------EPSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 57 QTHPDKTTALFKRYMGSAQQVRLGADTYRPEELSSLVLKSLKADVERAYGEPVTEAVISV 116 P A+ R M + + AD + E++ +K + ++ P ++ V Sbjct: 64 GRTPGNIAAI--RPM----KDGVIADFFVTEKMLQHFIKQVH---SNSFMRPSPRVLVCV 114 Query: 117 PAYFSDAQRKATRIAGELAGLKVEKLINEPTAAALAYGLHQKEGETSFLIFDLGGGTFDI 176 P + +R+A R + + AG + LI EP AAA+ GL E S ++ D+GGGT ++ Sbjct: 115 PVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGS-MVVDIGGGTTEV 173 Query: 177 SILELFDGVMEVRASAGDNFLGGEDFDRALLDHFVSAHQGDSNFPARALIEPSLRREAER 236 +++ L V + +GG+ FD A++++ + +LI AER Sbjct: 174 AVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNY--------GSLIG---EATAER 217 Query: 237 VRKALG----QDEFADFVLRHADREW----RRTITQEQVAELYAPLLARLRAPIERALRD 288 ++ +G DE + +R + T+ ++ E L + + + AL Sbjct: 218 IKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQ 277 Query: 289 AKIR-VADLDE--ILLVGGTTRMPLIRKLAAGMFGRFPSITLNPDEVVAQGAAI 339 +D+ E ++L GG + + +L G + +P VA+G Sbjct: 278 CPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVARGGGK 331
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.7 bits (207), Expect = 7e-19 Identities = 26/121 (21%), Positives = 55/121 (45%), Gaps = 2/121 (1%) Query: 670 VLMVEDNQDIGTYTRPMLEQLGFQVVWVSSGSEALQELSGNPESFQVVFSDIAMPGMSGL 729 +L+ +D+ I T L + G+ V S+ + + ++ +V +D+ MP + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA--GDGDLVVTDVVMPDENAF 63 Query: 730 ELYAEIETRYPWMPVVLTTGYSTEFAQFAQDESHRFDLLQKPYALEDLATLLHKAASRRT 789 +L I+ P +PV++ + +T E +D L KP+ L +L ++ +A + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 790 E 790 Sbjct: 124 R 124
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 432 bits (1112), Expect = e-151 Identities = 172/478 (35%), Positives = 244/478 (51%), Gaps = 51/478 (10%) Query: 4 SVIVVDDEAPIRQAVEQWLTLSGFTVQVFARAEECLAELPEHFPGVVLTDVRMPGISGLE 63 +++V DD+A IR + Q L+ +G+ V++ + A + +V+TDV MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 LLARLQVIDKDLPVILLTGHGDVPMAVEAMREGAYDFLEKPFSPETLISNLRRALEKRQL 123 LL R++ DLPV++++ A++A +GAYD+L KPF LI + RAL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK- 123 Query: 124 ILENRRLHEQADARTRLDATLLGMSPSLQTLRHHVLELSQLSVNVIIRGETGSGKELVAR 183 R + + ++ L+G S ++Q + + L Q + ++I GE+G+GKELVAR Sbjct: 124 -----RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 184 CLHDFGPRASKPFVALNCAAIPEHLFEAELFGHESGAFTGAQGKRIGRLEYADGGTVFLD 243 LHD+G R + PFVA+N AAIP L E+ELFGHE GAFTGAQ + GR E A+GGT+FLD Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 244 EIESMPMAQQVKLLRVLQDKRLERLGSNQSIDVDLRIIAATKPDLLEEARAGRFREDLAY 303 EI MPM Q +LLRVLQ +G I D+RI+AAT DL + G FREDL Y Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 304 RLNVAELHLPALRERREDIPLLFNHFARAAAERMGREAPVVSAARLSQLLSHDWPGNVRE 363 RLNV L LP LR+R EDIP L HF + A + G + L + +H WPGNVRE Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRE 357 Query: 364 LANAAERQAL-----GLTRPDVETH----------------------------------- 383 L N R +TR +E Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 384 ----AEPTGQSLAAQQEAFEAQCLRASLSRHKGDIKAVLHELQLPRRTLNEKMQRHGL 437 A P E + A+L+ +G+ L L R TL +K++ G+ Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 33.8 bits (77), Expect = 5e-04 Identities = 15/43 (34%), Positives = 24/43 (55%), Gaps = 3/43 (6%) Query: 248 TRYNNLLAESQTAQKEAKEVTRKLEELATLAGLDNNRMIWVQQ 290 T Y LL+ + T ++ + K E++ G D NR+I+VQQ Sbjct: 131 TEYLWLLSRTPTVERGILD---KFIEMSKERGFDTNRLIYVQQ 170
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 41.4 bits (97), Expect = 4e-06 Identities = 36/187 (19%), Positives = 71/187 (37%), Gaps = 9/187 (4%) Query: 44 LTPIAQDLGISQGQAGQAISISGFFAVLTSLLNTPLTGRFDRKKVLLSFSFLLLLSGMTV 103 L IA D + + + + L+ + K++LL F ++ G + Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL-FGIIINCFGSVI 95 Query: 104 TFAPNGVVFMT--GRALLGVSIGGFWSMSTATVMRLVPKDSVAKGLALINGGNALAATVA 161 F + + R + G F ++ V R +PK++ K LI A+ V Sbjct: 96 GFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVG 155 Query: 162 APLGSFLGQYIGWRGAFFLVIPLAVLAFAWQWLSLPAMSSPQNTRATNPFKLLRNSQVAI 221 +G + YI W ++ L+IP+ + + L + R F + +++ Sbjct: 156 PAIGGMIAHYIHW--SYLLLIPMITIITVPFLMKLL----KKEVRIKGHFDIKGIILMSV 209 Query: 222 GMLAIML 228 G++ ML Sbjct: 210 GIVFFML 216
>SALSPVAPROT#Salmonella virulence plasmid 28.1kDa A protein signature. Length = 255 Score = 28.6 bits (63), Expect = 0.010 Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 2/44 (4%) Query: 63 DNQTSRDFVALLPLDLALE--DYASTEKISTLSRKLIIEGAPSG 104 DN T+ D + +L L+ DY E ++T +R+L I P G Sbjct: 199 DNSTATDLTSFYQTNLGLKTADYTPFEALNTFARQLAITVPPGG 242
>PF06580#Sensor histidine kinase Length = 349 Score = 30.2 bits (68), Expect = 0.016 Identities = 14/70 (20%), Positives = 28/70 (40%) Query: 334 LELSFDCSEAAREVNVDFSALDIALHNLITNAVNFSPAGGQITVGLSFTAHHFELTVDDQ 393 L+ + A +V V + + N I + + P GG+I + + L V++ Sbjct: 240 LQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENT 299 Query: 394 GPGIDEQERE 403 G + +E Sbjct: 300 GSLALKNTKE 309
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 5e-23 Identities = 40/156 (25%), Positives = 78/156 (50%), Gaps = 5/156 (3%) Query: 2 RLLLIEDDAALGEGIHQALSREGYTVDWIRDGSSALHALLSETFDLAILDLGLPRLDGFE 61 +L+ +DDAA+ ++QALSR GY V + ++ + + DL + D+ +P + F+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRRLRHSGSAVPVMILTARDSTEDRITGLDTGADDYLVKPFDVSELKARLRALLRRSAG 121 +L R++ + +PV++++A+++ I + GA DYL KPFD++EL + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 RAKVLIEHAG-----ISLDPGTQQVSYHHEPVALTP 152 R L + + + Q++ + T Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 32.2 bits (73), Expect = 7e-04 Identities = 14/49 (28%), Positives = 22/49 (44%) Query: 134 LAFGDSDKAGELLQKALKINPDGIDPLYFWGDHQYRQGKYAEARDALNK 182 LA K G + +I+ D ++ LY +QY+ GKY +A Sbjct: 13 LAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQA 61
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 87.0 bits (215), Expect = 2e-22 Identities = 49/187 (26%), Positives = 84/187 (44%), Gaps = 10/187 (5%) Query: 8 VVLTGASGGIGLAIAEALCSHGAQVLAVSRNGQPL------RSLLAAYPDNLHWVEADLC 61 +TGA+ GIG A+A L S GA + AV N + L A + + AD+ Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF---PADVR 67 Query: 62 SEEGRKQVVAR-AQATTGVNLLINAAGANHFAMLEQLSTDDINAMLMINLHAPILLTRAM 120 ++ AR + +++L+N AG ++ LS ++ A +N +R++ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 121 LPLLRNTEQAMVVNVGSTYGSIGHAGYATYCASKFALRGFSEALRRELADTHVGVLYVAP 180 + + +V VGS + A Y +SK A F++ L ELA+ ++ V+P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 181 RATRTTM 187 +T T M Sbjct: 188 GSTETDM 194
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 77.6 bits (191), Expect = 2e-18 Identities = 32/145 (22%), Positives = 63/145 (43%), Gaps = 1/145 (0%) Query: 25 VLIVEDDQRLAQLTCDYLQNNGLSVRIEGNGALAAARIIQEQPDLVILDLMLPGEDGFSI 84 +L+ +DD + + L G VRI N A I DLV+ D+++P E+ F + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 85 CRKVRDRYDG-PILMLTARTDDTDHIQGLDTGADDFVCKPVHPRVLLARIHALLRRSEAP 143 +++ P+L+++A+ I+ + GA D++ KP L+ I L + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 144 QVPAAELRRLVFGPLVVDNALREAW 168 + + + A++E + Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIY 150
>PF06580#Sensor histidine kinase Length = 349 Score = 36.0 bits (83), Expect = 3e-04 Identities = 20/107 (18%), Positives = 35/107 (32%), Gaps = 25/107 (23%) Query: 431 LQNLVSNAMRHA------ETQVSISYRLGAQRCRIDVDDDGPGVPEDAWEQIFTPFMRID 484 +Q LV N ++H ++ + ++V++ G ++ E Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE---------- 309 Query: 485 DSRTRASGGHGLGLSIVR-RIINWHEGRALIGRSESLGGACFSLSWP 530 G GL VR R+ + A I SE G + P Sbjct: 310 --------STGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 66.2 bits (161), Expect = 5e-15 Identities = 48/222 (21%), Positives = 84/222 (37%), Gaps = 18/222 (8%) Query: 25 TPASEVVALPDLLHPERLDALLL----DLYG-TELMLSHLPVLVSQWAKYYFMQIIPAVL 79 + L P L +LL +Y +M+ L+S WA++Y ++P ++ Sbjct: 49 PAPLNAMTLAQWSSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLM 108 Query: 80 SASLLEGRHYALHLDQVSLVLDKRKLPVGIRFVEEGSALAQAELDPFQRFAGLLDDNLQP 139 A L + + + + + +V+ P R L+ L P Sbjct: 109 LALLTQEKALDVSPEHFHAEFHETGRVACF-WVDVCEDKNATPHSPQHRMETLISQALVP 167 Query: 140 FITTLSRYGGLASSVLWSSAGDALETCLTE----LAAGSHASLAAGFALLAERKRPDGRL 195 + L G + ++WS+ G + LTE L + SL L E+ +G Sbjct: 168 VVQALEATGEINGKLIWSNTGYLINWYLTEMKQLLGEATVESLRHA--LFFEKTLTNGED 225 Query: 196 NPLYQTVTFIKQAEDAESRKQRKACCLSYQVEWVGRCEHCPL 237 NPL++TV + R+ CC Y++ V +C C L Sbjct: 226 NPLWRTVVL------RDGLLVRRTCCQRYRLPDVQQCGDCTL 261
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 96.6 bits (240), Expect = 9e-27 Identities = 50/186 (26%), Positives = 96/186 (51%), Gaps = 9/186 (4%) Query: 26 ASIQRSVAKRQTEFLAGRLCARDALRRLDGRQYIPGIGEDRAPIWPGEICGSITHSTGWA 85 ++ + KR+ E LAGR+ A ALR + G + +PG+G+ R P+WP + GSI+H A Sbjct: 37 DRLRSAGRKRKAEHLAGRIAAVHALREV-GVRTVPGMGDKRQPLWPDGLFGSISHCATTA 95 Query: 86 AAIVAHQQQWRGLGLDTEHLLSHDRASRLAGEILTANELADMANGPDDQVAQRVTLTFSI 145 A+++ Q +G+D E ++S A+ LA I+ ++E + +TL FS Sbjct: 96 LAVISRQ----RIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLA-LTLAFSA 150 Query: 146 KEALFKALYPIVQQRFYFEHAELLEWSQDGSARLRLLIDLSSEWHHGKELEGQFSVQDDH 205 KE+++KA + F A++ + L LL ++ + + ++ +D+ Sbjct: 151 KESVYKA-FSDRVTLPGFNSAKVTSLTA-THISLHLLPAFAAT-MAERTVRTEWFQRDNS 207 Query: 206 LLSLIA 211 +++L++ Sbjct: 208 VITLVS 213
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.1 bits (208), Expect = 4e-21 Identities = 32/120 (26%), Positives = 57/120 (47%), Gaps = 1/120 (0%) Query: 2 KLLVVEDEALLRHHLRTRLTEAGHVVEAVANAEEALYQVGQFNHDLAVIDLGLPGIGGLD 61 +LV +D+A +R L L+ AG+ V +NA + + DL V D+ +P D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LIRQLRTLGKSFPILILTARGNWQDKVEGLAAGADDYVVKPFQFEE-LDARLNALLRRSS 120 L+ +++ P+L+++A+ + ++ GA DY+ KPF E + AL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124
>PF06872#EspG protein Length = 398 Score = 27.4 bits (60), Expect = 0.019 Identities = 13/28 (46%), Positives = 15/28 (53%) Query: 69 GLLTRTVFAEVPPRVEYEITEKARGLGP 96 G+LT EVPP V+ E E AR L Sbjct: 359 GMLTNRTSYEVPPGVKCEPNEMARMLKA 386
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 76.2 bits (187), Expect = 3e-19 Identities = 37/202 (18%), Positives = 77/202 (38%), Gaps = 14/202 (6%) Query: 16 RRRIPKGDLRKVDIIKAALVIFARDGFAGASLSNIAKVAGISQVGLLHHFPNKLALLQAV 75 R+ + + I+ AL +F++ G + SL IAK AG+++ + HF +K L + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 76 LDHRDQYISTRLQDAEQ---VATLEGFVAFLRFIMRFSIEDASVSQALMIINTESLSVT- 131 + + I + + L L ++ ++ + + II + V Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGE 122 Query: 132 ----HPAHRWFCERSHIVHSHLQAQLKLLVQAGEVREDIDVKQVSIELASMMDGMQIQWL 187 A R C + ++ LK ++A + D+ ++ +I + + G+ WL Sbjct: 123 MAVVQQAQRNLCLE---SYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 188 RSRADVD---IEGAFNRFLDRM 206 + D + L M Sbjct: 180 FAPQSFDLKKEARDYVAILLEM 201
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 42.0 bits (98), Expect = 9e-06 Identities = 34/163 (20%), Positives = 67/163 (41%), Gaps = 28/163 (17%) Query: 399 LASNTTNVDYISQMSLNPKSSVWYQPGSDKQAISNTGVKAEYYGNTNLSGDPVATRIEPG 458 L S+T N++ + Q + ++ + ++ S+ G+ Y+ + N V T G Sbjct: 17 LVSSTGNLE-VIQAEVKQENRL-----LNESESSSQGLLGYYFSDLNFQAPMVVTSSTTG 70 Query: 459 VNLDWITSSNATDNGTSTVSGFNPAAGAFSARFTGKIKPTITGPHVFKVRADGAYKLWIN 518 D S+ +N S F SA ++G IK + + F AD +W++ Sbjct: 71 ---DLSIPSSELENIPSENQYFQ------SAIWSGFIKVKKSDEYTFATSADNHVTMWVD 121 Query: 519 DELVAEDEGGQVSFDLIPVVPRTVKTASLKAGSEYNVRLEYRR 561 D+ ++I + K L+ G Y ++++Y+R Sbjct: 122 DQ------------EVINKASNSNKI-RLEKGRLYQIKIQYQR 151
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 25.0 bits (54), Expect = 0.035 Identities = 9/22 (40%), Positives = 13/22 (59%) Query: 27 QPPSPATSDALRAQVDAKRQHL 48 QP P + A+RA + R+HL Sbjct: 19 QPQDPTLAQAVRATIAKHREHL 40
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 78.2 bits (192), Expect = 4e-19 Identities = 45/180 (25%), Positives = 79/180 (43%), Gaps = 10/180 (5%) Query: 3 KTVLITGASSGFGLLLATRLHDQGFEVIGTSRHPQNHA---------GRFPFKLLRLDVT 53 K ITGA+ G G +A L QG + +P+ R + DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVR 67 Query: 54 DDASIQFFLEQLFTNIPRVDVLINNAGYMLTGIAEETPVEAAREQFETNFWGTVKVTNAL 113 D A+I ++ + +D+L+N AG + G+ E F N G + ++ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 114 LPFMREKKGSQIITVSSIVGLIGPPNLSYYSASKHAVEGYFKSLRFELDPFDIRVSMVEP 173 +M +++ I+TV S + +++ Y++SK A + K L EL ++IR ++V P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.9 bits (171), Expect = 1e-14 Identities = 30/129 (23%), Positives = 55/129 (42%), Gaps = 2/129 (1%) Query: 565 TVLVVDDEPSVRMLVVEVLSTEGYHALEAADAQAGLEILQSDIHIDLLISDVGLPGGMNG 624 T+LV DD+ ++R ++ + LS GY ++A + + DL+++DV +P N Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMP-DENA 62 Query: 625 REMADAARTKRPALPTLFITGYAETSALDGCHLQPKTQILTKPFGLEVLASRIKELISER 684 ++ + RP LP L ++ + L KPF L L I ++E Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 685 SQEQGQPRA 693 + + Sbjct: 123 KRRPSKLED 131
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 27.8 bits (61), Expect = 0.049 Identities = 12/28 (42%), Positives = 16/28 (57%) Query: 64 RDFFPKARRLLDDFEDSILNIRELAERQ 91 DF +AR L D D +L +REL R+ Sbjct: 109 EDFLRQARSLFPDPSDLVLVLRELLRRK 136
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 505 bits (1302), Expect = e-179 Identities = 181/491 (36%), Positives = 257/491 (52%), Gaps = 16/491 (3%) Query: 5 IKILLIDDDSQRRRDLAVILNFLGEENLSCSS--QDWQQVVGSLGSTREVLCVLIGNVNA 62 IL+ DDD+ R L L+ G + S+ W+ + G +++ +V Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGD------LVVTDVVM 57 Query: 63 PG-SLQGLLKTIAAWDEFLPVLLLGESSSVELP-EDMRRRVLSALEMPPSYSKLLDSLHR 120 P + LL I LPVL++ ++ + + L P ++L+ + R Sbjct: 58 PDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117 Query: 121 AQVYREMYDQARERGRHREPNLFRSLVGTSRAIQHVRQMMQQVADTDASVLILGESGTGK 180 A + R + LVG S A+Q + +++ ++ TD +++I GESGTGK Sbjct: 118 ALAEP----KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGK 173 Query: 181 EVVARNLHYHSKRRDAPFVPVNCGAIPAELLESELFGHEKGAFTGAITSRAGRFELANGG 240 E+VAR LH + KRR+ PFV +N AIP +L+ESELFGHEKGAFTGA T GRFE A GG Sbjct: 174 ELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGG 233 Query: 241 TLFLDEIGDMPLPMQVKLLRVLQERTFERVGSNKTQSIDVRIIAATHKNLENMIEIGSFR 300 TLFLDEIGDMP+ Q +LLRVLQ+ + VG DVRI+AAT+K+L+ I G FR Sbjct: 234 TLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFR 293 Query: 301 EDLYYRLNVFPIEMAPLRERVEDIPLLMNELISRMEHEKRGSIRFNSAAIMSLCRHAWPG 360 EDLYYRLNV P+ + PLR+R EDIP L+ + + E E RF+ A+ + H WPG Sbjct: 294 EDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPG 353 Query: 361 NVRELANLVERMAIMHPYGVIGVAELPKKFRY-VDDEDEQMVDSLRSEIEERVAINGHTP 419 NVREL NLV R+ ++P VI + + R + D + + + A+ + Sbjct: 354 NVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMR 413 Query: 420 N-FASGALLPPEGLDLKDYLGGLEQGLIQQALDDANGIVARAAERLRIRRTTLVEKMRKY 478 FAS P L +E LI AL G +AA+ L + R TL +K+R+ Sbjct: 414 QYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL 473 Query: 479 GMSRREGDEQA 489 G+S A Sbjct: 474 GVSVYRSSRSA 484
>PF06580#Sensor histidine kinase Length = 349 Score = 36.4 bits (84), Expect = 2e-04 Identities = 17/103 (16%), Positives = 31/103 (30%), Gaps = 23/103 (22%) Query: 302 LVENAV----QASAGRTRLKVHVYSRGNTLRLCISDNGRGMDQAALARIGEPFFTTKTTG 357 LVEN + ++ + T+ L + + G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK------------NTKES 310 Query: 358 TGLGLAVVTAVTRAHQG---GVQYRSRVGRGTCAIVSLPLIPA 397 TG GL V + G ++ + G+ + LIP Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV----LIPG 349
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 492 bits (1269), Expect = e-174 Identities = 171/479 (35%), Positives = 254/479 (53%), Gaps = 20/479 (4%) Query: 3 IKVLLVEDDRSLREALGETLELAGYGYRAVGSAEEALLAVESEPFSLVISDVNMPGMDGH 62 +L+ +DD ++R L + L AGY R +A + + LV++DV MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 63 QLLALLRNRHPQLPVLLMTAHGAVERAVDAMRQGAADYLVKPFEP--------KALLALV 114 LL ++ P LPVL+M+A A+ A +GA DYL KPF+ +AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 115 ARHALGRLGPAEGEGPIAVEPASIQLLNLASRVAKSDSTVLISGESGTGKEVLARYIHQN 174 R + +G + A ++ + +R+ ++D T++I+GESGTGKE++AR +H Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 175 SPRADKPFIAINCAAIPDNMLEATLFGHEKGSFTGAIAAQAGKFEQADGGTILLDEISEM 234 R + PF+AIN AAIP +++E+ LFGHEKG+FTGA G+FEQA+GGT+ LDEI +M Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 235 PMGLQAKLLRVLQEREVERVGARKPIILDIRVLATTNRDLAGEVAAGRFREDLFYRLSVF 294 PM Q +LLRVLQ+ E VG R PI D+R++A TN+DL + G FREDL+YRL+V Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 295 PLAWQALRQRTADILPLAERLLAKHVNKMKHAPVRLSAEAQACLVSYPWPGNVRELDNAV 354 PL LR R DI L + + K R EA + ++PWPGNVREL+N V Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 355 QRALILQQGGVIQAQDFCL--AGPVGSVPAPVVQAPAPHMPVTSLADTAVA------AGG 406 +R L VI + + P A + + ++ + + Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 407 SESAGALGDDLRRREFQMIIDTLRAERGRRKEAAERLGISPRTLRYKLAQMRDAGMDVE 465 +G L E+ +I+ L A RG + +AA+ LG++ TLR K +R+ G+ V Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK---IRELGVSVY 478
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 80.5 bits (198), Expect = 2e-23 Identities = 40/92 (43%), Positives = 51/92 (55%) Query: 20 QMDAMAAPKPVSGPQEAGASSFADMLGQAVNKVASTQQASSQLANAFEIGKSGVDLTDVM 79 Q+ A A SFA L A+++++ TQ A+ A F +G+ GV L DVM Sbjct: 12 QLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVM 71 Query: 80 ISSQKASVSFQALTQVRNKLVQAYQDIMQMPV 111 QKASVS Q QVRNKLV AYQ++M M V Sbjct: 72 TDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 514 bits (1324), Expect = e-180 Identities = 196/575 (34%), Positives = 299/575 (52%), Gaps = 39/575 (6%) Query: 27 LENLSEMTMLRQIGLMVGLAASVAIGFAVVLWSQQPDYRPLYGSLAGMDSKQIMDTLTAA 86 LE L+ + +I L+V +A+VAI A+VLW++ PDYR L+ +L+ D I+ LT Sbjct: 13 LEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQM 72 Query: 87 NINYTVEPNSGALLVKADDVQRARIQLAQAGVVQNDANIGFEILDKDQGLGTSQFMEATR 146 NI Y SGA+ V AD V R++LAQ G+ + +GFE+LD+++ G SQF E Sbjct: 73 NIPYRFANGSGAIEVPADKVHELRLRLAQQGLPK-GGAVGFELLDQEK-FGISQFSEQVN 130 Query: 147 YRRGLEGELARTISALNNVKGARVHLAIPKSSVFVRDDRKPSASVLVELYAGRSLEPSQV 206 Y+R LEGELARTI L VK ARVHLA+PK S+FVR+ + PSASV V L GR+L+ Q+ Sbjct: 131 YQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQI 190 Query: 207 LAIINLVATSVPELTKSQITVVDQKGTLLSDQAENSELTMAGKQFDYSRRMEGMLTQRVQ 266 A+++LV+++V L +T+VDQ G LL+ Q+ S + Q ++ +E + +R++ Sbjct: 191 SAVVHLVSSAVAGLPPGNVTLVDQSGHLLT-QSNTSGRDLNDAQLKFANDVESRIQRRIE 249 Query: 267 NILQPILGSDRYKAEVSAVVDFSAVESTAESFNPDQPA----LRSEQSVNEQRSSSSGSQ 322 IL PI+G+ A+V+A +DF+ E T E ++P+ A LRS Q ++ + Sbjct: 250 AILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPG 309 Query: 323 GVPGALSNQPPGPATAPQTAGGAGAAAAAIAPGQPLLDANGQQIMDPATGQPALAPYPAD 382 GVPGALSNQP P AP P N Q +T + + P Sbjct: 310 GVPGALSNQPAPPNEAPIAT-------------PPTNQQNAQNTPQTSTSTNSNSAGPRS 356 Query: 383 KRVQSTKNFELDRSISHTKQQQGRLTRLSVAVVVDDMVKTNAANGEVTRAPWSAADLARF 442 + T N+E+DR+I HTK G + RLSVAVVV+ + P +A + + Sbjct: 357 TQRNETSNYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADG-----KPLPLTADQMKQI 411 Query: 443 TRLVQDAVGFDASRGDSVSVINVPFSSERAEVLPEASFYSQPWFWDIVKQAVGVIFILIL 502 L ++A+GF RGD+++V+N PFS+ E F+ Q F D + A + +L++ Sbjct: 412 EDLTREAMGFSDKRGDTLNVVNSPFSAV-DNTGGELPFWQQQSFIDQLLAAGRWLLVLVV 470 Query: 503 VF----GVLRPVLNNIT-TGKSRELAGFGGDAELGGMGGLDGELSNDRVSLGGPQSILLP 557 + +RP L K+ + ++ LS D + L Sbjct: 471 AWILWRKAVRPQLTRRVEEAKAAQEQAQVRQE---TEEAVEVRLSKDEQLQQRRANQRL- 526 Query: 558 SPTEGYDAQLNAIKSLVAEDPGRVAQVVKEWINTD 592 G + I+ + DP VA V+++W++ D Sbjct: 527 ----GAEVMSQRIREMSDNDPRVVALVIRQWMSND 557
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 299 bits (768), Expect = e-103 Identities = 105/332 (31%), Positives = 204/332 (61%) Query: 7 VAKLSKVEKAAVLLLSLGETDAAQVLRHMGPKEVQKVGVAMAQMRNVHREQVEEVMSEFV 66 V+ L+ +KAA+LL+S+G +++V +++ +E++ + +A++ + E + V+ EF Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 67 DIVGDQTSLGVGSDGYIRKMLTQALGEDKANGLIDRILLGGNTSGLDSLKWMEPRAVADV 126 +++ Q + G Y R++L ++LG KA +I+ + + + ++ +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNF 131 Query: 127 IRFEHPQIQAIVVAYLDADQAGEVLGHFDHKVRLDIILRVSSLNTVQPAALKELNQILEK 186 I+ EHPQ A++++YLD +A +L +V+ ++ R++ ++ P ++E+ ++LEK Sbjct: 132 IQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEK 191 Query: 187 QFSGNANTSRTTLGGIKRAADIMNFLDSSIEGALMDSIREVDEDLSVQIEDLMFVFNNLS 246 + + ++ T+ GG+ +I+N D E +++S+ E D +L+ +I+ MFVF ++ Sbjct: 192 KLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIV 251 Query: 247 DVDDRGIQALLREVSSDVLVLALKGSDEAIKEKIFKNMSKRAAELLRDDLEAKGPVRVSD 306 +DDR IQ +LRE+ L ALK D ++EKIFKNMSKRAA +L++D+E GP R D Sbjct: 252 LLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKD 311 Query: 307 VETAQKEILTIARRMAEAGEIVLGGKGGEEMI 338 VE +Q++I+++ R++ E GEIV+ G E+++ Sbjct: 312 VEESQQKIVSLIRKLEEQGEIVISRGGEEDVL 343
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 56.0 bits (134), Expect = 2e-11 Identities = 48/201 (23%), Positives = 90/201 (44%), Gaps = 17/201 (8%) Query: 37 PEPEPDPVDEPAEMEEVPLEEVQPLTLEELESIRQEAWNEGF------------ATGEKE 84 P+ E P+ EP EE +EE +P ++L ++ +A +G+ G +E Sbjct: 18 PQAEFVPIVEP---EETIIEEAEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQE 74 Query: 85 GFHSTQLKVRQEAEVALAGKIASLEMLMASLLNPIAEQDTQIEKAVIHLVEHIARQVIQR 144 G + EA+ A A ++ L++ + D+ I ++ + ARQVI + Sbjct: 75 GLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQ 134 Query: 145 ELATDSGQIASVLRDALKLLPMGANNLRIFINPQDFALVKAM--RERHEETWKILEDDTL 202 D+ + ++ L+ P+ + ++ ++P D V M W++ D TL Sbjct: 135 TPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTL 194 Query: 203 LPGGCRIETEHSRIDASIETR 223 PGGC++ + +DAS+ TR Sbjct: 195 HPGGCKVSADEGDLDASVATR 215
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 44.8 bits (105), Expect = 1e-08 Identities = 36/134 (26%), Positives = 69/134 (51%) Query: 9 LAPVVEMAEAAERSAAQRLGHFQGQVNLANNKLQELDQFRHDYQQQWLQRGSSGVSGQWL 68 LA + ++AE AA+ LG + A +L+ L ++++Y+ S+G++ Sbjct: 7 LATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAGITSNRW 66 Query: 69 LGYQRFLSQLDVAVAQQYKSLEWHKVNLDKARGAWQEAYARVEGLRKLVQRYMDEARKLE 128 + YQ+F+ L+ A+ Q + L +D A +W+E R++ + L +R A E Sbjct: 67 INYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQSTAALLAE 126 Query: 129 DKREQKLLDELSQR 142 ++ +QK +DE +QR Sbjct: 127 NRLDQKKMDEFAQR 140
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.9 bits (179), Expect = 9e-16 Identities = 27/131 (20%), Positives = 58/131 (44%), Gaps = 3/131 (2%) Query: 10 ILIADDSASDRLLLATIIARQGHRVVSAANGLEAVAIFSTERPHLILMDAMMPLMDGFEA 69 IL+ADD A+ R +L ++R G+ V +N + L++ D +MP + F+ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 ARRIKLMAGESLVPIIFLTSLTEGEALARCLDAGGDDFVSKPYN-TQVLAAKINAMNRLR 128 RIK +P++ +++ + + G D++ KP++ T+++ A+ + Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 129 LLQETVLQQRD 139 + Sbjct: 124 RRPSKLEDDSQ 134
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 52.9 bits (126), Expect = 1e-09 Identities = 57/178 (32%), Positives = 88/178 (49%), Gaps = 12/178 (6%) Query: 294 AALSQAAQPARVAATPT-AAPLMSQPLAMHQSGWTEGVVDRVMYLSSQNLKSAEIKLEPA 352 AA S P + PT AAP++S PL H+ W + + + + Q +SAE++L P Sbjct: 209 AAASPLITPHQTQPLPTVAAPVLSAPLGSHE--WQQSLSQHISLFTRQGQQSAELRLHPQ 266 Query: 353 ELGRLDIRVNMAPDQQTQVTFMSAHVGVREALESQMSRLRDSFSQQGLGQVDVNVSDQSQ 412 +LG + I + + D Q Q+ +S H VR ALE+ + LR ++ G+ N+S +S Sbjct: 267 DLGEVQISLKV-DDNQAQIQMVSPHQHVRAALEAALPVLRTQLAESGIQLGQSNISGESF 325 Query: 413 QQAQQQAQEQASRAQRGGRSGGTGSGDSADDVSIADAAVPVSQPAARVIGTSEIDYYA 470 QQ A +Q Q+ R+ DD ++ VPVS RV G S +D +A Sbjct: 326 SGQQQAASQQ----QQSQRTANHEPLAGEDDDTL---PVPVSL-QGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 250 bits (640), Expect = 9e-84 Identities = 93/323 (28%), Positives = 164/323 (50%), Gaps = 9/323 (2%) Query: 5 DLLSQDEIDALLHGVDDG---MVQTESPGEPGSVKSYDLTSQDRIVRGRMPTLEMINERF 61 ++LSQDEID LL + G + + + YD D+ + +M TL +++E F Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62 Query: 62 ARYTRISMFNLLRRSADVAVGGVQVMKFGEYVHSLYVPTSLNLAKIKPLRGTALFILDAK 121 AR T S+ LR V V V + + E++ S+ P++L + + PL+G A+ +D Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122 Query: 122 LVFKLVDNFFGGDGRHAKIEGREFTPTELRVVRMVLDQAFIDLKEAWQAIMEVNFEYINS 181 + F ++D FGG G+ AK++ R+ T E V+ V+ + +++E+W ++++ Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 182 EVNPAMANIVGPSEAVVISTFHIELDGGGGDLHVTMPYSMIEPIREMLDAGF--QSDLDD 239 E NP A IV PSE VV+ T ++ G ++ +PY IEPI L + F S Sbjct: 182 ETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRRS 241 Query: 240 QDERWVNALKEDVLDVNVPLSTTIAQRQLPLRDILHMRPGDVIPIE---LAESLVLRANG 296 +++ L++ + V++ + + +L +RDIL +R GD+I + + + VL Sbjct: 242 STTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIGN 301 Query: 297 VPSFKVKLGSHKGKMALQVIEPI 319 F + G K+A Q++E I Sbjct: 302 RKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 117 bits (294), Expect = 1e-36 Identities = 63/153 (41%), Positives = 92/153 (60%), Gaps = 19/153 (12%) Query: 1 MADENDMTSAEDQALADEWAAALGE-AGEGGQDDIDALLAADAGNATNRMTMEEFGSVPK 59 M+D N+ + AL D WA AL E + DA+ G + Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQ-------- 52 Query: 60 NNAPVTLDGPNLDVILDIPVSISMEVGSTDINIRNLLQLNQGSVIELDRLAGEPLDVLVN 119 ++D+I+DIPV +++E+G T + I+ LL+L QGSV+ LD LAGEPLD+L+N Sbjct: 53 ----------DIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILIN 102 Query: 120 GTLIAHGEVVVVNEKFGIRLTDVISPSERIKKL 152 G LIA GEVVVV +K+G+R+TD+I+PSER+++L Sbjct: 103 GYLIAQGEVVVVADKYGVRITDIITPSERMRRL 135
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 260 bits (666), Expect = 4e-90 Identities = 138/247 (55%), Positives = 181/247 (73%), Gaps = 4/247 (1%) Query: 1 MGALRFLVLLLLVMIAPVALAADPLSIPAITLSNGADGQQEYSVSLQILLIMTALSFIPA 60 M L + +LL +I P+A A +P IT G Q +S+ +Q L+ +T+L+FIPA Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQ----LPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPA 56 Query: 61 FVMLMTSFTRIIIVFSILRQALGLQQTPSNQILTGMALFLTMFIMAPVFDKVNQDALQPY 120 +++MTSFTRIIIVF +LR ALG P NQ+L G+ALFLT FIM+PV DK+ DA QP+ Sbjct: 57 ILLMMTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPF 116 Query: 121 LAEQLTAQDAVAKAQVPIKDFMLAQTRTSDLELFMRLSKRTDIPTPDAAPLNILVPAFVI 180 E+++ Q+A+ K P+++FML QTR +DL LF RL+ + P+A P+ IL+PA+V Sbjct: 117 SEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVT 176 Query: 181 SELKTAFQIGFMIFIPFLIIDLVVASVLMAMGMMMLSPLIISLPFKIMLFVLVDGWALIV 240 SELKTAFQIGF IFIPFLIIDLV+ASVLMA+GMMM+ P I+LPFK+MLFVLVDGW L+V Sbjct: 177 SELKTAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLV 236 Query: 241 GTLAGSF 247 G+LA SF Sbjct: 237 GSLAQSF 243
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 50.9 bits (122), Expect = 2e-12 Identities = 24/74 (32%), Positives = 39/74 (52%) Query: 7 VDLFREALWLTTMLVAILVVPSLICGLLVAMFQAATQINEQTLSFLPRLIVMLITLIAIG 66 V +AL+L +L + + I GLLV +FQ TQ+ EQTL F +L+ + + L + Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64 Query: 67 PWLLKVFMEYMLSL 80 W +V + Y + Sbjct: 65 GWYGEVLLSYGRQV 78
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 137 bits (348), Expect = 9e-42 Identities = 100/256 (39%), Positives = 150/256 (58%), Gaps = 2/256 (0%) Query: 4 MLALTDAQISTWVASFMLPLFRIIAVLMTMPIIGTTLVPRRVRLYLAVAMTVAVAPVLPA 63 ML +T Q +W+ + PL R++A++ T PI+ VP+RV+L LA+ +T A+AP LPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 64 MPTVQALDLSALLLIGEQIIIGAGMGLALQLFFHIFVVAGQIISTQMGMGFASMVDPTNG 123 AL L +QI+IG +G +Q F AG+II QMG+ FA+ VDP + Sbjct: 61 NDVPV-FSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119 Query: 124 VSSATIGQFFTMLVTLLFLAMNGHLVVLEILVESFTTMPVGSGLLVNNFWELATGLGW-V 182 ++ + + ML LLFL NGHL ++ +LV++F T+P+G L +N + T G + Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179 Query: 183 MGSALRLVLPAITALLVINIAFGVMTRAAPQLNIFSIGFPLTLVLGMVILWMTMGDILNQ 242 + L L LP IT LL +N+A G++ R APQL+IF IGFPLTL +G+ ++ M I Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239 Query: 243 YQPLASQALQALRDMV 258 + L S+ L D++ Sbjct: 240 CEHLFSEIFNLLADII 255
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 317 bits (813), Expect = e-108 Identities = 93/346 (26%), Positives = 176/346 (50%), Gaps = 4/346 (1%) Query: 9 DKTEEPTEKKVRDSRADGQIARSKELTTLVVMLMGSGGALVFGGGIAQMMFELMRDNFTI 68 +KTE+PT KK+RD+R GQ+A+SKE+ + +++ S + + +LM Sbjct: 4 EKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLML---IP 60 Query: 69 TRETLMDQDYMGKALLSSGL-HALVVVLPFLIAMLMAALVGPIMLGGWLFATKSLAPKFS 127 ++ + ++ + L + P L + A+ ++ G+L + +++ P Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 128 RMNPAAGLKRMFSPHALVELLKSLAKFLIILAVALVVLSKERNDLVAIAHEPLEQAIIHS 187 ++NP G KR+FS +LVE LKS+ K +++ + +++ L+ + +E Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180 Query: 188 LQVVGWSSFWMACGLMFVAAADVPFVLWEAHKKLLMTKQEVRDEHKNSEGSPEIKQRIRQ 247 Q++ G + ++ AD F ++ K+L M+K E++ E+K EGSPEIK + RQ Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240 Query: 248 LQREMSQRRMMASIPEADVIITNPTHFAVALKYDPEKGGAPMLLAKGTDLVALKIREIAA 307 +E+ R M ++ + V++ NPTH A+ + Y + P++ K TD +R+IA Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300 Query: 308 HNQILILESPGLARSIYYSTELEQEIPAGLYLAVAQVLAYVYQIRQ 353 + IL+ LAR++Y+ ++ IPA A A+VL ++ + Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQNI 346
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 4e-24 Identities = 32/123 (26%), Positives = 56/123 (45%), Gaps = 3/123 (2%) Query: 6 KILIVDDFSTMRRIIKNLLRDLGFTNTSEADDGLTALPMLQSGAFDFLVTDWNMPGMSGI 65 IL+ DD + +R ++ L G+ + T + +G D +VTD MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 66 DLLRQVRQDERLKSLPVLMVTAEAKREQIIEAAQAGVNGYVVKPFTAQALKEKIEKIFER 125 DLL +++ + LPVL+++A+ I+A++ G Y+ KPF L I + Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 126 VNS 128 Sbjct: 122 PKR 124
>PF06580#Sensor histidine kinase Length = 349 Score = 46.4 bits (110), Expect = 2e-07 Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 10/79 (12%) Query: 460 ETDLDKNLVEALADPLV--HLVRNAVDHGIETPEEREATGKSRGGKVILAAEQEGDHILL 517 E ++ +++ P++ LV N + HGI +GGK++L ++ + L Sbjct: 243 ENQINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTL 294 Query: 518 SISDDGKGMDPNVLRSIAV 536 + + G N S Sbjct: 295 EVENTGSLALKNTKESTGT 313
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 60.2 bits (146), Expect = 5e-12 Identities = 37/165 (22%), Positives = 58/165 (35%), Gaps = 11/165 (6%) Query: 2 AVKVLVVDDSGFFRRRVTEILSSDPNIQVVGTATNGKEAIEQALALKPDVITMDYEMPMM 61 +LV DD R + + LS V +N A D++ D MP Sbjct: 3 GATILVADDDAAIRTVLNQALS-RAGYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 62 DGITAVRHIMQRIP-TPVLMFSSLTHEGARVTLDALDAGAVDFLPKNFE-----DISRNP 115 + + I + P PVL+ S+ + A + GA D+LPK F+ I Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAI--KASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 116 QKVKQLLCEKINSISRSNRRLSGASSASAAPVSSSAAPAARTAAP 160 + K+ S+ L G S+A + A +T Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQE-IYRVLARLMQTDLT 162
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 63.8 bits (155), Expect = 1e-13 Identities = 31/128 (24%), Positives = 54/128 (42%), Gaps = 16/128 (12%) Query: 134 LNSSLLFGSGDAMPSDKAFTIIEKVAGIVKRFDNP---IHVEGFTDDQPISTAQFPTNWE 190 L S +LF A + ++++ + D + V G+TD I + + N Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDR--IGSDAY--NQG 272 Query: 191 LSSARSASIVRMLAIDGVNPARLASVGYGEFQPIAPNTSATGR---------AKNRRVVL 241 LS R+ S+V L G+ ++++ G GE P+ NT + A +RRV + Sbjct: 273 LSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332 Query: 242 VISRNLDV 249 + DV Sbjct: 333 EVKGIKDV 340
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 273 bits (699), Expect = 1e-82 Identities = 201/725 (27%), Positives = 303/725 (41%), Gaps = 93/725 (12%) Query: 18 DINGATVKGSSQPAIRVGSFGTPSSGSTLKVRSSEVTGAGV--GISAGVF----GDVDIR 71 D+ + V V + G P++ S L + G + G +AGV V ++ Sbjct: 195 DLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTLDGGHITGGRAAGVAAMQGAVVHLQ 254 Query: 72 ATKVYGHAWSPLGSPGYGISAAGPNMVIAEGSYIVGDESGIRIIDPASGQLLEKESVITI 131 + +P G G + G + G G + + S + + Sbjct: 255 RATIRR-GDAPAGGAVPGGAVPGGAVPGGFG------PGGFGPVLDGWYGVDVSGSSVEL 307 Query: 132 DNSTVEGIG-GASIRVYYRDLLDVRADITVQNSSKLLSGNGNL--LEVAESSIVDFKVDN 188 S VE GA+IRV + V ++ G A + + Sbjct: 308 AQSIVEAPELGAAIRVGRGARVTVSGGSLSAPHGNVIETGGARRFAPQAAPLSITLQAGA 367 Query: 189 STLGGNLVSDDTST-LNVTLQNNASLTGDII-------------------------NGNI 222 G L+ + +TL A GDI+ G Sbjct: 368 HAQGKALLYRVLPEPVKLTLTGGADAQGDIVATELPSIPGTSIGPLDVALASQARWTGAT 427 Query: 223 LAVKS----GGNWQMVGDNAIKSLSMEG-GSVNF---AEEG-FHTLSLNELSGQGSFGMR 273 AV S W M ++ + +L + GSV+F AE G F L++N L+G G F M Sbjct: 428 RAVDSLSIDNATWVMTDNSNVGALRLASDGSVDFQQPAEAGRFKVLTVNTLAGSGLFRMN 487 Query: 274 VDLDKGVGDLIDVNGQASGQFGLRVRNTGLEVVSSDMEPLKVVHT-EGGDAQFSL--LGG 330 V D G+ D + V ASGQ L VRN+G E S+ L +V T G A F+L G Sbjct: 488 VFADLGLSDKLVVMQDASGQHRLWVRNSGSEPASA--NTLLLVQTPLGSAATFTLANKDG 545 Query: 331 RVDLGAFSYQLKQQGN-DWFIVGEDKVISPS--------------------------TQS 363 +VD+G + Y+L GN W +VG +P + Sbjct: 546 KVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRE 605 Query: 364 ALALFNAA---------PTVWMGELSTLRTRMGEIRGTG-RGGSWMRAYGSRLNATTGDG 413 A NAA T+W E + L R+GE+R GG+W R + R G Sbjct: 606 LSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQRQQLDNRAG 665 Query: 414 VDYRQQISGLSLGADAPIEVSHGQLLFGVLGGYSKSDLDLSRGTSGKIDSYYAGAYGTWL 473 + Q+++G LGAD + V+ G+ G L GY++ D + G DS + G Y T++ Sbjct: 666 RRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVHVGGYATYI 725 Query: 474 ADDGYYLDGVLKLNRFRNKAKVAMSDASQVKGDYSNSAVGGWVEFGRHIKLADDYFLEPF 533 AD G+YLD L+ +R N KVA SD VKG Y VG +E GR AD +FLEP Sbjct: 726 ADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHADGWFLEPQ 785 Query: 534 AQLSSVVVEGKDYRMDNDLKAKNDRTHSLLGKVGTSAGRTIALKDGGVLQPYVRVALAQE 593 A+L+ G YR N L+ +++ S+LG++G G+ I L G +QPY++ ++ QE Sbjct: 786 AELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQE 845 Query: 594 FSRSNEVSVNDAKFDNSLFGSRAELGAGVSVSLSERLQVHADFDYMKGKHVEQPWGANVG 653 F + V N L G+RAELG G++ +L ++A ++Y KG + PW + G Sbjct: 846 FDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAG 905 Query: 654 LSLAF 658 ++ Sbjct: 906 YRYSW 910
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 29.3 bits (65), Expect = 0.044 Identities = 37/180 (20%), Positives = 69/180 (38%), Gaps = 11/180 (6%) Query: 238 RMKTCLTRLQDTAEHLNHQARQSNSLANASSTGLERQRVETEQVA-AAINEMAATTQEVA 296 + T + + L QA+ + + G + EQ A A + Sbjct: 149 KTDTAKSVYDAATKKLT-QAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALDKATD 207 Query: 297 SHVNRAAEATQQANELTRRGRDIAGETREAIQRLSTSVGETGLTVTQLAKDSDEIGGVVD 356 + V +A +A + G A Q S GE ++ +A+ + + ++ Sbjct: 208 ATVKAGTDAKAKAEKADNILTKFQGTANAASQN-QVSQGEQD-NLSNVARLTMLMAMFIE 265 Query: 357 VIKGIADQT--NLLALNAAIEAARAGEMGRGFAVVADEVRQLAQRTAESTGQIHGLIAKL 414 ++ +++ N LAL A++ R EM + A +E R+ AE T +I G I K+ Sbjct: 266 IVGKNTEESLQNDLALFNALQEGRQAEMEKKSAEFQEETRK-----AEETNRIMGCIGKV 320
>PF06917#Periplasmic pectate lyase Length = 555 Score = 29.9 bits (67), Expect = 0.017 Identities = 15/36 (41%), Positives = 17/36 (47%) Query: 252 LMADGFTYKPRQPVDWMVCDIVEKPARNAALLETWL 287 L+ADGF QPV W D P N A + WL Sbjct: 41 LLADGFDVLTHQPVVWEFPDGHHTPISNFASQQNWL 76
>PF01206#SirA family protein Length = 76 Score = 92.1 bits (229), Expect = 1e-28 Identities = 28/72 (38%), Positives = 45/72 (62%) Query: 10 VDAVLDASGLFCPEPVMMLHQKVRDLPAGGLLKVIATDPSTTRDIPKFCVFLGHELIEQQ 69 D LDA+GL CP P++ + + + AG +L V+ATDP + +D F GHEL+EQ+ Sbjct: 4 FDQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQK 63 Query: 70 AGDGTFLYWIRK 81 DGT+ + +++ Sbjct: 64 EEDGTYHFRLKR 75
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 33.2 bits (75), Expect = 0.007 Identities = 40/193 (20%), Positives = 71/193 (36%), Gaps = 20/193 (10%) Query: 393 RREVLLELLERLKLRPKTVDSWLDFVDGKDRLAITIAPLDEGLLLEDPALALVAESPLFG 452 RRE+ L+ + K +V + LD D A +APLD + + +L +V + Sbjct: 75 RREIELQFRDAEKKLEASVQAELDKADAALGPAKNLAPLD----VINRSLTIVGNALQQK 130 Query: 453 QRVMQRRRREKRTDGGNNDAVIKNLTELREGAPVVHIDHGVGRYLGLATLEVENQVAEFL 512 + + +++ + G N + + E+ E A +G Y+ E+E A + Sbjct: 131 NQKLLLNQKKITSLGAKN-FLTRTAEEIGEQAVREGNINGPEAYMRFLDREMEGLTAAY- 188 Query: 513 MLAYAEDAKLYVPVANLHLIARYTGSDDEMAPLHRLGSETWQKAKRKAAEQVRDVAAELL 572 + KL+ I+ + L + A KA EQ A Sbjct: 189 ------NVKLFTEA-----ISSLQIRMNT---LTAAKASIEAAAANKAREQAAAEAKRKA 234 Query: 573 DIYARRAAREGYA 585 + AR+ A A Sbjct: 235 EEQARQQAAIRAA 247
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.1 bits (91), Expect = 3e-05 Identities = 35/168 (20%), Positives = 64/168 (38%), Gaps = 1/168 (0%) Query: 34 FVSYLFRTVNAVIYVDLQADLSLPASSLGLLTGVYFLTFAAAQIPLGVMLDRYGPRSVQA 93 F S L V V D+ D + P +S + + LTF+ G + D+ G + + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 94 PMLLFSVLGSIIFSLSSTETGLLI-GRGLIGLGVAGSLMSAIKACAIWLPVERLPLSTAC 152 ++ + GS+I + + LLI R + G G A + A ++P E + Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 153 LLSIGGLGAMASTTPLHLLLDWFTWREAFLILALLTFCVAGIIHFSVP 200 + SI +G ++ + W LI + V ++ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKK 191
>PF06580#Sensor histidine kinase Length = 349 Score = 29.4 bits (66), Expect = 0.022 Identities = 21/133 (15%), Positives = 43/133 (32%), Gaps = 14/133 (10%) Query: 191 SRIFTSVKRSVSIVGDLLDFTRTQLGSG----IPVRRRVDDLAQACEAMVEEARAYHPDR 246 + I ++ ++ L + R L + + + ++ ++ A DR Sbjct: 184 ALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELT----VVDSYLQLASIQFEDR 239 Query: 247 SIVLLSEPRLAASFDRSRMEQVISNLIGNAIKHGDAGRA----VTVTLTDEQGVACLSVH 302 M ++ L+ N IKHG A + + T + G L V Sbjct: 240 LQFENQINPAIMDVQVPPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVE 297 Query: 303 NEGAPIDEGARAG 315 N G+ + + Sbjct: 298 NTGSLALKNTKES 310
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.0 bits (78), Expect = 9e-04 Identities = 38/135 (28%), Positives = 59/135 (43%), Gaps = 14/135 (10%) Query: 34 AIAKAFFPSDSAFASLMLSLATFGAGFLMRPLGAIFLGAYIDRHGRRKGLIVTLAMMAMG 93 + + S+ A + LA + LM+ A LGA DR GRR L+V+LA A+ Sbjct: 30 GLLRDLVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAV- 85 Query: 94 TLLIACVPGYATLGVAAPLLVL-LGRLLQGFSAGVELGGVSVYLAEISTPGRKGFFVSWQ 152 YA + A L VL +GR++ G + G Y+A+I+ + + Sbjct: 86 --------DYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFM 136 Query: 153 SASQQAAVVFAGLLG 167 SA +V +LG Sbjct: 137 SACFGFGMVAGPVLG 151 Score = 29.8 bits (67), Expect = 0.024 Identities = 11/33 (33%), Positives = 19/33 (57%) Query: 276 CVGVSNFIWLPIMGSFSDRIGRKPLLIAATVLA 308 + F P++G+ SDR GR+P+L+ + A Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 78.7 bits (194), Expect = 4e-19 Identities = 34/117 (29%), Positives = 63/117 (53%) Query: 2 KLLIVEDQSRTGQFLRQGLNEAGFDTEWVADGSAGQQRALSGDHALLILDVMLPDCDGWE 61 +L+ +D + L Q L+ AG+D ++ + + +GD L++ DV++PD + ++ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 ILESVRAAGLDTPVLFLTARDAIEDRVHGLELGADDYLVKPFAFSELLARVRTLLRR 118 +L ++ A D PVL ++A++ + E GA DYL KPF +EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 135 bits (341), Expect = 5e-39 Identities = 77/310 (24%), Positives = 120/310 (38%), Gaps = 81/310 (26%) Query: 47 FKNDGNLFGGSVGYFLTDDVEL--RLGYDEVHNVRSDSGKNIKGSNTALDALYHFNNPGD 104 +K G +GY +TDD+++ RLG R+D+ N+ G N Sbjct: 93 YKAQGVQLTAKLGYPITDDLDIYTRLG---GMVWRADTKSNVYGKN-------------- 135 Query: 105 MLRPYLSAGFSDQSIGQDARGGRNGSTFANVGGGAKLYFTDNFYARAGVEAQYNIDQGDT 164 D + GG + + + +T+N + D G Sbjct: 136 ----------HDTGVSPVFAGGVEYAITPEIATRLEYQWTNNIGDAHTIG--TRPDNGML 183 Query: 165 EWAPSVGIGVNFGGGS--KKVEAAPAPVAEVCSDSDNDGVCDNVDKCPDTPANVTVDADG 222 S+G+ FG G V APAP EV + Sbjct: 184 ----SLGVSYRFGQGEAAPVVAPAPAPAPEVQTKH------------------------- 214 Query: 223 CPAVAEVVRVELDVKFDFDKSVVKPSSYGDIKNLADFMQQY--PQTSTTVEGHTDSVGPD 280 ++ DV F+F+K+ +KP + L + S V G+TD +G D Sbjct: 215 -------FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSD 267 Query: 281 AYNQKLSERRANAVKQVLVNQYGVGASRVNSVGYGESRPVADNATESGR---------AV 331 AYNQ LSERRA +V L+++ G+ A ++++ G GES PV N ++ + A Sbjct: 268 AYNQGLSERRAQSVVDYLISK-GIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAP 326 Query: 332 NRRVEAEVEA 341 +RRVE EV+ Sbjct: 327 DRRVEIEVKG 336
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 40.5 bits (94), Expect = 2e-05 Identities = 37/116 (31%), Positives = 54/116 (46%), Gaps = 9/116 (7%) Query: 358 LATRLLRATGLLHRRNIIHRDIKPENLLLGD-DGELRLLDFGLAFCPGLSAANAEDLPG- 415 +A RLL T L + ++H DIKP N++ GE ++D GL + + E G Sbjct: 250 IAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPVVIDL------GLHSRSGEQPKGF 303 Query: 416 TPSYIAPE-AFNGAEPDPQQDLYAVGVTLYYLLTGQYPYGEIEAFQHRRFGTPIPA 470 T S+ APE + D++ V TL + + G EI+ Q RF T PA Sbjct: 304 TESFKAPELGVGNLGASEKSDVFLVVSTLLHCIEGFEKNPEIKPNQGLRFITSEPA 359
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.6 bits (139), Expect = 4e-11 Identities = 86/459 (18%), Positives = 158/459 (34%), Gaps = 77/459 (16%) Query: 1 MDTSFWKAG--HRPTLFAAFLYFDLSFMVWYLLGPMAVQIATDLHLTTQQRGLMVATPIL 58 M+TS+ ++ H L + S + +L IA D + + +L Sbjct: 1 MNTSYSQSNLRHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFML 60 Query: 59 AGAILRFFMGLLADQLSPKTAGIIGQVIVIGALLTAWQLGIRSYEQVLLLGVFLGMAGAS 118 +I G L+DQL K + G +I + + +G + +++ G A+ Sbjct: 61 TFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGF-VGHSFFSLLIMARFIQGAGAAA 119 Query: 119 F-AVALPLASQWYPPQHQGKAMG-IAGAGNSGTVLAALIAPVLAASFGWGNVFGLALIPL 176 F A+ + + +++ P +++GKA G I G + I ++A W + LIP+ Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL---LIPM 176 Query: 177 VLTLIAFTLMARNAPERSKPKSTADYLKAL------------GDRDSWWFMFFYSVTFGG 224 + + LM + + + K D + S F+ ++F Sbjct: 177 ITIITVPFLM-KLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLI 235 Query: 225 FI------------------------------------GLASALPGYFNDQYGLSPITAG 248 F+ G S +P D + LS G Sbjct: 236 FVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIG 295 Query: 249 YYT--AACVFGGSLMRPLGGALADRFGGIRTLTAMYAVAAIGIAAVGFNLPSS-WAALAL 305 + + +GG L DR G + L ++ F L ++ W + Sbjct: 296 SVIIFPGTMSVI-IFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTII 354 Query: 306 FVAAMLGLGAGNGAVFQLVPQRFR-KEIGVMTGLI------GMAGGIG--GFLLAAGL-- 354 V + GL + +V + +E G L+ GI G LL+ L Sbjct: 355 IVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLD 414 Query: 355 -----GSIKQNTGDYQLGLWLFASLAVLAWFGLMNVKRR 388 + Q+T Y L LF+ + V++W +NV + Sbjct: 415 QRLLPMEVDQSTYLYSNLLLLFSGIIVISWLVTLNVYKH 453
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 44.1 bits (104), Expect = 1e-07 Identities = 25/135 (18%), Positives = 57/135 (42%), Gaps = 3/135 (2%) Query: 3 RILLINDTPKKVGRLRTALIEAGFEVIDESGFIIDLPARVDAVRPDVILIDTESPGRDVM 62 IL+ +D L AL AG++V S L + A D+++ D P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSN-AATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 63 EQVVLVSRDQPR-PIVMFTDEHDPGVMRQAIKSGVSAYIVEGIQAQRLQPILDVAMARFE 121 + + + + +P P+++ + ++ +A + G Y+ + L I+ A+A + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 122 SDQALRAQLHARDQQ 136 + + + ++D Sbjct: 124 RRPS-KLEDDSQDGM 137
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 28.9 bits (64), Expect = 0.049 Identities = 14/29 (48%), Positives = 18/29 (62%), Gaps = 2/29 (6%) Query: 431 NTVRAIAGFSRDSNGNTWAVVAILNDPRP 459 N V+ +A F RDS GNT V ++ PRP Sbjct: 287 NPVQVVATFGRDSQGNTTVDVQVI--PRP 313
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.2 bits (151), Expect = 7e-12 Identities = 37/161 (22%), Positives = 58/161 (36%), Gaps = 10/161 (6%) Query: 966 HILIVDDHPANRLLLCEQLGFLGHHCEVAENGALGLECWLQNRFDLVVADCNMPVMNGYD 1025 IL+ DD A R +L + L G+ + N A DLVV D MP N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1026 LTRAIRAQEQSRDSQPCTVWGFTANAQQEEVQRCRDAGMDDCLFKPISLSLLSDRLALLS 1085 L I+ + A + + G D L KP L+ L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKA-----SEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 1086 PLTRSTPAFNPGSVSR---LTGDRPEM--VKRLLSELLRSN 1121 + P+ L G M + R+L+ L++++ Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTD 160
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.5 bits (183), Expect = 9e-18 Identities = 28/117 (23%), Positives = 55/117 (47%), Gaps = 1/117 (0%) Query: 7 SVFIIDDHPVVRLAVRMLLENENYEVVGETDNGVDAMQMVRECMPDLIILDISIPKLDGL 66 ++ + DD +R + L Y+V T N + + DL++ D+ +P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 EVLARFNTMGLPSKILVLTSQTPKLFAIRCMQSGAAGYVCKQEDLSELLSSVKAVLS 123 ++L R +LV+++Q + AI+ + GA Y+ K DL+EL+ + L+ Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 48.7 bits (116), Expect = 8e-10 Identities = 20/100 (20%), Positives = 34/100 (34%), Gaps = 8/100 (8%) Query: 7 RILLVEDHPFQLIATQILLNNQGYFLLTPVLTASEAMAAMER-SPEPYDLILCDQRLPDL 65 IL+ +D L+ GY V S A + DL++ D +PD Sbjct: 5 TILVADDDAAIRTVLNQALSRAGY----DVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 66 DGLDLIEKAWKRGLIRHAVLLSGLAAQQLLDLEQLAIQLG 105 + DL+ + K +++S A + G Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNT---FMTAIKASEKG 97
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.6 bits (139), Expect = 5e-11 Identities = 49/301 (16%), Positives = 106/301 (35%), Gaps = 15/301 (4%) Query: 32 LLGVLLAVLVAGINEGVTRIAMADIRGAMFIGADEATWLVAAYSATSVAAMAFAPWFAVS 91 L+ + + + +NE V +++ DI W+ A+ T A + Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 92 LSLRRFTLGAITAFIVLGLLCPFAPNYPSLLVL-RILQGLAAGCLPPMLMTVALRFLPPH 150 L ++R L I ++ ++ SLL++ R +QG A P ++M V R++P Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 151 IKLYGLAGYALTATFGPSLGTPLAALWTEHFNWQWTFWQVIPPCLVAMIAIAHGIPQDPL 210 + G +G + + + +W + +IP + + + + + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEV 193 Query: 211 RLERFRTFNWRGVLLGLPAIAALVIGILQGNRLDWFESTLICWLLGGGLVLLVAFMVNEW 270 R F+ +G++L I ++ + S LI +L + F+ + Sbjct: 194 R--IKGHFDIKGIILMSVGIVFFMLFTTSYSI-----SFLIVSVLSFLI-----FVKHIR 241 Query: 271 FTPVPFFKLQLLAGRNLSHALLTLGGVLIVLTAVASIPSSYLAQVHGYRPLQTAPLMLIV 330 PF L +L G + + S+ + VH + +++ Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 331 A 331 Sbjct: 302 G 302
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 128 bits (323), Expect = 2e-35 Identities = 50/366 (13%), Positives = 104/366 (28%), Gaps = 81/366 (22%) Query: 46 VVAPKVSGFISQVLVEDNQPVKAGQLLAVID----------------------------- 76 + P + + +++V++ + V+ G +L + Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 77 -----------------------DRDVQTALASAEAGVATAAAELEQVTALLQRQTAVID 113 + +V + + +T + Q L ++ A Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217 Query: 114 QARAALTASTAAVRFAEQERDRYEHLAGAGAGTVQNAQQARNRIDTANANHASASASLVA 173 A + R + D + L A + N+ A + L Sbjct: 218 TVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277 Query: 174 ERKQVD--ILTARQHS-------------AEAGLKHARAARDQAQLQVSYTHIVAPIDGV 218 ++ + + + + + + + I AP+ Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337 Query: 219 VGERAVR-VGNYVNPGSKLLSVVPLADAYVV-GNFQETQLTHVSVGQSVEVRVDTYPDE- 275 V + V G V L+ +VP D V Q + ++VGQ+ ++V+ +P Sbjct: 338 VQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTR 397 Query: 276 --VLKAHVQSIAPATGVTFAAVRPDNATGNFTKVVQRIPVKIVLDPGQPLAARLRVGMSV 333 L V++I D G V+ I + + + L GM+V Sbjct: 398 YGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNI--PLSSGMAV 448 Query: 334 DASIDT 339 A I T Sbjct: 449 TAEIKT 454
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.2 bits (73), Expect = 0.006 Identities = 67/307 (21%), Positives = 129/307 (42%), Gaps = 29/307 (9%) Query: 10 AAVVFLFAAVIA--VPLAKRLKLGAVIGYLAA-GVVIGPSVLGLIGDTESVSHISELGVV 66 AA L V+A +P R K +IG + A G +GP++ G+I +H + Sbjct: 118 AAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMI------AHYIHWSYL 171 Query: 67 LLLFIIGLELSPKRLWVMRKAVFGVGTAQVLLTGLVIGAVALVAFGQSMNTAIVLGLGLA 126 LL+ +I + P + +++K V G + G+++ +V +V F + L Sbjct: 172 LLIPMITIITVPFLMKLLKKEVRIKGHFD--IKGIILMSVGIVFFMLFTTSY--SISFLI 227 Query: 127 LSSTAFGL----QSLAERKELNSPHGRT-AFAILLFQ----DIAAIPLIALVPFLAGGDH 177 +S +F + ++ G+ F I + +++VP++ H Sbjct: 228 VSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVH 287 Query: 178 ATSTQESINHGLRVLGSIAIVV---VGGRYLLR--PVFRIVAKTRIQEVSTATALLVVIG 232 ST E I + G++++++ +GG + R P++ + VS TA ++ Sbjct: 288 QLSTAE-IGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLET 346 Query: 233 TAWLMELVGVSMALGAFLAGLLLADSEYRHELEAQIEPFKGLLLGLFFISVGMG-ANIGL 291 T+W M ++ V + G +++ + + LL F+S G G A +G Sbjct: 347 TSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGG 406 Query: 292 LFSAPLV 298 L S PL+ Sbjct: 407 LLSIPLL 413
>TOXICSSTOXIN#Staphylococcal toxic shock syndrome toxin signature. Length = 234 Score = 30.8 bits (69), Expect = 0.005 Identities = 12/36 (33%), Positives = 16/36 (44%) Query: 124 YETYTTTNATRLITMDDGSRVEMDLGTELTYANYKD 159 Y + T ITM+DGS + DL + Y K Sbjct: 184 YRSSDKTGGYWKITMNDGSTYQSDLSKKFEYNTEKP 219
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1131 bits (2928), Expect = 0.0 Identities = 507/1030 (49%), Positives = 703/1030 (68%), Gaps = 8/1030 (0%) Query: 1 MSLFFIRRPNFAWVLALFILLAGLMALPALPVAQYPVVAPPQITITATYPGASAKVLVDS 60 M+ FFIRRP FAWVLA+ +++AG +A+ LPVAQYP +APP ++++A YPGA A+ + D+ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTSVIEDELNGAKGMLYYESTSNSTGSAEINVTFNPGTNPDMAQVEVQNRIKKAEARLPQ 120 VT VIE +NG ++Y STS+S GS I +TF GT+PD+AQV+VQN+++ A LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 PVLSQGLQVEQASSGFLMIFALSYTGDTANKDTVALADYAARNVNNEISRVNGVGRLQFF 180 V QG+ VE++SS +LM+ +D ++DY A NV + +SR+NGVG +Q F Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDD--ISDYVASNVKDTLSRLNGVGDVQLF 178 Query: 181 AAEAAMRVWIDPQKLVGYGLSIDDVNAAIRAQNVQVPAGSFGSTPGSSLQELTATLAVKG 240 A+ AMR+W+D L Y L+ DV ++ QN Q+ AG G TP Q+L A++ + Sbjct: 179 GAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238 Query: 241 TLDNPEEFGRIVLRANQDGSTVHLEDVARLAVGSQDYNFESRLDGKRAVAGAIQLSPGAN 300 NPEEFG++ LR N DGS V L+DVAR+ +G ++YN +R++GK A I+L+ GAN Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298 Query: 301 AIQTVKAVKQRLDELSVNFPEGVEYSIPYDTSRFVDVAIDKVIYTLIEAMVLVFMVMFLF 360 A+ T KA+K +L EL FP+G++ PYDT+ FV ++I +V+ TL EA++LVF+VM+LF Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358 Query: 361 LQNIRYTLIPTIVVPVCLAGTLAIMYVLGFSVNMMTMFGMVLAIGILVDDAIVVVENVER 420 LQN+R TLIPTI VPV L GT AI+ G+S+N +TMFGMVLAIG+LVDDAIVVVENVER Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418 Query: 421 IMAEEGLSPPAATVKAMGQVSGAILGITLVLAAVFLPLAFMGGSVGVIYQQFSLSLAVSI 480 +M E+ L P AT K+M Q+ GA++GI +VL+AVF+P+AF GGS G IY+QFS+++ ++ Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478 Query: 481 LFSGFLALTFTPALCATLLKPIPKGHTE-KRGFFGGFNRLFGKLTDRYDRVNSSLIKRAG 539 S +AL TPALCATLLKP+ H E K GFFG FN F + Y ++ G Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538 Query: 540 RYMLLYVGIVGLLGFFYLRLPESFVPVEDQGYLIIDVQLPPGATRLRTDATAKLLEDYML 599 RY+L+Y IV + +LRLP SF+P EDQG + +QLP GAT+ RT + DY L Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598 Query: 600 --SRETTDAVTMLLGFSFSGMGENAGLAFPTLKDWSER-GDGQSAADEAAAFNQHFAGLS 656 + ++V + GFSFSG +NAG+AF +LK W ER GD SA + Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658 Query: 657 DGTVMAVTPPPIEGLGTSGGFALRLQDRAGLGREALLAARNELLGKANGNP-KILYAMME 715 DG V+ P I LGT+ GF L D+AGLG +AL ARN+LLG A +P ++ Sbjct: 659 DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 716 GLAEAPQLRLNIDREKARTMGVSFESISSALATAFGSSVISDFANAGRQQRVVVQAEQGA 775 GL + Q +L +D+EKA+ +GVS I+ ++TA G + ++DF + GR +++ VQA+ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 776 RMTPESVLQLYVPNSSGTLVPLGAFVSTHWEEGPVQIARYNGYPAFKISGDAPPGVSTGE 835 RM PE V +LYV +++G +VP AF ++HW G ++ RYNG P+ +I G+A PG S+G+ Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 836 AMAEIERIVSQLPPGIGYEWTGLSYQEKVASGQATGLFALALLVVFLLLVALYESWAIPL 895 AMA +E + S+LP GIGY+WTG+SYQE+++ QA L A++ +VVFL L ALYESW+IP+ Sbjct: 839 AMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898 Query: 896 VVMLIVPVGALGAVLAVTAVGLPNDVYFKVGLITIIGLAAKNAILIVEFAKELWE-QGHS 954 VML+VP+G +G +LA T NDVYF VGL+T IGL+AKNAILIVEFAK+L E +G Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958 Query: 955 LRDAAMEAARLRFRPIVMTSLAFILGVVPLTLATGAGAASQRAIGTGVIGGMLSATLLGV 1014 + +A + A R+R RPI+MTSLAFILGV+PL ++ GAG+ +Q A+G GV+GGM+SATLL + Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018 Query: 1015 VLVPIFFVWV 1024 VP+FFV + Sbjct: 1019 FFVPVFFVVI 1028 Score = 85.3 bits (211), Expect = 7e-19 Identities = 64/328 (19%), Positives = 124/328 (37%), Gaps = 17/328 (5%) Query: 722 QLRLNIDREKARTMGVSFESISSALATA---FGSSVISDFANAGRQQRVVVQAEQGARMT 778 +R+ +D + ++ + + L + + QQ Q Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 779 PESVLQLYVP-NSSGTLVPLG--AFVSTHWEEGPVQIARYNGYPAFKISGDAPPGVSTGE 835 PE ++ + NS G++V L A V E IAR NG PA + G + + Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGEN-YNVIARINGKPAAGLGIKLATGANALD 301 Query: 836 A----MAEIERIVSQLPPGIGYEW---TGLSYQEKVASGQATGLFALALLVVFLLLVALY 888 A++ + P G+ + T Q + T A+ L VFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIML--VFLVMYLFL 359 Query: 889 ESWAIPLVVMLIVPVGALGAVLAVTAVGLPNDVYFKVGLITIIGLAAKNAILIVE-FAKE 947 ++ L+ + VPV LG + A G + G++ IGL +AI++VE + Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 948 LWEQGHSLRDAAMEAARLRFRPIVMTSLAFILGVVPLTLATGAGAASQRAIGTGVIGGML 1007 + E ++A ++ +V ++ +P+ G+ A R ++ M Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 1008 SATLLGVVLVPIFFVWVLSVLRRKPHQQ 1035 + L+ ++L P +L + + H+ Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHEN 507
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.5 bits (87), Expect = 7e-05 Identities = 22/128 (17%), Positives = 51/128 (39%), Gaps = 4/128 (3%) Query: 103 KAALSKAQGDLARTEATLFEARATVKRYESLVEIEAVSRQTYDTARATLQNAVAAKRSAQ 162 + +A +L ++ L + + + + E + V++ + L+ Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKE--EYQLVTQLFKNEILDKLRQTTDNIGLLT 315 Query: 163 ADVETAQLNLGFATVRAPISGRIGRALV-TEGALVGQAETTLMATIQQLEPVFVDFTQPV 221 ++ + + +RAP+S ++ + V TEG +V AE TLM + + + + V Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQN 374 Query: 222 ADALHMRA 229 D + Sbjct: 375 KDIGFINV 382 Score = 34.8 bits (80), Expect = 5e-04 Identities = 21/132 (15%), Positives = 38/132 (28%), Gaps = 6/132 (4%) Query: 58 PGRVEPV-RVAQVRARVAGIVLTRNFEEGADVKAGAVLFQIDPAPFKAALSKAQGDLART 116 G++ R +++ IV +EG V+ G VL ++ +A K Q L + Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA 146 Query: 117 EATLFEARATVKRYESLVEIEAVSRQTYDTARATLQNAVAAKRSAQADVET-----AQLN 171 + + E E + + + + T Q Sbjct: 147 RLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206 Query: 172 LGFATVRAPISG 183 L RA Sbjct: 207 LNLDKKRAERLT 218
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 33.4 bits (76), Expect = 3e-04 Identities = 17/140 (12%), Positives = 47/140 (33%), Gaps = 7/140 (5%) Query: 25 TLKDIAQAAGVSKATLNRFCGTRANLIEMLLNHASDLMNQMIAEADLEHAPHVEALQRLV 84 +L +IA+AAGV++ + +++L + + + ++ E + ++ R + Sbjct: 33 SLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREI 92 Query: 85 DNHLIHREMLVFLVFQWRPDTMDESCGGRRWLPYSDALDAFFLRGQ-------REGLFRI 137 H++ + + A L + + Sbjct: 93 LIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAK 152 Query: 138 DMSAAVLTETFASLLFGLVD 157 + A ++T A ++ G + Sbjct: 153 MLPADLMTRRAAIIMRGYIS 172
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 90.0 bits (223), Expect = 1e-23 Identities = 55/217 (25%), Positives = 84/217 (38%), Gaps = 5/217 (2%) Query: 56 GVFALVLHGAVIYWLSQKPTPALPVVPPEIPPMTIEFSRSAPPVQAPPPPPEPVVQPVTE 115 G L ++ + + P PA P+ + P +E ++ P P PEP +P+ E Sbjct: 26 GAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPE 85 Query: 116 PPPPVEDELAVKPPPPKPIPKPKPQPPKPVVKPVAKPVESTPAPPVPAPPVAAPAPPAPP 175 PP + P PKP PKP + +P AP + Sbjct: 86 PPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAA 145 Query: 176 APKPVTPASASAGYLRNPAPEYPSLAMRRGWEGTVMLRVHVLASGKPGEIQIQKSSGRES 235 KPVT ++ L P+YP+ A EG V ++ V G+ +QI + Sbjct: 146 TSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANM 205 Query: 236 LDDAALAAVKRWSFVPAKQGDVAQDGWVSVPIDFKIN 272 + A++RW + P K G + V I FKIN Sbjct: 206 FEREVKNAMRRWRYEPGKPG-----SGIVVNILFKIN 237
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 391 bits (1006), Expect = e-128 Identities = 194/672 (28%), Positives = 314/672 (46%), Gaps = 81/672 (12%) Query: 100 NFVDADIQAVVRALSRSTGQQFLVDPRVTGTLTLVSEGQVPAVQAYDMLLSALRMQGFSV 159 +F DIQ + +S++ + ++DP V GT+T+ S + Q Y LS L + GF+V Sbjct: 33 SFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDVYGFAV 92 Query: 160 VDVG-GVAHVVPEADAKLLGGPIYSPDKPA-GNGMLTRTFRLQYENAVNLIPVLRPIVSP 217 +++ GV VV DAK P+ S P G+ ++TR L A +L P+LR + Sbjct: 93 INMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQLNDN 152 Query: 218 NNPINA--YPGNNTIVVTDYAENLTRVAQIIDGIDTPSAIDTDVVSVRNGIAVDIAGMVS 275 + Y +N +++T A + R+ I++ +D V + A D+ +V+ Sbjct: 153 AGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASAADVVKLVT 212 Query: 276 EL---LDTQGGDPTQKISVIGDPRSNAIIIRAGSPERTELARNLIYKLDNAQSNPSNLHV 332 EL + +V+ D R+NA+++ G P + +I +LD Q+ N V Sbjct: 213 ELNKDTSKSALPGSMVANVVADERTNAVLVS-GEPNSRQRIIAMIKQLDRQQATQGNTKV 271 Query: 333 VYLRNAQAGKLAQALRGLLTGESDSGATDTARAMLSGMGGMSNKNEGQGTTSTSSGSGSA 392 +YL+ A+A L + L G+ + S K + + Sbjct: 272 IYLKYAKASDLVEVLTGISS------------------TMQSEKQAAKPVAALDKN---- 309 Query: 393 SGTGSNGYGQAGGTTANAGVSGQQGDQSTAFTASGVTIQADATTNTLLISAPEPLYRNLR 452 + I+A TN L+++A + +L Sbjct: 310 -----------------------------------IIIKAHGQTNALIVTAAPDVMNDLE 334 Query: 453 EVIDQLDQRRAQVVIESLIVEVSEDDANEFGVQWQTGNLSGSGVFGGANLGGSGLVSNPA 512 VI QLD RR QV++E++I EV + D G+QW N +G+ N G +S Sbjct: 335 RVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN---AGMTQFTNSGLP--ISTAI 389 Query: 513 GGTTIDVLPPGLNVGVVKGTVTIPGIG---EVLDLKVLARALKSKGGSNVLSTPNLLTLD 569 G ++ + + GI + +L AL S +++L+TP+++TLD Sbjct: 390 AGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLD 449 Query: 570 NEAASIFVGQTIPFVTGSYVTGGGGTSNNPFQTVEREEVGLKLNVRPQISEGGTVKLDIY 629 N A+ VGQ +P +TGS T G N F TVER+ VG+KL V+PQI+EG +V L+I Sbjct: 450 NMEATFNVGQEVPVLTGSQTTSGD----NIFNTVERKTVGIKLKVKPQINEGDSVLLEIE 505 Query: 630 QEVSSVDQRASV---DAGTVTNKRAIDTSILLDDGQIMVLGGLLQDGYNQSNDAVPWLSN 686 QEVSSV AS D G N R ++ ++L+ G+ +V+GGLL + + D VP L + Sbjct: 506 QEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGD 565 Query: 687 IPVLGVLFRNDRRQMTKTNLMVFLRPYIIRDSGAGRSITLNRYEYMRRAQG-SLQPERNW 745 IPV+G LFR+ ++++K NLM+F+RP +IRD R + +Y AQ E N Sbjct: 566 IPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENND 625 Query: 746 ALPDMQGPQLPP 757 A+ + ++ P Sbjct: 626 AMLNQDLLEIYP 637
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 371 bits (953), Expect = e-129 Identities = 184/406 (45%), Positives = 251/406 (61%), Gaps = 4/406 (0%) Query: 1 MNRYRYEAANAQGRIESGHLEADSRNAAFGVLRSRGLTALQVEPETVRAGSGGRGLFSAR 60 M +Y Y+A +AQG+ G EADS A +LR RGL L V+ G S R Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 61 ----LSDTDLASVTRQLASLLGAGLPLDEALTATLEQAERKHIIQTLGAVRSDVRSGMRL 116 LS +DLA +TRQLA+L+ A +PL+EAL A +Q+E+ H+ Q + AVRS V G L Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 117 AEALAARPGDFPDIYRALIAAGEESGDLAHVMERLADYIEDRNGLRSKILTAFIYPGVVG 176 A+A+ PG F +Y A++AAGE SG L V+ RLADY E R +RS+I A IYP V+ Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 177 LVSVAIVIFLLSYVVPQVVSAFSQARQDLPGLTLAMLNASDFIRGWGWLCLLGLTISVWS 236 +V++A+V LLS VVP+VV F +Q LP T ++ SD +R +G LL L + Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 237 WRIYLRNPAARLSWHARVLRLPLIGRFVLGLNTARFASTLAILGGAGVPLLRALDAARQT 296 +R+ LR R+S+H R+L LPLIGR GLNTAR+A TL+IL + VPLL+A+ + Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 297 LSNDRLSESVTEATAKVREGVNLAAALRVEKVFPPLLIHLIASGEKTGALPPMLERAAQT 356 +SND ++ AT VREGV+L AL +FPP++ H+IASGE++G L MLERAA Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 357 LSRDIERRAMGMTALLEPLMIVVMGGVVLVIVLAVLLPIIEINQLV 402 R+ + L EPL++V M VVL IVLA+L PI+++N L+ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.1 bits (182), Expect = 2e-17 Identities = 28/138 (20%), Positives = 61/138 (44%), Gaps = 3/138 (2%) Query: 7 SVLIVEDNLALAANMFDYLEACGHTPDAAPDGKAATRLLLENTYDVIVLDWMMPRMDGIA 66 ++L+ +D+ A+ + L G+ + R + D++V D +MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LLHHLRHEMGSPTPVMLLTAKDQLEDKLEGFESGADDYIVKPLALPELEIRLRVLAARSQ 126 LL ++ + PV++++A++ ++ E GA DY+ KP L E+ + A ++ Sbjct: 65 LLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT--ELIGIIGRALAE 121 Query: 127 PRSNVRQTLEVGDLRFDL 144 P+ + + L Sbjct: 122 PKRRPSKLEDDSQDGMPL 139
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 59.6 bits (144), Expect = 3e-13 Identities = 25/139 (17%), Positives = 49/139 (35%), Gaps = 6/139 (4%) Query: 14 PAQARSRATVDAIIQATTYILTKVGWDGLTTNAIAERAGVNIGSLYQFFPNKEAIIAELQ 73 + ++ T I+ + ++ G + IA+ AGV G++Y F +K + +E+ Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63 Query: 74 RRHVAATRTDLVNVLQDLPESP--TLRGALTMIVE-MLVAEHR---VAPAVHKAIHEELP 127 + + P P LR L ++E + E R + HK Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEM 123 Query: 128 LTVRRLDTDRDTLQRRFAE 146 V++ + E Sbjct: 124 AVVQQAQRNLCLESYDRIE 142
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 36.7 bits (85), Expect = 1e-04 Identities = 27/200 (13%), Positives = 65/200 (32%), Gaps = 21/200 (10%) Query: 67 ASGSVAPWEEAVIGAQVANLRLTDLRANVGDRVKRGQLLATFDADLLSADEERLKANWLQ 126 A+G + + + N + ++ G+ V++G +L A AD + +++ LQ Sbjct: 86 ANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQ 145 Query: 127 ADANRKRALLLKGT--------GGMSDQDVLQYETQADVTRAQLT-----------STQL 167 A + R +L + + D+ Q ++ +V R Q Sbjct: 146 ARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQK 205 Query: 168 QLRYARVIAPDDGVISARSATTGAVYGNGQEL--FRLIRQGRLEWRGELNAGQMAQVQSG 225 +L + A V++ + L F + + + + + V++ Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265 Query: 226 QRIDLQLPDGTSASASIREL 245 + + + I Sbjct: 266 NELRVYKSQLEQIESEILSA 285 Score = 31.7 bits (72), Expect = 0.004 Identities = 12/85 (14%), Positives = 30/85 (35%), Gaps = 2/85 (2%) Query: 150 QYETQADVTRAQLTSTQLQLRYARVIAPDDGVISARSATT-GAVYGNGQELFRLIRQG-R 207 Q + +L + + + + + AP + T G V + L ++ + Sbjct: 306 QTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365 Query: 208 LEWRGELNAGQMAQVQSGQRIDLQL 232 LE + + + GQ +++ Sbjct: 366 LEVTALVQNKDIGFINVGQNAIIKV 390
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 618 bits (1596), Expect = 0.0 Identities = 253/1041 (24%), Positives = 455/1041 (43%), Gaps = 55/1041 (5%) Query: 7 SIRNPIPSILLFILLSLAGVMGFRALPIANMPDVDLPTVVITLTQPGAAPAQLETEVARK 66 IR PI + +L I+L +AG + LP+A P + P V ++ PGA ++ V + Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64 Query: 67 VENSLATLSGIKHIT-TSIVDGLVTINVEFILEKQLSDALIETKDAVDRVRSDLPTDLEQ 125 +E ++ + + +++ TS G VTI + F A ++ ++ + LP +++Q Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124 Query: 126 PSISAVRVGGDDATLLYAVASTK--MDEEALSWFVDDTINKTILGVPGIGKFERVGGVQR 183 IS V ++ S ++ +S +V + T+ + G+G + G Q Sbjct: 125 QGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QY 182 Query: 184 QVLVEVDPSSLAAQGATAAEVSRALKNVEQESSGGRGQMGSA------EQAVRTIATVRQ 237 + + +D L T +V LK + + G+ A ++ + Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 238 AAELNRLPVVLG-NGRRVNLDQVAVVKDTYADRTQIATLDGKPVVGFRLFRAKGFDETRV 296 E ++ + + +G V L VA V+ + IA ++GKP G + A G + Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 297 AAGVISALDQLHA-ADSTLSFTKVSGTVDYTHEQYEGSMHMLYEGALLAVLVVWWFLRDW 355 A + + L +L + T + + L+E +L LV++ FL++ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 356 RATLISASALPLSVLPTFLVMNWLGYSLNTLTLLALAVIVGILVDDAIVEIENIERHSRM 415 RATLI A+P+ +L TF ++ GYS+NTLT+ + + +G+LVDDAIV +EN+ER Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 416 GK-PIKQAAGDAVTEIALAVMATTMTLVVVFLPTAMMSGVPGLFFKQFGWTAVVAVLSSL 474 K P K+A ++++I A++ M L VF+P A G G ++QF T V A+ S+ Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 475 LVARILTPMMAAYLLKTHPDKQEPADGALMT-----------RYLSAVRWCLKHRGLTLG 523 LVA ILTP + A LLK + G Y ++V L G L Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542 Query: 524 ATLLVFVASIAMVPLLETGLIPASDKGYSNINVELPPGSSLEATRSTVEAVSRVI--KDI 581 L+ + + L + +P D+G ++LP G++ E T+ ++ V+ + Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602 Query: 582 PGIEHVFSTVGVAQSAGHGQTQAAELRRATMTLVLSDRGTRAGQTD----IENRIRGVLH 637 +E VF+ G + S GQ Q A + L R G + + +R + L Sbjct: 603 ANVESVFTVNGFSFS---GQAQNA----GMAFVSLKPWEERNGDENSAEAVIHRAKMELG 655 Query: 638 GIPGARF---------SLGSGGLGEKMALILSSDDPAALKATAQALERELRDVPG-LANI 687 I LG+ + + + AL L P L ++ Sbjct: 656 KIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSV 715 Query: 688 TSTASLERPEIVVRPDARQAAERGVTTATIGETVRIATNGDFDSQMAKLNLDNRQISIQV 747 + + + D +A GV+ + I +T+ A G + + R + V Sbjct: 716 RPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF---IDRGRVKKLYV 772 Query: 748 RIPQAARQDLETIADLRVRGRDG-LVPLSSVAQLSVESGPTQIDRYDRRRYANVSA-DLG 805 + R E + L VR +G +VP S+ G +++RY+ + Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP 832 Query: 806 HMPLGQALTIARSLPAIQSMPSSVRLIETGDAEIMAELMEGFGMAIIIGLVCVYVVLVLL 865 G A+ + +L +P+ + TG + + I V V++ L L Sbjct: 833 GTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAAL 890 Query: 866 FSDFFQPLTILFAIPLSVGGAFVALLLTRGMLSLPSLIGLVMLMGIVTKNSILLVEYSVM 925 + + P++++ +PL + G +A L + ++GL+ +G+ KN+IL+VE++ Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950 Query: 926 GIRKQGLSVADALINACHKRVRPIIMTTLAMIAGMMPIALGLGADASFRQPMAIAVIGGL 985 + K+G V +A + A R+RPI+MT+LA I G++P+A+ GA + + + I V+GG+ Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010 Query: 986 MTSTALSLLVVPVAFTYIDEL 1006 +++T L++ VPV F I Sbjct: 1011 VSATLLAIFFVPVFFVVIRRC 1031
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 215 bits (550), Expect = 3e-73 Identities = 84/217 (38%), Positives = 130/217 (59%), Gaps = 7/217 (3%) Query: 7 NLIEIILVVATIGLIPLAVVTLTGFMKISVVLFLIRNALGVQQTPPNLVLYGIALILSVY 66 N I +I ++A L+P + + T F+K S+V ++RNALG+QQ P N+ L G+AL+LS++ Sbjct: 3 NDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLSMF 62 Query: 67 VTTPLIGDMYRQVQGRDLSLQNVQQLEELGSALRPPLQAHLSKYANENERGFFVQATETI 126 V P++ D Y + D++ ++ L + + +L KY++ FF A Sbjct: 63 VMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQLKR 122 Query: 127 WSPEA-------RADLRDDDLVVLIPAFVSSELTRAFEIGFLLYIPFLVVDLLVSNVLMA 179 E + ++ + L+PA+ SE+ AF+IGF LY+PF+VVDL+VS+VL+A Sbjct: 123 QYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVLLA 182 Query: 180 MGMSMVSPTLISIPLKIFLFVALSGWSRLMHGLILSY 216 +GM M+SP IS P+K+ LFVAL GW+ L GLIL Y Sbjct: 183 LGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 56.3 bits (136), Expect = 2e-14 Identities = 30/76 (39%), Positives = 44/76 (57%) Query: 7 LSLMNQALMTVLLLSAPALAVAIVVGLSVGLLQALTQIQDQTLPQVVKLVGVLLVIVFVG 66 + N+AL VL+LS VA ++GL VGL Q +TQ+Q+QTLP +KL+GV L + + Sbjct: 5 VFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLS 64 Query: 67 PLLAGQVAELGNQVLD 82 + G QV+ Sbjct: 65 GWYGEVLLSYGRQVIF 80
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 130 bits (329), Expect = 7e-39 Identities = 56/261 (21%), Positives = 107/261 (40%), Gaps = 8/261 (3%) Query: 11 EIAYPVISSASLAASRAMGVVIITPAFNRLGLTGMIRGCVAVAISVPMILPVFTAFTSMP 70 E ++ R + ++ P + + ++ +A+ I+ + + + Sbjct: 7 EQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVF 66 Query: 71 EHSGFFLAGLMVKELLIGLLIGLLFGIPFWAAEVAGELIDLQRGSTMEQLVDPLGQGEAS 130 FF L V+++LIG+ +G F A AGE+I LQ G + VDP Sbjct: 67 S---FFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMP 123 Query: 131 VMATLFTVMLIALFFMSGGFILMVDGYYHSYQLWPVTEFTPLFSSAALMSILALLDQVMR 190 V+A + ++ + LF G + ++ ++ P+ +S A +++ + Sbjct: 124 VLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPI--GGEPLNSNAFLALTKAGSLIFL 181 Query: 191 IGVLMVAPLLVAMLITDLMLAYLSRMAPSLHIFDLSLPVKNLFFAVLMVVYISFLIPVMI 250 G+++ PL+ +L +L L L+RMAP L IF + P+ LM + + P Sbjct: 182 NGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCE 241 Query: 251 DQLAQFRGTVEVLKALASEAP 271 F +L + SE P Sbjct: 242 H---LFSEIFNLLADIISELP 259
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 245 bits (628), Expect = 2e-81 Identities = 101/339 (29%), Positives = 187/339 (55%), Gaps = 1/339 (0%) Query: 5 SEEKSQPATDKKLRDARKKGQVAKSQDLVSGVVILLCTLCIAVLLPRARAQVEALIDLTA 64 S EK++ T KK+RDARKKGQVAKS+++VS +I+ + + L L+ + A Sbjct: 2 SGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPA 61 Query: 65 NIYIEPFADVWPRLLDHAEQIVLGITLPVVAVTVAAVILTNIVTMRGVVFSVEPVKPDIK 124 PF+ ++D+ + P++ V I +++V G + S E +KPDIK Sbjct: 62 EQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVV-QYGFLISGEAIKPDIK 120 Query: 125 RIHPGEGFKRIFAMRNLIEFLKGLVKVLLLALAFYIVGRQALQALMESSRCGAGCIESTF 184 +I+P EG KRIF++++L+EFLK ++KV+LL++ +I+ + L L++ CG CI Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180 Query: 185 YLVLKPLVFTVLAAFLLVGAVDVLMQRWLFGRDMKMSRSEQKRERKDVDGDPLIKRERQR 244 +L+ L+ F+++ D + + + +++KMS+ E KRE K+++G P IK +R++ Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240 Query: 245 QRREMQALATKLGLGRASLMIGIGGNWVVGVRYVRGETPVPVVVCRGSPEESVQLLAQAA 304 +E+Q+ + + R+S+++ + +G+ Y RGETP+P+V + + + + A Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300 Query: 305 PLGIAVWADAGLAEQIAKRSVAGDPVPENTFQAVADALV 343 G+ + LA + ++ +P +A A+ L Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLR 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.5 bits (170), Expect = 4e-17 Identities = 32/120 (26%), Positives = 52/120 (43%), Gaps = 7/120 (5%) Query: 6 STILVVEDDTIVRMLIVDVLEELEYTVLEAEDAMTALEIIKDETRTIDLMMTDQGLPDLK 65 +TILV +DD +R ++ L Y V +A T I DL++TD +PD Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDEN 61 Query: 66 GTELAKKARALRPELPVLFASGYSENIEVPAGM-----HSIGKPFSIDDLRDKVKSVLDQ 120 +L + + RP+LPVL S + + + KPF + +L + L + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.8 bits (189), Expect = 2e-16 Identities = 36/115 (31%), Positives = 53/115 (46%), Gaps = 3/115 (2%) Query: 1045 KILVVDDDVRNIFALTSALEHKGAVVEIARNGLEAIAKLNEVEDIDLVLMDVMMPEMDGY 1104 ILV DDD L AL G V I N + D DLV+ DV+MP+ + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMPDENAF 63 Query: 1105 EATIEIRKDPRWRKLPIIAVTAKAMKDDQERCLQAGSNDYLAKPIDLDRLFSLIR 1159 + I+K LP++ ++A+ + + G+ DYL KP DL L +I Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116 Score = 67.5 bits (165), Expect = 1e-13 Identities = 30/127 (23%), Positives = 53/127 (41%), Gaps = 5/127 (3%) Query: 778 ILVIEDEVRFAQILFDLAHELGYYCLVAHAADDGFNLAARFTPDAILLDMRLPDHSGLTV 837 ILV +D+ +L GY + A + A D ++ D+ +PD + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 838 LQRLKELAPTRHIPVHVISVE---DRQEAALHMGAIGYAVKPTTREELKDVFAKLEAKLT 894 L R+K+ P +PV V+S + A GA Y KP EL + + A+ Sbjct: 66 LPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 895 QKVKRIL 901 ++ ++ Sbjct: 124 RRPSKLE 130 Score = 62.5 bits (152), Expect = 5e-12 Identities = 17/81 (20%), Positives = 33/81 (40%), Gaps = 2/81 (2%) Query: 899 RILLVEDDALQRDSIARLIGDDDIEITAVGFAQEALDLLRDNIYDCMIIDLKLPDMLGDE 958 IL+ +DDA R + + + ++ A + D ++ D+ +PD + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 959 LLKRMSTEEICSFPPVIVYTG 979 LL R+ PV+V + Sbjct: 65 LLPRIKKAR--PDLPVLVMSA 83
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 3e-15 Identities = 33/169 (19%), Positives = 60/169 (35%), Gaps = 19/169 (11%) Query: 7 AKLLIVDDLPENLLALEALIKREDRLVFKALSADEALSLLLQHEFALAILDVQMPGMNGF 66 A +L+ DD L + R V +A + + L + DV MP N F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELAELMRSTEKTKSIPIVFVSAAGRELNYAFKGYESGAVDFLHKPLDIHAVKSKVNVFVD 126 +L ++ +P++ +SA A K E GA D+L KP D+ Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQN-TFMTAIKASEKGAYDYLPKPFDL------------ 108 Query: 127 LFRQRKAMKMQVEELERSRQEQEALLKRLQSTQGELEHAIRMRDDFMSI 175 + + + L ++ L Q + + M++ + + Sbjct: 109 ----TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.1 bits (156), Expect = 2e-15 Identities = 30/117 (25%), Positives = 48/117 (41%), Gaps = 10/117 (8%) Query: 9 VLIVEDEPLILMLLADYLSGVGYRVLQAENGEQAFEILATKPHLDLMITDYRLPGGISGV 68 +L+ +D+ I +L LS GY V N + +A DL++TD +P + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMPDE-NAF 63 Query: 69 QIAEPAVKLRPELKVIFISGYPAEIIDSGSPIAA-KAPI---LAKPFTMETLQSQIQ 121 + K RP+L V+ +S + I A + L KPF + L I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN----TFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 68.7 bits (168), Expect = 3e-14 Identities = 29/116 (25%), Positives = 53/116 (45%), Gaps = 1/116 (0%) Query: 556 DGETVLIVEDDPAVRALVSEVLSELGYAFIEAGDSLSAVPILESGQRIDLLISDVGLPGM 615 G T+L+ +DD A+R ++++ LS GY ++ + + +G DL+++DV +P Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE 60 Query: 616 NGRQLAEIARQLRPELKVLFITGYAEHAAARSGFLDTGMQLITKPFAFDHLTSKVR 671 N L ++ RP+L VL ++ A + KPF L + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116 Score = 44.8 bits (106), Expect = 1e-06 Identities = 23/116 (19%), Positives = 45/116 (38%), Gaps = 5/116 (4%) Query: 26 MILKEAGYPATVARDLNELVAELETGAGLVIVADEALRTVDITPLLDLLGQQPAWSDLPI 85 L AGY + + L + G G ++V D + + LL + + A DLP+ Sbjct: 21 QALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRI--KKARPDLPV 78 Query: 86 VLLTHHGGPEQNPSARMGSLLGNVTFLERPFHPVTLVSLVATAVRGRRRQYEARAR 141 ++++ A G +L +PF L+ ++ A+ +R+ Sbjct: 79 LVMSAQNTFMTAIKASE---KGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLED 131
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.9 bits (88), Expect = 6e-05 Identities = 24/151 (15%), Positives = 54/151 (35%), Gaps = 20/151 (13%) Query: 63 GFGGSAAAVPVRVAPVTQGDFPIYYKALGTVTATNTINVRSRVAGELVKLNFQEGQMVKA 122 + V + G + + ++ + ++ +EG+ V+ Sbjct: 70 IAFILSVLGQVEIVATANGKL---------THSGRSKEIKPIENSIVKEIIVKEGESVRK 120 Query: 123 GDLLAEIDP-------RSYQVALQQAEGTLATNQALLKNAQLDVQRYRGLYAE---DSIA 172 GD+L ++ Q +L QA Q L ++ +L+ L E +++ Sbjct: 121 GDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180 Query: 173 KQTLDTAESLVNQYQGTIKTNQAAVAEAKLN 203 ++ + SL+ + T + NQ E L+ Sbjct: 181 EEEVLRLTSLIKEQFSTWQ-NQKYQKELNLD 210 Score = 36.7 bits (85), Expect = 2e-04 Identities = 23/123 (18%), Positives = 52/123 (42%), Gaps = 11/123 (8%) Query: 138 LQQAEGTLATNQALLKNAQLDVQRYRGLYAEDSIAKQTLDTAESLVNQYQGTIKTNQAAV 197 L+ + L ++ + +A+ + Q L+ + + K + Q I + Sbjct: 268 LRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK---------LRQTTDNIGLLTLEL 318 Query: 198 AEAKLNLDFTRIRAPIAGRV-GLKQLDVGNLVAANDTTALVVITQTQPISVNFTLPEKDL 256 A+ + + IRAP++ +V LK G +V +T +V++ + + V + KD+ Sbjct: 319 AKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTALVQNKDI 377 Query: 257 SSV 259 + Sbjct: 378 GFI 380
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 830 bits (2146), Expect = 0.0 Identities = 292/1037 (28%), Positives = 515/1037 (49%), Gaps = 28/1037 (2%) Query: 3 MSRLFILRPVATTLSMLAIVLAGLIAYGLLPVSALPQVDYPTIRVMTLYPGASPQVMTSA 62 M+ FI RP+ + + +++AG +A LPV+ P + P + V YPGA Q + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 63 VTAPLERQFGQMPGLTQMASTS-SGGASVITLRFSLEINMDVAEQEVQAAINGATNLLPT 121 VT +E+ + L M+STS S G+ ITL F + D+A+ +VQ + AT LLP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 122 DLPAPPVYNKVNPADTPVLTLAITS--KTMLLPKLNDLVDTRMAQKISQISGVGMVSIAG 179 ++ + + + ++ S ++D V + + +S+++GVG V + G Sbjct: 121 EVQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179 Query: 180 GQRQAVRIKVNPEALAANSLNLADVRTLISASNVNQPKGNFDGPTRVS------MLDAND 233 Q +RI ++ + L L DV + N G G + + A Sbjct: 180 AQYA-MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238 Query: 234 QLKSPEEYANLIL-AYKDGAPLRLKDVAQIVDGAENERLAAWANRNQAVLLNIQRQPGAN 292 + K+PEE+ + L DG+ +RLKDVA++ G EN + A N A L I+ GAN Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298 Query: 293 VIDVVDRIKTLLPGITDNLPAGLDVTVLTDRTQTIRASVTDVQHELLIAIVLVVLVTFLF 352 +D IK L + P G+ V D T ++ S+ +V L AI+LV LV +LF Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358 Query: 353 LRRLSATIIPSIAVPLSLVGTFGVMYLAGFSINNLTLMALTIATGFVVDDAIVMLENISR 412 L+ + AT+IP+IAVP+ L+GTF ++ G+SIN LT+ + +A G +VDDAIV++EN+ R Sbjct: 359 LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVER 418 Query: 413 HI-EEGETPLQAALKGAKQIGFTLISLTLSLIAVLIPLLFMADVVGRLFREFAITLAVAI 471 + E+ P +A K QI L+ + + L AV IP+ F G ++R+F+IT+ A+ Sbjct: 419 VMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAM 478 Query: 472 LISLLVSLTLTPMMCARLLKREPRE--EEQSRFYRASGAWIDWLVHVYGGGLRWVLKHQP 529 +S+LV+L LTP +CA LLK E E + F+ D V+ Y + +L Sbjct: 479 ALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTG 538 Query: 530 LTLLVAIGTLGLTVLLYIIVPKGFFPVQDTGVIQGISEAPQSVSFKAMSERQQALADIIL 589 LL+ + V+L++ +P F P +D GV + + P + + + + D L Sbjct: 539 RYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYL 598 Query: 590 KDPS--VVSLSSYIGVDGDNATLNSGRFLINLKPHGERD---LTAAEIIQRIQPEVDKLS 644 K+ V S+ + G N+G ++LKP ER+ +A +I R + E+ K+ Sbjct: 599 KNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR 658 Query: 645 DIRLFMQPVQDLTIEDRVSRTQYQFSM---SSPDAELLSEWSVRLADALAQRP-ELTDVA 700 D F+ P I + + T + F + + + L++ +L AQ P L V Sbjct: 659 D--GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716 Query: 701 SDLQDKGLQVYLVIDRDAASRVGVSVANITDALYDAFGQRQISTIYTQASQYRVVLQSAS 760 + + Q L +D++ A +GVS+++I + A G ++ + ++ +Q+ + Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776 Query: 761 ASELGPEALEQIHVKTTDGAQVKLSSLARVEQRQAQLAIAHIGQFPAVMMSFNLAPNIAL 820 + PE +++++V++ +G V S+ + P++ + AP + Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS 836 Query: 821 GEAVEVIEQVQKDIGMPIGVQTQFQGAAQAFQASLSSTLLLILAAVVTMYIVLGVLYESY 880 G+A+ ++E + +P G+ + G + + S + L+ + V +++ L LYES+ Sbjct: 837 GDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESW 894 Query: 881 IHPITILSTLPSAAVGALLALLISGNDLGMIAIIGIILLIGIVKKNAIMMIDFALDAERN 940 P++++ +P VG LLA + + ++G++ IG+ KNAI++++FA D Sbjct: 895 SIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEK 954 Query: 941 RGVDPETAIYEAALLRFRPILMTTLAALFGAIPLMLATGSGAELRQPLGLVMVGGLLLSQ 1000 G A A +R RPILMT+LA + G +PL ++ G+G+ + +G+ ++GG++ + Sbjct: 955 EGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSAT 1014 Query: 1001 ILTLFTTPVIYLYFDRL 1017 +L +F PV ++ R Sbjct: 1015 LLAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 802 bits (2072), Expect = 0.0 Identities = 293/1032 (28%), Positives = 514/1032 (49%), Gaps = 28/1032 (2%) Query: 7 FIRRPVATVLLSLAIMLLGAVSFRLLPVAPLPNMDFPVIVVSASLPGASPEIMASSVATP 66 FIRRP+ +L++ +M+ GA++ LPVA P + P + VSA+ PGA + + +V Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64 Query: 67 LERSLGTIAGVNTMTSNS-SQGTTRVILQFDLDRDINGAAREVQAAINASRNLLPSGMRS 125 +E+++ I + M+S S S G+ + L F D + A +VQ + + LLP ++ Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124 Query: 126 MPTYKKVNPSQAPIMVLSLTST--VLEKGQLYDLASTILSQSLSQVTGVGEVQIGGSSLP 183 S + +MV S + + D ++ + +LS++ GVG+VQ+ G+ Sbjct: 125 QGISV-EKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182 Query: 184 AVRIELEPQLLSQYGISLDEVRTAITGSNVRRPKGSVEND------QHNWQVQANDQLET 237 A+RI L+ LL++Y ++ +V + N + G + Q N + A + + Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 238 AKDYAPLIIRY-QDGATLRLKDVAKVSDAVENRYNSGFFNDDRAVLLVINRQAGANIIET 296 +++ + +R DG+ +RLKDVA+V EN N A L I GAN ++T Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 297 VAQIKAQLPALQAVLPASVKLDIAMDRSPVITATLHEAEMTLLIAVVLVVLVVFLFLGSF 356 IKA+L LQ P +K+ D +P + ++HE TL A++LV LV++LFL + Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 357 RASLIPTLAVPVSLVGTFALMYLCGFSLNNLSLMALILATGLVVDDAIVVLENISRHIH- 415 RA+LIPT+AVPV L+GTFA++ G+S+N L++ ++LA GL+VDDAIVV+EN+ R + Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 416 NGLDPMKAAFLGAKEVGFTLLSMNVSLVAVFVSILFMGGLVESLFREFSITLSVSIVVSL 475 + L P +A ++ L+ + + L AVF+ + F GG +++R+FSIT+ ++ +S+ Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 476 VVSLTLTPMLCARWLKPREAEQE---NAFQRWSERVNDRMVAGYDRSLGWVLRHPRLTLV 532 +V+L LTP LCA LKP AE F W D V Y S+G +L L+ Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542 Query: 533 SLLITIVVNIALYVVVPKTFLPQQDTGQLMGFVRGDDGLSFKVMQPKMEIFRRAVLADP- 591 + + + L++ +P +FLP++D G + ++ G + + Q ++ L + Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602 Query: 592 ----AVQSVAGFIGGSGGTNNAFMIVRLKPIGER---KLSAEKVVERLRKNLPHVPGGRL 644 +V +V GF N V LKP ER + SAE V+ R + L + G + Sbjct: 603 ANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFV 662 Query: 645 FLAPDQDLQLGGGREQTSSQYQYIVQSGDLEVLREWYPKIVA-ALKSLPQLTAIDAREGR 703 + G + G + L + +++ A + L ++ Sbjct: 663 IPFNMPAIVELGTATGFDFELIDQAGLG-HDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 704 GAQQVTLIVNRDTAKRLGIDMNMVTAVLNNAYSQRQVSTIYDSLNQYQVVMEVNPKYAQD 763 Q L V+++ A+ LG+ ++ + ++ A V+ D ++ ++ + K+ Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 764 PVTLEQVQVITADGQRVPLSSIAHYERSLENDRVSHDGQFASENISFDLAEGVSLDQATV 823 P ++++ V +A+G+ VP S+ + R+ S I + A G S A Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 824 AIERSVAAIGLPSGIISKMAGTANAFAATQKSQPWMILGALLAVYLVLGILYESYIHPLT 883 +E + LP+GI G + + P ++ + + V+L L LYES+ P++ Sbjct: 842 LMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 884 ILSTLPSAGVGALLTIYVLRSEFSLISLLGLFLLIGVVKKNAIMMIDLALHLERDQGMTP 943 ++ +P VG LL + + + ++GL IG+ KNAI++++ A L +G Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 944 QESIRSACLQRLRPILMTTMAAILGALPLLLSTAEGAEMRRPLGLTIIGGLVFSQVLTLY 1003 E+ A RLRPILMT++A ILG LPL +S G+ + +G+ ++GG+V + +L ++ Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1004 TTPVVYLYLDRL 1015 PV ++ + R Sbjct: 1020 FVPVFFVVIRRC 1031 Score = 93.0 bits (231), Expect = 3e-21 Identities = 79/510 (15%), Positives = 166/510 (32%), Gaps = 39/510 (7%) Query: 2 NLSAPFIRRPVATVLLSLAIMLLGAVSFRLLPVAPLPNMDFPVIVVSASLP-GASPEIMA 60 N + +L+ I+ V F LP + LP D V + LP GA+ E Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587 Query: 61 SSVAT----------PLERSLGTIAGVNTMTSNSSQGTTRVILQFDLDRDINGAAREVQA 110 + S+ T+ G + + G V L+ + Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLK---PWEERNGDENSAE 644 Query: 111 AINASRNLLPSGMRSMPTYKKVNPSQAPIMVLSLTSTVLEK------GQLYDLASTILSQ 164 A+ + +R P+ + + L L + +L Sbjct: 645 AVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGM 704 Query: 165 SLSQVTGVGEVQIGGSS-LPAVRIELEPQLLSQYGISLDEVRTAITGSNVRRPKGSVEND 223 + + V+ G ++E++ + G+SL ++ I+ + + Sbjct: 705 AAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDR 764 Query: 224 QHNWQV--QANDQL-ETAKDYAPLIIRYQDGATLRLKDVAKVSD----AVENRYNSGFFN 276 ++ QA+ + +D L +R +G + RYN Sbjct: 765 GRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYN----- 819 Query: 277 DDRAVLLVINRQAGANIIETVAQIKAQLPALQAVLPASVKLDIAMDRSPVITATLHEAEM 336 L + Q A + A + L + LPA + D S + ++A Sbjct: 820 ----GLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDW-TGMSYQERLSGNQAPA 874 Query: 337 TLLIAVVLVVLVVFLFLGSFRASLIPTLAVPVSLVGTFALMYLCGFSLNNLSLMALILAT 396 + I+ V+V L + S+ + L VP+ +VG L + ++ L+ Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934 Query: 397 GLVVDDAIVVLENI-SRHIHNGLDPMKAAFLGAKEVGFTLLSMNVSLVAVFVSILFMGGL 455 GL +AI+++E G ++A + + +L +++ + + + G Sbjct: 935 GLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGA 994 Query: 456 VESLFREFSITLSVSIVVSLVVSLTLTPML 485 I + +V + ++++ P+ Sbjct: 995 GSGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024 Score = 79.5 bits (196), Expect = 4e-17 Identities = 55/345 (15%), Positives = 123/345 (35%), Gaps = 22/345 (6%) Query: 707 QVTLIVNRDTAKRLGIDMNMVTAVLNNAYSQ----RQVSTIYDSLNQYQVVMEVNPKYAQ 762 + + ++ D + + V L Q + T Q + ++ + Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF-K 241 Query: 763 DPVTLEQVQV-ITADGQRVPLSSIAHYERSLENDR--VSHDGQFASENISFDLAEGVSLD 819 +P +V + + +DG V L +A E EN +G+ A+ +LD Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 820 QATVAIERSVAAI--GLPSGIISKMAGTANAF--AATQKSQPWMILGALLAVYLVLGILY 875 A AI+ +A + P G+ F + + + +L LV+ + Sbjct: 302 TAK-AIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVF-LVMYLFL 359 Query: 876 ESYIHPLTILSTLPSAGVGALLTIYVLRSEFSLISLLGLFLLIGVVKKNAIMMIDLALHL 935 ++ L +P +G + + +++ G+ L IG++ +AI++++ + Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 936 ERDQGMTPQESIRSACLQRLRPILMTTMAAILGALPLLLSTAEGAEMRRPLGLTIIGGLV 995 + + P+E+ + Q ++ M +P+ + R +TI+ + Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 996 FSQVLTLYTTPVVYLYLDRLRHR--------FSRWRGRRTDAALE 1032 S ++ L TP + L + F W D ++ Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 90.5 bits (224), Expect = 7e-24 Identities = 75/262 (28%), Positives = 112/262 (42%), Gaps = 31/262 (11%) Query: 3 KVLIITGGSRGIGAATAILAASQGYRICINYLSDHAAAERTCAQVRAQGAQAITVQADVS 62 K+ ITG ++GIG A A ASQG I + E+ + ++A+ A ADV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 63 NEDEIIRLFARVDAELGRVTHLVNNAGTLAQACRVEEMSEFRMLKMMMSNVVGPMLCSKH 122 + I + AR++ E+G + LVN AG L + +S+ N G S+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGL-IHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 ALLRMSPHHGGQGGSIVNVSSAAA---RLGSAGEYVDYAASKGALDTFTLGLSKEVAPEG 179 M + GSIV V S A R A YA+SK A FT L E+A Sbjct: 127 VSKYMMDR---RSGSIVTVGSNPAGVPRTSMAA----YASSKAAAVMFTKCLGLELAEYN 179 Query: 180 IRVNAVRPGFIFTDFH--------------ALSGDPFRVSKLEGALPMGRGGTAEEVAEA 225 IR N V PG TD S + F+ +P+ + ++A+A Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKT-----GIPLKKLAKPSDIADA 234 Query: 226 ILWLLSDKASYATGTFIDLAGG 247 +L+L+S +A + T + + GG Sbjct: 235 VLFLVSGQAGHITMHNLCVDGG 256
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 41.5 bits (97), Expect = 1e-06 Identities = 16/53 (30%), Positives = 25/53 (47%) Query: 74 LAIADAARGLGLGKQLLQHAEQRAVERDCAYLRLEVRPDNLAAIGLYERSGYR 126 +A+A R G+G LL A + A E L LE + N++A Y + + Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 28.9 bits (65), Expect = 0.042 Identities = 20/64 (31%), Positives = 30/64 (46%), Gaps = 12/64 (18%) Query: 11 LRNGRILDVELGKLVSGQEVVIQGERIIDVRAEGEPAGPDDQVIDLGGKTLMPGLIDCHV 70 L++GRI +GK +G + G II GP +VI GK + G +D H+ Sbjct: 90 LKDGRI--AAIGK--AGNPDMQPGVTII--------VGPGTEVIAGEGKIVTAGGMDSHI 137 Query: 71 HVLA 74 H + Sbjct: 138 HFIC 141
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.6 bits (95), Expect = 9e-06 Identities = 29/161 (18%), Positives = 59/161 (36%), Gaps = 21/161 (13%) Query: 48 FFPTGSELTSYLLALATFGVGFFMRPVGGIVLGIYGDKHGRKAALSLTILLMAFGTLIIA 107 + Y + LA + M+ VLG D+ GR+ L +++ A I+A Sbjct: 35 LVHSNDVTAHYGILLALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMA 91 Query: 108 LTPSFAQIGYLAPVLIVLARLLQGFSAGGEMGSATAFLTEHAPAGRKAFYSSWIQASIGV 167 P ++ + R++ G + G A A++ + +A + ++ A G Sbjct: 92 TAPFLW--------VLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142 Query: 168 AVLLGSTLGAILSSYLTQAQLESWGWRVPFLIGTLIGPVGF 208 ++ G LG ++ + PF + + F Sbjct: 143 GMVAGPVLGGLMGGF---------SPHAPFFAAAALNGLNF 174
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 28.4 bits (63), Expect = 0.032 Identities = 10/56 (17%), Positives = 24/56 (42%) Query: 38 TLGGLTASALLASLSPNYALAEQVEFTDPDIIAEYVNYPSPKGHGQVRGYLVRPAK 93 LG + + P + EQ++ +++YV++ +++GY + P Sbjct: 153 VLGLYSQEDSGSDGVPGAQVNEQLQQRASTTMSDYVSFSPIMNDNKLQGYRLNPGP 208
>PF04183#IucA / IucC family Length = 580 Score = 180 bits (459), Expect = 1e-51 Identities = 94/411 (22%), Positives = 147/411 (35%), Gaps = 40/411 (9%) Query: 95 ARRGQGSW--------QCPAFPEFVQQLLSACEHMTRASNDELLDQVMQ--SQHLTAAIV 144 A RG W +C P Q LL + + S D + + MQ L + Sbjct: 50 AERGIWGWLWIDAQTLRCADEPVLAQTLLMQLKQVLSMS-DATVAEHMQDLYATLLGDLQ 108 Query: 145 AHNMAGEHP--EPLSGYLASEQGLWFGHPNHPAPKARLWPKHLAQETYAPEFQAKTALHL 202 + ++ Q L GHP K R A E YAPE+ LH Sbjct: 109 LLKARRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHW 168 Query: 203 FEVPMEGLRITS-NGLSDTQVMAGFVDQAKARP------------GHALICMHPVQAELF 249 V E + N + Q++ +D + + +HP Q + Sbjct: 169 LAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQK 228 Query: 250 MQDRRVQRLLELGEVTDLGTTGPLASPTASMRTWYIEG--HDYFIKGSLNVRITNCVRKN 307 + + E G + LG G S+RT IK L + T+C R Sbjct: 229 IATDFIADFAE-GRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGI 287 Query: 308 AWYELESTLIIDELFQRLQQTQP-ETLGGLSTVAEP--GSMSWAPKGVGETDAHWFREQT 364 + + + Q++ T G + EP G +S + ++E Sbjct: 288 PGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQEML 347 Query: 365 GAILRENFCLRSGAD-RSVMAGTLFARDLRSRPLVHDFLQRFKGWELGDEDLLTWFDQYQ 423 G I REN C D V+ TL D ++PL ++ R D TW Q Sbjct: 348 GVIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDRS------GLDAETWLTQLF 401 Query: 424 ALLLRPVMALFFNHGVVMEPHLQNAVLIHDNGQPRQLLLRDFEG-VKLTDE 473 +++ P+ L +GV + H QN L G P+++LL+DF+G ++L E Sbjct: 402 RVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKE 452
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 145 bits (366), Expect = 2e-40 Identities = 93/412 (22%), Positives = 179/412 (43%), Gaps = 19/412 (4%) Query: 11 WVVINVLLGTLTVSLSNSSLNPALPTFMEAFRIGPLMATWIVAAFMTSMGMTMPLTSFLS 70 W+ I L + LN +LP F P W+ AFM + + + LS Sbjct: 18 WLCILSFFSVLNEMV----LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 71 QRVGRKRLYLWGVALFIGGSLLGALANSIA-LVIAARVVQGVASGLMIPLSLAIIFAVYE 129 ++G KRL L+G+ + GS++G + +S L+I AR +QG + L + ++ Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 130 KHERGRVTGLWSAAVMLAPALGPLCGSLLLEWFSWRSLFLMNVPIGLLALLLGVGVLPAS 189 K RG+ GL + V + +GP G ++ + W +L+ +P+ + + + L Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKK 191 Query: 190 EPAERKPFDLIGYLLIASGIGLLMVAISRMHHAEALLDPLNQAMVLVAVACLIAFVRVEL 249 E + FD+ G +L++ GI M+ + + + ++V+V + FV+ Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTTSY----------SISFLIVSVLSFLIFVKHIR 241 Query: 250 RRKDPLLNLRLFNLRGYRLSVIVAVVQSVGMFECLVLLPLLVQTVMGYNPIWTGLSLLCT 309 + DP ++ L + + V+ + + + ++P +++ V + G ++ Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 310 AAFAS-LFGQWGGKALDRHGPRKVVAIGLLLTGLSTLALGLLKSDAAIGVVFVLMMVRGA 368 + +FG GG +DR GP V+ IG+ +S L L + + +++ V G Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG- 360 Query: 369 GLGLSYMPITTAGLNALPEPMVTQGAAMNNISRRLVASLGIVIASLWLEFRL 420 GL + I+T ++L + G ++ N + L GI I L L Sbjct: 361 GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412
>PF04183#IucA / IucC family Length = 580 Score = 478 bits (1231), Expect = e-165 Identities = 165/598 (27%), Positives = 257/598 (42%), Gaps = 39/598 (6%) Query: 29 IDAGRYTKVQRRVIGQLLQTLLYEAALPYTCVSLDEHRHLFVVPATDSAQAPVEYRCSGL 88 ++ + V RR++ ++L L YE + S + R+ +P ++R Sbjct: 1 MNHKDWDLVNRRLVAKMLSELEYEQV--FHAESQGDDRYCINLPG-------AQWRFIAE 51 Query: 89 LSSSFELIRLEHASLERVDKDGKRSVPDLHQALTELLSPFQDSPHLARFIQEIEQTQLKD 148 + + ++ +L D L L ++LS +A +Q++ T L D Sbjct: 52 -RGIWGWLWIDAQTLRCAD--EPVLAQTLLMQLKQVLS--MSDATVAEHMQDLYATLLGD 106 Query: 149 LQA-RNQSYKPAKPAHQLDVDALEQHFMDAHSYHPCYKSRIGFSLADNVKYGPEFATPIE 207 LQ + + A L+ D Q + H K R G+ +Y PE+A Sbjct: 107 LQLLKARRGLSASDLINLNADR-LQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFR 165 Query: 208 VVWIAVAKSSASVGHVRAMDIQQFVRDELGTQRWQAFAQTLAAQGKSIDDYQLMPVHPWQ 267 + W+AV + MDI Q + + Q + F+Q G ++ +PVHPWQ Sbjct: 166 LHWLAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLD-HNWLPLPVHPWQ 224 Query: 268 WDNVTVSTFFPELARGELIYLGTSSDQYKAQQSIRTLANANDPKKPYVKLAMSMTNTSST 327 W + F + A G ++ LG DQ+ AQQS+RTL NA+ +KL +++ NTS Sbjct: 225 WQQKIATDFIADFAEGRMVSLGEFGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCY 284 Query: 328 RILARHTVLNGPIIADWLQHLISTDSTARELGFVILGEVAGVSFDYRHLAESRSA--QTY 385 R + + GP+ + WLQ + +TD+T + G VILGE A + A A + Sbjct: 285 RGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYAALARAPYRYQ 344 Query: 386 GTLGAIWRESLHQYLKDDEQAVPFNGLSHVENRYGDGEQSPFIDAWVRQYGL--ENWTRQ 443 LG IWRE+ ++LK DE V L D P A++ + GL E W Q Sbjct: 345 EMLGVIWRENPCRWLKPDESPVLMATLMEC-----DENNQPLAGAYIDRSGLDAETWLTQ 399 Query: 444 LLQVTVPPIIHMLYAEGIGMESHGQNIVLITKDGWPQRIALKDFHDGVRYSPAHLGRPEL 503 L +V V P+ H+L G+ + +HGQNI L K+G PQR+ LKDF +R Sbjct: 400 LFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEF----- 454 Query: 504 CPELVPLPASHAKLNRNSFIVTDDVNAVRDFSCDCFFFICLAEMAIFLRQQYQLDEALFW 563 PE+ LP + + F+ + L + + E F+ Sbjct: 455 -PEMDSLPQEVRD------VTSRLSADYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFY 507 Query: 564 QMTADVILGYQKAHPQHRDRFGLFDVFAPTYEVEELTKRRL-LGDGERRFRPVPNPLH 620 Q+ A V+ Y K HPQ +RF LF +F P L +L D + R +PN L Sbjct: 508 QLLAAVLSDYMKKHPQMSERFALFSLFRPQIIRVVLNPVKLTWPDLDGGSRMLPNYLE 565
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.0 bits (62), Expect = 0.015 Identities = 17/70 (24%), Positives = 24/70 (34%), Gaps = 6/70 (8%) Query: 108 HGKGDAQLIYAATEHWAVTRGARWLRIGVVTDNPRAKRFWETQGFATVCEREGVTMGLKK 167 G G A L++ A E WA L + N A F+ F + V L Sbjct: 104 KGVGTA-LLHKAIE-WAKENHFCGLMLETQDINISACHFYAKHHF-IIG---AVDTMLYS 157 Query: 168 NTISTMIKAL 177 N + A+ Sbjct: 158 NFPTANEIAI 167
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 27.3 bits (60), Expect = 0.020 Identities = 14/34 (41%), Positives = 18/34 (52%), Gaps = 1/34 (2%) Query: 66 TDRAKPSNIKIGWSLDHCKAVLLINDYPHAIVDF 99 T P N K+ W D +AVLLI+D + VD Sbjct: 13 TASDMPQN-KVSWVPDPNRAVLLIHDMQNYFVDA 45
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 30.1 bits (68), Expect = 0.009 Identities = 26/125 (20%), Positives = 42/125 (33%), Gaps = 26/125 (20%) Query: 1 MSILVIGATGTVGSLIVQRLAAADAEVKAL---------VRQPGKASFPA--GVTEVVAD 49 M LV GA G +G + +RL A +V + + + A G D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 50 LTDVPSIR--------------AALTSVR-TLFLLNAVTPDEVTQALITLNLAQEAGIER 94 L D + +VR +L +A +T L L + I+ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 95 IVYLS 99 ++Y S Sbjct: 121 LLYAS 125
>PF03309#Bvg accessory factor Length = 271 Score = 30.5 bits (69), Expect = 0.007 Identities = 17/80 (21%), Positives = 31/80 (38%), Gaps = 9/80 (11%) Query: 247 TLDLLTACPDLAGLYVAGGGIEGVVSALEEMRSRRARLPTVVCHDLTDL----TRSALQS 302 +D+++A G ++ G GV + + +R A L V + T +Q+ Sbjct: 137 CVDVVSA----KGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRSVIGKNTVECMQA 192 Query: 303 GLVQAVLSHPVETLARRSME 322 G V V+ L R + Sbjct: 193 GAVFGFAGL-VDGLVNRIRD 211
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 36.2 bits (83), Expect = 2e-04 Identities = 22/121 (18%), Positives = 42/121 (34%), Gaps = 7/121 (5%) Query: 186 KAELNPDAIKQEAQATQQDAQNTAERSAQNPQQADEQLGGLMDRIKA--KGDQAWDAADR 243 E + KQE++ +++ Q+ E +AQN + A E + + + + Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKET 1095 Query: 244 QALVNLI-----KARGNKTDAEANQIVDQAQASYRQAYAKYQELKAQAEQKAREAAEVTA 298 Q K K + E Q V + + + + ++ QAE V Sbjct: 1096 QTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155 Query: 299 K 299 K Sbjct: 1156 K 1156
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1194 bits (3091), Expect = 0.0 Identities = 640/1032 (62%), Positives = 807/1032 (78%), Gaps = 3/1032 (0%) Query: 1 MSRFFIDRPIFAWVLAIIVMLAGIMAILTLPIAQYPTIAPPAIAITANYPGASAKTLEDT 60 M+ FFI RPIFAWVLAII+M+AG +AIL LP+AQYPTIAPPA++++ANYPGA A+T++DT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQKMKGLDRLSYIASTSESSGSVTITLTFENGTDADTAQVQVQNKLTLATPLLPS 120 VTQVIEQ M G+D L Y++STS+S+GSVTITLTF++GTD D AQVQVQNKL LATPLLP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVQVVKSSTNFLNILAFTSEDGRMNGADLSDYVSANIQEAIGRVDGVGDTTLFGA 180 EVQQQG+ V KSS+++L + F S++ D+SDYV++N+++ + R++GVGD LFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWINPDKLASYKMTPIDVRNAIQAQNVQVSSGQLGALPAAGNQQLNATITSQTRL 240 QYAMRIW++ D L YK+TP+DV N ++ QN Q+++GQLG PA QQLNA+I +QTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTAEQFEDILLRTETDGSQVRLRDVAKVELGSESYSNTSRFNGKPAAGLAIKLATGANAL 300 + E+F + LR +DGS VRL+DVA+VELG E+Y+ +R NGKPAAGL IKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTVKAIDARIEELKPYWPEGVRVQKPYDITPFVRISIEEVVRTLVEAVVLVFLVMYLFLQ 360 DT KAI A++ EL+P++P+G++V PYD TPFV++SI EVV+TL EA++LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFGVLAVFGYSINTLTMFAMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTF +LA FGYSINTLTMF MVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 SEEGLEPKAAARQSMEQISGALVGIALVLAAVFIPMAFFSGSSGVIYRQFSITIVSAMTL 480 E+ L PK A +SM QI GALVGIA+VL+AVFIPMAFF GS+G IYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVAMILTPALCATLLKPVGINHGQERRGFFGWFNRAFDRGSNRYQGVVGHMLVRPWRY 540 SVLVA+ILTPALCATLLKPV H + + GFFGWFN FD N Y VG +L RY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 MIGYGVIVLLVMLGFSKLPVGFLPDEDQGTLFALIQLPPGATEKRTDEVLRQVEQHFMVD 600 ++ Y +IV +++ F +LP FLP+EDQG +IQLP GAT++RT +VL QV +++ + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKDAVSGVFTVSGFSFAGSGQNIGLAFVKLRPWNERSDESLTVTQVTARAWQAFSGIRDA 660 EK V VFTV+GFSF+G QN G+AFV L+PW ER+ + + V RA IRD Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LIVPFAPPAVSELGNATGFDLMLQDRGNLGHDALMKARNQLLEKLSKDP-RLVAVRANGQ 719 ++PF PA+ ELG ATGFD L D+ LGHDAL +ARNQLL ++ P LV+VR NG Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 ENAPEFRLQIDAHKAGTLGLSMSDINDTFSMAWGSNYVNDFLDQGRVKKVMLQAEAPFRM 779 E+ +F+L++D KA LG+S+SDIN T S A G YVNDF+D+GRVKK+ +QA+A FRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 780 LPQDIGRWYVRNSAGTMVSFAAFAKAEWTSGSPRLERYNGVSSIEILGMALPGQASSGEA 839 LP+D+ + YVR++ G MV F+AF + W GSPRLERYNG+ S+EI G A PG SSG+A Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG-TSSGDA 839 Query: 840 LAIVEAAVAELPPGFGFEWTGLSRQEKASTGQTTLLYSLSILFVFLCLAALYESWSVPLS 899 +A++E ++LP G G++WTG+S QE+ S Q L ++S + VFLCLAALYESWS+P+S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 900 VIMVIPLGVFGVLLGAVLTWKMNDVYFQVGLLTTIGLAAKNAILIVEFAKDLHDR-GTGI 958 V++V+PLG+ GVLL A L + NDVYF VGLLTTIGL+AKNAILIVEFAKDL ++ G G+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 959 IEATLQATRMRLRPILMTSFAFILGVLPLVLSSGAGAGAQNALGVAVTGGMLSGTILALF 1018 +EATL A RMRLRPILMTS AFILGVLPL +S+GAG+GAQNA+G+ V GGM+S T+LA+F Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1019 FVPLFFILVYRK 1030 FVP+FF+++ R Sbjct: 1020 FVPVFFVVIRRC 1031 Score = 74.1 bits (182), Expect = 2e-15 Identities = 85/514 (16%), Positives = 177/514 (34%), Gaps = 39/514 (7%) Query: 536 RPWRYMIGYGVIVLLVMLGFSKLPVGFLPDEDQGTLFALIQLPPGATEKRTDEVLRQVEQ 595 RP + ++++ L +LPV P + P + D V + +EQ Sbjct: 8 RPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQ 67 Query: 596 HFMVDEKDAVSGVFTVSGFSFAGSGQNIGLAFVKLRPWNERSDESLTVTQVTARAWQAFS 655 + + + +S S + I L F +D + QV + Sbjct: 68 NMN-----GIDNLMYMSSTSDSAGSVTITLTF------QSGTDPDIAQVQVQNK----LQ 112 Query: 656 GIRDALIVPFAPPAVSELGNATGF---DLMLQDRGNLGHDALMK-ARNQLLEKLSKDPRL 711 L +S +++ + + D D + + + + LS+ + Sbjct: 113 LATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGV 172 Query: 712 VAVRANGQENAPEFRLQIDAHKAGTLGLSMSDINDTFSMA----WGSNYVNDFLDQGRVK 767 V+ G + A R+ +DA L+ D+ + + G+ Sbjct: 173 GDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230 Query: 768 KVMLQAEAPFRMLPQDIGRWYVR-NSAGTMVSFAAFAKAEWTSGSPR-LERYNGVSSIEI 825 + A+ F+ P++ G+ +R NS G++V A+ E + + R NG + + Sbjct: 231 NASIIAQTRFKN-PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGL 289 Query: 826 LGMALPGQASSGEALAIVEAAVAELPPGF--GFEWTGLSR-----QEKASTGQTTLLYSL 878 G A++ + ++A +AEL P F G + Q TL Sbjct: 290 GIKLATG-ANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLF--E 346 Query: 879 SILFVFLCLAALYESWSVPLSVIMVIPLGVFGVLLGAVLTWKMNDVYFQVGLLTTIGLAA 938 +I+ VFL + ++ L + +P+ + G + G++ IGL Sbjct: 347 AIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406 Query: 939 KNAILIVE-FAKDLHDRGTGIIEATLQATRMRLRPILMTSFAFILGVLPLVLSSGAGAGA 997 +AI++VE + + + EAT ++ ++ + +P+ G+ Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAI 466 Query: 998 QNALGVAVTGGMLSGTILALFFVPLFFILVYRKR 1031 + + M ++AL P + + Sbjct: 467 YRQFSITIVSAMALSVLVALILTPALCATLLKPV 500
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.4 bits (92), Expect = 2e-05 Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 9/74 (12%) Query: 67 EIRPQVSGIVQKRSFTEGSTVKAGQVLYLIDPATYRATYNSDLAALAKAEASLTSVRLKN 126 EI+P + IV++ EG +V+ G VL + A K ++SL RL+ Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAE-------ADTLKTQSSLLQARLEQ 150 Query: 127 ERYKELAALDAVSR 140 RY+ ++ Sbjct: 151 TRYQ--ILSRSIEL 162 Score = 31.7 bits (72), Expect = 0.005 Identities = 13/34 (38%), Positives = 17/34 (50%), Gaps = 1/34 (2%) Query: 66 AEIRPQVSGIVQKRS-FTEGSTVKAGQVLYLIDP 98 + IR VS VQ+ TEG V + L +I P Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361 Score = 29.0 bits (65), Expect = 0.036 Identities = 16/149 (10%), Positives = 46/149 (30%), Gaps = 44/149 (29%) Query: 96 IDPATYRATYNSDLAALAKAEASLTSVRLKNERYKELAALDAVSRQDYDDAVSSLGESRA 155 ++ RA + LA + + E + + + + L A+++ + + E+ Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266 Query: 156 DVASAKANV-------------------------------------------ESSRINLT 172 ++ K+ + + Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQ 326 Query: 173 YTQVNAPITGRIGKSGI-TPGALVTANQT 200 + + AP++ ++ + + T G +VT +T Sbjct: 327 ASVIRAPVSVKVQQLKVHTEGGVVTTAET 355
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.0 bits (150), Expect = 4e-14 Identities = 23/113 (20%), Positives = 43/113 (38%), Gaps = 1/113 (0%) Query: 1 MRVLTDAKRDAIIDAAAQVFQEDGFEAASMAAIAARVGGSKSTLYRYYNSKEALFVAVSS 60 + R I+D A ++F + G + S+ IA G ++ +Y ++ K LF + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 61 KAAKSQLLPSLEKLLATEDKDLSTVLTAFGKATLSVVASEAMIKTLRTVISES 113 + S + + A D +VL L +E + L +I Sbjct: 65 LSE-SNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 5e-15 Identities = 27/121 (22%), Positives = 47/121 (38%), Gaps = 2/121 (1%) Query: 409 SSERILIVEDRPDVAELAKMVLDDYGYASDIVLNAREALKKFESGSTYDLLFTDLIMPGG 468 + IL+ +D + + L GY I NA + +G DL+ TD++MP Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAG-DGDLVVTDVVMP-D 59 Query: 469 MNGVMLAREVKRRYPKIKVLLTTGYAESSIERTDIGGSEFDVVSKPCMPHDLARKVRQVL 528 N L +K+ P + VL+ + +D + KP +L + + L Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 529 D 529 Sbjct: 120 A 120
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 159 bits (403), Expect = 4e-48 Identities = 77/344 (22%), Positives = 140/344 (40%), Gaps = 37/344 (10%) Query: 2 ILVTGGAGYIGAHIVLALLEHGNEVLVLDNLCNSSRETL---DRVANITGRHFDFIPGDV 58 LVTG AG+IG H+ LLE G++V+ +DNL N + R+ + F F D+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNL-NDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 59 RSKATLHALFAEYPIEAVVHCAGLKAVGESVREPLRYFETNVSGSVNLCQAMAEAGVFNL 118 + + LFA E V AV S+ P Y ++N++G +N+ + + +L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 119 LFSSSATVYGEADRMPLDETCALGLPTNPYGHSKLMAEHVMKSAASSDPRWAIGLLRYFN 178 L++SS++VYG +MP ++ P + Y +K E +M S LR+F Sbjct: 122 LYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANE-LMAHTYSHLYGLPATGLRFFT 180 Query: 179 PIGAHPSGMLGESPRNTPNNLLPFLLQVANRQRPALHVFGSDYPTPDGTGIRDYLHVMDL 238 G P P ++ F A + ++ V+ G RD+ ++ D+ Sbjct: 181 VYG----------PWGRP-DMALFKFTKAMLEGKSIDVYN------YGKMKRDFTYIDDI 223 Query: 239 AEGHLQALARIGTQRGV---------------SIWNLGTGRGYSVLEVVKTFERISGVKV 283 AE ++ I ++N+G +++ ++ E G++ Sbjct: 224 AEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEA 283 Query: 284 PLVFEPRRSGDVAECWSDPGKALLELNWQARHDLEAMLTDAWRW 327 P + GDV E +D + + ++ + + W Sbjct: 284 KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNW 327
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 55.2 bits (133), Expect = 3e-10 Identities = 81/445 (18%), Positives = 140/445 (31%), Gaps = 63/445 (14%) Query: 33 LVIALGITWLLDGLEVTLAGSVAGALKASPVLNLS-NSEIGLAGAAYIAGAVLGALFFGW 91 L++ L LD + + L V L V + + G+ A Y A G Sbjct: 7 LIVILSTV-ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 92 LTDRLGRRKLFFITLALYISATFATAFSFSVWSFMLFRFLTGMGIGGEYTAINSTIQEFT 151 L+DR GRR + ++LA A + +W + R + G+ G + I + T Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124 Query: 152 P----ARYRGWVDLTINGTFWLGAALGAVGSIVLLDPQWVGAELGWRLCFGIGAVLGLFI 207 AR+ G++ G LG + F A L Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGG-----------LMGGFSPHAPFFAAAALNGLN 173 Query: 208 MLMRLWLPESPRWLMIHGRSEEARKIVEQIEADMQRRGHVLPAIEGKPLRLHARDHTPLG 267 L +L EA + ++L +G Sbjct: 174 FLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL-------VG 226 Query: 268 EIFHTLFVSFRQRSLVGLTLLTAQAFFYNAIFFTYALVLTDFYDVPSERVGWYVLPLALG 327 ++ L+V F F ++A +L L Sbjct: 227 QVPAALWVIF-----------GEDRFHWDATTIGISLAAFG----------------ILH 259 Query: 328 NFCGPLLLGRLFDVVGRRIMISLTYGLSGVLLAISGYLFQQGLLDVTQQAIAWMVIFFFA 387 + ++ G + +G R + L G++ +GY+ L T+ +A+ ++ A Sbjct: 260 SLAQAMITGPVAARLGERRALML-----GMIADGTGYIL---LAFATRGWMAFPIMVLLA 311 Query: 388 SAA-ASSAYLTVAETFPLEIRALAIAVFYAFGTGLGGIIGPTLFGELIETHDRSNVLIGY 446 S A + E R + A T L I+GP LF + + + Sbjct: 312 SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAW 371 Query: 447 LIGAGL--MLLAAFVQSIWGTAAER 469 + GA L + L A + +W A +R Sbjct: 372 IAGAALYLLCLPALRRGLWSGAGQR 396
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.6 bits (82), Expect = 4e-04 Identities = 70/415 (16%), Positives = 128/415 (30%), Gaps = 83/415 (20%) Query: 59 PGLIREGIFATGSQGLFGFSDQAAFASATFLGLF-FGASLVSPI----ADRFGRRAIFTC 113 PGL+R+ S+ L L+ +P+ +DRFGRR + Sbjct: 29 PGLLRD----------LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLV 78 Query: 114 ALIWYTVATVMMGLQTSAMGVIGMRFLVGIGLGVELVTIDAYLSELVPKRIRSSAFAF-- 171 +L V +M + R + GI G AY++++ R+ F F Sbjct: 79 SLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMS 137 Query: 172 -AFSIQFLAVPSVALMSWWLVPQDPLGYAGWRWVVISSAVFALFIWWLRSSLPESPRWLA 230 F +A P + + P P A ++ F + L S Sbjct: 138 ACFGFGMVAGPVLGGLMGGFSPHAPFFAAA----ALNGLNFLTGCFLLPES--------- 184 Query: 231 QHGRFVEAERVVDDLEARCLKDHKQPLDQPEPQTVAVEGKGRFADMWQPPFRRRALMLIA 290 K ++PL + +A W A ++ Sbjct: 185 -------------------HKGERRPLRREALNPLASF-------RWARGMTVVAALMAV 218 Query: 291 FHIFQAIGFFG------FG----NWLPAL--LSGQGVSVTHSLSYAFVITLAYPLGPLLF 338 F I Q +G FG +W +S + HSL+ A + Sbjct: 219 FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMI-----------T 267 Query: 339 VKFANRFENKWQIVGSALSSMIFGTLFAFQTSAAGLIFCGIMITFSNAWLSFSYHSYQGE 398 A R + ++ ++ L AF T + F +++ S + + Sbjct: 268 GPVAARLGERRALMLGMIADGTGYILLAFATRGW-MAFPIMVLLASGGIGMPALQAMLSR 326 Query: 399 LFPTNIRARAVGFCYSFSRLSTVFSSLLIG-IFLEHFGTPGVLAFIVSSMLIVII 452 + + G + + L+++ LL I+ T A+I + L ++ Sbjct: 327 QVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 368 bits (947), Expect = e-125 Identities = 126/370 (34%), Positives = 197/370 (53%), Gaps = 17/370 (4%) Query: 162 ERIQHLSRGVEDQRQLVEVYKRAAGGRAPRELIGQSEVLERLQQEIQLVANSPLTVLVTG 221 + + + + K + L+G+S ++ + + + + + LT+++TG Sbjct: 109 TELIGIIGRALAEPKR-RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 222 ETGVGKELVAEAIHLHSPRAHKPLISLNCAALPETLVESELFGHVKGAFSGAVNGRSGKF 281 E+G GKELVA A+H + R + P +++N AA+P L+ESELFGH KGAF+GA +G+F Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 282 ELADGGTLFLDEVGELPLSVQSKLLRVLQSGQLQRVGSDQEHRVDVRIIAATNRNLAEEV 341 E A+GGTLFLDE+G++P+ Q++LLRVLQ G+ VG R DVRI+AATN++L + + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 342 RSGRFRADLYHRLSVYPLQVPALRERGRDVLLLAGYFLEENRLRMGLRSLRLNPEAQRML 401 G FR DLY+RL+V PL++P LR+R D+ L +F+++ + GL R + EA ++ Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELM 346 Query: 402 LAHAWPGNVRELEHLISRAVLKALSGHAQRPRIL---------------TIEPQSLGLDE 446 AH WPGNVRELE+L+ R R I SL + + Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 447 AIDSLPLLPQALEVAAGVEGQGLKAAVDAYQRALIANALDRHQGRWTEVARELSVDRANL 506 A++ A A + + LI AL +G + A L ++R L Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTL 466 Query: 507 NRLSKRLGIR 516 + + LG+ Sbjct: 467 RKKIRELGVS 476
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 53.5 bits (128), Expect = 1e-10 Identities = 48/196 (24%), Positives = 78/196 (39%), Gaps = 11/196 (5%) Query: 2 KRILIIGATSAIAHACARLWAAQGCDFFLVARSADRLQ--VTAADLEGRGARAVTLHEMD 59 K I GA I A AR A+QG V + ++L+ V++ E R A A D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 60 ATHFAEHPRMLADCLQVLGQIDVVLIAHGTL---PDQRACEQDVGLALQEFITNSASVIA 116 + E + A + +G ID+++ G L +++ F NS V Sbjct: 69 SAAIDE---ITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWE---ATFSVNSTGVFN 122 Query: 117 LLTLLAKHFELQRCGTLAVISSVAGERGRPSNYLYGAAKAAVSTFCDGLQARLFKVGVHV 176 ++K+ +R G++ + S R S Y ++KAA F L L + + Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 177 LTIKPGFVDTPMTQGL 192 + PG +T M L Sbjct: 183 NIVSPGSTETDMQWSL 198
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.0 bits (187), Expect = 4e-18 Identities = 34/143 (23%), Positives = 61/143 (42%), Gaps = 2/143 (1%) Query: 2 TRILAIEDDAITAKEIVAELSSHGLEVDWVDNGRDGLARAVSGDYDLITLDRMLPEIDGL 61 IL +DDA + LS G +V N +GD DL+ D ++P+ + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 TIVTQLRAQGIATPILMISALSDVDERVRGLRAGGDDYLPKPFASDEMAARVEVLLRRSN 121 ++ +++ P+L++SA + ++ G DYLPKPF E+ + L + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL--AE 121 Query: 122 PVSAAKTVLQVADLELNLITREA 144 P + + + L+ R A Sbjct: 122 PKRRPSKLEDDSQDGMPLVGRSA 144
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 24/147 (16%), Positives = 49/147 (33%), Gaps = 8/147 (5%) Query: 109 NQVLARLDPREQRTGLESASADVAVRESRLRLAEQNYQR-QQRLLPKGYTNLSEYQQ-AR 166 Q +A+ EQ A ++ V +S+L E ++ ++ Sbjct: 246 KQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQ----LVTQLFKNEIL 301 Query: 167 SSLESARGDLASFKAQLATAREQVGYTELVAVANGVITARQA-EEGQVVQAAAPVFSLAH 225 L ++ +LA E+ + + A + + + EG VV A + + Sbjct: 302 DKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361 Query: 226 DGEREAVFAAY-ESLLGTDRIGDRVTI 251 + + V A +G +G I Sbjct: 362 EDDTLEVTALVQNKDIGFINVGQNAII 388 Score = 36.7 bits (85), Expect = 1e-04 Identities = 13/83 (15%), Positives = 29/83 (34%) Query: 113 ARLDPREQRTGLESASADVAVRESRLRLAEQNYQRQQRLLPKGYTNLSEYQQARSSLESA 172 L+ ++R + A + E+ R+ + LL K + + A Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 173 RGDLASFKAQLATAREQVGYTEL 195 +L +K+QL ++ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKE 287 Score = 32.1 bits (73), Expect = 0.004 Identities = 19/129 (14%), Positives = 39/129 (30%), Gaps = 10/129 (7%) Query: 92 SGKLVKR-FVDVGDRVHVNQVLARLDP-------REQRTGLESASADVAVRESRLRLAEQ 143 +VK V G+ V VL +L + ++ L A + + R E Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIEL 162 Query: 144 NYQRQQRLLPKGYTNLSEYQQARSSLESARGDLASFKAQLATAREQVGYTELVAVANGVI 203 N + +L + Y ++ + ++++ Q + A V+ Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN--LDKKRAERLTVL 220 Query: 204 TARQAEEGQ 212 E Sbjct: 221 ARINRYENL 229
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.1 bits (99), Expect = 2e-06 Identities = 21/118 (17%), Positives = 43/118 (36%), Gaps = 11/118 (9%) Query: 99 SEQQNQLHARQAELSKAQSSWQQVRDEQLRYQQLFERGVGSRARLDQLSSDLRNQEALQQ 158 E N+L +++L + +S ++E QLF+ + + R + L E Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLE---- 317 Query: 159 RAGIALQQARDHLSYTRLLAEFDGLVTEWRA-EVGQVMAAGEPVVSLARPESREAVVD 215 L + + + + A V + + G V+ E ++ + PE V Sbjct: 318 -----LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV-PEDDTLEVT 369 Score = 30.6 bits (69), Expect = 0.011 Identities = 25/96 (26%), Positives = 35/96 (36%), Gaps = 17/96 (17%) Query: 84 GKAVRKGDLLATLEPSEQQNQLHARQAELSKAQSSWQQVRDEQLRYQQLFERGVGSRARL 143 G++VRKGD+L L +A+ K QSS Q R EQ RYQ Sbjct: 115 GESVRKGDVLLKLTALGA-------EADTLKTQSSLLQARLEQTRYQ----------ILS 157 Query: 144 DQLSSDLRNQEALQQRAGIALQQARDHLSYTRLLAE 179 + + + L + L T L+ E Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKE 193
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 465 bits (1199), Expect = e-149 Identities = 237/1044 (22%), Positives = 436/1044 (41%), Gaps = 74/1044 (7%) Query: 12 LKHRTLVWYMMFVSLLMGSWSFLNLGREEDPSFAIKTMVIQARWPGATLNDTLQQVTDRL 71 ++ W + + ++ G+ + L L + P+ A + + A +PGA VT + Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65 Query: 72 EKKLEEIDALDYVKSYTL-AGESTLFVFLKSETRSADIPEAWYQVRKKISDVRGELPAGI 130 E+ + ID L Y+ S + AG T+ + +S T D A QV+ K+ LP + Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNKLQLATPLLPQEV 122 Query: 131 QGP-AFNDEFGDVFGSIYAFTTDGLSFRQ--LRDYVE-QVRADIRSVPNLGKIELLGAQR 186 Q ++ + + F +D Q + DYV V+ + + +G ++L GAQ Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 187 EV-IYLNFSIRKLAALGIDQRQVLQSLQAQNSVTPAGVMEAGPE------RIAVRASGQF 239 + I+L+ L + V+ L+ QN AG + P ++ A +F Sbjct: 183 AMRIWLDAD--LLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 240 TSEEDLLAVNLRFGD--RFFRLSDLATVERRYADPPSSLFRFNGQPAIGLAVAMKQGGNI 297 + E+ V LR RL D+A VE + + + R NG+PA GL + + G N Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKLATGANA 299 Query: 298 QAFGTQLQQRIEELTTELPLGIDVHLVSSQADVVEKAIGGFTRALFEAILIVLVVSFISL 357 ++ ++ EL P G+ V V+ +I + LFEAI++V +V ++ L Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 358 G-VRAGLVVACSIPLVLALVFVFMEYSGITMQRISLGALIIALGLLVDDAMITVEMMVTR 416 +RA L+ ++P+VL F + G ++ +++ +++A+GLLVDDA++ VE + Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 417 L-EHGDSREQAATFAYTSTAFPMLTGTLVTVAGFVPIGLNHSSAGEYVFTMFAVIAVALL 475 + E ++A + + ++ +V A F+P+ S G I A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 476 LSWLVAVLFAPLIGVHLLKVSA--VHAAPGRWMRGFSRALVRALEH-----------RWW 522 LS LVA++ P + LLK + H G + F+ ++ H Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 523 VIGITTLIFIGSLFAGKLLQNQFFPDSDRPEILVDFYMPQNGSIEGTRQTMDRFESTLKD 582 + I LI G + L + F P+ D+ L +P + E T++ +D+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 583 DPDVLRWSSYVGKGAVRFYLPLDQQLSNPFYGQMVIV-----SHGGEARDRLIERLRQRF 637 + S + G Q N + + + + + +I R + Sbjct: 600 NEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL 654 Query: 638 RDDYVGVGGYVQPLNMGPPVGWPIQYRVSGPDIEQVRSQAMALAALLDTN---------- 687 G+V P NM V +G D E + + AL Sbjct: 655 GKIR---DGFVIPFNMPAIVE---LGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQH 708 Query: 688 -PNIGQVIYDWNEPGKVLKIDIAQDKVRQFGLSSEDVAQILNSLVSGTTITQLRDNTYLI 746 ++ V + E K+++ Q+K + G+S D+ Q +++ + GT + D + Sbjct: 709 PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVK 768 Query: 747 DLVGRAESDERSSIQTLASLQIPTPNGSTVPLLSFATLSYEQEQPLVWRRDRLPTITLKA 806 L +A++ R + + L + + NG VP +F T + P + R + LP++ Sbjct: 769 KLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME--- 825 Query: 807 SVLGKLQPAALVKQLKPDVDVFSASLPVRYSVATGGAVEASARSQGPILKVVPLMLLMVV 866 + G+ P ++ ++ LP G S +V + ++V Sbjct: 826 -IQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVF 884 Query: 867 SFLMVQLHSVKKLMLVVSVVPLGLIGVVAALLISGYPLGFVAILGVLALIGIIIRNSVIL 926 L S + V+ VVPLG++GV+ A + ++G+L IG+ +N++++ Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944 Query: 927 VTQIDDFMAA-GESPWASVIKATEHRCRPILLTAAAASLGMIPIA------REVFWGPMA 979 V D M G+ + + A R RPIL+T+ A LG++P+A + Sbjct: 945 VEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN-AVG 1003 Query: 980 IAMIGGIAVATLLTLFFLPALYLV 1003 I ++GG+ ATLL +FF+P ++V Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVV 1027 Score = 74.9 bits (184), Expect = 1e-15 Identities = 54/330 (16%), Positives = 124/330 (37%), Gaps = 26/330 (7%) Query: 702 KVLKIDIAQDKVRQFGLSSEDVAQIL---NSLVSGTTI---TQLRDNTYLIDLVGRAESD 755 ++I + D + ++ L+ DV L N ++ + L ++ + Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 756 ERSSIQTLASLQIPT-PNGSTVPLLSFATLSY-EQEQPLVWRRDRLPTITLKASVLGKLQ 813 + + + + +GS V L A + + ++ R + P L + Sbjct: 242 ---NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298 Query: 814 PAALVKQLKPDVDVFSASLP--VRYSVA--TGGAVEASARS-QGPILKVVPLMLLMVVSF 868 K +K + P ++ T V+ S + + + L+ L++ F Sbjct: 299 ALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLF 358 Query: 869 LMVQLHSVKKLMLVVSVVPLGLIGVVAALLISGYPLGFVAILGVLALIGIIIRNSVILVT 928 L +++ ++ VP+ L+G A L GY + + + G++ IG+++ +++++V Sbjct: 359 L----QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 929 QIDDFMA-AGESPWASVIKATEHRCRPILLTAAAASLGMIPIA-----REVFWGPMAIAM 982 ++ M P + K+ ++ A S IP+A + +I + Sbjct: 415 NVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITI 474 Query: 983 IGGIAVATLLTLFFLPALYLVCYGIRPTGH 1012 + +A++ L+ L PAL H Sbjct: 475 VSAMALSVLVALILTPALCATLLKPVSAEH 504
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.6 bits (95), Expect = 9e-06 Identities = 60/368 (16%), Positives = 119/368 (32%), Gaps = 45/368 (12%) Query: 43 IAPDIGLSSTAASLIVSLTQIGYALGLFFLVPLGDLLENRKLMLLTTAVATLSLLSAAFA 102 IA D + + + + + +++G L D L ++L+L + + Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99 Query: 103 EQP-NLFLLVSLLVGFSSVSVQMLIPLA-AHLAPEESRGRVVGGIMGGLLLGILLARPIS 160 +L ++ + G + + L+ + A P+E+RG+ G I + +G + I Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159 Query: 161 SLVADHFGWRAVFGSAAVVMIGISVVLATTMP-KRVPDH-------------------RA 200 ++A + W + + +I + ++ R+ H Sbjct: 160 GMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTT 219 Query: 201 TYGQLLFSLWTLLRTQPVLRQRA--------------------FYQACMFATFSLFWTAV 240 +Y + L V R +F T + F + V Sbjct: 220 SYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMV 279 Query: 241 PLELSRNHGLSQTQI-AIFALIGAI-GAIAAPISGRLADAGHTRIVSLGALLLGALSFLP 298 P + H LS +I ++ G + I I G L D V + ++SFL Sbjct: 280 PYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLT 339 Query: 299 GLIHPVYSVIGLAVTGV-VLDYCVQTSMVLGQRTVYALDAASRSRLNALYMTSIFIGGAI 357 + + + V VL T V+ +L +L + F+ Sbjct: 340 ASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399 Query: 358 GSAVASPL 365 G A+ L Sbjct: 400 GIAIVGGL 407
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 43.0 bits (101), Expect = 1e-07 Identities = 35/128 (27%), Positives = 53/128 (41%), Gaps = 16/128 (12%) Query: 5 QRGFTLLEVLLVISLLGVLLVLVAGALLG------ANRAVLKAERYTVGLDEMRAAQAFL 58 QRGFTLLE+++VI ++GVL LV L+G +AV LD + Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66 Query: 59 RSSIS--QALPLDTSAEDDAKS----GFFEGTAQD---LRFVATLPGELGGGIQLHTLGL 109 ++ ++L + A + G+ + D +V PGE G L + G Sbjct: 67 PTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGE-HGAYDLLSAGP 125 Query: 110 KGPEGDRD 117 G G D Sbjct: 126 DGEMGTED 133
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 33.8 bits (77), Expect = 8e-05 Identities = 16/32 (50%), Positives = 22/32 (68%) Query: 4 SQSGFTLLEMLAALTVMAVCSGVLLVAFGQSA 35 Q GFTLLEM+ L +M V +G++L+AF S Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASR 33
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 38.3 bits (89), Expect = 2e-06 Identities = 18/50 (36%), Positives = 31/50 (62%) Query: 1 MKSPVASRGFTLMEMLVVLVLMSIAVGLVGFGLQQGLSTASERRAVGDMV 50 M++ RGFTL+E++VV+V++ + LV L A +++AV D+V Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIV 50
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 117 bits (295), Expect = 5e-37 Identities = 43/143 (30%), Positives = 69/143 (48%), Gaps = 21/143 (14%) Query: 12 RRQSGFTLLEMLAVIVLLGIVATIVVRQVGGNVDKGKYGAGKAQLASLGMKIESYALDVG 71 +Q GFTLLE++ VIV++G++A++VV + GN +K + + +L ++ Y LD Sbjct: 5 DKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNH 64 Query: 72 SPPKTLQQLTDKPGNAAGWNGPYAKPSDL------------KDPFGHAFGYRFPGQHGSF 119 P T G + P P DP+G+ + PG+HG++ Sbjct: 65 HYPTT------NQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAY 118 Query: 120 DLIFYGQDGQPGGEGYSADLGNW 142 DL+ G DG+ G E D+ NW Sbjct: 119 DLLSAGPDGEMGTED---DITNW 138
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 321 bits (824), Expect = e-109 Identities = 135/404 (33%), Positives = 212/404 (52%), Gaps = 6/404 (1%) Query: 1 MSLFKYRALDAQGAPQNGTLEARDQDAAVAALQKRGLMVLQVDSAGLGGLRRALGS---- 56 M+ + Y+ALDAQG GT EA A L++RGL+ L VD + Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 57 --GMLNGAALVSFTQQLATLLGAGQPLERSLGILLKQPGQPQTRALIERIREQVKAGKPL 114 L+ + L T+QLATL+ A PLE +L + KQ +P L+ +R +V G L Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 115 SVALEEEGSQFSPLYISMVRAGEAGGALESTLRQLSDYLERSQLLRGEVINALIYPAFLV 174 + A++ F LY +MV AGE G L++ L +L+DY E+ Q +R + A+IYP L Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 175 VGVLGSLALLLAYVVPQFVPIFKDLGVPIPLITEVILNLGQFLGSYGLAVFAGLIVLIWG 234 V + +++LL+ VVP+ V F + +PL T V++ + + ++G + L+ Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 235 LVISMRDPQRRERHDRRVLGIRVIGPLLQRIEAARLTRTLGTLLSNGVALLQALVIARQV 294 + +R +RR RR+L + +IG + + + AR RTL L ++ V LLQA+ I+ V Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 295 CTNRALQAQVEQAAESVKGGGTLAAAFGAQPLLPDLALQMIEVGEQAGELDSMLLKVADV 354 +N + ++ A ++V+ G +L A L P + MI GE++GELDSML + AD Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 355 FDVEAKRGIDRMLAALVPALTVVMAGMVAVIMLAIMLPLMSLTS 398 D E + L P L V MA +V I+LAI+ P++ L + Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNT 404
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 47.4 bits (112), Expect = 2e-07 Identities = 33/142 (23%), Positives = 53/142 (37%), Gaps = 34/142 (23%) Query: 434 IQGMKAEYFSNANWSGDAAVTRTEQHVDLDWANDKDLPFESNLSGSDPYTSKGSTAGSLN 493 QG+ YFS+ N+ VT + DL S+ + Sbjct: 45 SQGLLGYYFSDLNFQAPMVVTSST---------TGDLSIPSS-----------ELENIPS 84 Query: 494 GDTSSTSIRYTGKITPTESGEQVFKVRADGAVRLWVNGKLIIDNGDGKPLPGNSIPPTIP 553 + S ++G I +S E F AD V +WV+ + +I+ NS Sbjct: 85 ENQYFQSAIWSGFIKVKKSDEYTFATSADNHVTMWVDDQEVIN------KASNSN----- 133 Query: 554 EFAKISLQAGQSYDVKLEYSRR 575 KI L+ G+ Y +K++Y R Sbjct: 134 ---KIRLEKGRLYQIKIQYQRE 152
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 91.2 bits (226), Expect = 5e-23 Identities = 43/175 (24%), Positives = 75/175 (42%), Gaps = 4/175 (2%) Query: 8 LVVVVLLALLMAGCGDRMELHRDLTEQDANEVLAELAGKNIDAQKRLDKGGVAVLVSTQD 67 V +V+ +L A D L +L++QD ++A+L NI R G A+ V Sbjct: 34 AVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY--RFANGSGAIEVPADK 91 Query: 68 ISRAVRVLEAVGLPRRSRSTLGQVFRKEGVISSPLEERARYIYALSQELEQTLSQIDGVV 127 + L GLP+ + ++ +E S E+ Y AL EL +T+ + V Sbjct: 92 VHELRLRLAQQGLPK-GGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150 Query: 128 VARVHVVLPERIAPGEPVQPASAAVFIKHRADLEPDSVLPR-IRRMVASSIPGMT 181 ARVH+ +P+ + SA+V + D + +V+S++ G+ Sbjct: 151 SARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAGLP 205
>PF07132#Harpin protein (HrpN) Length = 356 Score = 45.1 bits (106), Expect = 2e-07 Identities = 55/245 (22%), Positives = 101/245 (41%), Gaps = 35/245 (14%) Query: 31 GGAAGKIASLLGDSMFEKHGSGANIRDTENPLLGMVADHMDKNPGKYGKPDDATGKVNGW 90 + S LG + G+G N + + ++ ++ G G ++ Sbjct: 98 SSLGSGLGSALGGGLGGALGAGMNAMNPSAMMGSLLFSALEDLLG---------GGMSQQ 148 Query: 91 RDELSEDKYLNSEEKEAFTKGLEGLITEFLSGGSTGSASGGTGTGTGQSVGSGQNPASNW 150 + L +K +S E A+T+G+ ++ L G + + + G + G + A Sbjct: 149 QGGLFGNKQPSSPEISAYTQGVNDALSAILGNGLSQTKGQTSPLQLGNNGLQGLSGA--- 205 Query: 151 GAPAANSGNASGGGLQELLAALLGSLGEEKLDNLLQPNTSPNAKSGQTTFSFEDKDVLKE 210 G +L + L S+G++ L ++ N + ED+ + KE Sbjct: 206 ------------GAFNQLGSTLGMSVGQKAGLQELNNISTHNDSPTRYFVDKEDRKMAKE 253 Query: 211 VSRFMDMHPEEFGKPDGK----------SKDWMGELSE-GDNVMSKGESEQFQKAIDMIK 259 + +FMD +PE FGKP+ + K W LS+ D+ M+KG ++F KA+ MIK Sbjct: 254 IGQFMDQYPEVFGKPEYQKDNWQTAKQDDKSWAKALSKPDDDGMTKGSMDKFMKAVGMIK 313 Query: 260 GEIKG 264 + G Sbjct: 314 SAVAG 318
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 280 bits (719), Expect = 5e-94 Identities = 99/328 (30%), Positives = 151/328 (46%), Gaps = 45/328 (13%) Query: 23 IRKAAPLNVDMVLEGETGTGKDTLARRIHQLSGR-EGPLVAINCAAVPEQLAESELFGVM 81 + + ++ +++ GE+GTGK+ +AR +H R GP VAIN AA+P L ESELFG Sbjct: 153 LARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHE 212 Query: 82 AGAYTGASKSRAGYIEASHNGTLYLDEIDSMPLLLQAKLLRVLEMRGIERLGSTRFVPLN 141 GA+TGA G E + GTL+LDEI MP+ Q +LLRVL+ +G + + Sbjct: 213 KGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSD 272 Query: 142 LRVIVATQTPLEKLVEEGKFRRDLFFRLNVIKIQLPTLRSRLDHLPSLFERFVVETAEKH 201 +R++ AT L++ + +G FR DL++RLNV+ ++LP LR R + +P L F V+ AEK Sbjct: 273 VRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHF-VQQAEKE 331 Query: 202 GQPIPVRDPHVLNRLLSHRWPGNIRELKCAAERFVL------------------------ 237 G + D L + +H WPGN+REL+ R Sbjct: 332 GLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSP 391 Query: 238 -------------------GMPPLSSENDSQTENSIHLKSYLRQFEKALIQDCLSRHPKS 278 M + S L + E LI L+ + Sbjct: 392 IEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGN 451 Query: 279 IDSVINELGIPRRTLYHRMKSLSINSPE 306 + LG+ R TL +++ L ++ Sbjct: 452 QIKAADLLGLNRNTLRKKIRELGVSVYR 479
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 433 bits (1115), Expect = e-151 Identities = 86/430 (20%), Positives = 173/430 (40%), Gaps = 11/430 (2%) Query: 19 QFFTRAGWLLTLVGAGSFFLWASLAPLDQGIAVQGTVVVSGKRKAVQSLDSGVVSRILVT 78 + + + F+ + L ++ G + SG+ K ++ +++ +V I+V Sbjct: 55 RRPRLVAYFIMGFLVI-AFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 79 EGQAVKEGEPLFRLDQTQVEADVQSLRAQYRMAWASLARWQSERDNLSEVNFPAELIAAG 138 EG++V++G+ L +L EAD ++ A R+Q ++ P + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPD- 172 Query: 139 HGQDPDPRLAMVLEGQ----RQLFSSRRQALAREQAGLQASIEGAGAQLAGMRRARSDLL 194 +P V E + L + ++ + +++ A+ + + Sbjct: 173 -----EPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYE 227 Query: 195 AQADSLRQQLSNLQPLAQNGFIPRNRLLEYERQLSQVQQEMAQNAGETGRIEQGIVESRL 254 + + +L + L I ++ +LE E + + E+ + +IE I+ ++ Sbjct: 228 NLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287 Query: 255 RLQQQREEYQKEVRTQWADAQVKTLTLEQQLASAGFSLQHSEILAPADGIAVNLGVHTEG 314 Q + ++ E+ + L +LA Q S I AP L VHTEG Sbjct: 288 EYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEG 347 Query: 315 AVVRAGQTLLEVVPQGTRLEVEGRLPVNLIDKVGSHLPVDILFTAFNQNSTPRVTGEVSL 374 VV +TL+ +VP+ LEV + I + I AF + G+V Sbjct: 348 GVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKN 407 Query: 375 ISADQLEDEKTGQPYYVLRTSVSDAVMEKLNGLVIKPGMPAEMFVRTGERSLLNYLFKPL 434 I+ D +ED++ G + V+ + + + + + GM ++TG RS+++YL PL Sbjct: 408 INLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPL 467 Query: 435 LDRAGSALTE 444 + +L E Sbjct: 468 EESVTESLRE 477
>MPTASEINHBTR#Metalloprotease inhibitor signature. Length = 122 Score = 96.6 bits (240), Expect = 2e-29 Identities = 37/100 (37%), Positives = 50/100 (50%), Gaps = 3/100 (3%) Query: 1 MATSLKLPSPAELSGKWRLFAQARPSEACELQLNTDAPQLGGDPACASRWLSDTPTGWFP 60 MA+S +PS A+++G+ + A A L GD ACA +WL D P W P Sbjct: 24 MASSFVVPSTAQMAGQLGIEATGS---GVCAGPAEQANALAGDVACAEQWLGDKPVSWSP 80 Query: 61 TPDGLAFTDKEGSGLIHFNHMGNQLYQARLPGGDLLTLAR 100 TPDG+ + EG+G+ H N Y R P G +TL R Sbjct: 81 TPDGIWLMNAEGTGITHLNRQKEGEYTGRTPSGADVTLQR 120
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 393 bits (1012), Expect = e-135 Identities = 247/476 (51%), Positives = 321/476 (67%), Gaps = 15/476 (3%) Query: 11 SAVQLAATGSSAFNQIDTFVHTYDRGGNLTINGKPSYSVDQAADYILRDDAAWVDRDGNG 70 + L+A SSA+N + F+ +DRG LT+NGK SYS+DQAA I R++ +W + G Sbjct: 12 AQHALSANTSSAYNSVYDFLRYHDRGDGLTVNGKTSYSIDQAAAQITRENVSWNGTNVFG 71 Query: 71 -TINLTYTFLTARPSGFDTSLGTFSAFNAQQKAQAVLSMQSWADVAKVTFTQAASGGDGH 129 + NLT+ FL + S G F FNA+Q QA LS+QSW+DVA +TFT+ + Sbjct: 72 KSANLTFKFLQSVSSIPSGDTG-FVKFNAEQIEQAKLSLQSWSDVANLTFTEVTGNKSAN 130 Query: 130 MTFGNYSDGSSG-----GAAFAYLPSGNSRYDGQSWYLTNNSYTVNLTPDNGNYGRQTLT 184 +TFGNY+ +SG A+AY P G SWY N S N P + YGRQT T Sbjct: 131 ITFGNYTRDASGNLDYGTQAYAYYPGNYQGA-GSSWYNYNQSNIRN--PGSEEYGRQTFT 187 Query: 185 HEIGHSLGLSHPGDYNAGEGNPTYNDVSYAEDTRGYSVMSYWSESNTDQNFVKGGSPTYS 244 HEIGH+LGL+HPG+YNAGEG+P+YND YAED+ +S+MSYW E+ T ++ Y Sbjct: 188 HEIGHALGLAHPGEYNAGEGDPSYNDAVYAEDSYQFSIMSYWGENETGADYNG----HYG 243 Query: 245 SGPLMDDIAAIQQLYGANMSTRAGDTVYGFNSTAGRDFYSATSASSKVVFSVWDGGGKDT 304 P++DDIAAIQ+LYGANM+TR GD+VYGFNS RDFY+AT +S ++FSVWD GG DT Sbjct: 244 GAPMIDDIAAIQRLYGANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWDAGGTDT 303 Query: 305 LDFSGFTQNQKINLNEASFSDVGGMVGNVSIAKGVLVENAVGGSGNDLLVGNAAANDLKG 364 DFSG++ NQ+INLNE SFSDVGG+ GNVSIA GV +ENA+GGSGND+LVGN+A N L+G Sbjct: 304 FDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTIENAIGGSGNDILVGNSADNILQG 363 Query: 365 GAGNDIIYGGGGADSLWGGAGADIFVFGASSDSNRAAQDTIRDFTRGQDKIDVSAISSLT 424 GAGND++YGG GAD+L+GGAG D FV+G+ DS AA D I DF +G DKID+SA + Sbjct: 364 GAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYDWIADFQKGIDKIDLSAFRNEG 423 Query: 425 SLQFVN-AFSGHAGEAILSYTQSTNLGSLAIDFTGQGVADFLVGTVGQAVATDIVV 479 L FV F+G E +L + + ++ +L + G DFLV VGQA +DI+V Sbjct: 424 QLSFVQDQFTGKGQEVMLQWDAANSITNLWLHEAGHSSVDFLVRIVGQAAQSDIIV 479
>PF05043#Transcriptional activator Length = 493 Score = 28.0 bits (62), Expect = 0.048 Identities = 20/112 (17%), Positives = 44/112 (39%), Gaps = 8/112 (7%) Query: 5 NALRKLDMQDLMIFVSVFEQR---NLTLVSEALNVSQSTVSYCLKKLRANFEDDLFISTR 61 + L K + L + +FE + + + ++E LN ++ V L +++ F D +F S+ Sbjct: 3 DLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHSST 62 Query: 62 NGMRPTRKAMAMHGHVQQILHKVNICHDGLKL--FDPSSGQTTFTVCAPEYF 111 NG+R ++ + H + F + E++ Sbjct: 63 NGIRIIN---TDDSDIEMVYHHFFKHSTHFSILEFIFFNEGCQAESICKEFY 111
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.0 bits (78), Expect = 0.001 Identities = 39/220 (17%), Positives = 79/220 (35%), Gaps = 9/220 (4%) Query: 171 AIWAALALLALCFWVVQRHAFQSSGSSAAPRKQ------AFSQMPRAWMLGVFFGLGTAS 224 A L L CF + + H + A A ++ VFF + Sbjct: 167 AALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVG 226 Query: 225 YTCALAWLAPYYLENGWSEQDAGLLL-GFMTLMEVVSGLVTPALANRSRDKRLVLAVLLG 283 A W+ W G+ L F L + ++T +A R ++R ++ ++ Sbjct: 227 QVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMI- 285 Query: 284 LIMAGFVGLILMPQQLSLLWTGLLGLGIGGLFPMSLIVSMDHYDDPQQAGSLTAFVQGVG 343 G++ L+ + + + ++ L GG+ +L + D ++ G L + + Sbjct: 286 ADGTGYI-LLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALT 344 Query: 344 YLIAGLSPLLAGVIRDVTGSFAGAWWSLIGLVAVMLLMVV 383 L + + PLL I + + W + G +L + Sbjct: 345 SLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPA 384
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 70.4 bits (172), Expect = 3e-17 Identities = 31/178 (17%), Positives = 60/178 (33%), Gaps = 17/178 (9%) Query: 1 MTAPMRL-TDQKREAIVLAAIAEFGDRGFEVTSMDRIAARAEVSKRTVYNHFPSKEELFA 59 M + + R+ I+ A+ F +G TS+ IA A V++ +Y HF K +LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 60 EILQRL---WNCSPPQSDVVYHADVGLREQLRDLLTGKMRTLNDSSFLDLARVVVGATIH 116 EI + + + D LR++L L + + R+++ H Sbjct: 61 EIWELSESNIGELELEYQAKFPGD--PLSVLREILI---HVLESTVTEERRRLLMEIIFH 115 Query: 117 SPERAQVWLARINEREETFSAW-------IRAAQKDGRLKP-VDPGFAATQVHALLKS 166 E + ++ + L + AA + + Sbjct: 116 KCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 56.3 bits (136), Expect = 2e-10 Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 17/100 (17%) Query: 4 DLLIRDAFVIDGSGATGYRADVAIHDGRILRIGAL--PD---------ASAIEEIDAHGL 52 D +I +A ++D G +AD+ + DGRI IG PD E I G Sbjct: 69 DTVITNALILDHWGI--VKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126 Query: 53 VLAPGFIDVHTHDDTVVIRKPQMLPKISQGVTTVIVGNCG 92 ++ G +D H H I Q+ + G+T ++ G G Sbjct: 127 IVTAGGMDSHIH----FICPQQIEEALMSGLTCMLGGGTG 162
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.024 Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 3/76 (3%) Query: 269 AEALEPFRSAQPARPEGDPQPLPYLLDQTDQ-QGGALDIELEVLLLTEKMKAAGAQPHRL 327 AEAL + + + P + + + + +Q + + L LL E AA + Sbjct: 731 AEALHLYLAGERYFPSPEDEEIYFRPEQELRLVETGVQGRLWALLTREGAPAAEGAAQKG 790 Query: 328 AVSNSLNMYWTVAQMV 343 N+ + T+A +V Sbjct: 791 YSVNTT--FVTIADLV 804
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 2e-17 Identities = 38/149 (25%), Positives = 65/149 (43%), Gaps = 2/149 (1%) Query: 927 ILVVDDHIEHRKVISGMLAPLGFDVAQAANGQEAIRQVSLLHPDLILMDLSMPDMDGWAA 986 ILV DD R V++ L+ G+DV +N R ++ DL++ D+ MPD + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 987 SRLIRRNALSQAPIIVLSANASGFADDKERNLQVCNDYLPKPVHLQRLLDRLQHHLQLTW 1046 I++ A P++V+SA + K DYLPKP L L+ + L Sbjct: 66 LPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAY-DYLPKPFDLTELIGIIGRALAEPK 123 Query: 1047 LRRAHNAPTPAPSPRVLPSRMDLEELYEL 1075 R + ++ ++E+Y + Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYRV 152
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.4 bits (227), Expect = 1e-23 Identities = 40/138 (28%), Positives = 61/138 (44%), Gaps = 6/138 (4%) Query: 4 MTRAAENGIILIVDDVPDNLALLSDALDEAGYMVLVALDGHSALTRIQRRRPDLILLDAM 63 MT A IL+ DD +L+ AL AGY V + + + I DL++ D + Sbjct: 1 MTGAT----ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVV 56 Query: 64 MPGMNGFETCRQIKAQPDTANIPVLFMTALTDSEHVVQGFEAGAIDYVTKPIQCTEVLAR 123 MP N F+ +IK ++PVL M+A ++ E GA DY+ KP TE++ Sbjct: 57 MPDENAFDLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114 Query: 124 VASHLRTARILQSARNAS 141 + L + S Sbjct: 115 IGRALAEPKRRPSKLEDD 132
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 29.3 bits (65), Expect = 0.042 Identities = 41/231 (17%), Positives = 82/231 (35%), Gaps = 19/231 (8%) Query: 242 EAGRLLKALAQMQANLRTTIMQISDSSNQLASASEEMTAVTEESSRGLVAQNDEVNQAAT 301 EA + K + + +A + +LA+ SEE AV + AQ++ V Sbjct: 152 EAEQRRKEIEREKAETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGE 211 Query: 302 AVTEMSAAVDEV-ARNAESASEESKRTQGYTEEGSARVAQTLKSIQKLNGNVEN------ 354 T S + AR+AE + KR + + SA+ + + ++KL+ + Sbjct: 212 IKTLNSRLSSSIHARDAEMKTLAGKRNE--LAQASAKYKELDELVKKLSPRANDPLQNRP 269 Query: 355 ----TSEQIQGLSNRAQ---SISKVVEVIRAIAEQTNLL--ALNAAIEAARAGEQGRGFA 405 T ++ R + ++ I I + A++ AG A Sbjct: 270 FFEATRRRVGAGKIREEKQKQVTASETRINRINADITQIQKAISQVSNNRNAGIARVHEA 329 Query: 406 VVADEVRALAHRTQVSTQEIEQMIAAIQTDSD-LAVKAMNTSKDLATESLG 455 + ++ ++ QT ++ K +++LA +S G Sbjct: 330 EENLKKAQNNLLNSQIKDAVDATVSFYQTLTEKYGEKYSKMAQELADKSKG 380
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 71.1 bits (174), Expect = 2e-15 Identities = 75/381 (19%), Positives = 132/381 (34%), Gaps = 42/381 (11%) Query: 16 KVIALLAGLSALSILSTNIILPAFPEMAAQLGVSSRELGLTFSSFFITFALAQLVVGPLA 75 +++ L LS S+L+ ++ + P++A ++F +TF++ V G L+ Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 76 DRYGRKKLVLGGLSVFVIGTAVCGFAQS-FEILIVGRVIQALGICAAAVLARAIARDLFQ 134 D+ G K+L+L G+ + G+ + S F +LI+ R IQ G A L + Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 135 GEALARAMSLIMVATAAAPGFSPLLGSVLTTALGWRAIFVIVAI---------------- 178 E +A LI A G P +G ++ + W + +I I Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 179 ----------AALSVALIYSRTLGETLPASSRVSRSVPEVFVAYGQLMR-DRRFILPGLS 227 L I L T + S + SV + + + F+ PGL Sbjct: 194 RIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLG 253 Query: 228 VSL-LMSGLFASFGA----------APAILMIGIGLTSLEAG--FYFAATVFVVFTAGIA 274 ++ M G+ P ++ L++ E G F T+ V+ I Sbjct: 254 KNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIG 313 Query: 275 APRLAHRFGIRNVTATGFAIALFGGLLLLLGPVNPSLGTYTLSMVIFLWGMGLANPLGTA 334 L R G V G L S + + + + T Sbjct: 314 G-ILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTI 372 Query: 335 ITMGPYGAQAGLASALLGFLT 355 ++ +AG +LL F + Sbjct: 373 VSSSLKQQEAGAGMSLLNFTS 393
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 452 bits (1165), Expect = e-144 Identities = 232/1050 (22%), Positives = 436/1050 (41%), Gaps = 61/1050 (5%) Query: 8 LSALAVRERSITLFLIFLIGVAGTLSFFKLGRAEDPPFTVKQLTIISAWPGATAQEMQDQ 67 ++ +R L ++ +AG L+ +L A+ P +++ + +PGA AQ +QD Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 68 VAEPLEKRMQELK--WYDRSETYTRAGLAFTMVSLQDKTPPSQVQEEFYQARKKVGDAAK 125 V + +E+ M + Y S + AG ++ Q T P Q Q + K+ A Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTS-DSAGSVTITLTFQSGTDPDIAQV---QVQNKLQLATP 116 Query: 126 TLPAGVIGPMVNDEFSDVTFAL---FALKAKGEPQRLLVRDAEA-LRQRLLHVPGVKKIN 181 LP V ++ E S ++ + F G Q + + ++ L + GV + Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 182 IVG-EKAERIFVSFSHERLATLGVSPQDIFAALNTQNVLTPAGSIETDGP------QVFL 234 + G + A RI++ + L ++P D+ L QN AG + + Sbjct: 177 LFGAQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234 Query: 235 RLDGAFDKLEKIRNTPIAVQ--GRTLKLTDVATVERGYEDPATFMVRSQGEPALLLGVVM 292 F E+ + V G ++L DVA VE G E+ R G+PA LG+ + Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI-ARINGKPAAGLGIKL 293 Query: 293 RDGWNGLDLGKALDAETASINAAMPLGMTLSKVTDQSVNIASSVDEFMIKFFVALLVVML 352 G N LD KA+ A+ A + P GM + D + + S+ E + F A+++V L Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 353 VCFLSMG-WRVGVVVAAAVPLTLAIVFVVMEATGKNFDRITLGSLILALGLLVDDAIIAI 411 V +L + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI+ + Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 412 EMMV-VKMEEGYDRIKASAYAWSHTAAPMLAGTLVTAVGFMPNGFAQSTAGEYTSNMFWI 470 E + V ME+ +A+ + S ++ +V + F+P F + G Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473 Query: 471 VGIALIASWVVAVVFTPYLGVKLLPDIKPVEGGHAA--------IYDTPHYNRFRRILAR 522 + A+ S +VA++ TP L LL + + +D N + + + Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD-HSVNHYTNSVGK 532 Query: 523 VIARKWLVAIVVIVTFVVAVLGMGLVKKQFFPTSDRPEVLIEVQMPYGTSNEQTSATTAK 582 ++ ++ + V+ + F P D+ L +Q+P G + E+T + Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592 Query: 583 VEAWLHKQDAAKIVTAYIGQGSPRFYLAMAPELPDPSFAKIVV-----LTDSQESRETLK 637 V + K + A + + + G + + + + A + + + S E + Sbjct: 593 VTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647 Query: 638 HSIREAVAQ-----GLAPEARVRVTQLVFGPYSPFPVAYRVAGPDPDKLREIAQQVQTVM 692 H + + + + V + + AG D L + Q+ + Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELID--QAGLGHDALTQARNQLLGMA 705 Query: 693 QDSP-MMRTVNTDWGSRVPTLHFSLNQDRLQAVGLTSSAVAQQLQFLLSGVPITSVREDI 751 P + +V + ++Q++ QA+G++ S + Q + L G + + Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765 Query: 752 RSVEVMGRAAGDIRLDPAKIAGFTLVGSGGQRIPLSQIGEVGVRMEDPILRRRDRLPTIT 811 R ++ +A R+ P + + + G+ +P S P L R + LP++ Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSME 825 Query: 812 VRGDIAEHLQPPDVSSKIIKELQPIIDNLPAGYRIDQAGSIEESAKATVALLPLFPIMIA 871 ++G+ A P S + ++ + LPAG D G + + L I Sbjct: 826 IQGEAA----PGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFV 881 Query: 872 VTLLIIILQVRSMSAMVMVFMTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGILMRNT 931 V L + S S V V + PLG++GV+ LFNQ + +VGL+ G+ +N Sbjct: 882 VVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNA 941 Query: 932 LILIGQIDQ-NEKDGLDPFHAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT----- 985 ++++ EK+G A + A R RP+L+T+LA IL +PL S G+ Sbjct: 942 ILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNA 1001 Query: 986 LAYTLIGGTLGGTVMTLVFLPAMYSIWYKI 1015 + ++GG + T++ + F+P + + + Sbjct: 1002 VGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031 Score = 79.5 bits (196), Expect = 5e-17 Identities = 83/523 (15%), Positives = 188/523 (35%), Gaps = 44/523 (8%) Query: 524 IARKWLVAIVVIVTFVVAVLGMGLVKKQFFPTSDRPEVLIEVQMPYGTSNEQTSATTAKV 583 I R ++ I+ + L + + +PT P V + P + T + Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65 Query: 584 EAWLHKQDAAKIVTAY-IGQGSPRFYLAMAPELPDPSFAKIVVLTDSQESRETLKHSIRE 642 E ++ D +++ GS L DP A++ V Q + L Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT-DPDIAQVQVQNKLQLATPLL------ 118 Query: 643 AVAQGLAPEARVRVTQLVFGPYSPFPVAYRVAGPDPD-KLREIAQQVQTVMQDSPMMRTV 701 P+ + V S + + +P +I+ V + ++ + + Sbjct: 119 -------PQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVK--DTLSRL 169 Query: 702 N-----TDWGSRVPTLHFSLNQDRLQAVGLT----SSAVAQQLQFLLSGVPITSVREDIR 752 N +G++ + L+ D L LT + + Q + +G + + Sbjct: 170 NGVGDVQLFGAQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQ 228 Query: 753 SVEVMGRAAGDIRLDPAKIAGFTLVGSG-GQRIPLSQIGEVGVRMED-PILRRRDRLPTI 810 + A + +P + TL + G + L + V + E+ ++ R + P Sbjct: 229 QLNASIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAA 287 Query: 811 TVRGDIAEHLQPPDVSSKIIKELQPIIDNLPAGYRI----DQAGSIEESAKATVALLPLF 866 + +A D + I +L + P G ++ D ++ S V L Sbjct: 288 GLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLF-- 345 Query: 867 PIMIAVTLLIIILQVRSMSAMVMVFMTAPLGLIGVVPTLLLFNQPFGINALVGLIALSGI 926 I + L++ L +++M A ++ + P+ L+G L F + G++ G+ Sbjct: 346 -EAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404 Query: 927 LMRNTLILIGQIDQ-NEKDGLDPFHAVVEATVQRARPVLLTALAAILAFIPL-----THS 980 L+ + ++++ +++ +D L P A ++ Q ++ A+ FIP+ + Sbjct: 405 LVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 981 VFWGTLAYTLIGGTLGGTVMTLVFLPAMYSIWYKIRPDQEPQA 1023 + + T++ ++ L+ PA+ + K + + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHEN 507
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.7 bits (103), Expect = 8e-07 Identities = 16/94 (17%), Positives = 39/94 (41%), Gaps = 9/94 (9%) Query: 68 VSGKVLERLVDTGQTVKRGQPLMRLDPVDLGLQAQAQQQAVAAAVARAKQTADDEARNRD 127 + V E +V G++V++G L++L L A+A +++ +A+ R Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTA----LGAEADTLKTQSSLLQARLEQ-----TRY 153 Query: 128 LVAAGAISASAYDRIKSLADTAKADLSAAQAQAA 161 + + +I + +K + ++S + Sbjct: 154 QILSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Score = 33.3 bits (76), Expect = 0.001 Identities = 16/83 (19%), Positives = 30/83 (36%), Gaps = 2/83 (2%) Query: 178 GVVVDTLAEPGQVVSAGQPVVRLAKSGQREAIVHLPETLRPAVGSAAQARMYGNNAEVVP 237 +V + + + G+ V G +++L G + +L A Q R + + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA--RLEQTRYQILSRSIEL 162 Query: 238 AKLRLLSDSADPLTRTFEARYVL 260 KL L +P + VL Sbjct: 163 NKLPELKLPDEPYFQNVSEEEVL 185 Score = 32.5 bits (74), Expect = 0.003 Identities = 11/120 (9%), Positives = 36/120 (30%), Gaps = 7/120 (5%) Query: 99 LQAQAQQQAVAAAVARAKQTADDEARNRDLVAAGAISASAYDRIKSLADTAKADLSAAQA 158 ++A + + + + + + A+ + D+++ ++ Sbjct: 262 VEAVNELRVYKSQLEQIESEIL-SAKEEYQLVTQLFKNEILDKLR----QTTDNIGLLTL 316 Query: 159 QAAVARNATGYAVLLADADGVVVD-TLAEPGQVVSAGQPVVRLAKSGQR-EAIVHLPETL 216 + A +V+ A V + G VV+ + ++ + E + Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKD 376 Score = 29.0 bits (65), Expect = 0.037 Identities = 14/38 (36%), Positives = 18/38 (47%), Gaps = 1/38 (2%) Query: 68 VSGKVLERLVDT-GQTVKRGQPLMRLDPVDLGLQAQAQ 104 VS KV + V T G V + LM + P D L+ A Sbjct: 334 VSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTAL 371
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 59.6 bits (144), Expect = 3e-13 Identities = 21/158 (13%), Positives = 47/158 (29%), Gaps = 12/158 (7%) Query: 19 RDQVVEAATEHFGHYGFEKTTVSDLAKAIGFSKAYIYKFFDSKQAIGEVICSNRLAMIMT 78 R +++ A F G T++ ++AKA G ++ IY F K + I + I Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 79 IVDAAIADAPTASEKLRRLFRAVVEAGSDLFFHDRKLHDIAAVATR-----DKWPSALAH 133 + A P + R ++ + + + + + + Sbjct: 73 LELEYQAKFP---GDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA 129 Query: 134 DA----RLRELIQQIVLEGRESGEFERKTPLDETVHAI 167 + I+Q + E+ + Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 165 bits (418), Expect = 5e-47 Identities = 75/319 (23%), Positives = 114/319 (35%), Gaps = 48/319 (15%) Query: 58 WGLGRIQAEQAYATGITGAGVKIGALDSGFDPSHPEASPSRFHAVTASGTYVDGSPFSVT 117 G+ IQA + G GVK+ LD+G D HP+ + G F+ Sbjct: 24 RGVEMIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLK----------ARIIGGRNFTDD 72 Query: 118 GAINPN----NDTHGTHVTGTMGAARDGVGVHGVAYNAQVYVGNTNKNDSFLFGPSPDPL 173 +P + HGTHV GT+ A + GV GVA A + + G Sbjct: 73 DEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLNKQ----GSGQYDW 128 Query: 174 YFRAVYGALADAGVRVINNSWGSQPSDVTYATYDGMRAAYAQHYNRGTWLDEAANVSRKG 233 + +Y A + V +I+ S G P DV + A + Sbjct: 129 IIQGIYYA-IEQKVDIISMSLGG-PEDV-----PELHEAVKKAVA-------------SQ 168 Query: 234 VINVFSAGNSGYANASVRASLPYFQPDLEGHWLAVSGLDDTNGQRYNQCGISKYWCITTP 293 ++ + +AGN G + P ++V ++ + + + P Sbjct: 169 ILVMCAAGNEGDGDDRT---DELGYPGCYNEVISVGAINF-DRHASEFSNSNNEVDLVAP 224 Query: 294 GRLINGTVPGGGYGIKSGTSMSAPHATGALALVMERFPY-----MNNEQALQVLLTTATQ 348 G I TVPGG Y SGTSM+ PH GALAL+ + + + L+ Sbjct: 225 GEDILSTVPGGKYATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIP 284 Query: 349 LDGSVTQAPNGNVGWGAAN 367 L S NG + A Sbjct: 285 LGNSPKMEGNGLLYLTAVE 303
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 153 bits (388), Expect = 4e-43 Identities = 70/373 (18%), Positives = 114/373 (30%), Gaps = 99/373 (26%) Query: 63 NADWGLGAINADQAYAAGYTGKDIKLGIFDQPVYAAHPEFSGTGKVINLVTSGIREYTDP 122 G+ I A + G+ +K+ + D A HP+ Sbjct: 21 EIPRGVEMIQAPAVWNQT-RGRGVKVAVLDTGCDADHPDLKAR----------------- 62 Query: 123 YIPVKAGDAFRYDGAPTLDSGGKLGNHGTHVGGIAGGSRDGGPMHGVAFNAQIISA---D 179 + G F D + HGTHV G + + + GVA A ++ + Sbjct: 63 ---IIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKVLN 119 Query: 180 NGDPGPEDGIVLGNDGAVYQAGWNALVASGARVINNSWGIGITDRFDQGGKDPAFPHFTV 239 G D I+ G + +I+ S G G +D H Sbjct: 120 KQGSGQYDWII---------QGIYYAIEQKVDIISMSLG---------GPEDVPELH--- 158 Query: 240 QDAQLQFDQIRQILGTRPGGAYQGAIDAARSGVVTIFAAGNDYNLNNPDAMAGLGYFVPE 299 + A S ++ + AAGN+ P Sbjct: 159 ----------------------EAVKKAVASQILVMCAAGNE----GDGDDRTDELGYPG 192 Query: 300 IAPNWLTVAALQVNPNAAAAVSTPYTLSTFSSRCGYTASFCVSAPGTRIFSSVINGNSLE 359 ++V A+ + S FS+ + APG I S+V G Sbjct: 193 CYNEVISVGAINFD----------RHASEFSNSNNEV---DLVAPGEDILSTVPGGKY-- 237 Query: 360 NLTTDWANKNGTSMAAPHVAGSMAVLMERFPY-----MTGAQVADVLKTTATDLGAPGID 414 A +GTSMA PHVAG++A++ + +T ++ L LG Sbjct: 238 ------ATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSP 289 Query: 415 ALYGWGMINLGKA 427 + G G++ L Sbjct: 290 KMEGNGLLYLTAV 302
>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein signature. Length = 170 Score = 30.3 bits (68), Expect = 0.004 Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 1/56 (1%) Query: 154 ALDFCVKTTALQLARAGFVVVLYVPACRGISEEGSLAALSEMAQAGIL-IANNPQE 208 L F + T A Q+ + V L + + G +A + E+ QAG+ I+ + Sbjct: 48 KLSFDLSTEAKQVGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEM 103
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 33.3 bits (76), Expect = 2e-04 Identities = 27/150 (18%), Positives = 49/150 (32%), Gaps = 23/150 (15%) Query: 4 IAVYGGAFNPPHAGHANVMIHASRQARLTMVVPSYQHPYGKVMVDYDLRLQWLRLITDNV 63 A+Y G+F+P GH ++ I + + V ++P + M RL+ + ++ Sbjct: 2 NAIYPGSFDPITFGHLDI-IERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHL 60 Query: 64 RNQCGGELSVSDIERELFSQSPGPVYSFNLLTCLANTTGCAPKSIALVVGQDVADALPGF 123 N + + F L A L V D L Sbjct: 61 PN----------AQVDSFE---------GLTVNYARQRQAGAILRGLRVLSDFELELQMA 101 Query: 124 YLGPEL---LETFSVIIAPEQIGVRSTALR 150 L LET + + E + S+ ++ Sbjct: 102 NTNKTLASDLETVFLTTSTEYSFLSSSLVK 131
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 61.6 bits (149), Expect = 2e-13 Identities = 62/264 (23%), Positives = 98/264 (37%), Gaps = 27/264 (10%) Query: 4 LAGKRVLIVGVASKLSIASGIAAAMHREGAELAFTYQNDKLKGRVEEFAAGWGSGPELCF 63 + GK I G A I +A + +GA +A N + +V E F Sbjct: 6 IEGKIAFITGAAQ--GIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE-AF 62 Query: 64 PCDVASDEEINKVFEELSKKWDGLDVIVHSVGF---APGDQLDGDFTEATTREGFRIAHD 120 P DV I+++ + ++ +D++V+ G L + EAT F + Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEAT----FSVN-- 116 Query: 121 ISAYSFVALAKAGREMMKGRNGSLLTLSYLGAERTMPNYNVMGMAKASLEAGVRYLAGSL 180 S F A + MM R+GS++T+ A + +KA+ + L L Sbjct: 117 -STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175 Query: 181 GPEGTRVNAVSAGPIRTL-----------AASGIKNFRKMLAANEAQTPLRRNVTIDEVG 229 R N VS G T A IK L + PL++ ++ Sbjct: 176 AEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGS---LETFKTGIPLKKLAKPSDIA 232 Query: 230 NAGAFLCSDLASGISGEIMYVDGG 253 +A FL S A I+ + VDGG Sbjct: 233 DAVLFLVSGQAGHITMHNLCVDGG 256
>SECA#SecA protein signature. Length = 901 Score = 31.8 bits (72), Expect = 0.011 Identities = 18/49 (36%), Positives = 24/49 (48%), Gaps = 6/49 (12%) Query: 269 RRAAHILIEVN------DKLSDEQAKAKIEEIQQRLAKGEDFAALAKEF 311 RR ++ +N +KLSDE+ K K E + RL KGE L E Sbjct: 19 RRMRKVVNIINAMEPEMEKLSDEELKGKTAEFRARLEKGEVLENLIPEA 67
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 116 bits (293), Expect = 4e-38 Identities = 44/88 (50%), Positives = 61/88 (69%) Query: 2 NKSELIDAIAASADIPKAAAGRALDAVIESVTGALKAGDSVVLVGFGTFSVTDRPARTGR 61 NK +LI +A + ++ K + A+DAV +V+ L G+ V L+GFG F V +R AR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKTLEIAAAKKPGFKAGKALKEAV 89 NPQTG+ ++I A+K P FKAGKALK+AV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.035 Identities = 13/81 (16%), Positives = 29/81 (35%), Gaps = 6/81 (7%) Query: 292 DWLVQVPWKAQSKVRLDLARAEAILDADHYGLDEVKERILEYLAVQKRVKKIRGP----- 346 DW+ W ++ L D+ +++ + V ++ P Sbjct: 537 DWVKAQQWDEVPRLEKWLVHVLGKTPDDYKPRRLRYLQLVGKYILMGHVARVMEPGCKFD 596 Query: 347 -VLCLVGPPGVGKTSLAESIA 366 + L G G+GK++L ++ Sbjct: 597 YSVVLEGTGGIGKSTLINTLV 617
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.011 Identities = 15/82 (18%), Positives = 32/82 (39%), Gaps = 16/82 (19%) Query: 2 LLLWIVVLVVGIAWL------AHRRTDPLPALGVV--AVYLLAMGIFSHAPGWLLTIFWI 53 LLL ++ ++ + +L R G++ +V ++ +F+ + I + Sbjct: 171 LLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSV 230 Query: 54 LWLAIFI--------PMILPDL 67 L IF+ P + P L Sbjct: 231 LSFLIFVKHIRKVTDPFVDPGL 252
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.011 Identities = 9/44 (20%), Positives = 16/44 (36%), Gaps = 6/44 (13%) Query: 79 RDLLW------FFAGDCLHFMPDDEIDLYQALEERRYEAEQNDE 116 R L+ + AG+ P+DE ++ +E R Sbjct: 726 RGQLFAEALHLYLAGERYFPSPEDEEIYFRPEQELRLVETGVQG 769
>PF06917#Periplasmic pectate lyase Length = 555 Score = 28.0 bits (62), Expect = 0.031 Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 2/37 (5%) Query: 150 PEFSDIAQDANLM--DDMIVQIPEALTALYLLCQAPD 184 PEF +IA++AN++ D + I L L +L Q PD Sbjct: 297 PEFGEIAREANVLFRDMRPLLIDNPLAMLDILRQQPD 333
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 71.5 bits (175), Expect = 7e-15 Identities = 32/80 (40%), Positives = 44/80 (55%), Gaps = 4/80 (5%) Query: 923 LLGADGDDVLLAAGGHDCLNGGNGNDVLIGGPGDDVLTGGEGQDRFMWLAGD----TGHD 978 +G G+D+L+ + L GG GNDVL GG G D L GG G+D F++ +G +D Sbjct: 343 AIGGSGNDILVGNSADNILQGGAGNDVLYGGAGADTLYGGAGRDTFVYGSGQDSTVAAYD 402 Query: 979 RVTDFNVGIDSLDLSHLLQG 998 + DF GID +DLS Sbjct: 403 WIADFQKGIDKIDLSAFRNE 422
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 54.2 bits (130), Expect = 5e-11 Identities = 19/80 (23%), Positives = 37/80 (46%) Query: 25 KASREGSEQRRQVILDAAMRIVVRDGVRAVRHRAVAAEAGVPLSATTYYFKDIDDLLTDA 84 + +++ +++ RQ ILD A+R+ + GV + +A AGV A ++FKD DL ++ Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 85 FAQYVQRSADYLARLWQNTE 104 + + Sbjct: 63 WELSESNIGELELEYQAKFP 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.1 bits (156), Expect = 2e-13 Identities = 36/161 (22%), Positives = 66/161 (40%), Gaps = 13/161 (8%) Query: 19 VLLVDDQAMIGEAVRRGLANESSIDFHFCADPHQAISQAVQIKPTVILQDLVMPGLDGLT 78 +L+ DD A I + + L+ D ++ +++ D+VMP + Sbjct: 6 ILVADDDAAIRTVLNQALSRAG-YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 79 LVREYRSNPLTRDIPIIVLSTKEDPLIKSAAFAAGANDYLVK---LPDNIELVARILYHS 135 L+ + D+P++V+S + + A GA DYL K L + I ++ R L Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 136 RSYLTLLQRDEAYRALRVSQ----QQLLDTNLVLQRLMNSD 172 + + L+ D V + Q++ L RLM +D Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRV---LARLMQTD 160
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 51.4 bits (123), Expect = 2e-09 Identities = 32/184 (17%), Positives = 62/184 (33%), Gaps = 22/184 (11%) Query: 2 KIAIVNDMPMAVEALRRALAFEPLHQVIWVAGNGAEAVRCCAEQTPDLILMDLIMPVMDG 61 I + +D L +AL+ + N A R A DL++ D++MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVR--ITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 VEATRQIMASTPCAIVIVTVDREQNVHRVFEAMGHGAMDVVDTPAIGAGNPKEAAAPLLR 121 + +I + P V+V + +A GA D + P L Sbjct: 63 FDLLPRIKKARPDLPVLVMSA-QNTFMTAIKASEKGAYDYLPKPFD-----------LTE 110 Query: 122 KILNIEWLMGQRNTHERTVATPLRESVRRDRLVAIGSSAGGPAALEILLKALPSNFPAAV 181 I I + + E +D + +G SA +L + + ++ Sbjct: 111 LIGIIGRALAEPKRRPSK-----LEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT--- 162 Query: 182 VLVQ 185 +++ Sbjct: 163 LMIT 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.5 bits (183), Expect = 5e-16 Identities = 30/126 (23%), Positives = 60/126 (47%), Gaps = 3/126 (2%) Query: 661 SRKRVLVVDDSLTVRELERKLLVGRGYEVSVAVDGMDGWNALRSEDFDLLITDIDMPRMD 720 + +LV DD +R + + L GY+V + + W + + D DL++TD+ MP + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 721 GIELVTLLRRDTRLQSLPVMVVSYKDREEDRRRGLDAGADYYLAKASFHDDALLDAVVEL 780 +L+ +++ LPV+V+S ++ + + GA YL K F L+ + Sbjct: 62 AFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK-PFDLTELIGIIGRA 118 Query: 781 IGDAQA 786 + + + Sbjct: 119 LAEPKR 124
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 31.1 bits (70), Expect = 0.006 Identities = 15/90 (16%), Positives = 27/90 (30%), Gaps = 2/90 (2%) Query: 264 PRETAATATAPAPVNKPIARPTPEPTPRTPAAQPASRNATAFAPGNKPAAATGNADSAAL 323 E APV +P P+P P+ + + A + Sbjct: 79 EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPT 138 Query: 324 LVTIASLANEGRTVEARAACERYLQQHEPV 353 T + ++ T A R L +++P Sbjct: 139 SSTATAATSKPVTSVASGP--RALSRNQPQ 166
>PF07132#Harpin protein (HrpN) Length = 356 Score = 27.7 bits (61), Expect = 0.045 Identities = 21/74 (28%), Positives = 29/74 (39%) Query: 29 GKPSSGSDSLMDGLGSLLGGNKSGGQSSQGGLGGLLSGAGGGALAAGAMSLLRGKGSRGM 88 G+ S+ ++ L D + +++ G GGLGGL S GG L G GS Sbjct: 42 GQRSNIAEQLSDIMTTMMFMGSMMGGGLGGGLGGLGSSLGGLGGGLLGGGLGGGLGSSLG 101 Query: 89 GGKALKYGGLAALG 102 G GG Sbjct: 102 SGLGSALGGGLGGA 115
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.6 bits (178), Expect = 2e-15 Identities = 35/125 (28%), Positives = 54/125 (43%), Gaps = 4/125 (3%) Query: 575 TVMVVDDEPTVRLLITEVLEDLGYLVLQADRGSAALEILQSKAAIDLLVTDVGLPGGMNG 634 T++V DD+ +R ++ + L GY V + + + DL+VTDV +P N Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAA-GDGDLVVTDVVMP-DENA 62 Query: 635 RQVADAARAVRPDLKILFVTGYAENAALAHDTLEPGMY-VLPKPFSIAALTGRVTELLDS 693 + + RPDL +L ++ A E G Y LPKPF + L G + L Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQN-TFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 694 ANERL 698 R Sbjct: 122 PKRRP 126
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 744 bits (1923), Expect = 0.0 Identities = 296/867 (34%), Positives = 443/867 (51%), Gaps = 51/867 (5%) Query: 32 RRSRVCISLVLSCSCTAFAAGPDGPAITTPVKFNTAFIQGSEQPP-DLKEFLRANSVLPG 90 R+ R+ V AFAA P + + FN F+ Q DL F + PG Sbjct: 19 RKHRLAGFFVRLFVACAFAAQ--APLSSAELYFNPRFLADDPQAVADLSRFENGQELPPG 76 Query: 91 IYRVDIYVNRTLSGRRDVAFSKNRRSGQIEPCLTLEMLQGFGLDPARLP-ATGEPDEACF 149 YRVDIY+N RDV F+ I PCLT L GL+ A + D+AC Sbjct: 77 TYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACV 136 Query: 150 DLPAQVEFARVDYHPGALRLNISVPQAVMARSARGYVSPQLWDEGEPAAFVNYNANVVRR 209 L + + A G RLN+++PQA M+ ARGY+ P+LWD G A +NYN + Sbjct: 137 PLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV 196 Query: 210 RNQN-LDSDQYYMGLRNGVNLGAWRLRNESSLLY-----GADRSWRYRGNRTFAQRDITA 263 +N+ +S Y+ L++G+N+GAWRLR+ ++ Y + +++ T+ +RDI Sbjct: 197 QNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 264 LKSQLTLGETFSDSQVFDSVRFRGASIASDDGMLPDSERNYAPVIRGTAETNATVEVRQN 323 L+S+LTLG+ ++ +FD + FRGA +ASDD MLPDS+R +APVI G A A V ++QN Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 324 GFLLYSGNVSPGPFEITDIYPSGSNGDLEVTIIEADGRRRSFTQAYASLPIMVPAGALRF 383 G+ +Y+ V PGPF I DIY +G++GDL+VTI EADG + FT Y+S+P++ G R+ Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 384 SLAAGQIDNDG--QDSPAFTSAALIYGLSERMTGFGGLQLAEDYQATNIGTGVNTG-IGA 440 S+ AG+ + Q+ P F + L++GL T +GG QLA+ Y+A N G G N G +GA Sbjct: 377 SITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGA 436 Query: 441 VSLDITHSVSQQKPQ-TLAGQSLRVRYANTLDVTDTTLAIAGYRYSTEQYRTLNQHVSET 499 +S+D+T + S GQS+R Y +L+ + T + + GYRYST Y Sbjct: 437 LSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSR 496 Query: 500 GDPVNGLP-----------------GGQPRDRLELNVTQVLPAQNASLSLTASEQRYWNL 542 + N R +L+L VTQ L + ++L L+ S Q YW Sbjct: 497 MNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLG-RTSTLYLSGSHQTYWGT 555 Query: 543 PGKTRQLYLSYNAAWRSLNYSLSVERNQDFGRSGDATPDTRIAFSVTLPLG--------T 594 Q N A+ +N++LS ++ + G D +A +V +P + Sbjct: 556 SNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGR---DQMLALNVNIPFSHWLRSDSKS 612 Query: 595 SPGSSRLSFNGVRSSAGDYSVQAGLNGQVMDDRDTFYSVQTGR----DSRSGSYGAGKVN 650 + S++ G + AG+ G +++D + YSVQTG D SGS G +N Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672 Query: 651 TTLPYGRFEAGYSQGQDYDALTLSATGSVVAHAGGVNLGQPLGETFALVHVPDVEGARLR 710 YG GYS D L +G V+AHA GV LGQPL +T LV P + A++ Sbjct: 673 YRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVE 732 Query: 711 SFNNVATAANGYAVMPYAQPYRTNWVSLDTRQLGADIDLESAITQIVPRRGAVPLVRFKA 770 + V T GYAV+PYA YR N V+LDT L ++DL++A+ +VP RGA+ FKA Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792 Query: 771 AVGRRVQFELVRADGSKVPLGASVEDGQGRALAVVDPSSQALVLSDQDSGRLHVRWSD-- 828 VG ++ + + +P GA V ++ +V + Q + +G++ V+W + Sbjct: 793 RVGIKLLM-TLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEE 851 Query: 829 -QRCEAPFVLPPRDPARAYERLKVTCQ 854 C A + LPP + +L C+ Sbjct: 852 NAHCVANYQLPPESQQQLLTQLSAECR 878
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 56.8 bits (137), Expect = 6e-11 Identities = 26/120 (21%), Positives = 53/120 (44%), Gaps = 8/120 (6%) Query: 55 VTGIGSV-LSLQSVVIRPQVDGILTRVLVKEGQQVKAGELLATLDDRSISASLEQARAQL 113 T G + S +S I+P + I+ ++VKEG+ V+ G++L L A A Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADT 136 Query: 114 AQSKAQLDVAQLDLKRYRQLTEDNGISKQTYDQQQALVRQLSATAQGNEASINAAQVQLS 173 ++++ L A+L+ RY+ L+ ++K + + + + + + Q S Sbjct: 137 LKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196 Score = 37.9 bits (88), Expect = 6e-05 Identities = 15/83 (18%), Positives = 36/83 (43%), Gaps = 9/83 (10%) Query: 103 SASLEQARAQLAQSKAQLDVAQLDLKRYRQLTEDNGISKQTYDQQQALVRQLSATAQGNE 162 L ++QL Q ++++ A+ + + +T+ + D+ +RQ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEY---QLVTQL--FKNEILDK----LRQTTDNIGLLT 315 Query: 163 ASINAAQVQLSHTQIRSPVSGRV 185 + + + + IR+PVS +V Sbjct: 316 LELAKNEERQQASVIRAPVSVKV 338 Score = 37.5 bits (87), Expect = 7e-05 Identities = 12/101 (11%), Positives = 33/101 (32%), Gaps = 1/101 (0%) Query: 79 RVLVKEGQQVKAGELL-ATLDDRSISASLEQARAQLAQSKAQLDVAQLDLKRYRQLTEDN 137 L+KE + L+ A A++ + + V + L + L Sbjct: 188 TSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQ 247 Query: 138 GISKQTYDQQQALVRQLSATAQGNEASINAAQVQLSHTQIR 178 I+K +Q+ + + ++ + + ++ + Sbjct: 248 AIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEE 288
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 741 bits (1914), Expect = 0.0 Identities = 284/1033 (27%), Positives = 489/1033 (47%), Gaps = 32/1033 (3%) Query: 12 IDHPVATLLLTFALVLLGVIAFPRLPVAPLPEAEFPTIQVSAQLPGASPETMASSVATPL 71 I P+ +L L++ G +A +LPVA P P + VSA PGA +T+ +V + Sbjct: 6 IRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVI 65 Query: 72 EVQFSAIPGMTQMTSSSA-LGSTNLTLQFTLNKSIDTAAQEVQAAINTAAGRLPADMPNL 130 E + I + M+S+S GS +TL F D A +VQ + A LP ++ Sbjct: 66 EQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ- 124 Query: 131 PTWRKVNPADSPVLILSVSSS--LMPGTELSDVTETILARQLSQVEGVGQVFITGQQRPA 188 + S +++ S ++SD + + LS++ GVG V + G Q A Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY-A 183 Query: 189 IRVQAAPEKLAALGLTLADIRQAVQQTSLNLAKGALYGKDSIS------TLSSNDQLFKP 242 +R+ + L LT D+ ++ + +A G L G ++ ++ + + P Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243 Query: 243 QDYAQLIV-SYKDGAPVHLSDVARVVNGSENAYVKAWSGDQQGVNIAIFRQPGANIVDTV 301 +++ ++ + DG+ V L DVARV G EN V A + + I GAN +DT Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303 Query: 302 DRIQRELPRLQEMLPAAVDVSVLNDRTRTIRASLHEVELTLLIAVLLVVAVMALFLRQLS 361 I+ +L LQ P + V D T ++ S+HEV TL A++LV VM LFL+ + Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363 Query: 362 ATLIVSAVLGVSLIASFAMMYLLGFSLNNLTLVAIVVSVGFVVDDAIVVVENIHRHL-EA 420 ATLI + + V L+ +FA++ G+S+N LT+ +V+++G +VDDAIVVVEN+ R + E Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423 Query: 421 GQGMREAAIKGSGEIGFTVVSISFSLIAAFIPLLFMGGVVGRLFKEFALTATATILISVV 480 +EA K +I +V I+ L A FIP+ F GG G ++++F++T + + +SV+ Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483 Query: 481 VSLTLAPTLAALFMR--APSHAKHSRPGFG------ERLLATYERGLRKALAHQRIMLGI 532 V+L L P L A ++ + H ++ FG + + Y + K L L I Sbjct: 484 VALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLI 543 Query: 533 FGLTLALAVVGYIVIPKGFFPVQDTAFALGTTEAAADISYPDMVEKHLALAKIVGADPAV 592 + L +A VV ++ +P F P +D L + A + + + + Sbjct: 544 YALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKA 603 Query: 593 LAFS--HSVGVSGSNQTIANGRFWISLKPRSERDV---SVSEFIDRLRPRLAKVPGIVLY 647 S G S S Q G ++SLKP ER+ S I R + L K+ + Sbjct: 604 NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVI 663 Query: 648 LRAGQDINLSSGPSRSQYQYVLKSNDGPL-LNTWTQRLTEKLRENPA-FRDLSNDLQLGG 705 I + ++ + ++ G L +L ++PA + + Sbjct: 664 PFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDT 723 Query: 706 SVTHIDIDRSAAARFGLTTADVDQALYDAFGQRQISEYQTEVNQYKVILELDARQRGKAE 765 + +++D+ A G++ +D++Q + A G ++++ K+ ++ DA+ R E Sbjct: 724 AQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPE 783 Query: 766 SLAYFYLRSPLTNEMVPLSALAKVGAPQMGPLSISHDGMFPAANLSFNLASGVALGDAVR 825 + Y+RS EMVP SA G + P+ + A G + GDA+ Sbjct: 784 DVDKLYVRSA-NGEMVPFSAFTTS-HWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 826 MLDEAKAEIGMPASIIGSFQGAAQAFQSSLANQPWLILAALVAVYIILGVLYESFVHPLT 885 +++ ++ +PA I + G + + S P L+ + V V++ L LYES+ P++ Sbjct: 842 LMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 886 IISTLPSAGIGALLLLWMMGQDFSIMALIGVVLLIGIVKKNGILLVDFALQAQREQGLTP 945 ++ +P +G LL + Q + ++G++ IG+ KN IL+V+FA ++G Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 946 QEAIYEACMTRFRPIIMTTLAALLGALPLMLGFGVGAELRQPLGIAVVGGLLVSQLLTLF 1005 EA A R RPI+MT+LA +LG LPL + G G+ + +GI V+GG++ + LL +F Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1006 TTPVIYLQLERLF 1018 PV ++ + R F Sbjct: 1020 FVPVFFVVIRRCF 1032 Score = 104 bits (262), Expect = 7e-25 Identities = 80/526 (15%), Positives = 178/526 (33%), Gaps = 49/526 (9%) Query: 1 MKRRGSVSAWCIDHPVATLLLTFALVLLGVIAFPRLPVAPLPEAEFPTIQVSAQLP-GAS 59 + + + LL+ +V V+ F RLP + LPE + QLP GA+ Sbjct: 523 VNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGAT 582 Query: 60 PETMASSVATPLEVQF-SAIPGMTQMTSSSALG----STNLTLQFTLNKSID--TAAQEV 112 E + + + + + + + + N + F K + + Sbjct: 583 QERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENS 642 Query: 113 QAAINTAAGRLPADMPNLPTWRKVNPADSPVLILSVSSSL---------MPGTELSDVTE 163 A+ R ++ + + ++ L ++ + L+ Sbjct: 643 AEAV---IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARN 699 Query: 164 TILARQLSQVEGVGQVFITGQQ-RPAIRVQAAPEKLAALGLTLADIRQAVQQTSLNLAKG 222 +L + V G + +++ EK ALG++L+DI Q + Sbjct: 700 QLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTIS--------- 750 Query: 223 ALYGKDSISTLSSNDQLFK------------PQDYAQLIVSYKDGAPVHLSDVARVVNGS 270 G ++ ++ K P+D +L V +G V S Sbjct: 751 TALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY 810 Query: 271 ENAYVKAWSGDQQGVNIAIFRQPGANIVDTVDRIQRELPRLQEMLPAAVDVSVLNDRTRT 330 + ++ ++G + I PG + D + ++ L LPA + + Sbjct: 811 GSPRLERYNG-LPSMEIQGEAAPGTSSGDAMALME----NLASKLPAGIGYDWT-GMSYQ 864 Query: 331 IRASLHEVELTLLIAVLLVVAVMALFLRQLSATLIVSAVLGVSLIASFAMMYLLGFSLNN 390 R S ++ + I+ ++V +A S + V V+ + ++ L + Sbjct: 865 ERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDV 924 Query: 391 LTLVAIVVSVGFVVDDAIVVVENI-HRHLEAGQGMREAAIKGSGEIGFTVVSISFSLIAA 449 +V ++ ++G +AI++VE + G+G+ EA + ++ S + I Sbjct: 925 YFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILG 984 Query: 450 FIPLLFMGGVVGRLFKEFALTATATILISVVVSLTLAPTLAALFMR 495 +PL G + ++ + ++++ P + R Sbjct: 985 VLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030 Score = 93.7 bits (233), Expect = 2e-21 Identities = 74/414 (17%), Positives = 144/414 (34%), Gaps = 30/414 (7%) Query: 625 VSVSEFIDRLRPRL---AKVPGIVLYLRAGQDINLSSGPSRSQYQYVLKSNDGPLLNTWT 681 V V + P L + GI + SS S++ Sbjct: 105 VQVQNKLQLATPLLPQEVQQQGISVE-------KSSSSYL---MVAGFVSDNPGTTQDDI 154 Query: 682 QRLTEKLRENPAFRDLSN--DLQLGGS--VTHIDIDRSAAARFGLTTADVDQALYDAFGQ 737 L+ D+QL G+ I +D ++ LT DV L Q Sbjct: 155 SDYVAS-NVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQ 213 Query: 738 RQISEYQTEVNQYKVILELDARQRGKAESLAYF---YLRSPLTNEMVPLSALAKVGAPQM 794 + L + + ++ F LR +V L +A+V ++ Sbjct: 214 IAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV---EL 270 Query: 795 GPLSISHDGMF---PAANLSFNLASGVALGDAVRMLDEAKAEI--GMPASI-IGSFQGAA 848 G + + PAA L LA+G D + + AE+ P + + Sbjct: 271 GGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTT 330 Query: 849 QAFQSSLANQPWLILAALVAVYIILGVLYESFVHPLTIISTLPSAGIGALLLLWMMGQDF 908 Q S+ + A++ V++++ + ++ L +P +G +L G Sbjct: 331 PFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSI 390 Query: 909 SIMALIGVVLLIGIVKKNGILLVDFALQAQREQGLTPQEAIYEACMTRFRPIIMTTLAAL 968 + + + G+VL IG++ + I++V+ + E L P+EA ++ ++ + Sbjct: 391 NTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLS 450 Query: 969 LGALPLMLGFGVGAELRQPLGIAVVGGLLVSQLLTLFTTPVIYLQLERLFHRRH 1022 +P+ G + + I +V + +S L+ L TP + L + H Sbjct: 451 AVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEH 504
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 1e-19 Identities = 30/126 (23%), Positives = 58/126 (46%), Gaps = 2/126 (1%) Query: 2 RVLIIEDEEKTADYLRRGLTEQGYAVDVARDGIEGLHLGLENDYAVMVLDVMLPGLDGFG 61 +L+ +D+ L + L+ GY V + + D ++V DV++P + F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRALRAR-KQTPVIMLTAREQVDDRIRGLREGADDYLGKPFSFLELVARL-QALTRRSG 119 +L ++ PV++++A+ I+ +GA DYL KPF EL+ + +AL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 120 GHEPVQ 125 ++ Sbjct: 125 RPSKLE 130
>PF05272#Virulence-associated E family protein Length = 892 Score = 33.5 bits (76), Expect = 0.001 Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 9/56 (16%) Query: 34 LILVGPSGCGKSTLMNCIAGLENITGGAILIDGEDVSGTSPKDRDIAMVFQSYALY 89 ++L G G GKSTL+N + GL+ + I +D Y Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDI---------GTGKDSYEQIAGIVAY 645
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 100 bits (251), Expect = 1e-26 Identities = 42/130 (32%), Positives = 68/130 (52%), Gaps = 2/130 (1%) Query: 7 SILLVDDDQEIRELLDTYLSRAGFQVRTVGDGAGFRQAFNEASSDLLILDVMLPDEDGFS 66 +IL+ DDD IR +L+ LSRAG+ VR + A + DL++ DV++PDE+ F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LCRWVRQHPRQPHVPIIMLTASSDEADRVIGLELGADDYLGKPFSPRELQARIKALLRRA 126 L +++ +P +P+++++A + + E GA DYL KPF EL I L Sbjct: 65 LLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 127 QFGQERPGGD 136 + + D Sbjct: 123 KRRPSKLEDD 132
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.6 bits (64), Expect = 0.033 Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 3/70 (4%) Query: 249 VLTVGGLGGVYIA-GGVVPRFTDFFMNSGFKRALAEKGVM--SDYFKGLPVWLVTAEYPG 305 VLTV + V I VVP+ + F++ L+ + +M SD + W++ A G Sbjct: 178 VLTVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAG 237 Query: 306 LMGAGVALQQ 315 M V L+Q Sbjct: 238 FMAFRVMLRQ 247
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 30.6 bits (69), Expect = 0.024 Identities = 28/115 (24%), Positives = 44/115 (38%), Gaps = 17/115 (14%) Query: 326 LSEVVPTLSHVYPNGKADINHFQAAGGMSFLIRELLAAGLLHENVNTVAGYGLSRYTKEP 385 +S+ P L + H + + E+ A L + Y + KEP Sbjct: 368 ISDSDPLLRYYVD----SATHEIILSFLGKVQMEVTCALLQEK-------YHVEIEIKEP 416 Query: 386 FLEDGKLVWREGPLESLDENILRPV-SRPFSAEGGLRVMEGNLGRGVMKVSAVAL 439 +++ E PL+ + I V PF A GL V LG G+ S+V+L Sbjct: 417 -----TVIYMERPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSL 466
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 332 bits (852), Expect = e-111 Identities = 118/356 (33%), Positives = 180/356 (50%), Gaps = 35/356 (9%) Query: 177 ERLSALHHDHAEGFDALLGESPAIRTLKARAQRIAALDAPLLIQGETGTGKELVARACHA 236 +R + D ++ L+G S A++ + R+ D L+I GE+GTGKELVARA H Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182 Query: 237 SSARHGEPFLALNCAALPENLAESELFGYAPGAFTGAQRGGKPGLMELANQGTVFLDEIG 296 R PF+A+N AA+P +L ESELFG+ GAFTGAQ G E A GT+FLDEIG Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRST-GRFEQAEGGTLFLDEIG 241 Query: 297 EMSPYLQAKLLRFLNDGSFRRVGGDREVKVNVRILSATHRDLEKMVSEGTFREDLFYRLN 356 +M Q +LLR L G + VGG ++ +VRI++AT++DL++ +++G FREDL+YRLN Sbjct: 242 DMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLN 301 Query: 357 VLNLEVPPLRERGQDILLLARYFMEQACAQIQRPVCRLAPGTYPALLGNRWPGNVRQLQN 416 V+ L +PPLR+R +DI L R+F++QA + V R + + WPGNVR+L+N Sbjct: 302 VVPLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELEN 360 Query: 417 VIFRAAAISESAVVDIGDLDIAG--------------------------------TAIAG 444 ++ R A+ V+ ++ A G Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420 Query: 445 QSTVEVDSLEHAVESFEKDLLERLYADYPSTRQLATR-LHTSHTAIAHRLRKYGIP 499 + + + E L+ + A L + + ++R+ G+ Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 69.9 bits (171), Expect = 3e-17 Identities = 30/119 (25%), Positives = 50/119 (42%), Gaps = 2/119 (1%) Query: 2 AQILIIEDNAANMRLAELLLTSAGHGVTAATDAETGLRLAQECQPQLILMDIHLPGMDGL 61 A IL+ +D+AA + L+ AG+ V ++A T R L++ D+ +P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 QATSLLKSDARTATIPVIALTAMAMKEDEEKIRLAGCDAYITKPLSYKELYRVIETLLA 120 +K +PV+ ++A K G Y+ KP EL +I LA Sbjct: 64 DLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.3 bits (237), Expect = 9e-23 Identities = 34/122 (27%), Positives = 59/122 (48%), Gaps = 2/122 (1%) Query: 4 TNATILIVDDDVHVRDLLEVLLQNQQYRTQTAESGEQALEMVEKHAPDLILLDIMMPGMD 63 T ATIL+ DDD +R +L L Y + + + DL++ D++MP + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 64 GYEVASRLKSGKTTSNIPIIMLSALDERSARISGLEAGAEEYLNKPVDSAELWLRVRNLL 123 +++ R+K + ++P++++SA + I E GA +YL KP D EL + L Sbjct: 62 AFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 124 RL 125 Sbjct: 120 AE 121
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 94.2 bits (234), Expect = 7e-25 Identities = 45/171 (26%), Positives = 75/171 (43%), Gaps = 16/171 (9%) Query: 66 KGALIGAAVVGAASAGYGY-YADKQEAALRASMANTGVEVQRQGDQIKLIMPGNITFATD 124 + G S G Y + + A + A EVQ + + ++ F + Sbjct: 171 AHTIGTRPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEVQTK----HFTLKSDVLFNFN 226 Query: 125 SSAIASSFYSPLNNLANSLKQFNQSN--IEIIGYTDSTGSRQHNMDLSQQRAQSVATYLT 182 + + + L+ L + L + + + ++GYTD GS +N LS++RAQSV YL Sbjct: 227 KATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLI 286 Query: 183 SQGVDQAHLSVRGAGPDQPIASNADANGR---------AQNRRVEVNLKPI 224 S+G+ +S RG G P+ N N + A +RRVE+ +K I Sbjct: 287 SKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEVKGI 337
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 28.7 bits (64), Expect = 0.031 Identities = 14/66 (21%), Positives = 22/66 (33%), Gaps = 13/66 (19%) Query: 176 ELGAKQINPKATVAVVYTG--AWNDPVKERAATMALIDNGVDVVGQHVDS-------PTP 226 ++ A + K + + +G W K L G VVG S P Sbjct: 41 QVNAASSHTKPPLVIFLSGDGGWATLDKAVGG--ILQQQGWPVVG--WSSLKYYWKQKDP 96 Query: 227 QIVAQE 232 + V Q+ Sbjct: 97 KDVTQD 102
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 68.9 bits (168), Expect = 3e-16 Identities = 35/164 (21%), Positives = 65/164 (39%), Gaps = 12/164 (7%) Query: 38 KRRLRLMEGKRSVILDAALEIFSRYGVHGSSLDQVASLADVSKTNLLYYFSSKDDLYLNV 97 ++ + + R ILD AL +FS+ GV +SL ++A A V++ + ++F K DL+ + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 98 LRQLLEVWLSPLLHFTAD--KDPQQAIGAYLKAKLEMSRDHPAESRLFCMEVMQGAPLIQ 155 L + A DP + L LE + L ME++ Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLL--MEIIFHKCEFV 120 Query: 156 GELQHPLR-------DTVQTKVAVIQHWIDSGQL-APINPHHLI 191 GE+ + ++ ++H I++ L A + Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAA 164
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 76.2 bits (187), Expect = 2e-18 Identities = 50/197 (25%), Positives = 78/197 (39%), Gaps = 28/197 (14%) Query: 14 PDLQP-----ARDLPARPEALRMKAGETALVVVDMQNAYASLGGYLDLAGFDVSSTGPVI 68 P +QP A D+P + L++ DMQN + +D S + Sbjct: 4 PAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELS 57 Query: 69 ANIKKACATARAAGIPVIFFQNGWDPAYVEAGGPGSPNWHKSNALKTMRKRPELEGQLLA 128 ANI+K GIPV++ PGS N L G L Sbjct: 58 ANIRKLKNQCVQLGIPVVY-----------TAQPGSQNPDDRALLTDFW------GPGLN 100 Query: 129 KGGWDYQLVDELKPEPGDIVVPKIRYSGFFNSSFDSVLRSRGIRNLVFTGIATNVCVEST 188 G ++ +++ EL PE D+V+ K RYS F ++ ++R G L+ TGI ++ T Sbjct: 101 SGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVT 160 Query: 189 LRDGFHLEYFGVVLADA 205 + F + + DA Sbjct: 161 ACEAFMEDIKAFFVGDA 177
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 27.3 bits (60), Expect = 0.019 Identities = 9/21 (42%), Positives = 13/21 (61%) Query: 56 LETIKSVIETAGGTMDDVTFN 76 L +K V+ T G +DD +FN Sbjct: 59 LLKLKPVLITDEGKIDDKSFN 79
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 31.1 bits (70), Expect = 0.015 Identities = 14/71 (19%), Positives = 28/71 (39%), Gaps = 4/71 (5%) Query: 16 VVVLAFVLFTLYN----DYLQRATINQNLESSVGQAGQLTASSVQNWLSGRILVLENLTQ 71 V+ L FV F ++ D + Q +++ AG V G+++ ++ Sbjct: 9 VIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKLISVKTDVL 68 Query: 72 DVAYQGVGSDL 82 D+ G D+ Sbjct: 69 DLTINTRGGDV 79
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 33.8 bits (77), Expect = 4e-04 Identities = 28/104 (26%), Positives = 46/104 (44%), Gaps = 1/104 (0%) Query: 154 WTALLFPL-VLLPLAIATLGFSWLLAALGVYLRDVGQVIGVLTTVLLFLSPVLYPVAALP 212 W +LL+ L V+ +A ++ AL ++ T +LFLS ++PV LP Sbjct: 144 WLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLP 203 Query: 213 QVYQPWLKLNPLTYIIEESRNALLFGNWPDWQSLALAMLIASAI 256 V+Q + PL++ I+ R +L D A+ I I Sbjct: 204 IVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVI 247
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 135 bits (342), Expect = 5e-41 Identities = 82/248 (33%), Positives = 126/248 (50%), Gaps = 11/248 (4%) Query: 7 KVVVVTGAGSGIGEATAKRFAREGASVVLVGRNEEKLKKVHAQLEGEGHLVRA--ADVAD 64 K+ +TGA GIGEA A+ A +GA + V N EKL+KV + L+ E A ADV D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 65 LSDVEALFKEVASHFGRLDALVNNAGIVKSGKVTELEVQDWKELMSVDLDGVFYCTRSAM 124 + ++ + + G +D LVN AG+++ G + L ++W+ SV+ GVF +RS Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 125 PALIVSK-GNIVNVSSVSGMGGDWGMSFYNAAKGAITNFTRALALDHGADGVRVNAVCPS 183 ++ + G+IV V S M+ Y ++K A FT+ L L+ +R N V P Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 184 LTRSELTDDMMDND--------ALMAKFKERIALGRPAEPEDIGDVIAFLASDDARFVTG 235 T +++ + ++ + FK I L + A+P DI D + FL S A +T Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248 Query: 236 VNLPVDGG 243 NL VDGG Sbjct: 249 HNLCVDGG 256
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 33.1 bits (75), Expect = 0.002 Identities = 38/150 (25%), Positives = 61/150 (40%), Gaps = 17/150 (11%) Query: 222 RTQVGVWYSELQDIYQQQFFNLLHSQTFGDWTLG-ANLGYFIGKEDGNKLAGDLDNKTAY 280 R Q G ++E + + +LL S+ G W G A Y++ NKL D+ N Sbjct: 83 RIQAGKGFNEFPNKEYDLYKSLLSSKIDGGWDWGNAARHYWVKDGQWNKLEVDMQNAVGT 142 Query: 281 ALLSA--RYGGSTFYVGLQKLSGDTAWMRVNGTSGGTLANDSYNSSYDNAKEKSWQLRHD 338 LS + G V +QK A +R+ +G +S+ S D+A + R D Sbjct: 143 YNLSGLINFTGGDLDVNMQK-----ATLRLGQFNG-----NSFTSYKDSADRTT---RVD 189 Query: 339 YNFAVLGVPG-LTLMNRYISGDNVHTGNIT 367 +N + + L + NR SG + Sbjct: 190 FNAKNILIDNFLEINNRVGSGAGRKASSTV 219
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 70.6 bits (173), Expect = 3e-16 Identities = 41/179 (22%), Positives = 72/179 (40%), Gaps = 23/179 (12%) Query: 13 RLLLTGAAGGLGKVLRETLR-------------PYANILRLSDIAEMAPAAGSHEEVQVC 59 + L+TGAAG +G + + L Y ++ E+ G ++ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQF-HKI- 59 Query: 60 DLSDKNAVHQLVE--GVDAILHFG---GV--SVERPFEEILGANICGVFHIYEAARRHGV 112 DL+D+ + L + + V S+E P +N+ G +I E R + + Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYA-DSNLTGFLNILEGCRHNKI 118 Query: 113 KRVIFASSNHVIGFYKQDETIDAHSPRRPDSYYGLSKSYGEDMASFYFDRYGIETVSIR 171 + +++ASS+ V G ++ S P S Y +K E MA Y YG+ +R Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.0 bits (200), Expect = 4e-19 Identities = 40/168 (23%), Positives = 69/168 (41%), Gaps = 4/168 (2%) Query: 1 MSTLALLICDDSNMARKQLMRALPADWDVSVTMATQGQEGLEAIRSGLGKVVLLDLTMPV 60 M+ +L+ DD R L +AL + V + + I +G G +V+ D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59 Query: 61 MDGYQTLAAIREEHLDAKVIVVSGDVQDEAVRRVMELGALAFLKKPADPDELKSTLERLG 120 + + L I++ D V+V+S + E GA +L KP D EL + R Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR-- 117 Query: 121 LLGKPSALPAAVAAQHTAGQGVISFQDAFRETVNVAMGRAAALLAKVL 168 L +P P+ + G ++ A +E V + R ++ Sbjct: 118 ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRV-LARLMQTDLTLM 164
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 26.5 bits (58), Expect = 0.020 Identities = 13/44 (29%), Positives = 24/44 (54%) Query: 31 LYTALRNGLPYEIFERLAQYTDLNRSTLAEHLGIAPATLQRRLK 74 LY + + + + + QYT N++ A +GI TL+++LK Sbjct: 50 LYELVLAEVEQPLLDMVMQYTRGNQTRAALMMGINRGTLRKKLK 93
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 155 bits (392), Expect = 3e-51 Identities = 54/149 (36%), Positives = 78/149 (52%), Gaps = 1/149 (0%) Query: 7 INEQDRQQ-IVDGLSHLLSDTYVLYLKTHNFHWNVTGPMFRTLHLLFEEQYTELATAVDS 65 N + Q + + L+ LS+ ++LY K H FHW V GP F TLH FEE Y A VD+ Sbjct: 4 ENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVDT 63 Query: 66 IAERIRALGFPAPGTYSTYARLSSIKEEPGVPDAAEMIRQLVEGQEAVVRTARGLFPLLE 125 IAER+ A+G T Y +SI + A+EM++ LV + + ++ + L E Sbjct: 64 IAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLAE 123 Query: 126 KVSDEPTADLLTQRMQVHEKAAWMLRTLL 154 + D TADL ++ EK WML + L Sbjct: 124 ENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>PERTACTIN#Pertactin signature. Length = 922 Score = 30.8 bits (69), Expect = 0.011 Identities = 19/59 (32%), Positives = 21/59 (35%) Query: 197 WRAQAALAEGKPAPIPEPGPAASAVGNYLVASPQRYNPPGVIDSQVELPRLLAAARREV 255 W A A P P P+PGP PQ PP Q E P A RE+ Sbjct: 560 WSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGREL 618
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 54.6 bits (131), Expect = 4e-11 Identities = 57/195 (29%), Positives = 74/195 (37%), Gaps = 3/195 (1%) Query: 43 VELALVEPEPPAPEPVIPPEPQPVEPVQPDEPPPPPVPVVDSEEAEPPPPPPKPVPKPEP 102 V + P P P V P +EP Q +PPP PV + E EP P PPK P Sbjct: 37 VHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPE-PEPIPEPPKEAPVVIE 95 Query: 103 KPKPEPKPRPKPAPAVAKPAEPVPAPRQPVVSAPVAPVAPPAPPAPPKVDTQGLEGGYLK 162 KPKP+PKP+PKP V +P V S + T Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155 Query: 163 GLRNELDGYKQYPTGRQASLERPSGEVIVWLLVDRQGRVLDSGIQSQASSMLLNRAATSS 222 G R QYP +A R G+V V V GRV + I S + + R ++ Sbjct: 156 GPRALSRNQPQYP--ARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNA 213 Query: 223 LRRIKQVKPFPEQAF 237 +RR + P Sbjct: 214 MRRWRYEPGKPGSGI 228
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 13/37 (35%), Positives = 19/37 (51%) Query: 14 SHILRGLSFDVKVGEVTCLLGRNGVGKTTLLRVLMGL 50 H+ R + K L G G+GK+TL+ L+GL Sbjct: 583 GHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.5 bits (84), Expect = 2e-05 Identities = 13/63 (20%), Positives = 25/63 (39%), Gaps = 1/63 (1%) Query: 81 RHTVEHSVYVRADQRGKGLGPKLMSALIERARTCDKHMMVAAIESGNAASIALHERLGFT 140 +E + V D R KG+G L+ IE A+ ++ + N ++ + + F Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 141 TTG 143 Sbjct: 148 IGA 150
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.8 bits (85), Expect = 1e-05 Identities = 14/62 (22%), Positives = 23/62 (37%), Gaps = 7/62 (11%) Query: 90 RAEVQKLMVSPAARGHGLGRQLME-AVEQAAVKLKRGLLHLDTEAGST---AEAFYRSMA 145 A ++ + V+ R G+G L+ A+E A + L E A FY Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIEWA---KENHFCGLMLETQDINISACHFYAKHH 145 Query: 146 YT 147 + Sbjct: 146 FI 147
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 1119 bits (2897), Expect = 0.0 Identities = 430/567 (75%), Positives = 487/567 (85%), Gaps = 2/567 (0%) Query: 2 KISRQAYADMFGPTVGDKVRLADTELWIEVEKDFTTYGEEVKFGGGKVIRDGMGQGQLL- 60 ++SR AYA+MFGPTVGDKVRLADTEL+IEVEKDFTT+GEEVKFGGGKVIRDGMGQ Q+ Sbjct: 4 RMSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTR 63 Query: 61 AAEVVDTLITNALIIDHWGIVKADVGLKNGRIAAIGKAGNPDIQPDVTIAVGAATEVIAG 120 VDT+ITNALI+DHWGIVKAD+GLK+GRIAAIGKAGNPD+QP VTI VG TEVIAG Sbjct: 64 EGGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAG 123 Query: 121 EGMILTAGGVDTHIHFICPQQIEEALMSGVTTMIGGGTGPATGTNATTVTPGPWHMARML 180 EG I+TAGG+D+HIHFICPQQIEEALMSG+T M+GGGTGPA GT ATT TPGPWH+ARM+ Sbjct: 124 EGKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMI 183 Query: 181 QAADAFPMNIGLTGKGNVSLPGPLIEQVKAGAIGLKLHEDWGTTPAAIDNCLSVADEYDV 240 +AADAFPMN+ GKGN SLPG L+E V GA LKLHEDWGTTPAAID CLSVADEYDV Sbjct: 184 EAADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDV 243 Query: 241 QVAIHTDTLNESGFVETTLAAFKNRTIHTYHTEGAGGGHAPDIIKACGSPNVLPSSTNPT 300 QV IHTDTLNESGFVE T+AA K RTIH YHTEGAGGGHAPDII+ CG PNV+PSSTNPT Sbjct: 244 QVMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPT 303 Query: 301 RPFTRNTIDEHLDMLMVCHHLDPSIAEDVAFAESRIRRETIAAEDILHDLGAFSMLSSDS 360 RP+T NT+ EHLDMLMVCHHL P+I ED+AFAESRIR+ETIAAEDILHD+GAFS++SSDS Sbjct: 304 RPYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDS 363 Query: 361 QAMGRVGEVIMRTWQTADKMKKQRGPLPQDGPGNDNFRAKRYIAKYTINPAITHGISHEV 420 QAMGRVGEV +RTWQTADKMK+QRG L ++ NDNFR KRYIAKYTINPAI HG+SHE+ Sbjct: 364 QAMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEI 423 Query: 421 GSIEVGKWADLVLWRPAFFGVKPTLILKGGAIAASLMGDANASIPTPQPVHYRPMFASYG 480 GS+EVGK ADLVLW PAFFGVKP ++L GG IAA+ MGD NASIPTPQPVHYRPMF +YG Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYG 483 Query: 481 SSLHATSMTFISQAAFDAGVPESLGLKKQIGVVKGCR-TVQKKDLIHNDYLPDIEVDPQT 539 S +S+TF+SQA+ DAG+ LG+ K++ V+ R + K +IHN P IEVDP+T Sbjct: 484 RSRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPET 543 Query: 540 YQVKADGVLLWCEPADVLPMAQRYFLF 566 Y+V+ADG LL CEPA VLPMAQRYFLF Sbjct: 544 YEVRADGELLTCEPATVLPMAQRYFLF 570
>PF06580#Sensor histidine kinase Length = 349 Score = 31.0 bits (70), Expect = 0.007 Identities = 25/112 (22%), Positives = 40/112 (35%), Gaps = 29/112 (25%) Query: 270 MLQNLIGNALQHGAASHE----ITVRVIGGPDTVELVVHNEGKPIAEDAIGTIFDPLVRS 325 ++Q L+ N ++HG A I ++ TV L V N G ++ Sbjct: 259 LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------ 306 Query: 326 SEENSESRTTSTSLGLGLFIVKEVVNAHSG---SITVTSTIGDGTTFTVVLP 374 T S G GL V+E + G I ++ G V++P Sbjct: 307 ---------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 44.7 bits (105), Expect = 2e-06 Identities = 35/234 (14%), Positives = 75/234 (32%), Gaps = 27/234 (11%) Query: 30 SEAVQQSLDKIADRKLPDADQKALQQVLEQTLAFLASKQDSEQKLTALKQQLNQAPKQTS 89 SE + + A + + + A + + + + L A K L +A + Sbjct: 109 SEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 168 Query: 90 ENQRELSRLKESKIVPIAQRYGGLDVPQLEQMLSQRSTQQSDLQKELNDANSLSITAQTR 149 S + LE + +Q++L+K L A + S + Sbjct: 169 NFSTADSAKIK----------------TLEAEKAALEARQAELEKALEGAMNFSTADSAK 212 Query: 150 PERAQAEISANQNRIQQINAILKLGKDNGKALSADQRNLLNAELASINALNLLRRQELAG 209 + +AE +A R + + N A+ A I L + A Sbjct: 213 IKTLEAEKAALAARKADL-----------EKALEGAMNFSTADSAKIKTLEAEKAALEAR 261 Query: 210 NSQLQDLGNSQHDLLTEKVARQEQEIQDLQTLINDKRRAQSQKTVADLSLEAQK 263 ++L+ + T A+ + + L +K + Q V + + ++ + Sbjct: 262 QAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLR 315 Score = 36.6 bits (84), Expect = 5e-04 Identities = 33/273 (12%), Positives = 72/273 (26%), Gaps = 17/273 (6%) Query: 153 AQAEISANQNRIQQINAILKLGKDNGKALSADQRNLL--NAELASINA-----LNLLRRQ 205 + Q + + K+ + K + L N++L+ N + L + Sbjct: 35 VNTNEVSAVATRSQTDTLEKVQERADK-FEIENNTLKLKNSDLSFNNKALKDHNDELTEE 93 Query: 206 ELAGNSQLQDLGNSQHDLLTEKVARQEQEIQDLQTLINDKRRAQSQKTVADLSLEAQKSG 265 +L+ + K+ E DL+ + + + +LEA+K+ Sbjct: 94 LSNAKEKLRKN-DKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKA- 151 Query: 266 GSSLLATESAYNLQLSDYLLRGTDRLNELTQQNLKTKQQLDNLTQTDQALSEQINVLSGS 325 +L+ + + + L+ ++ L L + + Sbjct: 152 ----ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA--ALEARQAELEKALEGAMNF 205 Query: 326 LLLSKILYKQKQSLPHLELDKGLADEIANIRLYQFEINQKREQMSTPTAYVEKLLTTQPP 385 K ++ L AD + ++ T A L Q Sbjct: 206 STADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 264 Query: 386 ENVTPQLRRTLLDLAITRSDLLERLNRELSALL 418 + + LE L A Sbjct: 265 LEKALEGAMNFSTADSAKIKTLEAEKAALEAEK 297
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 3e-14 Identities = 26/113 (23%), Positives = 55/113 (48%), Gaps = 2/113 (1%) Query: 1874 VMVVDDSVTVRKVTGRLLERHGMHVLTAKDGVDAMSLLQEHTPDIMLLDIEMPRMDGFEV 1933 ++V DD +R V + L R G V + + D+++ D+ MP + F++ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 1934 ASQIRHDEQLKDLPIIMITSRSGQKHRDRAMAIGVNEYLSKPYQESVLLDSIA 1986 +I+ + DLP++++++++ +A G +YL KP+ + L+ I Sbjct: 66 LPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.0 bits (200), Expect = 3e-21 Identities = 33/119 (27%), Positives = 52/119 (43%), Gaps = 2/119 (1%) Query: 2 ARILIVDDSPTEMYKLTGMLEKHGHEVLKAENGADGVALARQEKPDAVLMDIVMPGLNGF 61 A IL+ DD L L + G++V N A D V+ D+VMP N F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 QATRQLTKDADTAMIPVIMITTKDQETDKVWGKRQGARDYLTKPVDEDTLMKTLNAVLA 120 ++ K +PV++++ ++ + +GA DYL KP D L+ + LA Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.9 bits (166), Expect = 2e-16 Identities = 28/117 (23%), Positives = 48/117 (41%), Gaps = 2/117 (1%) Query: 6 TALKVMVIDDSKTIRRTAETLLRNVGCEVITAIDGFDALAKIADNHPRIIFVDIMMPRLD 65 T ++V DD IR L G +V + IA ++ D++MP + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 66 GYQTCALIKNNRAFKSTPVIMLSSKDGLFDKARGRAVGSDQFLTKPFSKEELLSAIK 122 + IK R PV+++S+++ + G+ +L KPF EL+ I Sbjct: 62 AFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>RTXTOXINC#Gram-negative bacterial RTX toxin-activating protein C signature. Length = 170 Score = 28.3 bits (63), Expect = 0.024 Identities = 11/28 (39%), Positives = 14/28 (50%) Query: 196 IMAQGYLPAIKDGDKRILMVDGEPVPYC 223 + A LPAI+ +L D PV YC Sbjct: 30 LFAINVLPAIQANQYVLLTRDDYPVAYC 57
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 66.5 bits (162), Expect = 6e-15 Identities = 38/250 (15%), Positives = 75/250 (30%), Gaps = 41/250 (16%) Query: 23 RLGFTMMIAALIHLAIILGVGFTYVKPEHISQTLEITLATFKSEEKPKQADFLAQDDQQG 82 R + +++ IH A++ G+ +T V I L +P +A D + Sbjct: 13 RFPWPTLLSVCIHGAVVAGLLYTSV-------HQVIELPA---PAQPISVTMVAPADLE- 61 Query: 83 SGTLDKAETLKTTEVAPYQDTKVNKVTPPPASKPVVKQEAPKTAVATTAPSPQKTVAKRE 142 + V + P P P +EAP K ++ Sbjct: 62 -----------PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 143 EVKPDPAVKAAPTFDSAELSNEIASLEAELSAEQQLYAKRPKIHRLNAASTMRDKGAWYK 202 +P VK + P + A+ K Sbjct: 111 VEQPKRDVKPVES-----------------RPASPFENTAPARPTSSTATAATSKPVTSV 153 Query: 203 DDWRKKVERVGNLNYPEEARRKQIYGNLRLLVSINRDGSLYEVLVLESSGQPLLDQAAQR 262 + + R YP A+ +I G +++ + DG + V +L + + ++ + Sbjct: 154 ASGPRALSRN-QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKN 212 Query: 263 IVRLAAPFAP 272 +R + P Sbjct: 213 AMR-RWRYEP 221
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 36.6 bits (84), Expect = 3e-04 Identities = 47/267 (17%), Positives = 100/267 (37%), Gaps = 21/267 (7%) Query: 381 TEQTSAGVNNQKVETDQVATAMHEMTATVQEVARNAEEASEAAVAADQQAREGERVVNEA 440 +E T N K E+ V + T T + A+EA ++ V A+ Q E + +E Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEA-KSNVKANTQTNEVAQSGSET 1092 Query: 441 IAQIERLASAVGNSSEAMGALKQESEKIGSVLDVIKSVA-QQTNLLALNAAIEAARAGEA 499 + + + + E K E+EK V V V+ +Q + E AR + Sbjct: 1093 K-ETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDP 1151 Query: 500 GRGFAVVADEVRSLAQRTQKSTEEIEAL------IVSLQSGTQQAASVMDSSRELSASSV 553 V E +S T + + + V+ + SV+++ + ++ Sbjct: 1152 ----TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207 Query: 554 DLTRRAGSSLENITKTVSAIQSMNQQIAAAAEQQSATAEEINRSIINVRDVSEQT--SAA 611 T + SS + + +++S+ + A + +RS + + D++ + Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN------DRSTVALCDLTSTNTNAVL 1261 Query: 612 SEETAASSIELARLGTHLQTLVSRFTV 638 S+ A + +G + +S+ + Sbjct: 1262 SDARAKAQFVALNVGKAVSQHISQLEM 1288
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.3 bits (76), Expect = 0.003 Identities = 21/205 (10%), Positives = 65/205 (31%), Gaps = 9/205 (4%) Query: 278 LSTLQGTRRDSEADSSRKTLSGVAALALLVGLLAAWIMTRQITE------PLRQTLIAAA 331 L L +++ ++ +L +L+ I ++ E P Q + Sbjct: 124 LLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEE 183 Query: 332 RIAQGDLSKDLETGRRDELGQLQNSMQAMTLSLRELIGGIGDGVSQIASAAEQLSAVT-- 389 + L K+ + +++ Q + ++ ++ I + +L + Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSL 243 Query: 390 -EQTCMGVNTQKDETDQVATAMNEMTATVQEVARNAQEASQSAAQADQQAQDGDRVVGQA 448 + + + ++ ++ A+NE+ ++ + E + + Q + Sbjct: 244 LHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK 303 Query: 449 ITQIEQLAREVVNSTQAMNQLKQES 473 + Q + + +Q S Sbjct: 304 LRQTTDNIGLLTLELAKNEERQQAS 328 Score = 31.3 bits (71), Expect = 0.013 Identities = 29/169 (17%), Positives = 62/169 (36%), Gaps = 29/169 (17%) Query: 82 VVDRLNEIEALLASLRKQSDEADALAS---------LESQSQLISLMEKTFTDLGADREA 132 V+ R+N E L + + D+ +L LE +++ + + +L + Sbjct: 219 VLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN----ELRVYKSQ 274 Query: 133 RDQIRARLDQKSEQAVNAVTQVEKEVLKAVSQEQDNGERMDEFTNLSQLKHQIQIARYQV 192 +QI + + E+ + E+L + Q D N+ L ++ + Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTD---------NIGLLTLELAKNEERQ 325 Query: 193 QAYTFTGKEADETAAVTAIDEALKEMQQISQDQADENIQALVPANEALQ 241 QA + A V+ + LK + E + +VP ++ L+ Sbjct: 326 QA-------SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLE 367
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 38.1 bits (88), Expect = 1e-04 Identities = 33/129 (25%), Positives = 51/129 (39%), Gaps = 13/129 (10%) Query: 512 PAEPGKEPALLVAD--KAEDKKVAAKEAAAKEAAAK-----EAAKPAATKDTDQVEIAKA 564 PA P + VA+ K E K V E A E A+ + AK +T E+A++ Sbjct: 1030 PATPSET-TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088 Query: 565 DAPKVEAAKPEA-AKGDASKPDAAKGEVAKTDAA---KADVAKDKDGKEIQQPETEAAPT 620 + E E K + AK E KT + V+ ++ E QP+ E A Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR- 1147 Query: 621 HPEPAKTLQ 629 +P ++ Sbjct: 1148 ENDPTVNIK 1156 Score = 34.7 bits (79), Expect = 0.002 Identities = 24/145 (16%), Positives = 44/145 (30%), Gaps = 4/145 (2%) Query: 495 KDSGKPTEMRAYLLREIPAEPGKEPALLVADKAEDKKVAAKEAAAKEAAAKEAAKPAATK 554 K+ TE A A+ K E + ++ + KE A + Sbjct: 1053 KNEQDATETTAQNREV--AKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEE 1110 Query: 555 DTDQVEIAKADAPKVEA-AKPEAAKGDASKPDA-AKGEVAKTDAAKADVAKDKDGKEIQQ 612 + PKV + P+ + + +P A E T K ++ + +Q Sbjct: 1111 KAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQ 1170 Query: 613 PETEAAPTHPEPAKTLQVMTETWSY 637 P E + +P + S Sbjct: 1171 PAKETSSNVEQPVTESTTVNTGNSV 1195
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 30.3 bits (68), Expect = 0.036 Identities = 23/105 (21%), Positives = 44/105 (41%), Gaps = 5/105 (4%) Query: 463 RINNKTNGITFRRWLFQANPKLTEMLVEAL----GPDVLDNAETRLKELEPFAEKSSFRK 518 NGITFR W + + ++ +E + G + ++ + E + K+S+ Sbjct: 901 LSIGHKNGITFRNWFEKESGDISNHEIEQIFDKSGRIITPDSLKKALEYQQRNNKASYVY 960 Query: 519 QMADQRLHSKRALAAIIHERLGIAVNPAAMFDVQVKRIHEYKRQL 563 S+ L +I+E + ++ A FDV+ +R QL Sbjct: 961 GNDALAYGSQGDLNPLINE-ISKIISAAGSFDVKEERTAASLLQL 1004
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 30.7 bits (69), Expect = 0.022 Identities = 16/86 (18%), Positives = 28/86 (32%) Query: 56 LATRKTVESLQQRLTLLEYPPVPSPQPVAEAQQTAPLPADSVIIAAQTTGPELIWDLPAE 115 L + V+ + + E P P P+P EA P + + Sbjct: 60 LEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVK 119 Query: 116 EAPAAPTAAPVATATTRQASTTPSSP 141 + P + TA R S+T ++ Sbjct: 120 PVESRPASPFENTAPARPTSSTATAA 145
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 116 bits (291), Expect = 3e-35 Identities = 57/150 (38%), Positives = 81/150 (54%), Gaps = 10/150 (6%) Query: 33 VDSVDLKQYQGTWYELARLPMFFQRKCAQSEAHYALKDDGNIAVTNRCRTIE-GEWQEAT 91 V +L Y G WYE+ARL F+R +Q A Y +++DG I+V NR + E GEW+EA Sbjct: 26 VSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKGEWKEAE 85 Query: 92 GTASPQVPGKTDKLWVVFDNWFSRLLPGVAKGDYWVLDIG-DGYKTAVVGNPDRKYLWLL 150 G A L V F F G Y V ++ + Y A V P+ +YLWLL Sbjct: 86 GKAYFVNGSTDGYLKVSFFGPFY--------GSYVVFELDRENYSYAFVSGPNTEYLWLL 137 Query: 151 SRTPTVSESVKQDMLSKARQQGYDTSRLIW 180 SRTPTV + + ++++G+DT+RLI+ Sbjct: 138 SRTPTVERGILDKFIEMSKERGFDTNRLIY 167
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 48.8 bits (116), Expect = 6e-09 Identities = 27/169 (15%), Positives = 52/169 (30%), Gaps = 32/169 (18%) Query: 51 GEHICGGALIAPQWVLTAAHCLTNPEKKAHAVSIGLEQYRPEVIERERITVGDVFLHAGL 110 G I G ++ +LT H + HA+ + T + ++G Sbjct: 100 GTFIASGVVVGKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 111 NRGQYDIALVKLSRPAQSTEFLKLDSGQSPLPLYKNTP------VTLIGFGRTDEGVLAD 164 D+A+VK S Q+ P + N +T+ G+ Sbjct: 160 G----DLAIVKFSPNEQNKHI---GEVVKPATMSNNAETQVNQNITVTGYP---GDKPVA 209 Query: 165 VLYQGQGRILNDARCIYIPEGYPDTNFNPDNNICAGYNQAGGDSGGPLL 213 +++ +G+I + D + G+SG P+ Sbjct: 210 TMWESKGKI----TYLKGEAMQYDLSTTG------------GNSGSPVF 242
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 39.6 bits (92), Expect = 2e-05 Identities = 26/126 (20%), Positives = 37/126 (29%), Gaps = 7/126 (5%) Query: 359 SDEDAVPTGSPAQPPTVTTTAPPA--GVPAGQAAAQTPRSSIPAPTPAAPVTQPAPAAKP 416 S + +PAQP +VT AP A Q + P P P + AP Sbjct: 36 SVHQVIELPAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIE 95 Query: 417 APAPTQVATAKPAPAPAAKPAEKPAAAKPAAGGNWYSGQAPGHYVVQILGTSSEATAQAY 476 P P KP + A S T++ AT++ Sbjct: 96 KPKPKPKPKPKPVKKVEQPKRDVKPVESRPA-----SPFENTAPARPTSSTATAATSKPV 150 Query: 477 VAEQGG 482 + G Sbjct: 151 TSVASG 156
>PF05272#Virulence-associated E family protein Length = 892 Score = 27.3 bits (60), Expect = 0.042 Identities = 8/19 (42%), Positives = 11/19 (57%) Query: 4 LILVGPMGAGKSTIGRLLA 22 ++L G G GKST+ L Sbjct: 599 VVLEGTGGIGKSTLINTLV 617
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 271 bits (695), Expect = 1e-83 Identities = 101/403 (25%), Positives = 180/403 (44%), Gaps = 38/403 (9%) Query: 344 VPWDQALDLVLKTKGLDKRKVGSVLLVAPADEIAARERQELESL--------KQIAELAP 395 + W A D+V L+K S L + + A ER + + IA + Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ 258 Query: 396 LRRE--------LLQVNYAKAADIAKLFQSVTS---AESKA-------DERGSITVDERT 437 L R+ ++ + YAKA+D+ ++ ++S +E +A D+ I +T Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQT 318 Query: 438 NNIIAYQTQDRLDELRRIVSQLDIPVRQVMIEARIVEANVDYDKQIGVRWGGRTDRSRKW 497 N +I D +++L R+++QLDI QV++EA I E +G++W + ++ Sbjct: 319 NALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQF 378 Query: 498 SVGGLDDNGDEAGNTGNDLTANIPFVDLGAPDATAGVGIGFLTNNALLDLELSAMEKTGN 557 + GL + AG + + A + G+ GF N + L+A+ + Sbjct: 379 TNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGN--WAMLLTALSSSTK 436 Query: 558 GEIVSQPKVVTSDKETAKILKGTEIPYQESSSSG-----ATTVSFKEASLSLEVTPQITP 612 +I++ P +VT D A G E+P S + TV K + L+V PQI Sbjct: 437 NDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINE 496 Query: 613 DNRIIMEVKVTKDEPDY----LNAVLGVPPIKKNEVNAKVLISDGETIVIGGVFSNTQSK 668 + +++E++ ++ LG VN VL+ GET+V+GG+ + S Sbjct: 497 GDSVLLEIEQEVSSVADAASSTSSDLGAT-FNTRTVNNAVLVGSGETVVVGGLLDKSVSD 555 Query: 669 VVEKVPFLGDVPYLGRLFRRDVVAEAKSELLVFLTPRIMNNQA 711 +KVP LGD+P +G LFR +K L++F+ P ++ ++ Sbjct: 556 TADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRD 598 Score = 43.8 bits (103), Expect = 2e-06 Identities = 31/179 (17%), Positives = 69/179 (38%), Gaps = 10/179 (5%) Query: 304 SLNFQDIDVRSVLQLIADFTNLNLVASDTVQGGITLRLQN-VPWDQALDL---VLKTKGL 359 S +F+ D++ + ++ N ++ +V+G IT+R + + +Q VL G Sbjct: 31 SASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLSVLDVYGF 90 Query: 360 DKRKVG-SVLLVAPADEIAARERQELESLKQIAELAPLRRELLQVNYAKAADIAKLFQSV 418 + VL V + + A + S + ++ + A D+A L + + Sbjct: 91 AVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLAPLLRQL 149 Query: 419 TSAESKADERGSITVDERTNNIIAYQTQDRLDELRRIVSQLDIPVRQVMIEARIVEANV 477 GS+ E +N ++ + L IV ++D + ++ + A+ Sbjct: 150 ND----NAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDRSVVTVPLSWASA 204
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 32.8 bits (75), Expect = 0.002 Identities = 40/174 (22%), Positives = 67/174 (38%), Gaps = 37/174 (21%) Query: 182 LAAQLGNG---HDELTVAVVDIGATMTTLSVLHHGRIIYTREQLFGGRQLTEEI----QR 234 +AA +G G + VVDIG T ++V+ ++Y+ GG + E I +R Sbjct: 145 MAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRR 204 Query: 235 RYGLSMEE--AGLAKKQGG--LPDDYVSEVLEPFKD------------------ALVQQV 272 YG + E A K + G P D V E+ ++ AL + + Sbjct: 205 NYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPL 264 Query: 273 SRSLQFFFAAGQYNSVDH--------IMLAGGTASISGLEHLIQRRLGTPTQVA 318 + + A + + ++L GG A + L+ L+ G P VA Sbjct: 265 TGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVA 318
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 28.6 bits (63), Expect = 0.022 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%) Query: 145 LFSSNPRGENQEGWQGERYGSYHDLESWRALLTEAGFAELEHY 187 LF + P+ E GW+GE DLE R T+ FAE E + Sbjct: 107 LFGAKPQTELPLGWKGEPLSGAPDLEGMRVAETDK-FAEGESH 148
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 492 bits (1267), Expect = e-159 Identities = 241/1053 (22%), Positives = 444/1053 (42%), Gaps = 72/1053 (6%) Query: 7 LSEWALKHQSFVWYLMFVALLMGVFSYMKLGREEDPSFTIKTMVIQTRWPGATVDETLEQ 66 ++ + ++ F W L + ++ G + ++L + P+ + + +PGA + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 67 VTDRIEKKLEELDSLDYVKSYT-RPGESTVMV-FLRDTTSAEAIPEIWYQVRKKIDDIRG 124 VT IE+ + +D+L Y+ S + G T+ + F T A QV+ K+ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQ----VQVQNKLQLATP 116 Query: 125 QFPQGLQGP-AFNDEFGDVYGSIYAFTADGFSMRQ--LRDYVEKVRVD-IRSVEGLGKVE 180 PQ +Q ++ Y + F +D Q + DYV D + + G+G V+ Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 181 MVGQQDEV-IYLNFSTRKLAALGLDQRQVVQSLQSQNAVTPAGVIEAGPE------RISV 233 + G Q + I+L+ L L V+ L+ QN AG + P S+ Sbjct: 177 LFGAQYAMRIWLDAD--LLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASI 234 Query: 234 RTSGQFASEKDLAAVNLRLNDRFY--RLSDIADITRGYTDPPKPLFRYNGKPAIGLAIAM 291 +F + ++ V LR+N RL D+A + G + + R NGKPA GL I + Sbjct: 235 IAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELG-GENYNVIARINGKPAAGLGIKL 293 Query: 292 KKGGNIQAFGKALHERMDATTAELPVGVGVHKVSDQAEVVDKAVGGFTSALFEAVIIVLV 351 G N KA+ ++ P G+ V D V ++ LFEA+++V + Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 352 VSFISLG-VRAGLVVACSIPLVLAMVFVFMEYSGITMQRISLGALIIALGLLVDDAMITV 410 V ++ L +RA L+ ++P+VL F + G ++ +++ +++A+GLLVDDA++ V Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 411 EMMVTRLEMGETKEQAATY-AYTSTAFPMLTGTLVTVAGFVPIGLNNSSAGEYTFTLFAV 469 E + + + + AT + + ++ +V A F+P+ S G Sbjct: 414 ENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSIT 473 Query: 470 IAVAMLVSWVVAVLFAPVIGVHILSSNIKPKSEEPGRVGRAFNS-----------SMIWA 518 I AM +S +VA++ P + +L E G FN+ S+ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 519 MRHRWLAIGITLALFAASLFSMQFVQSQFFPSSDRPEILVDLNLPQNASVNETRKVVDRF 578 + + I + A + + S F P D+ L + LP A+ T+KV+D+ Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593 Query: 579 -EASLKDD-PDIERWSTYIGQGALRFYLPLDQQLENPFYAQLVIVSKGLEERGALTARLQ 636 + LK++ ++E T G Q +N A + K EER + Sbjct: 594 TDYYLKNEKANVESVFTVNGFS-------FSGQAQNAGMAF--VSLKPWEERNGDENSAE 644 Query: 637 K---RLREDFVGI-GSYVQPLEMGPPV-----GRPLQYRV---SGEDVDKVRQHAIELAT 684 R + + I +V P M P + + + +G D + Q +L Sbjct: 645 AVIHRAKMELGKIRDGFVIPFNM-PAIVELGTATGFDFELIDQAGLGHDALTQARNQLLG 703 Query: 685 LLDQN-SHVGEVIYDWNEPGKVLRIDINQDKARQLGLSSEDVANLMNSVVSGSAVTQVRD 743 + Q+ + + V + E +++++Q+KA+ LG+S D+ +++ + G+ V D Sbjct: 704 MAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID 763 Query: 744 DIYLINVVGRAEDAERGTPETLQNLQIVTPSGASIPLLAFATVGYELEQPLVWRRDRKPT 803 + + +A+ R PE + L + + +G +P AF T + P + R + P+ Sbjct: 764 RGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPS 823 Query: 804 ITVKGAVRDEIQPTDLVKQLKPEIDKFAAGLPVGYKVATGGTVEESSKAQGPIASVVPLM 863 + ++G E P ++ A+ LP G G + + ++V + Sbjct: 824 MEIQG----EAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAIS 879 Query: 864 LFLMATFLMIQLHSVQKMFLVASVAPLGLIGVVLALIPTGTPLGFVAILGVLALIGIIIR 923 ++ L S V V PLG++GV+LA ++G+L IG+ + Sbjct: 880 FVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAK 939 Query: 924 NSVILVTQI-DAYEISGYLPWDAVVEATEHRRRPILLTAAAASLGMIPIA------REVF 976 N++++V D E G +A + A R RPIL+T+ A LG++P+A Sbjct: 940 NAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQ 999 Query: 977 WGPMAYAMIGGIIIATLLTLLFLPALYVAWYRI 1009 + ++GG++ ATLL + F+P +V R Sbjct: 1000 N-AVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031 Score = 93.7 bits (233), Expect = 2e-21 Identities = 86/505 (17%), Positives = 180/505 (35%), Gaps = 30/505 (5%) Query: 6 NLSEWALKHQSFVWYLMFVALLMGVFS-YMKLGREEDPSFTIKTMVIQTRWP-GATVDET 63 N L + + L++ ++ G+ +++L P + + P GAT + T Sbjct: 528 NSVGKILGS-TGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586 Query: 64 ---LEQVTDRIEKKLEELDSLDYVKS-YTRPGEST------VMVFLRD--TTSAEAIPEI 111 L+QVTD K + + + ++ G++ V + + + + Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646 Query: 112 WYQVRKKIDDIRGQFPQGLQGPAFNDEFGDVYGSIYAFTADGFSMRQLRDYVEKVRVDIR 171 ++ + ++ IR F PA + G L ++ Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAA 706 Query: 172 SV-EGLGKVEMVGQQDEV-IYLNFSTRKLAALGLDQRQVVQSLQSQNAVTPAG-VIEAGP 228 L V G +D L K ALG+ + Q++ + T I+ G Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766 Query: 229 E-RISVRTSGQFASE-KDLAAVNLRL-NDRFYRLSDIADITRGYTDPPKPLFRYNGKPAI 285 ++ V+ +F +D+ + +R N S Y L RYNG P++ Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYG--SPRLERYNGLPSM 824 Query: 286 GLAIAMKKGGNIQAFGKALHERMDATTAELPVGVGVHKVSDQAEVVDKAVGGFTSALFEA 345 + G + G A+ M+ ++LP G+G + + + + + + Sbjct: 825 EIQGEAAPGTSS---GDAM-ALMENLASKLPAGIGY-DWTGMSYQERLSGNQAPALVAIS 879 Query: 346 VIIVLVVSFISL-GVRAGLVVACSIPLVLAMVFVFMEYSGITMQRISLGALIIALGLLVD 404 ++V + + V +PL + V + + L+ +GL Sbjct: 880 FVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAK 939 Query: 405 DAMITVEMMVTRLEM-GETKEQAATYAYTSTAFPMLTGTLVTVAGFVPIGLNNSSAGEYT 463 +A++ VE +E G+ +A A P+L +L + G +P+ ++N + Sbjct: 940 NAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQ 999 Query: 464 FTLFAVIAVAMLVSWVVAVLFAPVI 488 + + M+ + ++A+ F PV Sbjct: 1000 NAVGIGVMGGMVSATLLAIFFVPVF 1024 Score = 90.7 bits (225), Expect = 2e-20 Identities = 84/526 (15%), Positives = 184/526 (34%), Gaps = 43/526 (8%) Query: 518 AMRHRWLAIGITLALFAASLFSMQFVQSQFFPSSDRPEILVDLNLP-QNASVNETRKVVD 576 +R A + + L A ++ + +P+ P + V N P +A + V Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT-VTQ 63 Query: 577 RFEASLKDDPDIERW---STYIGQGALRFYLPLDQQLENPFYAQLVIVSKGLEERGALTA 633 E ++ ++ S G + +P AQ+ + +K Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGT---DPDIAQVQVQNK--------LQ 112 Query: 634 RLQKRLREDFVGIGSYVQPLEMGPPVGRPLQYRVSGEDVDKVRQHAIE-LATLLDQNSHV 692 L ++ G V+ + G D + + + L + + V Sbjct: 113 LATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGV 172 Query: 693 GEVIYDWNEPGKVLRIDINQDKARQLGLSSEDVANLMNS----VVSGSAVTQVRDDIYLI 748 G+V + +RI ++ D + L+ DV N + + +G + Sbjct: 173 GDVQLFGAQ--YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL 230 Query: 749 NVVGRAEDAERGTPETLQNLQI-VTPSGASIPLLAFATV--GYELEQPLVWRRDRKPTIT 805 N A+ + PE + + V G+ + L A V G E + KP Sbjct: 231 NASIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING-KPAAG 288 Query: 806 VKGAVRDEIQPTDLVKQLKPEIDKFAAGLPVGYKVA----TGGTVEES-SKAQGPIASVV 860 + + D K +K ++ + P G KV T V+ S + + + Sbjct: 289 LGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAI 348 Query: 861 PLMLFLMATFLMIQLHSVQKMFLVASVAPLGLIGVVLALIPTGTPLGFVAILGVLALIGI 920 L+ +M FL +++ + P+ L+G L G + + + G++ IG+ Sbjct: 349 MLVFLVMYLFL----QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404 Query: 921 IIRNSVILVTQIDAYEISGYL-PWDAVVEATEHRRRPILLTAAAASLGMIPIA-----RE 974 ++ +++++V ++ + L P +A ++ + ++ A S IP+A Sbjct: 405 LVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 975 VFWGPMAYAMIGGIIIATLLTLLFLPALYVAWYRIKEPTDEQRREA 1020 + + ++ + ++ L+ L+ PAL + + + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGG 510
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.0 bits (96), Expect = 6e-06 Identities = 19/128 (14%), Positives = 53/128 (41%), Gaps = 13/128 (10%) Query: 92 QNNVRGRQGDLANVQAQWINAQANARRQQELFDRGVGAQAQLDIALTNLKTAQSSLDQAK 151 N +R + L ++++ ++A+ + +LF + L L+ ++ Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEI---------LDKLRQTTDNIGLLT 315 Query: 152 AAEQQARDQLSYSDLRSDHDAVVTEWKVEA-GQVVTAGQEVVTLARPDIKEAVIDMPAQL 210 + ++ S +R+ V + KV G VVT + ++ + P+ + +++ A + Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIV-PE--DDTLEVTALV 372 Query: 211 ADQLPDDV 218 ++ + Sbjct: 373 QNKDIGFI 380 Score = 37.9 bits (88), Expect = 6e-05 Identities = 18/101 (17%), Positives = 31/101 (30%), Gaps = 7/101 (6%) Query: 62 VSGRIASRHVDVGSEVKKGDLLATLDPTDQQNNVRGRQGDLANVQAQWINAQANARRQQE 121 + + V G V+KGD+L L G + D Q+ + A+ R Q Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQI 155 Query: 122 LFDRGVGAQAQLDIALTNLKTAQSSLDQAKAAEQQARDQLS 162 L + S ++ ++Q S Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFS 196
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.7 bits (103), Expect = 9e-07 Identities = 22/144 (15%), Positives = 48/144 (33%), Gaps = 30/144 (20%) Query: 55 DIQARVQTQLSFRVNGKIIQRN---------VDVGDRVKANQVLARLDPKDLQINVDSAQ 105 +I A +L+ K I+ V G+ V+ VL +L + + Q Sbjct: 81 EIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQ 140 Query: 106 ASVA---AEQARVS------------------QTRAAFVRQQKLLPKGYTSQSEYDSAQA 144 +S+ EQ R + V ++++L + ++ + Q Sbjct: 141 SSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200 Query: 145 ALRGSESSLKAAQAQLANAREQLS 168 E +L +A+ +++ Sbjct: 201 QKYQKELNLDKKRAERLTVLARIN 224 Score = 39.0 bits (91), Expect = 2e-05 Identities = 15/162 (9%), Positives = 52/162 (32%), Gaps = 5/162 (3%) Query: 47 AASVALTGDIQARVQTQLSFRVNGKIIQRNVDVGDRVKANQVLARLDPKDLQINVDSAQA 106 + + + ++ + +D + Q +A+ + + A Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266 Query: 107 SVAAEQARVSQTRAAFVR-QQKLLPKGYTSQSEYDSAQAALRGSESSLKAAQAQLANARE 165 + ++++ Q + + +++ ++E LR + ++ +LA E Sbjct: 267 ELRVYKSQLEQIESEILSAKEEYQLVTQLFKNE---ILDKLRQTTDNIGLLTLELAKNEE 323 Query: 166 QLSYTALVAEAPGVITARQA-EVGQVVQATVPIFDLARDGER 206 + + + A + + G VV + + + + Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT 365