>PF00577#Outer membrane usher protein FimD Length = 878 Score = 764 bits (1975), Expect = 0.0 Identities = 331/870 (38%), Positives = 487/870 (55%), Gaps = 54/870 (6%) Query: 6 IVVGLTAGTCLIFSQNLMAEVSVFNPALLEIDHQSGVDIRQFNRANLMPPGVYSVDIFIN 65 V L L + FNP L D Q+ D+ +F +PPG Y VDI++N Sbjct: 26 FFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLN 85 Query: 66 GKMFERQDVTFVQDNPDADLHACFIAIKKTLSSFGIKVDALKSFNDVDETVCLDPAPRIE 125 +DVTF + + + C L+S G+ ++ N + + C+ I Sbjct: 86 NGYMATRDVTFNTGDSEQGIVPCLTR--AQLASMGLNTASVSGMNLLADDACVPLTSMIH 143 Query: 126 GSSWQFDSDKLQLNISIPQIYMDAMAYDYISPTRWDEGINALTINYDFSGSHTLRSDYGS 185 ++ Q D + +LN++IPQ +M A YI P WD GINA +NY+FSG+ + Sbjct: 144 DATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV--QNRIG 201 Query: 186 QETDTSYLNLRNGLNIGPWRLRNYSTLN------TSDGRAEYNSISTWIQRDIAALRSQI 239 + +YLNL++GLNIG WRLR+ +T + +S + ++ I+TW++RDI LRS++ Sbjct: 202 GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRL 261 Query: 240 MIGDTWTASDIFDSTQIRGARLYTDNDMLPASQNGFAPVVRGIAKSNATVIIRQNGYVIY 299 +GD +T DIFD RGA+L +D++MLP SQ GFAPV+ GIA+ A V I+QNGY IY Sbjct: 262 TLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIY 321 Query: 300 QSAVPQGAFEITDLNTASTGGDLDVTIKEEDGSEQRFTQPYASLAILKREGQTDVDVSVG 359 S VP G F I D+ A GDL VTIKE DGS Q FT PY+S+ +L+REG T ++ G Sbjct: 322 NSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAG 381 Query: 360 ELRDEDG--FTPDVLQAQILHGFSHGITLYGGMQAAENYGSAALGVGKDLGALGAISFDV 417 E R + P Q+ +LHG G T+YGG Q A+ Y + G+GK++GALGA+S D+ Sbjct: 382 EYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDM 441 Query: 418 THARANFSHDDTETGQSYRFLYSKRFDDTDTSLRLVGYRYSTEGYYTLNEWASRRNS--- 474 T A + D GQS RFLY+K +++ T+++LVGYRYST GY+ + R + Sbjct: 442 TQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYN 501 Query: 475 --------------PEDFWETGNRRSRVEGTLTQSLGRDYGNLYLTLSRQQYWHTDDVER 520 + + N+R +++ T+TQ LGR LYL+ S Q YW T +V+ Sbjct: 502 IETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDE 560 Query: 521 LMQFGYSSSWKRLSWNVSWSYSNTARQRTGNNHASDNTSEQIYMLSLSVPLSGW------ 574 Q G +++++ ++W +S+S + A Q+ + + L++++P S W Sbjct: 561 QFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQ---------MLALNVNIPFSHWLRSDSK 611 Query: 575 --WGNSYATYSVSQNDNSGSSHQLGLSGTALERNNLSWNLMQSYNSHDDEVGGN---MSL 629 W ++ A+YS+S + N ++ G+ GT LE NNLS+++ Y D G+ +L Sbjct: 612 SQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATL 671 Query: 630 TYDGSYGTVNGSYNYSQNSQRLNYGIRGGILAHSEGVTLSQELGETIALVKAPGAAGLEI 689 Y G YG N Y++S + ++L YG+ GG+LAH+ GVTL Q L +T+ LVKAPGA ++ Sbjct: 672 NYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKV 731 Query: 690 DNMRGAATDWRGYTVKTQLNPYDENRVAISDNYFSKSNIELDNTVVTMVPTRGAVVKAEF 749 +N G TDWRGY V Y ENRVA+ N + N++LDN V +VPTRGA+V+AEF Sbjct: 732 ENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLA-DNVDLDNAVANVVPTRGAIVRAEF 790 Query: 750 VTHVGYRVLFRVLNANGKPVPFGAIAAIQDASLADSGIVGDRGELYLSGLPEKGQVTLSW 809 VG ++L L N KP+PFG A + S SGIV D G++YLSG+P G+V + W Sbjct: 791 KARVGIKLLMT-LTHNNKPLPFG--AMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKW 847 Query: 810 GENASTKCIFNYSLSTPESESGLIEQGVTC 839 GE + C+ NY L + L + C Sbjct: 848 GEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 29.2 bits (65), Expect = 0.048 Identities = 23/80 (28%), Positives = 31/80 (38%), Gaps = 10/80 (12%) Query: 367 LGDAEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGSDTQLVAPVTVGKGATIAAGTT 426 LGD + D V + AG+ N G DV T G AT A T Sbjct: 616 LGDGD--DKVFLSAGSA--NIYAGK------GHDVVYYDKTDTGYLTIDGTKATEAGNYT 665 Query: 427 VTRNVGENALAISRVPQTQK 446 VTR +G + + V + Q+ Sbjct: 666 VTRVLGGDVKVLQEVVKEQE 685
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 28.1 bits (62), Expect = 0.025 Identities = 19/59 (32%), Positives = 24/59 (40%), Gaps = 8/59 (13%) Query: 68 FSVNGVAGANAFAN-GGGSTTCCGDVSGDTAEVIWTLSTTRTQYNAGMRLEKRNMTLPL 125 FSV + A AN G G G E QYNAG++L+ N+ L L Sbjct: 1466 FSVTPILSARYDANQGSGKINVNGYDFAYNVENQQ-------QYNAGLKLKYHNVKLSL 1517
>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature. Length = 289 Score = 499 bits (1286), Expect = 0.0 Identities = 289/289 (100%), Positives = 289/289 (100%) Query: 1 MRTLQGWLLPVFMLPMAVYAQEATVKEVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYL 60 MRTLQGWLLPVFMLPMAVYAQEATVKEVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYL Sbjct: 1 MRTLQGWLLPVFMLPMAVYAQEATVKEVHDAPAVRGSIIANMLQEHDNPFTLYPYDTNYL 60 Query: 61 IYTQTSDLNKEAIASYDWAENARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSWWQL 120 IYTQTSDLNKEAIASYDWAENARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSWWQL Sbjct: 61 IYTQTSDLNKEAIASYDWAENARKDEVKFQLSLAFPLWRGILGPNSVLGASYTQKSWWQL 120 Query: 121 SNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHDSNGRSDPTSRSWNRLYT 180 SNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHDSNGRSDPTSRSWNRLYT Sbjct: 121 SNSEESSPFRETNYEPQLFLGFATDYRFAGWTLRDVEMGYNHDSNGRSDPTSRSWNRLYT 180 Query: 181 RLMAENGNWLVEVKPWYVVGNTDDNPDITKYMGYYQLKIGYHLGDAVLSAKGQYNWNTGY 240 RLMAENGNWLVEVKPWYVVGNTDDNPDITKYMGYYQLKIGYHLGDAVLSAKGQYNWNTGY Sbjct: 181 RLMAENGNWLVEVKPWYVVGNTDDNPDITKYMGYYQLKIGYHLGDAVLSAKGQYNWNTGY 240 Query: 241 GGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLNDLF 289 GGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLNDLF Sbjct: 241 GGAELGLSYPITKHVRLYTQVYSGYGESLIDYNFNQTRVGVGVMLNDLF 289
>SECA#SecA protein signature. Length = 901 Score = 30.2 bits (68), Expect = 0.004 Identities = 11/71 (15%), Positives = 30/71 (42%) Query: 14 AKARRKTREELDQEARDRKRQKKRRGHAPGSRAAGGNTTSGSKGQNAPKDPRIGSKTPIP 73 +K + + EE+++ + R+ + +R ++ + + + ++G P P Sbjct: 827 SKVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCP 886 Query: 74 LGVTEKVTKQH 84 G +K + H Sbjct: 887 CGSGKKYKQCH 897
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 601 bits (1550), Expect = 0.0 Identities = 206/478 (43%), Positives = 300/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIVWVVDDDSSIRWVLERALAGAGLTCTTFENGAEVLEALASKTPDVLLSDIRMPGM 60 M + V DDD++IR VL +AL+ AG N A + +A+ D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVERAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNIQLNGPTTDIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRAKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTIRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQVAARELGVEAKLLHPETEAALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLIQDLPGELFESTVAESTSQMQPDSWATLLAQWADRALRS---- 416 EN R LT + + + + EL + S + ++Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPELERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L E+E L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>PF06580#Sensor histidine kinase Length = 349 Score = 28.3 bits (63), Expect = 0.042 Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 41/190 (21%) Query: 171 IIEQADRLRNLVDRL---LGPQLPGTRVTE-SIHKVAERV---VTLVSMELPDNVRLIRD 223 I+E + R ++ L + L + + S+ V + L S++ D ++ Sbjct: 186 ILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ 245 Query: 224 YDPSLPELAHDPDQIEQVLLN-IVRNALQ---ALGPEGGEIILRTRTAFQLTLHGERYRL 279 +P++ ++ Q+ +L+ +V N ++ A P+GG+I+L+ Sbjct: 246 INPAIMDV-----QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGT------KDNGTVT- 293 Query: 280 AARIDVEDNGPGIPPHLQDTLFYPMVSGREGGTGLGLSIARNLIDQHSGK---IEFTSWP 336 ++VE+ G + ++ TG GL R + G I+ + Sbjct: 294 ---LEVENTGSLALKNTKE------------STGTGLQNVRERLQMLYGTEAQIKLSEKQ 338 Query: 337 GHTEFSVYLP 346 G V +P Sbjct: 339 GKVNAMVLIP 348
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 180 bits (458), Expect = 4e-51 Identities = 97/445 (21%), Positives = 170/445 (38%), Gaps = 81/445 (18%) Query: 4 KLRNIAIIAHVDHGKTTLVDKLLQQSGTFDSRAETQE--RVMDSNDLEKERGITILAKNT 61 K+ NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKAFAYGL 121 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 122 KPIVVINKVDRPGARPDWVVDQVFD-------------LFVNLDATDEQLD--------- 159 I INK+D+ G V + + L+ N+ T+ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 160 --------------------------------FPIVYASALNGIAGLDHEDMAEDMTPLY 187 FP+ + SA N I G+D+ L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNI-GIDN---------LI 231 Query: 188 QAIVDHVPAPDVDLDGPFQMQISQLDYNSYVGVIGIGRIKRGKVKPNQQVTIIDSEGKTR 247 + I + + ++ +++Y+ + R+ G + V I + E Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-- 289 Query: 248 NAKVGKVLGHLGLERIETDLAEAGDIVAITGLGELNISDTVCDTQNVEALPALSVDEPTV 307 K+ ++ + E + D A +G+IV + L ++ + DT+ + + P + Sbjct: 290 --KITEMYTSINGELCKIDKAYSGEIVILQNEF-LKLNSVLGDTKLLPQRERIENPLPLL 346 Query: 308 SMFFCVNTSPFCGKEGKFVTSRQILDRLNKELVHNVALRVEETEDADAFRVSGRGELHLS 367 + + D L LR +S G++ + Sbjct: 347 QTTVEPSKPQQREMLLDALLEISDSDPL---------LRYYVDSATHEIILSFLGKVQME 397 Query: 368 VLIENMRRE-GFELAVSRPKVIFRE 391 V ++ + E+ + P VI+ E Sbjct: 398 VTCALLQEKYHVEIEIKEPTVIYME 422 Score = 32.5 bits (74), Expect = 0.005 Identities = 13/75 (17%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 398 EPYENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 457 EPY + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 458 MTSGTGLLYSTFSHY 472 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.5 bits (66), Expect = 0.028 Identities = 31/161 (19%), Positives = 64/161 (39%), Gaps = 15/161 (9%) Query: 227 NVFFVYAVYCGLTFFIPFLKNIYLLP----------VALVGAYGIINQYCLKMIGGPIGG 276 N+ F+ V CG F + ++P A +G+ I +I G IGG Sbjct: 255 NIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGG 314 Query: 277 MISDKILKSPSKYLCYTFIISTAALVLLIMLPHESMPVYLGMACTLGFGAIVFTQRAVFF 336 ++ D+ + P L + + + L E+ ++ + G + FT+ Sbjct: 315 ILVDR--RGPLYVLNIGVTFLSVSFLTASFLL-ETTSWFMTIIIVFVLGGLSFTK--TVI 369 Query: 337 APIGEAKIAENKTGAAMALGSFIGYAPAMFCFSLYGYILDL 377 + I + + + + GA M+L +F + ++ G +L + Sbjct: 370 STIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSI 410
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 42.0 bits (98), Expect = 2e-06 Identities = 32/155 (20%), Positives = 64/155 (41%), Gaps = 5/155 (3%) Query: 79 LTPEQRQLLEQMQADMRQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSR 138 + +QAD+ P+ E+ ++ P + +AE + Q+S+ Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK--QESK 1049 Query: 139 TTEQSWQQQT-RTSQAAPVQAQPRQSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVARA 197 T E++ Q T T+Q V + + + A+TQ + T ++ ++ A V + Sbjct: 1050 TVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKE 1109 Query: 198 ADAPKPTAEKKDERRWMVQCGSFRGAEQAETVRAQ 232 A T + ++ + Q + EQ+ETV+ Q Sbjct: 1110 EKAKVETEKTQEVPKVTSQVSPKQ--EQSETVQPQ 1142
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 3e-18 Identities = 57/198 (28%), Positives = 87/198 (43%), Gaps = 13/198 (6%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGAARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQYDFPGDDTPIVRGSALKALEGDAEWE 186 G+P I F+NK D + L V +++E LS + + +W+ Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 187 AKILELAGFLDSYIPEPE 204 I L+ Y+ Sbjct: 177 TVIEGNDDLLEKYMSGKS 194
>SECETRNLCASE#Bacterial translocase SecE signature. Length = 127 Score = 165 bits (420), Expect = 2e-56 Identities = 127/127 (100%), Positives = 127/127 (100%) Query: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL Sbjct: 1 MSANTEAQGSGRGLEAMKWVVVVALLLVAIVGNYLYRDIMLPLRALAVVILIAAAGGVAL 60 Query: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS Sbjct: 61 LTTKGKATVAFAREARTEVRKVIWPTRQETLHTTLIVAAVTAVMSLILWGLDGILVRLVS 120 Query: 121 FITGLRF 127 FITGLRF Sbjct: 121 FITGLRF 127
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 27.9 bits (62), Expect = 0.011 Identities = 12/57 (21%), Positives = 22/57 (38%), Gaps = 7/57 (12%) Query: 57 VILKAAGANKVAVIKAVRGATGLGLKEAKDLVESAPAALKEGVSKDDAEALKKALEE 113 I A GAN + KA++ + +L P +K D ++ ++ E Sbjct: 290 GIKLATGANALDTAKAIK-------AKLAELQPFFPQGMKVLYPYDTTPFVQLSIHE 339
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 30.6 bits (68), Expect = 0.004 Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 8/58 (13%) Query: 127 ALGHVFPTQTKQMPSAPQGLEQLARHVERLADYPTVAIGGISLARAPAVIATGVGSIA 184 ALG +P + GL A+ + ++AIG + A A +A G GSIA Sbjct: 46 ALGLEYPVRPP--VPGAGGLNASAKGIH------SIAIGATAEAAKGAAVAVGAGSIA 95
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 29.5 bits (66), Expect = 0.040 Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 17/84 (20%) Query: 459 FYTLGPLTTDIAPGYDHFT---SGIGAAMIGWFGCAMLCYVTPKEHLGLPNKEDVKQGLI 515 F +LG + T +AP YD+F + + ++ G V P + L V Q Sbjct: 161 FASLGMVVTALAPSYDYFIFYQTLVITPILFLSGA-----VFPVDQL-----PIVFQTAA 210 Query: 516 TYKIAAHAADLAK----GHPGAQI 535 + +H+ DL + GHP + Sbjct: 211 RFLPLSHSIDLIRPIMLGHPVVDV 234
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 120 bits (302), Expect = 2e-39 Identities = 50/89 (56%), Positives = 66/89 (74%) Query: 2 NKTQLIDVIAEKAELSKTQAKAALESTLAAITESLKEGDAVQLVGFGTFKVNHRAERTGR 61 NK LI +AE EL+K + AA+++ +A++ L +G+ VQL+GFG F+V RA R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEIKIAAANVPAFVSGKALKDAVK 90 NPQTG+EIKI A+ VPAF +GKALKDAVK Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF06580#Sensor histidine kinase Length = 349 Score = 36.0 bits (83), Expect = 3e-04 Identities = 49/262 (18%), Positives = 104/262 (39%), Gaps = 43/262 (16%) Query: 197 ILFALATVLLA-SVLSFFW-YRRYLRSRQLLQDEMKRKEKLVALGHLAAGV-AHEIRNPL 253 I+F + V S+L F W + + + ++ Q +M + L L A + H + N L Sbjct: 120 IIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNAL 179 Query: 254 SSIKGLAKYFAERAPAGGEAHQLAQVM---AKEADRLNRVVSELLELVKPTHLALQAVDL 310 ++I+ L +A L+++M + ++ +++ L +V ++L L ++ Sbjct: 180 NNIRALILEDPTKAREM--LTSLSELMRYSLRYSNARQVSLADELTVVD-SYLQLASIQF 236 Query: 311 NTLINHSLQLVSQDANSREIQLRFTANDTLPEIQADPDRLTQVLL-NLYLNAIQAIGQHG 369 + Q+ + ++Q+ P L Q L+ N + I + Q G Sbjct: 237 EDRLQFENQI---NPAIMDVQV--------------PPMLVQTLVENGIKHGIAQLPQGG 279 Query: 370 VISVTVSESGAGVKISVTDSGKGIAADQLEAIFTPYFTTKAEGTGLGLAVVHNIVEQHGG 429 I + ++ V + V ++G + E TG GL V ++ G Sbjct: 280 KILLKGTKDNGTVTLEVENTGSLALKNT------------KESTGTGLQNVRERLQMLYG 327 Query: 430 ---TIQVASQEGKGSTFTLWLP 448 I+++ ++GK + +P Sbjct: 328 TEAQIKLSEKQGKV-NAMVLIP 348
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 13/35 (37%), Positives = 18/35 (51%) Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGEKR 66 VV G G GKSTL+ + GL+ + IG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 26.7 bits (59), Expect = 0.020 Identities = 5/33 (15%), Positives = 13/33 (39%), Gaps = 1/33 (3%) Query: 17 ELVEKR-QRFATILSIIMLAVYIGFILLIAFAP 48 EL+E R +++ ++ + +L Sbjct: 47 ELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQ 79
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 29.9 bits (67), Expect = 0.006 Identities = 6/21 (28%), Positives = 11/21 (52%) Query: 179 FGNLDDPNSEISQLLRQKPTY 199 GNL++P ++ L+ P Sbjct: 75 TGNLEEPAVMVNYFLQGDPYQ 95
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.2 bits (172), Expect = 4e-16 Identities = 31/109 (28%), Positives = 50/109 (45%), Gaps = 4/109 (3%) Query: 4 VLIIDDDAMVAELNRRYVAQIPGFQCCGTASTLEKAKEIIFNSDTPIDLILLDIYMQKEN 63 +L+ DDDA + + + +++ G+ S I + DL++ D+ M EN Sbjct: 6 ILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61 Query: 64 GLDLLPVLHNARCKSDVIVISSAADAATIKDSLHYGVVDYLIKPFQASR 112 DLLP + AR V+V+S+ T + G DYL KPF + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE 110
>PF06580#Sensor histidine kinase Length = 349 Score = 41.0 bits (96), Expect = 7e-06 Identities = 21/99 (21%), Positives = 38/99 (38%), Gaps = 18/99 (18%) Query: 442 LIENALE-ALGP-EPGGEISVTLHYRHGWLHCEVNDDGPGIAPDKIDHIFDKGVSTKGSE 499 L+EN ++ + GG+I + +G + EV + G + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN------------TKES 310 Query: 500 RGVGLALVKQQVENLGG---SIAVESEPGIFTQFFVQIP 535 G GL V+++++ L G I + + G V IP Sbjct: 311 TGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 26.4 bits (58), Expect = 0.012 Identities = 9/28 (32%), Positives = 16/28 (57%) Query: 32 LAIIEHTDVDESLKGQGIGKQLVAKVVE 59 A+IE V + + +G+G L+ K +E Sbjct: 89 YALIEDIAVAKDYRKKGVGTALLHKAIE 116
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.2 bits (68), Expect = 0.022 Identities = 36/190 (18%), Positives = 66/190 (34%), Gaps = 14/190 (7%) Query: 44 NHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGIDTNSTFSL 102 H L + YA P+LG +DR G R ++ + + ++ + L Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99 Query: 103 YLALAIIICGYGLFKSNISCLLGELYDEND-HRRDGGFSLLYAAGNIGSIAAPIACGLAA 161 Y+ + G+ + + + D D R F + A G +A P+ GL Sbjct: 100 YIGRIV----AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMG 155 Query: 162 QWYGWHVGFALAGGGMFIGLLIFLSGHRHFQSTRSMDKKALTSVKF-ALPVWSWLVVMLC 220 + H F A + L FL+G + +++ L L + W M Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211 Query: 221 LAPVFFTLLL 230 +A + + Sbjct: 212 VAALMAVFFI 221
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 51.6 bits (123), Expect = 1e-08 Identities = 60/392 (15%), Positives = 118/392 (30%), Gaps = 80/392 (20%) Query: 33 EQAKAAKPAQPEVVEALQSALNALEERKGSL-ERIKQYQEVIDNYPKLSATLRAQLNNMR 91 E + A +Q + +E +Q + E +L + + L +L+N + Sbjct: 39 EVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAK 98 Query: 92 DEPRSVSPGMSTDALNQEILQVSSQLLDKSRQAQQEQERAREIADSLNQLPQQQTDARRQ 151 ++ R +S A + L+ D + + + + + L ++ + Sbjct: 99 EKLRKNDKSLSEKASKIQELEARKA--DLEKALEGAMNFSTADSAKIKTLEAEKAALAAR 156 Query: 152 LNEIERRLGTLTGNTPLNQAQNFALQSDSARLKALVDELELAQLSANNRQELARLRSELA 211 ++E+ L + + A+ L+++ A L+A ELE E A S Sbjct: 157 KADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELE-------KALEGAMNFSTAD 209 Query: 212 EKESQQLDAYLQALRNQLNSQRQLEAERALESTELLAENSADLPKDIVAQFKINRELSAA 271 + + L+A AL + + ST + L + A EL A Sbjct: 210 SAKIKTLEAEKAALAARKADLEKALEGAMNFSTA-DSAKIKTLEAEKAALEARQAELEKA 268 Query: 272 LNQQAQRMDLVASQQRQATSQTLQVRQALNTLREQSQWLGSSNLLGEALRAQVARLPEMP 331 L A +TL+ +A Sbjct: 269 LEGAMNFST-----ADSAKIKTLEAEKA-------------------------------- 291 Query: 332 KPQQLDTEMAQLRVQRLRYEDLLNKQPLLRQIHQADGQPLTAEQNRILEAQLRTQRELLN 391 L+ E A L Q+++L A ++ R L+ Sbjct: 292 ---ALEAEKADLE-----------------------------HQSQVLNANRQSLRRDLD 319 Query: 392 SLLQGGDTLLLELTKLKVSNGQLEDALKEVNE 423 + + L E KL+ N E + + + Sbjct: 320 ASREAKKQLEAEHQKLEEQNKISEASRQSLRR 351 Score = 42.7 bits (100), Expect = 8e-06 Identities = 48/239 (20%), Positives = 92/239 (38%), Gaps = 23/239 (9%) Query: 20 ATAPDSKQISQELEQAKAAKPAQPEVVEALQSALNALEERKGSLER-IKQYQEVIDNYPK 78 A A + + LE A A ++ L++ ALE R+ LE+ ++ Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281 Query: 79 LSATLRAQLNNMRDEPRSVSPGMSTDALNQEILQVSSQLLDKSRQAQQEQERAREIADSL 138 TL A+ + E + + Q + LD SR+A+++ E + + Sbjct: 282 KIKTLEAEKAALEAEKADL---EHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQ 338 Query: 139 NQLPQQQTDARRQLNEIERRLGTLTGNTPLNQAQNFALQSDSARLKAL--VDELELAQLS 196 N++ ++A RQ + R L L+++ +L+ + E L Sbjct: 339 NKI----SEASRQ--SLRRDLDASR-------EAKKQLEAEHQKLEEQNKISEASRQSLR 385 Query: 197 AN---NRQELARLRSELAEKESQQLDAYLQALRNQLNSQRQLEAERALESTELLAENSA 252 + +R+ ++ L E S +L A + + S++ E E+A +L AE A Sbjct: 386 RDLDASREAKKQVEKALEEANS-KLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKA 443
>SECA#SecA protein signature. Length = 901 Score = 32.2 bits (73), Expect = 0.005 Identities = 26/144 (18%), Positives = 54/144 (37%), Gaps = 6/144 (4%) Query: 282 HVIDAADVRVQENIEAVNTVLEEIDAHEIPTLLVMNKIDMLEDFEPRIDRDEENK-PIRV 340 ++D +DV N + IDA+ P L ++ + + R+ D + PI Sbjct: 665 ELLDVSDVSETINSIREDVFKATIDAYIPPQSL--EEMWDIPGLQERLKNDFDLDLPIAE 722 Query: 341 WLSAQTGAGIPQLFQALTERLSGEVAQHTLRLPPQEGRLRSRFYQLQAIEKEWMEEDGSV 400 WL + L + + + + + + R + LQ ++ W E ++ Sbjct: 723 WLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSLWKEHLAAM 782 Query: 401 SLQVRMPIVDWRRLCKQEPALIDY 424 +R I R +++P +Y Sbjct: 783 D-YLRQGIH-LRGYAQKDP-KQEY 803
>cloacin#Cloacin signature. Length = 551 Score = 31.6 bits (71), Expect = 0.006 Identities = 25/81 (30%), Positives = 30/81 (37%), Gaps = 10/81 (12%) Query: 17 GSSKPGGNSEGNGNKGGRDQGPPDLDDIFRKLSKKLGGLGGGKGTGSGGGSSSQGP---- 72 S G +SE N GG G G GGG GTG G S+ P Sbjct: 33 ASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTG-GNLSAVAAPVAFG 91 Query: 73 -----RPQLGGRVVTIAAAAI 88 P GG V+I+A A+ Sbjct: 92 FPALSTPGAGGLAVSISAGAL 112
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.6 bits (69), Expect = 0.029 Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 1/55 (1%) Query: 165 VVPDDSRLSFDILIPPDQIMGARMGFVVVVELTQRPTRRTKAV-GKIVEVLGDNM 218 +VP+D L L+ I +G ++++ P R + GK+ + D + Sbjct: 359 IVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAI 413
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 62.4 bits (151), Expect = 8e-14 Identities = 60/248 (24%), Positives = 103/248 (41%), Gaps = 33/248 (13%) Query: 6 GKTVLILGGSRGIGAAIVRRFVTDGANVR-FTYAGSKD---AAKRLAQETGATAVFTDSA 61 GK I G ++GIG A+ R + GA++ Y K + A+ A A D Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 62 DRDAVIDVV----RKSGALDILVVNAGIGVFGEALELNADDIDRLFKINIHAPYHASVEA 117 D A+ ++ R+ G +DILV AG+ G L+ ++ + F +N ++AS Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 118 ARQMP--EGGRILIIGS--VNGDRMPVAG----------MARGLARDFGPRGITINVVQP 163 ++ M G I+ +GS R +A + L + I N+V P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 164 GPIDTDA--------NPANGPMRDMLHSL---MAIKRHGQPEEVAGMVAWLAGPEASFVT 212 G +TD N A ++ L + + +K+ +P ++A V +L +A +T Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 213 GAMHTIDG 220 +DG Sbjct: 248 MHNLCVDG 255
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.8 bits (121), Expect = 4e-10 Identities = 20/100 (20%), Positives = 38/100 (38%), Gaps = 6/100 (6%) Query: 5 KQSRVPGRPRRFAPEQAISAAKVLFHQKGFDAVSVAEVTDYLGINPPSLYAAFGSKAGLF 64 ++++ + R + + A LF Q+G + S+ E+ G+ ++Y F K+ LF Sbjct: 3 RKTKQEAQETR---QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59 Query: 65 SRVLNEYVGTEAIPLVDILRDDRPVGECLAEVLKEAARRY 104 S + L + P VL+E Sbjct: 60 SEIWELSESN-IGELELEYQAKFP--GDPLSVLREILIHV 96
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 145 bits (368), Expect = 7e-45 Identities = 86/256 (33%), Positives = 134/256 (52%), Gaps = 8/256 (3%) Query: 7 LAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVEKLRQEGIRAVAAPFN 66 + GK ITG+AQGIG +A L GA I D E+ E V L+ E A A P + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTHKHEIDTAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQ 126 V ID IE+++GPID+LVN AG+ R ++EW +VN T VF S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 AVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGI 186 +V+++M++R++G ++ + S + + R ++ YA+SK A M T+ + +ELA +NI+ N + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 187 APGYFKTEMTKALVEDE--------AFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDF 238 +PG +T+M +L DE P + P ++ A +FL S + Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGH 245 Query: 239 VNGHLLFVDGGMLVAV 254 + H L VDGG + V Sbjct: 246 ITMHNLCVDGGATLGV 261
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 50.6 bits (121), Expect = 5e-09 Identities = 49/310 (15%), Positives = 121/310 (39%), Gaps = 20/310 (6%) Query: 69 FFGAMADKYGRKPMMMWAIIIYSVGTGLSGIATNLYMLAVCRFIVGL-GMSGEYACASTY 127 GA++D++GR+P+++ ++ +V + A L++L + R + G+ G +G A A Y Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG--AVAGAY 119 Query: 128 AVESWPKNLQSKASAFLVSGFSVGNIIAAQIIPQFAEVYGWRNSFFI-----GLLPVLLV 182 + + +++ F+ + F G ++A ++ + FF GL + Sbjct: 120 IADITDGDERARHFGFMSACFGFG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178 Query: 183 LWIRKSAPESQEWIEDKYKDKSTFLSVFRKPHLSISMIVFLVCFCLFGANWPINGLLPSY 242 + +S + + + + + FR + + F + + L Sbjct: 179 FLLPESHKGERR--PLRREALNPL-ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI 235 Query: 243 LADNGVNTVVISTLMTIAGLG---TLTGTIFFGFVGDKIGVKKAFVVGLITSFVFLCPLF 299 ++ + + +++A G +L + G V ++G ++A ++G+I L Sbjct: 236 FGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLA 295 Query: 300 FISVKNSSLIGLCLFGLM-FTNLGIAGLVPKFIYDYFPTKLRGLGTGLIYNLGATGGMAA 358 F + + + L + ++ + + + +L+G L + + Sbjct: 296 FATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALT----SLTSIVG 351 Query: 359 PVLATYISGY 368 P+L T I Sbjct: 352 PLLFTAIYAA 361
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 64.2 bits (156), Expect = 6e-14 Identities = 44/240 (18%), Positives = 91/240 (37%), Gaps = 13/240 (5%) Query: 36 TPQRIVVLELSFADALAAVDVSPIGIADDNDAKRILPEVRAHLKPWQSVGTRAQPSLEAI 95 P RIV LE + L A+ + P G+AD + + + E VG R +P+LE + Sbjct: 34 DPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSE-PPLPDSVIDVGLRTEPNLELL 92 Query: 96 AALKPDLIIADSSRHAGVYIALQQIAPVLLLKSR--NETYAENLQSAAIIGEMVGKKREM 153 +KP ++ S+ + L +IAP + A +S + +++ + Sbjct: 93 TEMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAA 151 Query: 154 QARLEQHKERMAQWASQLPKGTR---VAFGTSREQQFNLHTQETWTGSVLASLGLNVPAA 210 + L Q+++ + + K + + + + +L G +P A Sbjct: 152 ETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYG--IPNA 209 Query: 211 MAGAS----MPSIGLEQLLAVNPAWLLVAHYREESIVKRWQQDPLWQMLTAAQKQQVASV 266 G + ++ +++L A +L + + PLWQ + + + V Sbjct: 210 WQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDMDALMATPLWQAMPFVRAGRFQRV 269
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 32.9 bits (75), Expect = 0.004 Identities = 19/89 (21%), Positives = 29/89 (32%), Gaps = 9/89 (10%) Query: 546 GSFGTVQYSQIGKAVQSGNVEPEKARTWELGTRYDDGALTAEMGLFLINFNNQYDSNQTN 605 G F + NV EK + L + YD+ AL A + Q D+ Sbjct: 187 GFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASV------AVQQQDAKLVE 240 Query: 606 DTVTARGKTRHTGLETQARYDLGTLTPTL 634 + T + Y G +TP + Sbjct: 241 E---NYSHNSQTEVAATLAYRFGNVTPRV 266
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 33.7 bits (77), Expect = 5e-04 Identities = 10/51 (19%), Positives = 18/51 (35%), Gaps = 2/51 (3%) Query: 17 MNASGLNQGTSGNVSARYTGGMLITPSGIAYSKMTPDMIVFVDDKGIPEAG 67 + S + GG+L +G+ + D +V V G +A Sbjct: 682 IGYSHSDDIKQLYYGVS--GGVLAHANGVTLGQPLNDTVVLVKAPGAKDAK 730
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.8 bits (64), Expect = 0.008 Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 10/77 (12%) Query: 85 LAVIPEYQGMGVGGRLIRTGIE--------HLRLMGCQTVFVLGHATYYPRHGFEPCAGD 136 +AV +Y+ GVG L+ IE L L Q + + +Y +H F A D Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLE-TQDINISA-CHFYAKHHFIIGAVD 152 Query: 137 KGYPAPYPIPEEHKACW 153 + +P E W Sbjct: 153 TMLYSNFPTANEIAIFW 169
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 27.6 bits (61), Expect = 0.007 Identities = 17/83 (20%), Positives = 30/83 (36%), Gaps = 10/83 (12%) Query: 21 QVLIGDIYLRQQCDVFWLGY------TISPAYTRQGYAIEAITATIDWIKENGFSLIKAG 74 IG I +R W GY ++ Y ++G + I+W KEN F + Sbjct: 74 NNCIGRIKIRSN----WNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLE 129 Query: 75 VNPENTLSKKLLIRIGFNFSSIE 97 N + + F +++ Sbjct: 130 TQDINISACHFYAKHHFIIGAVD 152
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.027 Identities = 69/407 (16%), Positives = 120/407 (29%), Gaps = 45/407 (11%) Query: 40 LFVLFIFSFLDRINIGFAGL---TMGRDLGLS---ATMFGLATTLFYAAYVIFGIPSNIM 93 L V+ LD + IG + RDL S +G+ L+ + Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 94 LSIVGARRWIATIMVLWGIASTATMFATGPT--SLYVLRILVGITEAGFLPGILLYLTFW 151 G R ++ L G A + AT P LY+ RI+ GIT A Y+ Sbjct: 67 SDRFGRR--PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATG-AVAGAYIADI 123 Query: 152 FPAYFRARANALFMVAMPVTTALGSIVSGYILSLDGVMALKGWQWLFLLEGFPSVLLGVM 211 RAR G ++ G M F + L + Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGL-------MGGFSPHAPFFAAAALNGLNFLT 176 Query: 212 VWFWLDDSPDKAKWLTKEDKKCLQEMMDNDRLTLVQPEGAISHHAMQQRSMWREIFTPVV 271 F L +S + R + P + W T V Sbjct: 177 GCFLLPESHKGER--------------RPLRREALNPLASFR---------WARGMTVVA 213 Query: 272 MMYTLAYFCLTNTLSAISIWTPQILQSFNQGSSNITIGLLAAVPQICTILGMVYWSRHSD 331 + + + ++W F+ TIG+ A I L + Sbjct: 214 ALMAVFFIMQLVGQVPAALWVIFGEDRFHW--DATTIGISLAAFGILHSLAQAMITGPVA 271 Query: 332 RRQERRHHTALPYLFAAAGWLLASATDHNMIQMLGIIMASTGSFSAMAIFWTTPDQSISL 391 R R L + G++L + + +++ ++G A+ Q Sbjct: 272 ARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEE 331 Query: 392 RARAIGIAVINATGNIGSALSPFMIGWLKDLT-GSFNSGLWFVAALL 437 R + ++ T ++ S + P + + + ++N W A L Sbjct: 332 RQGQLQGSLAALT-SLTSIVGPLLFTAIYAASITTWNGWAWIAGAAL 377
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 29.8 bits (67), Expect = 0.011 Identities = 32/182 (17%), Positives = 59/182 (32%), Gaps = 34/182 (18%) Query: 78 QIKQLLDVGTQT---LLVPMVQNADEAREAVRATRYPPAGIRGVGSALARASRWNRIPDY 134 Q++ LL T ++ PM+ +E R+A + + G ++ + Sbjct: 374 QLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVG----- 428 Query: 135 LQKANDQMCVLVQIETREAMKNLPQILDVEGVDGVFIGPADL----------SADMGYAG 184 + +E + VD IG DL + + Y Sbjct: 429 -----------IMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLY 475 Query: 185 NPQHPEVQAAIEQAIVQIRESGKAPGI---LIANEQLAKRYLELGALFVAVGVDTTLLAR 241 P HP + ++ I GK G+ + +E L LG ++ + L AR Sbjct: 476 QPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPAR 535 Query: 242 AA 243 + Sbjct: 536 SQ 537
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 30.2 bits (68), Expect = 0.007 Identities = 9/51 (17%), Positives = 24/51 (47%) Query: 138 LHAVDARVNELEELLPLLMKDKLLAKGVSHLLSSQLTRILRTHAAMSVLGH 188 + V+ +VN+ +P L + + +++ +S L +S + + A + Sbjct: 80 VSDVEEQVNQYLSKVPELEQKQNVSELLSLLSNSPNISLSQLKAYLEGKSE 130
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 143 bits (362), Expect = 6e-40 Identities = 83/387 (21%), Positives = 149/387 (38%), Gaps = 84/387 (21%) Query: 5 IGIDLGTTNSCVAIMDGTTPRVLENAEGDRTTPSIIAYTQDGET------LVGQPAKRQA 58 + IDLGT N+ + + + E PS++A QD VG AK+ Sbjct: 13 LSIDLGTANTLIYVKGQGIV-LNE--------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPQNTLFAIKRLIGRRFQDEEVQRDVSIMPFKIIAADNGDAWVEVKGQKMAPPQISAE 118 P N + AI+ + +D I F + + Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFVTEK------------------MLQH 93 Query: 119 VLKKMKKTAEDYLGEPVTEAVITVPAYFNDAQRQATKDAGRIAGLEVKRIINEPTAAALA 178 +K++ + P ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 94 FIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIG 150 Query: 179 YGL--DKGTGNRTIAVYDLGGGTFDISIIEIDEVDGEKTFEVLATNGDTHLGGEDFDSRL 236 GL + TG+ V D+GGGT ++++I ++ V + +GG+ FD + Sbjct: 151 AGLPVSEATGS---MVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAI 198 Query: 237 INYLVEEFKKDQGIDLRNDPLAMQRLKEAAEKAKIELSSA----QQTDVNLPYITADATG 292 INY+ + G + AE+ K E+ SA + ++ + Sbjct: 199 INYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGV 245 Query: 293 PKHMNIKVTRAKLESLVEDLVNRSIEPLKVALQD-AGLSVSDIDD--VILVGGQTRMPMV 349 P+ + + LE+L E + + + VAL+ SDI + ++L GG + + Sbjct: 246 PRGFTLN-SNEILEALQEP-LTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNL 303 Query: 350 QKKVAEFFGKEPRKDVNPDEAVAIGAA 376 + + E G +P VA G Sbjct: 304 DRLLMEETGIPVVVAEDPLTCVARGGG 330
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 61.4 bits (149), Expect = 4e-17 Identities = 18/46 (39%), Positives = 30/46 (65%) Query: 23 HKAMIVALIVICITAVVAALVTRKDLCEVHIRTGQTEVAVFTAYES 68 +++ ++++C+T ++ +TRK LCE+ R G EVA F AYES Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 255 bits (653), Expect = 1e-79 Identities = 87/387 (22%), Positives = 164/387 (42%), Gaps = 20/387 (5%) Query: 1 MAAWRYASQDYRTFSDHLYENDKHYH----------QSDYNDFYDIG--RKNSLSANIMQ 48 + +RY++ Y F+D Y Y+ + + D+Y++ ++ L + Q Sbjct: 476 LVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQ 535 Query: 49 PLSNNLGNVSLSALWRNYWGRSGNAKDYQFSYSNNWQHISYTFSASQSYDENNKEEER-F 107 L + LS + YWG S + +Q + ++ I++T S S + + K ++ Sbjct: 536 QL-GRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQML 594 Query: 108 NLFISIPF--YWGDDIAKTRHQINLSNSTSFSKDGYSSNNTGITGIAGEHDQLNYGI--- 162 L ++IPF + D + S S S +G +N G+ G E + L+Y + Sbjct: 595 ALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTG 654 Query: 163 YVNQQQQNNDTSLGTNLSWRTPIAIIDGSYSHSKNAWQSGGSISSGLVVWSGGINITNQL 222 Y N+ ++ L++R + YSHS + Q +S G++ + G+ + L Sbjct: 655 YAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPL 714 Query: 223 SDTFAILDAPGLEGAHINGQKYNRTNSKGQVVYDLIIPHRENHLVLDIANSESETELQGN 282 +DT ++ APG + A + Q RT+ +G V +REN + LD +L Sbjct: 715 NDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNA 774 Query: 283 RQIIAPYRGAVSYVQFTTDQRKPWYIQALRPDGSPLTFGYDVLDLQENNIGVVGQGSRLF 342 + P RGA+ +F + L + PL FG V + G+V +++ Sbjct: 775 VANVVPTRGAIVRAEFKARVGIKLLMT-LTHNNKPLPFGAMVTSESSQSSGIVADNGQVY 833 Query: 343 IRVDEIPTGIKVALNDEQNLFCTITFQ 369 + + ++V +E+N C +Q Sbjct: 834 LSGMPLAGKVQVKWGEEENAHCVANYQ 860
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 30.7 bits (69), Expect = 0.002 Identities = 14/32 (43%), Positives = 19/32 (59%) Query: 8 NSAVLVHFTLKLDDGTTAESTRNNGKPALFRL 39 + V V +T L DGT +ST GKPA F++ Sbjct: 144 SDTVTVEYTGTLIDGTVFDSTEKAGKPATFQV 175
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.5 bits (74), Expect = 0.009 Identities = 20/110 (18%), Positives = 38/110 (34%), Gaps = 18/110 (16%) Query: 34 CKALREEGYRVILVNS-----------NPATIMTDPEMADATYIEPIHWEVVRKIIEKER 82 +AL GY V + ++ + ++TD M D + + I+K R Sbjct: 20 NQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD------LLPRIKKAR 73 Query: 83 PDAVLPTMGGQTALNCALELERQGVLEEFGVTM-IGATADAIDKAEDRRR 131 PD + M Q A++ +G + + I +A + Sbjct: 74 PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 29.4 bits (66), Expect = 0.040 Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 18/103 (17%) Query: 300 ILIADKQSVGERAVKGICGQVDGSVV------PGFIGLEAGQS-AFGDIYAWFGRVLGWP 352 + I++K+ + + + ++G + G I + + + G P Sbjct: 281 VRISEKEKIK---ITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSV---LGDTKLLP 334 Query: 353 L-EQLAAQHPELKAQINASQKQ----LLPALTEAWAKNPSLDH 390 E++ P L+ + S+ Q LL AL E +P L + Sbjct: 335 QRERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRY 377
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 29.3 bits (65), Expect = 0.022 Identities = 26/118 (22%), Positives = 47/118 (39%), Gaps = 21/118 (17%) Query: 82 YGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFVNTGFFRPDEAHQPHFSDLFGQ-IINA 140 Y R PE +E + R YW + N P ++ +F+ + +F G ++ Sbjct: 158 YSRFPEVKELMESQMERLARPYWEKLRNRPDMY----YFKNYNFKRCYFGLNGGDCLVAK 213 Query: 141 G-----------QGEGRYSELLAINLLEQLLLRRMEA-----INESLHPPMDNRVREA 182 G QG +Y E + LE++L +++A I + +P +V A Sbjct: 214 GDDGRTFISFSLQGNSKYKEEMDAKKLEEILSLKVDANPDKYIKATGYPGYSEKVEVA 271
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 29.2 bits (65), Expect = 0.020 Identities = 26/99 (26%), Positives = 39/99 (39%), Gaps = 20/99 (20%) Query: 110 MVKIEGGEWL----VETVKMLTERAVPVCGHLGLTPQSVNIFGGYKVQGRGDEAGDQL-L 164 V +E G L + V L AV GL P +V + D++G L Sbjct: 176 TVTLEPGRALDEGQISAVVHLVSSAVA-----GLPPGNVTLV---------DQSGHLLTQ 221 Query: 165 SDALALEAAGAQLLVLECVPVELAKRITEALAIPVIGIG 203 S+ + AQL V + +RI L+ P++G G Sbjct: 222 SNTSGRDLNDAQLKFANDVESRIQRRIEAILS-PIVGNG 259
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 790 bits (2043), Expect = 0.0 Identities = 267/850 (31%), Positives = 426/850 (50%), Gaps = 38/850 (4%) Query: 15 RIATFCALLYCNSAFCAELVEYDHTFLMGQNASNIDLSRYSEGNPAIPGMYDVSVYVNDQ 74 R+ CA AE + ++ FL + DLSR+ G PG Y V +Y+N+ Sbjct: 29 RLFVACAFAAQAPLSSAE-LYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNG 87 Query: 75 PIINQSITFIEIEGKKNAQACITLKNLLQFHINSPDINNEKAVLLARDETLGNCLNLTEI 134 + + +TF + ++ C+T L +N+ + LLA D C+ LT + Sbjct: 88 YMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASV--SGMNLLADDA----CVPLTSM 141 Query: 135 IPQASVRYDVNEQRLDIDVPQAWVMKNYQNYVDPSLWENGINAAMLSYNLNGYHSETP-G 193 I A+ + DV +QRL++ +PQA++ + Y+ P LW+ GINA +L+YN +G + G Sbjct: 142 IHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIG 201 Query: 194 RRNDSIYAAFNGGMNLGAWRLRASGNYNWMTDSGS-----NYDFKNRYIQRDIASLRSQL 248 + Y G+N+GAWRLR + +++ + S + N +++RDI LRS+L Sbjct: 202 GNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRL 261 Query: 249 ILGESYTTGETFDSVSIRGIRLYSDSRMLPPTLASFAPIIHGVANTNAKVTITQGGYKIY 308 LG+ YT G+ FD ++ RG +L SD MLP + FAP+IHG+A A+VTI Q GY IY Sbjct: 262 TLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIY 321 Query: 309 ETTVPPGAFVIDDLSPSGYGSDLIVTVEESDGSKRTFSQPFSSVVQMLRPGVGRWDISGG 368 +TVPPG F I+D+ +G DL VT++E+DGS + F+ P+SSV + R G R+ I+ G Sbjct: 322 NSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAG 381 Query: 369 QVLKDD-IQDEPNLFQASYYYGLNNYLTGYTGIQITDNNYTAGLLGLGLNT-SVGAFSFD 426 + + Q++P FQ++ +GL T Y G Q+ D Y A G+G N ++GA S D Sbjct: 382 EYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLAD-RYRAFNFGIGKNMGALGALSVD 440 Query: 427 VTHSNVRIPDDKTYQGQSYRVSWNKLFEETSTSLNIAAYRYSTQNYLGLNDALTLIDEVK 486 +T +N +PDD + GQS R +NK E+ T++ + YRYST Y D Sbjct: 441 MTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGY 500 Query: 487 HPE-----QDLEPKSMRNYSRM---KNQVTVSINQPLKFEKKDYGSFYLSGSWSDYWASG 538 + E ++PK Y+ + ++ +++ Q L + YLSGS YW + Sbjct: 501 NIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL----GRTSTLYLSGSHQTYWGTS 556 Query: 539 QNRSNYSIGYSNSASWGSYSVSAQRSWNE-DGDTDDSVYLSFTIPIEKLLGTEQRTS-GF 596 + G + + ++++S + N D + L+ IP L ++ ++ Sbjct: 557 NVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRH 616 Query: 597 QSIDTQMSSDFKGNNQLNVSSSGYS-DNARVSYSVNTGYTMNKASKDLSYVGGYASYESP 655 S MS D G G ++ +SYSV TGY S +Y Sbjct: 617 ASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGG 676 Query: 656 WGSLAGSVSANSDNSRQVSLSTDGGFVLHSGGLTFSNDSFSDSDTLAVVQAPGAQGARIN 715 +G+ S + D +Q+ GG + H+ G+T +DT+ +V+APGA+ A++ Sbjct: 677 YGNANIGYSHSDDI-KQLYYGVSGGVLAHANGVTLGQPL---NDTVVLVKAPGAKDAKVE 732 Query: 716 YGNST-IDRWGYGVTSALSPYHENRIALDINDLENDVELKSTSAVAVPRQGSVVFADFET 774 D GY V + Y ENR+ALD N L ++V+L + A VP +G++V A+F+ Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792 Query: 775 VQGQSAIMNITRSDGKNIPFAADIYDEQGNVIGNVGQGGQAFVRGIEQQGNISIKWLEES 834 G +M +T + K +PF A + E G V GQ ++ G+ G + +KW EE Sbjct: 793 RVGIKLLMTLTH-NNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEE 851 Query: 835 KPVSCLAHYQ 844 C+A+YQ Sbjct: 852 NA-HCVANYQ 860
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 507 bits (1308), Expect = 0.0 Identities = 293/296 (98%), Positives = 293/296 (98%) Query: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA Sbjct: 1 MSGLPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60 Query: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSSEMLARIAPGR 120 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS EMLARIAPGR Sbjct: 61 DTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGR 120 Query: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFILSMKPRFVKRGARPLLLT 180 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFI SMKPRFVKRGARPLLLT Sbjct: 121 GFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLT 180 Query: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH Sbjct: 181 TLIDPRHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDH 240 Query: 241 DNSKDMDALMATPLWQAMPFVRTGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296 DNSKDMDALMATPLWQAMPFVR GRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA Sbjct: 241 DNSKDMDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 34.3 bits (78), Expect = 4e-04 Identities = 12/55 (21%), Positives = 28/55 (50%), Gaps = 4/55 (7%) Query: 186 NDYYRKVKELRAKNQITLPVILKNERQINVFLRT----EDIDLINVINEETLLQQ 236 + ++ EL A N T+ +K ++N+ +R D + I V +E+++++ Sbjct: 589 QNIKNQLAELNATNIYTVLDKIKLNAKMNILIRDKRFHYDRNNIAVGADESVVKE 643
>PF06580#Sensor histidine kinase Length = 349 Score = 31.8 bits (72), Expect = 0.016 Identities = 15/95 (15%), Positives = 29/95 (30%), Gaps = 9/95 (9%) Query: 18 RPAMPRFKVSAFWLLILAWIFL-LVWIWWKGPTWTLYEEQWLKPLANRWLATAAWG---- 72 + + ++ A + + +VW W L KP+A + Sbjct: 66 QGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSIIFNVV 125 Query: 73 IIALMW----LTVRVMKRLQQLEKMQKQQREEAVD 103 ++ MW K +Q E Q + A + Sbjct: 126 VVTFMWSLLYFGWHFFKNYKQAEIDQWKMASMAQE 160
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 35.0 bits (80), Expect = 0.002 Identities = 33/213 (15%), Positives = 71/213 (33%), Gaps = 11/213 (5%) Query: 432 SALTTLSHQQEAEIRQLERELRIGLRTDTSRMTEVLVQYDETLTALDELEAAWHQQQTLV 491 + ++ Q E+ Q E + T+T V + + E Q Sbjct: 1072 AKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSP 1131 Query: 492 REIIALRQQLLGVAEDDAAP---LPDADTVEDTQPESESESESESESESESESESEQDNT 548 ++ + Q + P + + + +T ++E ++ S + + +ES NT Sbjct: 1132 KQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNT 1191 Query: 549 GAEPADEAGSE-----QPEETAETVSPVQRLAHLTAELDALHNDRLLVSPHVDKKQIAA- 602 G + + QP +E+ S + H + HN + D+ +A Sbjct: 1192 GNSVVENPENTTPATTQPTVNSES-SNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250 Query: 603 -VIAEWTGVPLNRLSQNEMSVITDLPKWLGDTI 634 + + T L+ V ++ K + I Sbjct: 1251 DLTSTNTNAVLSDARAKAQFVALNVGKAVSQHI 1283
>OUTRSURFACE#Outer surface protein signature. Length = 273 Score = 36.4 bits (84), Expect = 6e-04 Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 21/184 (11%) Query: 395 RVTITDSLNRR--EVLYTEGEGGLKRVVKKEHADGSITRSEYDEAGRL--KAQTDAAGRR 450 ++TI D L++ E+ +G+ + R V + D + T ++E G L K T G + Sbjct: 87 KLTIADDLSKTTFELFKEDGKTLVSRKVSSK--DKTSTDEMFNEKGELSAKTMTRENGTK 144 Query: 451 TEYRLHMASGAVTAVTGPDGRTVRYGYNSQRQVTSVTYPDGLRSSREYDEKGRLTAETSR 510 EY + G A T+ + + ++ +G + L+ E ++ Sbjct: 145 LEYTEMKSDGTGKAKEVLKNFTLEGKVANDK--VTLEVKEGTVT---------LSKEIAK 193 Query: 511 SGETTRYSYDDPASEL--PTGIQDATGSTKQM-AWSRYGQLLAFTDCSGYTTRYEYDRYG 567 SGE T D ++ TG D+ ST + S+ L FT T +YD G Sbjct: 194 SGEVTVALNDTNTTQATKKTGAWDSKTSTLTISVNSKKTTQLVFTK-QDTITVQKYDSAG 252 Query: 568 QQIA 571 + Sbjct: 253 TNLE 256 Score = 34.1 bits (78), Expect = 0.003 Identities = 23/100 (23%), Positives = 46/100 (46%), Gaps = 10/100 (10%) Query: 572 VHREEGISTYSSYNPRGQLVSQKDAQGRETRYEYSAAGDLTATVSPDGKRSTIEYDKRGR 631 V ++ ST +N +G+L ++ + T+ EY+ ++ + + K + G+ Sbjct: 114 VSSKDKTSTDEMFNEKGELSAKTMTRENGTKLEYT---EMKSDGTGKAKEVLKNFTLEGK 170 Query: 632 ------PVSVTEGGLTRSMGYDAAGRITV-LTNENGSQST 664 + V EG +T S +G +TV L + N +Q+T Sbjct: 171 VANDKVTLEVKEGTVTLSKEIAKSGEVTVALNDTNTTQAT 210
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 549 bits (1416), Expect = 0.0 Identities = 232/384 (60%), Positives = 269/384 (70%), Gaps = 34/384 (8%) Query: 1 MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNASKDGDQSYIRFG 60 MK+ LALV+ ++A+ + AAEIYNKDGNKLD+YGKV +HY SD++SKDGDQ+Y+R G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALY 120 FKGETQINDQLTGYG+WE N E + A TRLAFAGLK+ D GSFDYGRN G LY Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNEN-- 178 DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFG++DGLN LQYQGKNE+ Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 179 --------------RDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSR-- 222 D++ NGDGFG S TYD G F+ AYT SDRTNEQ Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239 Query: 223 GTGKRAEAWATGLKYDANNIYLATFYSETRKMTP-------ITGGFANKTQNFEAVAQYQ 275 G +A+AW GLKYDANNIYLAT YSETR MTP GG ANKTQNFE AQYQ Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 276 FDFGLRPSLGYVLSKGKDIE----GIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDS 331 FDFGLRP++ +++SKGKD+ D+DLV Y DVGATYYFNKN S +VDYKIN LD Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359 Query: 332 DNKL----NINNDDIVAVGMTYQF 351 D+ I+ DDIVA+GM YQF Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 37.5 bits (87), Expect = 6e-05 Identities = 28/127 (22%), Positives = 48/127 (37%), Gaps = 17/127 (13%) Query: 119 DTLRALLDNNI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169 +T++ L++ + VPVI E+ + E V D D A AD ++LT Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235 Query: 170 DQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAA-DVACRAG 228 D G + + +++V +++ + G M K+ AA G Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289 Query: 229 IDTIIAA 235 IIA Sbjct: 290 ERAIIAH 296 Score = 30.2 bits (68), Expect = 0.013 Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%) Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAAGHRIVIVTSG-------- 51 + +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 52 -AIAAGREHLGYPELP 66 + AG+ G P P Sbjct: 62 LHMDAGQATYGIPAQP 77
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 62.9 bits (153), Expect = 4e-12 Identities = 30/247 (12%), Positives = 72/247 (29%), Gaps = 23/247 (9%) Query: 487 TLNLNSLWSKLGTFSISYNDDRRYNSHYYTADYYQNVYSGTFGSLGLRAGIQRYNNGDSN 546 L + + T +S + Y + +Q + F + N Sbjct: 530 QLTVTQQLGRTSTLYLSG-SHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQK 588 Query: 547 ANTGKYIALDLSLPLGNWFSAGMTHQNGYTMANLSARKQFDEGT------------IRTV 594 + +AL++++P +W + Q + A+ S + + Sbjct: 589 -GRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNL 647 Query: 595 GANLSRAISGDTGDDKTLSGGAYAQFDARYASGTLNVNSAADGYINTNLTANGSVGWQGK 654 ++ +G + +G A + Y + + S +D +G V Sbjct: 648 SYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGY-SHSDDIKQLYYGVSGGVLAHAN 706 Query: 655 NIAASGRTDGNAGVIFNTGLED---DGQISAKINGRIFPLNGKRNYLPLSPYGRYEVELQ 711 + + ++ G +D + Q + + R G + Y V L Sbjct: 707 GVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWR-----GYAVLPYATEYRENRVALD 761 Query: 712 NSKNSLD 718 + + + Sbjct: 762 TNTLADN 768
>INTIMIN#Intimin signature. Length = 939 Score = 556 bits (1433), Expect = 0.0 Identities = 232/820 (28%), Positives = 364/820 (44%), Gaps = 53/820 (6%) Query: 41 PVMAARAQHAVQPRLSMGNTTVTADNNVEKNVASFAANAGTFLSSQPDS-----DATRNF 95 P++AA +L+ + VT N + ++AA L SQ S D ++ Sbjct: 131 PLVAAGGVAGHTNKLTKMSPDVTKSNMTDDKALNYAAQQAASLGSQLQSRSLNGDYAKDT 190 Query: 96 ITGMATAKANQEIQEWLGKYGTARVKLNVDKDFSLKDSSLEMLYPIYDTPTNMLFTQGAI 155 G+A +A+ ++Q WL YGTA V L +F SSL+ L P YD+ + F Q Sbjct: 191 ALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD--GSSLDFLLPFYDSEKMLAFGQVGA 248 Query: 156 HRTDDRTQSNIGFGWRHFSGNDWMAGVNTFIDHDLSRSHTRIGVGAEYWRDYLKLSANGY 215 D R +N+G G R F M G N FID D S +TR+G+G EYWRDY K S NGY Sbjct: 249 RYIDSRFTANLGAGQRFFLPE-NMLGYNVFIDQDFSGDNTRLGIGGEYWRDYFKSSVNGY 307 Query: 216 IRASGWKKSPDIEDYQERPANGWDIRAEGYLPAWPQLGASLMYEQYYGDEVGLFGKDKRQ 275 R SGW +S + +DY ERPANG+DIR GYLP++P LGA LMYEQYYGD V LF DK Q Sbjct: 308 FRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDNVALFNSDKLQ 367 Query: 276 KDPHAISAEVTYTPVPLLTLSAGHKQGKSGENDTRFGLEVNYRIGEPLEKQLDTDSIRER 335 +P A + V YTP+PL+T+ ++ G END + ++ Y+ +P +Q++ + E Sbjct: 368 SNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQQIEPQYVNEL 427 Query: 336 RMLAGSRYDLVERNNNIVLEYRKSEVIRIALPERIEGKGGQTLSLGLVVSKATHGLKNVQ 395 R L+GSRYDLV+RNNNI+LEY+K +++ + +P I G T + L+V K+ +GL + Sbjct: 428 RTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHDINGTERSTQKIQLIV-KSKYGLDRIV 486 Query: 396 WEAPSLLAEGGKITGQGSQ----WQVTLPAYRPGKDNYYAISAVAYDNKGNASKRVQTEV 451 W+ +L ++GG+I GSQ +Q LPAY G N Y ++A AYD GN+S V + Sbjct: 487 WDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTI 546 Query: 452 VITGAGMSADRTALTLDGQSRIQMLANGNEQRPLVLSLR----DAEGQPVTGMKDQIKTE 507 + G D+ +T + A+G E +++ PV+ Sbjct: 547 TVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVS--------- 597 Query: 508 LTFKPAGNIVTRTLKATKSQAKPTLGEFTETEAGVYQSVFTTGTQSGEATITVSVDGMSK 567 NIV+ T + + A +G + G+ ++ M+ Sbjct: 598 ------FNIVSGTAVLSANSAN-------TNGSGKATVTLKSDK-PGQVVVSAKTAEMTS 643 Query: 568 TVTAELRATMMDVANSTLSANEPSGDVVADGQQAYTLTLTAVDTDGNPVTGEASRLRFVP 627 + A + S VA+GQ A T T+ V PV+ + Sbjct: 644 ALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVK-VMKGDKPVSNQEVTFT--- 699 Query: 628 QDTNGVTIGTISEIKPGVYSATVSSTRAGNVVVRAFSEQYQLGTLQQTLKFVAGP-LDAA 686 ++ T G T++ST G +V A + ++F +D Sbjct: 700 TTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDG 759 Query: 687 HSSITLNPDKPVVGGTVTAIWTVKDAYDNPVTSLTPE---APSLAGAAAVGSTASGWTNN 743 + I V G + +W + + + + A+V +++ T Sbjct: 760 NIEIVGTG----VKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLK 815 Query: 744 GDGTWTAQITLGSTAGELEVMPKLNGQDAAANAAKVTVVADALSSNQSKVSVAEDHVKAG 803 GT T + + N N +K DA+++ ++ Sbjct: 816 EKGTTTISVISSDNQTATYTIATPNSL-IVPNMSKRVTYNDAVNTCKNFGGKLPSSQNEL 874 Query: 804 ESTTVTLIAKDAHGNAISGLSLSASLTGTASEGATVSSWT 843 E+ A + + S ++ + + TA + + + T Sbjct: 875 ENVFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVAST 914 Score = 80.5 bits (198), Expect = 4e-17 Identities = 82/396 (20%), Positives = 128/396 (32%), Gaps = 45/396 (11%) Query: 905 KTTTELTFTVKDAYGNPVTGMKPDAPVFSGAASTGTERPSTGDWTETSNGVYVATLTLGS 964 T TVK G+ S +GT S +G TL Sbjct: 575 TEAITYTATVKK------NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDK 628 Query: 965 AAGQLSVMPRVNGQNAVAQPLVLNVAGDASKAEIRDMTVKVDNQLANGQSTNQVTLTVVD 1024 + +A+ V+ V D +KA I ++ +ANGQ +T TV Sbjct: 629 PGQVVVSAKTAEMTSALNANAVIFV--DQTKASITEIKADKTTAVANGQDA--ITYTVKV 684 Query: 1025 TY-GNPLQGQNVTLTLPKGVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNAQ-- 1081 P+ Q VT T G S + T TD G A + L ST G+ ++A V++ Sbjct: 685 MKGDKPVSNQEVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVD 742 Query: 1082 -KTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTATVEDKNGNPVPGSLVTFNLPR 1140 K V+F + ++E+ V G T ++ NL Sbjct: 743 VKAPEVEFFTTLTIDDGNIEI------VGTGVKGKLPTVWLQYG----------QVNLK- 785 Query: 1141 GVKPLTGDNVWVKANGE----GKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKA 1196 G W AN + QV GT I+ + D QT T+ Sbjct: 786 -ASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISS-----DNQTATYTIATP 839 Query: 1197 TATVSGIEVMGNYALADGKAKQTYKVTVTDANNNLVKDSEVTLTASPASLNLEPNGTATT 1256 + + + D ++ N +++ A+ + + T + Sbjct: 840 NSLIV-PNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTIIS 898 Query: 1257 NEQGQAIFTATTTVAATYTLKAQVSQTNGQVSTKTA 1292 Q Q A + VA+TY L Q N + S A Sbjct: 899 WVQ-QTAQDAKSGVASTYDLVKQNPLNNIKASESNA 933 Score = 76.6 bits (188), Expect = 5e-16 Identities = 73/337 (21%), Positives = 121/337 (35%), Gaps = 21/337 (6%) Query: 976 NGQNAVAQPLVLNVAGDAS-KAEIRDMTVKVDNQLANGQSTNQVTLTVVDTYGNPLQGQN 1034 N N V + + G + + D T + A+G T TV G Sbjct: 537 NSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKN-GVAQANVP 595 Query: 1035 VTLTLPKGVTSKTGNTVTTDAAGKADIELMSTVAGEHSITASVNNAQKTVTVKFKADFST 1094 V+ + G + N+ T+ +GKA + L S G+ ++A + Sbjct: 596 VSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQ 655 Query: 1095 GQASLEVDSAAPK--VANGKDAFTLTATVEDKNGNPVPGSLVTFNLPRGVKPLTGDNVWV 1152 +AS+ A VANG+DA T T V K PV VTF G + + Sbjct: 656 TKASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSNQEVTFTTTLGKLSNSTE---- 710 Query: 1153 KANGEGKAELQVVSVTAGTYEITASAGNSQPSDTQTITFVADKATATVSGIEVMGNYALA 1212 K + G A++ + S T G ++A + T IE++G Sbjct: 711 KTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGT---- 766 Query: 1213 DGKAKQTYKVTVTDANNNLVK---DSEVTLTAS-PASLNLEPN-GTATTNEQGQAIFTAT 1267 G + V + NL + + T ++ PA +++ + G T E+G + Sbjct: 767 -GVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVI 825 Query: 1268 T--TVAATYTLKAQVSQTNGQVSTKTAESKFVADDKN 1302 + ATYT+ S +S + + V KN Sbjct: 826 SSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKN 862 Score = 58.5 bits (141), Expect = 2e-10 Identities = 60/374 (16%), Positives = 109/374 (29%), Gaps = 54/374 (14%) Query: 779 VTVVADALSSNQSKV---SVAEDHVKAGESTTVTLIA------KDAHGNAISGLSLSASL 829 +TV+++ +Q V + + KA + +T A +S +S Sbjct: 546 ITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVPVSFNIVS--- 602 Query: 830 TGTASEGATVSSWTEKGDGSYVATLTTGGKTGELRVMPLFNGQPAATEAAQLTVIAGEMS 889 GTA A +S G G TL + + A A + + Sbjct: 603 -GTAVLSA--NSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALN--ANAVIFVDQTK 657 Query: 890 SANSTLVADNKAPTVKTTTELTFTVKDAY-GNPVTGMKPDAPVFSGAASTGTERPSTGDW 948 ++ + + AD +T+TVK PV+ + +T + S Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEV-------TFTTTLGKLSNSTE 710 Query: 949 TETSNGVYVATLTLGSAAGQLSVMPRVNGQN-AVAQPLVLNVAG---DASKAEIRDMTVK 1004 +NG TLT + G+ V RV+ V P V D EI Sbjct: 711 KTDTNGYAKVTLT-STTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEI------ 763 Query: 1005 VDNQLANGQSTNQVTLTV-VDTYGNPLQGQNVTLTLPKGVTSKTGNTVTTDA-AGKADIE 1062 + G T+ + G N T S + DA +G+ ++ Sbjct: 764 ----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTW----RSANPAIASVDASSGQVTLK 815 Query: 1063 LMSTVAGEHSITASVNNAQKTVTVKFKADFSTGQASLEVDSAAPKVANGKDAFTLTATVE 1122 G +I+ ++ Q T + + + + Sbjct: 816 ----EKGTTTISVISSDNQ---TATYTIATPNSLIVPNMSKRV-TYNDAVNTCKNFGGKL 867 Query: 1123 DKNGNPVPGSLVTF 1136 + N + + Sbjct: 868 PSSQNELENVFKAW 881 Score = 54.7 bits (131), Expect = 2e-09 Identities = 40/178 (22%), Positives = 63/178 (35%), Gaps = 13/178 (7%) Query: 1168 TAGTYEITASA----GNSQPSDTQTITFVADKATATVSGI---EVMGNYALADGKAKQTY 1220 + Y++TA A GNS + TIT +++ G+ A ADG TY Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580 Query: 1221 KVTVTDANNNLVKDSEVTLTAS-PASLNLEPNGTATTNEQGQAIFTATTTVAATYTLKAQ 1279 TV S A L+ +A TN G+A T + + A+ Sbjct: 581 TATVKKNGVAQANVPVSFNIVSGTAVLSAN---SANTNGSGKATVTLKSDKPGQVVVSAK 637 Query: 1280 VSQTNGQVSTKTAESKFVADDKNAVLTASSDMQSLVADGKSTAKLEVTLMSANNPVGG 1337 + FV K ++ +D + VA+G+ V +M + PV Sbjct: 638 --TAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSN 693
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.025 Identities = 12/42 (28%), Positives = 19/42 (45%) Query: 3 RQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRN 44 RQ IL L S+ +IA+ +G +R I F++ Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKD 54
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 64.3 bits (156), Expect = 5e-15 Identities = 32/172 (18%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 10 RRRQLIDATLEAINEVGMHDATIAQIARRAGVSTGIISHYFRDKNGLLEATMRDITSQLR 69 R+ ++D L ++ G+ ++ +IA+ AGV+ G I +F+DK+ L S + Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71 Query: 70 DAVLNRLHALPQGSAEQRLQAIVGGNFDETQVSSAAMKAWLAFWASSMHQP-------ML 122 + L P G L+ I+ + T V+ + + Sbjct: 72 ELELEYQAKFP-GDPLSVLREILIHVLEST-VTEERRRLLMEIIFHKCEFVGEMAVVQQA 129 Query: 123 YRLQQVSSRRLLSNLVSEFRRE---LPREQAQEAGYGLAALIDGL---WLRA 168 R + S + + + A + I GL WL A Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 128 bits (323), Expect = 4e-32 Identities = 120/469 (25%), Positives = 187/469 (39%), Gaps = 65/469 (13%) Query: 854 QAGNVLVVKGNYHGNNGQLVMNTVLNGDDSVTDKLVVEGDTSGTTAVTVNNAGGTGAKTL 913 +AG V+ N +G MN D ++DKLVV D SG + V N+G + Sbjct: 466 EAGRFKVLTVNTLAGSGLFRMNV--FADLGLSDKLVVMQDASGQHRLWVRNSGS-EPASA 522 Query: 914 NGIELIHVDGKSEGEFVQA---GRIVAGAYDYTLARGQGANSGNWYLTSGSDSPELQPEP 970 N + L+ S F A G++ G Y Y LA +G W L P +P P Sbjct: 523 NTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLA---ANGNGQWSLVGAKAPPAPKPAP 579 Query: 971 DPMPNPEPNPNPEPN-PNPTPTPGPDLNVDNDLRPEAGSYIANLAAANTMFTTRLHERLG 1029 P P P P P+P P P P G +L+ AAAN T Sbjct: 580 QPGPQPPQPPQPQPEAPAPQPPAGRELS----------------AAANAAVNTGGVGLAS 623 Query: 1030 NTYYTDMVTGEQKQTTMWMRHEGGHNKWRDGSGQLKTQSNRYV---------LQLGGDVA 1080 +Y + ++ + + + G W G Q + NR +LG D A Sbjct: 624 TLWYAESNALSKRLGELRLNPDAG-GAWGRGFAQRQQLDNRAGRRFDQKVAGFELGADHA 682 Query: 1081 QWSQNGSDSWHVGVMAGYGNSDSKTISSRTGYRAKASVNGYSTGLYATWYADDESRNGAY 1140 G WH+G +AGY D G+ + G YAT+ AD +G Y Sbjct: 683 VAVAGGR--WHLGGLAGYTRGDRGFTGDGGGH-----TDSVHVGGYATYIAD----SGFY 731 Query: 1141 LDSWAQYSWFDN--TVKGDDLQS--ESYKSKGFTASLEAGYKHKLAEFNGSQGTRNEWYV 1196 LD+ + S +N V G D + Y++ G ASLEAG + A + W++ Sbjct: 732 LDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLEAGRRFTHA---------DGWFL 782 Query: 1197 QPQAQVTWMGVKADKHRESNGTLVHSNGDGNVQTRLGVKTWLKSHHKMDDGKSREFQPFV 1256 +PQA++ +R +NG V G +V RLG L+ +++ R+ QP++ Sbjct: 783 EPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLG----LEVGKRIELAGGRQVQPYI 838 Query: 1257 EVNWLHNSKDFST-SMDGVSVTQDGARNIAEIKTGVEGQLNANLNVWGN 1304 + + L T +G++ + AE+ G+ L +++ + Sbjct: 839 KASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYAS 887
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.6 bits (69), Expect = 0.009 Identities = 15/50 (30%), Positives = 24/50 (48%), Gaps = 2/50 (4%) Query: 6 TEENLLAFTTAARFGSFSKAAEELGLTTSAISYTIKRMETGLDVVLFTRS 55 E L+ A G+ KAA+ LGL + + I+ + G+ V +RS Sbjct: 436 MEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIREL--GVSVYRSSRS 483
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 336 bits (864), Expect = e-112 Identities = 122/401 (30%), Positives = 199/401 (49%), Gaps = 54/401 (13%) Query: 164 DLAEEAGMTGIFIYSAATVRQAFSDALDMTRMSLRHNTHDATRNALRTRYVLGDMLGQSP 223 A +A G + Y ++ + + +L ++ ++G+S Sbjct: 88 MTAIKASEKGAYDYLPKPFDL--TELIGIIGRALAEPKRRPSK-LEDDSQDGMPLVGRSA 144 Query: 224 QMEQVRQTILLYARSSAAVLIEGETGTGKELAAQAIHREYFARHDARQGKKSHPFVAVNC 283 M+++ + + ++ ++I GE+GTGKEL A+A+H + R+ PFVA+N Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-----YGKRRNG---PFVAINM 196 Query: 284 GAIAESLLEAELFGYEEGAFTGSRRGGRAGLFEIAHGGTLFLDEIGEMPLPLQTRLLRVL 343 AI L+E+ELFG+E+GAFTG++ G FE A GGTLFLDEIG+MP+ QTRLLRVL Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTR-STGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVL 255 Query: 344 EEKEVTRVGGHQPVPVDVRVISATHCKLEEDMQQGRFRRDLFYRLSILRLQLPPLRERVA 403 ++ E T VGG P+ DVR+++AT+ L++ + QG FR DL+YRL+++ L+LPPLR+R Sbjct: 256 QQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAE 315 Query: 404 DILPLAESFLKVSLAALSAPFSAALRQGLQASETVLVHYDWPGNIRELRNMMERLALFLS 463 DI L F++ ++ L+ + + WPGN+REL N++ RL Sbjct: 316 DIPDLVRHFVQ-QAEKEGLDVKRFDQEALEL----MKAHPWPGNVRELENLVRRLTALYP 370 Query: 464 VEP-TPDLTPQFLQLLLPELARESAKTPAPRLLTP------------------------- 497 + T ++ L+ +P+ E A + L Sbjct: 371 QDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYD 430 Query: 498 -----------QQALEKFNGDKTAAANYLGISRTTFWRRLK 527 AL G++ AA+ LG++R T ++++ Sbjct: 431 RVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIR 471
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 29.8 bits (67), Expect = 0.023 Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%) Query: 65 LIHGKLPTRDE-LAAYKTKLKALRGLPANVRTV 96 + +LPT +E AYK ++ + G P +RT+ Sbjct: 303 MDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTL 335
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.6 bits (82), Expect = 3e-04 Identities = 44/192 (22%), Positives = 72/192 (37%), Gaps = 22/192 (11%) Query: 4 LKNTNFWMFGLFFFFYFFI-MGAYFPFFPIWLHDINHISK--SDTGIIFAAISLFSLLFQ 60 +K + L + +G P P L D+ H + + GI+ A +L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 61 PLFGLLSDKLGLRKYLLWIITGMLVMFAPFFIFIFGPLLQYNILVGSIVGGIYLGFCFNA 120 P+ G LSD+ G R LL + + I P L + + +G IV GI A Sbjct: 61 PVLGALSDRFGRRPVLL---VSLAGAAVDYAIMATAPFL-WVLYIGRIVAGIT-----GA 111 Query: 121 GAPAVEAFIEKVSRRSNFEFGRARMFG----CVGWALCAS--IVGIMFTINNQFVFWLGS 174 A+I ++ RAR FG C G+ + A + G+M + F+ + Sbjct: 112 TGAVAGAYIADITDGDE----RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAA 167 Query: 175 GCALILAVLLFF 186 + + F Sbjct: 168 ALNGLNFLTGCF 179
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 30.0 bits (67), Expect = 0.015 Identities = 19/69 (27%), Positives = 30/69 (43%) Query: 254 DIVRELRERTELPIGAYQVSGEYAMIKFAALAGAIDEEKVVLESLGSIKRAGADLIFSYF 313 + EL + +L + QV G A F +D E L I+ A +IF+ Sbjct: 466 NQFLELEKTKQLRLDTDQVYGNIATYNFENGRVRVDTGSNWSEVLPQIQETTARIIFNGK 525 Query: 314 ALDLAEKKI 322 L+L E++I Sbjct: 526 DLNLVERRI 534
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 122 bits (307), Expect = 2e-30 Identities = 103/442 (23%), Positives = 169/442 (38%), Gaps = 59/442 (13%) Query: 572 TVNGNNDINNGDEVDNSSVDNVVA---ATGNYKVRIDNSTGAGAIADYAGKQLIYIDDTK 628 T+ G+ D D +V A+G +++ + NS G+ A L+ Sbjct: 477 TLAGSGLFRMNVFADLGLSDKLVVMQDASGQHRLWVRNS---GSEPASANTLLLVQTPLG 533 Query: 629 TNATFSAAN---KADLGAYTYQAEQRGNTV------------------------------ 655 + ATF+ AN K D+G Y Y+ GN Sbjct: 534 SAATFTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQP 593 Query: 656 ------VLQQMELTDYANMALSIP--SANTNIWNLEQDTVGTRLTNSRHGLADNGGAWVS 707 EL+ AN A++ + +W E + + RL R D GGAW Sbjct: 594 EAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRL-NPDAGGAWGR 652 Query: 708 YFGGNFNGDNGTIN-YDQDVNGIMVGVDTKIDGNNAKWIVGAAAGFAKGDMN---DRSGQ 763 F DN +DQ V G +G D + +W +G AG+ +GD D G Sbjct: 653 GFAQRQQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGH 712 Query: 764 VDQDSQTAYIYSSAHFANNVF-VDGSLSYSHFNNDLSATMSNGTYVDGSTNSDAWGFGLK 822 D Y + + A++ F +D +L S ND S+G V G + G L+ Sbjct: 713 TDSVHVGGY---ATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLE 769 Query: 823 AGYDFKLGDAGYVTPYGSISGLFQSGDDYQLSNDMKVDGQSYDSMRYELGVDAGYTFTYS 882 AG F D ++ P ++ G Y+ +N ++V + S+ LG++ G + Sbjct: 770 AGRRFTHADGWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELA 829 Query: 883 EDQALTPYFKLAYVYDDSNNDNDVNGDSIDNGTEGSAVRV--GLGTQFSFTKNFSAYTDA 940 + + PY K + + + + V+ + I + TE R GLG + + S Y Sbjct: 830 GGRQVQPYIKASVL-QEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYASY 888 Query: 941 NYLGGGDVDQDWSANVGVKYTW 962 Y G + W+ + G +Y+W Sbjct: 889 EYSKGPKLAMPWTFHAGYRYSW 910
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 29.8 bits (67), Expect = 0.028 Identities = 31/137 (22%), Positives = 54/137 (39%), Gaps = 24/137 (17%) Query: 245 IWLPLGLVIGLLAAMFVLRILRRIQSPHHRLQDAIENRDICVHYQPIVSLANGKIVGAEA 304 W+ L L+ G +A +LR R+ + + P++ G+I Sbjct: 228 PWMLLALLAGFMAFRVMLR------QEKRRVS-----FHRRLLHLPLI----GRIARGLN 272 Query: 305 LARWPQTDGSWLSPDSFIPLAQQTGLS-EPLTLLIIRSVFEDMGDWLRQHPQQHISINLE 363 AR+ +T + S +PL Q +S + ++ R D +R+ H + LE Sbjct: 273 TARYARTLSILNA--SAVPLLQAMRISGDVMSNDYARHRLSLATDAVREGVSLHKA--LE 328 Query: 364 STVLTSEKIPQLLREMI 380 T L P ++R MI Sbjct: 329 QTAL----FPPMMRHMI 341
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1369 bits (3546), Expect = 0.0 Identities = 802/1033 (77%), Positives = 915/1033 (88%), Gaps = 1/1033 (0%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLAILKLPVAQYPTIAPPAVTISASYPGADAKTVQDT 60 M NFFI RPIFAWV+AII+M+AG LAIL+LPVAQYPTIAPPAV++SA+YPGADA+TVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSNSDSTGTVQITLTFESGTDADIAQVQVQNKLQLAMPLLPQ 120 VTQVIEQNMNGIDNLMYMSS SDS G+V ITLTF+SGTD DIAQVQVQNKLQLA PLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVSVEKSSSSFLMVVGVINTDGTMTQEDISDYVAANMKDAISRTSGVGDVQLFGS 180 EVQQQG+SVEKSSSS+LMV G ++ + TQ+DISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMNPNELNKFQLTPVDVITAIKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW++ + LNK++LTPVDVI +K QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TSTEEFGKILLKVNQDGSRVLLRDVAKIELGGENYDIIAEFNGQPASGLGIKLATGANAL 300 + EEFGK+ L+VN DGS V L+DVA++ELGGENY++IA NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAAAIRAELAKMEPFFPSGLKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++PFFP G+K++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIAKGDHGEGKKGFFGWFNRMFEKSTHHYTDSVGGILRSTGR 540 SVLVALILTPALCAT+LKP++ H E K GFFGWFN F+ S +HYT+SVG IL STGR Sbjct: 481 SVLVALILTPALCATLLKPVSAE-HHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 541 YLVLYLIIVVGMAYLFVRLPSSFLPDEDQGVFMTMVQLPAGATQERTQKVLNEVTHYYLT 600 YL++Y +IV GM LF+RLPSSFLP+EDQGVF+TM+QLPAGATQERTQKVL++VT YYL Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 601 KEKNNVESVFAVNGFGFAGRGQNTGIAFVSLKDWADRPGEENKVEAITMRATRAFSQIKD 660 EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 661 AMVFAFNLPAIVELGTATGFDFELIDQAGLGHEKLTQARNQLLAEAAKHPDMLTSVRPNG 720 V FN+PAIVELGTATGFDFELIDQAGLGH+ LTQARNQLL AA+HP L SVRPNG Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 721 LEDTPQFKIDIDQEKAQALGVSINDINTTLGAAWGGSYVNDFIDRGRVKKVYVMSEAKYR 780 LEDT QFK+++DQEKAQALGVS++DIN T+ A GG+YVNDFIDRGRVKK+YV ++AK+R Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFR 779 Query: 781 MLPDDIGDWYVRAADGQMVPFSAFSSSRWEYGSPRLERYNGLPSMEILGQAAPGKSTGEA 840 MLP+D+ YVR+A+G+MVPFSAF++S W YGSPRLERYNGLPSMEI G+AAPG S+G+A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 841 MELMEQLASKLPTGVGYDWTGMSYQERLSGNQAPSLYAISLIVVFLCLAALYESWSIPFS 900 M LME LASKLP G+GYDWTGMSYQERLSGNQAP+L AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 901 VMLVVPLGVIGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMDKEGKGL 960 VMLVVPLG++G LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLM+KEGKG+ Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 961 IEATLDAVRMRLRPILMTSLAFILGVMPLVISTGAGSGAQNAVGTGVMGGMVTATVLAIF 1020 +EATL AVRMRLRPILMTSLAFILGV+PL IS GAGSGAQNAVG GVMGGMV+AT+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1021 FVPVFFVVVRRRF 1033 FVPVFFVV+RR F Sbjct: 1020 FVPVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.0 bits (104), Expect = 6e-07 Identities = 33/212 (15%), Positives = 71/212 (33%), Gaps = 23/212 (10%) Query: 100 TYQATYDSAKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTA 159 + Y A +L + + Q+ Q +++ ++ L +Q + Sbjct: 256 EQENKYVEAVNELR--VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGL 313 Query: 160 AKAAVETARINLAYTKVTSPISGRIGKSNV-TEGALVQNGQATALATVQQLDPIYVDVTQ 218 + + + +P+S ++ + V TEG +V + T + V + D + V Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALV 372 Query: 219 SSNDFLRLKQELA----------NGTLKQENGKAKVSLITSDGIKFPQDGTLEFSDVTVD 268 + D + KV I D I+ + G + ++++ Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTRYGYLV---GKVKNINLDAIEDQRLGLVFNVIISIE 429 Query: 269 QTTGSITLRAIFPNPDHTLLPGMFVRARLEEG 300 + S + I L GM V A ++ G Sbjct: 430 ENCLSTGNKNIP------LSSGMAVTAEIKTG 455 Score = 34.4 bits (79), Expect = 8e-04 Identities = 24/125 (19%), Positives = 43/125 (34%), Gaps = 13/125 (10%) Query: 49 PLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFKEGSDIEAGVSLYQIDPATYQATYDS 107 ++I G+ T + R E++P + I+ + KEG + G L ++ +A Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA---- 134 Query: 108 AKGDLAKAQAAANIAQLTVNRYQKLLGTQYISKQEYDQALADAQQANAAVTAAKAAVETA 167 D K Q++ A+L RYQ L E ++ Sbjct: 135 ---DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEVLR 186 Query: 168 RINLA 172 +L Sbjct: 187 LTSLI 191
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 222 bits (567), Expect = 5e-76 Identities = 215/215 (100%), Positives = 215/215 (100%) Query: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPATNE 215
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.1 bits (73), Expect = 0.014 Identities = 18/125 (14%), Positives = 40/125 (32%), Gaps = 6/125 (4%) Query: 28 QNTAFARASSNGDLPTKADLQAQLDSLNKQKDLSAQDKLVQQDLTDTLATLDKIDRIKEE 87 N RA L + + L L+ + A L++ ++ E Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSR--LDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 88 TVQLRQKVAEAPEKMRQATAALTALSDIDND--EETRKIL--STLSLRQLETRVAQALDD 143 +LR ++ + + +A + E L +T ++ L +A+ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324 Query: 144 LQNAQ 148 Q + Sbjct: 325 QQASV 329
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 39.7 bits (92), Expect = 4e-05 Identities = 40/251 (15%), Positives = 78/251 (31%), Gaps = 31/251 (12%) Query: 404 PLPETTSQVLAARQQLQRVQGATKAKKSEPAA----ATRARPVNNAALERLASVTDRVQA 459 P E +Q + + + P+ AR + A + A T Sbjct: 983 PEVEKRNQTVDTTN----ITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETT 1037 Query: 460 RPVPSALEKAPAKKEAYRWKATTPVMQQKE--------VVATPKALKKA---LEHEKTPE 508 V ++ E AT Q +E V A + + A E ++T Sbjct: 1038 ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQT 1097 Query: 509 LAAKLAA---------EAIERDAWAAQVSQLSLPKLVEQVALNAWKE-ESDNAVCLHLRS 558 K A E+ +V+ PK + + E +N ++++ Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157 Query: 559 SQRHLNNRGAQQKLAEALST-LKGSTVELTIVEDDNPAVRTPLEWRQAIYEEKLAQARES 617 Q N ++ A+ S+ ++ E T V N V P A + + + Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217 Query: 618 IIADNNIQTLR 628 + + +++R Sbjct: 1218 KPKNRHRRSVR 1228
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 77.0 bits (189), Expect = 7e-19 Identities = 48/212 (22%), Positives = 81/212 (38%), Gaps = 7/212 (3%) Query: 3 KSVLITGCSSGIGLESALELKRQGFHVLAGCRKPDDVERMNN----MGFT--GVLIDLDS 56 K ITG + GIG A L QG H+ A P+ +E++ + D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 57 PESVDRAADEVIALTDNCLYGIFNNAGFGMYGPLSTISRAQMEQQFSANFFGAHQLTMRL 116 ++D + + + N AG G + ++S + E FS N G + + Sbjct: 69 SAAIDEITARIEREMGP-IDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 117 LPAMLPHGEGRIVMTSSVMGLISTPGRGAYAASKYALEAWSDALRMELRHSGIKVSLIEP 176 M+ G IV S + AYA+SK A ++ L +EL I+ +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 208 G T ++ ++ G F G Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.014 Identities = 12/20 (60%), Positives = 13/20 (65%) Query: 41 LVGESGSGKSTLLAILAGLD 60 L G G GKSTL+ L GLD Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 55.1 bits (133), Expect = 3e-10 Identities = 39/163 (23%), Positives = 59/163 (36%), Gaps = 32/163 (19%) Query: 4 DLIIKNGTVILENEARVVDIAVKGGKIAAIG-------QD-----LGDAKEVMDASGLVV 51 D +I N ++ DI +K G+IAAIG Q +G EV+ G +V Sbjct: 69 DTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIV 128 Query: 52 SPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRAS------- 104 + G +D+H H P + A G+T M+ PA A+ Sbjct: 129 TAGGMDSHIHFICPQQIE---------EALMSGLTCMLGGGTG--PAHGTLATTCTPGPW 177 Query: 105 -IELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFK 146 I +AA ++ A G + L E+ G K Sbjct: 178 HIARMIEAADA-FPMNLAFAGKGNASLPGALVEMVLGGATSLK 219
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 385 bits (991), Expect = e-137 Identities = 125/310 (40%), Positives = 175/310 (56%), Gaps = 16/310 (5%) Query: 2 KTLVVALGGNALLQRGEALTAENQYRNIASAVPALARL-ARSYRLAIVHGNGPQVGLLAL 60 K +V+ALGGNAL QRG+ + E N+ +A + AR Y + I HGNGPQVG L L Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 61 QNLAWKE---VEPYPLDVLVAESQGMIGYMLAQSLSAQPQM----PPVTTVLTRIEVSPD 113 A + + P+DV A SQG IGYM+ Q+L + + V T++T+ V + Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQALKNELRKRGMEKKVVTIITQTIVDKN 122 Query: 114 DPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRD-GKYLRRVVASPQPRKILDSEAIELL 172 DPAF P K +GP Y E + L GW +K D G+ RRVV SP P+ +++E I+ L Sbjct: 123 DPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKL 182 Query: 173 LKEGHVVICSGGGGVPVTEDG---AGSEAVIDKDLAAALLAEQINADGLVILTDADAVYE 229 ++ G +VI SGGGGVPV + G EAVIDKDLA LAE++NAD +ILTD + Sbjct: 183 VERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAAL 242 Query: 230 NWGTPQQCAIRHATPDELAPFAKAD----GSMGPKVTAVSGYVRSRGKPAWIGALSRIED 285 +GT ++ +R +EL + + GSMGPKV A ++ G+ A I L + + Sbjct: 243 YYGTEKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAIIAHLEKAVE 302 Query: 286 TLAGEAGTCI 295 L G+ GT + Sbjct: 303 ALEGKTGTQV 312
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 826 bits (2134), Expect = 0.0 Identities = 403/856 (47%), Positives = 572/856 (66%), Gaps = 20/856 (2%) Query: 20 ICYSSLAILPSFLSYAESYFNPAFLLENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFI 79 + A + LS AE YFNP FL ++ +VADLSRFE G P G YRVD++ N+ ++ Sbjct: 31 FVACAFA-AQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89 Query: 80 GSQDIVFESTTENTGDKSGGLMPCFNQVLLERIGLNSSAFPELAQQQNNKCINLLKAVPD 139 ++D+ F NTGD G++PC + L +GLN+++ + ++ C+ L + D Sbjct: 90 ATRDVTF-----NTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144 Query: 140 ATINFDFAAMRLNITIPQIALLSSAHGYIPPEEWDEGIPALLLNYNFTGN----RGNGND 195 AT D RLN+TIPQ + + A GYIPPE WD GI A LLNYNF+GN R GN Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS 204 Query: 196 SYFFSEL-SGINIGPWRLRNNGSWNYFRGNG--YHSEQWNNIGTWVQRAIIPLKSELVMG 252 Y + L SG+NIG WRLR+N +W+Y + +W +I TW++R IIPL+S L +G Sbjct: 205 HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLG 264 Query: 253 DGNTGSDIFDGVGFRGVRLYSSDNMYPDSQQGFAPTVRGIARTAAQLTIRQNGFIIYQSY 312 DG T DIFDG+ FRG +L S DNM PDSQ+GFAP + GIAR AQ+TI+QNG+ IY S Sbjct: 265 DGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNST 324 Query: 313 VSPGAFEITDLHPTSSNGDLDVTIDEHDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFR 372 V PG F I D++ ++GDL VTI E DG+ Q +T+PYS+VP+LQREG ++ +TAG++R Sbjct: 325 VPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYR 384 Query: 373 SGNSQQSSPFFFQGTALGGLPQEFTAYGGTQLSANYTAFLLGLGRNLGNWGAVSLDVTHA 432 SGN+QQ P FFQ T L GLP +T YGGTQL+ Y AF G+G+N+G GA+S+D+T A Sbjct: 385 SGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQA 444 Query: 433 RSQLADDSRHEGDSIRFLYAKSMNTFGTNFQLMGYRYSTQGFYTLDDVAYRRMEGYEYDY 492 S L DDS+H+G S+RFLY KS+N GTN QL+GYRYST G++ D Y RM GY Sbjct: 445 NSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGY-NIE 503 Query: 493 DYDGEHRDEPIIVNYHNLRFSRKDRLQLNISQSLNDFGSLYISGTHQKYWNTSDSDTWYQ 552 DG + +P +Y+NL ++++ +LQL ++Q L +LY+SG+HQ YW TS+ D +Q Sbjct: 504 TQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQ 563 Query: 553 VGYTSSWVGISYSLSFSWNESVGIPDNERIVGLNVSVPFNVLTKRRYTWENALDRAYASF 612 G +++ I+++LS+S ++ ++++ LNV++PF+ R ++ A AS+ Sbjct: 564 AGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWL--RSDSKSQWRHASASY 621 Query: 613 NANRNSNGQNSWLAGVGGTLLEGHNLSYHVSQG----DTSNNGYTGSATANWQAAYGTLG 668 + + + NG+ + LAGV GTLLE +NLSY V G N+G TG AT N++ YG Sbjct: 622 SMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNAN 681 Query: 669 VGYNYDRDQHDVNWQLSGGVVGHENGITLSQPLGDTNVLIKAPGAGGVRIENQTGILTDW 728 +GY++ D + + +SGGV+ H NG+TL QPL DT VL+KAPGA ++ENQTG+ TDW Sbjct: 682 IGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDW 741 Query: 729 RGYAVMPYATVYRYNRIALDTNTMGNSIDVEKNISSVVPTQGALVRANFDTRIGVRALIT 788 RGYAV+PYAT YR NR+ALDTNT+ +++D++ +++VVPT+GA+VRA F R+G++ L+T Sbjct: 742 RGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMT 801 Query: 789 VTQGGKPVPFGSLVRENSTGITSMVGDDGQVYLSGAPLSGELLVQWGNGANSRCIAHYVL 848 +T KP+PFG++V S+ + +V D+GQVYLSG PL+G++ V+WG N+ C+A+Y L Sbjct: 802 LTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQL 861 Query: 849 PKQSLQQAVTVISAVC 864 P +S QQ +T +SA C Sbjct: 862 PPESQQQLLTQLSAEC 877
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 60.6 bits (147), Expect = 4e-13 Identities = 26/122 (21%), Positives = 55/122 (45%), Gaps = 2/122 (1%) Query: 1 MKPTSVIIMDTHPIIRMSIEVLLQKNSELQIVLKTDDYRITIDYLRTRPVDLIIMDIDLP 60 M ++++ D IR + L + V T + ++ DL++ D+ +P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GTDGFTFLKRIKQIQSTVKVLFLSSKSECFYAGRAIQAGANGFVSKCNDQNDIFHAVQMI 120 + F L RIK+ + + VL +S+++ A +A + GA ++ K D ++ + Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 LS 122 L+ Sbjct: 119 LA 120
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 30.2 bits (68), Expect = 0.034 Identities = 17/72 (23%), Positives = 25/72 (34%), Gaps = 1/72 (1%) Query: 629 PAAVSDLRAALELEPNNSNIQAALGYALWDSGDIAQSREMLEQAHKGLPDDPALIRQLAY 688 VS+L + L N ++ Y S + ++ +ML L P L Sbjct: 100 KQNVSELLSLL-SNSPNISLSQLKAYLEGKSEEPSEQFKMLCGLRDALKGRPELAHLSHL 158 Query: 689 VNQRLDDMPATQ 700 V Q L M Q Sbjct: 159 VEQALVSMAEEQ 170
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 57.9 bits (140), Expect = 5e-16 Identities = 18/50 (36%), Positives = 27/50 (54%) Query: 1 MLTKYALVAVIVLCLTVLGFTLLVGDSLCEFTVKERNIEFKAVLAYEPKK 50 + + V+++CLT+L FT L SLCE ++ E A +AYE K Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 274 bits (701), Expect = 3e-96 Identities = 105/183 (57%), Positives = 131/183 (71%), Gaps = 1/183 (0%) Query: 4 MKTTHTSLPFAGHTLHFVEFDPANFCEQDLLWLPHYAQLQHAGRKRKAEHLAGRIAAIYA 63 M T+H LPFAGH LH V+FD ++F E DLLWLPH+ +L+ AGRKRKAEHLAGRIAA++A Sbjct: 1 MLTSHFPLPFAGHRLHIVDFDASSFREHDLLWLPHHDRLRSAGRKRKAEHLAGRIAAVHA 60 Query: 64 LREYGYKCVPAIGELRQPVWPAEVYGSISHCGATALAVVSRQPIGVDIEEIFSAQTATEL 123 LRE G + VP +G+ RQP+WP ++GSISHC TALAV+SRQ IG+DIE+I S TATEL Sbjct: 61 LREVGVRTVPGMGDKRQPLWPDGLFGSISHCATTALAVISRQRIGIDIEKIMSQHTATEL 120 Query: 124 TDNIITPAEHERLADCGLAFSLALTLAFSAKESAFKA-SEIQTDAGFLDYQIISWNKQQV 182 +II E + L L F LALTLAFSAKES +KA S+ T GF ++ S + Sbjct: 121 APSIIDSDERQILQASLLPFPLALTLAFSAKESVYKAFSDRVTLPGFNSAKVTSLTATHI 180 Query: 183 IIH 185 +H Sbjct: 181 SLH 183
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 82/394 (20%), Positives = 145/394 (36%), Gaps = 38/394 (9%) Query: 27 FISIVSLGLLGVAVPVQIQMMTHSTWQV---GLSVTLTGGAMFVGLMVGGVLADRYERKK 83 + V +GL+ +P ++ + HS G+ + L F V G L+DR+ R+ Sbjct: 15 ALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRP 74 Query: 84 VILLARGTCGIGFIGLCLNALL--PEPSLLAIYLLGLWDGFFASLGVTALLAATPALVGR 141 V+L + G ++ + P L +Y+ + G + G A A + Sbjct: 75 VLL-------VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVA-GAYIADITDG 126 Query: 142 ENLMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNYGLAAAGTFITLLPLLSLPALPP 201 + + G V P++GGL+ GG + + AA L L LP Sbjct: 127 DERARHFGFMSACFGFGMVAGPVLGGLM---GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 202 PPQPREHPLK----SLLAGFRFLLASPLVGGIALLGGLLTMAS----AVRVLYPALADNW 253 + PL+ + LA FR+ +V + + ++ + A+ V++ D + Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG--EDRF 241 Query: 254 QMSAAQIGFLYAAIP-LGAAIGALTSGKLAHSARPGLLMLLSTLGS---FLAIGLFGLMP 309 A IG AA L + A+ +G +A ++L + ++ + Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 310 MWILGVVCLALFGWLSAVSSLLQYTMLQTQTPEAMLGRINGLWTAQNVTGDAIGAALLGG 369 M +V LA G ML Q E G++ G A +G L Sbjct: 302 MAFPIMVLLASGGIGMPALQ----AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 370 LGAMMTPVASASASGFGLLIIGVLLLLVLVELRR 403 + A + + +G+ + L LL L LRR Sbjct: 358 IYA----ASITTWNGWAWIAGAALYLLCLPALRR 387
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 62.3 bits (151), Expect = 3e-13 Identities = 56/266 (21%), Positives = 96/266 (36%), Gaps = 35/266 (13%) Query: 40 HTLESQPQRIVSTSVTLTGSLLAIDAPVIASGATTPNNRVADDQGFLRQWSKVAKERKLQ 99 H P RIV+ LLA+ VAD + R W E L Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINY-RLW---VSEPPLP 75 Query: 100 RLYIG-----EPSAEAVAAQMPDLILISATGGDSALALYDQLSTIAPTLIINYDDKS--- 151 I EP+ E + P ++ SA G S + L+ IAP N+ D Sbjct: 76 DSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPS----PEMLARIAPGRGFNFSDGKQPL 131 Query: 152 --WQSLLTQLGEITGHEKQAAERIAQFDKQLAAAKEQIKLPPQPVTAIVYTAAAHSANLW 209 + LT++ ++ + A +AQ++ + + K + + ++ Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191 Query: 210 TPESAQGQMLEQLGFTLAKLPAGLNASQSQGKRHDIIQLGGENLAAGLNGESLFLFAGDQ 269 P S ++L++ G NA Q + + + LAA + + L + Sbjct: 192 GPNSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNS 243 Query: 270 KDADAIYANPLLAHLPAVQNKQVYAL 295 KD DA+ A PL +P V+ + + Sbjct: 244 KDMDALMATPLWQAMPFVRAGRFQRV 269
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 443 bits (1140), Expect = e-160 Identities = 146/299 (48%), Positives = 195/299 (65%), Gaps = 18/299 (6%) Query: 1 MAIPKLQAYALPESHDIPKNKVDWAFEPQRAALLIHDMQDYFVSFWGENCPMMEQVIANI 60 MAIP +Q Y +P + D+P+NKV W +P RA LLIHDMQ+YFV + + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDYCKQHNIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQKVVDRLTPDADDTV 120 L++ C Q IPV YTAQP Q+ +DRALL D WGPGL P ++K++ L P+ DD V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEQMLKESGRNQLIITGVYAHIGCMTTATDAFMRDIKPFMVADALAD 180 L KWRYSAF R+ L +M+++ GR+QLIITG+YAHIGC+ TA +AFM DIK F V DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSRDEHLMSLKYVAGRSGRVVMTEELL------PAPVPASKA-----------ALREVIL 223 FS ++H M+L+Y AGR VMT+ LL PA V + A +R+ I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 224 PLLDESDEPFDDD-NLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWKLLS 281 LL E+ E D +L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W KLL+ Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLT 299
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 363 bits (932), Expect = e-130 Identities = 108/258 (41%), Positives = 149/258 (57%), Gaps = 20/258 (7%) Query: 5 GKNVWVTGAGKGIGYATALAFVEAGAKVTGFD---------------QAFTQEQYPFATE 49 GK ++TGA +GIG A A GA + D +A E +P Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 50 VMDVADAGQVAQVCQRLLAETERLDVLINAAGILRMGATDQLSKEDWQQTFAVNVGGAFN 109 DV D+ + ++ R+ E +D+L+N AG+LR G LS E+W+ TF+VN G FN Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 110 LFQQTMNQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALSVGLELAGSGVRC 169 + +R G+IVTV S+ A PR M+AY +SKAA +GLELA +RC Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 170 NVVSPGSTDTDMQRTLWVSDDAEEQRIRGFGEQFKLGIPLGKIARPQEIANTILFLASDL 229 N+VSPGST+TDMQ +LW ++ EQ I+G E FK GIPL K+A+P +IA+ +LFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 230 ASHITLQDIVVDGGSTLG 247 A HIT+ ++ VDGG+TLG Sbjct: 243 AGHITMHNLCVDGGATLG 260
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 28.8 bits (64), Expect = 0.014 Identities = 18/98 (18%), Positives = 39/98 (39%), Gaps = 13/98 (13%) Query: 30 QGITIIKTFDAPGGMKGYLGKYQDMGVTIYLTPDGKHAISG--YMYNEKGENLSNTLIEK 87 + + + F+ YLGK+ ++ + G ++ + N+ G ++ N Sbjct: 21 ESVKPVSDFEL----NNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLN----- 71 Query: 88 EIYAPAGREMWQRMEQSHWLLDGKKDAPVIVYVFADPF 125 Y+ + W+ E + ++G D + V F PF Sbjct: 72 RGYSEE-KGEWKEAEGKAYFVNGSTDGYLKVSFFG-PF 107
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.4 bits (227), Expect = 1e-23 Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 1/125 (0%) Query: 2 TNVLIVEDEQAIRRFLRTALEGDGMRVYEAETLQRGLLEAATRKPDLIILDLGLPDGDGI 61 +L+ +D+ AIR L AL G V A DL++ D+ +PD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 EFIRDLRQWSA-VPVIVLSARSEESDKIAALDAGADDYLSKPFGIGELQARLRVALRRHS 120 + + +++ +PV+V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 121 ATAAP 125 + Sbjct: 124 RRPSK 128
>PF06580#Sensor histidine kinase Length = 349 Score = 31.8 bits (72), Expect = 0.011 Identities = 10/48 (20%), Positives = 21/48 (43%), Gaps = 4/48 (8%) Query: 785 LLENAVKYAGAQAE----IGIDAHVEGENLQLDVWDNGPGLPPGQEQT 828 L+EN +K+ AQ I + + + L+V + G +++ Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKES 310
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 27.9 bits (62), Expect = 0.038 Identities = 10/37 (27%), Positives = 18/37 (48%) Query: 91 RRGAIHVISAFSTMHSLVIGQIKTDKKSNEITAIPEL 127 R + ++ SL+IG I D+ S ++ +P L Sbjct: 442 SRTVVDTVARVGHGQSLIIGGIYRDELSVALSKVPLL 478
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 609 bits (1571), Expect = 0.0 Identities = 240/873 (27%), Positives = 390/873 (44%), Gaps = 74/873 (8%) Query: 4 YRLSLVSCLVVAMPCALA-----------VEFNLNVLDKSMRDRIDISLLKEKGVIAPGE 52 R ++ V + A A + FN L + D+S + + PG Sbjct: 18 IRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGT 77 Query: 53 YFVSVAVNNNQISNGQKINWHKNDDK--TIPCINDLLVDKFGLKPEVRQSLPLI--NQCV 108 Y V + +NN ++ + + ++ D + +PC+ + GL + L+ + CV Sbjct: 78 YRVDIYLNNGYMAT-RDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACV 136 Query: 109 DFSSR-PEMLFNFDQANQQLNITIPQAWLAWHSENWTPPSTWKEGVAGVLMDYNLFASSY 167 +S + D Q+LN+TIPQA+++ + + PP W G+ L++YN +S Sbjct: 137 PLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSV 196 Query: 168 RPQDGSSSTNLNAYGTAGINTGAWRLRSDYQLNQTDSDDNHEQSG--EISRTYLFRPLPQ 225 + + G +S +G+N GAWRLR + + SD + + T+L R + Sbjct: 197 QNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 226 LGSKLTLGETDFSSNIFDGFSYTGAALASDDRMLPWELRGYAPQISGIAQTNATVTISQS 285 L S+LTLG+ +IFDG ++ GA LASDD MLP RG+AP I GIA+ A VTI Q+ Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 286 GRVIYQKKVPPGPFIIDDLNQ-SVQGTLDVKVTEEDGRVNNFQVSAASTPFLTRQGQVRY 344 G IY VPPGPF I+D+ G L V + E DG F V +S P L R+G RY Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 345 KLAAGQPRPSMSHQTENETFFSNEVSWGMLSNTSLYGGLLLSGDDYHSAAIGIGQNMLWL 404 + AG+ R + Q E FF + + G+ + ++YGG L+ D Y + GIG+NM L Sbjct: 377 SITAGEYRSG-NAQQEKPRFFQSTLLHGLPAGWTIYGGTQLA-DRYRAFNFGIGKNMGAL 434 Query: 405 GALSFDVTWASSHFDTQQDERGLSYRFNYSKQVDATNSTISLAAYRFSDRHFHSYANYLD 464 GALS D+T A+S G S RF Y+K ++ + + I L YR+S + ++A+ Sbjct: 435 GALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTY 494 Query: 465 HKYND--------------------SDAQDEKQTISLSVGQPITPLNLNLYANLLHQTWW 504 + N + A +++ + L+V Q + LY + HQT+W Sbjct: 495 SRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQL-GRTSTLYLSGSHQTYW 553 Query: 505 NADASTTANITAGFNVDIGDWRDISISTSFNTTHYE-DKDRDNQIYLSISLPFGNGGR-- 561 AG N + DI+ + S++ T K RD + L++++PF + R Sbjct: 554 GTSNVDE-QFQAGLNT---AFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSD 609 Query: 562 ---------VGYDMQNSSHS-TTHRMSWNDTLDERN--SWGMSAGL-QSDRPDNGAQVSG 608 Y M + + T+ TL E N S+ + G ++G+ Sbjct: 610 SKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYA 669 Query: 609 NYQHLSSAGEWDISGTYAANDYSSVSSSWSGSFTATQYGAAFHRRSSTNEPRLMVSTDGV 668 + G +I ++ ++D + SG A G + N+ ++V G Sbjct: 670 TLNYRGGYGNANIGYSH-SDDIKQLYYGVSGGVLAHANGVTLGQP--LNDTVVLVKAPGA 726 Query: 669 ADIPVQGNLDY-TNHFGIAVVPLISSYQPSTVAVNMNDLPDGVTVAENVIKETWIEGAIG 727 D V+ T+ G AV+P + Y+ + VA++ N L D V + V GAI Sbjct: 727 KDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIV 786 Query: 728 YKSLASRSGKDVNVIIRNASGQFPPLGADIRQDDSGISVGMVGEEGHAWLSGVAENQNFT 787 +R G + + + + + P GA + +S S G+V + G +LSG+ Sbjct: 787 RAEFKARVGIKLLMTLT-HNNKPLPFGAMV-TSESSQSSGIVADNGQVYLSGMPLAGKVQ 844 Query: 788 VVWG--DSQHCSLH--LPEH-MEDTANRLILPC 815 V WG ++ HC + LP + +L C Sbjct: 845 VKWGEEENAHCVANYQLPPESQQQLLTQLSAEC 877
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 35.0 bits (80), Expect = 7e-05 Identities = 43/179 (24%), Positives = 74/179 (41%), Gaps = 26/179 (14%) Query: 14 SLLFTAPVYAADEGSGEIHFKGEVIEAPCEIHQDDIDKEVELGQVTTSHINQS-HHSDAV 72 ++L + V+AAD + FKG++I C + + EV G + ++ QS + Sbjct: 15 AVLMSQHVHAADN----LTFKGKLIIPACTVQ----NAEVNWGDIEIQNLVQSGGNQKDF 66 Query: 73 AVDLRLVNCDLENSSNGSGGKISKVAVTFDSSAKTTGADPILNNTSTGEATGVGVRLMNK 132 VD+ NC + + VT S+ TG ++ NTST G+ + L N Sbjct: 67 TVDM---NCPYS---------LGTMKVTITSNG-QTGNSILVPNTSTASGDGLLIYLYNS 113 Query: 133 DQSNI----VLGTATPDIDLAPTSSEQTLNFFAWMEQIDQATPVTPGAVTANATYVLDY 187 + S I LG+ + T+ + + +A + + G +A AT V Y Sbjct: 114 NNSGIGNAVTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQSLQAGTFSATATLVASY 172
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.4 bits (71), Expect = 0.003 Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 14 VDDAPRMQDYTLEAEEGRDM-MLLDALIQLKEKDPSLSFRR 53 +++ + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.1 bits (145), Expect = 9e-12 Identities = 34/199 (17%), Positives = 69/199 (34%), Gaps = 8/199 (4%) Query: 99 EQERLKQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEE 158 E E+ Q QA+ + E A A ++ E Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVP----SNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 159 AAK--KAAADAKKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAA--AAEARKKAATE 214 A+ K + +K E +A + A+ ++ A+ A + +K + E A +E ++ TE Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTE 1099 Query: 215 AAEKAKAEAEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKK 274 E A E E+KA E + + K+ + +A ++ + + Sbjct: 1100 TKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQ 1159 Query: 275 AAAAKAAAEKAAAAKAAAE 293 + A + A + ++ Sbjct: 1160 SQTNTTADTEQPAKETSSN 1178 Score = 57.0 bits (137), Expect = 9e-11 Identities = 30/236 (12%), Positives = 85/236 (36%), Gaps = 11/236 (4%) Query: 68 QSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQA 127 Q+ S ++E+ ++ +E ++ + +K E+ A + Sbjct: 1004 QADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKN--EQDATET 1061 Query: 128 ELKQKQ-AEEAAAKAAAD------AKAKAEADAKAAEEAAKKAAADAKKKAEAEAAKAAA 180 + ++ A+EA + A+ A++ +E E + A + ++KA+ E K Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121 Query: 181 EAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAA 240 + ++ + ++++E + A AR+ T ++ +++ A E+ A + + Sbjct: 1122 VPKVTSQVSPK--QEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNV 1179 Query: 241 EKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAAAEADD 296 E+ + + + A ++ K + ++ + Sbjct: 1180 EQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235 Score = 56.2 bits (135), Expect = 2e-10 Identities = 28/228 (12%), Positives = 75/228 (32%), Gaps = 2/228 (0%) Query: 66 RMQSQESSAKRSDEQRKMKEQQAAEELREKQAAEQERLKQLEKERLAAQEQKKQAEEAAK 125 R ++E+ + + + Q+ E +E Q E + +EKE A E +K E Sbjct: 1066 REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKV 1125 Query: 126 QAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAK--KAAADAKKKAEAEAAKAAAEAQ 183 +++ KQ + + A+ + + E ++ A + E + + Sbjct: 1126 TSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTE 1185 Query: 184 KKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAEAEKKAAAEKAAADKKAAAEKA 243 ++ + E A + + + K + ++ ++ +++ Sbjct: 1186 STTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Query: 244 AADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAAEKAAAAKAA 291 +D +A A+ A + A ++ ++ + Sbjct: 1246 TVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLEMNNEGQ 1293 Score = 55.5 bits (133), Expect = 2e-10 Identities = 32/265 (12%), Positives = 86/265 (32%), Gaps = 14/265 (5%) Query: 51 DAVMVDSGAVVEQYKRMQSQESSAKRSDEQRKMKEQQAAE-ELREKQAAEQER------L 103 D V A + ++ ++K+ + + EQ A E + ++ A++ + Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080 Query: 104 KQLEKERLAAQEQKKQAEEAAKQAELKQKQAEEAAAKAAADAKAKAEADAKAAEEAAKKA 163 + E + ++ ++ Q K+ +K+ KA + + E ++ + K+ Sbjct: 1081 QTNEVAQSGSETKETQ-TTETKETATVEKE-----EKAKVETEKTQEVPKVTSQVSPKQE 1134 Query: 164 AADA-KKKAEAEAAKAAAEAQKKAEAAAAALKKKAEAAEAAAAEARKKAATEAAEKAKAE 222 ++ + +AE K+ ++ + A+ ++ + Sbjct: 1135 QSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNS 1194 Query: 223 AEKKAAAEKAAADKKAAAEKAAADKKAAEKAAAEKAAADKKAAAEKAAADKKAAAAKAAA 282 + A + +++ K + + + + A + A + Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTS 1254 Query: 283 EKAAAAKAAAEADDIFGELSSGKNA 307 A + A A F L+ GK Sbjct: 1255 TNTNAVLSDARAKAQFVALNVGKAV 1279
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 24.5 bits (53), Expect = 0.032 Identities = 7/23 (30%), Positives = 11/23 (47%) Query: 10 VGTITFVYSVTKRGWVFPGLSVI 32 VG + F+ T F +SV+ Sbjct: 209 VGIVFFMLFTTSYSISFLIVSVL 231
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 28.6 bits (64), Expect = 0.007 Identities = 18/66 (27%), Positives = 26/66 (39%), Gaps = 7/66 (10%) Query: 57 IMLAQHALLIAISSDLNAYGVVCEFDWN----DGNGQEGWPPMDGSEGIRITD---IDTS 109 + LA L I + D +G +F DG GQ G+ IT+ +D Sbjct: 22 VRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTREGGAVDTVITNALILDHW 81 Query: 110 GIFDSD 115 GI +D Sbjct: 82 GIVKAD 87
>PF06291#Lambda prophage Bor protein Length = 102 Score = 28.1 bits (62), Expect = 0.021 Identities = 17/69 (24%), Positives = 31/69 (44%), Gaps = 6/69 (8%) Query: 244 EQLNTLNQRQSIDRKKETRRDNFYVTGVRRSGETNQDINIDVIRVSNGESFTIKTSADIT 303 +Q T+ + + KET +F+V+G+ Q +D ++ G +KT T Sbjct: 24 QQTFTVGNKPTAVTPKETITHHFFVSGI------GQKKTVDAAKICGGAENVVKTETQQT 77 Query: 304 STDELLEFL 312 + LL F+ Sbjct: 78 FVNGLLGFI 86
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 28.6 bits (64), Expect = 0.018 Identities = 17/77 (22%), Positives = 27/77 (35%), Gaps = 11/77 (14%) Query: 2 KNIAAQMVNFDREQM-----------RRIANNMPEQYDEKPQVQQVAQIINGVFSQLLAT 50 N+A ++ DR +++A+ E Y V V +IIN + Sbjct: 165 TNVARRIALMDRTSPEVVREVERVLEKKLASLSSEDYTSAGGVDNVVEIINMADRKTEKF 224 Query: 51 FPASLANRDQNEVNEIR 67 SL D EI+ Sbjct: 225 IIESLEEEDPELAEEIK 241
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 25.8 bits (56), Expect = 0.049 Identities = 19/72 (26%), Positives = 28/72 (38%), Gaps = 7/72 (9%) Query: 10 RWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHD 69 +W WA N + FK IP K ++P D + + A + +H+ Sbjct: 195 QWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA----GHITMHN 250 Query: 70 LLVDYYVGGMTF 81 L VD GG T Sbjct: 251 LCVD---GGATL 259
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 231 bits (589), Expect = 1e-80 Identities = 84/151 (55%), Positives = 111/151 (73%), Gaps = 1/151 (0%) Query: 25 PKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKN 84 P+ V+P+++F+ N YLGKWYE+ARL++ FERGL QV+A Y RNDGGI VLNRGY K Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79 Query: 85 KWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDE-YKYALVSGPNREYLWILAR 143 +W E+EGKAYF + LKVSFFGPFYG Y V +LD E Y YA VSGPN EYLW+L+R Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139 Query: 144 TPTIPDKVKADYVRTAQKLGFNVNELLWVKQ 174 TPT+ + ++ +++ GF+ N L++V+Q Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQ 170
>INTIMIN#Intimin signature. Length = 939 Score = 30.0 bits (67), Expect = 0.011 Identities = 29/202 (14%), Positives = 61/202 (30%), Gaps = 29/202 (14%) Query: 66 DWTATGQGQKSAGDTSFT----LAWMPGEQGQQALLAWFNEGDTRAYKIRFPNGTVDVFR 121 G G+ + S + + AL + A I + Sbjct: 611 SANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSAL-------NANAV-IFVDQTKASITE 662 Query: 122 GWVSSIGKAVTAKEVITRTVKVTNVGRPSMAEDRSTVTAATGMTVTP--------ASASV 173 ++ IT TVKV +P ++ + T ++ + A ++ Sbjct: 663 IKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTL 722 Query: 174 VKGQSTTLTVAFQPEGV-------TDKSFRAVSADKTKATVSVSGMTITV--NGVAAGKV 224 V+ + V + F ++ D + +G+ + + G+V Sbjct: 723 TSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKLPTVWLQYGQV 782 Query: 225 NIPVVSGNGEFAAVAEITVTAS 246 N+ GNG++ + AS Sbjct: 783 NLKASGGNGKYTWRSANPAIAS 804
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.0 bits (93), Expect = 4e-05 Identities = 55/377 (14%), Positives = 127/377 (33%), Gaps = 36/377 (9%) Query: 236 SGLTAMARQFHNVTAEQIAYVAQLQRSGDEAGALQAANEAATKGFDDQTRRLKENMGTLE 295 S R+ +E+ + + +L+ + + + + L+ L Sbjct: 95 SNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALA 154 Query: 296 TWADRTARAFKSMWDAVLDI-GRPDTAQEMLIKAEAAFKKADDIWNLRKDDYFVNDEARA 354 +A + + + T + EA + + + + Sbjct: 155 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 214 Query: 355 RYWDDREKARLAL-EVARKKAEQQSQQDKNAQQQSDTEASRLKYTEEAQKAYERLQTPLE 413 ++ ++ + + ++ + EA + + + L+ + Sbjct: 215 TLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMN 274 Query: 414 KYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSGVKVSAGDRQEDSAHA 473 TA ++ + L+A+ L + A + + R D++ Sbjct: 275 FSTADSAKIKTLEAEKAALEAEKADLEHQ-SQVLNANRQSLR----------RDLDASRE 323 Query: 474 ALLTLQAELRTLEKHAGANEKISQQ-RRDL-------WKAESQFAVLEEAAQRRQLSAQE 525 A L+AE + LE+ +E Q RRDL + E++ LEE + + S Q Sbjct: 324 AKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 383 Query: 526 KS--LLAHKDETLEYKRQLAALGDKVTYQERLNALAQQADKFAQQQRAKRAAIDAKSRGL 583 L A ++ + ++ L K+ E+LN +++ K ++++A Sbjct: 384 LRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKA------------ 431 Query: 584 TDRQAEREATEQRLKEQ 600 + QA+ EA + LKE+ Sbjct: 432 -ELQAKLEAEAKALKEK 447
>PF06291#Lambda prophage Bor protein Length = 102 Score = 27.3 bits (60), Expect = 0.018 Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 5/40 (12%) Query: 102 MTGILFSLGASMVLGGVAQML-----APKARTPRTQTTDN 136 M +LFS +M++ G AQ P A TP+ T + Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHH 45
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 35.8 bits (82), Expect = 0.001 Identities = 35/143 (24%), Positives = 46/143 (32%), Gaps = 30/143 (20%) Query: 993 SVNANAGTLNNVTVNENCTIKGMLEATQV----RGDF---------VKAVSKSFPKQAGT 1039 + + L NVT + +K L+A ++ G F VKA S K A Sbjct: 235 AAQYDKKQLTNVTFDTETAVKDALKAQKIEVSSVGYFKAPHTFTVNVKATSNKNGKSATL 294 Query: 1040 WGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRK 1099 PN V S I+ N YD + G R Sbjct: 295 PVTVTVPNVADPVVPSQSKT---------IMHNAYFYDKDA--------KRVGTDKVTRY 337 Query: 1100 NGVLIASRETKGAIPGSYSAVID 1122 N V +A TK A SY VI+ Sbjct: 338 NTVTVAMNTTKLANGISYYEVIE 360
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 141 bits (357), Expect = 4e-45 Identities = 68/202 (33%), Positives = 106/202 (52%), Gaps = 34/202 (16%) Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHASTNVPG-SDDLNGINVKYRYE 58 M+K C + L+A LA + + A+ ST++ GY A ++ G + + G N+KYRYE Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGY--AQSDAQGQMNKMGGFNLKYRYE 54 Query: 59 FTDT-LGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMA 117 ++ LG++ SF+Y T S T D +N+++ + AGP+ R+N+W S Y + Sbjct: 55 EDNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVV 106 Query: 118 GVAYSRVSTFSGDYLHVTDNKGKKHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIA 177 GV Y + T T+ KHD S+ ++GAG+QFNP E+VA+D + Sbjct: 107 GVGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFS 149 Query: 178 YEGSGSGDWRTDGFIVGVGYKF 199 YE S +I GVGY+F Sbjct: 150 YEQSRIRSVDVGTWIAGVGYRF 171
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 29.5 bits (66), Expect = 0.014 Identities = 12/38 (31%), Positives = 22/38 (57%) Query: 169 LSSDNIKYSLNIIQYSLNRHLSTWNIVYTKTISVFWLV 206 + ++N K + +++ SLN LS W ++Y+K W V Sbjct: 1 MKTENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYV 38
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.001 Identities = 34/150 (22%), Positives = 65/150 (43%), Gaps = 6/150 (4%) Query: 191 LLIGVVVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTVGRFTGGWFI 248 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 249 DRYSRVAVVR-ASALM--GALGIGLIIFVDSAWVA-GVSVVLWGLGASLGFPLTISAASD 304 DR + V+ + L ++ S ++ + VL GL + TI ++S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 305 TGPDAPTRVSVVATTGYLAFLVGPPLLGYL 334 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 49.6 bits (118), Expect = 6e-10 Identities = 14/83 (16%), Positives = 30/83 (36%), Gaps = 4/83 (4%) Query: 2 RRANDPQRREKIIQATLEAVKLYGIHAVTHRKIATLAGVPLGSMTYYFSGIDELLLEAFS 61 + + R+ I+ L G+ + + +IA AGV G++ ++F +L E + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW- 63 Query: 62 SFTEIMSRQYQAFFSDVSDAQGA 84 E+ + Sbjct: 64 ---ELSESNIGELELEYQAKFPG 83
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 29.2 bits (65), Expect = 0.020 Identities = 20/72 (27%), Positives = 37/72 (51%), Gaps = 9/72 (12%) Query: 100 QQRFNPDMVILADVNAQPSHISRPLMQRIE-----YFSSL-GRP--KAYSRYLRETIKPC 151 +QR N D++ ++ VN+Q +PL ++ YF+ L G+ Y L E +P Sbjct: 3 EQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPL 62 Query: 152 LER-LEHVRDSQ 162 L+ +++ R +Q Sbjct: 63 LDMVMQYTRGNQ 74
>cloacin#Cloacin signature. Length = 551 Score = 32.8 bits (74), Expect = 0.004 Identities = 21/67 (31%), Positives = 25/67 (37%), Gaps = 5/67 (7%) Query: 570 GGDGQSGNIGVSGEGGTSYWGGGGRAGAGGGVRGRAFGS-----GGGGAYDAGYSGTSMT 624 GGDG+ N G G G G GG G + S GGG + G S Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62 Query: 625 GGKGADG 631 G G +G Sbjct: 63 GNGGGNG 69 Score = 32.0 bits (72), Expect = 0.007 Identities = 31/82 (37%), Positives = 34/82 (41%), Gaps = 8/82 (9%) Query: 552 GGNGGVPSTGDIRITGGDGGDGQSGNIGVSGEGGTSY---WGGGGRAGAG-GGVRGRAFG 607 G N G ST I GG G G G G G +S WGGG +G GG G G Sbjct: 8 GHNTGAHSTSG-NINGGPTGLGVGGG-ASDGSGWSSENNPWGGGSGSGIHWGGGSGH--G 63 Query: 608 SGGGGAYDAGYSGTSMTGGKGA 629 +GGG G SGT A Sbjct: 64 NGGGNGNSGGGSGTGGNLSAVA 85 Score = 30.1 bits (67), Expect = 0.028 Identities = 24/89 (26%), Positives = 30/89 (33%) Query: 538 GGEGGGKVGVTNTNGGNGGVPSTGDIRITGGDGGDGQSGNIGVSGEGGTSYWGGGGRAGA 597 GG+G G ++ GN TG G G G S G G S GG +G Sbjct: 3 GGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGH 62 Query: 598 GGGVRGRAFGSGGGGAYDAGYSGTSMTGG 626 G G G G G + + G Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAPVAFG 91
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 38.8 bits (90), Expect = 8e-06 Identities = 30/159 (18%), Positives = 53/159 (33%), Gaps = 20/159 (12%) Query: 7 RLDKNDAAVLLVDHQAGLLSLVRDIEP--DKFKNNVLALGDLAKYFNLPTILTT---SFE 61 D N A +L+ D Q + + N+ L + +P + T S Sbjct: 25 VPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQN 84 Query: 62 TGPNGPLV----PELKAQFPDAPYIAR----PGNI-------NAWDNEDFVKAVKATGKK 106 L P L + + I ++ +A+ + ++ ++ G+ Sbjct: 85 PDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRD 144 Query: 107 QLIIAGVVTEVCVAFPALSAIEEGFDVFVVTDASGTFNE 145 QLII G+ + A A E F V DA F+ Sbjct: 145 QLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSL 183
>INTIMIN#Intimin signature. Length = 939 Score = 29.7 bits (66), Expect = 0.006 Identities = 34/169 (20%), Positives = 58/169 (34%), Gaps = 4/169 (2%) Query: 10 ITVVCATSSVMAADDNAITDGKVTFNGKVIAPACTLVAATKDSVVTLPNVSATKL--QTN 67 +V T+ + A N GK T K P +V+A + + N +A QT Sbjct: 598 FNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTK 657 Query: 68 GAVSGVKTDVPIALEGCDVTVTKNATFTFSGTADGVQPTAFANQATTDAATNVALQM--Y 125 +++ +K D A+ +T Q F + + Y Sbjct: 658 ASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGY 717 Query: 126 LPDGSTSVTPGTETSNIQLADSAEQTVTFKVDYIATGKATSGNVNAVTN 174 TS TPG + +++D A +V++ T GN+ V Sbjct: 718 AKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIVGT 766
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 756 bits (1953), Expect = 0.0 Identities = 386/789 (48%), Positives = 526/789 (66%), Gaps = 17/789 (2%) Query: 15 GVPSFIGGLVVFVSAAFNAQAETWFDPAFFKDDPSMVADLSRFEKGQKITPGVYRVDIVL 74 G + F + A + AE +F+P F DDP VADLSRFE GQ++ PG YRVDI L Sbjct: 25 GFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYL 84 Query: 75 NQTIVDTRNVNFVEITPEKGIAACLTTESLDAMGVNTDAFPAFKQLDKQACVPLAEIIPD 134 N + TR+V F E+GI CLT L +MG+NT + L ACVPL +I D Sbjct: 85 NNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHD 144 Query: 135 ASVTFNVNKLRLEISVPQIAIKSNARGYVPPERWDEGINALLLGYSFSGANSIHSSADSD 194 A+ +V + RL +++PQ + + ARGY+PPE WD GINA LL Y+FSG + + + Sbjct: 145 ATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNS 204 Query: 195 SGDSYFLNLNSGVNLGPWRLRNNSTWSR-----SSGQTAEWKNLSSYLQRAVIPLKGELT 249 +LNL SG+N+G WRLR+N+TWS SSG +W++++++L+R +IPL+ LT Sbjct: 205 --HYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLT 262 Query: 250 VGDDYTAGDFFDSVSFRGVQLASDDNMLPDSLKGFAPVVRGIAKSNAQITIKQNGYTIYQ 309 +GD YT GD FD ++FRG QLASDDNMLPDS +GFAPV+ GIA+ AQ+TIKQNGY IY Sbjct: 263 LGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYN 322 Query: 310 TYVSHGAFEISDLYSTSSSGDLLVEIKEADGSVNSYSVPFSSVPLLQRQGRIKYAVTLAK 369 + V G F I+D+Y+ +SGDL V IKEADGS ++VP+SSVPLLQR+G +Y++T + Sbjct: 323 STVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGE 382 Query: 370 YRTNSNEQQESKFAQATLQWGGPWGTTWYGGGQYAEYYRAAMFGLGFNLGDFGAISFDAT 429 YR+ + +Q++ +F Q+TL G P G T YGG Q A+ YRA FG+G N+G GA+S D T Sbjct: 383 YRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMT 442 Query: 430 QAKSTLADQSEHKGQSYRFLYAKTLNQLGTNFQLMGYRYSTSGFYTLSDTMYKHMDGY-- 487 QA STL D S+H GQS RFLY K+LN+ GTN QL+GYRYSTSG++ +DT Y M+GY Sbjct: 443 QANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNI 502 Query: 488 EFNDGDDEDTPMWSRYYNLFYTKRGKLQVNISQQLGEYGSFYLSGSQQTYWHTDQQDRLL 547 E DG + P ++ YYNL Y KRGKLQ+ ++QQLG + YLSGS QTYW T D Sbjct: 503 ETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQF 562 Query: 548 QFGYNTQIKDLSLGVSWNYSKSRGQPDADQVFALNFSLPLNLLLPRSNDSYTRKKNYAWM 607 Q G NT +D++ +S++ +K+ Q DQ+ ALN ++P + L + S R +A Sbjct: 563 QAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWR---HASA 619 Query: 608 TSNTSIDNEGHITQNLGLTETLLDDGNLSYSVQQGYNSEGKTANGS---ASMDYKGAFAD 664 + + S D G +T G+ TLL+D NLSYSVQ GY G +GS A+++Y+G + + Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679 Query: 665 ARVGYNYSDNGSQQQLNYALSGSLVAHSQGITLGQSLGETNVLIAAPGAENTRVANSTGL 724 A +GY++SD+ +QL Y +SG ++AH+ G+TLGQ L +T VL+ APGA++ +V N TG+ Sbjct: 680 ANIGYSHSDD--IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737 Query: 725 KTDWRGYTVVPYATSYRENRIALDAASLKRNVDLENAVVNVVPTKGALVLAEFNAHAGAR 784 +TDWRGY V+PYAT YRENR+ALD +L NVDL+NAV NVVPT+GA+V AEF A G + Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797 Query: 785 VLMKTSKQG 793 +LM + Sbjct: 798 LLMTLTHNN 806
>CLENTEROTOXN#Clostridium enterotoxin signature. Length = 319 Score = 31.9 bits (72), Expect = 0.004 Identities = 13/48 (27%), Positives = 22/48 (45%) Query: 295 VGVVVTDSQNNIISPAGGTLPLSIPDDADSIARMNVYPVSTTGVPPET 342 + V TD + I+ A T L++ D +S N+Y ++ P T Sbjct: 188 LTVPSTDIEKEILDLAAATERLNLTDALNSNPAGNLYDWRSSNSYPWT 235
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 41.9 bits (98), Expect = 4e-06 Identities = 17/49 (34%), Positives = 29/49 (59%) Query: 353 TLTNGALEASNVDLSKELVNMIVAQRNYQSNAQTIKTQDQILNTLVNLR 401 L+N S V+L +E N+ Q+ Y +NAQ ++T + I + L+N+R Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 37.2 bits (86), Expect = 9e-05 Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 4/56 (7%) Query: 6 AVSGLNAAATNLDVIGNNIANSATYGFKSGTASFAD----MFAGSKVGLGVKVAGI 57 A+SGLNAA L+ NNI++ G+ T A + AG VG GV V+G+ Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 43.8 bits (103), Expect = 4e-07 Identities = 18/81 (22%), Positives = 36/81 (44%), Gaps = 14/81 (17%) Query: 3 SSLWIAKTGLDAQQTNMDVIANNLANVSTNGFKRQRAVFEDLLYQTIRQPGAQSSEQTTL 62 S + A +GL+A Q ++ +NN+++ + G+ RQ + + +TL Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTI--------------MAQANSTL 47 Query: 63 PSGLQIGTGVRPVATERLHSQ 83 +G +G GV +R + Sbjct: 48 GAGGWVGNGVYVSGVQREYDA 68 Score = 41.1 bits (96), Expect = 3e-06 Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 220 ETSNVNVAEELVNMIQVQRAYEINSKAVSTTDQMLQKLTQL 260 S VN+ EE N+ + Q+ Y N++ + T + + L + Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 348 bits (894), Expect = e-125 Identities = 231/232 (99%), Positives = 232/232 (100%) Query: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY Sbjct: 1 MQKNAAHTYAISSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINY 60 Query: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA Sbjct: 61 GYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNA 120 Query: 121 RADVEASGGNTFNGKGGANASNTFNGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 RADVEASGGNTFNGKGGANASNTF+GTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF Sbjct: 121 RADVEASGGNTFNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRF 180 Query: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM Sbjct: 181 SGVVNPRTISGSNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSPM 232
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 427 bits (1100), Expect = e-152 Identities = 157/363 (43%), Positives = 213/363 (58%), Gaps = 9/363 (2%) Query: 4 FLSALILLLVTTAAQAERIRDLTSVQGVRQNSLIGYGLVVGLDGTGDQTTQTPFTTQTLN 63 F + L A RI+D+ S+Q R N LIGYGLVVGL GTGD +PFT Q++ Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72 Query: 64 NMLSQLGITVPTGTNMQLKNVAAVMVTASLPPFGRQGQTIDVVVSSMGNAKSLRGGTLLM 123 ML LGIT G + KN+AAVMVTA+LPPF G +DV VSS+G+A SLRGG L+M Sbjct: 73 AMLQNLGITTQGGQS-NAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIM 131 Query: 124 TPLKGVDSQVYALAQGNILVGGAGASAGGSSVQVNQLNGGRITNGAVIERELPSQFGVGN 183 T L G D Q+YA+AQG ++V G A +++ R+ NGA+IERELPS+F Sbjct: 132 TSLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSV 191 Query: 184 TLNLQLNDEDFSMAQQIADTINRVR----GYGSATALDARTIQVRVPSGNSSQVRFLADI 239 L LQL + DFS A ++AD +N G A D++ I V+ P + R +A+I Sbjct: 192 NLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRV-ADLTRLMAEI 250 Query: 240 QNMQVNVTPQDAKVVINSRTGSVVMNREVTLDSCAVAQGNLSVTVNRQANVSQPDTPFGG 299 +N+ V T AKVVIN RTG++V+ +V + AV+ G L+V V V QP PF Sbjct: 251 ENLTVE-TDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQP-APFSR 308 Query: 300 GQTVVTPQTQIDLRQSGGSLQSVRSSASLNNVVRALNALGATPMDLMSILQSMQSAGCLR 359 GQT V PQT I Q G + ++ L +V LN++G +++ILQ ++SAG L+ Sbjct: 309 GQTAVQPQTDIMAMQEGSKV-AIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQ 367 Query: 360 AKL 362 A+L Sbjct: 368 AEL 370
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 511 bits (1318), Expect = 0.0 Identities = 312/313 (99%), Positives = 312/313 (99%) Query: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG Sbjct: 1 MISDSKLLASAAWDAQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDG 60 Query: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEEPTPAAPMKFPLET 120 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEE TPAAPMKFPLET Sbjct: 61 LFSSEHTRLYTSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLET 120 Query: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL Sbjct: 121 VVRYQNQALSQLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQAAL 180 Query: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL Sbjct: 181 ESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSSYL 240 Query: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK Sbjct: 241 EALSDYVGLLTRNPRYAAVTTAASAEQGAQALQDAGYATDPHYARKLTNMIQQMKSISDK 300 Query: 301 VSKTYSMNIDNLF 313 VSKTYSMNIDNLF Sbjct: 301 VSKTYSMNIDNLF 313
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 679 bits (1753), Expect = 0.0 Identities = 542/546 (99%), Positives = 543/546 (99%) Query: 2 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVNVS 61 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGV VS Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 62 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 121 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 122 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 181 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 182 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 241 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 242 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 301 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQL 300 Query: 302 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAILQNTKNKGDVAIGATVTDASGVLATD 361 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPA+LQNTKNKGDVAIGATVTDAS VLATD Sbjct: 301 ALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATD 360 Query: 362 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 421 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV Sbjct: 361 YKISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAIV 420 Query: 422 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 481 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN Sbjct: 421 NMDVLITDEAKIAMASEEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGN 480 Query: 482 KTATLKTSSTTQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 541 KTATLKTSS TQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD Sbjct: 481 KTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFD 540 Query: 542 ALINIR 547 ALINIR Sbjct: 541 ALINIR 546
>FLAGELLIN#Flagellin signature. Length = 507 Score = 45.4 bits (107), Expect = 2e-07 Identities = 41/226 (18%), Positives = 80/226 (35%), Gaps = 9/226 (3%) Query: 7 MMYQQNMRGITNSQAEWMKYGEQMSTGKRVVNPSDDPIAASQAVVLSQAQAQNSQYTLAR 66 ++ Q N+ +S + + E++S+G R+ + DD + A + +Q + Sbjct: 11 LLTQNNLNKSQSSLSSAI---ERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNA 67 Query: 67 TFATQKVSLEESVLSQVTTAIQNAQEKIVYASNGTLSDDDRASLATDIQGLRDQLLNLAN 126 E L+++ +Q +E V A+NGT SD D S+ +IQ +++ ++N Sbjct: 68 NDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSN 127 Query: 127 TTDGNGRYIFAGYKTETAPFSEANGDYVGGTESIKQQVDASRSMVIGHTGDKIFDSITSN 186 T NG + + +G E+I + +G G + + Sbjct: 128 QTQFNGVKVLSQDNQMKIQVGANDG------ETITIDLQKIDVKSLGLDGFNVNGPKEAT 181 Query: 187 AVAEPDGSASETNLFAMLDSAIAALKTPVADSEADKETAAAALDKT 232 + T A + + TA DK Sbjct: 182 VGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKV 227
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 65.9 bits (160), Expect = 7e-13 Identities = 47/261 (18%), Positives = 84/261 (32%), Gaps = 26/261 (9%) Query: 551 VAPAPKAAPATPATPAQPGLLSRFFGALKALFSGSEETKPTEQP-APKAEAKPERQQDRR 609 P + S E + E P P A A P Sbjct: 991 TVDTTNITTPNNIQADVPSVPSN----------NEEIARVDEAPVPPPAPATPSETT--- 1037 Query: 610 KPRQNNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQTAETRESRQQAEV------T 663 N ++++++ D E +NR A++ + + + Q EV T Sbjct: 1038 -----ETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092 Query: 664 EKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQR 723 ++ +TT+ ++ E+ + + + Q+ K + + QE + + + R Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPK-VTSQVSPKQEQSETVQPQAEPARENDP 1151 Query: 724 QLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPAPRTELVKVPLPVVAQTAPEQQEE 783 +N K Q+ P E ++ E V E+ T V P A Q Sbjct: 1152 TVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTV 1211 Query: 784 NNADNRDNGGMPRRSRRSPRH 804 N+ + RRS RS H Sbjct: 1212 NSESSNKPKNRHRRSVRSVPH 1232 Score = 43.9 bits (103), Expect = 4e-06 Identities = 56/338 (16%), Positives = 101/338 (29%), Gaps = 54/338 (15%) Query: 513 PSEEEFAERKRPEQPALATFAMPDVPPAPT-PAEPAAPVVAPAPKAAPATPAT------- 564 P E+ + DVP P+ E A AP P APATP+ Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042 Query: 565 ---------------PAQPGLLSRFFG------------ALKALFSGSE-------ETKP 590 + +R + SGSE ETK Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102 Query: 591 TEQPAPKAEAKPERQQDRRKPR-QNNRRDRNERRDTRSERTEGSDNREENRRNRRQAQQQ 649 T + +AK E ++ + P+ + + E+ +T + E + + N ++ Q Q Sbjct: 1103 TATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTV-NIKEPQSQ 1161 Query: 650 TAETRESRQQAEVTEKARTTDEQQAPRRERSRRRNDDKRQAQQEAKALNVEEQSVQETEQ 709 T T ++ Q A+ T + EQ N + A + E+ Sbjct: 1162 TNTTADTEQPAKET---SSNVEQPVTESTTVNTGNSVVENPENTTPA-TTQPTVNSESSN 1217 Query: 710 EERVRPVQPRRKQRQLNQKVRYEQSVAEEAVVAPVVEETAAAEPIVQEAPA-PRTELVKV 768 + + R + R + + + + A + +A A + + V Sbjct: 1218 KPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAV--LSDARAKAQFVALNV 1275 Query: 769 PLPV---VAQTAPEQQEENNADNRDNGGMPRRSRRSPR 803 V ++Q + + N + S R Sbjct: 1276 GKAVSQHISQLEMNNEGQYNVWVSNTSMNKNYSSSQYR 1313 Score = 37.4 bits (86), Expect = 4e-04 Identities = 36/224 (16%), Positives = 64/224 (28%), Gaps = 19/224 (8%) Query: 515 EEEFAERKRPEQPALATFAMPDVPPAPTPAEPAAPVVAPAPKAAPA-----------TPA 563 E+E + E+ V P +E P PA + P T A Sbjct: 1107 EKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA 1166 Query: 564 TPAQPGLLSRFFGALKALFSGSEETKPTEQPAPKAEAKPERQQDRRKPRQNNRRDRNERR 623 QP + S + T + P+ Q N ++R+ RR Sbjct: 1167 DTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRH-RR 1225 Query: 624 DTRSERT--EGSDNREENRRNRRQAQQQTAETRESRQQAEVTEKARTTDEQQAPRRERSR 681 RS E + +R + T A + + +A + S+ Sbjct: 1226 SVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQ 1285 Query: 682 RRNDDKRQAQQEAKALNVEEQSVQETEQEERVRPVQPRRKQRQL 725 ++ + + V S+ + + R + Q QL Sbjct: 1286 LEMNN-----EGQYNVWVSNTSMNKNYSSSQYRRFSSKSTQTQL 1324
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 93.2 bits (231), Expect = 2e-21 Identities = 139/662 (20%), Positives = 228/662 (34%), Gaps = 107/662 (16%) Query: 137 DVDITTHGDNAHAIAARQGTVSFNQGEIYTTGPDAATAKIYNGGTVTLKNTSAVAHQGSG 196 D + + +V Q + AA ++ G VT+ S A G+ Sbjct: 285 PGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAA-IRVGRGARVTVSGGSLSAPHGNV 343 Query: 197 IVLESSIN--GQEATVDILSGSSLRSANEILYHKNETSNVTITDSEVSSAADVFINNIKG 254 I + Q A + I + + + L ++ V +T ++ AD + + Sbjct: 344 IETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLT---LTGGADAQGDIVAT 400 Query: 255 HLTVDATNSKITGSANISTDVN----THTYLSLS-DNSTWDIKADSTVSNLTV--DNSTV 307 L S +++ T SLS DN+TW + +S V L + D S Sbjct: 401 ELPSIPGTSIGPLDVALASQARWTGATRAVDSLSIDNATWVMTDNSNVGALRLASDGSVD 460 Query: 308 YISRADGRDVEPTRLTITENYVGNNGVLHLRTELGDDNSATDKVVIDGNTSGTTRVKVTN 367 + A+ + +T N + +G+ + D +DK+V+ + SG R+ V N Sbjct: 461 FQQPAEAGRFK----VLTVNTLAGSGLFRMNVFA--DLGLSDKLVVMQDASGQHRLWVRN 514 Query: 368 AGGSGAYTLNGIEIISVEGESNGEFI---KDSRIFAGAYEYSLTRGNTEATNKNWYLTNF 424 +G S + N + ++ S F KD ++ G Y Y L N W L Sbjct: 515 SG-SEPASANTLLLVQTPLGSAATFTLANKDGKVDIGTYRYRLA----ANGNGQWSLVGA 569 Query: 425 QA-----------TSGGETNSGGSSAPTVAPTPVLRLEAGSYVA----NLAAANTLFVMR 469 +A + AP P L A + A + A+TL+ Sbjct: 570 KAPPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNTGGVGLASTLWYAE 629 Query: 470 LN---DRAGEMRYIDPVTEQERSSRLWLRQIGGHNAWRDSNGQLRTTSHRY-------VS 519 N R GE+R ++P GG AW Q + +R V+ Sbjct: 630 SNALSKRLGELR-LNPDA-------------GG--AWGRGFAQRQQLDNRAGRRFDQKVA 673 Query: 520 QLGAELLTGGFTDSDSWRLGVMAGYARDYNSTHSSVSDYRSKGSVRGYCAGLYATWFADD 579 W LG +AGY R + G G YAT+ AD Sbjct: 674 GFELGADHAVAVAGGRWHLGGLAGYTR----GDRGFTGDGG-GHTDSVHVGGYATYIADS 728 Query: 580 ISKKGAYIDAWAQYSWFKN----------SVKGDELAYESYSAKGATVSLEAGYGFALNK 629 G Y+DA + S +N +VKG Y G SLEAG F Sbjct: 729 ----GFYLDATLRASRLENDFKVAGSDGYAVKGK------YRTHGVGASLEAGRRFTHAD 778 Query: 630 SFGLEAAKYTWIFQPQAQAIWMGVDHNAHTEANGSRIENDANNNIQTRLGFRTFIRTQEK 689 W +PQA+ A+ ANG R+ ++ +++ RLG R + Sbjct: 779 G---------WFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRIELA 829 Query: 690 NSGPHGDDFEPFVEMNWIHNSK-DFAVSMNGVKVEQDGASNLGEIKLGVNGNLNPAASVW 748 G +P+++ + + V NG+ + E+ LG+ L S++ Sbjct: 830 G----GRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLY 885 Query: 749 GN 750 + Sbjct: 886 AS 887
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 40.8 bits (95), Expect = 3e-07 Identities = 27/101 (26%), Positives = 44/101 (43%), Gaps = 1/101 (0%) Query: 8 TRSIYRELGATLSYNMRLGNGMEIEPCLKAAVRKEFVDDNRVKVNSDGNFVNDLSGRRGI 67 S+ LG + + L G +++P +KA+V +EF V N + +L G R Sbjct: 811 GSSVLGRLGLEVGKRIELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAH-RTELRGTRAE 869 Query: 68 YQAGIKASFSSTLSGHFGVGYSHGAGVESPWNAVAGVNWSF 108 G+ A+ S + YS G + PW AG +S+ Sbjct: 870 LGLGMAAALGRGHSLYASYEYSKGPKLAMPWTFHAGYRYSW 910
>SECA#SecA protein signature. Length = 901 Score = 57.2 bits (138), Expect = 2e-12 Identities = 16/28 (57%), Positives = 20/28 (71%) Query: 125 IDGTRPQFGRNDPCPCGSGKKFKKCCGQ 152 + GRNDPCPCGSGKK+K+C G+ Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 90.3 bits (224), Expect = 3e-22 Identities = 40/152 (26%), Positives = 64/152 (42%), Gaps = 3/152 (1%) Query: 10 ILIVEDEPVFRSLLDSWFSSLGATTVLAADGVDALELLGGFTPDLMICDIAMPRMNGLKL 69 IL+ +D+ R++L+ S G + ++ + DL++ D+ MP N L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 LEHIRNRGDQTPVLVISATENMADIAKALRLGVEDVLLKPVKDLNRLREMVFACLYPSMF 129 L I+ PVLV+SA KA G D L KP DL L ++ L + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122 Query: 130 NSRVEEEERLFRDWDAMVDNPAAAAKLLQELQ 161 R + E +D +V AA ++ + L Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.6 bits (69), Expect = 0.009 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 55 VVGESGCGKSTFARAI 70 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 32.0 bits (72), Expect = 5e-04 Identities = 16/57 (28%), Positives = 20/57 (35%), Gaps = 2/57 (3%) Query: 73 GPMPAVDSNDPGAAGFTGSTVIAEFESLEAAQAWADADPYVAAGVYEHVSVKPFKKV 129 P+PA G GS E + EA W +P A V +V KV Sbjct: 268 APLPA--EGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAVFNVAAAAKV 322
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 256 bits (654), Expect = 2e-88 Identities = 236/239 (98%), Positives = 237/239 (99%) Query: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVAPADLEPPQA 60 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMV PADLEPPQA Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDIKPVESR 120 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD+KPVESR Sbjct: 61 VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESR 120 Query: 121 PASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180 PASPFENTAPAR TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF Sbjct: 121 PASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKF 180 Query: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ Sbjct: 181 DVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTEIQ 239
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 342 bits (880), Expect = e-118 Identities = 126/341 (36%), Positives = 182/341 (53%), Gaps = 23/341 (6%) Query: 6 DNLLGEANSFLEVLEQVSHLAPLDKPVLIIGERGTGKELIASRLHYLSSRWQGPFISLNC 65 L+G + + E+ ++ L D ++I GE GTGKEL+A LH R GPF+++N Sbjct: 137 MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINM 196 Query: 66 AALNENLLDSELFGHEAGAFTGAQKRHPGRFERADGGTLFLDELATAPMMVQEKLLRVIE 125 AA+ +L++SELFGHE GAFTGAQ R GRFE+A+GGTLFLDE+ PM Q +LLRV++ Sbjct: 197 AAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQ 256 Query: 126 YGELERVGGSQPLQVNVRLVCATNADLPAMVNEGTFRADLLDRLAFDVVQLPPLRERESD 185 GE VGG P++ +VR+V ATN DL +N+G FR DL RL ++LPPLR+R D Sbjct: 257 QGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAED 316 Query: 186 IMLMAEHFAIQMCREIKLPLFPGFTERARETLLNYRWPGNIRELKNVVERSVYRHGTSDY 245 I + HF Q +E F + A E + + WPGN+REL+N+V R + Sbjct: 317 IPDLVRHFVQQAEKEGLDVK--RFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVI 374 Query: 246 PLDDIIID---PFKRRPPEDAIAVSETTSLPTLPLD------------------LREFQM 284 + I + P E A A S + S+ + Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLA 434 Query: 285 QQEKELLQLSLQQGKYNQKRAAELLGLTYHQFRALLKKHQI 325 + E L+ +L + NQ +AA+LLGL + R +++ + Sbjct: 435 EMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 32.0 bits (72), Expect = 5e-04 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 7/93 (7%) Query: 16 LFTATLQAADVTITVNGRVVAKPCTIQT-KEANVNLGDLYTRNLQQPGSASGWHNITLSL 74 L T+ ADV I + G V PCTI + V+ G++ N + ++ G +S+ Sbjct: 11 LLTSVAVLADVQINIRGNVYIPPCTINNGQNIVVDFGNI---NPEHVDNSRGEVTKNISI 67 Query: 75 TDCPVETSAVTAIVTGSTDNTGYYKNEGTAENI 107 + CP ++ ++ VTG+T G +N A NI Sbjct: 68 S-CPYKSGSLWIKVTGNTMGVG--QNNVLATNI 97
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 942 bits (2436), Expect = 0.0 Identities = 498/869 (57%), Positives = 653/869 (75%), Gaps = 10/869 (1%) Query: 15 QVLILPRFARLTIALGLATAVFPVDAEYYFNPRFLSNDLAESVDLSAFTKGREAPPGTYR 74 + + F RL +A A AE YFNPRFL++D DLS F G+E PPGTYR Sbjct: 20 KHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYR 79 Query: 75 VDIYLNDEFMTSRDITFIADDNNADLIPCLSTDLLVSLGIKKSALLDNKEHSAEKHVPDN 134 VDIYLN+ +M +RD+TF D+ ++PCL+ L S+G+ +++ + ++ + Sbjct: 80 VDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASV-------SGMNLLAD 132 Query: 135 SACTPLQDRLADASTEFDVGQQHLSLSVPQIYVGRMARGYVSPELWEEGINAGLLNYSFN 194 AC PL + DA+ + DVGQQ L+L++PQ ++ ARGY+ PELW+ GINAGLLNY+F+ Sbjct: 133 DACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFS 192 Query: 195 GNSINNRSNHNAGKSNYAYLNLQSGINIGSWRLRDNSTWSYNSGSSNSSNSNKWQHINTS 254 GNS N G S+YAYLNLQSG+NIG+WRLRDN+TWSYNS S+S + NKWQHINT Sbjct: 193 GNS---VQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249 Query: 255 AERDIIPLRSRLTVGDSYTDGDIFDSVNFRGLKINSTEAMLPDSQHGFAPVIHGIARGTA 314 ERDIIPLRSRLT+GD YT GDIFD +NFRG ++ S + MLPDSQ GFAPVIHGIARGTA Sbjct: 250 LERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTA 309 Query: 315 QVSVKQNGYDVYQTTVPPGPFTIDDINSAANGGDLQVTIKEADGSIQTLYVPYSSVPVLQ 374 QV++KQNGYD+Y +TVPPGPFTI+DI +A N GDLQVTIKEADGS Q VPYSSVP+LQ Sbjct: 310 QVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQ 369 Query: 375 RAGYTRYALAMGEYRSGNNLQSSPKFIQGSLMHGLEGNWTPYGGMQIAEDYQAFNLGIGK 434 R G+TRY++ GEYRSGN Q P+F Q +L+HGL WT YGG Q+A+ Y+AFN GIGK Sbjct: 370 REGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGK 429 Query: 435 DLGLFGAFSFDITQANTTLADGTRHSGQSVKSVYSKSFYQTGTNIQVAGYRYSTQGFYNL 494 ++G GA S D+TQAN+TL D ++H GQSV+ +Y+KS ++GTNIQ+ GYRYST G++N Sbjct: 430 NMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNF 489 Query: 495 SDSAYSRMSGYTVKPPTGDTNEQTQFIDYFNLFYSKRGQEQISISQQLGNYGTTFFSASR 554 +D+ YSRM+GY ++ G + +F DY+NL Y+KRG+ Q++++QQLG T + S S Sbjct: 490 ADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSH 549 Query: 555 QSYWNTSRSDQQISFGLNVPFGDITTSLNYSYSNNIWQNDRDHLLAFTLNVPFSHWMRTD 614 Q+YW TS D+Q GLN F DI +L+YS + N WQ RD +LA +N+PFSHW+R+D Sbjct: 550 QTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSD 609 Query: 615 SQSAFRNSNASYSMSNDLKGGMTNLSGVYGTLLPDNNLNYSVQVGNTHGGNTSSGTSGYS 674 S+S +R+++ASYSMS+DL G MTNL+GVYGTLL DNNL+YSVQ G GG+ +SG++GY+ Sbjct: 610 SKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYA 669 Query: 675 SLNYRGAYGNTNVGYSRNGDSSQIYYGMSGGIIAHADGITFGQPLGDTMVLVKAPGADNV 734 +LNYRG YGN N+GYS + D Q+YYG+SGG++AHA+G+T GQPL DT+VLVKAPGA + Sbjct: 670 TLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDA 729 Query: 735 KIENQTGIHTDWRGYAILPFATEYRENRVALNANSLADNVELDETVVTVIPTHGAIARAT 794 K+ENQTG+ TDWRGYA+LP+ATEYRENRVAL+ N+LADNV+LD V V+PT GAI RA Sbjct: 730 KVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAE 789 Query: 795 FNAQIGGKVLMTLKYGNKSVPFGAIVTHGENKNGSIVAENGQVYLTGLPQSGKLQVSWGK 854 F A++G K+LMTL + NK +PFGA+VT +++ IVA+NGQVYL+G+P +GK+QV WG+ Sbjct: 790 FKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGE 849 Query: 855 DKNSNCIVEYKLPEVSPGTLLNQQTAICR 883 ++N++C+ Y+LP S LL Q +A CR Sbjct: 850 EENAHCVANYQLPPESQQQLLTQLSAECR 878
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 53.3 bits (128), Expect = 7e-10 Identities = 41/192 (21%), Positives = 83/192 (43%), Gaps = 8/192 (4%) Query: 36 LSDIAQSFHMQTAQVGIMLTIYAWVVALMSLPFMLMTSQVERRKLLICLFVVFIASHVLS 95 L DIA F+ A + T + ++ + + ++ Q+ ++LL+ ++ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FLSWS-FTVLVISRIGVAFAHAIFWSITASLAIRMAPAGKRAQALSLIATGTALAMVLGL 154 F+ S F++L+++R A F ++ + R P R +A LI + A+ +G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PLGRIVGQYFGWRMTFFAIGIGALITLLCLIKLLPLLPSEHSGSLKSLPLLFRRPALMSI 214 +G ++ Y W I + +IT+ L+KLL + LMS+ Sbjct: 157 AIGGMIAHYIHWSY-LLLIPMITIITVPFLMKLLK------KEVRIKGHFDIKGIILMSV 209 Query: 215 YLLTVVVVTAHY 226 ++ ++ T Y Sbjct: 210 GIVFFMLFTTSY 221
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.8 bits (67), Expect = 0.031 Identities = 13/50 (26%), Positives = 22/50 (44%), Gaps = 5/50 (10%) Query: 33 QRLELLENELSQNKQELKATQNELGVYKSRLSTLQKSITENKYKSASLAE 82 + +LE E + NEL VYKS+L ++ I K + + + Sbjct: 250 AKHAVLEQE-----NKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 42/239 (17%), Positives = 82/239 (34%), Gaps = 18/239 (7%) Query: 7 RSTSALLASSLLLTIGRGATLPFMTIYLSRQYSLSVDLI---GYAMTIALTIGVVFSLGF 63 R +L++ L +G G +P + L R S D+ G + + + + Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLL-RDLVHSNDVTAHYGILLALYALMQFACAPVL 63 Query: 64 GILADKFDKKRYMLLAITAFASGFIAIPLVNNVTLVVLFFALINCAYSVFATVLKAWFAD 123 G L+D+F ++ +L+++ A + + + ++ + + + A A+ AD Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIAD 122 Query: 124 NLSSTSKTKIFSINYTMLNIGWTIGPPLGTLLVMQSINLPFWLAAICSAFPMLFIQIWVK 183 + + F G GP LG L+ S + PF+ AA + L + Sbjct: 123 ITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182 Query: 184 RSEK---------IIATETGSVWSPKVLLQDKALLWFTCSGFLASFVSGAFASCISQYV 233 S K + W + A L F+ V A+ + Sbjct: 183 ESHKGERRPLRREALNPLASFRW--ARGMTVVAALMAV--FFIMQLVGQVPAALWVIFG 237
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 100 bits (249), Expect = 2e-27 Identities = 70/244 (28%), Positives = 114/244 (46%), Gaps = 16/244 (6%) Query: 2 IVLVTGATAGFGECITRRFIQQGHKVIATGRRQERLQELKDELGDNLYIAQ---LDVRNR 58 I +TGA G GE + R QG + A E+L+++ L A+ DVR+ Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 59 AAIEEMLASLPAEWCNIDILVNNAGLALGMEPAHKASVEDWETMIDTNNKGLVYMTRAVL 118 AAI+E+ A + E IDILVN AG+ L H S E+WE N+ G+ +R+V Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PGMVERNHGHIINIGSTAGSWPYAGGNVYGATKAFVRQFSLNLRTDLHGTAVRVTDIEPG 178 M++R G I+ +GS P Y ++KA F+ L +L +R + PG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 179 LVGGTEFSNVRFKGDDGKAE------KTYQNTVALT----PEDVSEAV-WWVSTLPAHVN 227 T+ + ++G + +T++ + L P D+++AV + VS H+ Sbjct: 189 ST-ETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 228 INTL 231 ++ L Sbjct: 248 MHNL 251
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 48.0 bits (114), Expect = 4e-08 Identities = 33/118 (27%), Positives = 55/118 (46%), Gaps = 16/118 (13%) Query: 44 VGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLG 103 +G ++GK+ D++G K++L I + + + V ++ + + A R IQG G Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMA-------RFIQGAG 116 Query: 104 AGAEISGAGTMLAEYAPKGKR----GIISSFVAMGTNCGTLSATAI-----WAFMFFI 152 A A + ++A Y PK R G+I S VAMG G I W+++ I Sbjct: 117 AAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI 174
>ENTEROVIROMP#Enterobacterial virulence outer membrane protein signature. Length = 171 Score = 139 bits (352), Expect = 3e-44 Identities = 64/201 (31%), Positives = 102/201 (50%), Gaps = 32/201 (15%) Query: 1 MRK-VCAAILSAAICLAVSGAPAWASEQQATLSAGYLHARTNASGSDNLNGINVKYRYEF 59 M+K C + L+A LA + + A+ +T++ GY + + + G N+KYRYE Sbjct: 1 MKKIACLSALAA--VLAFTAGTSVAA--TSTVTGGYAQSDAQGQMNK-MGGFNLKYRYEE 55 Query: 60 TDT-LGLITSFSYANAEDEQKTHYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMAG 118 ++ LG+I SF+Y T S T D +N+++ + AGP+ R+N+W S Y + G Sbjct: 56 DNSPLGVIGSFTY--------TEKSRTASSGDYNKNQYYGITAGPAYRINDWASIYGVVG 107 Query: 119 MAYSRVSTFSGDYLRVTDNKGKKHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDVAY 178 + Y + T T+ KHD S+ ++GAG+QFNP E+VA+D +Y Sbjct: 108 VGYGKFQT--------TEYPTYKHD---------TSDYGFSYGAGLQFNPMENVALDFSY 150 Query: 179 EGSGSGDWRTDGFIVAVGYKF 199 E S +I VGY+F Sbjct: 151 EQSRIRSVDVGTWIAGVGYRF 171
>cloacin#Cloacin signature. Length = 551 Score = 41.6 bits (97), Expect = 2e-05 Identities = 33/142 (23%), Positives = 61/142 (42%), Gaps = 4/142 (2%) Query: 519 DQQRLNDLQEKKRQKDLQDAK--EQAERNYQEQQKRRNAENAALNRMNETEAARHQREIA 576 DQ + +E +RQ++ E AERNY+ + N N + R E +A Q + Sbjct: 294 DQVKQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVYNS 353 Query: 577 RINAMQYADQAVRDA-AIQRENERYEKALASGKKKTRETRNDEATRLLLQYSQQQAQVEG 635 R + + A++ + DA A ++ R+ +G + + +A R + +QA + Sbjct: 354 RKSELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDA 413 Query: 636 QIAAARQSAGIATERMTEAHKQ 657 A + A A E+ K+ Sbjct: 414 -AAKEKSDADAALSSAMESRKK 434
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 64.1 bits (156), Expect = 4e-18 Identities = 19/46 (41%), Positives = 32/46 (69%) Query: 23 QKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEP 68 + +++ ++++CLT+++ +TRK LCE+R R G EVA F AYE Sbjct: 5 RSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYES 50
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (301), Expect = 3e-39 Identities = 34/89 (38%), Positives = 55/89 (61%) Query: 4 TKAEMSEYLFDKLGLSKRDAKELVELFFEEIRRALENGEQVKLSGFGNFDLRDKNQRPGR 63 K ++ + + L+K+D+ V+ F + L GE+V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 64 NPKTGEDIPITARRVVTFRPGQKLKSRVE 92 NP+TGE+I I A +V F+ G+ LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>INVEPROTEIN#Salmonella/Shigella invasion protein E (InvE) signature. Length = 372 Score = 29.3 bits (65), Expect = 0.020 Identities = 18/81 (22%), Positives = 34/81 (41%), Gaps = 13/81 (16%) Query: 158 ETTSALHTYFNVGDIAKVSVSGLGDCFIDKVNDAKED-----------VLTDGIQTFPDR 206 E ++AL + N D K S S L + F ++V + + V ++ F + Sbjct: 57 EMSAALAQFRNRRDYEKKS-SNLSNSF-ERVLEDEALPKAKQILKLISVHGGALEDFLRQ 114 Query: 207 TDRVYLNPQDCSVINDEALNR 227 ++ +P D ++ E L R Sbjct: 115 ARSLFPDPSDLVLVLRELLRR 135
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 27.9 bits (62), Expect = 0.006 Identities = 24/88 (27%), Positives = 32/88 (36%), Gaps = 12/88 (13%) Query: 1 MRLLILTLSLITLAGCTVTRQAHVSEVDAATGIVRLVYNQAFLQHAHTDRYVSRGIADRA 60 +RL L L L GC + D G R +YN D Y+ R +A Sbjct: 3 LRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYN---FNFNVLDPYIVRPVA--- 56 Query: 61 CQQEGYTHAIPFGQPVGN-CSLFAGSLC 87 + +P QP N S F G+L Sbjct: 57 ---VAWRDYVP--QPARNGLSNFTGNLE 79
>SECA#SecA protein signature. Length = 901 Score = 60.3 bits (146), Expect = 7e-13 Identities = 27/70 (38%), Positives = 31/70 (44%), Gaps = 5/70 (7%) Query: 155 RVEKMSPEAFEESVDAIRLAALDLH---AYWMAHPQEKAVQQPI--KAEEKPGRNDPCPC 209 +V+ PE EE R+ A L A E K GRNDPCPC Sbjct: 828 KVQVRMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPC 887 Query: 210 GSGKKFKQCC 219 GSGKK+KQC Sbjct: 888 GSGKKYKQCH 897
>PF01206#SirA family protein Length = 76 Score = 92.5 bits (230), Expect = 7e-29 Identities = 16/71 (22%), Positives = 37/71 (52%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPQLKKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + + GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 117 bits (294), Expect = 5e-38 Identities = 103/103 (100%), Positives = 103/103 (100%) Query: 2 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 61 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL Sbjct: 1 SAIQGIEGVISQLQATAMSARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTL 60 Query: 62 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 104 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV Sbjct: 61 GEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 754 bits (1948), Expect = 0.0 Identities = 478/555 (86%), Positives = 514/555 (92%), Gaps = 5/555 (0%) Query: 3 ATAAQTKSLEWLNRLRANPKIPLIVAGSAAVAVMVALILWAKAPDYRTLFSNLSDQDGGA 62 +TA Q K LEWLNRLRANP+IPLIVAGSAAVA++VA++LWAK PDYRTLFSNLSDQDGGA Sbjct: 5 STATQPKPLEWLNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGA 64 Query: 63 IVSQLTQMNIPYRFSEASGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 122 IV+QLTQMNIPYRF+ SGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ Sbjct: 65 IVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQ 124 Query: 123 FSEQVNYQRALEGELSRTIETIGPVKGARVHLAMPKPSLFVREQKSPSASVTVNLLPGRA 182 FSEQVNYQRALEGEL+RTIET+GPVK ARVHLAMPKPSLFVREQKSPSASVTV L PGRA Sbjct: 125 FSEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRA 184 Query: 183 LDEGQISAIVHLVSSAVAGLPPGNVTLVDQGGHLLTQSNTSGRDLNDAQLKYASDVEGRI 242 LDEGQISA+VHLVSSAVAGLPPGNVTLVDQ GHLLTQSNTSGRDLNDAQLK+A+DVE RI Sbjct: 185 LDEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRI 244 Query: 243 QRRIEAILSPIVGNGNIHAQVTAQLDFASKEQTEEQYRPNGDESHAALRSRQLNESEQSG 302 QRRIEAILSPIVGNGN+HAQVTAQLDFA+KEQTEE Y PNGD S A LRSRQLN SEQ G Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304 Query: 303 SGYPGGVPGALSNQPAPANNAPISTPPANQNNRQQ--QASTTSNS---GPRSTQRNETSN 357 +GYPGGVPGALSNQPAP N API+TPP NQ N Q Q ST++NS GPRSTQRNETSN Sbjct: 305 AGYPGGVPGALSNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSN 364 Query: 358 YEVDRTIRHTKMNVGDVQRLSVAVVVNYKTLPDGKPLPLSNEQMKQIEDLTLEAMGFSEK 417 YEVDRTIRHTKMNVGD++RLSVAVVVNYKTL DGKPLPL+ +QMKQIEDLT EAMGFS+K Sbjct: 365 YEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKPLPLTADQMKQIEDLTREAMGFSDK 424 Query: 418 RGDSLNVVNSPFNSSDESGGELPFWQQQAFIDQLLAAGRWLLVLLVAWLLWRKAVRPQLT 477 RGD+LNVVNSPF++ D +GGELPFWQQQ+FIDQLLAAGRWLLVL+VAW+LWRKAVRPQLT Sbjct: 425 RGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLT 484 Query: 478 RRAEAMKAVQQQAQAREEVEDAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 537 RR E KA Q+QAQ R+E E+AVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR Sbjct: 485 RRVEEAKAAQEQAQVRQETEEAVEVRLSKDEQLQQRRANQRLGAEVMSQRIREMSDNDPR 544 Query: 538 VVALVIRQWINNDHE 552 VVALVIRQW++NDHE Sbjct: 545 VVALVIRQWMSNDHE 559
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 340 bits (874), Expect = e-119 Identities = 117/329 (35%), Positives = 197/329 (59%), Gaps = 2/329 (0%) Query: 1 MSNLTGTDKSVILLMTIGEDRAAEVFKHLSQREVQTLSAAMANVTQISNKQLTDVLAEFE 60 +S LTG K+ ILL++IG + +++VFK+LSQ E+++L+ +A + I+++ +VL EF+ Sbjct: 12 VSALTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFK 71 Query: 61 QEAEQFAALNINANDYLRSVLVKALGEERAASLLEDILETRDTASGIETLNFMEPQSAAD 120 + + DY R +L K+LG ++A ++ + L + + E + +P + + Sbjct: 72 ELMMAQEFIQKGGIDYARELLEKSLGTQKAVDIINN-LGSALQSRPFEFVRRADPANILN 130 Query: 121 LIRDEHPQIIATILVHLKRAQAADILTLFDERLRHDVMLRIATFGGVQPAALAELTEVLN 180 I+ EHPQ IA IL +L +A+ IL+ ++ +V RIA P + E+ VL Sbjct: 131 FIQQEHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLE 190 Query: 181 GLLDGQ-NLKRSKMGGVRTAAEIINLMKTQQEEAVITAVREFDGELAQKIIDEMFLFENL 239 L + + GGV EIIN+ + E+ +I ++ E D ELA++I +MF+FE++ Sbjct: 191 KKLASLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDI 250 Query: 240 VDVDDRSIQRLLQEVDSESLLIALKGAEQPLREKFLRNMSQRAADILRDDLANRGPVRLS 299 V +DDRSIQR+L+E+D + L ALK + P++EK +NMS+RAA +L++D+ GP R Sbjct: 251 VLLDDRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRK 310 Query: 300 QVENEQKAILLIVRRLAETGEMVIGSGED 328 VE Q+ I+ ++R+L E GE+VI G + Sbjct: 311 DVEESQQKIVSLIRKLEEQGEIVISRGGE 339
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 376 bits (965), Expect = e-136 Identities = 227/228 (99%), Positives = 228/228 (100%) Query: 1 MSDNLPWKTWTPDDLAPPQAEFVPMVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 MSDNLPWKTWTPDDLAPPQAEFVP+VEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI Sbjct: 1 MSDNLPWKTWTPDDLAPPQAEFVPIVEPEETIIEEAEPSLEQQLAQLQMQAHEQGYQAGI 60 Query: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL Sbjct: 61 AEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRL 120 Query: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT Sbjct: 121 MQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGAT 180 Query: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV Sbjct: 181 LSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATRWQELCRLAAPGVV 228
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 200 bits (509), Expect = 2e-69 Identities = 145/147 (98%), Positives = 146/147 (99%) Query: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MTSNRWINYQQFIQTLEKAITQHRQQLNQWKQKVDIALNSWREKKQRLQAWQTLQERQST 120 +TSNRWINYQQFIQTLEKAITQHRQQLNQW QKVDIALNSWREKKQRLQAWQTLQERQST Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147 AALLAENRLDQKKMDEFAQRAAMRKPE Sbjct: 121 AALLAENRLDQKKMDEFAQRAAMRKPE 147
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 463 bits (1193), Expect = e-166 Identities = 362/375 (96%), Positives = 367/375 (97%) Query: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK Sbjct: 1 MIRLAPLITADVDTTTLPGGKASDAAQDFLALLSEALAGETTTDKAAPQLLVATDKPTTK 60 Query: 61 GEPLVSEILADAQQADLLIPVDETPPVINDEQSTSTPLTTAQTMTLAAVAGNNTAKDEKA 120 GEPL+S+I++DAQQA+LLIPVDETPPVINDEQSTSTPLTTAQTM LAAVA NT KDEKA Sbjct: 61 GEPLISDIVSDAQQANLLIPVDETPPVINDEQSTSTPLTTAQTMALAAVADKNTTKDEKA 120 Query: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDVPSTVLPAEKPTLFTKLTSAQLTTAQPDDAP 180 DDLNEDVTASLSALFAMLPGFDNTPKVTD PSTVLP EKPTLFTKLTS QLTTAQPDDAP Sbjct: 121 DDLNEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAP 180 Query: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW Sbjct: 181 GTPAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEW 240 Query: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA Sbjct: 241 QQSLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAA 300 Query: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS Sbjct: 301 LPVLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVPVS 360 Query: 361 LQVRVTGNSGVDIFA 375 LQ RVTGNSGVDIFA Sbjct: 361 LQGRVTGNSGVDIFA 375
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 381 bits (979), Expect = e-135 Identities = 85/324 (26%), Positives = 148/324 (45%), Gaps = 10/324 (3%) Query: 5 ILSQAEIDALLNGDS--EVKDEPTASVSGESDIRPYDPNTQRRVVRERLQALEIINERFA 62 +LSQ EID LL S + E +S I YD + +E+++ L +++E FA Sbjct: 4 VLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETFA 63 Query: 63 RHFRMGLFNLLRRSPDITVGAIRIQPYHEFARNLPVPTNLNLIHLKPLRGTGLVVFSPSL 122 R L LR + V ++ Y EF R++P P+ L +I + PL+G ++ PS+ Sbjct: 64 RLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPSI 123 Query: 123 VFIAVDNLFGGDGRFPTKVEGREFTHTEQRVINRMLKLALEGYSDAWKAINPLEVEYVRS 182 F +D LFGG G+ KV+ R+ T E V+ ++ L ++W + L + Sbjct: 124 TFSIIDRLFGGTGQ-AAKVQ-RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQI 181 Query: 183 EMQVKFTNITTSPNDIVVNTPFHVEIGNLTGEFNICLPFSMIEPLRELLVNPPLENS--R 240 E +F I P+++VV ++G G N C+P+ IEP+ L + +S R Sbjct: 182 ETNPQFAQI-VPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 241 NEDQNWRDNLVRQVQHSQLELVANFADISLRLSQILKLKPGDVLPIEKP---DRIIAHVD 297 + + L ++ +++VA + L + IL L+ GD++ + D + + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 298 GVPVLTSQYGTLNGQYALRIEHLI 321 Q G + + A +I I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 212 bits (542), Expect = 1e-74 Identities = 125/137 (91%), Positives = 134/137 (97%) Query: 1 MSDMNNPADDNNGAMDDLWAEALSEQKSTSSKSAADAVFQQFGGGDVSGTLQDIDLIMDI 60 MSDMNNP+D+N GA+DDLWA+AL+EQK+T++KSAADAVFQQ GGGDVSG +QDIDLIMDI Sbjct: 1 MSDMNNPSDENTGALDDLWADALNEQKATTTKSAADAVFQQLGGGDVSGAMQDIDLIMDI 60 Query: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV Sbjct: 61 PVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGV 120 Query: 121 RITDIITPSERMRRLSR 137 RITDIITPSERMRRLSR Sbjct: 121 RITDIITPSERMRRLSR 137
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 334 bits (858), Expect = e-119 Identities = 243/245 (99%), Positives = 245/245 (100%) Query: 1 MRRLLSVAPVLLWLVTPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 MRRLLSVAPVLLWL+TPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM Sbjct: 1 MRRLLSVAPVLLWLITPLAFAQLPGITSQPLPGGGQSWSLPVQTLVFITSLTFIPAILLM 60 Query: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEK 120 Query: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANSGPLQGPEAVPMRILLPAYVTSELK 180 ISMQEALEKGAQPLREFMLRQTREADLGLFARLAN+GPLQGPEAVPMRILLPAYVTSELK Sbjct: 121 ISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELK 180 Query: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA Sbjct: 181 TAFQIGFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLA 240 Query: 241 QSFYS 245 QSFYS Sbjct: 241 QSFYS 245
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 67.1 bits (164), Expect = 1e-18 Identities = 22/78 (28%), Positives = 42/78 (53%) Query: 4 ESVMMMGTEAMKVALALAAPLLLVALVTGLIISILQAATQINEMTLSFIPKIIAVFIAII 63 + ++ G +A+ + L L+ +VA + GL++ + Q TQ+ E TL F K++ V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLNLLLDYVRTLF 81 + W +LL Y R + Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 201 bits (514), Expect = 1e-66 Identities = 258/261 (98%), Positives = 261/261 (100%) Query: 1 MLQVTSDQWLSWLSLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 MLQVTS+QWLSWL+LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 241 EHLFSEMFNLLADIISELPLI 261 EHLFSE+FNLLADIISELPLI Sbjct: 241 EHLFSEIFNLLADIISELPLI 261
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.047 Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 15/62 (24%) Query: 320 AKYILTPVLWKYLYRYAKKHQARGNGFGYGMVYPNNPQSVTRTLSARYYKDGAEILIDRG 379 A+Y + PVLW Y+ R+ K + G+ VY +R +DG+E RG Sbjct: 166 ARYQVGPVLWGYVVRFIK---SDGDKLTLPYVY------------SRSQRDGSEAWKWRG 210 Query: 380 WD 381 WD Sbjct: 211 WD 212
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 34.4 bits (79), Expect = 2e-04 Identities = 22/92 (23%), Positives = 36/92 (39%), Gaps = 9/92 (9%) Query: 37 AQKLAADDDVDMLVILTACYFHDIVSLAKNHPQRQRSSILAAEETRRLLREEFEQFPA-- 94 +KLA + + D+ +ILT + +L + Q + EE R+ E F A Sbjct: 219 GEKLAEEVNADIFMILTDV---NGAALYYGTEKEQWLREVKVEELRKYYEE--GHFKAGS 273 Query: 95 --EKIEAVCHAIAAHSFSAQIAPLTTEAKIVQ 124 K+ A I A IA L + ++ Sbjct: 274 MGPKVLAAIRFIEWGGERAIIAHLEKAVEALE 305
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 540 bits (1392), Expect = 0.0 Identities = 263/402 (65%), Positives = 302/402 (75%), Gaps = 26/402 (6%) Query: 1 MKRKVLAMLVPALLVAGAANAAEVYNKDGNKLDLYGKVVGLHYFSDDSGSDGDMSYARIG 60 MKRKVLA+++PALL AGAA+AAE+YNKDGNKLDLYGKV GLHYFSDDS DGD +Y R+G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 FKGETQIADQFTGYGQWEFNIGANGPESDKGNTATRLAFAGLGFGQNGTFDYGRNYGVVY 120 FKGETQI DQ TGYGQWE+N+ AN E + N+ TRLAFAGL FG G+FDYGRNYGV+Y Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 121 DVEAWTDMLPEFGGDTYAGADNFMNGRANGVATYRNNGFFGQVDGLNFALQYQGNNEDDG 180 DVE WTDMLPEFGGD+Y ADN+M GRANGVATYRN FFG VDGLNFALQYQG NE Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 181 LFGQEGSGNG--NGRKLAKENGDGFGMSTSYDFDFGLSLGAAYSNSDRTDNQVHKGLHNI 238 N NG + +NGDGFG+ST+YD G S GAAY+ SDRT+ QV+ G Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTI- 239 Query: 239 RYGDRFDATAGGETAEAWTVGAKYDANNVYLAAMYAETRNMTGYGDADA-----IANKTQ 293 AGG+ A+AWT G KYDANN+YLA MY+ETRNMT YG D +ANKTQ Sbjct: 240 ---------AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQ 290 Query: 294 NFEVVAQYQFDFGLRPSIAYLQSKGKDLGGVNSDNFDSQGNHHYTNKDLVKYVDIGMTYY 353 NFEV AQYQFDFGLRP++++L SKGKDL N + +KDLVKY D+G TYY Sbjct: 291 NFEVTAQYQFDFGLRPAVSFLMSKGKDLTY---------NNVNGDDKDLVKYADVGATYY 341 Query: 354 FNKNMSTYVDYKINLLDNDDDFYKENGIATDDIVAVGLVYQF 395 FNKN STYVDYKINLLD+DD FYK+ GI+TDDIVA+G+VYQF Sbjct: 342 FNKNFSTYVDYKINLLDDDDPFYKDAGISTDDIVALGMVYQF 383
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.5 bits (209), Expect = 9e-21 Identities = 30/117 (25%), Positives = 60/117 (51%), Gaps = 1/117 (0%) Query: 39 KILLIEDNQRTQEWVTQGLSEAGYVIDAVSDGRDGLYLALKDDYALIILDIMLPGMDGWQ 98 IL+ +D+ + + Q LS AGY + S+ D L++ D+++P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 99 ILQTLRTA-KQTPVICLTARDSVDDRVRGLDSGANDYLVKPFSFSELLARVRAQLRQ 154 +L ++ A PV+ ++A+++ ++ + GA DYL KPF +EL+ + L + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>INTIMIN#Intimin signature. Length = 939 Score = 699 bits (1805), Expect = 0.0 Identities = 222/790 (28%), Positives = 351/790 (44%), Gaps = 70/790 (8%) Query: 148 QQIASTSQQIGSLLAEDMNSEQAANMARGWASSQASGAMTDWLSRFGTARITLGVDEDFS 207 QQ AS Q+ S +N + A + A G A +QAS + WL +GTA + L +F Sbjct: 168 QQAASLGSQLQS---RSLNGDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGNNFD 224 Query: 208 LKNSQFDFLHPWYETPDNLFFSQHTLHRTDERTQINNGLGWRHFTPTWLSGINFFFDHDL 267 S DFL P+Y++ L F Q D R N G G R F P + G N F D D Sbjct: 225 --GSSLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDF 282 Query: 268 SRYHSRAGIGAEYWRDYLKLSSNGYLRLTNWRSAPELDNDYEARPANGWDVRAEGWLPAW 327 S ++R GIG EYWRDY K S NGY R++ W + DY+ RPANG+D+R G+LP++ Sbjct: 283 SGDNTRLGIGGEYWRDYFKSSVNGYFRMSGWHESYN-KKDYDERPANGFDIRFNGYLPSY 341 Query: 328 PHLGGKLVYEQYYGDEVALFDKDDRQSNPHAITAGLNYTPFPLMTFSAEQRQGKQGENDT 387 P LG KL+YEQYYGD VALF+ D QSNP A T G+NYTP PL+T + R G END Sbjct: 342 PALGAKLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDL 401 Query: 388 RFAVDFTWQPGSAMQKQLDPNEVAARRSLAGSRYDLVDRNNNIVLEYRKKELVRLTLTDP 447 +++ F +Q +Q++P V R+L+GSRYDLV RNNNI+LEY+K++++ L + Sbjct: 402 LYSMQFRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPHD 461 Query: 448 VTGKSGEVKSLVSSLQTKYALKGYNFEATALEAAGGKVVTTG----KDILVTLPAYRFTS 503 + G + + +++KY L ++ +AL + GG++ +G +D LPAY Sbjct: 462 INGTERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAY---- 517 Query: 504 TPETDNTWPIEVTAEDVKGNFSNREQ-SMVVVQAPTLSQKDSSVSLSTQTLSADSHSTAT 562 N + + A D GN SN ++ V+ + + + SA + T Sbjct: 518 VQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEA 577 Query: 563 LTFIAH------DAAGNPVIGLVLSTRHEGVQDITLSDWKDNGDGSYTQILTTGAMSGTL 616 +T+ A A PV ++S G ++ + NG G T L + + Sbjct: 578 ITYTATVKKNGVAQANVPVSFNIVS----GTAVLSANSANTNGSGKATVTLKSDKPGQVV 633 Query: 617 TLMPQLNGVDAAKAPAVVNIISVSSSRTHSSIKIDKDRYLSGNPIEVTVELR-DKNDKPV 675 A A AV I + + + IK DK ++ +T ++ K DKPV Sbjct: 634 VSAKTAEMTSALNANAV--IFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPV 691 Query: 676 KEQKQQLNTAVSIDNVKPGVTTDWKETADGVYKATYTAYTKGSGL-TAKLLMQNWNEDLH 734 Q+ T + K +T+ K +G K T T+ T G L +A++ + Sbjct: 692 SNQEVTFTTTLG----KLSNSTE-KTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAP 746 Query: 735 TAGFIIDANPQSAKIATLSASNNGVLANENAANTVSVNVADEGSNPINDHTVTFAVLSGS 794 F I + G L + + ++ Sbjct: 747 EVEFFTTLTIDDGNIEIVGTGVKGKLPTV---------------------WLQYGQVNLK 785 Query: 795 ATSFNNQNTAKTDVNGLATFDLKSSK---QEDNTVEVTLENGVKQTLIISFVGDSSTAQV 851 A+ N + T ++ +A+ D S + +E T +++ + QT ++ + + + Sbjct: 786 ASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQT--ATYTIATPNSLI 843 Query: 852 DLQKSKNEVVADGNDSATMTATVRDAKGNLLNDVKVTF----------NVNSAAAKLSQT 901 SK D ++ + N L +V + + + + + QT Sbjct: 844 VPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAWGAANKYEYYKSSQTIISWVQQT 903 Query: 902 EVNSHDGSAT 911 ++ G A+ Sbjct: 904 AQDAKSGVAS 913 Score = 158 bits (401), Expect = 7e-41 Identities = 98/389 (25%), Positives = 158/389 (40%), Gaps = 44/389 (11%) Query: 1126 TAEAILLNGNRD-----TKIVNIAPDASNAQVTLNITAQQV--VTNNSDSVQLTATVK-- 1176 TA A NGN T V + + TA + + ++++ TATVK Sbjct: 528 TARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKN 587 Query: 1177 --DPSNHPVAGITVNFTIPQDVAANFTLENNGIAITQANGEAHVTLKGKKAGTHTVTA-T 1233 +N PV+ V+ T ++AN A T +G+A VTLK K G V+A T Sbjct: 588 GVAQANVPVSFNIVSGT--AVLSAN-------SANTNGSGKATVTLKSDKPGQVVVSAKT 638 Query: 1234 LGNNNASDAQPVTFVADKDSAVVVLQTSKAEIIGNGVDETTLTATVKDPFDNVVKDLPVT 1293 +A +A V FV +++ ++ K + NG D T T V D V + VT Sbjct: 639 AEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVSNQEVT 697 Query: 1294 FSTNPADTQLSQSTSNTNDSGVAEVTLKGTVLGVHTVEATLLNGNGYTTTVNIAPDASNA 1353 F+T +LS ST T+ +G A+VTL T G V A + + V Sbjct: 698 FTTTLG--KLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVA---VDVKAPEVEFFT 752 Query: 1354 QVTLNIPAQQVVTNNSDSVQLTAMVK-DPSNHPVAGITVNFTMPQDVAANFTLENNGIAI 1412 +T++ ++V T ++ N +G +T + N IA Sbjct: 753 TLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNGKYT--------WRSANPAIAS 804 Query: 1413 TQANGEAHVTLKGKKAGTHTVTATLGNNNTSDSQPVTFVADKASAQVVLQISKDEITGNG 1472 A+ VTLK K GT T++ +SD+Q T+ ++ +V +SK + Sbjct: 805 VDAS-SGQVTLKEK--GTTTISVI-----SSDNQTATYTIATPNSLIVPNMSKRVTYNDA 856 Query: 1473 VDSATLTATVKDQFDNEVNNLPVTFSSAS 1501 V++ NE+ N+ + +A+ Sbjct: 857 VNTCKNFGGKLPSSQNELENVFKAWGAAN 885 Score = 103 bits (258), Expect = 6e-24 Identities = 82/400 (20%), Positives = 144/400 (36%), Gaps = 42/400 (10%) Query: 699 WKETADGVYKATYTAYTKGSGLTAKLLMQNWNEDLHTAGFIIDANPQ---SAKIATLSAS 755 + + VYK T AY + + +L+ + +N Q + +A Sbjct: 517 YVQGGSNVYKVTARAYDRNGNSSNNVLLT----------ITVLSNGQVVDQVGVTDFTAD 566 Query: 756 NNGVLANENAANTVSVNVADEGSNPINDHTVTFAVLSGSATSFNNQNTAKTDVNGLATFD 815 A+ A T + V G N V+F ++SG+A + N+A T+ +G AT Sbjct: 567 KTSAKADGTEAITYTATVKKNGVAQANV-PVSFNIVSGTAVL--SANSANTNGSGKATVT 623 Query: 816 LKSSKQEDNTVEVTLENGVK--QTLIISFVGDSSTAQVDLQKSKNEVVADGNDSATMTAT 873 LKS K V + FV + + +++ K VA+G D +T T Sbjct: 624 LKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQD--AITYT 681 Query: 874 VRDAK-GNLLNDVKVTFNVNSAAAKLSQTEV-NSHDGSATATLTSLKNGDYTVTASVSSG 931 V+ K +++ +VTF + KLS + +G A TLTS G V+A VS Sbjct: 682 VKVMKGDKPVSNQEVTFT--TTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDV 739 Query: 932 SQANQQVNFIGDQSTAALTFSVPSGDITVTNTAPLHMTATLQDKNGNPLKDKEITFSVPN 991 + + + LT + +I T T LQ + + Sbjct: 740 AVDVKAPEV---EFFTTLTIDDGNIEIVGTGVKGKLPTVWLQ-------YGQVNLKASGG 789 Query: 992 DVASRFSISNSGKGMTDSNGTAIASLTGTLAGTHMITARLANSNVSDTQPMTFVADKDRA 1051 + + +N D++ +T GT I+ SD Q T+ + Sbjct: 790 NGKYTWRSANPAIASVDASS---GQVTLKEKGTTTISVI-----SSDNQTATYTIATPNS 841 Query: 1052 VVVLQTSRAEIIGNGVDETTLTATVKDPFDNVVKNLSVVF 1091 ++V S+ + V+ N ++N+ + Sbjct: 842 LIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAW 881 Score = 101 bits (253), Expect = 2e-23 Identities = 70/371 (18%), Positives = 121/371 (32%), Gaps = 30/371 (8%) Query: 1030 RLANSNVSDTQPMTFVADK---DRAVVV-LQTSRAEIIGNGVDETTLTATVKDPFDNVVK 1085 R NS+ + +T +++ D+ V + +G + T TATVK Sbjct: 534 RNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKN-GVAQA 592 Query: 1086 NLSVVFRTSPADTQLSLNARNTNENGITEVTLKGTVLGVHTAEAILLNGNR--DTKIVNI 1143 N+ V F LS N+ NTN +G VTLK G A + V Sbjct: 593 NVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALNANAVIF 652 Query: 1144 APDASNAQVTLNITAQQVVTNNSDSVQLTATVKDPSNHPVAGITVNFTIPQDVAANFTLE 1203 + + V N D++ T V + PV+ V FT Sbjct: 653 VDQTKASITEIKADKTTAVANGQDAITYTVKVMK-GDKPVSNQEVTFT------TTLGKL 705 Query: 1204 NNGIAITQANGEAHVTLKGKKAGTHTVTATLGNNNASDAQPVTFVADKDSAVVVLQTSKA 1263 +N T NG A VTL G V+A + + P + + + Sbjct: 706 SNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEV----EFFTTLTIDDGNI 761 Query: 1264 EIIGNGVDETTLTATVK-DPFDNVVKDLPVTFSTNPADTQLSQSTSNTNDSGVAEVTLKG 1322 EI+G GV T ++ +L + + + + D+ +VTLK Sbjct: 762 EIVGTGVKGKLPTVWLQYGQV-----NLKASGGNGKYTWRSANPAIASVDASSGQVTLKE 816 Query: 1323 TVLGVHTVEATLLNGNGYTTTVNIAPDASNAQVTLNIPAQQVVTNNSDSVQLTAMVKDPS 1382 G T+ + T N+ + N+ + + ++ + S Sbjct: 817 K--GTTTISVI----SSDNQTATYTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSS 870 Query: 1383 NHPVAGITVNF 1393 + + + + Sbjct: 871 QNELENVFKAW 881 Score = 85.9 bits (212), Expect = 1e-18 Identities = 86/394 (21%), Positives = 124/394 (31%), Gaps = 47/394 (11%) Query: 1428 AGTHTVTATLGNNNTSDSQPVTFVADKASAQVVLQIS--------KDEITGNGVDSATLT 1479 + + VTA + N + S V S V+ K +G ++ T T Sbjct: 522 SNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITYT 581 Query: 1480 ATVKDQFDNEVNNLPVTFSSASSGLTLTPGVSNTNESGIAQATLAGVAFGEKTVTASLAN 1539 ATVK + N PV+F+ S L+ +NTN SG A TL G+ V+A A Sbjct: 582 ATVKKNGVAQANV-PVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAE 640 Query: 1540 NGASDNKTVHFIGDTAAAKIIELTPVPDSIIAGTPQNSSGSVITATV-VDNNGFPVKGVT 1598 ++ N D A I E+ + +A IT TV V PV Sbjct: 641 MTSALNANAVIFVDQTKASITEIKADKTTAVANGQ-----DAITYTVKVMKGDKPVSNQE 695 Query: 1599 VNFTSNAATAEMTNGGQAVTNEQGKATVTYT-NTRSSIESGARPDTVEASLENGSSTLST 1657 V FT+ + T+ G A VT T T AR V ++ T Sbjct: 696 VTFTTTLGKLSNS---TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFT 752 Query: 1658 SINVNADASTAHLTLLQALFDTVSAGETTSLYIEVKNNYGNGVPQQEVTLSVSPSEGVPP 1717 ++ ++ V +P V L Sbjct: 753 TLTIDDGNIEI-----------------------VGTGVKGKLPT--VWLQYGQVNLKAS 787 Query: 1718 SNNAIYTTNHDGNFYASFTATKAGV---YQLTATLENGDSMQQTVTYVPNVANAEITLAA 1774 N YT AS A+ V + T T+ S QT TY N+ I Sbjct: 788 GGNGKYTWRSANPAIASVDASSGQVTLKEKGTTTISVISSDNQTATYTIATPNSLIVPNM 847 Query: 1775 SKDPVIADNNDLTTLTATVADTEGNAIANTEVTF 1808 SK D + + N + N + Sbjct: 848 SKRVTYNDAVNTCKNFGGKLPSSQNELENVFKAW 881 Score = 79.3 bits (195), Expect = 2e-16 Identities = 88/466 (18%), Positives = 166/466 (35%), Gaps = 38/466 (8%) Query: 2219 SGGKVRTNSSGQA--------PVVLTSNKVGTYTVTASFHNGVT----IQTQTTVKVTGN 2266 GG+++ + S A V + V T A NG + + T T + Sbjct: 495 QGGQIQHSGSQSAQDYQAILPAYVQGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQV 554 Query: 2267 SSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIEGLTVYFALKSGSATLTSLTAV 2326 V F AD ++ A ++ T ATV+ +G + V F + SG+A L++ +A Sbjct: 555 VDQVGVTDFTADKTSAKADGTEAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSAN 613 Query: 2327 TDQNGIATTSVKGAMTGSVTVSAVTTAGGMQTVDITLVAGPADASQSVLKNNRSSLKGDF 2386 T+ +G AT ++K G V VSA TA ++ V S+ + Sbjct: 614 TNGSGKATVTLKSDKPGQVVVSA-KTAEMTSALNANAVIFVDQTKASITEIKADKTTAVA 672 Query: 2387 TDSAELHLVLHDISGNPIKVSEGMEFVQSGTNVPYMKISAIDYSQNINGDYKATITGGGE 2446 + + + G+ ++ + F + + S + NG K T+T Sbjct: 673 NGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKL-----SNSTEKTDTNGYAKVTLTSTTP 727 Query: 2447 GIATLIPVLNGVHQAGLSTTIQFTRAEDKIMSGTVSVNGTDLPTTTFPSQGFTGAYYQLN 2506 G + + ++ V + ++F I G + + GT + P+ L Sbjct: 728 GKSLVSARVSDVAVDVKAPEVEF-FTTLTIDDGNIEIVGTGV-KGKLPTVWLQYGQVNLK 785 Query: 2507 NDNFAPGKTAADYEFSSSASWVDVDAIGKVTFKNVGSNWERITATPKSGGPSYVYEIRVK 2566 + G + ++ A + G+VT K G+ I+ + Sbjct: 786 ---ASGGNGKYTWRSANPAIASVDASSGQVTLKEKGTT--TISVISSDNQTATYTIATPN 840 Query: 2567 SWWVNSGDAFMIYSLAENFCSSNGYTLPRADHLNHSRSRGIGSLYSEWGDMGHYTTEAGF 2626 S V + + Y+ A N C + G LP + + + +++ WG Y Sbjct: 841 SLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNE-------LENVFKAWGAANKYEYYKSS 893 Query: 2627 RSNMYW-----SSSPANSSEQYVVSLATGDQSVFEKLGFAYATCYK 2667 ++ + W + + + Y + ++ AYATC K Sbjct: 894 QTIISWVQQTAQDAKSGVASTYDLVKQNPLNNIKASESNAYATCVK 939 Score = 67.8 bits (165), Expect = 6e-13 Identities = 56/264 (21%), Positives = 88/264 (33%), Gaps = 18/264 (6%) Query: 1698 NGVPQQEVTLSVSPSEGVPPSNNAIYTTNHDGNFYASFTATKAGVYQLTATLENGDSMQQ 1757 NGV Q V +S + G + TN G + + K G ++A S Sbjct: 587 NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEMTSALN 646 Query: 1758 T--VTYVPNVANAEITLAASKDPVIADNNDLTTLTATVADTEGNAIANTEVTFTLPEDVK 1815 V +V + + A K +A+ D T T V ++N EVTFT Sbjct: 647 ANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKV-MKGDKPVSNQEVTFT------ 699 Query: 1816 ANFTLSDGGKAVTDTEGKAKVTLKGTKAGAHTVTASMAGGKSE--QLVVNFIADTLTAQV 1873 TDT G AKVTL T G V+A ++ + V F Sbjct: 700 TTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDG 759 Query: 1874 NLNVTEDNFIANNVGMTRLQATVTDGNGN-PLANEAVTFTLPADVSASFTLGQGGSAITD 1932 N+ + + V + G N + +T + A ++ S Sbjct: 760 NIEI-----VGTGVKGKLPTVWLQYGQVNLKASGGNGKYTWRSANPAIASV-DASSGQVT 813 Query: 1933 INGKAEVTLSGTKSGSYPVTVSVN 1956 + K T+S S + T ++ Sbjct: 814 LKEKGTTTISVISSDNQTATYTIA 837 Score = 61.6 bits (149), Expect = 4e-11 Identities = 74/359 (20%), Positives = 129/359 (35%), Gaps = 37/359 (10%) Query: 1743 YQLTATLENGDSMQQTVTYVPNVANAEITLAASKDPVIADNNDLT-----------TLTA 1791 Y + + Q + P N TL+ S+ ++ NN++ + Sbjct: 403 YSMQFRYQFDKPWSQQIE--PQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNIPH 460 Query: 1792 TVADTEGNA---IANTEVTFTLPEDVKANFTL-SDGGKAVTDTEGKAK---VTLKGTKAG 1844 + TE + + + L V + L S GG+ A+ L G Sbjct: 461 DINGTERSTQKIQLIVKSKYGLDRIVWDDSALRSQGGQIQHSGSQSAQDYQAILPAYVQG 520 Query: 1845 AH-----TVTASMAGGKS---EQLVVNFIADTLTAQ----VNLNVTEDNFIANNVGMTRL 1892 T A G S L + +++ + + + A+ Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKADGTEAITY 580 Query: 1893 QATVTDGNGNPLANEAVTFTLPADVSASFTLGQGGSAITDINGKAEVTLSGTKSGSYPVT 1952 ATV NG AN V+F + VS + L SA T+ +GKA VTL K G V+ Sbjct: 581 TATVKK-NGVAQANVPVSFNI---VSGTAVLSAN-SANTNGSGKATVTLKSDKPGQVVVS 635 Query: 1953 VSVNNYGVSDTKQVTLIADAGTAKLASLTSVYSFVVSTTEGATMTASVTDANGNPVEGIK 2012 + + D A + + + + V+ + A PV + Sbjct: 636 AKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKGDKPVSNQE 695 Query: 2013 VSFRGTSVTLSSTSVETDDRGFAEILVTSTEVGLKTVSASLADKPTEVISRLLNASADV 2071 V+F T LS+++ +TD G+A++ +TST G VSA ++D +V + + + Sbjct: 696 VTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTL 754 Score = 56.2 bits (135), Expect = 2e-09 Identities = 44/164 (26%), Positives = 63/164 (38%), Gaps = 5/164 (3%) Query: 2186 TGTTLTATLTSANGTPVEGQVINFSVTPEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTV 2245 T TAT+ NG ++F++ A LS TN SG+A V L S+K G V Sbjct: 576 EAITYTATVKK-NGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVV 634 Query: 2246 TASFHNGVTIQTQTTVKVTGNSSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNLIE 2305 +A + + + + A + AD +T A D T V G + Sbjct: 635 SAKTAEMTS-ALNANAVIFVDQTKASITEIKADKTTAVANGQDAITYTVKVMKG-DKPVS 692 Query: 2306 GLTVYFALKSGSATLTSLTAVTDQNGIATTSVKGAMTGSVTVSA 2349 V F G + + T TD NG A ++ G VSA Sbjct: 693 NQEVTFTTTLGKLSNS--TEKTDTNGYAKVTLTSTTPGKSLVSA 734 Score = 52.4 bits (125), Expect = 3e-08 Identities = 72/339 (21%), Positives = 113/339 (33%), Gaps = 36/339 (10%) Query: 2016 RGTSVTLSSTSVETDDRGFAEILVTSTEVGLKTVSASLADKPTEVISRLLNASADVNSAT 2075 +G S T+ D G + V T L T+ + +A AD + Sbjct: 519 QGGSNVYKVTARAYDRNGNSSNNVLLTITVLSNGQVVDQVGVTDFTADKTSAKAD-GTEA 577 Query: 2076 ITSLEIPEGQVMVAQDVAVKAHVNDQFGNPVAHQPVTFSAEPPEHMTISQNIVSTDTHGI 2135 IT A V G A+ PV+F+ +S N +T+ G Sbjct: 578 IT----------------YTATVKKN-GVAQANVPVSFNIVSGT-AVLSANSANTNGSGK 619 Query: 2136 AEVSMTPERNGSYMVKASLANGASLEKQLEAI---DEKLTLTASSPLIGVYAPTG-TTLT 2191 A V++ ++ G +V A A S I K ++T G +T Sbjct: 620 ATVTLKSDKPGQVVVSAKTAEMTSALNANAVIFVDQTKASITEIKADKTTAVANGQDAIT 679 Query: 2192 ATL-TSANGTPVEGQVINFSVTPEGATLSGGKVRTNSSGQAPVVLTSNKVGTYTVTASFH 2250 T+ PV Q + F+ T LS +T+++G A V LTS G V+A Sbjct: 680 YTVKVMKGDKPVSNQEVTFTTT--LGKLSNSTEKTDTNGYAKVTLTSTTPGKSLVSARVS 737 Query: 2251 NGVTIQTQTTVKVTGNSSTAHVASFIADPSTIAATNSDLSTLKATVEDGSGNL-IEGLTV 2309 + V+ + I T ++ G NL G Sbjct: 738 DVAVDVKAPEVEFFT------TLTIDDGNIEIVGTGVKGKLPTVWLQYGQVNLKASGGNG 791 Query: 2310 YFALKSGSATLTSLTAVTDQNGIATTSVKGAMTGSVTVS 2348 + +S + ++ +V +G T KG T SV S Sbjct: 792 KYTWRSAN---PAIASVDASSGQVTLKEKGTTTISVISS 827
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.6 bits (82), Expect = 4e-04 Identities = 39/259 (15%), Positives = 96/259 (37%), Gaps = 18/259 (6%) Query: 79 LGGVIFGHFGDRLGRKRMLMLTVWMMGIATALIGILPSFSTIGWWAPILLVTLRAIQGFA 138 +G ++G D+LG KR+L+ + + + + + SF ++ I+ ++ A Sbjct: 64 IGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSL----LIMARFIQGAGAAA 119 Query: 139 VGGEWGGAALLSVESAPKNKK-AFYSSGVQVGYGVGLLLSTGLVSLISMMTTDEQFLSWG 197 + + K S V +G GVG + + I Sbjct: 120 FPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI------------H 167 Query: 198 WRIPFLFSIVLVLGALWVRNGMEESAEFEQQQHYQAAAKKRIPVIEALLRHPGAFLKIIA 257 W L ++ ++ ++ +++ + + + ++ +L + + Sbjct: 168 WSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLI 227 Query: 258 LRLCELLTMYIVTAFALNYSTQNMGLPRELFLNIGLLVGGLSCLTIPCFAWLADRFGRRR 317 + + L +++ + + GL + + IG+L GG+ T+ F + + Sbjct: 228 VSVLSFL-IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDV 286 Query: 318 VYITGALIGTLSAFPFFMA 336 ++ A IG++ FP M+ Sbjct: 287 HQLSTAEIGSVIIFPGTMS 305
>cdtoxina#Cytolethal distending toxin A signature. Length = 258 Score = 27.7 bits (61), Expect = 0.017 Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 5/61 (8%) Query: 74 VELLPVEITPDEQKEPVAAIAPSLSTSTQTRVSASSCKVEFRHGNMTLENPSPELLTVLI 133 VE P +PDE P+ P+L T+ + ++L N +LT+ Sbjct: 40 VEGGPTVPSPDEPGLPLPGPGPALPTNGAIPIPEPGTAPA-----VSLMNMDGSVLTMWS 94 Query: 134 R 134 R Sbjct: 95 R 95
>PF06580#Sensor histidine kinase Length = 349 Score = 35.2 bits (81), Expect = 4e-04 Identities = 10/78 (12%), Positives = 32/78 (41%), Gaps = 8/78 (10%) Query: 19 QAEALRQKDQQLSLVEETEAFLRSALTRAEEKIEEDEREIEHLRA--QIEKLRRMLFGTR 76 +A L + ++ +R +L + + E+ + + Q+ ++ F Sbjct: 183 RALILEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQ---FE-- 237 Query: 77 SEKLRREVELAEALLKQR 94 ++L+ E ++ A++ + Sbjct: 238 -DRLQFENQINPAIMDVQ 254
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 29.8 bits (66), Expect = 0.028 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 1/51 (1%) Query: 109 KNKEELLKAQKKALLSKLNK-ELRKGADESETLRQLEVLQKNSAEEPVRYK 158 KN EL+K Q++ L ++ + E + ++ L+Q+E L K AEE VR + Sbjct: 197 KNLSELIKQQRENELDQMERLEDMQEQAQANALKQIEELNKKQAEEAVRQR 247
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 41.6 bits (97), Expect = 2e-05 Identities = 120/616 (19%), Positives = 194/616 (31%), Gaps = 75/616 (12%) Query: 321 TVTGINRLGAFSVVEGKADNVVLENGGRLDVLTGHTATNTRVDDGGTLDVRNGGTATTVS 380 VT + GA + V + + +GG + G A + R Sbjct: 207 NVTAVPASGAPAAVSVLGASELTLDGGH--ITGGRAAGVAAMQGAVVHLQRATIRRGDAP 264 Query: 381 MGNGGVLLADSGAAVSGTRSDGKAFSIGGGQA----DALMLEKGSSFTLNAGDTATDTTV 436 G A G AV G G + G +E S A Sbjct: 265 AGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVDVSGSSVELAQSIVEAPELGAAIRVG 324 Query: 437 NGGLFTARGGTLAGTTTLNNGAILTLSGKTV---NNDTLTIR-EGDALLQGGSLTGNGSV 492 G T GG+L+ G ++ G L+I + A QG +L Sbjct: 325 RGARVTVSGGSLSAPH----GNVIETGGARRFAPQAAPLSITLQAGAHAQGKALLYRVLP 380 Query: 493 EKSGSGTLTVSNTTLTQKAVNLNEGTLTLNDSTVTTDVIAQRGTALKLTGSTVLNGAIDP 552 E LT++ Q + E S DV GA Sbjct: 381 EPV---KLTLTGGADAQGDIVATELPSIPGTSIGPLDVALASQARWT--------GATRA 429 Query: 553 TNVTLASGATWNIPDNATVQSVVDDLSHAGQIHF-TSTRTGKFVPATLKVKNLNGQNGTI 611 + ATW + DN+ V ++ L+ G + F G+F L V L G +G Sbjct: 430 VDSLSIDNATWVMTDNSNVGAL--RLASDGSVDFQQPAEAGRF--KVLTVNTLAG-SGLF 484 Query: 612 SLRVRPDMAQNNADRLVIDGGRATGKTILNLVNAGNSASGLATSGKGIQVVEAINGATTE 671 + V D+ + D+LV+ A+G+ L + N+G+ + T + V + A T Sbjct: 485 RMNVFADLGLS--DKLVVMQD-ASGQHRLWVRNSGSEPASANTL---LLVQTPLGSAATF 538 Query: 672 EGAFVQGNRLQAGAFNYSLNRDSDESWYLRSENAYRAEVPLYASMLTQAMDYDRIVAGSR 731 A ++ G + Y L + + W L A A P Sbjct: 539 TLANK-DGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEAPA 597 Query: 732 SHQTGVNGENNSVRLSIQGGHLGHDNNGGIARG-----------ATPESSGSYG--FVRL 778 + + ++ G +G + A P++ G++G F + Sbjct: 598 PQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQR 657 Query: 779 E------GDLMRTEVAG--------MSVTAGVYGAAGHSSVDVKDDDGSRAGTVRDDAGS 824 + G +VAG ++V G + G + D + G D+ Sbjct: 658 QQLDNRAGRRFDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDGGGHTDSVH 717 Query: 825 LGGYLNLVHTSSGLWADIVAQGTRH-------SMKASSDNNDFRARGWGWLGSLETGLPF 877 +GGY + SG + D + +R + +R G G SLE G F Sbjct: 718 VGGYATYI-ADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGA--SLEAGRRF 774 Query: 878 SITDNLMLEPQLQYTW 893 + D LEPQ + Sbjct: 775 THADGWFLEPQAELAV 790
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 28.6 bits (64), Expect = 0.026 Identities = 25/126 (19%), Positives = 51/126 (40%), Gaps = 16/126 (12%) Query: 95 LVDSALAHRIPRIIFTSSTSVYGDAQG---TVKETT--PRNPVTNSGRVLEELEDWLHNL 149 +++ ++I +++ SS+SVYG + + ++ P + + + E + +L Sbjct: 109 ILEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL 168 Query: 150 PGTSVDILRLAGLVGP-GRHPGRFF-------AGKTAP---DGEHGVNLVHLEDVIGAIT 198 G LR + GP GR F GK+ G+ + +++D+ AI Sbjct: 169 YGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228 Query: 199 LLLQAP 204 L Sbjct: 229 RLQDVI 234
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 95.2 bits (237), Expect = 2e-24 Identities = 70/334 (20%), Positives = 126/334 (37%), Gaps = 62/334 (18%) Query: 4 NVLLIGASGFVGT----RLLE----------------TAIADFNIKNLDKQQSHFYPEIT 43 L+ GA+GF+G RLLE ++ ++ L + F+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHK--- 58 Query: 44 QIGDVRDQQALDQALA--GFDTVVLLAAEH--RDDVSPTSLYYDVNVQGTRNVLAAMEKN 99 D+ D++ + A F+ V + R + Y D N+ G N+L N Sbjct: 59 --IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN 116 Query: 100 GVKNIIFTSSVAVYGLNKHNP-DENHPHD-PFNHYGKSKWQAEEVLREWYNKA---PTER 154 ++++++ SS +VYGLN+ P + D P + Y +K +A E++ Y+ P Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATK-KANELMAHTYSHLYGLPA-- 173 Query: 155 SLTIIRPTVIFGERNRGN--VYNLLKQIAGGKFMMV-GAGTNYKSMAYVGNIVEFIKYKL 211 T +R ++G R + ++ K + GK + V G + Y+ +I E I Sbjct: 174 --TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQ 231 Query: 212 KNVA-----------------AGYEVYNYVDKPDLNMNQLVAEVEQSLNKKIPSIHLPYP 254 + A Y VYN + + + + +E +L + LP Sbjct: 232 DVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQ 291 Query: 255 LGMLGGYCFDI--LSKITGKKYAVS-SVRVKKFC 285 G + D L ++ G + VK F Sbjct: 292 PGDVLETSADTKALYEVIGFTPETTVKDGVKNFV 325
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.9 bits (114), Expect = 5e-08 Identities = 47/369 (12%), Positives = 106/369 (28%), Gaps = 87/369 (23%) Query: 4 SYKSRWVIVIVVVIAAIAAFWFWQGRNDSQSAAPG-----ATKQAQQSPAGGRRG---MR 55 S + R V ++ IA G+ + + A G + + ++ Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 56 SG-------PLA---PVQAATAVEQAVPRYLTGLGTITAANTVTVRSRVDG--QLMALHF 103 G L + A + L T ++ ++ +L Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 104 QEGQQVKAGDLLAEI------------DPSQFKVALAQAQGQLA-------KDKATLANA 144 Q V ++L Q ++ L + + + + + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 145 RRDLARYQQLAKTNLVSRQELDAQQALVSETEGTIKADEASVA----------------- 187 + L + L +++ + Q+ E ++ ++ + Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 188 --------------------------SAQLQLDWSRITAPVDGRV-GLKQVDVGNQISSG 220 + + S I APV +V LK G +++ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 221 DTTGIVVITQTHPIDLVFTLPESDIATVVQAQKAGKPLVVEAWDRTNSKKL-SEGTLLSL 279 +T +V++ + +++ + DI + Q A + VEA+ T L + ++L Sbjct: 354 ETL-MVIVPEDDTLEVTALVQNKDIGFINVGQNA--IIKVEAFPYTRYGYLVGKVKNINL 410 Query: 280 DNQIDATTG 288 D D G Sbjct: 411 DAIEDQRLG 419
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 916 bits (2370), Expect = 0.0 Identities = 299/1036 (28%), Positives = 513/1036 (49%), Gaps = 29/1036 (2%) Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRALPVSALPEVDYPTIQVVTLYPGASPDVMTSAV 72 + FI RP+ +L + +++AG + LPV+ P + P + V YPGA + V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVITLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ ITL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPVYSKVNPADPPIMTLAVTSTAMPMTQVE--DMVETRVAQKISQISGVGLVTLSGG 189 + + S + +M S TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITGANVNSAKGSLDGP------SRAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSAEEYRQLII-AYQNGAPIRLGDVATVEQGAENSWLGAWANKEQAIVMNVQRQPGANI 302 ++ EE+ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 ISTADSIRQMLPQLTESLPKSVKVTVLSDRTTNIRASVDDTQFELMMAIALVVMIIYLFL 362 + TA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNIPATIIPGVAVPLSLIGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAITLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F+IT+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 ISAVVSLTLTPMMCARML---SQESLRKQNRFSRASEKMFDRIIAAYGRGLAKVLNHPWL 538 +S +V+L LTP +CA +L S E + F FD + Y + K+L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLSVALSTLLLSVLLWVFIPKGFFPVQDNGIIQGTLQAPQSSSFANMAQRQRQVADVILQ 598 L + + V+L++ +P F P +D G+ +Q P ++ + QV D L+ Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VQSLTSFVGVDGTNPSLNSARLQINLKPLDERDDR---VQKVIARLQTAVDKVPG 653 + V+S+ + G + + N+ ++LKP +ER+ + VI R + + K+ Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR- 658 Query: 654 VDLFLQPTQDLTIDTQVSRTQYQFTLQ---ATSLDALSTWVPQLMEKLQQLP-QLSDVSS 709 D F+ P I + T + F L DAL+ QL+ Q P L V Sbjct: 659 -DGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDKGLVAYVNVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEHNTE 769 + + + VD++ A LG+S++D++ + A G ++ + ++ ++ + + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ITPGLAALDTIRLTSSDGGVVPLSSIAKVEQRFAPLSINHLDQFPVTTISFNVPDNYSLG 829 +D + + S++G +VP S+ + + + P I S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 DAVQAIMDTEKTLNLPVDITTQFQGSTLAFQSALGSTVWLIVAAVVAMYIVLGILYESFI 889 DA A+M+ + LP I + G + + + L+ + V +++ L LYES+ Sbjct: 838 DA-MALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALMIAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA + + DV ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMSPRDAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLIVSQV 1009 G +A A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 922 bits (2385), Expect = 0.0 Identities = 290/1035 (28%), Positives = 508/1035 (49%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILLSVAITLCGILGFRMLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ +L++ + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVSEMTSSS-SLGSTRIILQFDFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAPTISQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+S+++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVGLNPQALFNQGVSLDDVRTAISNANVRKPQG------ALEDGTHRWQIQTNDELK 236 A+R+ L+ L ++ DV + N + G AL I K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAAEYQPLIIHYN-NGGAVRLGDVATVTDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 E+ + + N +G VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRAKLPELQETIPAAIDLQIAQDRSPTIRASLEEVEQTLIISVALVILVVFLFLRS 355 T +I+AKL ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATIIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENIARHL- 414 RAT+IP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGTREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIVIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LLVSLTLTPMMCGWMLKASKPREQKRLRGFG----RMLVALQQGYGKSLKWVLNHTRLVG 530 +LV+L LTP +C +LK + GF Y S+ +L T Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 VVLLGTIALNIWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----VMRGKLQDFMKII 586 ++ +A + L++ +P +F PE+D GV + IQ + + V+ ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 RD-DPAVDNVTGFT-GGSRVNSGMMFITLKPRDERS---ETAQQIIDRLRVKLAKEPGAN 641 + +V V GF+ G N+GM F++LKP +ER+ +A+ +I R +++L K Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLAALREWEPKIRKKLATL-----PELADVNSD 696 + + I G + ++ L D + + R +L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMNLVYDRDTMARLGIDVQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRY 756 ++ A+ L D++ LG+ + N ++ A G ++ K+ ++ D ++ Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDISALEKMFVINNEGKAIPLSYFAKWQPANAPLSVNHQGLSAASTISFNLPTGKSLSD 816 ++K++V + G+ +P S F + + I G S D Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ASAAIDRAMTQLGVPSTVRGSFAGTAQVFQETMNSQVILIIAAIATVYIVLGILYESYVH 876 A A ++ ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRHGN 936 P++++ +P VG LLA LFN + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031 Score = 79.9 bits (197), Expect = 3e-17 Identities = 77/446 (17%), Positives = 162/446 (36%), Gaps = 26/446 (5%) Query: 592 VDNVTGFTGGS-RVNSGMMFITLKPRDERSETAQQIIDRLRVKLAKEPGANLFLMAVQDI 650 +DN+ + S S + +T + + Q+ ++L++ P + Q I Sbjct: 72 IDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE----VQQQGI 127 Query: 651 RVGGRQSNASYQYTLLSDDLAALREW-----EPKIRKKLATLPELADVNSDQQDNGAE-- 703 V S+ +SD+ ++ ++ L+ L + DV GA+ Sbjct: 128 SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQL----FGAQYA 183 Query: 704 MNLVYDRDTMARLGID----VQAANSLLNNAFGQRQISTIYQPMNQYKVVMEVDPRYTQD 759 M + D D + + + + + + T P Q + R+ Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243 Query: 760 ISALEKMFVINNEGKAIPLSYFAK--WQPANAPLSVNHQGLSAASTISFNLPTGKSLSDA 817 + +N++G + L A+ N + G AA +L D Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL-DT 302 Query: 818 SAAIDRAMTQL--GVPSTVRGSFA-GTAQVFQETMNSQVILIIAAIATVYIVLGILYESY 874 + AI + +L P ++ + T Q +++ V + AI V++V+ + ++ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 875 VHPLTILSTLPSAGVGALLALELFNAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRH 934 L +P +G L F + + + G++L IG++ +AI++V+ Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 935 GNLTPQEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQ 994 L P+EA ++ ++ + +P+ GG + + ITIV + +S Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 995 LLTLYTTPVVYLFFDRLRLRFSRKPK 1020 L+ L TP + + + K Sbjct: 483 LVALILTPALCATLLKPVSAEHHENK 508
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 123 bits (310), Expect = 8e-33 Identities = 98/435 (22%), Positives = 190/435 (43%), Gaps = 25/435 (5%) Query: 20 FMQSLDTTIVNTALPSMAQSLGESPLHMHMVIVSYVLTVAVMLPASGWLADKVGVRNIFF 79 F L+ ++N +LP +A + P + V +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 TAIVLFTLGSLFCALSGTLNELL-LARALQGVGGAMMVPVGRLTVMKIVPREQYMAAMTF 138 I++ GS+ + + LL +AR +QG G A + + V + +P+E A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGLLVEYASWHWIFLINIPVGIIGAIATLL-LMPNYTMQTRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP+ I + L+ L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFLLLAVGMAVLTLALDGSKGTGLSPLTIAGLVAVGVVALVLYLLHARNNNRALFSLKL 257 G +L++VG+ L + L V V++ ++++ H R L Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLI---------VSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRTRTFSLGLAGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 + F +G+ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVLVATTLGLSLVTLLFMTTALL----GWYYVLPFVLFLQGMVNSTRFS 372 +V+R G VL +G++ +++ F+T + L W+ + V L G+ S + Sbjct: 313 GGILVDRRGPLYVL---NIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKT 367 Query: 373 SMNTLTLKDLPDNLASSGNSLLSMIMQLSMSIGVTIAGLLLGLFGSQHVSVDSGTTQTVF 432 ++T+ L A +G SLL+ LS G+ I G LL + + Q+ + Sbjct: 368 VISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTY 427 Query: 433 MYT--WLSMALIIAL 445 +Y+ L + II + Sbjct: 428 LYSNLLLLFSGIIVI 442
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.0 bits (70), Expect = 0.010 Identities = 27/95 (28%), Positives = 35/95 (36%), Gaps = 20/95 (21%) Query: 164 RQTSWLIVALATLLAALATFLLA------RGLLAPVKRLVDGTHKLAAGDFTTRVTPTSE 217 RQ + L+ A L AL L+A V+ V H LA + P S Sbjct: 75 RQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD---AMKCFPGSF 131 Query: 218 DEL-----------GKLAQDFNQLASTLEKNQQMR 241 + L G L N+LA E+ QQMR Sbjct: 132 ERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMR 166
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 6e-18 Identities = 28/136 (20%), Positives = 65/136 (47%), Gaps = 1/136 (0%) Query: 11 PRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDQVLPYVRQTPPDLILLDLMLPGTDGL 70 IL+ +D+ + +L L A Y + S+ + ++ DL++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDIPIVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTILRRCK 129 L I+ D+P+++++A+ + + E GA DY+ KP+ E++ + L K Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 130 PQRELQQQDAESPLII 145 + + D++ + + Sbjct: 124 RRPSKLEDDSQDGMPL 139
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 33.5 bits (76), Expect = 0.004 Identities = 29/144 (20%), Positives = 54/144 (37%), Gaps = 4/144 (2%) Query: 36 RETQKSLRELNGQASRIEGFRKTSAQLAVTGHALEKARQEAEALATQFKNTERPTRAQAK 95 + + ++ +++I+ A LA LEKA + A +T + A+ Sbjct: 197 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKA 256 Query: 96 VLE----SAKRAAEDLQAKYNRLTDSVKRQQRELAAVGINTRNLAHDEQGLKNRISETTA 151 LE ++A E + +K + E AA+ +L H Q L Sbjct: 257 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRR 316 Query: 152 QLNRQRDALARVSAQQAKLNAVKQ 175 L+ R+A ++ A+ KL + Sbjct: 317 DLDASREAKKQLEAEHQKLEEQNK 340 Score = 29.6 bits (66), Expect = 0.049 Identities = 58/305 (19%), Positives = 107/305 (35%), Gaps = 12/305 (3%) Query: 6 KLQVLLRAVDQASRPFKSIRTASKSLSGDIRETQKSLRELNGQAS-RIEGFRKTSAQLAV 64 A + ++ + A + D+ + + + S +I+ A L Sbjct: 201 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 260 Query: 65 TGHALEKARQEAEALATQFKNTERPTRAQAKVLESAKRAAEDLQAKYNRLTDSVKRQ--- 121 LEKA + A +T + A+ LE+ K E N S++R Sbjct: 261 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320 Query: 122 -QRELAAVGINTRNLAHDEQGLKNRISETTAQLNRQRDALARVSAQQAKLNAVKQRYQAG 180 + + + L + + L+ R+A ++ A+ KL + +A Sbjct: 321 SREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEAS 380 Query: 181 -KELAGNMASVGAAGVGIAAAGTMAGVKL-----LMPGYEFAQKNSELQAVLGVAKDSAE 234 + L ++ + A + A A KL L E ++K +E + AK AE Sbjct: 381 RQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAE 440 Query: 235 MAALRKQ-ARQLGDNTAASADDAAGAQIIIAKAGGDVDAIQAATPVTLNMALANRRTMEE 293 AL+++ A+Q + A A+ +Q AK G + P N+ M+E Sbjct: 441 AKALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKE 500 Query: 294 NAALL 298 L Sbjct: 501 TKRQL 505
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 28.2 bits (62), Expect = 0.017 Identities = 13/35 (37%), Positives = 18/35 (51%), Gaps = 2/35 (5%) Query: 105 SISLMLTERTLVSEVDGALH--VKNIPEPPPPEPV 137 ++ + RTL E DG V N+ PPPP P+ Sbjct: 340 ILTQLCAARTLAYEGDGYRRAPVNNMMPPPPPPPM 374
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 28.3 bits (63), Expect = 0.042 Identities = 12/99 (12%), Positives = 36/99 (36%), Gaps = 4/99 (4%) Query: 136 TEYLEFCRTAKHNPLNRFKLSPENLISVATPVELEFEDLPETVFTALTEKVKSIFGRKQA 195 + + ++ K K + + + ED + + + ++ +++ + K Sbjct: 236 KDLFKEIQSNKFEFNCYTK--NNSFVGFYCLNLMSKEDYKKIQYDSSSKLLENFYYAKDK 293 Query: 196 SD--DARLNDVHEAVTAVAEHVQEKLSATEQRLAEMENA 232 SD ++ +D+ + V +K L + E+ Sbjct: 294 SDRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDK 332
>SECA#SecA protein signature. Length = 901 Score = 28.3 bits (63), Expect = 0.015 Identities = 11/60 (18%), Positives = 23/60 (38%) Query: 21 GKHLAVPRWQDTCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDVERERLVCAIDEL 80 L + W D ++ RER+ +++ + E M E+ ++ +D L Sbjct: 715 DLDLPIAEWLDKEPELHEETLRERILAQSIEVYQRKEEVVGAEMMRHFEKGVMLQTLDSL 774
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 31.2 bits (70), Expect = 0.003 Identities = 18/63 (28%), Positives = 27/63 (42%), Gaps = 2/63 (3%) Query: 156 AQGCENKNVIIIGAGT-IGLLAIQCAVALGAKSVTAIDISSEKLALAKSFGAMQTFNSSE 214 A+G E K I GA IG + + GA + A+D + EKL S + ++ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 215 MSA 217 A Sbjct: 62 FPA 64
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.4 bits (79), Expect = 8e-04 Identities = 53/268 (19%), Positives = 89/268 (33%), Gaps = 17/268 (6%) Query: 29 LSKSGFSAGEIGWSYACTAIAAILSPILVGSITDRFFSAQKVLAVLMFAGALLMYFAAQQ 88 L S G A A+ ++G+++DRF ++ + ++ AGA + Y Sbjct: 35 LVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAI--- 89 Query: 89 TTFAGFFPLLLAYSLTYMPTIALTNSIAFANVPDVERDFPRIRVMGTIG-WIASGLACGF 147 A F +L + T A T ++A A + D+ R R G + G+ G Sbjct: 90 MATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG- 147 Query: 148 LPQMLGY-ADISPTNIPLLITAGSSALLGVFAFFLPDTPPKSTGKMDIKVMLGLDALILL 206 P + G SP + P A + L + FL K + + L A Sbjct: 148 -PVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRW 205 Query: 207 RDKN------FLVFFFCSFLFAMPLAFYYIFANGYLTEVGMKNATGWMTLGQFSEIFFML 260 VFF + +P A + IF G + + Sbjct: 206 ARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAM 265 Query: 261 ALPFFTKRFGIKKVLLLGLVTAAIRYGF 288 R G ++ L+LG++ Y Sbjct: 266 ITGPVAARLGERRALMLGMIADGTGYIL 293 Score = 34.0 bits (78), Expect = 0.001 Identities = 32/153 (20%), Positives = 53/153 (34%), Gaps = 20/153 (13%) Query: 253 FSEIFFMLALPFFTKRFGIKKVLLLGLVTAAIRYGFFIYGSADEYFTYALLFLGILLHGV 312 + L + RFG + VLL+ L AA+ Y +L++G ++ G+ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-----FLWVLYIGRIVAGI 108 Query: 313 SYDFYYVTAYIYVDKKAPVHMRTAAQGLITLCCQGFGSLLGYRLGGVMMEKMFAYQEPVN 372 + V D R G ++ C GFG + G LGG+M F+ P Sbjct: 109 TGATGAVAGAYIAD-ITDGDERARHFGFMS-ACFGFGMVAGPVLGGLMGG--FSPHAP-- 162 Query: 373 GLTFNWSGMWTFGAVMIAIIAVLFMIFFRESDN 405 + A + + + ES Sbjct: 163 ---------FFAAAALNGLNFLTGCFLLPESHK 186
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 26.4 bits (58), Expect = 0.029 Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 6 KMLLGVLLLVTSAAWAAPATAGSTNTSGISKYE-LSSFIADF 46 ++L G LLL++S +WA ++K E L + DF Sbjct: 11 RVLTGTLLLLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDF 52
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 28.5 bits (63), Expect = 0.044 Identities = 18/79 (22%), Positives = 34/79 (43%), Gaps = 8/79 (10%) Query: 93 NITLSNNQ---TSFTSGYSVTVTPAASNAKVNVSAGGGGSVMINGVATLSSA-----SSS 144 NI LS N+ T T + T++ S ++ + S G + + + + S+S Sbjct: 297 NIILSKNEDQSTQNTDSQTRTISKNTSTSRTHTSEVHGNAEVHASFFDIGGSVSAGFSNS 356 Query: 145 TRGSAAVQFLLCLLGGKSW 163 + A+ L L G ++W Sbjct: 357 NSSTVAIDHSLSLAGERTW 375
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 715 bits (1847), Expect = 0.0 Identities = 242/843 (28%), Positives = 390/843 (46%), Gaps = 35/843 (4%) Query: 2 LRMTPLASAI---VALLLGIEAYAAEETFDTHFMIGGMKDQQVANIRL--DDNQPLPGQY 56 R+ + A +AE F+ F+ Q VA++ + + PG Y Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADD--PQAVADLSRFENGQELPPGTY 78 Query: 57 DIDIYVNKQWRGKYEIIVKDNPQET----CLSREVIKRLGIN-----SDNFASGKQCLTF 107 +DIY+N + ++ E CL+R + +G+N N + C+ Sbjct: 79 RVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPL 138 Query: 108 EQLVQGGSYTWDIGVFRLDFSVPQAWVEELESGYVPPENWERGINAFYTSYYVSQYYSDY 167 ++ + D+G RL+ ++PQA++ GY+PPE W+ GINA +Y S Sbjct: 139 TSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQN 198 Query: 168 KASGNSKSTYVRFNSGLNLLGWQLHSDASFSKTNNNPGV-----WKSNTLYLERGFAQLL 222 + GNS Y+ SGLN+ W+L + ++S +++ W+ +LER L Sbjct: 199 RIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLR 258 Query: 223 GTLRVGDMYTSSDIFDSVRFRGVRLFRDMQMLPNSKQNFTPRVQGIAQSNALVTIEQNGF 282 L +GD YT DIFD + FRG +L D MLP+S++ F P + GIA+ A VTI+QNG+ Sbjct: 259 SRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGY 318 Query: 283 VVYQKEVPPGPFAITDLQLAGGGADLDVSVKEADGSVTTYLVPYAAVLNMLQPGVSKYDF 342 +Y VPPGPF I D+ AG DL V++KEADGS + VPY++V + + G ++Y Sbjct: 319 DIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSI 378 Query: 343 AAGRSHIEGASKHSD-FVQAGYQYGFNNLLTLYGGSMVANNYYAFTLGTGWNT-RIGAIS 400 AG A + F Q+ +G T+YGG+ +A+ Y AF G G N +GA+S Sbjct: 379 TAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALS 438 Query: 401 VDATKSHSKQDNGDVFDGQSYQIAYNKFVSQTSTRFGLAAWRYSSRDYRTFNDHVWANNK 460 VD T+++S + DGQS + YNK ++++ T L +RYS+ Y F D ++ Sbjct: 439 VDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMN 498 Query: 461 DNYRRDENDVYDI----ADYYQNDFGRKNSFSANMSQSLPEGWGSVSLSTLWRDYWGRSG 516 ++ V + DYY + ++ ++Q L ++ LS + YWG S Sbjct: 499 GYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSN 557 Query: 517 SSKDYQLSYSNNLRRISYTLAASQAYDENHHE-EKRFNIFISIPFD--WGDDVSTPRRQI 573 + +Q + I++TL+ S + ++ + ++IPF D + R Sbjct: 558 VDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHA 617 Query: 574 YMSNSTTFDDQGFASNNTGLSGTVGSRDQFNYGVNLSHQHQGN---ETTAGANLTWNAPV 630 S S + D G +N G+ GT+ + +Y V + G+ +T A L + Sbjct: 618 SASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGY 677 Query: 631 ATVNGSYSQSSTYRQAGASVSGGIVAWSGGVNLANRLSETFAVMNAPGIKDAYVNGQKYR 690 N YS S +Q VSGG++A + GV L L++T ++ APG KDA V Q Sbjct: 678 GNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGV 737 Query: 691 TTNRNGVVVYDGMTPYRENHLMLDVSQSDSEAELRGNRKIAAPYRGAVVLVNFDTDQRKP 750 T+ G V T YREN + LD + +L P RGA+V F + Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA-RVGI 796 Query: 751 WFIKALRADGQPLTFGYEVNDIHGHNIGVVGQGSQLFIRTNEIPPSVNVAIDKQQGLSCT 810 + L + +PL FG V + G+V Q+++ + V V +++ C Sbjct: 797 KLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCV 856 Query: 811 ITF 813 + Sbjct: 857 ANY 859
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.1 bits (156), Expect = 3e-14 Identities = 22/113 (19%), Positives = 47/113 (41%), Gaps = 2/113 (1%) Query: 9 VMIVDDHPLMRRGVRQLLELDPGFEVVAEAGDGASAIDLANRLDIDVILLDLNMKGMSGL 68 +++ DD +R + Q L G++V + A+ D D+++ D+ M + Sbjct: 6 ILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 69 DTLNALRRDGVTAQIIILTVSDASSDVFALIDAGADGYLLKDSDPEVLLEAIR 121 D L +++ +++++ + + GA YL K D L+ I Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 28.0 bits (62), Expect = 0.008 Identities = 12/31 (38%), Positives = 18/31 (58%), Gaps = 1/31 (3%) Query: 34 KHIVLWLGLALACLGLAMVLWLLVL-QNVPV 63 + I+ +L + L C LAM+ W + L N PV Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPV 45
>AUTOINDCRSYN#Autoinducer synthesis protein signature. Length = 216 Score = 33.7 bits (77), Expect = 1e-04 Identities = 14/79 (17%), Positives = 32/79 (40%), Gaps = 12/79 (15%) Query: 1 MIEWQDLHHSKLSVSQLYALLQLRCAVFV--------VEQNCPYQDIDGDDLTGDNRHIL 52 M+E D++H+ LS ++ L LR F + D + + ++ Sbjct: 1 MLEIFDVNHTLLSETKSGELFTLRKETFKDRLNWAVQCTDGMEFDQYDNN----NTTYLF 56 Query: 53 GWKNDELVAYARILKSDDD 71 G K++ ++ R +++ Sbjct: 57 GIKDNTVICSLRFIETKYP 75
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 405 bits (1043), Expect = e-147 Identities = 249/251 (99%), Positives = 249/251 (99%) Query: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR Sbjct: 1 MKLRLSALALGTTLLVGCASSGTDQQGRSDPLEGFNRTMYNFNFNVLDPYIVRPVAVAWR 60 Query: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM Sbjct: 61 DYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGGFIDVAGM 120 Query: 121 ANPKLQRTEPHRFGSTLGHYGAGYGPYVQLPFYGSFTLRDDGGDMADGLYPVLSWLTWPM 180 ANPKLQRTEPHRFGSTLGHYG GYGPYVQLPFYGSFTLRDDGGDMAD LYPVLSWLTWPM Sbjct: 121 ANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVLSWLTWPM 180 Query: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA Sbjct: 181 SVGKWTLEGIETRAQLLDSDGLLRQSSDPYIMVREAYFQRHDFIANGGELKPQENPNAQA 240 Query: 241 IQDDLKDIDSE 251 IQDDLKDIDSE Sbjct: 241 IQDDLKDIDSE 251
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 121 bits (305), Expect = 5e-32 Identities = 92/404 (22%), Positives = 167/404 (41%), Gaps = 17/404 (4%) Query: 19 VTIALSLATFMQMLDSTISNVAIPTISGFLGASTDEGTWVITSFGVANAIAIPVTGRLAQ 78 + I L + +F +L+ + NV++P I+ WV T+F + +I V G+L+ Sbjct: 15 ILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 79 RIGELRLFLLSVTFFSLSSLMCSLS-TNLDVLIFFRVVQGLMAGPLIPLSQSLLLRNYPP 137 ++G RL L + S++ + + +LI R +QG A L ++ R P Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 138 EKRTFALALWSMTVIIAPICGPILGGYICDNFSWGWIFLINVPMGIIVLTLCLTLLKGRE 197 E R A L V + GP +GG I W +L+ +PM I+ L L +E Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKE 192 Query: 198 TETSPVKMNLPGLTLLVLGVGGLQIMLDKGRDLDWFNSSTIIILTVVSVISLISLVIWES 257 ++ G+ L+ +G+ + ML F +S I +VSV+S + V Sbjct: 193 VRIKG-HFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIR 241 Query: 258 TSENPILDLSLFKSRNFTIGIVSITCAYLFYSGAIVLMPQLLQETMGYNAIWAGLAYAPI 317 +P +D L K+ F IG++ + +G + ++P ++++ + G Sbjct: 242 KVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFP 301 Query: 318 GIMPLLIS-PLIGRYGNKIDMRLLVTFSFLMYAVCYYWRSVTFMPTIDFTGIILPQFFQG 376 G M ++I + G ++ ++ +V + S T F II+ G Sbjct: 302 GTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGG 361 Query: 377 FAVACFFLPLTTISFSGLPDNKFANASSMSNFFRTLSGSVGTSL 420 + ++TI S L + S+ NF LS G ++ Sbjct: 362 LSFTK--TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 78.3 bits (193), Expect = 6e-18 Identities = 62/413 (15%), Positives = 122/413 (29%), Gaps = 96/413 (23%) Query: 13 RRKYFSLLAVVLFIAFSGAYAYWSMELEDMISTDDAYVT-GNADPISAQVSGSVTVVNHK 71 RR ++ F+ + + ++E + + + G + I + V + K Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLG-QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 72 DTNYVRQGDILVSLDKTDATIALNKA---------------------------------- 97 + VR+GD+L+ L A K Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 98 ------------------KNNLANIVRQTNKLYLQDKQYSAEVASARIQ---YQQSLEDY 136 K + Q + L + AE + + Y+ Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 137 NRRV----PLAKQGVISKE----------TLEHTKDTLISSKAALNAAIQAYKANKALVM 182 R+ L + I+K + S + + I + K LV Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 183 N-------TPLNR-QPQVVEAADATKEAWLALKRTDIKSPVTGYIAQRSVQ-VGETVSPG 233 L + + + + + I++PV+ + Q V G V+ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 234 QSLMAVVPARQ-MWVNANFKETQLTDVRIGQSVNIISDLYGENVVFHGRVTGINMGTGNA 292 ++LM +VP + V A + + + +GQ+ I + F G +G Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE------AFPYTRYGYLVGK--- 404 Query: 293 FSLLPAQNATGNWIKIVQRVPVEVSLDPKELMEH----PLRIGLSMTATIDTR 341 + + +V V +S++ L PL G+++TA I T Sbjct: 405 VKNINLDAIEDQRLGLVFNVI--ISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.1 bits (117), Expect = 3e-09 Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 31/148 (20%) Query: 4 IIIDDHPLAIAAIRNLLIKNDIEILAELTEGGSAVQRVETLKPDIVIIDVDIPGVNGIQV 63 ++ DD + L + ++ + + + + D+V+ DV +P N + Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRIT-SNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 LETLRKRQYSGIIIIVSAKNDHFYGKHCADAGANGFVSKKEGMNNIIAAIEAAKNGYCYF 123 L ++K + ++++SA+N + AI+A++ G + Sbjct: 66 LPRIKKARPDLPVLVMSAQNT------------------------FMTAIKASEKGAYDY 101 Query: 124 ---PFSLNRFVGSLTSDQQKLDSLSKQE 148 PF L + + L ++ Sbjct: 102 LPKPFDLTE---LIGIIGRALAEPKRRP 126
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 2e-17 Identities = 30/105 (28%), Positives = 51/105 (48%) Query: 960 SILIADDHPTNRLLLKRQLNLLGYDVDEATDGVQALHKVSMQHYDLLITDVNMPNMDGFE 1019 +IL+ADD R +L + L+ GYDV ++ ++ DL++TDV MP+ + F+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1020 LTRKLREQNSSLPIWGLTANAQANEREKGLSCGMNLCLFKPLTLD 1064 L ++++ LP+ ++A K G L KP L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLT 109
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 49.8 bits (119), Expect = 4e-09 Identities = 33/116 (28%), Positives = 52/116 (44%), Gaps = 9/116 (7%) Query: 63 VRDGIVWDFFGAVTIVRRHLDTLEQQFGLRFS-HAATSFPPGTDP---RISINVLESAGL 118 ++DG++ DFF +++ + + +R S P G R + AG Sbjct: 76 MKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGA 135 Query: 119 EVSHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKKGKVTYSADEATGG 169 +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 136 REVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>cloacin#Cloacin signature. Length = 551 Score = 27.8 bits (61), Expect = 0.030 Identities = 39/157 (24%), Positives = 65/157 (41%), Gaps = 18/157 (11%) Query: 34 QQGKNEEQRQHDEW-------VAERNREIQQEKQRRANAQAAANK-RAATAAANKKARQD 85 +Q ++EE R+ EW AERN E + + +AN A N+ R A A +R+ Sbjct: 297 KQRQDEENRRQQEWDATHPVEAAERNYERARAELNQANEDVARNQERQAKAVQVYNSRKS 356 Query: 86 KLD------AEASADKKRDQSYEDELHS---LEIQKQKLALAKEEARVKRENEFIDQELK 136 +LD A+A A+ K+ + + + Q L + + V + D K Sbjct: 357 ELDAANKTLADAIAEIKQFNRFAHDPMAGGHRMWQMAGLKAQRAQTDVNNKQAAFDAAAK 416 Query: 137 HKAAQTDVVQSEADANRNMTEGGRDLMKSVGKAEENK 173 K + D S A +R E + ++ E+NK Sbjct: 417 EK-SDADAALSSAMESRKKKEDKKRSAENNLNDEKNK 452
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 114 bits (288), Expect = 5e-30 Identities = 81/371 (21%), Positives = 144/371 (38%), Gaps = 74/371 (19%) Query: 23 GIDLGTTNSLVATVRSGQAETLADHEGRHLLPSVVHYQQQGHS-------VGYDARTNAA 75 IDLGT N+L+ G + +E PSVV +Q VG+DA+ Sbjct: 14 SIDLGTANTLIYVKGQG----IVLNE-----PSVVAIRQDRAGSPKSVAAVGHDAK-QML 63 Query: 76 LDTANTISSVKRLMGRSLADIQQRYPHLPYQFQASENGLPMIETAAGLLNPVRVSADILK 135 T I++++ + +AD V+ +L+ Sbjct: 64 GRTPGNIAAIRPMKDGVIADF-------------------------------FVTEKMLQ 92 Query: 136 ALAARATEALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYG 194 + V++ VP +R+ +++A+ AG + L+ EP AAAI G Sbjct: 93 HFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAG 152 Query: 195 LDSGQEGVIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYIREQAG 254 L + V D+GGGT +++++ L+ V +GGD FD + +Y+R G Sbjct: 153 LPVSEATGSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYG 207 Query: 255 --IPDRSDNRVQRELLDAAIAAKIALSDADSVTVNVAG---WQG-----EISREQFNELI 304 I + + R++ E+ A + + V G +G ++ + E + Sbjct: 208 SLIGEATAERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEAL 260 Query: 305 APLVKRTLLACRRALKDAGVE-ADEVLE--VVMVGGSTRVPLVRERVGEFFGRPPLTSID 361 + + A AL+ E A ++ E +V+ GG + + + E G P + + D Sbjct: 261 QEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAED 320 Query: 362 PDKVVAIGAAI 372 P VA G Sbjct: 321 PLTCVARGGGK 331
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 26.3 bits (58), Expect = 0.032 Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 11/87 (12%) Query: 4 KTLTAAAAVLLMLTAGCSTLERVVYRPDINQGNYLTANDVSKIRV--GMTQQQVAYALGT 61 K + AVL + AG LER ++ Q + + + VS+ + GMT ++ A Sbjct: 69 KVV-LCGAVLARVDAGDEQLERKIH---YRQQDLVDYSPVSEKHLADGMTVGELCAA--A 122 Query: 62 PLMSDPFGTNTWFYVFRQQPGHEGVTQ 88 MSD N + G G+T Sbjct: 123 ITMSDNSAANL---LLATVGGPAGLTA 146
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 28.7 bits (64), Expect = 0.014 Identities = 20/105 (19%), Positives = 35/105 (33%), Gaps = 17/105 (16%) Query: 1 MKPRQRQAAILEYLQKQGKCSVEEL-----AQYFDTTGTTIRKDLVILEHAGTVIRTYGG 55 M QR I E + + +EL ++ T T+ +D+ E + T G Sbjct: 1 MNKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIK--ELHLVKVPTNNG 58 Query: 56 ---VVLNKEESDPPIDHKTLINTHKKELIAEAAVSFIHDGDSIIL 97 L ++ P+ K + +A V I+L Sbjct: 59 SYKYSLPADQRFNPLS-------KLKRSLMDAFVKIDSASHLIVL 96
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 373 bits (958), Expect = e-127 Identities = 125/388 (32%), Positives = 193/388 (49%), Gaps = 33/388 (8%) Query: 149 IAALAAGALS----------NALLIEQLESQNMMPGDATPFEAVKQTQMIGLSPGMTQLK 198 I A GA +I + ++ ++ ++G S M ++ Sbjct: 91 IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIY 150 Query: 199 KEIEIVAASDLNVLISGETGTGKELVAKAIHEASPRAVNPLVYLNCAALPESVAESELFG 258 + + + +DL ++I+GE+GTGKELVA+A+H+ R P V +N AA+P + ESELFG Sbjct: 151 RVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFG 210 Query: 259 HVKGAFTGAISNRSGKFEMADNGTLFLDEIGELSLALQAKLLRVLQYGDIQRVGDDRSLR 318 H KGAFTGA + +G+FE A+ GTLFLDEIG++ + Q +LLRVLQ G+ VG +R Sbjct: 211 HEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIR 270 Query: 319 VDVRVLAATNRDLREEVLAGRFRADLFHRLSVFPLSVPPLRERGDDVILLAGYFCEQCRL 378 DVR++AATN+DL++ + G FR DL++RL+V PL +PPLR+R +D+ L +F +Q Sbjct: 271 SDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAE- 329 Query: 379 RLGLSRVVLSAGARNLLQHYSFPGNVRELEHAIHRAVVLSRATRSGDEVIL-----EAQH 433 + GL A L++ + +PGNVRELE+ + R L E+I E Sbjct: 330 KEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPD 389 Query: 434 FAFPEVTLPPPEAAAVPVVKQNLR-----------------EATEAFQRETIRQALAQNH 476 + + V++N+R + I AL Sbjct: 390 SPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATR 449 Query: 477 HNWAACARMLETDVANLHRLAKRLGLKD 504 N A +L + L + + LG+ Sbjct: 450 GNQIKAADLLGLNRNTLRKKIRELGVSV 477
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.034 Identities = 17/93 (18%), Positives = 29/93 (31%), Gaps = 7/93 (7%) Query: 3 TTMLEVAKRAGVSKATVSRVLSG-----NGYVSQETKDRVFQAVEESGYRPNLLARNLSA 57 T++ E+AK AGV++ + + + +E P L Sbjct: 32 TSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLRE 91 Query: 58 KSTQTLGLVVTNTLYHGIYFSELLFHAARMAEE 90 L VT + E++FH E Sbjct: 92 ILIHVLESTVTEERRRLLM--EIIFHKCEFVGE 122
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 82.3 bits (203), Expect = 4e-19 Identities = 44/142 (30%), Positives = 63/142 (44%), Gaps = 14/142 (9%) Query: 415 PEQKMEVTASLQAQTVRLDSMSLFDVGQARLKDGSTKVL---VDALVNIRAKPGWLILVA 471 +Q + L S LF+ +A LK L L N+ K G ++V Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG-SVVVL 258 Query: 472 GYTDATGDEKSNQQLSLRRAEAVRNWMLQTSDIPATCFAVQGLGESQPAATNDTPQGR-- 529 GYTD G + NQ LS RRA++V ++ L + IPA + +G+GES P N + Sbjct: 259 GYTDRIGSDAYNQGLSERRAQSVVDY-LISKGIPADKISARGMGESNPVTGNTCDNVKQR 317 Query: 530 -------AVNRRVEISLVPRSD 544 A +RRVEI + D Sbjct: 318 AALIDCLAPDRRVEIEVKGIKD 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.8 bits (80), Expect = 0.001 Identities = 37/189 (19%), Positives = 67/189 (35%), Gaps = 34/189 (17%) Query: 512 IMALRQEGADSTDLQQQLRTHQGFAPLLALDVDARAVATVVADWTG----IPLSSLL--- 564 + + ++ DL +++ + P+L + + + A G +P L Sbjct: 52 VTDVVMPDENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTEL 111 Query: 565 RDEQSDLLSMEQSLENR----------VVGQSPALCAIAQRL-RAAKTGLTPENGPQGVF 613 L+ + ++ +VG+S A+ I + L R +T LT Sbjct: 112 IGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLT--------L 163 Query: 614 LLTGPSGTGKTETALTLADTLFGGEKSLITINLSEYQEPHTVSQLKGSPPGYVGYGQGGV 673 ++TG SGTGK A L D + IN++ S+L G + G Sbjct: 164 MITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGA 215 Query: 674 LTEAVRKRP 682 T A + Sbjct: 216 FTGAQTRST 224
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 110 bits (276), Expect = 2e-31 Identities = 71/257 (27%), Positives = 129/257 (50%), Gaps = 11/257 (4%) Query: 3 LSAFSLEGKVAVVTGCDTGLGQGMALGLAQAGCDIVGI--NIVEPTETIKQVTALGRRFL 60 ++A +EGK+A +TG G+G+ +A LA G I + N + + + + A R Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 61 SLTADLRKIDGIPALLDRAVAEFGHIDILVNNAGLIRREDALEFSEKDWDDVMNLNIKSV 120 + AD+R I + R E G IDILVN AG++R S+++W+ ++N V Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 121 FFMSQAAAKHFIAQGNGGKIINIASMLSFQGGIRVPSYTASKSGVMGVTRLMANEWAKHN 180 F S++ +K+ + + G I+ + S + + +Y +SK+ + T+ + E A++N Sbjct: 121 FNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179 Query: 181 INVNAIAPGYMATNNTQQLRADEQRSAEILD--------RIPAGRWGLPSDLMGPIVFLA 232 I N ++PG T+ L ADE + +++ IP + PSD+ ++FL Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 233 SSASDYVNGYTIAVDGG 249 S + ++ + + VDGG Sbjct: 240 SGQAGHITMHNLCVDGG 256
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 71.5 bits (175), Expect = 2e-18 Identities = 28/164 (17%), Positives = 65/164 (39%), Gaps = 9/164 (5%) Query: 1 MSTETIEIFNNSDEWANQLKHALSKGENLALLHGLTPDILDRIYAYAFDYHEKGNITDAE 60 M ET + + E+ ++ L G +A+L+ ++ D L+++Y+ AF+ ++ G DA Sbjct: 1 MQQETTD----TQEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAH 56 Query: 61 IYYKFLCIYAFENHEYLKDFASVCQPKKKYQQAYDLYKLSYNYSPYDDYSVIYRMGQCQI 120 ++ LC+ + + + Q +Y A Y + + +C + Sbjct: 57 KVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDI-KEPRFPFHAAECLL 115 Query: 121 GAKNIDNAMQCFYH----IINNCEDDSVKSKAQAYIELLNDNSE 160 + A + I + E + ++ + +E + E Sbjct: 116 QKGELAEAESGLFLAQELIADKTEFKELSTRVSSMLEAIKLKKE 159
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 33.0 bits (75), Expect = 0.001 Identities = 22/126 (17%), Positives = 49/126 (38%), Gaps = 5/126 (3%) Query: 4 ISLLLFILLLCGCKQQE-LLNHLDQQQANDVLAVLQRHNINAEKKDQGKTGFSIFVEPTD 62 +++++ ++L L ++L Q ++A L + NI + I V Sbjct: 35 VAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPYRFANGSGA---IEVPADK 91 Query: 63 FASAVDWLKIYNLPGKPDIQISQMFPADALVSSPRAEKARLYSAIEQRLEQSLKIMDGII 122 L LP + + + S +E+ A+E L ++++ + + Sbjct: 92 VHELRLRLAQQGLPKGGAVGFE-LLDQEKFGISQFSEQVNYQRALEGELARTIETLGPVK 150 Query: 123 SSRVHV 128 S+RVH+ Sbjct: 151 SARVHL 156
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 96.0 bits (239), Expect = 5e-27 Identities = 33/82 (40%), Positives = 48/82 (58%) Query: 2 IANPTHIAIGIYFKPHLSTIPLISVRETNEVALAVRKYAKEIGIPIITDKKLARKIYATH 61 +ANPTHIAIGI +K + +PL++ + T+ VRK A+E G+PI+ LAR +Y Sbjct: 261 VANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDA 320 Query: 62 RRYDYVSFENIDEILRLLLWLE 83 Y+ E I+ +L WLE Sbjct: 321 LVDHYIPAEQIEATAEVLRWLE 342
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 74.8 bits (184), Expect = 3e-20 Identities = 25/49 (51%), Positives = 39/49 (79%) Query: 2 IIVLIFDFIAEYFLFMKDMKMDKQEVKREYKEQEGNPEIKSKRRERIRK 50 +++ I D+ EY+ ++K++KM K E+KREYKE EG+PEIKSKRR+ ++ Sbjct: 196 VVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQFHQE 244
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 111 bits (280), Expect = 6e-32 Identities = 40/146 (27%), Positives = 70/146 (47%), Gaps = 5/146 (3%) Query: 2 ANKTEKPTQKKLQDASKKGQILKSRDLTISVIMLVG--TLYLGYVFDVHHIMSILEYILD 59 KTE+PT KK++DA KKGQ+ KS+++ + +++ L + H ++ + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIPAE 62 Query: 60 HNAKPNIWD---YFKAMGVGWLKTNIPFLLVCMFTTILVSWFQSKMQLATEAVKFKFDSL 116 + P + + + P L V I Q ++ EA+K + Sbjct: 63 QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKI 122 Query: 117 NPVNGLKRIFGLKTVKEFVKAILYIV 142 NP+ G KRIF +K++ EF+K+IL +V Sbjct: 123 NPIEGAKRIFSIKSLVEFLKSILKVV 148
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 140 bits (354), Expect = 9e-43 Identities = 48/248 (19%), Positives = 100/248 (40%), Gaps = 4/248 (1%) Query: 1 MGEAILYQLHSLLAATALGFCRLAPTFYLLPFFASGNIPTVVRHPIIIVVSCALVQHYHY 60 M + Q S L R+ P + ++P V+ + ++++ A+ Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 ELLNLNEIDIALFAAREIIIGLFIACLLASPFWIFLAIGSFIDNQRGATLSSTLDPATGV 120 + + A ++I+IG+ + + F G I Q G + ++ +DPA+ + Sbjct: 61 NDVPVFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 121 DTSELARLFNLFSAAVYLTKGGMNFILETLWQSYNLWPSGNFNF--PKLEPLFSYINNIM 178 + LAR+ ++ + ++LT G +++ L +++ P G L + I Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIF 180 Query: 179 THTIVYASPVIAVMLGGEAVLGLLARYASQLNAFAISLTVKSALAFLILIIYFA--PILA 236 + ++ A P+I ++L LGLL R A QL+ F I + + ++ Sbjct: 181 LNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFC 240 Query: 237 ERVMPLSF 244 E + F Sbjct: 241 EHLFSEIF 248
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 78.7 bits (194), Expect = 4e-23 Identities = 59/86 (68%), Positives = 73/86 (84%) Query: 1 MDDIVFAGNRALYLILVMSAGPIAVATFVGLLVGLFQTVTQLQEQTLPFGVKLLCVSICF 60 MDD+VFAGN+ALYL+L++S P VAT +GLLVGLFQTVTQLQEQTLPFG+KLL V +C Sbjct: 1 MDDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCL 60 Query: 61 FLMSGWYGEKLYSFGIEMLNLAFARG 86 FL+SGWYGE L S+G +++ LA A+G Sbjct: 61 FLLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 226 bits (578), Expect = 2e-77 Identities = 151/223 (67%), Positives = 181/223 (81%), Gaps = 5/223 (2%) Query: 1 MSNSISLIAILSLFTLLPFIIASGTCFIKFSIVFVIVRNALGLQQVPSNMTLNGVALLLS 60 M N ISLIA+L+ TLLPFIIASGTCF+KFSIVFV+VRNALGLQQ+PSNMTLNGVALLLS Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 61 MFVMMPVGKEIYNNSQNENLSFNNVASVVNFVETGMSGYKSYLIKYSEPELVSFFEKIQK 120 MFVM P+ + Y ++E+++FN+++S+ V+ G+ GY+ YLIKYS+ ELV FFE Q Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 121 VNSSEDNEEIIDDD-----NISIFSLLPAYALSEIKSAFIIGFYIYLPFVVVDLVISSVL 175 + E + D SIF+LLPAYALSEIKSAF IGFY+YLPFVVVDLV+SSVL Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 176 LTLGMMMMSPVTISTPIKLILFVAMDGWTMLSKGLILQYFDLS 218 L LGMMMMSPVTISTPIKL+LFVA+DGWT+LSKGLILQY D++ Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQYMDIA 223
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 82.7 bits (204), Expect = 2e-22 Identities = 40/104 (38%), Positives = 61/104 (58%) Query: 8 EEDFETDDFDIKKNESEIYDENDDQMINSFEDLPVKIEFVLGKKIMNLYEIDELCAKRII 67 E + DI+ E E + + LPVK+EFVL +K + L E++ + ++++ Sbjct: 198 EGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAELEAMGQQQLL 257 Query: 68 SLLPESEKNIKIRVNGALTGYGELVEVDDKLGVEIHSWLSGNNN 111 SL +E N++I NG L G GELV+++D LGVEIH WLS + N Sbjct: 258 SLPTNAELNVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESGN 301
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 56.9 bits (137), Expect = 6e-13 Identities = 31/118 (26%), Positives = 50/118 (42%) Query: 4 LRKVNRNTHTFETTFQNWKENGEDVALLMPEFSAKWLPIAEESGSWSGWVLLGEIFPLIS 63 +R+++R T + +G + L P W+ +++ WS W+ G+ +S Sbjct: 5 VRQIDRREWLLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWLEHVS 64 Query: 64 SELAGMALMPETERLIGEWLSLSSSPLNLKHPELKYNRLHVGKVFDGVLSPAQPLIRI 121 LAG A+ E L+ WL+ + P L P L RL V G P L+ I Sbjct: 65 PALAGAAVSAGAEHLVVPWLAATERPFELPVPHLSCRRLCVENPVPGSALPEGKLLHI 122
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 30.2 bits (68), Expect = 0.003 Identities = 17/48 (35%), Positives = 23/48 (47%), Gaps = 3/48 (6%) Query: 1 MRRTR--AGFTLLEMLVAIAIFASLA-LMAQQVTNGVTRVNSAVAGHD 45 MR T GFTLLE++V I I LA L+ + + + A D Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 33.0 bits (75), Expect = 1e-04 Identities = 13/24 (54%), Positives = 18/24 (75%) Query: 2 KRGFTLLEVMLALAIFALAATAVL 25 +RGFTLLE+ML L + ++A VL Sbjct: 3 QRGFTLLEMMLILLLMGVSAGMVL 26
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 76.5 bits (188), Expect = 4e-20 Identities = 41/196 (20%), Positives = 71/196 (36%), Gaps = 41/196 (20%) Query: 1 MPERGFTLLEIMLVIFLIGLASAGVVQTFATDSESPAKKAAQDFLTRFAQFKDRAVIEGK 60 M +RGFTLLE+ML++ L+G+++ V+ F + A + F + + R + G+ Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60 Query: 61 TLGVLIDPPGYQFMQRRQGQWSPVSSTRLSAQVTVPKQVQMLLQPGSDIWQKEYALELQR 120 GV + P +QF+ + + P D W L L+ Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPA-------------------PADDGWSGYRWLPLRA 101 Query: 121 RRL----TLHDIELEL-----QKEAKKKTPQIRFSPFEPATPFTLRFYSAAQNACWAVKL 171 R+ ++ +L L + P + P TPF L L Sbjct: 102 GRVATSGSIAGGKLNLAFAQGEAWTPGDNPDVLIFPGGEMTPFRLT-------------L 148 Query: 172 AHDGALSLNQCDERMP 187 ++ N E +P Sbjct: 149 GEAPGIAFNARGESLP 164
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 217 bits (554), Expect = 3e-76 Identities = 90/146 (61%), Positives = 109/146 (74%), Gaps = 3/146 (2%) Query: 6 RTQKPRAGFTLLEVMVVIVILGVLASLVVPNLLGNKEKADRQKAISDIVALENALDMYRL 65 R + GFTLLE+MVVIVI+GVLASLVVPNL+GNKEKAD+QKA+SDIVALENALDMY+L Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61 Query: 66 DNGRYPTTEQGLEALIQQPANMADSRNYRTGGYIKRLPKDPWGNDYQYLSPGEKGLFDVY 125 DN YPTT QGLE+L++ P + NY GYIKRLP DPWGNDY ++PGE G +D+ Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121 Query: 126 TLGADGQENGEGAGADIGNWNLQEFQ 151 + G DG+ E DI NW L + + Sbjct: 122 SAGPDGEMGTED---DITNWGLSKKK 144
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 453 bits (1168), Expect = e-161 Identities = 224/406 (55%), Positives = 300/406 (73%), Gaps = 1/406 (0%) Query: 1 MALFYYQALERNGRKTKGMIEADSARHARQLLRGKELIPVHI-EARMNTSSGGMLQRRRH 59 MA ++YQAL+ G+K +G EADSAR ARQLLR + L+P+ + E R + G Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 60 AHRRVAAADLALFTRQLATLVQAAMPLETCLQAVSEQSEKLHVKSLGMALRSRIQEGYTL 119 R++ +DLAL TRQLATLV A+MPLE L AV++QSEK H+ L A+RS++ EG++L Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 120 SDSLREHPRVFDSLFCSMVAAGEKSGHLDVVLNRLADYTEQRQRLKSRLLQAMLYPLVLL 179 +D+++ P F+ L+C+MVAAGE SGHLD VLNRLADYTEQRQ+++SR+ QAM+YP VL Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 180 VVATGVVTILLTAVVPKIIEQFDHLGHALPASTRTLIAMSDALQASGVYWLAGLLGLLVL 239 VVA VV+ILL+ VVPK++EQF H+ ALP STR L+ MSDA++ G + L LL + Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 240 GQRLLKNPAMRLRWDQTVLRLPVTGRVARGLNTARFSRTLSILTASSVPLLEGIQTAAAV 299 + +L+ R+ + + +L LP+ GR+ARGLNTAR++RTLSIL AS+VPLL+ ++ + V Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 300 SANRYVEQQLLLAADRVREGSSLRAALAELRLFPPMMLYMIASGEQSGELETMLEQAAVN 359 +N Y +L LA D VREG SL AL + LFPPMM +MIASGE+SGEL++MLE+AA N Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 360 QEREFDTQVGLALGLFEPALVVMMAGVVLFIVIAILEPMLQLNNMV 405 Q+REF +Q+ LALGLFEP LVV MA VVLFIV+AIL+P+LQLN ++ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 574 bits (1482), Expect = 0.0 Identities = 295/668 (44%), Positives = 431/668 (64%), Gaps = 34/668 (5%) Query: 24 LLPLVLAAALCSSPVWAEEATFTANFKDTDLKSFIETVGANLNKTIIMGPGVQGKVSIRT 83 L L++ AAL P AEE F+A+FK TD++ FI TV NLNKT+I+ P V+G +++R+ Sbjct: 11 SLTLLIFAALLFRPAAAEE--FSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRS 68 Query: 84 MTPLNERQYYQLFLNLLEAQGYAVVPMENDVLKVVKSSAAKVEPLPLVGEGSDNYAGDEM 143 LNE QYYQ FL++L+ G+AV+ M N VLKVV+S AK +P+ + + GDE+ Sbjct: 69 YDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPG-IGDEV 127 Query: 144 VTKVVPVRNVSVRELAPILRQMIDSAGSGNVVNYDPSNVIMLTGRASVVERLTEVIQRVD 203 VT+VVP+ NV+ R+LAP+LRQ+ D+AG G+VV+Y+PSNV+++TGRA+V++RL +++RVD Sbjct: 128 VTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVD 187 Query: 204 HAGNRTEEVIPLDNASASEIARVLESLTKNSGENQ-PATLKSQIVADERTNSVIVSGDPA 262 +AG+R+ +PL ASA+++ +++ L K++ ++ P ++ + +VADERTN+V+VSG+P Sbjct: 188 NAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPN 247 Query: 263 TRDKMRRLIRRLDSEMERSGNSQVFYLKYSKAEDLVDVLKQVSGTLTAAKEEAEGTVGSG 322 +R ++ +I++LD + GN++V YLKY+KA DLV+VL +S T+ + K+ A+ + Sbjct: 248 SRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPV-AAL 306 Query: 323 REIVSIAASKHSNALIVTAPQDIMQSLQSVIEQLDIRRAQVHVEALIVEVAEGSNINFGV 382 + + I A +NALIVTA D+M L+ VI QLDIRR QV VEA+I EV + +N G+ Sbjct: 307 DKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGI 366 Query: 383 QWASKDAGLMQFANGTQIPIGTLGAAISQAKPQKGSTVISENGATTINPDTNGDLST-LA 441 QWA+K+AG+ QF N + +PI T A + +G +S+ LA Sbjct: 367 QWANKNAGMTQFTN-SGLPISTAIAG-------------------ANQYNKDGTVSSSLA 406 Query: 442 QLLSGFSGTAVGVVKGDWMALVQAVKNDSSSNVLSTPSITTLDNQEAFFMVGQDVPVLTG 501 LS F+G A G +G+W L+ A+ + + +++L+TPSI TLDN EA F VGQ+VPVLTG Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466 Query: 502 STVGSNNSNPFNTVERKKVGIMLKVTPQINEGNAVQMVIEQEVSKVEGQTS-----LDVV 556 S S N FNTVERK VGI LKV PQINEG++V + IEQEVS V S L Sbjct: 467 SQTTSG-DNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525 Query: 557 FGERKLKTTVLANDGELIVLGGLMDDQAGESVAKVPLLGDIPLIGNLFKSTADKKEKRNL 616 F R + VL GE +V+GGL+D ++ KVPLLGDIP+IG LF+ST+ K KRNL Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585 Query: 617 MVFIRPTILRDGMAADGVSQRKYNYMRAEQIYR--DEQGLSLMPHTAQPVLPAQNQALPP 674 M+FIRPT++RD S +Y Q + E +++ + P Q+ A Sbjct: 586 MLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFR 645 Query: 675 EVRAFLNA 682 +V A ++A Sbjct: 646 QVSAAIDA 653
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 103 bits (257), Expect = 3e-28 Identities = 64/277 (23%), Positives = 109/277 (39%), Gaps = 36/277 (12%) Query: 1 MLLIISAKVAHSLWRYFSFSAEYTDVSPSANKPPRADAKTFDKNDVQLISQQNWFGKYQP 60 ++L+ ++A WR S P +A + ND L FG Sbjct: 22 LMLLFCQQLAMIFWR-IGLPDNAPVSSVQIT-PAQARQQPVTLNDFTL------FGVSPE 73 Query: 61 V--ATPVKQPEPVPVAETRLNVVLRGIAFG---ARPGAVIEEGGKQQVYLQGETLGSHNA 115 A + + + + LN+ L G+ G +R A+I + +Q E + +NA Sbjct: 74 KNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNA 133 Query: 116 VIEEINRDHVMLRYQGKIERLSLAEEERSTVAATNKKAVSDEAKQAVAEPAVSAPVEIPT 175 I I D V+L+YQG+ E L L +E S V+++ +Q Sbjct: 134 KIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQ--------------- 178 Query: 176 AVRQALAKDPQKIFNYIQLTPVRKEG-IVGYAVKPGADRSLFDASGFKEGDIAIALNQQD 234 + + +Y+ +P+ + + GY + PG F G ++ D+A+ALN D Sbjct: 179 -------RASTTMSDYVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLD 231 Query: 235 FTDPRAMIALMRQLPSMDSIQLTVLRKGARHDISIAL 271 D M ++ + + LTV R G R DI + Sbjct: 232 LRDAEQAKKAMERMADVHNFTLTVERDGQRQDIYMEF 268
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 280 bits (717), Expect = 8e-97 Identities = 110/271 (40%), Positives = 150/271 (55%), Gaps = 12/271 (4%) Query: 1 MLFDVFQQYPAAMPILATVGGLIIGSFLNVVIWRYPIML-RQQMAEFHGEMPSAQSKI-- 57 +L ++ P L + L+IGSFLNVVI R PIML R+ AE+ + Sbjct: 3 LLLELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDE 62 Query: 58 ---SLALPRSHCPHCQQTIRIRDNIPLLSWLMLKGRCRDCQAKISKRYPLVELLTALAFL 114 +L +PRS CPHC I +NIPLLSWL L+GRCR CQA IS RYPLVELLTAL + Sbjct: 63 PPYNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSV 122 Query: 115 LASLVWPESGWALAVMILSAWLIAASVIDLDHQWLPDVFTQGVLWTGLIAAWAQQSPLTL 174 ++ LA ++L+ L+A + IDLD LPD T +LW GL+ ++L Sbjct: 123 AVAMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL-LGGFVSL 181 Query: 175 QDAVTGVLVGFIAFYSLRWIAGVVLRKEALGMGDVLLFAALGSWVGPLSLPNVALIASCC 234 DAV G + G++ +SL W ++ KE +G GD L AALG+W+G +LP V L++S Sbjct: 182 GDAVIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLV 241 Query: 235 GLIYAVI-----TKRGSTTLPFGPCLSLGGI 260 G + S +PFGP L++ G Sbjct: 242 GAFMGIGLILLRNHHQSKPIPFGPYLAIAGW 272
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 48.8 bits (116), Expect = 5e-08 Identities = 23/60 (38%), Positives = 29/60 (48%), Gaps = 3/60 (5%) Query: 32 SSDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPD---PEPTPEPEPEPVP 88 S T + V+P P P EP PEP P PEP E P+P P+P+P+PV Sbjct: 50 ISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVK 109 Score = 42.3 bits (99), Expect = 7e-06 Identities = 14/57 (24%), Positives = 19/57 (33%), Gaps = 1/57 (1%) Query: 33 SDTPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPT 89 +D P + PE +P P PEP PEP + E + V Sbjct: 58 ADLEPPQAVQ-PPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQ 113 Score = 41.1 bits (96), Expect = 2e-05 Identities = 20/117 (17%), Positives = 37/117 (31%), Gaps = 6/117 (5%) Query: 35 TPPVDSGTGSLPEVKPDPTPNPEPTPEPTPDPEPTPEPTPDP--EPTPEPEPEPVPTKTG 92 P + +P +P PEP +PEP PEP P+P E E K Sbjct: 45 APAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPK 104 Query: 93 YLTLGGSLRVTGDITCNDESSDGFTFTPGDKVTCVAGNNTTIATFDTQSEAARSLRA 149 + + D+ + +P + ++T ++ + + Sbjct: 105 PKPVKKVEQPKRDVKPVESRPA----SPFENTAPARPTSSTATAATSKPVTSVASGP 157 Score = 38.0 bits (88), Expect = 2e-04 Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 6/61 (9%) Query: 29 SGSSSDTPPVDSGTGSLPEVKPDPTPNPEPTPE-PTPDPEPTPEPTPDPEPTPEPEPEPV 87 + P V PE +P P P E P P+P P+P + +P+ + Sbjct: 65 AVQPPPEPVV----EPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP-KPVKKVEQPKRDVK 119 Query: 88 P 88 P Sbjct: 120 P 120 Score = 35.7 bits (82), Expect = 8e-04 Identities = 17/40 (42%), Positives = 17/40 (42%) Query: 50 PDPTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPT 89 P P T D EP P PEP EPEPEP P Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83 Score = 30.7 bits (69), Expect = 0.034 Identities = 11/40 (27%), Positives = 13/40 (32%) Query: 52 PTPNPEPTPEPTPDPEPTPEPTPDPEPTPEPEPEPVPTKT 91 P P + + P P P P EPEP P Sbjct: 44 PAPAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPI 83
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 37.4 bits (87), Expect = 1e-04 Identities = 26/194 (13%), Positives = 58/194 (29%), Gaps = 40/194 (20%) Query: 12 TGLLLLLALAFVLFYEAINGFHDTANAVATVIY------TRAMQPQLAVVMAAFFNFFGV 65 L++AL+ +L + F + + ++A+ + V+ FF Sbjct: 30 VSTALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFP 89 Query: 66 LLGGLSVAYAIVHML-------------------PTDLLLNMGSTHGLAMVFSMLLAAII 106 LL ++ H++ P + + S L +L ++ Sbjct: 90 LLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVL 149 Query: 107 WNLGTWFFGLPASSSHTLIGAIIGIGLTNALLTGSSVMDALNLREVTKIFSSLIVSPIVG 166 ++ W + ++ L + L + +I L+V VG Sbjct: 150 LSILIWIIIKG------NLVTLLQ-------LPTCGIECITPL--LGQILRQLMVICTVG 194 Query: 167 LVIAGGLIFLLRRY 180 V+ + Y Sbjct: 195 FVVISIADYAFEYY 208
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 27.8 bits (61), Expect = 0.020 Identities = 15/48 (31%), Positives = 24/48 (50%), Gaps = 1/48 (2%) Query: 92 MTFTVGELDGVSQYLSCSLMSPLSHSMSIEEG-QRLTDDCARMILSLP 138 + V + + + L SL PL + EE +R D +R++LSLP Sbjct: 124 LALKVDNAETIKKELGLSLTEPLMEQVGTEEFIKRFGDGASRVVLSLP 171
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 28.5 bits (63), Expect = 0.011 Identities = 36/163 (22%), Positives = 66/163 (40%), Gaps = 35/163 (21%) Query: 14 AMILSNNVFADEGHGIVKFKGEVISAPCSIKPGDEDLTVNLGEVADTVLKSDQKSLAE-- 71 A+++S +V A + + FKG++I C++ ++ VN G++ L + + Sbjct: 15 AVLMSQHVHAADN---LTFKGKLIIPACTV----QNAEVNWGDIEIQNLVQSGGNQKDFT 67 Query: 72 -----PFTIHLQDCMLSQGGTTYSKAKVTFTTANTMTGQTDLLKNTKETEIGGATGVGVR 126 P+++ ++ G T + V T+ + G L N+ + IG A Sbjct: 68 VDMNCPYSLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNA------ 121 Query: 127 ILDSQSGEVTLGTPVV---ITFNNTNS----YQELNFKARMES 162 VTLG+ V IT Y +L +K M+S Sbjct: 122 --------VTLGSQVTPGKITGTAPARKITLYAKLGYKGNMQS 156
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 641 bits (1655), Expect = 0.0 Identities = 225/837 (26%), Positives = 389/837 (46%), Gaps = 56/837 (6%) Query: 17 IYCSLSVLIIGCASAYAVEFNKDLIEAEDRENVNLSQFETDGQLPVGKYSLNALINNKRT 76 ++ + + S+ + FN + + + +LS+FE +LP G Y ++ +NN Sbjct: 30 LFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYM 89 Query: 77 PIHLDLQWVLIDN--QTAVCLTPEQLTLLGFTDEIIEVAQQNLIDGCYPIEK-EKQITTY 133 D+ + D+ CLT QL +G + D C P+ T Sbjct: 90 A-TRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQ 148 Query: 134 LDKGKMQLSISAPQAWLKYKDANWTPPELWDHGIAGAFLDYNLYASHYAPHQGDNSQNIS 193 LD G+ +L+++ PQA++ + + PPELWD GI L+YN + G NS Sbjct: 149 LDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAY 208 Query: 194 SYGQAGVNLGAWRLRTDYQYDQSFNNGKS-QANNLDFPRIYLFRPIPAINAKLTIGQYDT 252 Q+G+N+GAWRLR + + + ++ S N +L R I + ++LT+G T Sbjct: 209 LNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYT 268 Query: 253 ESSIFDSFHFSGVSLKSDENMLPPDLRGYAPQITGVAQTNAKVTVSQNNRIIYQENVPPG 312 + IFD +F G L SD+NMLP RG+AP I G+A+ A+VT+ QN IY VPPG Sbjct: 269 QGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPG 328 Query: 313 PFAITNLFNT-LQGQLDVKVEEEDGQVTQWQVASNSIPYLTRKGQIRYTTAMGKPTSVGG 371 PF I +++ G L V ++E DG + V +S+P L R+G RY+ G+ S G Sbjct: 329 PFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRS-GN 387 Query: 372 DSLQQPFFWTGEFSWGWLNNVSLYGGSVLTNRDYQSLAAGVGFNLNSLGSLSFDVTRSDA 431 ++P F+ G ++YGG+ L +R Y++ G+G N+ +LG+LS D+T++++ Sbjct: 388 AQQEKPRFFQSTLLHGLPAGWTIYGGTQLADR-YRAFNFGIGKNMGALGALSVDMTQANS 446 Query: 432 QLHNQDKETGYSYRANYSKRFESTGSQLTFAGYRFSDKNFVTMNEYIND----------- 480 L + + G S R Y+K +G+ + GYR+S + + Sbjct: 447 TLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQD 506 Query: 481 ---------TNHYTNYQNEKESYIVTFNQYLESLRLNTYVSLARNTYWDAS-SNVNYSLS 530 T++Y N++ +T Q L Y+S + TYW S + + Sbjct: 507 GVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQFQAG 565 Query: 531 LSRDFDIGPLKNVSTSLTFSRIN--WEEDNQDQLYLNISIPWGTSR-----------TLS 577 L+ F ++++ +L++S W++ L LN++IP+ + S Sbjct: 566 LNTAF-----EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASAS 620 Query: 578 YGMQRNQDNKISHTASWYDS--SDRNNSWSVSASGDNDEFKDMKASLRASYQHNTENGRL 635 Y M + + ++++ A Y + D N S+SV + ++ A+ + G Sbjct: 621 YSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNA 680 Query: 636 YLSGTSQRDSYYSLNASWNGSFTATRHGAAFHDYSGSADSRFMIDADGAEDIPLNNKRAV 695 + + D L +G A +G D+ ++ A GA+D + N+ V Sbjct: 681 NIGYSHSDD-IKQLYYGVSGGVLAHANGVTLGQPLN--DTVVLVKAPGAKDAKVENQTGV 737 Query: 696 -TNRYGIGVIPSVSSYITTSLSVDTRNLPENVDIENSVITTTLTEGAIGYAKLDTRKGYQ 754 T+ G V+P + Y +++DT L +NVD++N+V T GAI A+ R G + Sbjct: 738 RTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIK 797 Query: 755 IMGVIRLADGSHPPLGISVKDKTSHKELGLVADGGFVYLNGIQDDSKLTLRWGDKSC 811 ++ + + P G V ++S + G+VAD G VYL+G+ K+ ++WG++ Sbjct: 798 LLMTLT-HNNKPLPFGAMVTSESS-QSSGIVADNGQVYLSGMPLAGKVQVKWGEEEN 852
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 52.0 bits (124), Expect = 5e-09 Identities = 47/287 (16%), Positives = 93/287 (32%), Gaps = 16/287 (5%) Query: 197 PNNAFDAEGLTKLTQETERRRRERNEVEQDVEVAVREKNRDALSRKLEIEQQEAFMTLEQ 256 N A+ + + E R A + + ++ E +QE+ + Sbjct: 999 TPNNIQAD-VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVA---ENSKQESKTVEKN 1054 Query: 257 EQQVKTRTAEQNARIAAFEAERRREAE-QTRILAERQIQETEIEREQAVRSRKVEAEREV 315 EQ TA+ R A EA+ +A QT +A+ + E + + + VE E + Sbjct: 1055 EQDATETTAQN--REVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA 1112 Query: 316 RIKEIEQQQVTEIANQTKSIAIAAKSEQ---QSQAEARANLALAEAVSAQQNVETTRQTA 372 +++ + Q+V ++ +Q +++ Q + E + + E S T Q A Sbjct: 1113 KVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPA 1172 Query: 373 EADRAKQVALIAAAQDAET------KAVELTVRAKAEKEAAEMQAAAIVELAEATRKKGL 426 + + + + T T +E + R Sbjct: 1173 KETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPH 1232 Query: 427 AEAEAQRALNDAINVLSDEQTSLKFKLALLQALPAVIEKSVEPMKSI 473 A + ND V + TS L A ++ K++ Sbjct: 1233 NVEPATTSSNDRSTVALCDLTSTNTNAVLSDARAKAQFVALNVGKAV 1279
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 29.0 bits (65), Expect = 0.028 Identities = 10/37 (27%), Positives = 20/37 (54%) Query: 347 GVFDILHAGHVSYLANARKLGDRLIVAVNSDASTKRL 383 G FD + GH+ + +L D++ VAV + + + + Sbjct: 7 GSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPM 43
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.0 bits (96), Expect = 7e-06 Identities = 58/329 (17%), Positives = 107/329 (32%), Gaps = 37/329 (11%) Query: 34 PTLMEELNISTQQ---YSYIIAAYSAAYTVMQPVAGYVLDVLGTK----IGYAMFAVLWA 86 P L+ +L S Y ++A Y+ PV G + D G + + A AV +A Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88 Query: 87 VFCGATALAGSWGGLAIA--RGAVGAAEAAMIPAGLKASSEWFPAKERSIAVGYFNVGSS 144 + A L + G +A GA GA A I ++ ER+ G+ + Sbjct: 89 IMATAPFLWVLYIGRIVAGITGATGAVAGAYI-------ADITDGDERARHFGFMSACFG 141 Query: 145 IGAMIAPPLVVWAIVMHSWQMAFIISGALSFIWAMAWLIFYKHPRDQKHLTDEERDYIIN 204 G M+A P++ + S F + AL+ + + K R +N Sbjct: 142 FG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALN 198 Query: 205 GQEAQHQVSTAKKMSVGQILRNRQFWGIALPRFLAEPAWGTFNAWIPLFMFKVYGFNLKE 264 + ++ + F+ + L + W +F + ++ Sbjct: 199 PLASFRWARGMTVVAALMAV----FFIMQLVGQVPAALWV-------IFGEDRFHWDATT 247 Query: 265 IAMFAWMPMLFADLGCILGGYLPPLFQRWFGVNLIVSRKMVV-TLGAVLMIGPGMIGLFT 323 I + F L + + G + M+ G +L+ + Sbjct: 248 IGI---SLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMA- 303 Query: 324 NPYVAIMLLCIGGFAHQALSGALITLSSD 352 + ++LL GG AL L + Sbjct: 304 --FPIMVLLASGGIGMPALQAMLSRQVDE 330
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 76.8 bits (189), Expect = 2e-16 Identities = 75/434 (17%), Positives = 142/434 (32%), Gaps = 32/434 (7%) Query: 285 NSRVDAYRNEQLLGSFYLNSGSQFIDTSSFPPGSYSVALKVYENNQLTRTELVPFTKTGG 344 ++V +N + + + G I+ S + + + E + T+ VP++ Sbjct: 308 TAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPL 367 Query: 345 LT-DGNAQWFLQAGKTTSQVS-DDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGN 402 L +G+ ++ + AG+ S + ++ +Q + L + +Y G AD AF G Sbjct: 368 LQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGI 427 Query: 403 NWTADLGGVGNLAISASVFRNDDGGKGDMQQANWS-NPGWPTLGF------YRTNSDG-- 453 G ++ ++ + D + D Q + N G YR ++ G Sbjct: 428 GKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYF 487 Query: 454 -DACTTDSRESYNALSCYESISATVSQNFVGWNMMLGYTRTQNNTDDSLRWDKQQSFENN 512 A TT SR + + + + V F + + R + + + + + + Sbjct: 488 NFADTTYSRMNGYNIETQDGV-IQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLS 546 Query: 513 YLRQTT--AQSISETVQLSASRAFVMRDWILSTSVGVFHRNDNGGDNDDNGLYLSFS--L 568 QT ++ E Q + AF ++ ++ + D L L+ + Sbjct: 547 GSHQTYWGTSNVDEQFQAGLNTAFED----INWTLSYSLTKNAWQKGRDQMLALNVNIPF 602 Query: 569 SDTPAMDSNNNSHSTNVSTDYRYSEQDGDQTSWQLSHTFYNDSFSHKEL--GVTVGGLNT 626 S DS + + S + + T D+ + G GG Sbjct: 603 SHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGN 662 Query: 627 DTINSAVNGRWDGQYGNVYATVSDSYDRKNHDHLSAFTGTYSSTLAVSRYGVNLGASGTD 686 + G YGN S S + L S + GV LG D Sbjct: 663 SGSTGYATLNYRGGYGNANIGYSHS---DDIKQLYY---GVSGGVLAHANGVTLGQPLND 716 Query: 687 DLLGAVLVDVKGFS 700 VLV G Sbjct: 717 ---TVVLVKAPGAK 727 Score = 31.4 bits (71), Expect = 0.020 Identities = 39/222 (17%), Positives = 68/222 (30%), Gaps = 35/222 (15%) Query: 299 SFYLNSGSQFIDTSSF------PPGSYSVALKVYENNQLTRTELVPFTKTGGLTDGNAQW 352 F + D S F PPG+Y V + + NN T V F Sbjct: 52 RFLADDPQAVADLSRFENGQELPPGTYRVDIYL--NNGYMATRDVTFNTGDSEQG----- 104 Query: 353 FLQAGKTTSQVSDDESSAYQLGVRLPLHPQYELYAGLANADDVSAFELGNNWTADLG-GV 411 + T +Q++ +G+ L A A S D+G Sbjct: 105 -IVPCLTRAQLA-------SMGLNTASVSGMNLLADDACVPLTSMIH-DATAQLDVGQQR 155 Query: 412 GNLAISASVFRNDDGGKGDMQQANWSNPGWPTLGFYRTNSDGDACTTDSRESYNALSCYE 471 NL I + N +G + W L Y + + +R N+ Y Sbjct: 156 LNLTIPQAFMSNRA--RGYIPPELWDPGINAGLLNYNFS----GNSVQNRIGGNSHYAYL 209 Query: 472 SISATVSQNFVGW----NMMLGYTRTQNNTDDSLRWDKQQSF 509 ++ + + N W N Y + +++ +W ++ Sbjct: 210 NLQSGL--NIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW 249
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 536 bits (1383), Expect = 0.0 Identities = 173/397 (43%), Positives = 254/397 (63%), Gaps = 11/397 (2%) Query: 7 VLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVN-GGEPAP--LAHHSYEGA 63 +LVINCGSSS+K+ ++++ D VL G+A+ I ++ L+ N GE ++ A Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62 Query: 64 LKAIAFELEKRNLN-----DSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLH 118 +K + L + + +GHR+ HGG FT S +ITD+V+ I LAPLH Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122 Query: 119 NYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTS 178 N AN+ GI++ Q+ P V VAVFDT+FHQTM AYLY +P++YY + +R+YGFHGTS Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182 Query: 179 HRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSG 238 H+YVSQRA +LN + ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242 Query: 239 DVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLR-VLEKAWHEGHERAQLAI 297 +D +S++ + N S ++ ++NK+SG+ GISG+SSD R + + A+ G +RAQLA+ Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302 Query: 298 KTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRS 357 F +R+ + I +AA++ +D I+FT GIGEN IR +++ L LG ++D E N Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362 Query: 358 NSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGK 394 E I+S+ +++V V+PTNEE MIA D + + Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKIVE 397
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 29.0 bits (65), Expect = 0.012 Identities = 8/22 (36%), Positives = 13/22 (59%) Query: 4 VLITGATGLVGGHLLRMLINEP 25 L+TGA G +G H+ + L+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG 24
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 73.4 bits (180), Expect = 2e-15 Identities = 69/313 (22%), Positives = 109/313 (34%), Gaps = 77/313 (24%) Query: 396 IMGHVDHGKTSLLDYI-----RSTKVASGEAG-------------GITQHIGAYHVETEN 437 ++ HVD GKT+L + + T++ S + G GIT G + EN Sbjct: 8 VLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQWEN 67 Query: 438 GMITFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIQHAKAAQVPVVVAV 497 + +DTPGH F + R D +L+++A DGV QT + +P + + Sbjct: 68 TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIFFI 127 Query: 498 NKIDKPEADPDRV----KNELSQYGI-----------------LPEEWG----------- 525 NKID+ D V K +LS + E+W Sbjct: 128 NKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGNDDLLE 187 Query: 526 ---------------GESQFV---------HVSAKAGTGIDELLDAILLQAEVLELKAVR 561 ES H SAK GID L++ I + Sbjct: 188 KYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVIT--NKFYSSTHRG 245 Query: 562 KGMASGAVIESFLDKGRGPVATVLVREGTLHKGDIVL-CGFEYGRVRAMRNELGQEVLEA 620 + G V + + R +A + + G LH D V E ++ M + E+ + Sbjct: 246 QSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYTSINGELCKI 305 Query: 621 GPSIPVEILGLSG 633 + EI+ L Sbjct: 306 DKAYSGEIVILQN 318
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 44.7 bits (105), Expect = 7e-07 Identities = 39/199 (19%), Positives = 68/199 (34%), Gaps = 19/199 (9%) Query: 143 DLAGNATDQANGVQPAPGTTSAENTQQDVSL-----------------PPISSTPTQGQT 185 DL ++ N T+ N Q DV PP +TP++ Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038 Query: 186 PAATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRETAKT 245 A + +Q + T+ Q + S + T ++G+ ++ET T Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 246 QTAERPATTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPA 305 +T E + + + E + V ++ P + + +P A A P Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEP 1158 Query: 306 TKETATTAPVQTASPAQTT 324 +T TTA T PA+ T Sbjct: 1159 QSQTNTTA--DTEQPAKET 1175 Score = 43.5 bits (102), Expect = 2e-06 Identities = 42/203 (20%), Positives = 67/203 (33%), Gaps = 10/203 (4%) Query: 126 APSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAENTQQDVSLPPISST-PTQGQ 184 P+ +D + + ++A D+A PAP T S + S T Q Sbjct: 999 TPNNIQADVPSVPSNNEEIA--RVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQ 1056 Query: 185 TPAATDGQQRVEVQGDLNNALTQPQN----QQQLNNVAVNSTLPTEPATVAPVRNGNASR 240 T Q R + +N Q Q +T E ATV + A Sbjct: 1057 DATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE--KEEKAKV 1114 Query: 241 ETAKTQTAERPATTRPARQQAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAA 300 ET KTQ + + +Q+ E +PQA E P + A T+ PA Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQS-ETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173 Query: 301 TSTPATKETATTAPVQTASPAQT 323 ++ ++ T + + Sbjct: 1174 ETSSNVEQPVTESTTVNTGNSVV 1196 Score = 32.7 bits (74), Expect = 0.003 Identities = 29/139 (20%), Positives = 41/139 (29%), Gaps = 7/139 (5%) Query: 206 TQPQNQQQLNNVAV------NSTLPTEPATVAPVRNGNASRETAKTQTAERPATTRPARQ 259 QPQ + N S T T P + +++ E T++ Sbjct: 1139 VQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVEN 1198 Query: 260 QAVIEPKKPQATVKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPATKETATTAPVQTAS 319 P Q TV +E + R V PA TS+ A T + Sbjct: 1199 PENTTPATTQPTVNSESSNKPKNRHRRS-VRSVPHNVEPATTSSNDRSTVALCDLTSTNT 1257 Query: 320 PAQTTATPAAGGKTAGNVG 338 A + A A NVG Sbjct: 1258 NAVLSDARAKAQFVALNVG 1276
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 32.1 bits (73), Expect = 8e-04 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 32 FYDSDQEIEKRTGADVGWVFDLEGEEGFRD----------REEKVINELTEKQGIVLATG 81 FYD + KR + GW+ + G+R E + I +L E+ IV+A+G Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193 Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112 GG V + +GV E I+K LA Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 287 bits (735), Expect = 1e-93 Identities = 80/301 (26%), Positives = 132/301 (43%), Gaps = 18/301 (5%) Query: 107 LENRSITLQYADAGELAKAGEKLLSAKGSMTVDKRTNRLLLRDNKTALSALEQWVAQMDL 166 L + +I D + +A SA+ + D N +++RD+ + ++ + +D Sbjct: 219 LSDATIQQVTVDNQRIPQAAT-RASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDK 277 Query: 167 PVGQVELSAHIVTINEKSLRELGVKWTLADAQHAGGVGQVTTLGSDLSVATATTHVGFNI 226 P ++E++ IV IN L ELGV W + + T G ++A+ G Sbjct: 278 PSARIEVALSIVDINADQLTELGVDWRVGIRTGNNHQVVIKTTGDQSNIASN----GALG 333 Query: 227 GRINGRLLDL---ELSALEQKQQLDIIASPRLLASHLQPASIKQGSEIPYQVSSGESGAT 283 ++ R LD ++ LE + +++ P LL A I SE Y +G+ A Sbjct: 334 SLVDARGLDYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA- 391 Query: 284 SVEFKEAVLG--MEVTPTVLQKG---RIRLKLHISQNVPGQVLQQADGEVLAIDKQEIET 338 E K G + +TP VL +G I L LHI +G + I + ++T Sbjct: 392 --ELKGITYGTMLRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEG-IPTISRTVVDT 448 Query: 339 QVEVKSGETLALGGIFTRKNKSGQDSVPLLGDIPWFGQLFRHDGKEDERRELVVFITPRL 398 V G++L +GGI+ + VPLLGDIP+ G LFR + R + I PR+ Sbjct: 449 VARVGHGQSLIIGGIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRI 508 Query: 399 V 399 + Sbjct: 509 I 509 Score = 33.3 bits (76), Expect = 0.002 Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 20/116 (17%) Query: 10 KPQKVTLMVDDVPVAQVLQALAEQEKLNLVVSPDVSGTVSLHLTDVPWKQALQTVVKSAG 69 P + + +L +VVS ++ VS + LQ + Sbjct: 32 LPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKINDKVSGQFEHDNPQDFLQHIASLYN 91 Query: 70 LITRQEGNILSVHSIAWQNNNIARQEAEQARAQANLPLENRSITLQYADAGELAKA 125 L+ +GN+L + ++N+ +A +R I LQ ++A EL +A Sbjct: 92 LVWYYDGNVLYI----FKNSEVA----------------SRLIRLQESEAAELKQA 127
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.4 bits (89), Expect = 4e-06 Identities = 21/92 (22%), Positives = 34/92 (36%), Gaps = 16/92 (17%) Query: 55 VACIDGDVVGHLTIDVQQRPRRSHVADFGICVDSRWKNRGVASALMREMIE------MCD 108 + ++ + +G + I + + D + D R K GV +AL+ + IE C Sbjct: 69 LYYLENNCIGRIKIR-SNWNGYALIEDIAVAKDYRKK--GVGTALLHKAIEWAKENHFCG 125 Query: 109 NWLRVDRIELTVFVDNAPAIKVYKKYGFEIEG 140 L I N A Y K+ F I Sbjct: 126 LMLETQDI-------NISACHFYAKHHFIIGA 150
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 31.2 bits (70), Expect = 0.010 Identities = 27/82 (32%), Positives = 37/82 (45%), Gaps = 17/82 (20%) Query: 275 RTPISGDYRGYQVYSMPPPSSGGIHIVQILNI--LENFDMKKYGF-GSADAMQIMAEAEK 331 R P+ G+ R + SMPPP G H +I N+ + FD G+ G A I++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNLKFFKQFD----GYVGGQTAWGILSELEK 131 Query: 332 YVYADRSEYLGDPDFVKVPWQA 353 Y P F WQ+ Sbjct: 132 GRY---------PTFSYQDWQS 144
>PF04619#Dr-family adhesin Length = 160 Score = 28.4 bits (63), Expect = 0.017 Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 4/60 (6%) Query: 29 VGAKYGHKMIEFDAKLSKDGEIFLLHDDNLERTSNGWGVAGELNWQD----LLRVDAGSW 84 +G ++ D + G+ FL+ D+N ++ W + D GSW Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.009 Identities = 13/43 (30%), Positives = 20/43 (46%), Gaps = 7/43 (16%) Query: 33 IVMIGPSGCGKSTLLRMVAGLERVTEGDIWINDQRVTEMEPKD 75 +V+ G G GKSTL+ + GL+ + +D KD Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD-------FFSDTHFDIGTGKD 634
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 39.3 bits (91), Expect = 2e-05 Identities = 39/160 (24%), Positives = 66/160 (41%), Gaps = 14/160 (8%) Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLADYAAKLKASGMKCGYASGWQ 193 G L++ P L YNKD PPKTW+++ +LKA G + + Sbjct: 127 GKLIAYPIAVEALSLIYNKDLLP-------NPPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 194 GWIQLENFSAWNGLPFASKNNGFDGTDAVLEF--NKPEQVKHIAMLEEMNKKGDFSYVGR 251 + +A G F +N +D D ++ K + +++ + D Y Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236 Query: 252 KDESTEKFYNGDCAMTTASSGSLANIREYAKFNYGVGMMP 291 + F G+ AMT + +NI + +K NYGV ++P Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNI-DTSKVNYGVTVLP 274
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 51.6 bits (123), Expect = 6e-09 Identities = 36/183 (19%), Positives = 62/183 (33%), Gaps = 12/183 (6%) Query: 19 EQTPEKETEVQNEQTVVEEIVQAQEPVKASEQAVEE----QPQAHTEAEAET-FAADVVE 73 P T + +TV E Q + V+ +EQ E + EA++ E Sbjct: 1025 VPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNE 1084 Query: 74 VTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQAEAE 133 V + +E+++ Q + V +E V E+ + +VSP++ Q+E Sbjct: 1085 VAQSGSETKETQTTE--TKETATVEKEEKAKVETEKTQ---EVPKVTSQVSPKQEQSETV 1139 Query: 134 TVEIVEAAEEEAAK--EEITDEELEAQALAAEAAEEAVIVVPPAEEEQPVEEIAQEQEKP 191 + A E + +E + A E + V P E V E P Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENP 1199 Query: 192 TKE 194 Sbjct: 1200 ENT 1202 Score = 46.2 bits (109), Expect = 2e-07 Identities = 37/171 (21%), Positives = 60/171 (35%), Gaps = 19/171 (11%) Query: 20 QTPEK-ETEVQNEQTVVEEIVQAQEPVKASEQAVEEQPQAHTEAEAETFAADVVEVTEQV 78 TP + +V + + EEI + E T AE + VE EQ Sbjct: 998 TTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQD 1057 Query: 79 AESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQAEAETVEIV 138 A AQ EV + + V+ + + + +E E E TVE Sbjct: 1058 ATETTAQ-NREVAKEAKSNVKANTQTNEVAQSGS------ETKETQTTE-TKETATVE-- 1107 Query: 139 EAAEEEAAKEEITDEELEAQALAAEAAEEAVIVVPPAEEEQPVEEIAQEQE 189 +EE AK E + + + + ++ +E QP E A+E + Sbjct: 1108 ---KEEKAKVETEKTQEVPKVTSQVSPKQE-----QSETVQPQAEPAREND 1150 Score = 45.1 bits (106), Expect = 6e-07 Identities = 27/157 (17%), Positives = 50/157 (31%), Gaps = 6/157 (3%) Query: 17 QKEQTPEKETE----VQNEQTVVEEIVQAQEPVKASEQAVEEQPQAHTEAEAETFAADVV 72 + ++T E E V+ E+T V +Q K EQ+ QPQA E + Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPK-QEQSETVQPQAEPARENDPTVNIKE 1157 Query: 73 EVTEQVAESEKAQPEAEVVAQPEPVVEETPEPVAIEREELPLPEDVNAEEVSPEEWQAEA 132 ++ ++ QP E + E V E+ V + PE+ P + Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTES-TTVNTGNSVVENPENTTPATTQPTVNSESS 1216 Query: 133 ETVEIVEAAEEEAAKEEITDEELEAQALAAEAAEEAV 169 + + + + + A + Sbjct: 1217 NKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT 1253 Score = 38.5 bits (89), Expect = 7e-05 Identities = 34/196 (17%), Positives = 61/196 (31%), Gaps = 23/196 (11%) Query: 17 QKEQTPEK------ETEVQNEQTVVE------------EIVQAQEPVKASEQAVEEQPQA 58 Q+ +T EK ET QN + E E+ Q+ K ++ ++ Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105 Query: 59 HTEAE-AETFAADVVEVTEQVAESEKAQPEAEVV-AQPEPVVEETPEPVAIEREELPLPE 116 + E A+ EV + ++ Q ++E V Q EP E P E + Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS---QT 1162 Query: 117 DVNAEEVSPEEWQAEAETVEIVEAAEEEAAKEEITDEELEAQALAAEAAEEAVIVVPPAE 176 + A+ P + + + E+ + + E A P Sbjct: 1163 NTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNR 1222 Query: 177 EEQPVEEIAQEQEKPT 192 + V + E T Sbjct: 1223 HRRSVRSVPHNVEPAT 1238 Score = 31.2 bits (70), Expect = 0.013 Identities = 33/149 (22%), Positives = 50/149 (33%), Gaps = 15/149 (10%) Query: 52 VEEQPQAHTEAEAETFAADVVEVT-EQVAESEKAQPEAEVVAQPEPVVE-ETPEPVAIER 109 VE++ Q T +V E A+ + V P P ET E VA Sbjct: 985 VEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENS 1044 Query: 110 EELPLPEDVNAEEVSPEEWQAEAETVEIVEAAEEEAAKEEITD-EELEAQALAAEAAE-- 166 ++ ++ V E A T + E A+E A + + E +E E Sbjct: 1045 KQ-------ESKTVEKNEQDATETTAQNREVAKE-AKSNVKANTQTNEVAQSGSETKETQ 1096 Query: 167 --EAVIVVPPAEEEQPVEEIAQEQEKPTK 193 E +EE+ E + QE P Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKV 1125
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 25.9 bits (57), Expect = 0.039 Identities = 6/21 (28%), Positives = 13/21 (61%) Query: 7 FFIVIIGLIVVAASFRFMQQR 27 +V+I AA ++F++Q+ Sbjct: 173 ALMVLIQSTSEAARYKFIEQQ 193
>PF01206#SirA family protein Length = 76 Score = 105 bits (265), Expect = 3e-34 Identities = 24/72 (33%), Positives = 41/72 (56%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRNMQPGETLLIIADDPATTRDIPGFCTFMEHELVAKET 68 D +LDA GL CP P++ +KT+ M GE L ++A DP + +D F HEL+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 DGLPYRYLIRKG 80 + Y + +++ Sbjct: 65 EDGTYHFRLKRA 76
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 54.4 bits (131), Expect = 3e-10 Identities = 80/398 (20%), Positives = 148/398 (37%), Gaps = 32/398 (8%) Query: 27 LRLNLRIVSIVMFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 84 ++ N ++ I+ + IGL + VLPG + D++ G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 85 PHAGRYADLLGPKKIVVFGLCGCFLSGLGYLTAGLTASLPVISLLLLCLGRVILGI-GQS 143 P G +D G + +++ L G + + Y L V L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSLAG---AAVDYAIMATAPFLWV-----LYIGRIVAGITGAT 112 Query: 144 FAGTGSTLWGVGVVGSL--HIGRVISWNGIVTYGAMAMGAPLGVVFYHWGGLQALALIIM 201 A G+ + + H G + + G +G +G H A AL + Sbjct: 113 GAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGL 172 Query: 202 GVALVAILLAIPRPTVK--ASKGKPLPFRAVLGSVWLYGMALALA-----SAGFGVIATF 254 LL + + P + + + +A +A V A Sbjct: 173 NFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232 Query: 255 ITLFYDAK-GWDGAAFALTLFSCAFVGT---RLLFPNGINRIGGLNVAMICFSVEIIGLL 310 +F + + WD ++L + + + ++ R+G M+ + G + Sbjct: 233 WVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYI 292 Query: 311 LVGVATMPWMAKIG-VLLAGAGFSLVFPALGVVAVKAVPQQNQGAALATYTVFMDLSLGV 369 L+ AT WMA VLLA G + PAL + + V ++ QG + L+ + Sbjct: 293 LLAFATRGWMAFPIMVLLASGGIGM--PALQAMLSRQVDEERQGQLQGSLAALTSLT-SI 349 Query: 370 TGPLAGLVMSWAGVPV----IYLAAAGLVAIALLLTWR 403 GPL + A + ++A A L + L R Sbjct: 350 VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.018 Identities = 10/34 (29%), Positives = 19/34 (55%) Query: 25 QAVLNNVSLTLKSGETVALLGRSGCGKSTLARLL 58 Q + ++ +++ T+ + G SG GK +AR L Sbjct: 147 QEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 49.9 bits (119), Expect = 5e-09 Identities = 41/171 (23%), Positives = 73/171 (42%), Gaps = 7/171 (4%) Query: 200 REREHGTVEHLLVMPITPFEIMMAKI-WSMGLVVLVVSGLSLVLMVKGVLGVPIEGSIPL 258 R T E +L + +I++ ++ W+ L +G+ +V G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTIARSMPQLGLLVILVLLPLQMLSGGSTPRESMPQMVQD 317 + L V AL+ A S+G+ + +A S LV+ P+ LSG P + +P + Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGAGFEIVWPQFLTLMAIGGAFF-TIALLRFR 367 +P +H + L + I+ ++ + I FF + ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.045 Identities = 9/26 (34%), Positives = 14/26 (53%) Query: 37 ARCMVGLIGPDGVGKSSLLSLISGAR 62 V L G G+GKS+L++ + G Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLD 620
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 84.5 bits (209), Expect = 4e-20 Identities = 72/408 (17%), Positives = 139/408 (34%), Gaps = 81/408 (19%) Query: 6 RHLAWWVVGLLAVAAIVAWWLLRPAGVP-EGFAVSNGRIEATEVDIASKIAGRIDTILVK 64 R +A++++G L +A I++ G +GR + I + I+VK Sbjct: 58 RLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKE----IKPIENSIVKEIIVK 113 Query: 65 EGQFVREGEVLAKMDTRV----------------LQEQRLEAI----------------- 91 EG+ VR+G+VL K+ L++ R + + Sbjct: 114 EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDE 173 Query: 92 -------------------AQIKEAQSAVAAAQALLEQRQSETRAAQSLVNQRQAELDSV 132 Q Q+ + L+++++E + +N+ + Sbjct: 174 PYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 133 AKRHTRSRSLAQRGAISAQQLDDDRAAAESARAALESAKAQVSASKAAIEAARTNIIQ-- 190 R SL + AI+ + + A L K+Q+ ++ I +A+ Sbjct: 234 KSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVT 293 Query: 191 -----------AQTRVEAAQATERRIAADID--DSELKAPRDGRV-QYRVAEPGEVLAAG 236 QT T + S ++AP +V Q +V G V+ Sbjct: 294 QLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTA 353 Query: 237 GRVLNMVDLSDVY-MTFFLPTEQAGTLKLGGEARLILDAAPDLRIPATISFVASVAQFTP 295 ++ +V D +T + + G + +G A + ++A P R V V Sbjct: 354 ETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYG---YLVGKVKNINL 410 Query: 296 KTVETSDERLKLMFRVKARIPPELLQQHLEYV--KTGLPGVAWVRVNE 341 +E D+RL L+F V I L + + +G+ A ++ Sbjct: 411 DAIE--DQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGM 456
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 73.7 bits (181), Expect = 7e-22 Identities = 18/48 (37%), Positives = 32/48 (66%) Query: 19 LGMPQKYRLLSLIVICFTLLFFTWMIRDSLCELHIKQGSYELAAFLAC 66 + +P+ + ++++C TLL FT++ R SLCE+ + G E+AAF+A Sbjct: 1 MKLPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAY 48
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 27.5 bits (61), Expect = 0.017 Identities = 19/66 (28%), Positives = 26/66 (39%), Gaps = 1/66 (1%) Query: 77 MTCLTVFIISVALLMVGLWNATLLLSEKGFYGLAFFLSLFGAVAVQKNIRDAGINPPKET 136 MT T II LL L + + GLA FL+ F V I P E Sbjct: 61 MTSFTRIIIVFGLLRNALGTPSAP-PNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEE 119 Query: 137 QVTQEE 142 +++ +E Sbjct: 120 KISMQE 125
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 57.9 bits (140), Expect = 8e-11 Identities = 44/147 (29%), Positives = 70/147 (47%), Gaps = 18/147 (12%) Query: 3 IATAGHVDHGKTTLLQAI---TGV------------NADRLPEEKKRGMTIDLGYAYWPQ 47 I HVD GKTTL +++ +G D E++RG+TI G + Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65 Query: 48 PDGRVPGFIDVPGHEKFLSNMLAGVGGIDHALLVVACDDGVMAQTREHLAILQLTGNPML 107 + +V ID PGH FL+ + + +D A+L+++ DGV AQTR L+ G P + Sbjct: 66 ENTKV-NIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTI 124 Query: 108 TVALTKADRVDEARVNEVERQVKEVLR 134 + K D+ ++ V + +KE L Sbjct: 125 -FFINKIDQNG-IDLSTVYQDIKEKLS 149
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 104 bits (260), Expect = 7e-28 Identities = 77/348 (22%), Positives = 127/348 (36%), Gaps = 67/348 (19%) Query: 2 IIVTGGAGFIGSNIVKALNDKGITDILVVDNLKD--------------GTKFVNLVDLDI 47 +VTG AGFIG ++ K L + G ++ +DNL D +D+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 48 ADYMDKEDFLIQIMAGEEFGDVEAIFHEGACSSTTEWDGKYMMDNNYQYSK-------EL 100 AD + + + A F E +F + +Y ++N + Y+ + Sbjct: 62 ADR----EGMTDLFASGHF---ERVFISPHRLAV-----RYSLENPHAYADSNLTGFLNI 109 Query: 101 LHYCLEREIP-FLYASSAATYGGRTSD-FIESREYEKPLNVYGYSKFLFDEYVRQILPEA 158 L C +I LYASS++ YG F + P+++Y +K + Sbjct: 110 LEGCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLY 169 Query: 159 NSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGESPKLFEGSENFKRDFVYVGDVA 218 G R+F VYGP + MA F + G+S ++ KRDF Y+ D+A Sbjct: 170 GLPATGLRFFTVYGPWG--RPDMA--LFKFTKAMLEGKSIDVY-NYGKMKRDFTYIDDIA 224 Query: 219 DVNL------------WFLENGVSG-------IFNLGTGRAESFQAVADATLAY-HKKGQ 258 + + W +E G ++N+G A + + Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAK 284 Query: 259 IEYIPFPDKLKGRYQAFTQADLTNLRAA-GYDKPFKTVAEGVTEYMAW 305 +P G T AD L G+ P TV +GV ++ W Sbjct: 285 KNMLPLQ---PGDVL-ETSADTKALYEVIGF-TPETTVKDGVKNFVNW 327
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 32.0 bits (72), Expect = 0.002 Identities = 26/76 (34%), Positives = 36/76 (47%), Gaps = 9/76 (11%) Query: 134 KEEARDIAVVEVNQYVESSKFEKEIKRILYEKIEPVQFDLLAQISSIDEIKLSQQEHNQS 193 KE RD + Q + K IKR L +K+E V +L S DE K + N+ Sbjct: 661 KEANRDARAIAYAQNL------KGIKRELSDKLENVNKNLKDFDKSFDEFKNGK---NKD 711 Query: 194 FEKLEQELDGLKRNLK 209 F K E+ L LK ++K Sbjct: 712 FSKAEETLKALKGSVK 727
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.8 bits (93), Expect = 1e-05 Identities = 71/391 (18%), Positives = 129/391 (32%), Gaps = 33/391 (8%) Query: 20 LLVAFLTGIAGALQTPTLSIFLADELKARPIM--VGFFFTGSAIMGILVSQFLARHSDKQ 77 L L + L P L L D + + + G A+M + L SD+ Sbjct: 11 LSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRF 70 Query: 78 GDRKLLILLCCLFGVLACTLFAWNRNYFILLSTGVLLSSFASTANPQMFALAREHADRTG 137 G R ++L+ + + A ++L ++ A AD T Sbjct: 71 GRR-PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV----AGITGATGAVAGAYIADITD 125 Query: 138 RET-VMFSTFLRAQISLAWVIGPPLAYELAMGFSFKVMYLTAAIAFVVCGLIVWLFLP-- 194 + F+ A V GP L + GFS + AA + L LP Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 195 --SIQRNIPVVT-QPVEILPSTHRKRDTRLLFVVCSMMWAANNLYMINMPLFIIDELHLT 251 +R + P+ L V +M + +F D H Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWD 244 Query: 252 DKLAGEMI-GIAAGLEIPMMLIAGYYMKRIGKRLLMLIAIVSGMCFYASVLMATTPAVEL 310 G + + +I G R+G+R +++ +++ Y +L+A + Sbjct: 245 ATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGY--ILLAFATRGWM 302 Query: 311 ELQILNAIFLGILCGIGMLYFQDLMPEKI---------GSATTLYANTSRVGWIIAGSVD 361 + L GIGM Q ++ ++ GS L + TS VG ++ ++ Sbjct: 303 ---AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIY 359 Query: 362 GIMVEIWSYHALFWLAIGMLGIAMICLLFIK 392 + W+ W I + ++CL ++ Sbjct: 360 AASITTWNG----WAWIAGAALYLLCLPALR 386 Score = 32.1 bits (73), Expect = 0.004 Identities = 18/102 (17%), Positives = 34/102 (33%) Query: 17 AAFLLVAFLTGIAGALQTPTLSIFLADELKARPIMVGFFFTGSAIMGILVSQFLARHSDK 76 AA + V F+ + G + IF D +G I+ L + Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272 Query: 77 QGDRKLLILLCCLFGVLACTLFAWNRNYFILLSTGVLLSSFA 118 + + ++L + L A+ ++ VLL+S Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG 314
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.3 bits (89), Expect = 4e-05 Identities = 33/208 (15%), Positives = 71/208 (34%), Gaps = 13/208 (6%) Query: 49 IIVEFLPVSLLTP----MAQDLGISEGVA---GQSVTVTAFVAMFASLFITQTIQATDRR 101 + ++ + + L+ P + +DL S V G + + A + + + RR Sbjct: 14 VALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRR 73 Query: 102 YVVILFAVLLTLSCLLVSFANSFSLLLIGRACLGLALGGFWAMSASLTMRLVPPRTVPKA 161 V+++ + +++ A +L IGR G+ G A++ + + + Sbjct: 74 PVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARH 132 Query: 162 LSVIFGAVSIALVIAAPLGSFLGELIGWRNVFNAAAVMG----VLCIFWIIKSLPSLPGE 217 + +V LG +G F AAA + + F + +S Sbjct: 133 FGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP 191 Query: 218 PSHQKQNTFRLLQRPGVMAGMIAIFMSF 245 + N + M + A+ F Sbjct: 192 LRREALNPLASFRWARGMTVVAALMAVF 219
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 37.8 bits (88), Expect = 1e-04 Identities = 30/105 (28%), Positives = 43/105 (40%), Gaps = 17/105 (16%) Query: 22 AVSRGDAVADYIIDNVSILDLINGGEISGPIVIKGRYIAGVG-AEYAD---------APA 71 V+R D +I N ILD + G + I +K IA +G A D P Sbjct: 60 QVTREGGAVDTVITNALILD--HWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117 Query: 72 LQRIDARGATAVPGFIDAHLHIESSMMTPVTFETATLPRGLTTVI 116 + I G G +D+H+H + P E A L GLT ++ Sbjct: 118 TEVIAGEGKIVTAGGMDSHIH----FICPQQIEEA-LMSGLTCML 157
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 0.001 Identities = 28/168 (16%), Positives = 61/168 (36%), Gaps = 17/168 (10%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ C Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--C 90 Query: 109 MLGFSASMGSGSVSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 +G SL +M + F Q G + + + ++ P+ RG G Sbjct: 91 FGSVIGFVGHSFFSLLIM------ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212 +G + A+Y+ + + + P + I+ L Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 40.6 bits (95), Expect = 9e-06 Identities = 65/408 (15%), Positives = 137/408 (33%), Gaps = 60/408 (14%) Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILANGVLSRSDIGLLATLFYITYGVSKFVSG 86 RH + IWL F+ N ++P+I + + + T F +T+ + V G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSNARYFMGIGLIATGIMNILFGFSTSLWAFAMLWVLNAFFQGWGS---PVCARLL 143 +SD+ + + G+I +++ S F L ++ F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPIVMAASALHYGWRAGMMIAGCMAIVVGIFLC 202 A Y + RG + L + +G + P + A + W ++ M ++ + Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPF- 184 Query: 203 WRLRDRPQALGLPAVGEWRHDALEIAQQQEGAGLTRKEILTKYVLLNPYIWLLSFCYVLV 262 + L +I G L I+ + Y VL Sbjct: 185 --------LMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANTAVTMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNGNRGPMNLIFAAGILL-SVGSLWLMPFASYVMQATCFFTIGFFVFGPQMLI 365 GS +F G + + GIL+ G L+++ + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFL-SVSFLTASFLLETTSWFM 351 Query: 366 ---------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395 G++ + ++ AGA + ++L Sbjct: 352 TIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>PF06580#Sensor histidine kinase Length = 349 Score = 39.8 bits (93), Expect = 2e-05 Identities = 28/142 (19%), Positives = 56/142 (39%), Gaps = 11/142 (7%) Query: 365 LRPRQLDDLTLEQAIRSLMREMELEGRGIVSHLEWRIDESALSENQRVTLFRVCQEGLNN 424 LR ++L + + ++L L++ + + +V + Q + N Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPM-LVQTLVEN 266 Query: 425 IVKHA-----DASAVTLQGWQQDERLMLVIEDDGSGLPPGSGQ-QGFGLTGMRERVTALG 478 +KH + L+G + + + L +E+ GS + + G GL +RER+ L Sbjct: 267 GIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQMLY 326 Query: 479 G---TLHISCLHG-TRVSVSLP 496 G + +S G V +P Sbjct: 327 GTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 61.4 bits (149), Expect = 2e-13 Identities = 29/174 (16%), Positives = 59/174 (33%), Gaps = 20/174 (11%) Query: 2 ITVALIDDHLIVRSGFAQLLGLEPDLQVVAEFGSGREALAGLPGRGVQVCICDISMPDIS 61 T+ + DD +R+ Q L V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALVEQALNAGARGFLSKRCSPDELIAAVHTVATG 118 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR---- 117 Query: 119 GCYLTPDIAIKLASGRQDPLTKRERQVAEKLAQG---MAVKEIAAELGLSPKTV 169 A+ R L + + + + + A L + T+ Sbjct: 118 --------ALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTL 163
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 526 bits (1357), Expect = 0.0 Identities = 183/468 (39%), Positives = 253/468 (54%), Gaps = 35/468 (7%) Query: 8 ILVVDDDISHCTILQALLRGWGYNVALANSGRQALEQVREQVFDLVLCDVRMAEMDGIAT 67 ILV DDD + T+L L GY+V + ++ + DLV+ DV M + + Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 68 LKEIKALNPAIPVLIMTAYSSVETAVEALKTGALDYLIKPLDFDNLQATLEKALAHTHSI 127 L IK P +PVL+M+A ++ TA++A + GA DYL KP D L + +ALA Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 128 DAETPAVTASQFGMVGKSPAMQHLLSEIALVAPSEATVLIHGDSGTGKELVARAIHASSA 187 ++ + +VG+S AMQ + +A + ++ T++I G+SGTGKELVARA+H Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGK 185 Query: 188 RSEKPLVTLNCAALNESLLESELFGHEKGAFTGADKRREGRFVEADGGTLFLDEIGDISP 247 R P V +N AA+ L+ESELFGHEKGAFTGA R GRF +A+GGTLFLDEIGD+ Sbjct: 186 RRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPM 245 Query: 248 MMQVRLLRAIQEREVQRVGSNQTISVDVRLIAATHRDLAAEVNAGRFRQDLYYRLNVVAI 307 Q RLLR +Q+ E VG I DVR++AAT++DL +N G FR+DLYYRLNVV + Sbjct: 246 DAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPL 305 Query: 308 EVPSLRQRREDIPLLAGHFLQRFAERNRKAVKGFTPQAMDLLIHYDWPGNIRELENAVER 367 +P LR R EDIP L HF+Q+ + VK F +A++L+ + WPGN+RELEN V R Sbjct: 306 RLPPLRDRAEDIPDLVRHFVQQAEKEGLD-VKRFDQEALELMKAHPWPGNVRELENLVRR 364 Query: 368 AVVLLTGEYISERELPLAIASTPIPLGQSQDIQP-------------------------- 401 L + I+ + + S + Sbjct: 365 LTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALP 424 Query: 402 --------LVEVEKEVILAALEKTGGNKTEAARQLGITRKTLLAKLSR 441 L E+E +ILAAL T GN+ +AA LG+ R TL K+ Sbjct: 425 PSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 30.7 bits (69), Expect = 0.001 Identities = 15/54 (27%), Positives = 20/54 (37%), Gaps = 5/54 (9%) Query: 78 IDPDVCGCGVGRMLVEHALSMAPE-----LTTNVNEQNEQAVGFYKKVGFKVTG 126 + D GVG L+ A+ A E L + N A FY K F + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 32.3 bits (73), Expect = 0.004 Identities = 14/58 (24%), Positives = 23/58 (39%) Query: 289 ETSTPDLELARRFAQAIHAKYPGKLLAYNCSPSFNWQKNLDDKTIASFQQQLSDMGYK 346 ET+ PD+ L A P L Y + N D +T + + QL+++ Sbjct: 544 ETTKPDMTLKEALKIAFGFNEPNGNLQYQGKDITEFDFNFDQQTSQNIKNQLAELNAT 601
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.6 bits (82), Expect = 4e-04 Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 3/87 (3%) Query: 279 VIGVMLSIFQQFVGINVVLYYAPEVFKTLGASTDIALLQTIIVGVINLTFTVLAIMT--- 335 +I ++ ++ VGI +++ P + + L S D+ I++ + L A + Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 336 VDKFGRKPLQIIGALGMAIGMFSLGTA 362 D+FGR+P+ ++ G A+ + TA Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATA 93
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 31.1 bits (70), Expect = 0.011 Identities = 22/124 (17%), Positives = 43/124 (34%), Gaps = 21/124 (16%) Query: 128 GDEWQLALSDGETGKNYLSDAFKFGGEQKLQLKETTAQPEGERANLRVITQNRQALSDIT 187 ++WQ+ T DA L+L T + L+ + A+ ++ Sbjct: 367 NNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPV---SDAIVNMD 423 Query: 188 AILPDGNKVMMSSLRQFSGTQPLYTLDGDGTLTNNQSGVKYRPNNQ--------IGFYQS 239 ++ D K+ M+S GD N Q+ + + N++ Y S Sbjct: 424 VLITDEAKIAMAS----------EEDAGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYAS 473 Query: 240 ITAD 243 + +D Sbjct: 474 LVSD 477
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 756 bits (1953), Expect = 0.0 Identities = 396/396 (100%), Positives = 396/396 (100%) Query: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 Query: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 Query: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 Query: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE Sbjct: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 Query: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE Sbjct: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 Query: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP Sbjct: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 Query: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK Sbjct: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 1098 bits (2841), Expect = 0.0 Identities = 873/878 (99%), Positives = 875/878 (99%) Query: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA Sbjct: 1 MSYLNLRLYQRNTQCLHIRKHRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQA 60 Query: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN Sbjct: 61 VADLSRFENGQELPPGTYRVDIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLN 120 Query: 121 TASVSGMNLLADDACVPLTSMIHDATAHLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 TASVSGMNLLADDACVPLTSMIHDATA LDVGQQRLNLTIPQAFMSNRARGYIPPELWDP Sbjct: 121 TASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDP 180 Query: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDRSSGSK 240 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSD SSGSK Sbjct: 181 GINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSK 240 Query: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGVQLASDDNMLPDSQRGFAPV 300 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRG QLASDDNMLPDSQRGFAPV Sbjct: 241 NKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPV 300 Query: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV Sbjct: 301 IHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTV 360 Query: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY Sbjct: 361 PYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRY 420 Query: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR Sbjct: 421 RAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYR 480 Query: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRS 540 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR+ Sbjct: 481 YSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRT 540 Query: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI Sbjct: 541 STLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSLTKNAWQKGRDQMLALNVNI 600 Query: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD Sbjct: 601 PFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGD 660 Query: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL Sbjct: 661 GNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVL 720 Query: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP Sbjct: 721 VKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVP 780 Query: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA Sbjct: 781 TRGAIVRAEFKARVGIKLLMTLTHNNKPLPFGAMVTSESSQSSGIVADNGQVYLSGMPLA 840 Query: 841 GKVQVKWGEEENAHCIANYQLPPESQQQLLTQLSAECR 878 GKVQVKWGEEENAHC+ANYQLPPESQQQLLTQLSAECR Sbjct: 841 GKVQVKWGEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 33.1 bits (75), Expect = 4e-04 Identities = 30/158 (18%), Positives = 49/158 (31%), Gaps = 9/158 (5%) Query: 3 WCKRGYVLAAILALASATIQAADVTITVNGKVVAKPCTVSTTNATVDLGDLYSFSLMSAG 62 W R + A LA + +TI + VT VN + + + + G Sbjct: 258 WMGRLQYVGAYLAPSYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTH------IG 311 Query: 63 AASAWHDVALELTNCPVG--TSRVTASFSGAADSTGYYKNQGTAQNIQLELQDDSGNTLN 120 W L + P G + S + Q ++QN + N+ Sbjct: 312 TLDLWQSAGLNIIAPPEGGYKDKPNDKPSNTTQNNAKNDKQESSQNNSNTQVINPPNSAQ 371 Query: 121 TGATKTVQVDDSSQSAHFPLQVRALTVNGGATQGTIQA 158 + QV D + V +N A GTI+ Sbjct: 372 KTEIQPTQVIDGPFAGGKNTVVNINRINTNA-DGTIRV 408
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 28.1 bits (62), Expect = 0.047 Identities = 19/79 (24%), Positives = 32/79 (40%), Gaps = 1/79 (1%) Query: 211 SQNLGYYLSGTTADAGNSIFTNTASFSPAQGVGVQLTRNGTIIPANNTVSLGAVGTSAVS 270 S+N G ++ +A+ N FT PA V V L ++G ++ + + + Sbjct: 133 SENAGKEITIGSAN-PNVTFTEKTGDQPASTVKVTLDQDGVAKLSSVQIKNVYAIDTTYN 191 Query: 271 LGLTANYARTGGQVTAGNV 289 + TG VT G V Sbjct: 192 SNVNFYDVTTGATVTTGAV 210
>PF06580#Sensor histidine kinase Length = 349 Score = 31.4 bits (71), Expect = 0.008 Identities = 10/49 (20%), Positives = 25/49 (51%) Query: 230 LVPLIPAIIMISTTIANIWLVKDTPAWEVVNFIGSSPIAMFIAMVVAFV 278 + +I ++ I +W V +T W ++ FI + P+A + + ++ + Sbjct: 73 MGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLALSII 121
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 29.2 bits (65), Expect = 0.015 Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 4/45 (8%) Query: 145 PLLVSHGIALGCLVSTILGLPAWAERRLRLRNCSISRVDYQESLW 189 P +V GIA G V T+ GL W ++ N D + +W Sbjct: 42 PAIVG-GIATGAAVGTVSGLLGWGLKQAEEAN---KTPDKPDKVW 82
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.9 bits (210), Expect = 3e-21 Identities = 34/139 (24%), Positives = 60/139 (43%) Query: 1 MQRETVWLVEDEQGIADTLVYMLQQEGFAVEVFERGLPVLDKARQLVPDVMILDVGLPDI 60 M T+ + +D+ I L L + G+ V + + D+++ DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRQLLALHPALPVLFLTARSEEVDRLLGLEIGADDYVAKPFSPREVCARVRTLLR 120 + F+L ++ P LPVL ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RVKKFSSPSPVIRIGHFEL 139 K+ S L Sbjct: 121 EPKRRPSKLEDDSQDGMPL 139
>PF06580#Sensor histidine kinase Length = 349 Score = 32.9 bits (75), Expect = 0.002 Identities = 41/177 (23%), Positives = 71/177 (40%), Gaps = 40/177 (22%) Query: 317 LENRQEVVLTAVDVAALFR---RVSEARTVQLAE--KNITLHVT--------PTEVNVAA 363 LE+ + ++ L R R S AR V LA+ + ++ + Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQI 246 Query: 364 EPALLDQALGNLL-----DNA----IDFTPESGRITLSAEVDQEHVTLKVLDTGSGIPDY 414 PA++D + +L +N I P+ G+I L D VTL+V +TGS Sbjct: 247 NPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALK- 305 Query: 415 ALSRIFERFYSLPRANGQKSSGLGLAFVSE-VARLFNGEVTLR-NVQEGGVLASLRL 469 N ++S+G GL V E + L+ E ++ + ++G V A + + Sbjct: 306 ---------------NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 4e-20 Identities = 30/122 (24%), Positives = 60/122 (49%), Gaps = 1/122 (0%) Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSEYDINLVIMDINLPGK 60 M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 NGLLLARELRE-QANVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119 N L +++ + ++ ++ ++ ++ + I E GA DY+ KPF+ EL L+ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 120 RT 121 Sbjct: 121 EP 122
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 29.4 bits (66), Expect = 0.016 Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 10/69 (14%) Query: 187 FISGTGFATDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDP- 245 +G ++D+R L A + D A+LAL + R+ K++ + Sbjct: 273 VYGISGISSDFRDLEDAAF---------KNGDKRAQLALNVFAYRVKKTIGSYAAAMGGV 323 Query: 246 DVIVLGGGM 254 DVIV G+ Sbjct: 324 DVIVFTAGI 332
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.0 bits (122), Expect = 5e-09 Identities = 75/356 (21%), Positives = 127/356 (35%), Gaps = 35/356 (9%) Query: 33 ILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGH---MISYYALGVVVGAPIIALF 89 + ++AL G+G+ IM VL L ++ S H +++ YAL AP++ Sbjct: 11 LSTVALDAVGIGL----IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 90 SSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIK 149 S R+ + +LL +A + A+ + +L IGR+V+G GA + I Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIAD--IT 124 Query: 150 PGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDI 209 G A G +S ++ P+ L FS F A N + F +P+ Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 210 RDEAKGKLREQ----------FHFLRSPAPWLI--FAATMFGNAGVFAWFSYVKPYMMFI 257 + LR + + A + F + G W + Sbjct: 185 HKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFG------E 238 Query: 258 SGFSETAMTFIMMLVGLGM---VLGNMLSGRVSGRYSPLRIAAVTDFIIVLALLMLFFCG 314 F A T + L G+ + M++G V+ R R + ++L F Sbjct: 239 DRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Query: 315 GMKITSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAF--NLGSAVG 368 + I + G+ LQ +L + E G G +A +L S VG Sbjct: 299 RGWMAFPIMVLLASGGIG--MPALQAMLSRQV-DEERQGQLQGSLAALTSLTSIVG 351
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.4 bits (92), Expect = 7e-05 Identities = 34/199 (17%), Positives = 71/199 (35%), Gaps = 14/199 (7%) Query: 671 QQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDDLPHSEETVALDNWRQVHEQCLALH 730 + + Q + Q R Q L+ +E + E + +V + Sbjct: 133 EADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTAL--------QASVFDDQQAFLAALMDEQTLTQL 782 Q T Q Q +L K +A+ T L + V + ++L+ +Q + Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA-- 250 Query: 783 EQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQEL-AQTHQKLREN 841 K + Q + V + +Q +Q + L+ + + Q + KLR+ Sbjct: 251 ---KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQT 307 Query: 842 TTSQGEIRQQLKQDADNRQ 860 T + G + +L ++ + +Q Sbjct: 308 TDNIGLLTLELAKNEERQQ 326 Score = 39.0 bits (91), Expect = 1e-04 Identities = 25/204 (12%), Positives = 59/204 (28%), Gaps = 18/204 (8%) Query: 487 EARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQTLEPGVNQSRLLALENEVKKLGEEG 546 EA ++ Q + Q ++E + E + +E + L Sbjct: 133 EADTLKTQSSLLQARLEQ---TRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT- 188 Query: 547 AALRGQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQPWLDAQD 606 + ++ Q Q + E R + + + + DD L Q Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248 Query: 607 -------EHERQL-RLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLP 658 E E + +++ + Q+ +I+ +++ + Q L Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF------KNEILD 302 Query: 659 QEDEEESWLATRQQEAQSWQQRQN 682 + + + E ++RQ Sbjct: 303 KLRQTTDNIGLLTLELAKNEERQQ 326 Score = 36.7 bits (85), Expect = 5e-04 Identities = 27/229 (11%), Positives = 61/229 (26%), Gaps = 22/229 (9%) Query: 171 ELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSLTASLQVLTN 230 ++L +LT AR E + Q + + L +++ + Sbjct: 122 DVLLKLTALGAEADTLKTQ-SSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS 180 Query: 231 EEKQLLTAQQQEQQSLNWLTRLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPAR 290 EE+ L ++Q W + + + +++ LA + + Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSR--VEKSRLD 238 Query: 291 NLRPHWERIAEHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNT 350 + + A A+ + E L+ + I E Q Q Sbjct: 239 DFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ---- 294 Query: 351 WLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNAL 399 + ++L + +L E++ A Sbjct: 295 -----LFKNEILDKLRQTTDN----------IGLLTLELAKNEERQQAS 328 Score = 32.5 bits (74), Expect = 0.009 Identities = 16/150 (10%), Positives = 42/150 (28%), Gaps = 5/150 (3%) Query: 731 SQQQTLQQQDVLAAQSLQKAQAQFDTA----LQASVFDDQQAFLAALMDEQTLTQLEQLK 786 + Q + A + Q + L D+ F +E+ L +K Sbjct: 134 ADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVS-EEEVLRLTSLIK 192 Query: 787 QNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQG 846 + + Q + A+ + L L + ++ Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKH 252 Query: 847 EIRQQLKQDADNRQQQQTLMQQIAQMTQQV 876 + +Q + + + + Q+ Q+ ++ Sbjct: 253 AVLEQENKYVEAVNELRVYKSQLEQIESEI 282 Score = 32.5 bits (74), Expect = 0.011 Identities = 16/138 (11%), Positives = 47/138 (34%), Gaps = 8/138 (5%) Query: 375 QTSDREHLRQWQQQLTHAEQKLNALAAITLTLTADEV------ASALAQHAEQRPLRQHL 428 + Q L A + ++ ++ +++ Q+ + + + Sbjct: 129 ALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLT 188 Query: 429 VALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEA 488 + Q Q + Q ++ + E+ A +N + + +L D ++ ++A Sbjct: 189 SLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQA 248 Query: 489 --RIKTLEAQRAQLQAGQ 504 + LE + ++A Sbjct: 249 IAKHAVLEQENKYVEAVN 266
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 29.7 bits (66), Expect = 0.022 Identities = 13/70 (18%), Positives = 23/70 (32%), Gaps = 4/70 (5%) Query: 149 KQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFP 208 K+ ++ I ++Y + + + I T D + + I A Sbjct: 135 KEAQMMNEIAEFYAAPFKKTRAINEKEAFECI-YDSRTRSA--GKD-IVSVKINIDKAKK 190 Query: 209 AQNFPPADYI 218 N P DYI Sbjct: 191 ILNLPECDYI 200
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 94.9 bits (236), Expect = 1e-24 Identities = 33/149 (22%), Positives = 62/149 (41%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPTSHRVMAGEEP 152 E L D + G Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>PF06580#Sensor histidine kinase Length = 349 Score = 34.1 bits (78), Expect = 0.001 Identities = 19/105 (18%), Positives = 33/105 (31%), Gaps = 26/105 (24%) Query: 325 LVYNAVNH----TPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380 LV N + H P+G I ++ + VE+ G Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306 Query: 381 SRQTGGSGLGLAIVKHAVNH---HESRLNIESTVGKGTRFSFVIP 422 +G GL V+ + E+++ + GK +IP Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAM-VLIP 348
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.0 bits (91), Expect = 3e-05 Identities = 71/347 (20%), Positives = 135/347 (38%), Gaps = 20/347 (5%) Query: 62 KFLWSPLMDRYTPPFFGRRRGWLLATQILLLVAIAAMGFLEPGTQLRWMAALAVVIAFCS 121 +F +P++ + F RR LL + V A M W+ + ++A + Sbjct: 56 QFACAPVLGALSDRF--GRRPVLLVSLAGAAVDYAIMAT----APFLWVLYIGRIVAGIT 109 Query: 122 ASQDIVFDAWKTDVLPAEERGAGAAISVLGYRLGMLVSGGLALWLADKWLGWQGMYWLMA 181 + V A+ D+ +ER + GM+ L + ++ A Sbjct: 110 GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG--FSPHAPFFAAA 167 Query: 182 AL-LIPCIIATLLAPEP--TDTIPVPKTLEQAVVAPLRDFFGRNNAWLILLLIVLYKLGD 238 AL + + L PE + P+ + + + A L+ + ++ +G Sbjct: 168 ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQ 227 Query: 239 AFAMSLTTTFLIRGVGFDAGEVGVVNKTLGLLATIVGALYGGILMQRLSLFRALLIFGIL 298 A +L F +DA +G+ G+L ++ A+ G + RL RAL+ G++ Sbjct: 228 VPA-ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMI 285 Query: 299 QGASNAGYWLLSITDKHLYSMGAAVFFENLCGGMGTSAFVALLMTLCNKSFSATQFALLS 358 A GY LL+ + + V GG+G A A+L ++ L+ Sbjct: 286 --ADGTGYILLAFATRGWMAFPIMVLL--ASGGIGMPALQAMLSRQVDEERQGQLQGSLA 341 Query: 359 ALSAVGRVYVGPVAGWFVEAHGWSTF--YLFSVAAAVPGLILLLVCR 403 AL+++ + VGP+ + A +T+ + + AA+ L L + R Sbjct: 342 ALTSLTSI-VGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRR 387
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.5 bits (58), Expect = 0.027 Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 3 KKILFPLVALFMLAGCAKPPTTIEVSPTITLPQQ 36 KK+LF ++ GCA+ T+ PT P++ Sbjct: 7 KKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKE 40
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.043 Identities = 16/73 (21%), Positives = 29/73 (39%), Gaps = 13/73 (17%) Query: 60 ERSALPTPHEIRNHLDDYVIGQEQAKKVLAVAVYNHYKRLRNGDTSNGVELGKSNILLIG 119 E P+ E + ++G+ A + +Y RL D +++ G Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQ----EIYRVLARLMQTD---------LTLMITG 167 Query: 120 PTGSGKTLLAETL 132 +G+GK L+A L Sbjct: 168 ESGTGKELVARAL 180
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKAEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (294), Expect = 3e-38 Identities = 49/88 (55%), Positives = 67/88 (76%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDAIIASVTESLKEGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L +G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPSFRAGKALKDAV 89 NPQTG+EI I A+KVP+F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.044 Identities = 29/183 (15%), Positives = 67/183 (36%), Gaps = 34/183 (18%) Query: 306 EELTRMAKMVSDML-FLAQADNNQLIPEKKMLNLADEVGKVFDFFEALAEDR-GVELRFV 363 + M +S+++ + + N + + LADE+ V + + LA + L+F Sbjct: 191 TKAREMLTSLSELMRYSLRYSNARQVS------LADELTVVDSYLQ-LASIQFEDRLQFE 243 Query: 364 GDECQVAGDPLMLRRALSNLLSNALRY----TPPGEAIVVRCQTVDHQVQVTVENPGTPI 419 D + + L+ N +++ P G I+++ + V + VEN G+ Sbjct: 244 NQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA 303 Query: 420 APEHLPRLFDRFYRVDPSRQRKGEGSGIGLAIVK---SIVVAHKGTVAVTSDARGTRFVI 476 E +G GL V+ ++ + + ++ ++ Sbjct: 304 LKN------------------TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMV 345 Query: 477 MLP 479 ++P Sbjct: 346 LIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.7 bits (212), Expect = 2e-21 Identities = 35/117 (29%), Positives = 62/117 (52%) Query: 2 KLLIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTGDYDLIILDIMLPDVNGWD 61 +L+ +D+ L + L+ AG+ V + N + GD DL++ D+++PD N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 IVRMLRSANKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118 ++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 38.7 bits (90), Expect = 4e-05 Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 13/189 (6%) Query: 254 QAQTVNSDSLQSVKLPA-GLSSQILLQRPDIMEAEHALM-----AANANIGAARAAFFPS 307 + +S + +K + +I+++ + + L+ A A+ ++ Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQS----- 141 Query: 308 ISLTSGISTASSDLSSLFNASSGMWNFIPKIEIPIFNAGRNQANLDIAEIRQQQSVVNYE 367 SL + + + + P F + L + + ++Q Sbjct: 142 -SLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQN 200 Query: 368 QKIQNAFKEVADALALRQSLNDQISAQQRYLASLQITLQRARALYQHGAVSYLEVLDAER 427 QK Q + A R ++ +I+ + + L +L A++ VL+ E Sbjct: 201 QKYQ-KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQEN 259 Query: 428 SLFATRQTL 436 L Sbjct: 260 KYVEAVNEL 268
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 698 bits (1802), Expect = 0.0 Identities = 214/1059 (20%), Positives = 441/1059 (41%), Gaps = 54/1059 (5%) Query: 1 MIEWIIRRSVANRFLVLMGALFLSIWGTWTIINTPVDALPDLSDVQVIIKTSYPGQAPQI 60 M + IRR + A+ L + G I+ PV P ++ V + +YPG Q Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56 Query: 61 VENQVTYPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119 V++ VT + M + + S G + + F+ GTDP A+ +V L Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 120 KLPAGVSAELGP-DATGVGWIYEYALVDRSGKHDLADLRSLQDWFLKYELKTIPDVAEVA 178 LP V + + + ++ V + D+ +K L + V +V Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 179 SVGGVVKEYQVVIDPQRLAQYGISLAEVKSALDASNQEAGGSSIELA------EAEYMVR 232 G ++ +D L +Y ++ +V + L N + + + + Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235 Query: 233 ASGYLQTLDDFNHIVLKASENGVPVYLRDVAKVQIGPEMRRGIAELNGEGEVAGGVVILR 292 A + ++F + L+ + +G V L+DVA+V++G E IA +NG+ AG + L Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294 Query: 293 SGKNAREVIAAVKDKLETLKSSLPEGVEIVTTYDRSQLIDRAIDNLSGKLLEEFIVVAVV 352 +G NA + A+K KL L+ P+G++++ YD + + +I + L E ++V +V Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354 Query: 353 CALFLWHVRSALVAIISLPLGLCIAFIVMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412 LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 413 NAHKRLEEWQHQHPDATLDNKTRWQVITDASVEVGPALFISLLIITLSFIPIFTLEGQEG 472 N + + E D + + ++ AL ++++ FIP+ G G Sbjct: 415 NVERVMME----------DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 473 RLFGPLAFTKTYAMAGAALLAIVVIPILMGYWIRGKIPPESSNPLNRF----------LI 522 ++ + T AMA + L+A+++ P L ++ + E F + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKP-VSAEHHENKGGFFGWFNTTFDHSV 523 Query: 523 RVYHPLLLKVLHWPKTTLLVAALSVLTVLWPLNKVGGEFLPQINEGDLLYMPSTLPGISA 582 Y + K+L LL+ AL V ++ ++ FLP+ ++G L M G + Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583 Query: 583 AEAASMLQKTDKLIM--SVPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639 +L + + V VF G + + + LKP ++ Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640 Query: 640 MTMDKIIEELDNTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTVLADI-DAMAE 698 + + +I + + + +++ + I +G + A + Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700 Query: 699 QIEEVARTVPGVASALAERLEGGRYINVEINREKAARYGMTVADVQLFVTSAVGGAMVGE 758 + A+ + S LE +E+++EKA G++++D+ +++A+GG V + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 759 TVEGIARYPINLRYPQSWRDSPQALRQLPILTPMKQQITLADVADVKVSTGPSMLKTENA 818 ++ + ++ +R P+ + +L + + + + + G L+ N Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820 Query: 819 RPTSWIYIDARDRDMVSVVHDLQKAIAEKVQLKPGTSVAFSGQFELLERANHKLKLMVPM 878 P+ I +A L + +A K L G ++G + ++ +V + Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASK--LPAGIGYDWTGMSYQERLSGNQAPALVAI 878 Query: 879 TLMIIFVLLYLAFRRVGEALLIISSVPFALVGGIWLLWWMGFHLSVATGTGFIALAGVAA 938 + +++F+ L + + ++ VP +VG + V G + G++A Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 939 EFGVVMLMYLRHAIEAEPSLNNPQTFSEQKLDEALYHGAVLRVRPKAMTVAVIIAGLLPI 998 + ++++ + + +E E + + EA +R+RP MT I G+LP+ Sbjct: 939 KNAILIVEFAKDLMEKE----------GKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988 Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037 GAGS + + ++GGM++A LL++F +P + + Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 47.2 bits (112), Expect = 3e-08 Identities = 36/146 (24%), Positives = 63/146 (43%), Gaps = 5/146 (3%) Query: 197 AREREQGTLDQLLVSPLTTWQIFIGKAVPALIVATFQATIVLAIGIWAYQIPFAGSLALF 256 R Q T + +L + L I +G+ A A IG+ A + + L+L Sbjct: 92 GRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGA---GIGVVAAALGYTQWLSLL 148 Query: 257 YFTMVI--YGLSLVGFGLLISSLCSTQQQAFIGVFVFMMPAILLSGYVSPVENMPVWLQN 314 Y VI GL+ G+++++L + + + P + LSG V PV+ +P+ Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 315 LTWINPIRHFTDITKQIYLKDASLDI 340 P+ H D+ + I L +D+ Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDV 234
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.012 Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 21/90 (23%) Query: 293 TPRFEDAFIDLLGGAGTSESPLGAILHTVEGTPGETVIEAKELTKKFGDFAATDHVNFAV 352 PR E + +LG P + + + K HV + Sbjct: 547 VPRLEKWLVHVLGKTPDDYKP-------------RRLRYLQLVGKYI----LMGHVARVM 589 Query: 353 KRGEIFG----LLGPNGAGKSTTFKMMCGL 378 + G F L G G GKST + GL Sbjct: 590 EPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619 Score = 29.7 bits (66), Expect = 0.046 Identities = 11/23 (47%), Positives = 13/23 (56%) Query: 34 YVTGLVGPDGAGKTTLMRMLAGL 56 Y L G G GK+TL+ L GL Sbjct: 597 YSVVLEGTGGIGKSTLINTLVGL 619
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 62.5 bits (152), Expect = 6e-13 Identities = 42/259 (16%), Positives = 92/259 (35%), Gaps = 25/259 (9%) Query: 82 ALMQAKAGVSVAQAQYDLMLAGYRDEEIAQAAAAVKQAQAAYDYAQNFYNRQQGLWKSRT 141 Q + + +A+ +LA E + + + + L + Sbjct: 201 QKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENK 260 Query: 142 ISA--NDLENARSSRDQAQATLKSAQDKLRQYRSGNREQ---DIAQAKASLEQAQAQLAQ 196 N+L +S +Q ++ + SA+++ + + + + Q ++ +LA+ Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAK 320 Query: 197 AELNLQDSTLIAPSDGTLLTRAV-EPGTVLNEGGTVFTVSLT-RPVWVRAYVDERNLDQA 254 E Q S + AP + V G V+ T+ + + V A V +++ Sbjct: 321 NEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFI 380 Query: 255 QPGRKVLLYTDGRPDKPYH---GQIGFVSPTAEFTPKTVETPDLRTDLVYRLRIVVT--- 308 G+ ++ + P Y G++ ++ A D R LV+ + I + Sbjct: 381 NVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVFNVIISIEENC 432 Query: 309 ----DADDALRQGMPVTVQ 323 + + L GM VT + Sbjct: 433 LSTGNKNIPLSSGMAVTAE 451
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 72.7 bits (178), Expect = 7e-18 Identities = 33/214 (15%), Positives = 77/214 (35%), Gaps = 17/214 (7%) Query: 9 KGEQAKKQLIAAALAQFGEYGMNATT-REIAAQAGQNIAAITYYFGSKEDLYLACAQWIA 67 + ++ ++ ++ AL F + G+++T+ EIA AG AI ++F K DL+ + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 68 DFIGEQFRPHAEEAERLFAQPQPDRAAIRELILRACRNMIKLLTQDDTVNLSKFISREQL 127 IGE E + P + +RE+++ + + + + + F E + Sbjct: 68 SNIGELEL---EYQAKFPGDP---LSVLREILIHVLESTVTEERRRLLMEII-FHKCEFV 120 Query: 128 SPTAAYHLVHEQVISPLHSHLTRLIAAWTGCDANDTRMILHTHALIGEILAFRLGKETIL 187 A + + + + + +A L T + + G Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKH--CIEAKMLPADLMTRRAAIIMRGYISG----- 173 Query: 188 LRTGWTAFDEEKTELINQTVTCHIDLILQGLSQR 221 L W + + + ++ ++L+ Sbjct: 174 LMENWLFAPQSFD--LKKEARDYVAILLEMYLLC 205
>SECA#SecA protein signature. Length = 901 Score = 29.8 bits (67), Expect = 0.025 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 4/67 (5%) Query: 246 QQVLVFTRTKHGANHLAEQLNKDGIRSAAIHG-NKSQGARTRALADFKSGDIRVLVATDI 304 Q VLV T + + ++ +L K GI+ ++ + A A A + + V +AT++ Sbjct: 450 QPVLVGTISIEKSELVSNELTKAGIKHNVLNAKFHANEAAIVAQAGYPAA---VTIATNM 506 Query: 305 AARGLDI 311 A RG DI Sbjct: 507 AGRGTDI 513
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 43.2 bits (102), Expect = 8e-07 Identities = 41/201 (20%), Positives = 64/201 (31%), Gaps = 34/201 (16%) Query: 16 AFLFLFAPTAFAAEQTVEAPSVDARAW----------ILMDYASGKVLAEGNADEKLDPA 65 + L A A P + I MD ASG+ L ADE+ Sbjct: 7 CIISLLATLPLAV-HASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMM 65 Query: 66 SLTKIMTSYVVGQALKADKIKLTDMVTVGKDAWATGNPALRGSSVMFLKPGDQVSVADLN 125 S K++ V + A +L + + +P V D ++V +L Sbjct: 66 STFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSP------VSEKHLADGMTVGELC 119 Query: 126 KGVIIQSGNDACIALADYVAGSQESFIGLMNGYAKKLGLTNTT---FQTVHGLDAPGQF- 181 I S N A L V G + + +++G T ++T PG Sbjct: 120 AAAITMSDNSAANLLLATVGGPAG-----LTAFLRQIGDNVTRLDRWETELNEALPGDAR 174 Query: 182 --STARDMA------LLGKAL 194 +T MA L + L Sbjct: 175 DTTTPASMAATLRKLLTSQRL 195
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.5 bits (92), Expect = 2e-05 Identities = 62/356 (17%), Positives = 129/356 (36%), Gaps = 51/356 (14%) Query: 48 QAGIDWVPTSMTAYLAGGMFLQWLLGPLSDRIGRRPVMLAGVVWFIITCLAILLAQNIEQ 107 A +WV T+ + G + G LSD++G + ++L G++ + + + Sbjct: 48 PASTNWVNTAFMLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFS 104 Query: 108 FTLL-RFLQGISLCFIGAVGYAAIQESFEEAVCIKITALMANVALIAPLLGPLVGAAWIH 166 ++ RF+QG A+ + + K L+ ++ + +GP +G H Sbjct: 105 LLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH 164 Query: 167 VLPWEGMFVLFAALAAISFFGLQQAMPETATRIGE------------------------- 201 + W +L + I+ L + + + G Sbjct: 165 YIHW-SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTSYSI 223 Query: 202 ---KLSLKELGRDYKLVL------------KNGRFVAGALALGFVSLPLLAWIAQSPIII 246 +S+ K + KN F+ G L G + + +++ P ++ Sbjct: 224 SFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMM 283 Query: 247 ITGEQLSSYEYG-LLQVPIFGALIAGNLLLARLTSRRTVRSLIIMGGWPIIIGLLVAAAA 305 QLS+ E G ++ P ++I + L RR ++ +G + + L A+ Sbjct: 284 KDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-- 341 Query: 306 TVISSHAYLWMTAGLSIYAFGIGLANAGLVRLTLFASDMSKGTVSAAMGMLQMLIF 361 + +MT + ++ G GL+ V T+ +S + + A M +L F Sbjct: 342 -FLLETTSWFMTIII-VFVLG-GLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSF 394
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.010 Identities = 9/18 (50%), Positives = 12/18 (66%) Query: 31 LVLLGPSGAGKSSLLRVL 48 +VL G G GKS+L+ L Sbjct: 599 VVLEGTGGIGKSTLINTL 616
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 29.1 bits (65), Expect = 0.017 Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 9/54 (16%) Query: 2 RRVFWLIAAALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADD 55 R+V L+ ALL AG A I K+G +LD Y ++ L HY +DD Sbjct: 3 RKVLALVIPALLAAGAAHAAEIYNKDGNKLDL-------YGKVDGL--HYFSDD 47
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 75.2 bits (185), Expect = 2e-17 Identities = 70/363 (19%), Positives = 123/363 (33%), Gaps = 65/363 (17%) Query: 1 MKVLVTGATSGLGRNAVEFLCQKGISVRA---------TGRNEAMGKLLEKMGAEFVPAD 51 MK LVTGA +G + + L + G V +A +LL + G +F D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LTELVSSQAKVMLAGIDTLWHCS-------SFTSPWGTQQAFDLANVRATRRLGEWAVAW 104 L + + ++ S +P A+ +N+ + E Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPH----AYADSNLTGFLNILEGCRHN 116 Query: 105 GVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFT 164 +++ ++ SS S+Y + D + +A +K A+E + + S T Sbjct: 117 KIQHLLYASSSSVYGL-NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSH-LYGLPAT 174 Query: 165 ILRPQSLFGPHDK--VFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEA 222 LR +++GP + + + + M SI + + G D TY ++ A+ Sbjct: 175 GLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVI 234 Query: 223 CDKLPS--------------GRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLD 268 RVYNI N L +Q L D L I+ + +P D Sbjct: 235 PHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGD 294 Query: 269 MIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVITLDEGIEKTAAW 328 + T D E +G+ P T+ +G++ W Sbjct: 295 V----------------LETS-----------ADTKALYEVIGFTPETTVKDGVKNFVNW 327 Query: 329 LRD 331 RD Sbjct: 328 YRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 55.6 bits (134), Expect = 1e-10 Identities = 29/125 (23%), Positives = 52/125 (41%), Gaps = 17/125 (13%) Query: 4 RILVLGASGYIGQHLVRTLSQQGHQILA---------AARHVDRLAKLQLANVSCHKVDL 54 + LV GA+G+IG H+ + L + GHQ++ + RL L HK+DL Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 SWPDNLPALLQD--IDTVYFLVH------SMGEGGDFIAQERQVALNVRDALREVPVKQL 106 + + + L + V+ H S+ + LN+ + R ++ L Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQHL 121 Query: 107 IFLSS 111 ++ SS Sbjct: 122 LYASS 126
>CHLAMIDIAOMP#Chlamydia major outer membrane protein signature. Length = 393 Score = 28.4 bits (63), Expect = 0.044 Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%) Query: 137 GVNGDAVDPKSVTSWADL------WKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPK 190 G GD DP T+W D + ++ +L D + FQM + +GN T P Sbjct: 42 GFGGDPCDP--CTTWCDAISMRMGYYGDFVFDRVLKTDVNKEFQMGDKPTSTTGNATAPT 99 Query: 191 EIEAAYN 197 + A N Sbjct: 100 TLTAREN 106
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.017 Identities = 10/36 (27%), Positives = 19/36 (52%), Gaps = 1/36 (2%) Query: 46 LTLLGPSGCGKTTVLRLIAGLE-TVDSGRIMLDNED 80 + L G G GK+T++ + GL+ D+ + +D Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKD 634
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.048 Identities = 11/69 (15%), Positives = 22/69 (31%), Gaps = 20/69 (28%) Query: 389 NACKYCLE------FVEISARQTDEHLYIVVEDDGPGIPLSKREVIFDRGQRVDTLRPGQ 442 N K+ + + + + + + + VE+ G + +E Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE--------------ST 311 Query: 443 GVGLAVARE 451 G GL RE Sbjct: 312 GTGLQNVRE 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.8 bits (215), Expect = 5e-22 Identities = 31/124 (25%), Positives = 62/124 (50%) Query: 2 RVLVVEDNALLRHHLKVQIQDAGHQVDDAEDAKEADYYLNEHLPDIAIVDLGLPDEDGLS 61 +LV +D+A +R L + AG+ V +A ++ D+ + D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LIRRWRSNDVSLPILVLTARESWQDKVEVLSAGADDYVTKPFHIEEVMARMQALMRRNSG 121 L+ R + LP+LV++A+ ++ ++ GA DY+ KPF + E++ + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 LASQ 125 S+ Sbjct: 125 RPSK 128
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 143 bits (362), Expect = 2e-39 Identities = 62/206 (30%), Positives = 102/206 (49%), Gaps = 1/206 (0%) Query: 258 GKAFYYQPVLCTVQAKSTLTVEEEQDRLRQAIDFTLLDLMTLTAKAEASGLDDIAAIFSG 317 KAF + ++ S V E ++L A++ + +L + + EAS D A IF+ Sbjct: 17 AKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAA 76 Query: 318 HHTLLDDPELLAAASELLQHEHCTAEYAWQQVLKELSQQYQQLDDEYLQARYIDVDDLLH 377 H +LDDPEL+ +++E AEYA ++V ++ +D+EY++ R D+ D+ Sbjct: 77 HLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEYMKERAADIRDVSK 136 Query: 378 RTLVHLT-QTKEELPQFNSPTILLAENIYPSTVLQLDPAVVKGICLSAGSPVSHSALIAR 436 R L HL L T+++AE++ PS QL+ VKG G SHSA+++R Sbjct: 137 RVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHSAIMSR 196 Query: 437 ELGIGWICQQGEKLYAIQPEETLTLD 462 L I + E IQ + + +D Sbjct: 197 SLEIPAVVGTKEVTEKIQHGDMVIVD 222
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 27.3 bits (60), Expect = 0.043 Identities = 10/47 (21%), Positives = 25/47 (53%) Query: 138 VESLRQSSEQNLSVPVALEAASRIAESAAQSTITMQARKGRASYLGE 184 E++ + ++N + +EA +A +A + + A+ G+A+ G+ Sbjct: 293 REAVDRWIQENPNAAETVEAVFNVAAAAKVAKLAKAAKPGKAAVSGD 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 244 bits (625), Expect = 7e-76 Identities = 91/363 (25%), Positives = 155/363 (42%), Gaps = 33/363 (9%) Query: 308 QMRQLMTSQLGKVSHTFAHMPQDDPQTRRLIHFGRQAARSSFPVLLCGEEGVGKALLSQA 367 + S+L S + + + + ++ +++ GE G GK L+++A Sbjct: 120 AEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 368 IHNESERAAGPYIAVNCELYGDAALAEEFIG---GDRTDNENGRLSRLELAHGGTLFLEK 424 +H+ +R GP++A+N + E G G T + R E A GGTLFL++ Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 425 IEYLAVELQSALLQVIKQGVITRLDARRLIPIDVKVIATTTADLAMLVEQNRFSRQLYYA 484 I + ++ Q+ LL+V++QG T + R I DV+++A T DL + Q F LYY Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 485 LHAFEITIPPLRMRRGSIPALVNNKLRSLEKRFSTRLKIDDDALARLVSCAWPGNDFELY 544 L+ + +PPLR R IP LV + ++ EK + D +AL + + WPGN EL Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRELE 359 Query: 545 SVIENLALSSDNGRIRVSDLPEHLFTEQATDDVSATRLSTS------------------- 585 +++ L I + L +E + + Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419 Query: 586 -----------LSFAEVEKEAIINAAQVTGGRIQEMSALLGIGRTTLWRKMKQHGIDAGQ 634 AE+E I+ A T G + + LLG+ R TL +K+++ G+ + Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYR 479 Query: 635 FKR 637 R Sbjct: 480 SSR 482
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 214 bits (547), Expect = 5e-60 Identities = 243/984 (24%), Positives = 399/984 (40%), Gaps = 119/984 (12%) Query: 14 RLAELKIRSPSIQLIKFGAIGLNAIIFSPLLIAADTGSQYGTNITINDGDRI---TGDTA 70 + A L+ + ++ L GA ++ I Q+G +I +D + +G T Sbjct: 10 KAAPLRRTTLAMALGALGAAPAAHADWNNQSIVKTGERQHGIHIQGSDPGGVRTASGTTI 69 Query: 71 DPSGN-LYGVMTPAGNTPGNINLGNDVTVN---VNDASGYAKGIIIQGKNSSLTANRLTV 126 SG G++ N + N + ++D + K L A+ T+ Sbjct: 70 KVSGRQAQGILLE--NPAAELQFRNGSVTSSGQLSDDGIRRFLGTVTVKAGKLVADHATL 127 Query: 127 DVVSQT---SAIGINLIGDYTHADLGTGSTIKSNDDGIIIGHSSTLTATQFTIENSNGIG 183 V T I + + G+ A + ST++ G+ I + +T + I + G+ Sbjct: 128 ANVGDTWDDDGIALYVAGEQAQASIAD-STLQGAG-GVQIERGANVTVQRSAIVD-GGLH 184 Query: 184 LTINDYGTSVDLGSGSKITTDGS-TGVYIGGLNGNNANGAARFTATDLTID---VQGYSA 239 + DL + D + T V G + A++LT+D + G A Sbjct: 185 IGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPA----AVSVLGASELTLDGGHITGGRA 240 Query: 240 MGINVQKNSVVDLGTNSTIKTNGDNAHGLWSFGQVSANAL-------TVDVTGAAANGVE 292 G+ + +VV L +TI+ A G G V A+ GV+ Sbjct: 241 AGVAAMQGAVVHL-QRATIRRGDAPAGGAVPGGAVPGGAVPGGFGPGGFGPVLDGWYGVD 299 Query: 293 VRGGTTTIGADSHISSAQGGGLVTSGSDATINFSGTAAQRNSIFSGGSYGASAQTATAVV 352 V G + + A S + + + G + G A + SG + +G +T A Sbjct: 300 VSGSSVEL-AQSIVEAPELGAAIRVGRGARVTVSGGSLS-------APHGNVIETGGARR 351 Query: 353 NM-QNTDITVD-RNGSLALGLWALSGGRITGDSLAITGAAGARGIYAMTNSQIDLTSDLV 410 Q +++ + G+ A G L L +TG A A+G T + + Sbjct: 352 FAPQAAPLSITLQAGAHAQGKALLYRVLPEPVKLTLTGGADAQGDIVATELPSIPGTSI- 410 Query: 411 IDMSTPDQMAIATQHDDGYAASRINASGRMLINGSVLSKGGLINLDMHPGSVWTGSSLSD 470 P +A+A+ + WTG++ Sbjct: 411 ----GPLDVALAS------------------------------------QARWTGAT--R 428 Query: 471 NVNGGKLDVAMNNSVWNVTSNSNLDTLAL-SHSTVDFASHGSTAGTFATLNVENLSGNST 529 V+ +D N+ W +T NSN+ L L S +VDF + AG F L V L+G+ Sbjct: 429 AVDSLSID----NATWVMTDNSNVGALRLASDGSVDFQQ-PAEAGRFKVLTVNTLAGSGL 483 Query: 530 FIMRADVVGEGNGVNNKGDLLNISGSSAGNHVLAIRNQGSEATTGNEVLTVVKTTDGAAS 589 F M D L + ++G H L +RN GSE + N +L V AA+ Sbjct: 484 FRMNV------FADLGLSDKLVVMQDASGQHRLWVRNSGSEPASANTLLLVQTPLGSAAT 537 Query: 590 FSASS---QVELGGYLYDVRKNG-TNWELYASGTVPEPTPNPEPTPAPAQPPIVNPDPTP 645 F+ ++ +V++G Y Y + NG W L + P P P P+P P P QPP P+ Sbjct: 538 FTLANKDGKVDIGTYRYRLAANGNGQWSLVGAKAPPAPKPAPQPGPQPPQPPQPQPEA-- 595 Query: 646 EPDPTPNPTPTPKPTTTADAGGNYLNVGYL--LNYVENRTLMQRMGDLRNQSKDGNIWLR 703 P P P + + A+A N VG L Y E+ L +R+G+LR G W R Sbjct: 596 ---PAPQPPAGRELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGR 652 Query: 704 SYG--GSLDSFASGKLSGFDMGYSGIQFGGDKRLSDVM-PLYVGLYIGSTHASPDYSG-G 759 + LD+ A + FD +G + G D ++ ++G G T ++G G Sbjct: 653 GFAQRQQLDNRAGRR---FDQKVAGFELGADHAVAVAGGRWHLGGLAGYTRGDRGFTGDG 709 Query: 760 DGTARSDYMGMYASYMAQNGFYSDLVIKASRQKNSFHVLDSQNNGVNANGTANGMSISLE 819 G S ++G YA+Y+A +GFY D ++ASR +N F V S V +G+ SLE Sbjct: 710 GGHTDSVHVGGYATYIADSGFYLDATLRASRLENDFKVAGSDGYAVKGKYRTHGVGASLE 769 Query: 820 AGQRFNLSPTGYGFYIEPQTQLTYSHQNEMAMKASNGLNIHLNHYESLLGRASMILGYDI 879 AG+RF + G+++EPQ +L A +A+NGL + S+LGR + +G I Sbjct: 770 AGRRFTHAD---GWFLEPQAELAVFRAGGGAYRAANGLRVRDEGGSSVLGRLGLEVGKRI 826 Query: 880 T-AGNSQLNVYVKTGAIREFSGDTEYLLNNSREKYSFKGNGWNNGVGVSAQYNKQHTFYL 938 AG Q+ Y+K ++EF G N + +G G+G++A + H+ Y Sbjct: 827 ELAGGRQVQPYIKASVLQEFDGAGTVHTNGIAHRTELRGTRAELGLGMAAALGRGHSLYA 886 Query: 939 EADYTQGNLFDQK-QVNGGYRFSF 961 +Y++G + GYR+S+ Sbjct: 887 SYEYSKGPKLAMPWTFHAGYRYSW 910
>INTIMIN#Intimin signature. Length = 939 Score = 258 bits (660), Expect = 3e-79 Identities = 120/378 (31%), Positives = 196/378 (51%), Gaps = 21/378 (5%) Query: 79 GEQAKAFALGKVRDALSQQVNQHVESWLSPWGNASVDVKVDNEGHFTGSRGSWFVPLQDN 138 G+ AK ALG + Q + +++WL +G A V+++ N F GS + +P D+ Sbjct: 184 GDYAKDTALGIAGN----QASSQLQAWLQHYGTAEVNLQSGNN--FDGSSLDFLLPFYDS 237 Query: 139 DRYLTWSQLGLTQQDDGLVSNVGVGQRWARGNWLVGYNTFYDNLLDENLQRAGFGAEAWG 198 ++ L + Q+G D +N+G GQR+ ++GYN F D + R G G E W Sbjct: 238 EKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDNTRLGIGGEYWR 297 Query: 199 EYLRLSANFYQPFAAWHE--QTATQEQRMARGYDLTARMRMPFYQHLNTSVSVEQYFGDR 256 +Y + S N Y + WHE ++R A G+D+ +P Y L + EQY+GD Sbjct: 298 DYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGAKLMYEQYYGDN 357 Query: 257 VDLFNSGTGYHNPVALSLGLNYTPVPLVTVTAQHKQGESGENQNNLGLNLNYRFGVPLKK 316 V LFNS NP A ++G+NYTP+PLVT+ ++ G EN + Y+F P + Sbjct: 358 VALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQFRYQFDKPWSQ 417 Query: 317 QLSAGEVAESQSLRGSRYDNPQRNNLPTLEYRQRKTLTVFLATPPWDLKPGETVPLKLQI 376 Q+ V E ++L GSRYD QRNN LEY+++ L++ + + T ++L + Sbjct: 418 QIEPQYVNELRTLSGSRYDLVQRNNNIILEYKKQDILSLNI-PHDINGTERSTQKIQLIV 476 Query: 377 RSRYGIRQLIWQGDTQILS-----LTPGAQANSAEGWTLIMPDWQNGEGASNHWRLSVVV 431 +S+YG+ +++W D+ + S G+Q SA+ + I+P + +G SN ++++ Sbjct: 477 KSKYGLDRIVWD-DSALRSQGGQIQHSGSQ--SAQDYQAILPAYV--QGGSNVYKVTARA 531 Query: 432 EDNQGQRVSSNEITLTLV 449 D G SSN + LT+ Sbjct: 532 YDRNGN--SSNNVLLTIT 547
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.7 bits (181), Expect = 2e-17 Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 2/117 (1%) Query: 7 ATILLIDDHPMLRTGVKQLISMAPDITVVGEASNGEQGIELAESLDPDLILLDLNMPGMN 66 ATIL+ DD +RT + Q +S A + SN + D DL++ D+ MP N Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRI--TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 67 GLETLDKLREKSLSGRIVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALHQA 123 + L ++++ ++V S N + A ++GA YL K + +L+ + +A Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 53.3 bits (128), Expect = 1e-09 Identities = 36/172 (20%), Positives = 73/172 (42%), Gaps = 23/172 (13%) Query: 424 PESSRELLSQIRNELNASWAQLRELLTTFRLQLTEPGLRPALEASCEEYSAKFGFPVKLD 483 P +RE+L+ + + S + +LT +++ + S +F ++ + Sbjct: 190 PTKAREMLTSLSELMRYSLRYSNARQVSLADELT------VVDSYLQLASIQFEDRLQFE 243 Query: 484 YQLPPRL----VPSHQAIHLLQIAREALSNALKH-----SQASEVVVTVAQNDNQVKLTV 534 Q+ P + VP L+Q E N +KH Q ++++ +++ V L V Sbjct: 244 NQINPAIMDVQVPPM----LVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 535 QDNGCGVPENAIRSNHYGMIIMRDRAQSLRG-DCRVRRRESGGTEVVVTFIP 585 ++ G +N S G+ +R+R Q L G + +++ E G + IP Sbjct: 297 ENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.0 bits (70), Expect = 0.011 Identities = 35/166 (21%), Positives = 60/166 (36%), Gaps = 22/166 (13%) Query: 258 IMSLLYLATFGSFIGFSAGFAMLSKTQFPDVQILQYAFFGPFIGALARSA---GGALSDR 314 I+S + L+ + I A A L K + + FFG F S ++ Sbjct: 474 IVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 315 LGGTRVTLVNFILMAIFSGLLFLTLPTD----GQGGSFMAFFAVFLALFLTAGLGSGSTF 370 LG T L+ + L+ +LFL LP+ G F+ L +G+T Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTM----------IQLPAGATQ 583 Query: 371 QMISVIFRKLTMDRVKAEGGSDER-----AMREAATDTAAALGFIS 411 + + ++T +K E + E + A + F+S Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVS 629
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.5 bits (222), Expect = 4e-24 Identities = 30/105 (28%), Positives = 51/105 (48%), Gaps = 3/105 (2%) Query: 7 KFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGYGFVISDWNMPNMDGL 66 LV DD + +R ++ L G++ V + + AG V++D MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYD-VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 67 ELLKTIRADGAMSALPVLMVTAEAKKENIIAAAQAGASGYVVKPF 111 +LL I+ LPVL+++A+ I A++ GA Y+ KPF Sbjct: 64 DLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.2 bits (159), Expect = 7e-14 Identities = 35/188 (18%), Positives = 72/188 (38%), Gaps = 23/188 (12%) Query: 1 MSKIRVLSVDDSALMRQIMTEIINSHSDMEMVATAPDPLVARDLIKKFNPDVLTLDVEMP 60 M+ +L DD A +R ++ + ++ V + I + D++ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 RMDGLDFLEKLMRLRPMPVVMVSSLTGKGS-EVTLRALELGAIDFVTKPQLGIGEGMLAY 119 + D L ++ + RP V+V ++ + + ++A E GA D++ KP + E + Sbjct: 59 DENAFDLLPRIKKARPDLPVLV--MSAQNTFMTAIKASEKGAYDYLPKP-FDLTELIGII 115 Query: 120 SEMIAEKVRTAAKASLAAHKPLSAPTTLKAGPLLSSEKLIAIGASTGGTEAIRHVLQPLP 179 +AE R +K + + +G S E R + + + Sbjct: 116 GRALAEPKRRPSKLEDDSQDGMP-----------------LVGRSAAMQEIYRVLARLMQ 158 Query: 180 LSSPALLI 187 ++ Sbjct: 159 TDLTLMIT 166
>PF06580#Sensor histidine kinase Length = 349 Score = 42.5 bits (100), Expect = 4e-06 Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 52/151 (34%) Query: 361 ELDKSLIERIIDPLT--HLVRNSLDHGIELPEKRLAAGKNSVGNLILSAEHQGGNICIEV 418 +++ ++++ + P+ LV N + HGI G ++L G + +EV Sbjct: 245 QINPAIMDVQVPPMLVQTLVENGIKHGIA--------QLPQGGKILLKGTKDNGTVTLEV 296 Query: 419 TDDGAGLNRERILAKAASQGLTVSENMSDDEVAMLIFAPGFSTAEQVTDVSGRGVGMDVV 478 + G+ + G G+ V Sbjct: 297 ENTGSLALKNTK--------------------------------------ESTGTGLQNV 318 Query: 479 KRNIQEMGG---HVEIQSKQGTGTTIRILLP 506 + +Q + G +++ KQG +L+P Sbjct: 319 RERLQMLYGTEAQIKLSEKQG-KVNAMVLIP 348
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.010 Identities = 22/93 (23%), Positives = 35/93 (37%), Gaps = 11/93 (11%) Query: 46 LISISSPKELIQIAEYFRTPLATAVTGGDRISNSESPIPGGGDDYTQSQGEVNKQPNIEE 105 L +SSP A P + G + ++ PGGGDD GE +++ Sbjct: 384 LADVSSPTAAAGGAGGGEPPKKRDPSAG---AGTDPGGPGGGDD-----GEDPFGEWLDD 435 Query: 106 LKKRM---EQSRLRKLRGDLDQLIESDPKLRAL 135 R+ + L+ R L + + S P L Sbjct: 436 EVARLRLRGRWLLKPRRAALIEALRSAPALAGC 468
>PF05844#YopD protein Length = 295 Score = 33.1 bits (75), Expect = 0.001 Identities = 12/28 (42%), Positives = 22/28 (78%), Gaps = 2/28 (7%) Query: 76 MDLLALLYRLMAKSRQMGMFSLERDIEN 103 ++LL +L+R+ K+R++G+ L+RD EN Sbjct: 74 VELLLILFRIAQKARELGV--LQRDNEN 99
>FLAGELLIN#Flagellin signature. Length = 507 Score = 166 bits (422), Expect = 1e-48 Identities = 166/407 (40%), Positives = 221/407 (54%), Gaps = 5/407 (1%) Query: 2 AQVINTNSLSLITQNNINKNQSALSSSIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 61 AQVINTNSLSL+TQNN+NK+QS+LSS+IERLSSGLRINSAKDDAAGQAIANRFTSNIKGL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 TQAARNANDGISVAQTTEGALSEINNNLQRVRELTVQATTGTNSESDLSSIQDEIKSRLE 121 TQA+RNANDGIS+AQTTEGAL+EINNNLQRVREL+VQAT GTNS+SDL SIQDEI+ RLE Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVSSQTQFNGVNVLAKDGKMNIQVGANDGQTITIDLKKIDSSTLNLSSFDATNLGTS 181 EIDRVS+QTQFNGV VL++D +M IQVGANDG+TITIDL+KID +L L F+ + Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEA 180 Query: 182 VKDGATINKQVAVGAGDFKDKASGSLGTLKLVEKDGKYYVNDTTNSKYYDAEVDTSKGEI 241 + + G + A+ + + Y +A + Sbjct: 181 TVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDD 240 Query: 242 NFNSTNESGTTPTAATEVTTVGRDVKLDASALKANQSLVVYKDKSGNDAYIIQTKDVTTN 301 N+T T +T T + + K + D + + Sbjct: 241 AENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVS 300 Query: 302 QSTFNAANISDAGVLSIGASTTAPSNLTADPLKALDDAIASVDKFRSSLGAVQNRLDSAI 361 + L++ T +N+ A L++ + SV + + + + Sbjct: 301 TTINGEKV-----TLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKL 355 Query: 362 ANLNNTTTNLSEAQSRIQDADYATEVSNMSKAQIIQQAGNSVLAKAN 408 ++L E++ + A+Y + + A Sbjct: 356 SDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGV 402 Score = 97.8 bits (243), Expect = 3e-24 Identities = 87/276 (31%), Positives = 123/276 (44%), Gaps = 1/276 (0%) Query: 145 NIQVGANDGQTITIDLKKIDSSTLNLSSFDATNLGTSVKDGATINKQVAVGAGDFKDKAS 204 D + ++ + +A + ++K G + G D + Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291 Query: 205 GSLGTLKLVEKDGKYYVNDTTNSKYYDAEVDTSKGEINFNSTNESGTTPTAATEVTTVGR 264 G+ G K+ V T A + + + S + T Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351 Query: 265 DVKLDASALKANQSLVVYKDKSGNDAYIIQTKDVTTNQSTFNAANISDAGVLSIGASTTA 324 KL +G + D T + + +GV ++ A Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFIDKTASGVSTLINEDAA 411 Query: 325 PSNLTAD-PLKALDDAIASVDKFRSSLGAVQNRLDSAIANLNNTTTNLSEAQSRIQDADY 383 + + PL ++D A++ VD RSSLGA+QNR DSAI NL NT TNL+ A+SRI+DADY Sbjct: 412 AAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADY 471 Query: 384 ATEVSNMSKAQIIQQAGNSVLAKANQVPQQVLSLLQ 419 ATEVSNMSKAQI+QQAG SVLA+ANQVPQ VLSLL+ Sbjct: 472 ATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 32.7 bits (74), Expect = 0.003 Identities = 27/95 (28%), Positives = 43/95 (45%), Gaps = 2/95 (2%) Query: 214 NGMEVSVAAQNAQLTVNNVAIENSSNTISDALENITLNLNDVTTGNQTLTITQDTSKAQT 273 N E +VAA+N + + A+ + +S AL T++L V+T LT T T ++ Sbjct: 236 NSSERAVAARNKAEELVSAALYSRPELLSQALSGKTVDLKIVSTS--LLTPTSLTGGEES 293 Query: 274 AIKDWVNAYNSLIDTFSSLTKYTAVDAGADSQSSS 308 +KD VNA L TK ++ + S Sbjct: 294 MLKDQVNALKGLNSKRGEPTKLLIRNSDGLLKEVS 328
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.017 Identities = 10/57 (17%), Positives = 17/57 (29%), Gaps = 2/57 (3%) Query: 164 RFTLLPIFRIPVKMQKVSAASPLTQKPDQARRRF--RLGMLVFFGMLGWALLTAMNQ 218 R L R + + + A L + P R R M ++L + Sbjct: 26 RKQLDTPVREKDENEFLPAHLELIETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEI 82
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.9 bits (122), Expect = 4e-09 Identities = 33/129 (25%), Positives = 57/129 (44%), Gaps = 20/129 (15%) Query: 132 AMMLH-IRQQAQAQLPEAITQAVIGRPINFQGLGGDEANAQAQGILERAAKRAGFRDVVF 190 M+ H I+Q + ++ P+ + E A + +A+ AG R+V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140 Query: 191 QYEPVAAGLDYEATLQEEKRVLVVDIGGGTTDCSLLLMGPQWRSRLDREASLLGHSGCRI 250 EP+AA + + E +VVDIGGGTT+ +++ + ++ S RI Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 251 GGNDLDIAL 259 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 34.7 bits (80), Expect = 5e-04 Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 23/137 (16%) Query: 332 RLSYRLV---RSAEECKIALSSV--AETRASLPFISDELAT------LISQQGLESALSQ 380 R +Y + +AE K + S + + LA ++ + AL + Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262 Query: 381 PLARILEQVQLALDNAQEKPDV--------IYLTGGSARSPLIKKALAEQLPGIPIAGGD 432 PL I+ V +AL+ Q P++ + LTGG A + + L E+ GIP+ + Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319 Query: 433 D-FGSVTAGLARWAEVV 448 D V G + E++ Sbjct: 320 DPLTCVARGGGKALEMI 336
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 31.0 bits (70), Expect = 0.016 Identities = 21/128 (16%), Positives = 37/128 (28%), Gaps = 11/128 (8%) Query: 594 VSLSAAVALLERRSQAIHAPALDRGAVLGALMRLEH-PNASAEAALTMLAQLSPAQSGEA 652 L L+ R + H L A+ M E A +T A Sbjct: 137 KMLCGLRDALKGRPELAHLSHLVEQAL--VSMAEEQGETIVLGARITPEAYRESQSGVNP 194 Query: 653 LQGLLALARHQLACQPTFIAGFSSHLNQLSDADFINALPDLRAAMA--------WLPPRE 704 LQ L R + A +S + + D + + L+ A++ + Sbjct: 195 LQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGDIDSVILFLQKALSADLQSQQSGSGREK 254 Query: 705 RGTLAHQV 712 G + + Sbjct: 255 LGIVISDL 262
>INTIMIN#Intimin signature. Length = 939 Score = 27.3 bits (60), Expect = 0.028 Identities = 19/94 (20%), Positives = 31/94 (32%) Query: 36 LNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKL 95 + + AITY K K K S ++ F + KT AK + K Sbjct: 671 VANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTDTNGYAKVTLTSTTPGKS 730 Query: 96 TYTDTYAQENVTIDMEKVDFKALQGISGINVSAE 129 + + V + +V+F I N+ Sbjct: 731 LVSARVSDVAVDVKAPEVEFFTTLTIDDGNIEIV 764
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.4 bits (175), Expect = 1e-16 Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 12/177 (6%) Query: 2 IKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRIS 61 +L+ DD+ R L L ++ ++ SNA + D++ D+ MP + Sbjct: 4 ATILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLEMVGMLDPEHRPYI--VFLTAFD--EYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQ 117 +++ + + RP + + ++A + AIKA E+ A+DYL KP D L + R Sbjct: 62 AFDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 118 ERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVT--SHEGKE 172 E ++ L ++Q + + G S + +A + + +T S GKE Sbjct: 121 EPKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMITGESGTGKE 174
>PF06580#Sensor histidine kinase Length = 349 Score = 220 bits (562), Expect = 4e-69 Identities = 63/216 (29%), Positives = 115/216 (53%), Gaps = 3/216 (1%) Query: 343 LGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQA 402 L G + + + +M ++++ L AQ+NPHF+FNALN I+A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 403 SQLVQYLSTFFRKNLKR-PSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQ 461 +++ LS R +L+ + V+LADE+ V++YLQ+ +F+ RLQ I + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 462 QLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGL-YQPVTNASGL 520 Q+P +Q +VEN IKHG +QL G++ + ++ + LE+E+ L + ++G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 521 GMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLP 556 G+ V +RL+ +G + I ++ + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 43.6 bits (103), Expect = 5e-07 Identities = 42/195 (21%), Positives = 76/195 (38%), Gaps = 18/195 (9%) Query: 1 MPKFRVSLFSLALMLAVPFAPQAVAKTAAATTASQPEIASGSAMI-VDLNTNKVIYSNHP 59 M R+ + SL + +P A A + S+ +++ MI +DL + + + + Sbjct: 1 MRYIRLCIISL--LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRA 58 Query: 60 DLVRPIASISKLMTAMVVLDARLPLDEKLKVDISQTPEMKGVYSRV---RLNSEISRKDM 116 D P+ S K++ VL DE+L+ I + YS V L ++ ++ Sbjct: 59 DERFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGEL 118 Query: 117 LLLALMSSENRAAASLAHHYPGGYKAFIKAMNAKAKSLGMNNTRFV--EPTGLS-----V 169 A+ S+N +AA+L GG + A + +G N TR E Sbjct: 119 CAAAITMSDN-SAANLLLATVGG----PAGLTAFLRQIGDNVTRLDRWETELNEALPGDA 173 Query: 170 HNVSTARDLTKLLIA 184 + +T + L Sbjct: 174 RDTTTPASMAATLRK 188
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 28.3 bits (63), Expect = 0.019 Identities = 5/33 (15%), Positives = 16/33 (48%), Gaps = 2/33 (6%) Query: 152 WLHNLDQHLKHW-VWLILVVVL-VVGVRWWLKR 182 L + ++ + W++L ++ + R L++ Sbjct: 215 VLMGMSDAVRTFGPWMLLALLAGFMAFRVMLRQ 247
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 114 bits (287), Expect = 5e-33 Identities = 71/253 (28%), Positives = 115/253 (45%), Gaps = 12/253 (4%) Query: 3 QVAIITASDSGIGKECALLLAQQGFDIGITWHSDEEGAKDTAREVVSHGVRAEIVQLDLG 62 ++A IT + GIG+ A LA QG I ++ E+ K + AE D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKA-EARHAEAFPADVR 67 Query: 63 KLPEGAQALEKLIQRLGRIDVLVNNAGAMTKAPFLDMAFDEWRKIFTVDVDGAFLCSQIA 122 + ++ + +G ID+LVN AG + ++ +EW F+V+ G F S+ Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARQMVKQGQGGRIINITSVHEHTPLPDASAYTAAKHALGGLTKTMALELVRHKILVNAVA 182 ++ M+ + + G I+ + S P +AY ++K A TK + LEL + I N V+ Sbjct: 128 SKYMMDR-RSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGAIATPM-------NGMDDSDVKPDAEP---SIPLRRFGATHEIASLVAWLCSEGANYT 232 PG+ T M + +K E IPL++ +IA V +L S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 233 TGQSLIVDGGFML 245 T +L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 29.0 bits (65), Expect = 0.024 Identities = 32/127 (25%), Positives = 53/127 (41%), Gaps = 5/127 (3%) Query: 122 GAKAMREAVPAHLPVSVKVRLGWDSGEK-KFEIADAVQQAGATELVVHGRTKEQGY-RAE 179 G EA+ ++ + +G + E+ K EI A E+ V GR +G R Sbjct: 190 GGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGF 249 Query: 180 HIDWQAIGE-IRQRLNIPVIANGEIWDWQSAQQCIAISGCDAVMIGRGALNIPNLSRVVK 238 ++ I E +++ L V A + + IS V+ G GAL + NL R++ Sbjct: 250 TLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL-LRNLDRLL- 307 Query: 239 YNEPRMP 245 E +P Sbjct: 308 MEETGIP 314
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.6 bits (108), Expect = 2e-07 Identities = 32/166 (19%), Positives = 70/166 (42%), Gaps = 4/166 (2%) Query: 34 LDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLIT 93 L IA +F+ +S ++ TA L ++ G L D +RL++ ++ G +I Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 94 ASSQSLAMMLLGTAL---TGLFSVVAQILVPLAATLASPDKRGKVVGTIMSGLLLGILLA 150 S +L+ G + A ++V + A + RGK G I S + +G + Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMV-VVARYIPKENRGKAFGLIGSIVAMGEGVG 155 Query: 151 RTVAGLLANLGGWRTVFWVASVLMALMALALWRGLPQMKSETHLNY 196 + G++A+ W + + + + + + +++ + H + Sbjct: 156 PAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.018 Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 12/94 (12%) Query: 23 PYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALG 82 P QE+ L + + L R A+G + + T + ++L ALG Sbjct: 756 PEQELRLVETGVQGRLWALLTREGAPAAEGAAQKGYSVNTTFVTI-------ADLVQALG 808 Query: 83 -----SSRTNATRIADELEKRGWIERRESDNDRR 111 SS ++ D L + GW RE+ RR Sbjct: 809 ADPGKSSPMLEGQVRDWLNENGWEYLRETSGQRR 842
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 78.7 bits (194), Expect = 5e-18 Identities = 64/412 (15%), Positives = 120/412 (29%), Gaps = 97/412 (23%) Query: 25 LLLTLLFIIIAVAIGIYWFLVLRHFEETDDA----YVAGNQIQIMSQVSGSVTKVWADNT 80 L FI+ + I VL E A +G +I + V ++ Sbjct: 57 PRLVAYFIMGFLVIAFILS-VLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115 Query: 81 DFVKEGDVLVTLDPTDARQAFEKA------------------------------------ 104 + V++GDVL+ L A K Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175 Query: 105 ----------------KTALASSVRQTHQLMINSKQLQANIEVQKIALAKA-------QS 141 K ++ Q +Q +N + +A + + +S Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 142 DYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQ 201 + L + I + + + A +L V Q ++ IL K E Q Q Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 202 AATEVRN------------------AWLALERTRIVSPMTGYVSRRAVQ-PGAQISPTTP 242 E+ + + + I +P++ V + V G ++ Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Query: 243 LMAVVPA-TNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKY---TGKVVGLDMGTGS 298 LM +VP + V A + I + +GQ I + + +Y GKV + + Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF-PYTRYGYLVGKVKNI-----N 409 Query: 299 AFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNR 350 ++ G V+ + + PL G++ + T R Sbjct: 410 LDAIE--DQRLGLVFNVIISIEENCLST--GNKNIPLSSGMAVTAEIKTGMR 457
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 132 bits (333), Expect = 9e-36 Identities = 97/405 (23%), Positives = 169/405 (41%), Gaps = 23/405 (5%) Query: 17 IALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRV 76 I L + +F VL+ + NV++P IA + + WV T+F + +I + G L+ ++ Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 77 GEVKLFLWSTIAFAIASWACGVS-SSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAK 135 G +L L+ I S V S ++LI R IQG A L ++ P Sbjct: 77 GIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKEN 136 Query: 136 RSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETR 195 R A L V + GP +GG I+ HW + + +P+ + + L L +E R Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVR 194 Query: 196 TERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTD 255 + D G+ L+ +GI + ML F++ I +V+V++ + Sbjct: 195 I-KGHFDIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKV 243 Query: 256 DNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGI 315 +P VD L K+ F IG LC + + G + ++P ++++V+ + G G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 316 IPVILS-PIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGF- 373 + VI+ I G + ++ +V F ++ S + I F Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFL-----LETTSWFMTIIIVFV 358 Query: 374 --AVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSI 416 ++ ++TI S L + A SL NFT L+ G +I Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAI 403
>LUXSPROTEIN#Bacterial autoinducer-2 (AI-2) production protein LuxS signature. Length = 171 Score = 292 bits (750), Expect = e-105 Identities = 132/170 (77%), Positives = 148/170 (87%) Query: 2 PLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFA 61 PLLDSFTVDHTRM APAVRVAKTM TP GD ITVFDLRF PNK+++ E+GIHTLEHL+A Sbjct: 1 PLLDSFTVDHTRMNAPAVRVAKTMQTPKGDTITVFDLRFTAPNKDILSEKGIHTLEHLYA 60 Query: 62 GFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIP 121 GFMRNHLNG+ VEIIDISPMGCRTGFYMSLIGTP EQ+VADAW AAMEDVLKV++QN+IP Sbjct: 61 GFMRNHLNGDSVEIIDISPMGCRTGFYMSLIGTPSEQQVADAWIAAMEDVLKVENQNKIP 120 Query: 122 ELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPEEKLQELHI 171 ELN YQCGT MHSL EA+ IA++ILE V +N N+ELALPE L+EL I Sbjct: 121 ELNEYQCGTAAMHSLDEAKQIAKNILEVGVAVNKNDELALPESMLRELRI 170
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 777 bits (2007), Expect = 0.0 Identities = 320/849 (37%), Positives = 470/849 (55%), Gaps = 48/849 (5%) Query: 31 SGMLCTTANAEEYYFDPIMLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQK 90 + ++ E YF+P L DLSRF PGTY+VDI+LN ++ + Sbjct: 35 AFAAQAPLSSAELYFNPRFLAD--DPQAVADLSRFENGQELPPGTYRVDIYLNNGYMATR 92 Query: 91 KITFTAN-AEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDF 149 +TF +EQ + P T QL +G+ + + DD+ + L +I A+ D Sbjct: 93 DVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVP-LTSMIHDATAQLDV 151 Query: 150 NHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNM 209 QRLNL+IPQ + ARGY+ P WD GI NY+F+G+ + R G S YLN+ Sbjct: 152 GQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNL 211 Query: 210 QNGANFGPWRLRNYSTWTRNDQTSS------WNTISSYLQRDIKALKSQLLLGESATSGS 263 Q+G N G WRLR+ +TW+ N SS W I+++L+RDI L+S+L LG+ T G Sbjct: 212 QSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGD 271 Query: 264 IFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFE 323 IF F G QLASDDNMLP+SQRGFAP + GIA +A VTI+QNGY IY S VP G F Sbjct: 272 IFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFT 331 Query: 324 INDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDS 383 IND+Y + NSGDL+VTI+E+DG+ + F PYSS+P++QR GH +YS TAG YR+ Sbjct: 332 INDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQE 391 Query: 384 KEPEFAEATAIYGLNNTFTLYGGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDN 443 K P F ++T ++GL +T+YGG ++ Y A GIG +GALGALS+D+ +A++ + Sbjct: 392 K-PRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPD 450 Query: 444 RHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEA------------------ 485 G R Y K + E+ TNI + YRY+ GYF+F + Sbjct: 451 DSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQ 510 Query: 486 ----NTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNDKNRNISVGVSGQQ 541 T ++ ++ ++Q ++Q + +LY SGS Q YWG ++ + G++ Sbjct: 511 VKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAF 570 Query: 542 WGIGYSLNYQYSRYTDQN-NDRALSLNLSIPLERWLPRSR--------VSYQMTSQKDRP 592 I ++L+Y ++ Q D+ L+LN++IP WL SY M+ + Sbjct: 571 EDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGR 630 Query: 593 TQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNS----SLNASYRSPYGTFSAGYSYGNDS 648 + + G+LL+D LSYS++ + NS +YR YG + GYS+ +D Sbjct: 631 MTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDI 690 Query: 649 SQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYL 708 Q YGV+GGV+ H +GVTL Q L + L+ A GA +++N G+ TD GYAV+PY Sbjct: 691 KQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYA 750 Query: 709 TTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPL 768 T Y+ENR+++DT L DNVDL+ VVP RGA+V A F A +G ++L+T+ N KPL Sbjct: 751 TEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTL-THNNKPL 809 Query: 769 PFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTT 828 PFGA+ +++ + IV + G +YLSG+ + V+WG + + C + P Sbjct: 810 PFGAMVTSESSQSSGIVADNGQVYLSGMPLAGK-VQVKWGEEENAHCVANYQLPPESQQQ 868 Query: 829 SVLQGTAQC 837 + Q +A+C Sbjct: 869 LLTQLSAEC 877
>FIMBRIALPAPF#Escherichia coli: P pili tip fibrillum papF protein signature. Length = 167 Score = 28.5 bits (63), Expect = 0.024 Identities = 42/160 (26%), Positives = 66/160 (41%), Gaps = 21/160 (13%) Query: 208 VKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTS 267 V+++I+GN+ P C IN G I V+FG IN + V +I+ C S Sbjct: 21 VQINIRGNVYIP-PCTINNGQNIVVDFGNINPEHVDNSRG------EVTKNISISCPYKS 73 Query: 268 KIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNG------ 321 SL +++ G T V Q N++A N GI + G + NG Sbjct: 74 ---GSLWIKVTGNTMGVGQNNVLA-----TNITHFGIALYQGKGMSTPLTLGNGSGNGYR 125 Query: 322 ILPVDPSGHGTINMRAWPVNLVGGELETGKFQGTATITVI 361 + + T + P G L G F+ TA++++I Sbjct: 126 VTAGLDTARSTFTFTSVPFRNGSGILNGGDFRTTASMSMI 165
>BINARYTOXINB#Binary toxin B family signature. Length = 764 Score = 30.0 bits (67), Expect = 0.041 Identities = 11/72 (15%), Positives = 24/72 (33%), Gaps = 4/72 (5%) Query: 487 AGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGE 546 + V+G + + + I + + ++ T D + G R A + + Sbjct: 330 SEVHGNAEVHASFFDIGGSVSAGFSNSNSS----TVAIDHSLSLAGERTWAETMGLNTAD 385 Query: 547 IAFIKPMIAMRN 558 A + I N Sbjct: 386 TARLNANIRYVN 397
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 27.6 bits (61), Expect = 0.036 Identities = 26/111 (23%), Positives = 44/111 (39%), Gaps = 22/111 (19%) Query: 42 NKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIA-------NDRL 94 K+L GN + A T + IA + V AI+ D+ Sbjct: 277 TKVL--GNVGKGISQYIIAQRAAQGLSTSAAAAGLIA----SAVTLAISPLSFLSIADKF 330 Query: 95 HD----EVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVA 141 E Y+++ + LG+ GD LLA + A++A++T T++A Sbjct: 331 KRANKIEEYSQRFKKLGYDGDSLLAAFHKETG-----AIDASLTTISTVLA 376
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 72.0 bits (176), Expect = 6e-16 Identities = 32/184 (17%), Positives = 63/184 (34%), Gaps = 38/184 (20%) Query: 90 GLGSGVIINASKGYVLTNNHVINQAQKISIQL------------NDGREFDAKLIGSDDQ 137 + SGV++ K +LTN HV++ L +G ++ + Sbjct: 102 FIASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 138 SDIALLQIQN-------PSKLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALG 190 D+A+++ + ++++ + +V G P V+ + Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VATMW 212 Query: 191 RSGLNLEGLEN-FIQTDASINRGNSGGALLNLNGELIGINTAILAPGGGSVGIGFAIPSN 249 S + L+ +Q D S GNSG + N E+IGI+ G+ Sbjct: 213 ESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWG---------GVPNEFNGA 263 Query: 250 MART 253 + Sbjct: 264 VFIN 267
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 52.7 bits (126), Expect = 8e-10 Identities = 31/160 (19%), Positives = 59/160 (36%), Gaps = 26/160 (16%) Query: 77 RTLGSGVIMDQRGYIITNKHVINDADQIIVALQ------------DGRVFEALLVGSDSL 124 + SGV++ + ++TNKHV++ AL+ +G + Sbjct: 101 TFIASGVVVG-KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGE 159 Query: 125 TDLAVLKI-------NATGGLPTIPINARRVPHIGDVVLAIGNPYNLGQTITQGIISATG 177 DLA++K + + ++ + + G P + T + G Sbjct: 160 GDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGD-KPVATMW--ESKG 216 Query: 178 RIGLNPTGRQNFLQTDASINHGNSGGALVNSLGELMGINT 217 +I + +Q D S GNSG + N E++GI+ Sbjct: 217 KI---TYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 28.1 bits (62), Expect = 0.045 Identities = 38/167 (22%), Positives = 61/167 (36%), Gaps = 27/167 (16%) Query: 3 VAVLGAAGGIGQALALLLKTQLPSGSELSLYDIAPVTPGVAVDLSHIPTAVKIKGFSGED 62 + GAA GIG+A+A L G+ ++ D P V S A + F + Sbjct: 11 AFITGAAQGIGEAVARTL---ASQGAHIAAVDYNP-EKLEKVVSSLKAEARHAEAFPADV 66 Query: 63 ATPA------------LEGADVVLISAGVARK------PGMDRSDLFNVNAGIVKNLVQQ 104 A + D+++ AGV R + F+VN+ V N + Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 105 VSKTCPK----ACIGIITNPVNTT-VAIAAEVLKKAGVYDKNKLFGV 146 VSK + + + +NP ++AA KA K G+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGL 173
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 169 bits (430), Expect = 4e-57 Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 5/141 (3%) Query: 15 KALLKEEKFSSQGEIVAALQEQGFDNINQSKVSRMLTKFGAVRTRNAKMEMVYCLPAELG 74 + ++ + +Q E+V L++ G+ N+ Q+ VSR + + V+ Y LPA+ Sbjct: 11 REIITANEIETQDELVDILKKDGY-NVTQATVSRDIKELHLVKVPTNNGSYKYSLPADQR 69 Query: 75 VPTTSSPLKNLV---LDIDYNDAVVVIHTSPGAAQLIARLLDSLGKAEGILGTIAGDDTI 131 S ++L+ + ID ++V+ T PG AQ I L+D+L E I+GTI GDDTI Sbjct: 70 FNPLSKLKRSLMDAFVKIDSASHLIVLKTMPGNAQAIGALMDNLDWEE-IMGTICGDDTI 128 Query: 132 FTTPANGFTVKDLYEAILELF 152 K + + ILEL Sbjct: 129 LIICRTHDDTKVVQKKILELL 149
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 53.3 bits (128), Expect = 4e-10 Identities = 28/163 (17%), Positives = 59/163 (36%), Gaps = 16/163 (9%) Query: 6 RKFSRTAITVVLVILAFIAIFNAWVYYTE----SPWTRDARFSADVVAIAPDVSGLITQV 61 SR V I+ F+ I + + S I P + ++ ++ Sbjct: 51 TPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEI 110 Query: 62 NVHDNQLVKKGQILFTIDQPR-------YQKALEEAQADVAYYQVLAQEKRQEAGRRNRL 114 V + + V+KG +L + Q +L +A+ + YQ+L++ E + L Sbjct: 111 IVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRS--IELNKLPEL 168 Query: 115 GVQAMSREEIDQANNVL---QTVLHQLAKAQATRDLAKLDLER 154 + + VL + Q + Q + +L+L++ Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211 Score = 51.4 bits (123), Expect = 2e-09 Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 15/147 (10%) Query: 100 LAQEKRQEAGRRNRLGVQ-AMSREEIDQANNVLQT-VLHQLAKAQAT-------RDLAKL 150 E R + ++ + ++EE + + +L +L + + Sbjct: 264 AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEE 323 Query: 151 DLERTVIRAPADGWVTNLNVYT-GEFITRGSTAVALVKQNSFY-VLAYMEETKLEGVRPG 208 + +VIRAP V L V+T G +T T + +V ++ V A ++ + + G Sbjct: 324 RQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383 Query: 209 YRAEIT----PLGSNKVLKGTVDSVAA 231 A I P L G V ++ Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNINL 410
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 129 bits (325), Expect = 1e-39 Identities = 78/209 (37%), Positives = 122/209 (58%), Gaps = 3/209 (1%) Query: 1 MAKRTKAEALKTRQELIETAIAQFAQHGVSKTTLNDIADAANVTRGAIYWHFENKTQLFN 60 MA++TK EA +TRQ +++ A+ F+Q GVS T+L +IA AA VTRGAIYWHF++K+ LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EMW-LQQPSLRELIQEHLTAGLEHDPFQQLREKLIVGLQYIAKIPRQQALLKILYHKCEF 119 E+W L + ++ EL E A DP LRE LI L+ R++ L++I++HKCEF Sbjct: 61 EIWELSESNIGELELE-YQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 NDEM-LAEGVIREKMGFNPQTLREVLQACQQQGCVANNLDLDVVMIIINGAFSGIVQNWL 178 EM + + R + + + L+ C + + +L II+ G SG+++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 MNMAGYDLYKQAPALVDNVLRMFMPDENI 207 +DL K+A V +L M++ + Sbjct: 180 FAPQSFDLKKEARDYVAILLEMYLLCPTL 208
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.9 bits (101), Expect = 1e-06 Identities = 38/217 (17%), Positives = 70/217 (32%), Gaps = 38/217 (17%) Query: 98 ATYQANYDSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADA-RQADAAV 156 K +L + E+ A + Q + I D RQ + Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 157 IAAKATVESARINLAYTKVTAPISGRIGK-STVTEGALVTNGQTTELATVQQLDPIYVDV 215 + + + AP+S ++ + TEG +VT +T + V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTA 370 Query: 216 TQSSND--FMRLKQSVEQGNLHKENATSNVELVMENGQTYP-LKGTLQ--FSDVTVDEST 270 + D F+ + Q+ +++ Y L G ++ D D+ Sbjct: 371 LVQNKDIGFINVGQNAI------------IKVEAFPYTRYGYLVGKVKNINLDAIEDQRL 418 Query: 271 GSIT--LRAV------FPNPQHTLLPGMFVRARIDEG 299 G + + ++ N L GM V A I G Sbjct: 419 GLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455 Score = 34.4 bits (79), Expect = 7e-04 Identities = 22/127 (17%), Positives = 43/127 (33%), Gaps = 13/127 (10%) Query: 46 TAPLEVKTELPGR-TNAYRIAEVRPQVSGIVLNRNFTEGSDVQAGQSLYQIDPATYQANY 104 +E+ G+ T++ R E++P + IV EG V+ G L ++ +A Sbjct: 77 LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEA-- 134 Query: 105 DSAKGELAKSEAAAAIAHLTVKRYVPLVGTKYISQQEYDQAIADARQADAAVIAAKATVE 164 + K++++ A L RY L E ++ + Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 165 SARINLA 171 +L Sbjct: 185 LRLTSLI 191 Score = 29.0 bits (65), Expect = 0.031 Identities = 11/34 (32%), Positives = 15/34 (44%), Gaps = 1/34 (2%) Query: 65 AEVRPQVSGIVLNRN-FTEGSDVQAGQSLYQIDP 97 + +R VS V TEG V ++L I P Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVP 361
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1408 bits (3645), Expect = 0.0 Identities = 1029/1034 (99%), Positives = 1030/1034 (99%) Query: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPDTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 EVQQQGISVEKSSSSYLMVAGFVSDNP TTQDDISDYVASNVKDTLSRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRL 240 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATLLKPTSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 SVLVALILTPALCATLLKP SAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNSDENSAEAVIHRAKMELGKIRDG 660 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERN DENSAEAVIHRAKMELGKIRDG Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKVYVQADAKFRM 780 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKK+YVQADAKFRM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVIRRCFKG 1034 VPVFFVVIRRCFKG Sbjct: 1021 VPVFFVVIRRCFKG 1034
>adhesinb#Adhesin B signature. Length = 310 Score = 27.5 bits (61), Expect = 0.004 Identities = 14/68 (20%), Positives = 26/68 (38%), Gaps = 10/68 (14%) Query: 1 MKR---LIPVALLTALLAGCAHDSPCVPVYDDQGRLVHTNTCMKGTTQDNWETAGAIAGG 57 MK+ L+ + L LA C+ + +V TN+ + T++ IAG Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKN-------IAGD 53 Query: 58 AAAVAGLT 65 + + Sbjct: 54 KINLHSIV 61
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 146 bits (371), Expect = 2e-46 Identities = 63/143 (44%), Positives = 85/143 (59%), Gaps = 2/143 (1%) Query: 3 ATLPFLILYACLSVLLFLWDAKHGLLPDRFTCPLLWSGLLFSQVCNPDGLADALWGAIIG 62 TL L+L L L F+ D LLPD+ T PLLW GLLF+ + L DA+ GA+ G Sbjct: 133 GTLAALLLTWVLVALTFI-DLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAG 191 Query: 63 YGTFAVIYWGYRILRHKEGLGYGDVKFLAALGAWHTWTFLPRLVFLAASFACGAVVVGLL 122 Y +YW +++L KEG+GYGD K LAALGAW W LP +V L +S + +GL+ Sbjct: 192 YLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALP-IVLLLSSLVGAFMGIGLI 250 Query: 123 MRGKESLKNPLPFGPFLAAAGFV 145 + P+PFGP+LA AG++ Sbjct: 251 LLRNHHQSKPIPFGPYLAIAGWI 273
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 38.3 bits (89), Expect = 3e-06 Identities = 19/103 (18%), Positives = 43/103 (41%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLQSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+Q+ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAKDLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 613 bits (1583), Expect = 0.0 Identities = 178/698 (25%), Positives = 304/698 (43%), Gaps = 81/698 (11%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRINIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W ++NIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVNQIKTRLGANPVPLQLAIGAEEHFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMVELANEWHQNLIESAAEASEELMEKYLGGEELTEAEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KGALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 + R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEKMGLALGRLAKEDPSFRVWTDEESNQTIIAGMGELHLDIIVDRMKREFNVEANVG 484 +E + AL ++ DP R + D +++ I++ +G++ +++ ++ +++VE + Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIK 414 Query: 485 KPQVAYRETIRQKVTDVEGKHAKQSGGRGQYGHVVIDMYPLEPGSNPKGYEFINDIKGGV 544 +P V Y E +K E + + + + + PL GS G ++ + + G Sbjct: 415 EPTVIYMERPLKK---AEYTIHIEVPPNPFWASIGLSVSPLPLGS---GMQYESSVSLGY 468 Query: 545 IPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHFGSYHDVDSSELAFKLAASIAFKEG 604 + + AV +GI+ + G L G+ V D I +G Y+ S+ F++ A I ++ Sbjct: 469 LNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQV 527 Query: 605 FKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRGMLKGQESEVTGVKIHAEVPLSEMF 664 KKA LLEP + ++ P+E D + + + + V + E+P + Sbjct: 528 LKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKNNEVILSGEIPARCIQ 587 Query: 665 GYATQLRSLTKGRASYTMEFLKYDEAPSNVAQAVIEAR 702 Y + L T GR+ E Y + V + R Sbjct: 588 EYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQPR 622
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.0 bits (65), Expect = 0.022 Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 1/62 (1%) Query: 160 ASSVEDLVTQTLEFTIEEVNADRNV-SNNAKNRQIVLNLYEKGIFDIKDAINQVADRLNI 218 A +V+D VTQ +E + ++ + S + + + L + D A QV ++L + Sbjct: 54 AQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQL 113 Query: 219 SK 220 + Sbjct: 114 AT 115
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 132 bits (334), Expect = 5e-40 Identities = 79/226 (34%), Positives = 124/226 (54%), Gaps = 9/226 (3%) Query: 28 AAKPATTADSKAAFKNDDQKSAYALGASLGRYMENSLKEQEKLGIKLDKDQLIAGVQDAF 87 A A A + D K +Y++GA LG K + GI ++ D L G+QD Sbjct: 14 AMSTAMAATDATSLTTDKDKLSYSIGADLG-------KNFKNQGIDINPDVLAKGMQDGM 66 Query: 88 A-DKSKLSDQEIEQTLQAFEARVKSSAQAKMEKDAADNEAKGKEYREKFAKEKGVKTSST 146 + + L++++++ L F+ + + A+ K A +N+AKG + + G+ + Sbjct: 67 SGAQLILTEEQMKDVLSKFQKDLMAKRSAEFNKKAEENKAKGDAFLSANKSKPGIVVLPS 126 Query: 147 GLVYQVVEAGKGEAPKDSDTVVVNYKGTLIDGKEFDNSYTRGEPLSFRLDGVIPGWTEGL 206 GL Y++++AG G P SDTV V Y GTLIDG FD++ G+P +F++ VIPGWTE L Sbjct: 127 GLQYKIIDAGTGAKPGKSDTVTVEYTGTLIDGTVFDSTEKAGKPATFQVSQVIPGWTEAL 186 Query: 207 KNIKKGGKIKLVIPPELAYGKAGVPG-IPPNSTLVFDVELLDVKPA 251 + + G ++ +P +LAYG V G I PN TL+F + L+ VK A Sbjct: 187 QLMPAGSTWEVFVPADLAYGPRSVGGPIGPNETLIFKIHLISVKKA 232
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 30.7 bits (69), Expect = 0.021 Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%) Query: 261 TAIDPFKGLLLG---LFFISVGMSLNLGVLYTHL-LWVVISVVVLVAVKILVLYLLARLY 316 A+ P L + L+FIS + L +++ + W +++ V+ ++ L Sbjct: 318 AAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKA-- 375 Query: 317 GVRSSERMQ 325 S +M+ Sbjct: 376 QYTSMAKMR 384
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 31.9 bits (72), Expect = 0.001 Identities = 32/135 (23%), Positives = 51/135 (37%), Gaps = 16/135 (11%) Query: 11 YAHPESQDSVANRVLLKPATQLSNVTVHDLYAHYPDFFIDIPREQALLREHEVIVFQH-- 68 Y P + D N+V P + + +HD+ ++ D F L + + Sbjct: 9 YQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCV 68 Query: 69 ----PLYTYSCPALLKEWLDRVLSRGFASGPGGNQLAGKYWRSVITTGEPESA------Y 118 P+ + P DR L F GPG N +G Y +IT PE + Sbjct: 69 QLGIPVVYTAQPGSQNP-DDRALLTDFW-GPGLN--SGPYEEKIITELAPEDDDLVLTKW 124 Query: 119 RYDALNRYPMSDVLR 133 RY A R + +++R Sbjct: 125 RYSAFKRTNLLEMMR 139
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.3 bits (73), Expect = 0.007 Identities = 29/152 (19%), Positives = 54/152 (35%), Gaps = 22/152 (14%) Query: 504 KVEPFDGDLEDYQQWLSDVQKQENQTDDAPKENANSAQARKDQKRREAELRAQTQPLRKE 563 + D + ++ E + D ++ R+ +R R + L E Sbjct: 272 AMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAE 331 Query: 564 IARLEKEME---------------------KLNAQLAQAEEKLGDSELYDQSRKAELTAC 602 +LE++ + +L A+ + EE+ SE QS + +L A Sbjct: 332 HQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDAS 391 Query: 603 LQQQASAKSGLEECEMAWLEAQEQLEQMLLEG 634 + + + LEE L A E+L + L E Sbjct: 392 REAKKQVEKALEEANSK-LAALEKLNKELEES 422 Score = 31.2 bits (70), Expect = 0.015 Identities = 13/125 (10%), Positives = 39/125 (31%), Gaps = 7/125 (5%) Query: 513 EDYQQWLSDVQKQENQTDDAPKENANSAQARKDQKRREAELRAQTQPLRKEIARLEKEME 572 + + ++ + E A A + D ++ + +++ Sbjct: 127 KALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST-------ADSAKIK 179 Query: 573 KLNAQLAQAEEKLGDSELYDQSRKAELTACLQQQASAKSGLEECEMAWLEAQEQLEQMLL 632 L A+ A E + + E + TA + + ++ + ++ LE + Sbjct: 180 TLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 239 Query: 633 EGQSN 637 ++ Sbjct: 240 FSTAD 244
>PF07299#Fibronectin-binding protein (FBP) Length = 219 Score = 31.8 bits (72), Expect = 0.002 Identities = 10/46 (21%), Positives = 21/46 (45%), Gaps = 2/46 (4%) Query: 71 PEANDFGLLEQTFIEYGQSGKGKSRKYLHTYDEAVPWNQVPGTFTP 116 P+ + + E ++ KG SRK++ ++ + + GTF Sbjct: 112 PDMEELDMKELSY--LSWIDKGSSRKFIIAKNDKNKFVGLQGTFQS 155
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 29.4 bits (66), Expect = 0.024 Identities = 26/109 (23%), Positives = 42/109 (38%), Gaps = 24/109 (22%) Query: 215 VITAENGIVFRENLLFTHRGLSGPAVLQISSYWQPGEFVSINLLPDVDLETFL--NEQRN 272 ++ E I RE RG GP +L + ++ + + + L T + N Q Sbjct: 58 LLNLEEAITLRE------RGWKGP-ILMLEGFFHAQD---LEIYDQHRLTTCVHSNWQLK 107 Query: 273 AHPNQSLKNTLAVHL------------PKRLVERLQQLGQIPDVSLKQL 309 A N LK L ++L P R++ QQL + +V L Sbjct: 108 ALQNARLKAPLDIYLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTL 156
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 42.5 bits (100), Expect = 2e-06 Identities = 47/275 (17%), Positives = 94/275 (34%), Gaps = 32/275 (11%) Query: 44 PVSQVAFSFGLLSLGLAIS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSNNL 99 + V +G+L A+ + V G L +RFG + V + S + + + A + L Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96 Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFAIGSYGLGSLGFK 152 +L++ AG+ AG + + F + LG Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152 Query: 153 FIDTQLLETVGLEKTFVIWGAIALVMIVFGATLMKDAPKQEVKTSNGVVEKDYTLAESMR 212 L+ F A+ + + G L+ ++ K E + R Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWAR 207 Query: 213 --KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQSLAHLDVVSAANAVTVISIAN-LS 265 ++AV F+ + L+VI + H D + ++ I + L+ Sbjct: 208 GMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSLA 262 Query: 266 GRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300 ++ G ++ ++ R + +G + G L FA Sbjct: 263 QAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Score = 36.0 bits (83), Expect = 2e-04 Identities = 37/155 (23%), Positives = 64/155 (41%), Gaps = 9/155 (5%) Query: 241 AQSLAHLDVVSAANAVTVISIANLSGRLVLGILSDKIARIRVITIGQVISLVGMAALLFA 300 AH ++ A A+ + A + G L SD+ R V+ + + V A + A Sbjct: 39 NDVTAHYGILLALYALMQFACAPVLGAL-----SDRFGRRPVLLVSLAGAAVDYAIMATA 93 Query: 301 PLNAVTFFAAIACVAFNFGGTITVFPSLVSEFFGLNNLAKNYGVIYLGFGIGSIFGSIIA 360 P V + I VA G T V + +++ + A+++G + FG G + G ++ Sbjct: 94 PFLWVLYIGRI--VAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLG 151 Query: 361 SLFGGF--YVTFYVIFALLILSLALSTTIRQPEQK 393 L GGF + F+ AL L+ + K Sbjct: 152 GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 27.8 bits (62), Expect = 0.039 Identities = 19/90 (21%), Positives = 37/90 (41%), Gaps = 13/90 (14%) Query: 119 SMYNEFGDSTTTLTDPLWHASVSSLGWRVDSRLGDLRPWAQISYNQQFGENIWKAQSGLS 178 S+ + D+ + H S + + + R G++ P ++SY F + Sbjct: 228 SVAVQQQDAKLV-EENYSHNSQTEVAATLAYRFGNVTP--RVSYAHGFKGSF-------- 276 Query: 179 RMTATNQNGNWLDVTVGADMLLNQNIAAYA 208 ATN N ++ V VGA+ ++ +A Sbjct: 277 --DATNYNNDYDQVVVGAEYDFSKRTSALV 304
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.9 bits (80), Expect = 5e-05 Identities = 16/52 (30%), Positives = 22/52 (42%), Gaps = 5/52 (9%) Query: 76 VAPKAVRRGIGKALMQYV-----QQRYPHLMLEVYQKNQPAIDFYRAQGFHI 122 VA ++G+G AL+ + + LMLE N A FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 113 bits (284), Expect = 3e-32 Identities = 41/122 (33%), Positives = 61/122 (50%), Gaps = 11/122 (9%) Query: 108 LNMPNNVTFDSSSATLKPAGANTLTGVAMVLKEY--PKTAVNVIGYTDSTGGHDLNMRLS 165 + ++V F+ + ATLKP G L + L +V V+GYTD G N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASALIIQGVDASRIRTQGLGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI +G+ A +I +G+G +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SP 218 Sbjct: 335 KG 336
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 59.9 bits (145), Expect = 6e-12 Identities = 41/184 (22%), Positives = 81/184 (44%), Gaps = 1/184 (0%) Query: 5 RNVNLLLMLVLLVAVGQMAQTIYIPAIADMARDLNVREGAVQSVMGAYLLTYGVSQLFYG 64 R+ +L+ L +L + + + ++ D+A D N + V A++LT+ + YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 65 PISDRVGRRPVILVGMSIFMLATLVA-VTTSSLTVLIAASAMQGMGTGVGGVMARTLPRD 123 +SD++G + ++L G+ I +++ V S ++LI A +QG G + + Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVAR 130 Query: 124 LYERTQLRHANSLLNMGILVSPLLAPLIGGLLDTMWNWRACYLFLLVLCAGVTFSMARWM 183 + A L+ + + + P IGG++ +W L ++ V F M Sbjct: 131 YIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLK 190 Query: 184 PETR 187 E R Sbjct: 191 KEVR 194