>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 29.3 bits (65), Expect = 0.030 Identities = 32/118 (27%), Positives = 49/118 (41%), Gaps = 5/118 (4%) Query: 87 QAEVTVTLSGVLERGGKLDARRTLALARIDSLAPQREIARLDLLAETARRYLAITAAIRQ 146 Q LS ++ R G +L R DS P + A R+ +A AAI Sbjct: 615 QESAKAQLSILINRSGSWADVARQSLQRFDSTRPVVKFGTEQYTA-IHRQMMAAHAAITL 673 Query: 147 REIAELDIEQRKRTVDAARRRLEAGASPESVVLTAKAALAEAELDRDRAAQAERTARL 204 +E++E + R TVD+ ++ G S L + + + E R+ AER RL Sbjct: 674 QEVSEFTDDMRNFTVDSIPLLIQLGRSS----LMDEHLVEQREKLRELTTIAERLNRL 727
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 767 bits (1983), Expect = 0.0 Identities = 247/1058 (23%), Positives = 436/1058 (41%), Gaps = 57/1058 (5%) Query: 5 IIRTSIANRWLVMTMTVVLIAIGVWSFNQLPIDATPDITNVQVQVNTAAPGYSPLEAEQR 64 + I + ++L+ G + QLP+ P I V V+ PG + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 65 VTYPIETAMAGLPKMENFRSIS-RYGLSQITVVFKDGTDIYFARQQVAERLQQVKSQIPA 123 VT IE M G+ + S S G IT+ F+ GTD A+ QV +LQ +P Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 124 NIEPTLGPIATGMGEIFSYTIDADPKAKKTDGTPYTATDLRTLQDWVIRPQLRNIPGVTE 183 ++ + +D T D+ ++ L + GV + Sbjct: 121 EVQQQGISVEKSSSSYLMVA------GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174 Query: 184 VNTLGGYKREVHITPDPSRLRSLGLTLDDVVKALQLNNQNVGAGYIER----NGQQFLVR 239 V G + + I D L LT DV+ L++ N + AG + GQQ Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233 Query: 240 IP--GQVADISQIEQVVL-ARREGAVIRMRDVAKVADGAELRTGAATQNGHEVVLGTVVM 296 I + + + +V L +G+V+R++DVA+V G E A NG + + Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293 Query: 297 LIGSNSRDVSQAAAAKLKDAAKSLPAGVTATPVYDRTKLVDRTIATVAKNLTEGAVLVIV 356 G+N+ D ++A AKL + P G+ YD T V +I V K L E +LV + Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 357 ILFLLLGNFRAALITALVIPLAMLFTLTGMARGGISANLMSLG--ALDFGLIVDGAVIII 414 +++L L N RA LI + +P+ +L T +A G S N +++ L GL+VD A++++ Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 415 ENCLSRFGHRQHELGRQMTLAERFETTASATAEVIRPSLFGLGIITAVYLPIFALTGVEG 474 EN E E T + +++ + +++AV++P+ G G Sbjct: 414 ENV---------ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 475 KMFHPMAITVVLALTGAMVLALTFVPAAIALMLGG---KVEEKENWLMGWLRR------- 524 ++ +IT+V A+ ++++AL PA A +L + E + GW Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524 Query: 525 RYEPLLDMSLRRGKWVAVGAILLLACSGVLFTRLGSEFVPNLDEGDFAMQAMRIPGTSLT 584 Y + L + L++A VLF RL S F+P D+G F G + Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584 Query: 585 QSVNMQRLLEQRLLKVPEIERVFSKIGTAEVASDPMPPSIGDTFVMVKPRDQWPDPDKPK 644 ++ + + LK E V S + + G FV +KP ++ + Sbjct: 585 RTQKVLDQVTDYYLKN-EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSA 643 Query: 645 AELVAQVEQLVARVPGNNYEFTQPIQM-RTNELISGVRADVA-INVYGDDLATLLKIGQQ 702 ++ + + + ++ F P M EL + D I+ G L + Q Sbjct: 644 EAVIHRAKMELGKIRDG---FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700 Query: 703 IEAVAKKVTGA-ADVRVEQASGLPLLEVVPNRLALASYGLTTDDVQSTVATAVGGEVAGK 761 + +A + + VR ++ ++ + G++ D+ T++TA+GG Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 762 LFEGDRRFDVVVRLPESLRQDPAALESLPIPLGPASDPQASGVSGPRTIPLSSVAKVVAS 821 + R + V+ R P ++ L + A+G +P S+ Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYV-------RSANGE----MVPFSAFTTSHWV 809 Query: 822 EGANQINRYNGKRRIAVTANVRDRDLGGFVSELQGVINANVQPPSGYWIEYGGSFEQLIS 881 G+ ++ RYNG + + G L + N + P+G ++ G Q Sbjct: 810 YGSPRLERYNGLPSMEIQGEAAPGTSSGDAMAL--MENLASKLPAGIGYDWTGMSYQERL 867 Query: 882 ASKRLAIVVPATLVIIFALLFWAFRSVKDSAIVFSGVPLALTGGILALTVRGIPLSISAG 941 + + +V + V++F L + S V VPL + G +LA T+ + Sbjct: 868 SGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFM 927 Query: 942 IGFIALSGVAVLNGLVLISFIRGLRE-QGEPLESAVREGALSRLRPVLMTAFVASLGFVP 1000 +G + G++ N ++++ F + L E +G+ + A RLRP+LMT+ LG +P Sbjct: 928 VGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLP 987 Query: 1001 MALNVGAGSEVQRPLATVVIGGIVSSTLLTLVVLPVLY 1038 +A++ GAGS Q + V+GG+VS+TLL + +PV + Sbjct: 988 LAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFF 1025 Score = 91.1 bits (226), Expect = 1e-20 Identities = 73/522 (13%), Positives = 169/522 (32%), Gaps = 40/522 (7%) Query: 3 ERIIRTSIANRWLVMTMTVVLIAIGVWSFNQLPIDATPDITNVQVQVNTAAPGYSPLEAE 62 + + + + + +++A V F +LP P+ P + E Sbjct: 527 TNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586 Query: 63 QRVTYPIETAMAGLPKMENFRSISRYG-----------LSQITVV-FKDGTDIYFARQQV 110 Q+V + K + G ++ +++ +++ + + V Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAV 646 Query: 111 AERLQQVKSQIP-ANIEPTLGPIATGMGEIFSYTIDADPKAKKTDGTPYTATDLRTLQDW 169 R + +I + P P +G + D + G + A L ++ Sbjct: 647 IHRAKMELGKIRDGFVIPFNMPAIVELGTATGF----DFELIDQAGLGHDA--LTQARNQ 700 Query: 170 VIRPQLRNIPGVTEVNTLGGYKR-EVHITPDPSRLRSLGLTLDDVVKALQLNNQNVGAGY 228 ++ ++ + V G + + D + ++LG++L D+ + + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 229 IERNGQQFLVRIPGQ---VADISQIEQVVLARREGAVIRMRDVAKVADGAELRTGAATQN 285 G+ + + ++++ + G ++ + + Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY-----GSPRL 815 Query: 286 GHEVVLGTVVMLIGSNSRDVSQAAAAKLKDAAKSLPAGVTATPVYDRTKLVDRTIATVAK 345 L ++ + + S A A +++ A LPAG+ + + Sbjct: 816 ERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIG-YDWTGMSYQERLSGNQAPA 874 Query: 346 NLTEGAVLVIVILFLLLGNFRAALITALVIPLAMLFTLTGMARGGISANLMSLGAL--DF 403 + V+V + L L ++ + LV+PL ++ L ++ + L Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934 Query: 404 GLIVDGAVIIIENCLSRFGHRQHELGRQMTLAERFETTASATAEVIRPSLFGLGIITAVY 463 GL A++I+E + G+ E T A +RP L Sbjct: 935 GLSAKNAILIVE----FAKDLMEKEGK-----GVVEATLMAVRMRLRPILMTSLAFILGV 985 Query: 464 LPIFALTGVEGKMFHPMAITVVLALTGAMVLALTFVPAAIAL 505 LP+ G + + I V+ + A +LA+ FVP + Sbjct: 986 LPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 453 bits (1168), Expect = e-145 Identities = 233/1039 (22%), Positives = 430/1039 (41%), Gaps = 63/1039 (6%) Query: 13 LTLFTALLVLVGGVLTFLNFPSQEEPSVTIRDAVVQLAYPGMPTEKVETLLARPVEENLR 72 A+++++ G L L P + P++ V YPG + V+ + + +E+N+ Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMN 70 Query: 73 SLAGIKKIV-TTVRPGSAILQITAHDSVADLPALWLRVRAKAAEVGGALPAG---TMGPF 128 + + + T+ GS + +T D ++V+ K LP Sbjct: 71 GIDNLMYMSSTSDSAGSVTITLTFQSGT-DPDIAQVQVQNKLQLATPLLPQEVQQQGISV 129 Query: 129 VDDDFGRVSVASIAVTAPGFSMSEMRGPL-RKMREQLYTLPGVERVSLYGLQEDRIYIAF 187 + VA PG + ++ + +++ L L GV V L+G + + I Sbjct: 130 EKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-AQYAMRIWL 188 Query: 188 DRVRLVEAGLSPASVIDQLRRQNVVVPGGLVSASGMA------MTVATSGEVGNVQALKQ 241 D L + L+P VI+QL+ QN + G + + ++ N + + Sbjct: 189 DADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGK 248 Query: 242 VLINTQGSGGAREIALGALAQVQVMPADPPETAAIYQGQPAVVVAVSMASGYNVVSFGKA 301 V + G + L +A+V + + A G+PA + + +A+G N + KA Sbjct: 249 VTLRVNSDGS--VVRLKDVARV-ELGGENYNVIARINGKPAAGLGIKLATGANALDTAKA 305 Query: 302 LREKLVETASLLPTGFQQHVVTFQADVVDREMSKMHQVMGETVVIVMAVVMLFLG-WRTG 360 ++ KL E P G + V + ++ + + E +++V V+ LFL R Sbjct: 306 IKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRAT 365 Query: 361 LIVGAIVPLTILGSLILMRVLNVELQTVSIAAIILALGLLVDNGIVIAEDIERRL-MAGE 419 LI VP+ +LG+ ++ + T+++ ++LA+GLLVD+ IV+ E++ER + Sbjct: 366 LIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKL 425 Query: 420 ERFHACVEAGRTLAIPLLTSSLVIVLAFSPFFFGQTSTNEYMRSLAVVLAITLLGSWLLS 479 A ++ + L+ ++V+ F P F ST R ++ + + S L++ Sbjct: 426 PPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVA 485 Query: 480 ITVTPLLCLYFARAHAGEHNEDN------YNSKFYRA---YRAVIERLLEFKLLFISTML 530 + +TP LC + + EH+E+ +N+ F + Y + ++L ++ Sbjct: 486 LILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYA 545 Query: 531 LALGGAVVVLSSIPYDFLPKSDRLQFQIPVTLRAGADSRQTLQSVETMSRW-LADKRANP 589 L + G VV+ +P FLP+ D+ F + L AGA +T + ++ ++ + L +++AN Sbjct: 546 LIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANV 605 Query: 590 EIVDSIGYVADGGPRIVLGLNPPLPGSNIAYFTVSVKPKTD-------IDQVIDRVRQYV 642 E V ++ + G N VS+KP + + VI R + + Sbjct: 606 ESVFTVNGFSFSGQ-----------AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMEL 654 Query: 643 RKTLPDVRAEPKR----FSLG-ATEAGVAVYRVTGADEQVLRTAADQIADALRSLPGTL- 696 + D P LG AT + G L A +Q+ P +L Sbjct: 655 -GKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713 Query: 697 DVTDDWQARIPRYIVQVDQLKARRAGVSSDDIAQALQLRYSGVPASQIRDDGVDVPILLR 756 V + ++ ++VDQ KA+ GVS DI Q + G + D G + ++ Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773 Query: 757 GDAGERAGNGSPAD--TLVYPQSGGKPLPLSAIASIQHDSEPSTLMRRNLERAITVTGRN 814 DA R P D L + G+ +P SA + L R N ++ + G Sbjct: 774 ADAKFR---MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEA 830 Query: 815 PDSTTSEMVAALADKIAKITLPPGYRIELGGEIEDSAEANQALLEYMPHALGAILLLFIW 874 T+S AL + +A LP G + G + + + + L Sbjct: 831 APGTSSGDAMALMENLAS-KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889 Query: 875 QFNSFRKLFIVVSAIPFVLIGAALALLVTGYPFGFMATFGLLALAGIIVNNAVLLLERI- 933 + S+ V+ +P ++G LA + GLL G+ NA+L++E Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949 Query: 934 EAELAEGLPRREAVISAAVKRLRPIVMTKLTCIVGLIPLMLFAGP---LWTGMAITMIGG 990 + EG EA + A RLRPI+MT L I+G++PL + G + I ++GG Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009 Query: 991 LALGTLVTLGMIPILYDVL 1009 + TL+ + +P+ + V+ Sbjct: 1010 MVSATLLAIFFVPVFFVVI 1028 Score = 108 bits (272), Expect = 4e-26 Identities = 82/418 (19%), Positives = 161/418 (38%), Gaps = 28/418 (6%) Query: 619 AYFTVSVKPKTDIDQVIDRVR---QYVRKTLPDVRAEPKRFSLGATEAGVAVYRVTGAD- 674 T++ + TD D +V+ Q LP + ++ + + V + Sbjct: 88 VTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNP 147 Query: 675 ----EQVLRTAADQIADALRSLPGTLDVTDDWQARIPRYIVQVDQLKARRAGVSSDDIAQ 730 + + A + D L L G DV R + D L ++ D+ Sbjct: 148 GTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQYAMRIWLDADLLNKY--KLTPVDVIN 205 Query: 731 ALQLRYSGVPASQIRDDGVDVPILLRGDAGERAGNGSPAD---TLVYPQSGGKPLPLSAI 787 L+++ + A Q+ L + +P + + S G + L + Sbjct: 206 QLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDV 265 Query: 788 ASIQHDSEP-STLMRRNLERAITVT-GRNPDSTTSEMVAALADKIAKI--TLPPGYRIEL 843 A ++ E + + R N + A + + + A+ K+A++ P G ++ Sbjct: 266 ARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVL- 324 Query: 844 GGEIEDSAEANQALLEYMPHAL-GAILLLFIWQF---NSFRKLFIVVSAIPFVLIGAALA 899 D+ Q + + L AI+L+F+ + + R I A+P VL+G Sbjct: 325 --YPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAI 382 Query: 900 LLVTGYPFGFMATFGLLALAGIIVNNAVLLLERIEAELAE-GLPRREAVISAAVKRLRPI 958 L GY + FG++ G++V++A++++E +E + E LP +EA + + + Sbjct: 383 LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGAL 442 Query: 959 VMTKLTCIVGLIPLMLFAG---PLWTGMAITMIGGLALGTLVTLGMIPILYDVLFGLR 1013 V + IP+ F G ++ +IT++ +AL LV L + P L L Sbjct: 443 VGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPV 500 Score = 89.1 bits (221), Expect = 5e-20 Identities = 87/526 (16%), Positives = 183/526 (34%), Gaps = 50/526 (9%) Query: 2 NLTRSALASSRLTLFTALLVLVGGVLTFLN-----FPSQEEPSVTIRDAVVQLAYPGMPT 56 N L S+ L L++ G V+ FL P +++ ++QL G Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLT---MIQLP-AGATQ 583 Query: 57 EKVETLLARPVEENLRSLAGIKKIVTTVR--------PGSAILQIT------AHDSVADL 102 E+ + +L + + L++ + V TV + + ++ + Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSA 643 Query: 103 PALWLRVRAKAAEVGGALPAGTMGPFVDD--DFGRVSVASIAVTAPGF-SMSEMRGPLRK 159 A+ R + + ++ P + + I G ++++ R L Sbjct: 644 EAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLG 703 Query: 160 MREQLYTLPGVERVSLYGLQ-EDRIYIAFDRVRLVEAGLSPASVIDQL------RRQNVV 212 M Q + V GL+ + + D+ + G+S + + + N Sbjct: 704 MAAQH--PASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF 761 Query: 213 VPGGLVSASGMAMTVATSGEVGNVQALKQVLINTQGSGGAREIALGALAQVQVMPADPPE 272 + G V + A + + + ++ + S + A + Sbjct: 762 IDRGRVKKLYVQ---ADAKFRMLPEDVDKLYVR---SANGEMVPFSAFTTSHWVYG--SP 813 Query: 273 TAAIYQGQPAVVVAVSMASGYNVVSFGKALREKLVETASLLPTGFQQHVVTFQADVVDRE 332 Y G P++ + A G AL E L LP G + T + Sbjct: 814 RLERYNGLPSMEIQGEAAPGT-SSGDAMALMENLASK---LPAGIG-YDWTGMSYQERLS 868 Query: 333 MSKMHQVMGETVVIV-MAVVMLFLGWRTGLIVGAIVPLTILGSLILMRVLNVELQTVSIA 391 ++ ++ + V+V + + L+ W + V +VPL I+G L+ + N + + Sbjct: 869 GNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMV 928 Query: 392 AIILALGLLVDNGIVIAEDI-ERRLMAGEERFHACVEAGRTLAIPLLTSSLVIVLAFSPF 450 ++ +GL N I+I E + G+ A + A R P+L +SL +L P Sbjct: 929 GLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPL 988 Query: 451 FFGQTSTNEYMRSLAVVLAITLLGSWLLSITVTPLLCLYFARAHAG 496 + + ++ + + ++ + LL+I P+ + R G Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRCFKG 1034
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 47.5 bits (113), Expect = 4e-08 Identities = 16/105 (15%), Positives = 33/105 (31%) Query: 66 GGRIKAIYVDVGDRVREGQLLAQLDLEPARLRLQQAQANAASAAADLRERKIQLDQQTAM 125 +K I V G+ VR+G +L +L A + Q++ A + +I Sbjct: 104 NSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN 163 Query: 126 FTDGATSQATLTTATVAADAARARLQVAESDRALAQRALRQADIR 170 V+ + + + + Q Q ++ Sbjct: 164 KLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208 Score = 34.0 bits (78), Expect = 8e-04 Identities = 20/140 (14%), Positives = 42/140 (30%), Gaps = 14/140 (10%) Query: 89 LDLEPARLRLQQAQANAASAAADLRERKIQLDQQTAMFTDGATSQAT--LTTATVAADAA 146 L+ E + S + + ++ + T ++ L T Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314 Query: 147 RARLQVAESDRALAQRALRQADIRAPFDGNVVARLQQPHVD--VPAGQGVLQLEGQGRTQ 204 L E + + IRAP V +L+ V + ++ + + T Sbjct: 315 TLELAKNEER-------QQASVIRAPVSV-KVQQLKVHTEGGVVTTAETLMVIVPEDDTL 366 Query: 205 VV-ALLPP-QVADLSPGSTV 222 V AL+ + ++ G Sbjct: 367 EVTALVQNKDIGFINVGQNA 386
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.7 bits (212), Expect = 2e-21 Identities = 34/115 (29%), Positives = 59/115 (51%), Gaps = 1/115 (0%) Query: 1 MAAKKVLVVEDDADSASVLEAYLRREGFDVAIAADGIRAVQLHAQWKPDLVLLDMMLPAL 60 M +LV +DDA +VL L R G+DV I ++ + A DLV+ D+++P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGIEVLSAIRLVG-DTPVIMVTAIGDEPEKLGALRYGADDYVVKPYSPKEVVARV 114 + ++L I+ D PV++++A + A GA DY+ KP+ E++ + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 30.2 bits (68), Expect = 0.012 Identities = 22/84 (26%), Positives = 33/84 (39%), Gaps = 6/84 (7%) Query: 220 AQGRQLLKRGGRF-VDIHPTPAKFLQSVFNSTLKVVVCKPRKEILKKVAVAAQDGLLKTT 278 Q RQLL+ G + + +S + RK L +A L T Sbjct: 26 RQARQLLRERGLVPLSVDENRGDQQKS-----GSTGLSLRRKIRLSTSDLALLTRQLATL 80 Query: 279 VGASVPLKAAIDLLAQLENGKRLG 302 V AS+PL+ A+D +A+ L Sbjct: 81 VAASMPLEEALDAVAKQSEKPHLS 104
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 27.5 bits (61), Expect = 0.021 Identities = 13/71 (18%), Positives = 27/71 (38%), Gaps = 6/71 (8%) Query: 75 KAKKAAPS-SSTIKVLRQEIAELKKLKAELITVIASQRAELDRARKSLIELGADPIVRSM 133 K KK+ + + + +E+ L + + A E++ +K LIE G ++ Sbjct: 392 KLKKSEEAANEQLLQNEEELNYLYSVLTNINN--ADNYDEIEEIKKELIETG---YIKFK 446 Query: 134 QRNFRNKRAKE 144 + K Sbjct: 447 KIYKSKKSKTS 457
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 183 bits (467), Expect = 1e-57 Identities = 88/346 (25%), Positives = 136/346 (39%), Gaps = 42/346 (12%) Query: 5 LVTGGAGFIGGNFVLEAVARGVRVVNLDALT--YAGNLNTL-ASLDGNPDHVFVKGDIGD 61 LVTG AGFIG + + G +VV +D L Y +L L P F K D+ D Sbjct: 4 LVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLAD 63 Query: 62 GPLVASLLHEHQPDAVLNFAAESHVDRSIEGPGAFIQTNVVGTLALLEAVRDHWKALPKE 121 + L + V V S+E P A+ +N+ G L +LE R Sbjct: 64 REGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRH-------- 115 Query: 122 RQDAFRFLHVSTDEVYGTLGETGKFTETTPYA-PNSPYSASKAASDHLVRAFRHTYGLPV 180 L+ S+ VYG L F+ P S Y+A+K A++ + + H YGLP Sbjct: 116 -NKIQHLLYASSSSVYG-LNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173 Query: 181 LTTNCSNNYGPYHFPEKLIPLVIAKALAGEPLPVYGDGKQVRDWLFVSDHCEAIRTVL-- 238 YGP+ P+ + L G+ + VY GK RD+ ++ D EAI + Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233 Query: 239 ----------------AKGKVGETYNVGGNSERQNIEVVQAICALLDQHRPRDDGKPRAS 282 A YN+G +S + ++ +QA+ L + Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG----------IEA 283 Query: 283 QITYVTDRPGHDRRYAIDASKLKNELGWEPAYTFEQGIAQTVHWYL 328 + + +PG + D L +G+ P T + G+ V+WY Sbjct: 284 KKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYR 329
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 56.6 bits (136), Expect = 1e-11 Identities = 48/204 (23%), Positives = 78/204 (38%), Gaps = 7/204 (3%) Query: 4 VLIIGATSAIAEATARRYAARGAAVHLLGRQAPRLETIAADLTTRGARTSIGVLDVNDNA 63 I GA I EA AR A++GA + + +LE + + L DV D+A Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 64 RHGEVLDAAWAALGGVDVVLIAHGTLPDQAACNASVELSLREFATNGTSTIALCAALVPR 123 E+ +G +D+++ G L + S E F+ N T ++ Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130 Query: 124 LTS--GATLAVISSVAGDRGRASNYLYGSAKAAVTAYLSGLGQRLRPQGINVLTIKPGFV 181 + ++ + S R S Y S+KAA + LG L I + PG Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGST 190 Query: 182 DTPMTAAFKKGALWAKPDQIAKGI 205 +T M + +LWA + + I Sbjct: 191 ETDM-----QWSLWADENGAEQVI 209
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 51.3 bits (123), Expect = 2e-09 Identities = 62/293 (21%), Positives = 114/293 (38%), Gaps = 60/293 (20%) Query: 7 KIVLTGAAGLVGQNLIVEMKQQGYTQLVAIDK---------HEHNLEILRKLHPDVKTIL 57 K ++TGAAG +G ++ + + G+ Q+V ID + LE+L + P + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGIDNLNDYYDVSLKQARLELLAQ--PGFQFHK 58 Query: 58 ADLAEPGVWSEAFA--GARLIVQLHAQITGKF-----RTLFDRNNLQATENVLKACVDHQ 110 DLA+ ++ FA + ++ ++ D NL N+L+ C ++ Sbjct: 59 IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADS-NLTGFLNILEGCRHNK 117 Query: 111 IPYMVHISSSVV------NSVATDD--------YTETKKLQEALV----RNSGIPHCVLR 152 I ++++ SSS V +TDD Y TKK E + G+P LR Sbjct: 118 IQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLR 177 Query: 153 PTLMFG-WFDPK--HLGWLSRFMARTPVFPIPGDGKFMRQPLYERDFCRCIVQCIEREPA 209 ++G W P + + + + GK R Y D I++ + P Sbjct: 178 FFTVYGPWGRPDMALFKFTKAMLEGKSI-DVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236 Query: 210 GD------------------IYDIVGATRVDYVDIIRTIKRAKQLRTVIVHIP 244 D +Y+I ++ V+ +D I+ ++ A + +P Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLP 289
>PF02370#M protein repeat Length = 168 Score = 30.5 bits (68), Expect = 0.006 Identities = 20/98 (20%), Positives = 38/98 (38%), Gaps = 5/98 (5%) Query: 224 QLGKLIGANIDIGTLAQEQGRLIGLVNEHQAARQIINQEVVDLKANLEQRIDALHRA--N 281 Q L+G N D L + +G+ + E + R+ + + Q D ++ Sbjct: 49 QYRALMGENQD---LRKREGQYQDKIEELEKERKEKQERPERREKFERQHQDKHYQEQQK 105 Query: 282 LLGEQLSQTQEILALREKDNQELNASLLSMTRELERMR 319 ++ Q + K+ Q +AS + R+LE R Sbjct: 106 KHQQEQQQLEAEKQKLAKEKQISDASRQGLNRDLEASR 143
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 36.8 bits (85), Expect = 4e-05 Identities = 26/89 (29%), Positives = 39/89 (43%) Query: 151 LTVLYFPLVIFPLVLVSAGVTWFFAALGVYYRDIGQITGLLATVLLFMSPALYPVSSLPP 210 L++LY VI L A + AL Y L+ T +LF+S A++PV LP Sbjct: 145 LSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPI 204 Query: 211 SMQKLIYLNPLTFIIEQSRNVLMWGLPPD 239 Q PL+ I+ R +++ D Sbjct: 205 VFQTAARFLPLSHSIDLIRPIMLGHPVVD 233
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 31.1 bits (70), Expect = 0.004 Identities = 17/73 (23%), Positives = 23/73 (31%), Gaps = 8/73 (10%) Query: 198 QGQYLNTSW-GDFGDYDGDLSRANAIAEYRFTKNFGIFAGYDWFKLDVDREGSDGLVGLK 256 QY +T + + G + A A Y+ G GYDW G G Sbjct: 36 WSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWL-------GRMPYKGSV 88 Query: 257 QEFKGPVAGVTLA 269 + GV L Sbjct: 89 ENGAYKAQGVQLT 101
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.5 bits (183), Expect = 3e-16 Identities = 34/118 (28%), Positives = 56/118 (47%), Gaps = 1/118 (0%) Query: 460 LVFEDMDTNRLVIGNLLTRAGHRVSFQVDGTDAVQRIREAAPDLVFLDLHMPGTSGWDAL 519 LV +D R V+ L+RAG+ V + + I DLV D+ MP + +D L Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLL 66 Query: 520 REARDAMSALPPIIVLTADTRTDSMRDASAAGVAGYLPKPINAHELLALLAQHASHAR 577 + A L P++V++A + AS G YLPKP + EL+ ++ + + + Sbjct: 67 PRIKKARPDL-PVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.0 bits (233), Expect = 5e-25 Identities = 65/259 (25%), Positives = 104/259 (40%), Gaps = 13/259 (5%) Query: 11 NPSPLQDRVVVITGGAQGIGRGIAQAVLGAGGSVVIGDLDADAGKACLQ-EWALPRRSAF 69 N ++ ++ ITG AQGIG +A+ + G + D + + + + A R + Sbjct: 2 NAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 70 VRCDAARQAQATRLIEAALKRFGRIDGLVNNAGVPDPHIAALPQLDWDTWNSRLS-SLHG 128 D A + + G ID LVN AGV + L + W + S + G Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTG 119 Query: 129 AFLCSKQALPALRQAPEGGAIINIASTRAWQSEPHSEAYAAAKGGLVAFTHALALSEGPH 188 F S+ + G+I+ + S A AYA++K V FT L L + Sbjct: 120 VFNASRSVSKYMM-DRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178 Query: 189 -VRVNSISPGWISTEAWRA--PQRRRAPKLSRRDHAQH----PAGRVGTPEDIAQLAVYL 241 +R N +SPG T+ + A ++ + P ++ P DIA AV Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD-AVLF 237 Query: 242 LAPQLSGFVTGQDFIVDGG 260 L +G +T + VDGG Sbjct: 238 LVSGQAGHITMHNLCVDGG 256
>INTIMIN#Intimin signature. Length = 939 Score = 31.2 bits (70), Expect = 0.026 Identities = 10/30 (33%), Positives = 18/30 (60%) Query: 194 GPGTYTITAVATDNNGNTGNSQAVSVSITQ 223 G Y +TA A D NGN+ N+ +++++ Sbjct: 521 GSNVYKVTARAYDRNGNSSNNVLLTITVLS 550
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (290), Expect = 2e-33 Identities = 75/251 (29%), Positives = 120/251 (47%), Gaps = 8/251 (3%) Query: 16 GKIALVTGGSSGIGLAAAKRLALEGATVV---ISGRRQQELDRAVAEIGHGATAVRADIS 72 GKIA +TG + GIG A A+ LA +GA + + + +++ ++ A A AD+ Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 73 VGAELDAVMDGIATAHGRLDLLLANAGGGEFAPIESITEAGFDKYFNINVKGTLLTVQKA 132 A +D + I G +D+L+ AG I S+++ ++ F++N G + Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 133 LPLMGA--GSAIVVTGSIAANQGVSNFGVYAATKAALRSFVRTWASELRARDIRVNLIAP 190 M +IV GS A ++ YA++KAA F + EL +IR N+++P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 191 GVVVTPAYRS---ELGMSEEDIDAYLDQIKQKAPLGRSASPDEMAKAMSFLASDDASYIT 247 G T S + +E+ I L+ K PL + A P ++A A+ FL S A +IT Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 248 GIELTVDGGLT 258 L VDGG T Sbjct: 248 MHNLCVDGGAT 258
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.6 bits (87), Expect = 7e-06 Identities = 15/61 (24%), Positives = 25/61 (40%), Gaps = 1/61 (1%) Query: 82 SVEHSIYVHRDHRGKGLGRLLLQALIAAAQARGVHVLVGGIDASNQASIALHEQFGFTHA 141 +E I V +D+R KG+G LL I A+ L+ N ++ + + F Sbjct: 91 LIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIG 149 Query: 142 G 142 Sbjct: 150 A 150
>PF07520#Virulence protein SrfB Length = 1041 Score = 30.7 bits (69), Expect = 0.024 Identities = 30/140 (21%), Positives = 45/140 (32%), Gaps = 33/140 (23%) Query: 303 ADSDADSGFR-----NIGFNDYLSQLQAQRSPMDSRPQVAVVVAAGEISGGEQPAGRIGG 357 + D + R IG QL +R + P + A I AG+I Sbjct: 922 SAQDPTAIVRMHSPVYIGAR----QLPLERWT--TTPLYRLDFANDSI------AGKIKL 969 Query: 358 ESTAALLRQARDDDEVKAVVLRVDSPGGEVFASEQIRREVV---ALKQAGKPV-----VV 409 L+R+ D DE E +E++R A G + V+ Sbjct: 970 PVKVELVREDDDFDE--------AETSLEKLRAERVREVFRVDAAEDAEGTMIKNDDVVL 1021 Query: 410 SMGDLAASGGYWISMNADRI 429 S+ L YW+ RI Sbjct: 1022 SLHTLGFEDEYWLDTGVFRI 1041
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 82.4 bits (203), Expect = 2e-19 Identities = 31/193 (16%), Positives = 71/193 (36%), Gaps = 40/193 (20%) Query: 110 LGSGVIIDAQKGYVLTNHHVIENADDVQVTL------------GDGRTVKAEFIGSDADT 157 + SGV++ K +LTN HV++ L +G + + Sbjct: 103 IASGVVVG--KDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEG 160 Query: 158 DIALIRIKAD--------NLTDIKLADSNALRVGDFVVAIGNPFG---FTQTVTSGIVSA 206 D+A+++ + + ++++ +V + G P T + G ++ Sbjct: 161 DLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMWESKGKITY 220 Query: 207 VGRSGIRGLGYQNFIQTDASINPGNSGGALVNLQGQLVGINTASFNPQGSMAGNIGLGLA 266 + +Q D S GNSG + N + +++GI+ + N + + Sbjct: 221 L---------KGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHWGGVPNE----FNGAVFIN 267 Query: 267 --IPSNLARNVVE 277 + + L +N+ + Sbjct: 268 ENVRNFLKQNIED 280
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.2 bits (151), Expect = 1e-13 Identities = 26/116 (22%), Positives = 50/116 (43%), Gaps = 2/116 (1%) Query: 2 IRVLLAEDQALLRGALVALLGLEDDIAVVGSAGDGESAWRELQRLQPDVLVTDIEMPGLT 61 +L+A+D A +R L L V + + WR + D++VTD+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSRAG--YDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELAQRIQRQALPVRVMIVTTFARPGFLRRALDAGVAGYLLKDAPAEQLVDALRQ 117 +L RI++ + V++++ +A + G YL K +L+ + + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.2 bits (70), Expect = 0.006 Identities = 26/126 (20%), Positives = 34/126 (26%), Gaps = 18/126 (14%) Query: 46 PTVTGAGGPVRPAAAPTESTAAAG------------RASTAPAPSPSPASGPASVPTPAV 93 P V V T + A R AP P P+PA+ + T A Sbjct: 983 PEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAE 1042 Query: 94 AGP--SASTSAAPPAATTAAAQAPRGAAE----TSAAANAAAKPANGVATTAAQPSPAPS 147 S + AT AQ A E A +G T Q + Sbjct: 1043 NSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKE 1102 Query: 148 PPAAER 153 E+ Sbjct: 1103 TATVEK 1108
>PF03309#Bvg accessory factor Length = 271 Score = 114 bits (288), Expect = 4e-33 Identities = 57/252 (22%), Positives = 92/252 (36%), Gaps = 25/252 (9%) Query: 5 LFDLGNSRFKYAPLHGNRAGQ--VQAWAHGAE--------AMDAAALAALPSGRI--AYV 52 D+ N+ + G+ VQ W E A+ L + R+ A Sbjct: 4 AIDVRNTHTVVGLISGSGDHAKVVQQWRIRTEPEVTADELALTIDGLIGDDAERLTGASG 63 Query: 53 ASVAAPALTQRMIACLQERFTQVRIVRTTAECA-GIRIAYADPSRFGVDRFLALLGARG- 110 S P++ + L++ + V V GI + +P G DR + L A Sbjct: 64 LSTV-PSVLHEVRVMLEQYWPNVPHVLIEPGVRTGIPLLVDNPKEVGADRIVNCLAAYHK 122 Query: 111 -DAPVLVAGVGTALTIDVLGADGLHHGGRIAASPTTMREALHARAEQLPA---SGGDYVE 166 +V G+++ +DV+ A G GG IA +A AR+ L + V Sbjct: 123 YGTAAIVVDFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAALRRVELTRPRSV- 181 Query: 167 LAIDTDDALTSG----CDGAAVALIERSLQHAQRSLGVPVRLLVHGGGAPPLLPLLPDA- 221 + +T + + +G G L+ R G V ++ G AP +LP L Sbjct: 182 IGKNTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSGADVAVVATGHTAPLVLPDLRTVE 241 Query: 222 TFRAALVLDGLA 233 + L LDGL Sbjct: 242 HYDRHLTLDGLR 253
>PF07201#Hypersensitivity response secretion protein HrpJ Length = 293 Score = 30.2 bits (68), Expect = 0.013 Identities = 20/124 (16%), Positives = 39/124 (31%), Gaps = 11/124 (8%) Query: 177 GKGGLDARQAQILSQMYDSTPLAAAAREGLALRQQVTAQLREEME---QAG-RGAASART 232 GK + Q ++L + D+ L +Q + EE G R A Sbjct: 127 GKSEEPSEQFKMLCGLRDALKGRPELAHLSHLVEQALVSMAEEQGETIVLGARITPEAYR 186 Query: 233 FADETRRMATLMRERYRLGFVDVGG----WDT-HANQGSVEGGLANNLRNLGEGLAAYAD 287 + +R+ YR + G W + + + + + L + L+A Sbjct: 187 ESQSGVNPLQPLRDTYRDAVMGYQGIYAIWSDLQKRFPNGD--IDSVILFLQKALSADLQ 244 Query: 288 ALGP 291 + Sbjct: 245 SQQS 248
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 97.2 bits (242), Expect = 1e-25 Identities = 29/117 (24%), Positives = 58/117 (49%) Query: 2 RILLVEDEAPLRETLAARLKREGFAVDAAQDGEEGLYMGREVPFDVGIIDLGLPKMSGME 61 IL+ +D+A +R L L R G+ V + D+ + D+ +P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LIKALRDEGKKFPVLILTARSSWQDKVEGLKQGADDYLVKPFHVEELLARVNALLRR 118 L+ ++ PVL+++A++++ ++ ++GA DYL KPF + EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 29.2 bits (65), Expect = 0.044 Identities = 12/79 (15%), Positives = 20/79 (25%), Gaps = 4/79 (5%) Query: 510 PPPRRPPEVRAGAATPVKKAAKKTARKVGKAPAKKLASASARRASAAAKPSQDAATSSGG 569 P P PE A ++K K K P +R + + + Sbjct: 78 PEPEPIPEPPKEAPVVIEKPKPKPKPK----PKPVKKVEQPKRDVKPVESRPASPFENTA 133 Query: 570 AAKKTVRKRATSKVAKAAG 588 A+ T + Sbjct: 134 PARPTSSTATAATSKPVTS 152
>PF06776#Invasion associated locus B Length = 214 Score = 33.8 bits (77), Expect = 4e-04 Identities = 16/66 (24%), Positives = 21/66 (31%) Query: 21 AAGAFARGGALQDTPARPAPQLLAANERRRAPDTVAVSLDAALAACSAAGRDPASLPSIF 80 A A+Q PA +P L + R + A A S D A Sbjct: 17 TNHAVPALKAIQMGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDRADAQGAV 76 Query: 81 TSTYGD 86 S +GD Sbjct: 77 RSVHGD 82
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 117 bits (293), Expect = 5e-34 Identities = 80/249 (32%), Positives = 128/249 (51%), Gaps = 14/249 (5%) Query: 6 RRALVTGGSGDLGGAICRHLAAQGRHVIVHANRNLTRADEVVAAIVADGGSAQAVAFDVA 65 + A +TG + +G A+ R LA+QG H+ + N + ++VV+++ A+ A+A DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAA-VDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 66 DAQASAAALERLL-EAGPIQIVVNNAGIHDDAPMAGMNAEQWHRVIDVSLHGFFNVTQPL 124 D+ A R+ E GPI I+VN AG+ + ++ E+W V+ G FN ++ + Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 125 LLPMARTRWGRIVSVSSVAAVLGNRGQTNYAAAKAALHGASKSLSREMASRGIAVNVVAP 184 M R G IV+V S A + YA++KAA +K L E+A I N+V+P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 185 GVIESDM-----VGDSFAPEVIKQL-------VPAGRVGKPDEVAALVAFLCSEPAGYIN 232 G E+DM ++ A +VIK +P ++ KP ++A V FL S AG+I Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 233 GQVIGVNGG 241 + V+GG Sbjct: 248 MHNLCVDGG 256
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 47.5 bits (113), Expect = 2e-07 Identities = 31/155 (20%), Positives = 65/155 (41%), Gaps = 18/155 (11%) Query: 638 VLGALVLAALLLAVTVAIALRSPRRIVRVLLPMALTTVLILAILRGTGVELNLFHLIALI 697 V+ L A +L+ + + + L++ R + + + + + AIL G +N + ++ Sbjct: 340 VVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMV 399 Query: 698 LAAGLGLDYAL-----FFDHAGDDHADQLRTLH--------ALIVCSLMTLLVF---ALL 741 LA GL +D A+ +D AL+ +++ VF A Sbjct: 400 LAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFF 459 Query: 742 AASSIPVLRAIGSTVALGVLFNFILALLVSREPAL 776 S+ + R T+ + + ++AL+++ PAL Sbjct: 460 GGSTGAIYRQFSITIVSAMALSVLVALILT--PAL 492 Score = 41.0 bits (96), Expect = 2e-05 Identities = 31/163 (19%), Positives = 59/163 (36%), Gaps = 20/163 (12%) Query: 246 ARTQGEAQWIGTLDTVGLVLLLLVAYRSWKIPVLGVLPLASAGLAGLGAVALLFDGVHGI 305 + +A + + V + L L Y SW IPV +L + + L A LF+ + + Sbjct: 866 RLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAA-TLFNQKNDV 924 Query: 306 TVAFGF-TLIGVVQ-------DYPIHLFSHQRPGLDPRENARH-----LWPTLATGVVST 352 G T IG+ ++ L ++ G E L P L T ++ Sbjct: 925 YFMVGLLTTIGLSAKNAILIVEFAKDL--MEKEGKGVVEATLMAVRMRLRPILMT-SLAF 981 Query: 353 CIAYVTFLFSGVDG---LRQLAVFTIAGLATAAVTTRWLLPAL 392 + + S G + + + G+ +A + + +P Sbjct: 982 ILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 35.3 bits (81), Expect = 5e-04 Identities = 22/106 (20%), Positives = 37/106 (34%), Gaps = 26/106 (24%) Query: 237 SLSAPISAQAAQWGIAPATPPDAAPVPPPVRLKQQLSAQERAGLLRVDEQADGQTRVRLS 296 LS +S + Q AP P AP P L Sbjct: 182 MLSLGVSYRFGQGEAAPVVAPAPAPAPEVQT-----------------------KHFTLK 218 Query: 297 SAAMFASGGVEVEPQQRGLIAQIAAAIEQL---PGRVIVVGHTDDV 339 S +F ++P+ + + Q+ + + L G V+V+G+TD + Sbjct: 219 SDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 37.8 bits (87), Expect = 2e-04 Identities = 34/149 (22%), Positives = 62/149 (41%), Gaps = 31/149 (20%) Query: 149 HPAIAQIHDVGTDAHG---QPYLVMEYLRGEPITWWCDEHRLSL-----HARV------- 193 HP +A +H + +G + L+M+ + G W C + +L ++ Sbjct: 190 HPNLANVHGMAVVPYGNRKEEALLMDEVDG----WRCSDTLRTLADSWKQGKINSEAYWG 245 Query: 194 ---LLMLRVSEAVQHAHQKGVIHRDLKPSNVLVSEIDGRPMPGVIDFGIAIDATNPGTTC 250 + R+ + H + GV+H D+KP NV+ G P+ VID G+ + Sbjct: 246 TIKFIAHRLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV--VIDLGLH------SRSG 297 Query: 251 AHDRG-TPGYMSPEQARGAQDVDARSDIY 278 +G T + +PE G +SD++ Sbjct: 298 EQPKGFTESFKAPELGVGNLGASEKSDVF 326
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 29.4 bits (66), Expect = 0.026 Identities = 19/73 (26%), Positives = 31/73 (42%), Gaps = 4/73 (5%) Query: 178 RLQLLQGQASFDVAADRRRLQVRALGLRVEDIGTAFDIALHGQQARVDVSAGRVH--VWR 235 R L+ A F + D+ + Q ALG+ + DI AL G + GRV + Sbjct: 716 RPNGLEDTAQFKLEVDQEKAQ--ALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773 Query: 236 ADHPAQPMLANLG 248 AD + + ++ Sbjct: 774 ADAKFRMLPEDVD 786
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 55.0 bits (132), Expect = 7e-12 Identities = 26/155 (16%), Positives = 53/155 (34%), Gaps = 8/155 (5%) Query: 2 AATRIAQAHGYSGLNVRSLAEDVGIKAASLYHHFPSKADLAAAVAKRYWEDSAATLDALS 61 A R+ G S ++ +A+ G+ ++Y HF K+DL + + + + Sbjct: 19 VALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQ 78 Query: 62 AET-KEPMKALRRYPETFRRSLENGNRICL---CSFMAAEYDDLPDIVKDEVQAFADVNI 117 A+ +P+ LR S R L F E+ +V+ + + Sbjct: 79 AKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESY 138 Query: 118 AWLSKMLVAA----EVVGAKDAKKRARTIFAAIGG 148 + + L + ++ A + I G Sbjct: 139 DRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISG 173
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 757 bits (1956), Expect = 0.0 Identities = 244/1074 (22%), Positives = 416/1074 (38%), Gaps = 70/1074 (6%) Query: 5 IIRFAIAQRWLMLALTGVLIAIGAWSFSRLPIDATPDITNVQVQVNTAAPGYSPLESEQR 64 + F I + L +L+ GA + +LP+ P I V V+ PG + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 65 VTFPLETVLAGLPGLESTRSLS-RYGLSQVTAVFADGTDLYFARQQVAERLQQVKSQLPA 123 VT +E + G+ L S S G +T F GTD A+ QV +LQ LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 124 DLEPQLGPIATGLGEIFMYTVEAKPNARKPDGSAWTATDLRTLQDWVVRPQLRNVPGVTE 183 +++ Q + M D T D+ V+ L + GV + Sbjct: 121 EVQQQGISVEKSSSSYLMVA------GFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGD 174 Query: 184 VNTIGGYARQIHITPDPARLVALGFTLDEVAQAVESNNRNIGAGYI------ERNGQQFL 237 V G + I D L T +V ++ N I AG + Sbjct: 175 VQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233 Query: 238 VRVPGQVDDIAQIGAIVLD-RRAGVPIRVRDVAQVGEGRELRTGAATQDGSEVVLGTVFM 296 + + + + G + L G +R++DVA+V G E A +G + + Sbjct: 234 IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKL 293 Query: 297 LVGANSRTVAQAAAQRLEVANASLPAGVQAVPVYDRTALVDRTIVTVAKNLIEGALLVIV 356 GAN+ A+A +L P G++ + YD T V +I V K L E +LV + Sbjct: 294 ATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFL 353 Query: 357 VLFLLLGNVRAALITAAVIPLAMLFTLTGMVRGGVSGNLMSLG--ALDFGLIVDGAVIIV 414 V++L L N+RA LI +P+ +L T + G S N +++ L GL+VD A+++V Sbjct: 354 VMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVV 413 Query: 415 ENCLRRFGEAQLRLGRVLERDERFELTAEASAEVIRPSLFGLGIITAVYLPVFALTGIEG 474 EN R E +L E T ++ +++ + +++AV++P+ G G Sbjct: 414 ENVERVMMEDKLP---------PKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 475 KMFHPMAITVVLALTGAMLLSLTFVPAAIALLLGGKVAEHE----------NRAMRWARG 524 ++ +IT+V A+ ++L++L PA A LL AEH N + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVN 524 Query: 525 VYAPLLDRALRHSRWVGIAALATVALCAVLATRLGSEFIPNLDEGDIALHALRIPGTSLE 584 Y + + L + + VA VL RL S F+P D+G G + E Sbjct: 525 HYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQE 584 Query: 585 --QAITMQSTLEKRIKQFPEVAHVFGKLGTAEVATDPMPPSVADTFLIMHPRAQWPDPRK 642 Q + Q T + V VF G + + F+ + P + Sbjct: 585 RTQKVLDQVTDYYLKNEKANVESVFTVNG---FSFSGQAQNAGMAFVSLKPWEERNGDEN 641 Query: 643 PKAQLLAEIEEAVKQLPGNNYEFTQPIQM-RMNELISGVRADVA-IKVYGDDLDTLVTLG 700 ++ + + ++ F P M + EL + D I G D L Sbjct: 642 SAEAVIHRAKMELGKIRDG---FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQAR 698 Query: 701 QRVQEIASAVPGA-ADVSLEQATGLPMLAVVPDRAALAGYGLNPGVVQDTVAAAVGGQEA 759 ++ +A+ P + V + D+ G++ + T++ A+GG Sbjct: 699 NQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYV 758 Query: 760 GQLFEGDRRFDIVVRLPEALRQDPTALADLPIPLRGDGERADADESSRAAGWRSGEPSTV 819 + R + V+ R P + L + +GE V Sbjct: 759 NDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSA-NGE-------------------MV 798 Query: 820 PLREVAKVDTVLGPNQINREDGKRRIVITANVRDRDLGGFVAEVQQRVKAQVKLPTGYWI 879 P V G ++ R +G + I G + + + KLP G Sbjct: 799 PFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLA--SKLPAGIGY 856 Query: 880 GYGGTFEQLISASQRLAWVVPGTLLLIFALLYWSFGSLRDALVVFSGVPLALTGGVVALA 939 + G Q + + +V + +++F L + S + V VPL + G ++A Sbjct: 857 DWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAAT 916 Query: 940 LRGLALSISAGVGFIALSGVAVLNGLVMIAFVRSL-RAEGMPLEQALREGALARLRPVLM 998 L + VG + G++ N ++++ F + L EG + +A RLRP+LM Sbjct: 917 LFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILM 976 Query: 999 TALVAALGFVPMAFNVGAGAEVQRPLATVVIGGIVSSTLLTLLVLPVLYRWLHR 1052 T+L LG +P+A + GAG+ Q + V+GG+VS+TLL + +PV + + R Sbjct: 977 TSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030 Score = 79.5 bits (196), Expect = 4e-17 Identities = 75/361 (20%), Positives = 137/361 (37%), Gaps = 35/361 (9%) Query: 708 SAVPGAADVSLEQATGLPMLAVVPDRAALAGYGLNPGVVQDTVAAAVGGQEAGQLFEGDR 767 S + G DV L + + D L Y L P V + + AGQL Sbjct: 167 SRLNGVGDVQL--FGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQL----- 219 Query: 768 RFDIVVRLPEALRQDPTALADLPIPLRGDGERADADESSRAAGWRSGEPSTVPLREVAKV 827 P Q A + + +E + + + S V L++VA+V Sbjct: 220 -----GGTPALPGQQLNAS------IIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARV 268 Query: 828 -DTVLGPNQINREDGKRRIVITANVRDRDLGGFVAEVQQRVKAQVK-----LPTGYWIGY 881 N I R +GK + + G + + +KA++ P G + Y Sbjct: 269 ELGGENYNVIARINGKPAAGLGIKLAT---GANALDTAKAIKAKLAELQPFFPQGMKVLY 325 Query: 882 GGTFEQLISASQRLAWVVPGTLL----LIFALLYWSFGSLRDALVVFSGVPLALTGGVVA 937 + S V TL L+F ++Y ++R L+ VP+ L G Sbjct: 326 PYDTTPFVQLSIH---EVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAI 382 Query: 938 LALRGLALSISAGVGFIALSGVAVLNGLVMIAFV-RSLRAEGMPLEQALREGALARLRPV 996 LA G +++ G + G+ V + +V++ V R + + +P ++A + + Sbjct: 383 LAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGAL 442 Query: 997 LMTALVAALGFVPMAFNVGAGAEVQRPLATVVIGGIVSSTLLTLLVLPVLYRWLHRERAP 1056 + A+V + F+PMAF G+ + R + ++ + S L+ L++ P L L + + Sbjct: 443 VGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSA 502 Query: 1057 R 1057 Sbjct: 503 E 503 Score = 76.4 bits (188), Expect = 4e-16 Identities = 85/523 (16%), Positives = 159/523 (30%), Gaps = 40/523 (7%) Query: 3 TNIIRFAIAQRWLMLALTGVLIAIGAWSFSRLPIDATPDITNVQVQVNTAAPGYSPLESE 62 TN + + L + +++A F RLP P+ P + E Sbjct: 527 TNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERT 586 Query: 63 QRVTFPLETVLAGLPGLESTRSLSRYGLSQVTAVFADGTDLYFARQQVAERLQQVKSQ-- 120 Q+V + + G S G + + + ER S Sbjct: 587 QKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAG-MAFVSLKPWEERNGDENSAEA 645 Query: 121 LPADLEPQLGPIATGLGEIFMYTVEAKPNARKPDGSAWT---ATDLRTLQDWVVRPQL-- 175 + + +LG I G F + L R QL Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQA--RNQLLG 703 Query: 176 ---RNVPGVTEVNTIGGY-ARQIHITPDPARLVALGFTLDEVAQAVESNNRNIGAGYIER 231 ++ + V G Q + D + ALG +L ++ Q + + Sbjct: 704 MAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID 763 Query: 232 NGQQFLVRVPGQVD---DIAQIGAIVLDRRAGVPIRVRDVAQVGEGRELRTGAATQDGSE 288 G+ + V + + + G + + + Sbjct: 764 RGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVY-----GSPRLERY 818 Query: 289 VVLGTVFMLVGANSRTVAQAAAQRLEVANASLPAGVQAVPVYDRTALVDRTIVTVAKNLI 348 L ++ + A T + A +E + LPAG+ YD T + + ++ + Sbjct: 819 NGLPSMEIQGEAAPGTSSGDAMALMENLASKLPAGIG----YDWTGMSYQERLSGNQAPA 874 Query: 349 EGAL---LVIVVLFLLLGNVRAALITAAVIPLAMLFTLTGMVRGGVSGNLMSLGAL--DF 403 A+ +V + L L + + V+PL ++ L ++ + L Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934 Query: 404 GLIVDGAVIIVENCLRRFGEAQLRLGRVLERDERFELTAEASAEVIRPSLFGLGIITAVY 463 GL A++IVE + + G+ E T A +RP L Sbjct: 935 GLSAKNAILIVE----FAKDLMEKEGK-----GVVEATLMAVRMRLRPILMTSLAFILGV 985 Query: 464 LPVFALTGIEGKMFHPMAITVVLALTGAMLLSLTFVPAAIALL 506 LP+ G + + I V+ + A LL++ FVP ++ Sbjct: 986 LPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVI 1028
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.4 bits (79), Expect = 7e-04 Identities = 22/125 (17%), Positives = 38/125 (30%), Gaps = 8/125 (6%) Query: 171 AVGAGSIADQHEVQGLLTPAEGAQAQATARFPGPVRSLRVNVGDQVRA-GQVLATVESNL 229 V + +++ + A+ T F + D + LA E Sbjct: 269 RVYKSQLE---QIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325 Query: 230 SLTTYSVSAPISGTVLARNA-SLGSNAGEGQALFEIA-DLSTLWVDLHIFGADAGHITAG 287 + + AP+S V + G + L I + TL V + D G I G Sbjct: 326 QASV--IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVG 383 Query: 288 APVTV 292 + Sbjct: 384 QNAII 388
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 29.2 bits (65), Expect = 0.010 Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 2/71 (2%) Query: 68 SHSGEYLLVGLGQGVRLGVDLERIRARPRVLEIAQRFFHPDEIALLAALAPDAQHALFFR 127 SH L + + R+G+D+E+I ++ E+A DE +L A AL Sbjct: 89 SHCATTALAVISRQ-RIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALTL- 146 Query: 128 LWCAKEALLKA 138 + AKE++ KA Sbjct: 147 AFSAKESVYKA 157
>SECYTRNLCASE#Preprotein translocase SecY subunit signature. Length = 437 Score = 25.9 bits (57), Expect = 0.018 Identities = 16/83 (19%), Positives = 33/83 (39%), Gaps = 2/83 (2%) Query: 3 IIIWLIVGG-IVGWLASIIMRRDAQQGIILNVVVGIVGALIAGFL-FGGGINQAITLWTF 60 ++I + G +V WL +I R G+ + + + I + A F Sbjct: 163 MVICMTAGTCVVMWLGELITDRGIGNGMSILMFISIAATFPSALWAIKKQGTLAGGWIEF 222 Query: 61 VWSLVGAVILLAIVNLVTRGRLR 83 + +I++A+V V + + R Sbjct: 223 GTVIAVGLIMVALVVFVEQAQRR 245
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 27.0 bits (59), Expect = 0.050 Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 1/74 (1%) Query: 8 RTAAARGDAAAQRYLLAQRAADLMQRAVAAAPAGTQPTLSPDAEREVAVIVSELEALALA 67 A A+ A A R L QR D++ A+ A P+ + A A + +E E L LA Sbjct: 75 AAAEAQAKAKANRDALTQRLKDIVNEALRHN-ASRTPSATELAHANNAAMQAEDERLRLA 133 Query: 68 GHRDAIDTLAQVVE 81 + A+ E Sbjct: 134 KAEEKARKEAEAAE 147
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 36.7 bits (85), Expect = 1e-04 Identities = 34/164 (20%), Positives = 60/164 (36%), Gaps = 14/164 (8%) Query: 9 LAAQRAQLGTLRAAL--AQAVVGQDAVVEQLL--IGLLAGG--HCLLEGAPGLGKTLLVR 62 LA + + L +VG+ A ++++ + L ++ G G GK L+ R Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 63 SLGQA---LELQFRRVQ---FTPDLMPSDILGTELLEEDHGTGHRHFRFQQGPIFTNLLL 116 +L F + DL+ S++ G E RF+Q T L Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT--LF 236 Query: 117 ADELNRTPPKTQAALLEAMSERTVSYAGTTYALPAPFFVLATQN 160 DE+ P Q LL + + + G + + ++A N Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATN 280
>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature. Length = 544 Score = 286 bits (732), Expect = 1e-94 Identities = 118/288 (40%), Positives = 164/288 (56%), Gaps = 23/288 (7%) Query: 76 YDAEQGTALPGTLVRD--EGAPATQDVAVTEAYDYLGATHDFFQTVYGRNSIDAAGMPLI 133 YD T LPG+L D A+ D A +A+ Y G +D+++ V+GR S D + + Sbjct: 270 YDGRNRTVLPGSLWADGDNQFFASYDAAAVDAHYYAGVVYDYYKNVHGRLSYDGSNAAIR 329 Query: 134 GTVHYERGYDNAFWNGEQMVFGDGDGEVFNRFTIALDVVGHELTHGVTERTANLIYQGQS 193 TVHY RGY+NAFWNG QMV+GDGDG+ F F+ +DVVGHELTH VT+ TA L+YQ +S Sbjct: 330 STVHYGRGYNNAFWNGSQMVYGDGDGQTFLPFSGGIDVVGHELTHAVTDYTAGLVYQNES 389 Query: 194 GALNESISDVFGVLIKQYTLRQSADQADWIIGAGLLMPGIQGVGLRSMQAPGSAYDDPAL 253 GA+NE++SD+FG L++ Y + DW IG + PG+ G LRSM P Sbjct: 390 GAINEAMSDIFGTLVEFY----ANRNPDWEIGEDIYTPGVAGDALRSMSDP--------- 436 Query: 254 GKDPQPATMAGYVDTQEDDGGVHYNSGIPNHAFYRAA-------VAIGGAAWEKTGRIWY 306 K P + +D+GGVH NSGI N A Y + V++ G +K G+I+Y Sbjct: 437 AKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQGGVHYGVSVTGIGRDKMGKIFY 496 Query: 307 RALTGGELAAGADFATFADLTASVASADYGANSREAVAVRQAWRDVGV 354 RAL L ++F+ A+ YG+ S+E +V+QA+ VGV Sbjct: 497 RALV-YYLTPTSNFSQLRAACVQAAADLYGSTSQEVNSVKQAFNAVGV 543
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 104 bits (261), Expect = 3e-29 Identities = 81/249 (32%), Positives = 123/249 (49%), Gaps = 12/249 (4%) Query: 6 KVALVTGASRGIGAAIAQRLAGDGFAVVLNYAGHADEADQQVRSIEAAGGRAIGVQADVS 65 K+A +TGA++GIG A+A+ LA G A + + ++ ++ V S++A A ADV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQG-AHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 66 DPAAVERLFAAAEAAFGGVDVLVNNAGIMQLATLADSDDGLFDKHIAINLKGTFNTLRQA 125 D AA++ + A E G +D+LVN AG+++ + D ++ ++N G FN R Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 126 AR--RLRNGGRIVNLSTSVVGLKLETYGVYAATKAAVETLTAILSKELRGRAITVNAVAP 183 ++ R G IV + ++ G+ + YA++KAA T L EL I N V+P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 184 GPTGTA----LFLDGKSPELI-----ERLSKANPLERLGCPDDIAAAVAFLVGPDGGWIN 234 G T T L+ D E + E PL++L P DIA AV FLV G I Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 235 GQVLRANGG 243 L +GG Sbjct: 248 MHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 74.3 bits (182), Expect = 1e-17 Identities = 47/188 (25%), Positives = 82/188 (43%), Gaps = 8/188 (4%) Query: 3 KRILVTGASSGFGRLAAQALAAAGHTVYASMRDTAGRNAGVAQAMAELADKQQLALHTVE 62 K +TGA+ G G A+ LA+ G + A + V+ AE + Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF-----P 63 Query: 63 LDVQSQASADAAVASIVAQAGGLDVVVHNAGHMVFGPAEAFTAEQLAQVYDINVLGTQRV 122 DV+ A+ D A I + G +D++V+ AG + G + + E+ + +N G Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 123 NRAALPQLRAQQQGLLVWVSSSSSAGGTPPY-LGPYFAAKAAMDALAVQYARELARWGIE 181 +R+ + ++ G +V V S+ G P + Y ++KAA ELA + I Sbjct: 124 SRSVSKYMMDRRSGSIVTV--GSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 182 TSIIVPGA 189 +I+ PG+ Sbjct: 182 CNIVSPGS 189
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 30.9 bits (70), Expect = 0.017 Identities = 6/33 (18%), Positives = 10/33 (30%) Query: 418 YYTGGFDQFLSNLYKHYQINPLHSQDAPRRAAL 450 G +S L + P+ + A L Sbjct: 138 LTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFL 170
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 52.0 bits (124), Expect = 5e-09 Identities = 27/144 (18%), Positives = 57/144 (39%), Gaps = 4/144 (2%) Query: 439 AVHDALAADNHDLELQQDRMQAAQEALQDAHEELASLGPELAQAKQEAQQQAREAEQQIR 498 A +A LE ++ + A + L+ A E + + + + + E + Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263 Query: 499 EMTQQHRQAQYAYAAAVRQADALSRRQVEM-AKQAALQGRAEAERGQREAAQAQRQA--- 554 E+ + A A + L + + A++A L+ +++ R++ + A Sbjct: 264 ELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASRE 323 Query: 555 AQAQVEAEKARAEADRAQAEAERA 578 A+ Q+EAE + E +EA R Sbjct: 324 AKKQLEAEHQKLEEQNKISEASRQ 347 Score = 50.1 bits (119), Expect = 2e-08 Identities = 31/153 (20%), Positives = 57/153 (37%), Gaps = 6/153 (3%) Query: 431 DASRDAQAAVHDALAADNHDLELQQDRMQAAQEALQDAHEELASLGPELAQAKQEAQQQA 490 + + + A +A LE ++ ++A Q L+ A E + AK + + Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFS-TADSAKIKTLEAE 289 Query: 491 REAEQQIREMTQQHRQAQYAYAAAVRQADALSRR-----QVEMAKQAALQGRAEAERGQR 545 + A + + + Q A ++R+ SR + E K +EA R Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSL 349 Query: 546 EAAQAQRQAAQAQVEAEKARAEADRAQAEAERA 578 + A+ Q+EAE + E +EA R Sbjct: 350 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQ 382 Score = 32.0 bits (72), Expect = 0.008 Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 1/66 (1%) Query: 431 DASRDAQAAVHDALAADNHDLELQQDRMQAAQEALQDAHEELASLGPEL-AQAKQEAQQQ 489 DASR+A+ V AL N L + + +E+ + +E A L +L A+AK ++ Sbjct: 389 DASREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEKL 448 Query: 490 AREAEQ 495 A++AE+ Sbjct: 449 AKQAEE 454
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 35.1 bits (81), Expect = 2e-04 Identities = 28/128 (21%), Positives = 42/128 (32%), Gaps = 32/128 (25%) Query: 1 MSILVTGATGTVGSLVTQGLADAGAQV--------------KALVRQQGKRPFPAGVTEV 46 M LVTGA G +G V++ L +AG QV K + +P G Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQP---GFQFH 57 Query: 47 VADLTDVASMRAALA--PVLTLFLLNAVT--------PDEVTQALIA-----LNLAKEAG 91 DL D M A +F+ P + + L + Sbjct: 58 KIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK 117 Query: 92 IERIVYLS 99 I+ ++Y S Sbjct: 118 IQHLLYAS 125
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 35.3 bits (81), Expect = 3e-04 Identities = 19/102 (18%), Positives = 31/102 (30%), Gaps = 3/102 (2%) Query: 96 DPNQFNAYVMQAHLAVARGDLDEAERLSRTAARLAPEHPQLLAVDGVVEMRRGHSDRALA 155 Q + + + G ++A ++ + L + G G D A+ Sbjct: 35 TLEQLYSLAFNQYQS---GKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIH 91 Query: 156 LLTRAAEQLPDDARVLFSLGFAYLQKEHFAFAERAFERVIEL 197 + A + R F LQK A AE EL Sbjct: 92 SYSYGAIMDIKEPRFPFHAAECLLQKGELAEAESGLFLAQEL 133
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 495 bits (1276), Expect = e-175 Identities = 206/471 (43%), Positives = 283/471 (60%), Gaps = 14/471 (2%) Query: 8 SHIWVVDDDRSVRFVLSTALRDAGYAVDGFDSAAAALQALAMRPTPDLLFTDVRMPGEDG 67 + I V DDD ++R VL+ AL AGY V +AA + +A DL+ TDV MP E+ Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA-AGDGDLVVTDVVMPDENA 62 Query: 68 LTLLDKLKSKHPQLPVIVMSAYTDVASTAGAFRGGAHEFLSKPFDLDDAVALAARALPDA 127 LL ++K P LPV+VMSA + A GA+++L KPFDL + + + RAL + Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 128 DAGVEEILATRLAEGSASLIGDTPAMQALFRAIGRLAQAPLSVLINGETGTGKELVARAL 187 ++ ++ L+G + AMQ ++R + RL Q L+++I GE+GTGKELVARAL Sbjct: 123 KRRPSKLEDD--SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARAL 180 Query: 188 HNESPRARKPFVALNTAAIPAELLESELFGHETGAFTGATKRHIGRFEQADGGTLFLDEI 247 H+ R PFVA+N AAIP +L+ESELFGHE GAFTGA R GRFEQA+GGTLFLDEI Sbjct: 181 HDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEI 240 Query: 248 GDMPLPLQTRLLRVLAENEFFRVGGRELIRVDVRVIAATHQDLEALVEQGRFRADLLHRL 307 GDMP+ QTRLLRVL + E+ VGGR IR DVR++AAT++DL+ + QG FR DL +RL Sbjct: 241 GDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300 Query: 308 DVVRLQLPPLRERRGDIAQLAENFLAMAGRKLDMLPKRLSSAALEDLRQYDWPGNVRELE 367 +VV L+LPPLR+R DI L +F+ A K + KR ALE ++ + WPGNVRELE Sbjct: 301 NVVPLRLPPLRDRAEDIPDLVRHFVQQA-EKEGLDVKRFDQEALELMKAHPWPGNVRELE 359 Query: 368 NVCWRLAALATSDIIDVVDV---------DAALARGGRRHRSGRSDGQWDDMLSSWAAQR 418 N+ RL AL D+I + D+ + + R S ++ + + A Sbjct: 360 NLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASF 419 Query: 419 LSE-GAQGLHAEARERLDRTLLEAALQLTQGRRAEAAARLGLGRNTVTRKL 468 GL+ ++ L+ AAL T+G + +AA LGL RNT+ +K+ Sbjct: 420 GDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKI 470
>SECBCHAPRONE#Bacterial protein-transport SecB chaperone protein signature. Length = 170 Score = 195 bits (498), Expect = 5e-67 Identities = 64/160 (40%), Positives = 99/160 (61%), Gaps = 3/160 (1%) Query: 1 MSDEILNGAAAPADAAAGPAFTIEKIYVKDVSFESPNAPAVFNDANQPELQLNLNQKVQR 60 MS+E AA A P I++IYVKDVSFE+PN P +F +P+L +L+ + ++ Sbjct: 1 MSEENQVNAAD-TQATQQPVLQIQRIYVKDVSFEAPNLPHIFQQDWEPKLSFDLSTEAKQ 59 Query: 61 LNDNAFEVVLAVTLTCTA--GGKTAYVAEVQQAGVFGLVGLDPQAIDVLLGTQCPNILFP 118 + D+ +EV L +++ T G A++ EV+QAGVF + GL+ + L +QCPN+LFP Sbjct: 60 VGDDLYEVCLNISVETTMESSGDVAFICEVKQAGVFTISGLEEMQMAHCLTSQCPNMLFP 119 Query: 119 YVRTLVSDLIQAGGFPPFYLQPINFEALYAETLRQRQNEG 158 Y R LVS L+ G FP L P+NF+AL+ + L++++ Sbjct: 120 YARELVSSLVNRGTFPALNLSPVNFDALFMDYLQRQEQAE 159
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 27.6 bits (61), Expect = 0.034 Identities = 20/94 (21%), Positives = 32/94 (34%), Gaps = 10/94 (10%) Query: 49 KASYAIAPNFHVFGDYSKQ--NADDNNNVFENTDSDFQQWGV-GVGFNHEIATSTDFVAR 105 K Y I + ++ AD +NV + D V G E A + + R Sbjct: 103 KLGYPITDDLDIYTRLGGMVWRADTKSNV-YGKNHDTGVSPVFAGGV--EYAITPEIATR 159 Query: 106 VAYRKL----DLDTPNINFDGYSVEAGLRNAFGE 135 + Y+ D T D + G+ FG+ Sbjct: 160 LEYQWTNNIGDAHTIGTRPDNGMLSLGVSYRFGQ 193
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 463 bits (1193), Expect = e-163 Identities = 178/478 (37%), Positives = 262/478 (54%), Gaps = 37/478 (7%) Query: 2 ARILIIDDDAAFRTTLQVTLRSLGHAVVAAENGPDGLARLSEGGIDMAFVDFRMPGMDGI 61 A IL+ DDDAA RT L L G+ V N ++ G D+ D MP + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 AVLRARLDDAQARQVPLVMLTAHVSSGNTIEAMTLGAFDHLVKPVGRADIVEVVERALLS 121 +L R+ A+ +P+++++A + I+A GA+D+L KP +++ ++ RAL Sbjct: 64 DLLP-RIKKARPD-LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 122 RADAQAAAADSSPAPVEDDDALVGHSPAMRTVHKRIGLAAASDLPVLITGETGTGKELAA 181 + +D LVG S AM+ +++ + +DL ++ITGE+GTGKEL A Sbjct: 122 PKRRPSKL----EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVA 177 Query: 182 RALHRASPRASAPFVAVNCAAIPLELMESELFGHRKGAFSGASSDRRGLIREADGGTLFL 241 RALH R + PFVA+N AAIP +L+ESELFGH KGAF+GA + G +A+GGTLFL Sbjct: 178 RALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFL 237 Query: 242 DEIGDMPLPMQAKLLRFLQEGEVTPLGGSGPQKVDVRVLAATHRDLAACVADGRFRSDLR 301 DEIGDMP+ Q +LLR LQ+GE T +GG P + DVR++AAT++DL + G FR DL Sbjct: 238 DEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLY 297 Query: 302 YRLNVVPIELPPLPERGQDILLLAQHFL---SADAARAQSLSPAAQERLLAHRWPGNVRE 358 YRLNVVP+ LPPL +R +DI L +HF+ + + A E + AH WPGNVRE Sbjct: 298 YRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVRE 357 Query: 359 LRNVMQRSQVLVRGASIDAADLDD---------------------ALGEAGELPPPQPSA 397 L N+++R L I +++ ++ +A E Q A Sbjct: 358 LENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFA 417 Query: 398 -------VTGTLPEAVARLETQMIRSALEQSQGNRAEAARRLGIHRQLLYRKLEEYGL 448 +G +A +E +I +AL ++GN+ +AA LG++R L +K+ E G+ Sbjct: 418 SFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 33.6 bits (77), Expect = 0.001 Identities = 64/375 (17%), Positives = 121/375 (32%), Gaps = 12/375 (3%) Query: 30 PFLSVFLQSKGWSVAAIGTVMSVGGIAGMLATTPAGALVDATRRKRAVVVIGCLAILLAT 89 P L L A G ++++ + GAL D R R V+++ + Sbjct: 29 PGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGR-RPVLLVSLAGAAVDY 87 Query: 90 ALIWLQPTSSGVVAAQIASALAAAGIGPALTGITLGLVHAHGFDHQLARNQVANHAGNVL 149 A++ P + +I + + A G + G V Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDGDERARHFGFMSACFGFGMVA 146 Query: 150 AAVLAGWLGWRYGFAAVFLLTAFFGALALVAVLAIPAATIDHRAARGLATTNGGDALSGW 209 VL G +G + A F A L + + + H+ R + L+ + Sbjct: 147 GPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPES--HKGERRPLRREALNPLASF 203 Query: 210 RVLLTCRPLALLAVTLGLFHLGNAAMLPLYGMAIVAAHAGDPSALTATTIVVAQATMVVV 269 R +A L + L L+ + D + + + + Sbjct: 204 RWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQ 263 Query: 270 ALLAMRWIRVHGHWWVLLVAFMALPLRALVAASVIHGWGVFPVQILDGLGAGLQSVVVPA 329 A++ G L++ +A ++ A GW FP+ +L G + +PA Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG----IGMPA 319 Query: 330 LVARLLQGTGRVNVG--QGAVMTVQGVGAALSPAFGGWL-AHAFGYRVAFLTLGAIALLA 386 L A L + G QG++ + + + + P + A + + + AL Sbjct: 320 LQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYL 379 Query: 387 VALWAGCRGMLQAAA 401 + L A RG+ A Sbjct: 380 LCLPALRRGLWSGAG 394
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 32.0 bits (72), Expect = 0.006 Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 3/30 (10%) Query: 56 GDGFWLIHEPDGRVHAIDACGRAAQAATLD 85 GD FWL+H+ +G +H + G+ A A D Sbjct: 155 GDDFWLLHDSNGILHLL---GKTAAARLSD 181
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 57.9 bits (140), Expect = 2e-11 Identities = 95/380 (25%), Positives = 140/380 (36%), Gaps = 47/380 (12%) Query: 47 VQPLLPEFAHAF-KVDAATASLPLSLATGALALAIFC--AGAVSENLGRRGLMFVSIALA 103 + P+LP + TA + LA AL GA+S+ GRR ++ VS+A A Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83 Query: 104 AVLNLIAAFLPHWGALVVVRTLSGIALGGVPAVAMVYLGEELPASK-------MGAATGL 156 AV I A P L + R ++GI G AVA Y+ + + M A G Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142 Query: 157 -YVAGNAFGGMSGRIVMSVLTDHTDWRTALAVLSVFDLLCALAFFWLLPPS----RNFVR 211 VAG GG+ G + + L L +LLP S R +R Sbjct: 143 GMVAGPVLGGLMGGFSP---------HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLR 193 Query: 212 RHGINLRFHLRAWAGHLRDRNLPFLFALPFLLM---GVFVCLYNYAGFRLGGPEFGLSQS 268 R +N R WA + + L A+ F++ V L+ G F + Sbjct: 194 REALNPLASFR-WARGMT--VVAALMAVFFIMQLVGQVPAALWVI----FGEDRFHWDAT 246 Query: 269 QIGMIFSAYVFGIVSS----SVAGAASDRFGRGPVVTTGIVLCVLGMALTLAHVLALVVA 324 IG+ + FGI+ S + G + R G + G++ G L + Sbjct: 247 TIGISLA--AFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAF 304 Query: 325 GIVVVTIGFFIAHSAASAWVSRLGGAHRSHAASLYLLAYYAGSSVIGALGGWFW------ 378 I+V+ I A A +SR R L A + +S++G L Sbjct: 305 PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASIT 364 Query: 379 QHGGWAALVGLCLTLLMLAL 398 GWA + G L LL L Sbjct: 365 TWNGWAWIAGAALYLLCLPA 384
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 37.5 bits (87), Expect = 4e-05 Identities = 26/130 (20%), Positives = 42/130 (32%), Gaps = 35/130 (26%) Query: 8 ILVTGASGQLGALVVEALLGHLPANRIVA---------TARDTASLAEFAKRDIAVRQAD 58 LVTGA+G +G V + LL +++V + A L A+ + D Sbjct: 3 YLVTGAAGFIGFHVSKRLLEA--GHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 59 YANPHSLD--------------AAFAGVGRVL-----LVSSNAVGQRVPQHRNVIEAAKR 99 A+ + V L SN G N++E + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTG-----FLNILEGCRH 115 Query: 100 AGVELLAYTS 109 ++ L Y S Sbjct: 116 NKIQHLLYAS 125
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 39.5 bits (92), Expect = 2e-05 Identities = 30/172 (17%), Positives = 70/172 (40%), Gaps = 2/172 (1%) Query: 29 LLTMLDGFDVMAMAFTAPHVSADWQLSGKQLGMLFSAGLIGMALGALGLAPLADRIGRRA 88 +L+ + M + + P ++ D+ + +A ++ ++G L+D++G + Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 89 LTLACLAILTVGMGLSALASTAWQ-LGALRLLTGLGIGGMLACVAVTAGEFSSPRWRNTA 147 L L + I G + + + + L R + G G A V V + R A Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140 Query: 148 IVLQVTGYPVGATLGGTIAELLMQQWSWPAVFVLGAVASLLCVPLVLAFLPE 199 L + +G +G I ++ W + ++ + +++ VP ++ L + Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLI-PMITIITVPFLMKLLKK 191
>cdtoxina#Cytolethal distending toxin A signature. Length = 258 Score = 29.7 bits (66), Expect = 0.018 Identities = 15/40 (37%), Positives = 15/40 (37%), Gaps = 9/40 (22%) Query: 63 APTSSGASAAPAVPPSPAPAPAPAAPE-----PPEPAAAP 97 PT P P P P P PA P PEP AP Sbjct: 43 GPTVPS----PDEPGLPLPGPGPALPTNGAIPIPEPGTAP 78
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 26.7 bits (59), Expect = 0.029 Identities = 6/27 (22%), Positives = 16/27 (59%) Query: 55 LPAHTRSLITLSMMIALGHDEEFKLHV 81 +PA R + +++++ +F++HV Sbjct: 138 MPARYRKNVLGAVLLSPSQSSDFEIHV 164
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 61.7 bits (150), Expect = 2e-16 Identities = 24/78 (30%), Positives = 43/78 (55%) Query: 4 DDLVRFTSEALLLCLKVSLPVVGVAALAGLLIAFVQAVMSLQDASISFALKLVVVVAAIA 63 DDLV ++AL L L +S VA + GLL+ Q V LQ+ ++ F +KL+ V + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 VTAPWGASAIMQFGQALM 81 + + W ++ +G+ ++ Sbjct: 62 LLSGWYGEVLLSYGRQVI 79
>TYPE3IMPPROT#Type III secretion system inner membrane P protein family signature. Length = 224 Score = 246 bits (630), Expect = 2e-85 Identities = 80/219 (36%), Positives = 130/219 (59%), Gaps = 8/219 (3%) Query: 3 MPDVGSLLLVVIMLGLLPFAAMVVTSYTKIVVVLGLLRNAIGVQQVPPNMVLNGVALLVS 62 M + SL+ ++ LLPF T + K +V ++RNA+G+QQ+P NM LNGVALL+S Sbjct: 1 MGNDISLIALLAFSTLLPFIIASGTCFVKFSIVFVMVRNALGLQQIPSNMTLNGVALLLS 60 Query: 63 CFVMAPVGMEAFKA-AQNYGAGSDNSRVVVLLDACREPFRQFLLKHTREREKAFFMRSAQ 121 FVM P+ +A+ +D S + +D + +R +L+K++ FF + Sbjct: 61 MFVMWPIMHDAYVYFEDEDVTFNDISSLSKHVDEGLDGYRDYLIKYSDRELVQFFENAQL 120 Query: 122 QIWPKDKAAT-------LKSDDLLVLAPAFTLSELTEAFRIGFLLYLVFIVIDLVVANAL 174 + ++ T ++ + L PA+ LSE+ AF+IGF LYL F+V+DLVV++ L Sbjct: 121 KRQYGEETETVKRDKDEIEKPSIFALLPAYALSEIKSAFKIGFYLYLPFVVVDLVVSSVL 180 Query: 175 MAMGLSQVTPTNVAIPFKLLLFVAMDGWSMLIHGLVLSY 213 +A+G+ ++P ++ P KL+LFVA+DGW++L GL+L Y Sbjct: 181 LALGMMMMSPVTISTPIKLVLFVALDGWTLLSKGLILQY 219
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 63.9 bits (155), Expect = 9e-14 Identities = 40/177 (22%), Positives = 74/177 (41%), Gaps = 15/177 (8%) Query: 144 PTQLPAWLAALRVNTRLRIGGRTASAALLQSLRPGDVLLHCTASAAVTSGELLWGIAGGA 203 P LR R IG +LL + GDVLL T+ A V G Sbjct: 138 PAVGGGRPKMLRWPLRFVIGSSDTQRSLLGRIGIGDVLLIRTSRAEVYCYAKKLG----- 192 Query: 204 VLRAPVRLNLQQMILEATPTMQHDTFE---PDVAPSTSNVAELELPVQLEVDQLALSLST 260 ++ I+ T +QH E + A + + +L + ++ + + ++L+ Sbjct: 193 -----HFNRVEGGIIVETLDIQHIEEENNTTETAETLPGLNQLPVKLEFVLYRKNVTLAE 247 Query: 261 LSGLQPGQILELSVPVDQADIRLVVYGQTIGTGRLLAVGEHLGVQILS-MSESTHAD 316 L + Q+L L + ++ ++ G +G G L+ + + LGV+I +SES + + Sbjct: 248 LEAMGQQQLLSLPTNAEL-NVEIMANGVLLGNGELVQMNDTLGVEIHEWLSESGNGE 303
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 332 bits (854), Expect = e-115 Identities = 115/345 (33%), Positives = 191/345 (55%), Gaps = 2/345 (0%) Query: 1 MSEEKTEKPTEKKLRDARKDGEVPVSPDVTAAAVLFGALLVMKSAGDYFADHVRALMTIG 60 MS EKTE+PT KK+RDARK G+V S +V + A++ ++ DY+ +H LM I Sbjct: 1 MSGEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLMLIP 60 Query: 61 FDFPENTRDAAAINRALGHLGIQGLLLMLPFLAACLIAGVAGGAFQTGLNASLKPVAPKF 120 + + A++ + ++ ++ L P L + +A Q G S + + P Sbjct: 61 AE-QSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDI 119 Query: 121 DSLNPAAGVKKLFSLRSLINLLKLIIKAILIGVVLWVGIRALMPMIIGLAYETPLDIAQI 180 +NP G K++FS++SL+ LK I+K +L+ +++W+ I+ + ++ L I + Sbjct: 120 KKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPL 179 Query: 181 AWHTLGMLFALGVLLFVLVGAADWSVQHWLFIRDKRMSKDEQKREFKESEGDPEIKGKRK 240 L L + + FV++ AD++ +++ +I++ +MSKDE KRE+KE EG PEIK KR+ Sbjct: 180 LGQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRR 239 Query: 241 EFAKELVFGDPRERVAKAKVMVVNPTHYAVALAYEPDDFGLPQVVAKGVDDGALELRALA 300 +F +E+ + RE V ++ V+V NPTH A+ + Y+ + LP V K D +R +A Sbjct: 240 QFHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIA 299 Query: 301 HNQGIPIVANPPLARALY-QVELGDAIPEPLFETVAVVLRWVDEL 344 +G+PI+ PLARALY + IP E A VLRW++ Sbjct: 300 EEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 79.2 bits (195), Expect = 6e-19 Identities = 43/188 (22%), Positives = 81/188 (43%), Gaps = 11/188 (5%) Query: 3 ALRCLVVLLVALLLSACSQQ---LYSGLTENDANDMLEVLLHAGVDASKVTPDDGKTWAV 59 A V ++VA++L A + L+S L++ D ++ L + + G A+ Sbjct: 30 AGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIPY-RFANGSG---AI 85 Query: 60 NAPHDQVSYSLEVLRAHGLPHEQHANLG-EMFKKDGLISTPTEERVRFIYGVSQQLSQTL 118 P D+V L GLP + +G E+ ++ + E+V + + +L++T+ Sbjct: 86 EVPADKVHELRLRLAQQGLP--KGGAVGFELLDQEKFGISQFSEQVNYQRALEGELARTI 143 Query: 119 SNIDGVISADVEIVLPNNDPLSTSVKPSSAAVFIKFRVGSDLT-SLVPNIKTMVMHSVEG 177 + V SA V + +P K SA+V + G L + + +V +V G Sbjct: 144 ETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVHLVSSAVAG 203 Query: 178 LTYENVSV 185 L NV++ Sbjct: 204 LPPGNVTL 211
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 29.3 bits (65), Expect = 0.009 Identities = 13/75 (17%), Positives = 24/75 (32%), Gaps = 13/75 (17%) Query: 93 AEQAQAAADQSLQSARDELASVQQALSKLQAQAQV-------------YADKAASARRAR 139 +E + A+ S Q ++ + Q A +V + A S + Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETK 1093 Query: 140 QAQRDAAEEEDAVEA 154 + Q +E VE Sbjct: 1094 ETQTTETKETATVEK 1108
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 177 bits (451), Expect = 6e-57 Identities = 52/238 (21%), Positives = 105/238 (44%), Gaps = 3/238 (1%) Query: 8 LLAISSQGVSLLALLALCGVRVFVMFIVLPATAQDSLPGIARNGVIYVLSSFIAYGQPAD 67 L S Q +S L L +RV + P ++ S+P + G+ +++ IA PA+ Sbjct: 2 LQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPAN 61 Query: 68 ALAKIQTVGLVGVVFKEAFIGLLIGFAASTVFWIAESVGLLIDDLAGYNNVQMTNPLSGQ 127 + L + ++ IG+ +GF F + G +I G + +P S Sbjct: 62 DVPVFSFFAL-WLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHL 120 Query: 128 QSTPVSTVLLQLAIVSFYALGGMLLLLGALFESFRWWPLTQLGPNMGSVAESFVIQQSDS 187 ++ ++ LA++ F G L L+ L ++F P+ + S A + + Sbjct: 121 NMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGG--EPLNSNAFLALTKAGSL 178 Query: 188 MMTAVVKLSAPVMLVLVLVDLAIGLVARAADKLEPSNLSQPIRGVLALLLLALLTSVF 245 + + L+ P++ +L+ ++LA+GL+ R A +L + P+ + + L+A L + Sbjct: 179 IFLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLI 236
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 334 bits (857), Expect = e-109 Identities = 101/288 (35%), Positives = 155/288 (53%), Gaps = 13/288 (4%) Query: 320 DVGGGAELASDAPVIEADPRTNAILIRDRPERMQSYGTLIQQLDNRPKLLQIDATIIEIR 379 + A AS +EADP NAI++RD PERM Y LI LD +++ +I++I Sbjct: 233 RIPQAATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDIN 292 Query: 380 DGAMQDLGVDWRFHSQHTDIQTGNGSGSQLGFNGALSGAATDGATTPAGGTLTAVLGDAG 439 + +LGVDWR I+TGN + G S A++GA G+L G Sbjct: 293 ADQLTELGVDWR-----VGIRTGNNHQVVIKTTGDQSNIASNGAL----GSLVDARGL-- 341 Query: 440 RYLMTRVSALETTNKAKIVSSPQVATLDNVEAVMDHKQQAFVRVSGYASADLYNLSAGVS 499 YL+ RV+ LE A++VS P + T +N +AV+DH + +V+V+G A+L ++ G Sbjct: 342 DYLLARVNLLENEGSAQVVSRPTLLTQENAQAVIDHSETYYVKVTGKEVAELKGITYGTM 401 Query: 500 LRVLPSVVPGSPNGQMRLDVRIEDGQLGSNT--VDGIPVITSSEITTQAFVNEGQSLLIA 557 LR+ P V+ ++ L++ IEDG N+ ++GIP I+ + + T A V GQSL+I Sbjct: 402 LRMTPRVLTQGDKSEISLNLHIEDGNQKPNSSGIEGIPTISRTVVDTVARVGHGQSLIIG 461 Query: 558 GYAYDADETDLNAVPGLSKIPLLGNLFKHRQKSGSRMQRLFLLTPHVV 605 G D L+ VP L IP +G LF+ + + R RLF++ P ++ Sbjct: 462 GIYRDELSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509 Score = 250 bits (639), Expect = 5e-77 Identities = 72/230 (31%), Positives = 115/230 (50%), Gaps = 6/230 (2%) Query: 15 MAAVLMLSLLPLLSPHADAAQVPWHSRTFKYVADNKDLKEVLRDLSASQSIATWISPEVT 74 VL +LL LLS ++ A ++ W + YVA + L+++L D A+ +S ++ Sbjct: 9 FKRVLTGTLL-LLSSYSWAQELDWLPIPYVYVAKGESLRDLLTDFGANYDATVVVSDKIN 67 Query: 75 GTLSGKFE-TSPQKFLDDLAATYGFVWYYDGAVLRIWGANESKSATLSLGTASTKSLRDA 133 +SG+FE +PQ FL +A+ Y VWYYDG VL I+ +E S + L + L+ A Sbjct: 68 DKVSGQFEHDNPQDFLQHIASLYNLVWYYDGNVLYIFKNSEVASRLIRLQESEAAELKQA 127 Query: 134 LARMRLDDPRFPVRYDEAAHVAVVSGPPGYVDTVSAIAKQVEQGVRQR----DATEVQVF 189 L R + +PRF R D + + VSGPP Y++ V A +EQ + R A +++F Sbjct: 128 LQRSGIWEPRFGWRPDASNRLVYVSGPPRYLELVEQTAAALEQQTQIRSEKTGALAIEIF 187 Query: 190 QLHYAQAADHTTRIGGQDVQIPGMASLLRSMYGARGAPVAAIAGPSANFG 239 L YA A+D T +V PG+A++L+ + + Sbjct: 188 PLKYASASDRTIHYRDDEVAAPGVATILQRVLSDATIQQVTVDNQRIPQA 237
>THERMOLYSIN#Thermolysin metalloprotease (M4) family signature. Length = 544 Score = 311 bits (799), Expect = e-102 Identities = 170/504 (33%), Positives = 235/504 (46%), Gaps = 54/504 (10%) Query: 50 PADAFIARDSIVDADGTEHVRFDRTYQGLPVIGGDVVVHSRRGVMRELSQTMDTTV---R 106 + + +D G +RF++ +G +V H G + LS T+ + Sbjct: 72 ARERLSLIGNKLDELGHTVMRFEQAIAASLCMGAVLVAHVNDGELSSLSGTLIPNLDKRT 131 Query: 107 PSLVPGIDAATALRVAGSQFDVAQDAA-------PRASLALYAGQGAPRLVYEVIYSGVK 159 I A +A L +Y + PRL YEV + Sbjct: 132 LKTEAAISIQQAEMIAKQDVADRVTKERPAAEEGKPTRLVIYPDEETPRLAYEVNVRFLT 191 Query: 160 PDQTPTEMHYIVDAVNQRILESWDTVHTACSGGT------STAGTGRSLYAGSVTVNTTR 213 P P Y++DA + ++L W+ + A GG ST G GR + +NTT Sbjct: 192 PV--PGNWIYMIDAADGKVLNKWNQMDEAKPGGAQPVAGTSTVGVGRGVLGDQKYINTTY 249 Query: 214 CSSTS-YEMTDLSRGSGA-TYNMRNSTSGNGTLVTDDDNAWGSGTTGDTVTAAVDAHYGV 271 S Y + D +RGSG TY+ RN T G+L D DN + AAVDAHY Sbjct: 250 SSYYGYYYLQDNTRGSGIFTYDGRNRTVLPGSLWADGDNQF----FASYDAAAVDAHYYA 305 Query: 272 ALTWDYYRTMHSRTGIANDGAGARSRVHYGSRYNNAFWQDSCFCMTFGDGDGSTFTPLV- 330 + +DYY+ +H R A RS VHYG YNNAFW S M +GDGDG TF P Sbjct: 306 GVVYDYYKNVHGRLSYDGSNAAIRSTVHYGRGYNNAFWNGSQ--MVYGDGDGQTFLPFSG 363 Query: 331 SVDVAGHEMTHGVTSRTAALTYSGESGGLNEATSDIMGTMVEYSAANSAEPGNYLIGEKI 390 +DV GHE+TH VT TA L Y ESG +NEA SDI GT+VE+ A + ++ IGE I Sbjct: 364 GIDVVGHELTHAVTDYTAGLVYQNESGAINEAMSDIFGTLVEFYANRNP---DWEIGEDI 420 Query: 391 IPNNSTGTLALRYMFKPSLDGKS---PDCYSSSLGSLNVHYSSGVANHFYYLLAEGAVVP 447 G ALR M P+ G Y+ + + VH +SG+ N YLL++G Sbjct: 421 YTPGVAGD-ALRSMSDPAKYGDPDHYSKRYTGTQDNGGVHTNSGIINKAAYLLSQG---- 475 Query: 448 SGFGSGTSYNLTPTSLVCSGSTALTAIGRAAASRIWYRALTVYMTSSTNYAAARRATLSA 507 G Y ++ +T IGR +I+YRAL Y+T ++N++ R A + A Sbjct: 476 -----GVHYGVS-----------VTGIGRDKMGKIFYRALVYYLTPTSNFSQLRAACVQA 519 Query: 508 ATDLYGSTSTQYRAVAAAWSAVSV 531 A DLYGSTS + +V A++AV V Sbjct: 520 AADLYGSTSQEVNSVKQAFNAVGV 543
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 29.5 bits (66), Expect = 0.004 Identities = 19/103 (18%), Positives = 41/103 (39%), Gaps = 10/103 (9%) Query: 44 EYKESIDEMKHADKLSDRILFLEGLPNF---QALGKLRIGENP-----TEMFRCDLALER 95 E + E D +++R+L + G P + I + +EM + + + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EAVVVLREAVAYAETVKDYVSRQLLVDILESEEEHIDWLETQL 138 + + + AE +D + L V ++E E+ + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.0 bits (161), Expect = 2e-13 Identities = 30/148 (20%), Positives = 61/148 (41%), Gaps = 11/148 (7%) Query: 115 RVLIVEDDRSQALFAQSVLHGAGMHAQVEMTAASVPQAIQDYHPDLILMDLHMPELDGIR 174 +L+ +DD + L AG ++ AA++ + I DL++ D+ MP+ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 175 LTTLIRQQPGQQLLPIVFLTGDPDPERQFEVLDSGADDFLTKPIRPRHLIAAVSNRIRRA 234 L I++ + LP++ ++ + + GA D+L KP + Sbjct: 65 LLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDL--------TELIGI 114 Query: 235 RQQALQQVGEQVSVRS-NPETGLPTRGH 261 +AL + + S + + G+P G Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVGR 142
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 39.3 bits (91), Expect = 1e-05 Identities = 21/79 (26%), Positives = 29/79 (36%), Gaps = 1/79 (1%) Query: 66 EAALQQARRSQAQQRRQIEQLQQRQVNLAMSDKISRAANTEVQASLAERDEQIAALRADV 125 A Q RR R +QL+ L +KIS A+ ++ L E L A+ Sbjct: 308 NANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQLEAEH 367 Query: 126 AFYERLVG-STAQRKGLNA 143 E S A R+ L Sbjct: 368 QKLEEQNKISEASRQSLRR 386
>cloacin#Cloacin signature. Length = 551 Score = 37.0 bits (85), Expect = 2e-04 Identities = 20/55 (36%), Positives = 25/55 (45%), Gaps = 3/55 (5%) Query: 441 GTAALIGTPWADYHSLRAPHGHAGGSGSSCGGGGGDSGGDGGSSDGGGCGGCGGG 495 G A G+ W+ ++ P G GSG GGG G G G + GGG G G Sbjct: 30 GGGASDGSGWSSENN---PWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 113 bits (284), Expect = 9e-37 Identities = 37/74 (50%), Positives = 55/74 (74%) Query: 16 KSPLREHVAQSVRRYLRDLDGSDADDVYEIVLREMEIPLFVEVLNHCEGNQSRAAALLGI 75 + PLR+ V Q+++ Y L+G D +D+YE+VL E+E PL V+ + GNQ+RAA ++GI Sbjct: 24 QKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQPLLDMVMQYTRGNQTRAALMMGI 83 Query: 76 HRATLRKKLKEYGL 89 +R TLRKKLK+YG+ Sbjct: 84 NRGTLRKKLKKYGM 97
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.2 bits (70), Expect = 0.023 Identities = 21/119 (17%), Positives = 35/119 (29%), Gaps = 14/119 (11%) Query: 131 AMRAPAAMQAPRAAVAAAKGIAETPAASANAGTSTATPNVEHTATPPPAQSLRITA-TTN 189 ++ A K P + N T P E ++ + T T N Sbjct: 1138 TVQPQAEPARENDPTVNIK----EPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGN 1193 Query: 190 ATSATPTNRTP---------ESAKRADAQARTPRSSTAWTLQFDRIVAEQVQAVSLRQL 239 + P N TP ES+ + + R S ++ + V+L L Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 41.0 bits (96), Expect = 1e-05 Identities = 47/235 (20%), Positives = 71/235 (30%), Gaps = 52/235 (22%) Query: 200 YNDLKQAY---GYPSYQTMIGAPGKQQRLDGSGSTIAVLIGSDVLDTDIAAMFDHERFSR 256 Y +KQ P MI AP + G G +A VLDT DH Sbjct: 10 YQVIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVA------VLDTGC--DADHPDLK- 60 Query: 257 YAGNHANPTLYARRYVAGAKPGVQDGNR-----AAAREATLDVDMALGGAPGAHVLLYVI 311 G +D N A AT + + +G AP A +L+ + Sbjct: 61 ---ARIIGGRNFTDDDEGDPEIFKDYNGHGTHVAGTIAATENENGVVGVAPEADLLIIKV 117 Query: 312 PDL----SIDSILAGYRQIVQDNEADVVSSSFGFCEQVFTAAYNGKDATSILGVFDSVFK 367 + D I+ G ++ D++S S G +D K Sbjct: 118 LNKQGSGQYDWIIQGIYYAIEQK-VDIISMSLG----------GPEDVPE----LHEAVK 162 Query: 368 QGNAQGISFVAPSGDNAGLDCPDTQYLVEGKNGRYVPSVHWPAADAYVTAVGGGN 422 + A I + +G+ EG + +P V +VG N Sbjct: 163 KAVASQILVMCAAGN-------------EGDGDDRTDELGYPGCYNEVISVGAIN 204
>PF06776#Invasion associated locus B Length = 214 Score = 30.7 bits (69), Expect = 0.004 Identities = 13/51 (25%), Positives = 20/51 (39%) Query: 8 LLPLALTLAIAACSKPADNAAAPAAETPAAATAPADAAAAPAPAPAAAAST 58 + P L+ +A+C + A A A A A + + A A A S Sbjct: 29 MGPAELSPMLASCRRLARRNGARLMLAGAMAIALSFGWSDRADAQGAVRSV 79
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 29.3 bits (65), Expect = 0.024 Identities = 20/57 (35%), Positives = 27/57 (47%), Gaps = 7/57 (12%) Query: 124 GTGGLRGHGVGGGSGGLRSRRTDFYSVAAL--VSRWTKARTAGVDADQTIRAGANQA 178 GT G G GGG GG +S S AA+ ++W+ A+ A+Q RA A Sbjct: 27 GTPDGSGSGGGGGKGGSKSE-----SSAAIHATAKWSTAQLKKTQAEQAARAKAAAE 78
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 82.0 bits (202), Expect = 8e-21 Identities = 54/222 (24%), Positives = 96/222 (43%), Gaps = 5/222 (2%) Query: 3 IENKVVVITGAGSGMGRATALHLAALGAKVVLGARREARIAEVARQITLSGGQAVYRPTD 62 IE K+ ITGA G+G A A LA+ GA + ++ +V + A P D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 63 VTVHEEVLALADLACSQFGRLDVMVNNAGISPLSRFDALQVEAWNAMIDVNLRGVLHGIA 122 V + + + G +D++VN AG+ +L E W A VN GV + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 123 AALPIFGRQQSSHVINVVSTAGLRIVPTMGVYAATKNAVRTISEALRQESGPH-IRVTEI 181 + ++S ++ V S +M YA++K A ++ L E + IR + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 182 SPGMVQSELLDT--VSDPALRQTLQAQSEA--SGMPAEAIAR 219 SPG ++++ + + Q ++ E +G+P + +A+ Sbjct: 186 SPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAK 227
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 50.7 bits (121), Expect = 8e-11 Identities = 20/79 (25%), Positives = 38/79 (48%), Gaps = 5/79 (6%) Query: 9 RGFTLLEMLAVLVIAALASTLVVMTLPDTRRDLHDHADTLAS---ALIHARDEAILSLRM 65 RGFTLLEM+ +L++ +++ +V++ P +R D A TLA L + + + + Sbjct: 4 RGFTLLEMMLILLLMGVSAGMVLLAFPASRDD--SAAQTLARFEAQLRFVQQRGLQTGQF 61 Query: 66 VEVGIDAGGYGFRRQAQQQ 84 V + + F + Sbjct: 62 FGVSVHPDRWQFLVLEARD 80
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 35.2 bits (81), Expect = 4e-05 Identities = 12/48 (25%), Positives = 26/48 (54%) Query: 1 MIRKQRTRGFTLIELLVALAVFALVAAAAVMVMRQSIDQRDAVRARLQ 48 M + RGFTL+E++V + + ++A+ V + + ++ D +A Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSD 48
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.4 bits (240), Expect = 4e-25 Identities = 34/118 (28%), Positives = 61/118 (51%), Gaps = 1/118 (0%) Query: 11 ARVLIVDDEPQIRRFLDISLRAQGYRVLQAGTGEEGLALLAGQGAELVVLDIGLPDRDGH 70 A +L+ DD+ IR L+ +L GY V +A +LVV D+ +PD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 EVLREIRQ-WSNVPVIMLTVRAGETEKVAALDAGANDYVTKPFGVQELMARIRALLRQ 127 ++L I++ ++PV++++ + + A + GA DY+ KPF + EL+ I L + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 203 bits (518), Expect = 1e-62 Identities = 97/331 (29%), Positives = 143/331 (43%), Gaps = 53/331 (16%) Query: 147 QWAFGTTNAGL---NIRPAWDKSTGANVVVAVIDTGI-VSHPDLDANILPGYDFISDATA 202 + G+ W+++ G V VAV+DTG HPDL A I+ G +F Sbjct: 16 EQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNFT----- 70 Query: 203 ARDGNGRDNNPADEGDWNSTSGCTTSNSSWHGTHVAGTVAAVTNNTTGVAGTAFNAKVVP 262 + + +P D+N HGTHVAGT+AA T N GV G A A ++ Sbjct: 71 ----DDDEGDPEIFKDYN-----------GHGTHVAGTIAA-TENENGVVGVAPEADLLI 114 Query: 263 VRVLGRCG-GSLSDIADAIIWASGGTVSGVPANPNAAEVINMSLGGGGTCSSTMQSAING 321 ++VL + G G I I +A ++I+MSLGG + A+ Sbjct: 115 IKVLNKQGSGQYDWIIQGIYYA----------IEQKVDIISMSLGGPED-VPELHEAVKK 163 Query: 322 AVSRGTTVVVAAGNSAANVSG----SLPANCANVIAVAATTSAGAKASYSNYGSGIDVSA 377 AV+ V+ AAGN P VI+V A + +SN + +D+ A Sbjct: 164 AVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFSNSNNEVDLVA 223 Query: 378 PGSGILSTLNSGTTTPGNASYASYNGTSMAAPHVAGVVALVQSVAPTT----LTPAAVET 433 PG ILST+ G YA+++GTSMA PHVAG +AL++ +A + LT + Sbjct: 224 PGEDILSTVPGGK-------YATFSGTSMATPHVAGALALIKQLANASFERDLTEPELYA 276 Query: 434 LLKNTARALPGACSGGCGAGIVDADAAVTAA 464 L L + G G++ A + Sbjct: 277 QLIKRTIPLGNS-PKMEGNGLLYLTAVEELS 306
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 203 bits (519), Expect = 5e-63 Identities = 97/339 (28%), Positives = 142/339 (41%), Gaps = 57/339 (16%) Query: 134 DPGVPQQWAMGATAASL---NIRPAWDRSTGKGIVVAVIDTGI-TNHPDLAANVLPGYDF 189 + Q+ + + W+++ G+G+ VAV+DTG +HPDL A ++ G +F Sbjct: 10 YQVIKQEQQVNEIPRGVEMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIGGRNF 69 Query: 190 IVDPATARDGTARDANAADQGDWAAANECGPGASASNSSWHGTHVAGIVAAVGNNAVGVV 249 ++ G + + HGTHVAG +AA N GVV Sbjct: 70 TD------------------------DDEGDPEIFKDYNGHGTHVAGTIAATENE-NGVV 104 Query: 250 GTAFNAKILPLRVLGRCG-GYMSDIADAIVWASGGKVSGVPTNPNPATVINLSLGGAGTC 308 G A A +L ++VL + G G I I +A +I++SLGG Sbjct: 105 GVAPEADLLIIKVLNKQGSGQYDWIIQGIYYA----------IEQKVDIISMSLGGPED- 153 Query: 309 SATLNNAIAAAVTRGSAVVVAAGNSNLDVST----SVPANCANVIAVAATTSAGAKASFS 364 L+ A+ AV V+ AAGN P VI+V A + FS Sbjct: 154 VPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDRHASEFS 213 Query: 365 NFGKGVDIAAPGQSIVSTLNTGTTAPGNAAYAVYSGTSMAAPHVAGVVALMQSVALN--- 421 N VD+ APG+ I+ST+ G YA +SGTSMA PHVAG +AL++ +A Sbjct: 214 NSNNEVDLVAPGEDILSTVPGGK-------YATFSGTSMATPHVAGALALIKQLANASFE 266 Query: 422 -PLTPATVKALLKASARPMPVACTQGCGAGLVNADGAVA 459 LT + A L P+ + G GL+ Sbjct: 267 RDLTEPELYAQLIKRTIPLGNSPKM-EGNGLLYLTAVEE 304
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 36.8 bits (85), Expect = 8e-05 Identities = 14/81 (17%), Positives = 28/81 (34%) Query: 46 VQRALALHPGHPEAVARLGRVRWAQQRHAEAATLLQQASDLVPQHPGIALWLGHALEDAG 105 + + E + L ++ ++ +A + Q L L LG + G Sbjct: 25 IAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMG 84 Query: 106 QPEQAAAAYTRAHRLLPDEPY 126 Q + A +Y+ + EP Sbjct: 85 QYDLAIHSYSYGAIMDIKEPR 105
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 33.3 bits (76), Expect = 0.002 Identities = 15/85 (17%), Positives = 32/85 (37%), Gaps = 10/85 (11%) Query: 60 QSARSSLPKPREILEVLDQY----VIGQLRAKRTLAVAVYNHYKRIESRSKNDDVELAK- 114 + A LPKP ++ E++ + R + + S + + + Sbjct: 96 KGAYDYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155 Query: 115 -----SNILLVGPTGSGKTLLAETL 134 +++ G +G+GK L+A L Sbjct: 156 LMQTDLTLMITGESGTGKELVARAL 180
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.3 bits (219), Expect = 2e-22 Identities = 32/119 (26%), Positives = 60/119 (50%), Gaps = 2/119 (1%) Query: 2 RILVIEDNSDIAANLGDYLEDRGHTVDFAADGVTGLHLAVVHEFDAIVLDLNLPGMDGIE 61 ILV +D++ I L L G+ V ++ T + D +V D+ +P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VCRKLRNEARKQTPVLMLTARDSLDNKLAGFDSGADDYLIKPFALQE-VEVRLNALSRR 119 + +++ +AR PVL+++A+++ + + GA DYL KPF L E + + AL+ Sbjct: 65 LLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 330 bits (848), Expect = e-116 Identities = 129/282 (45%), Positives = 176/282 (62%), Gaps = 1/282 (0%) Query: 1 MAFLDQHPGLGFPAAAGLGLLIGSFLNVVILRLPKRMEWQWRRDAREILELPDI-YEPPP 59 + P L F L+IGSFLNVVI RLP +E +W+ + R D + PP Sbjct: 5 LELAHGLPWLYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPP 64 Query: 60 PGIVVEPSHDPVTGDKLKWWENIPVLSWAMLRGKSRYSGKPISIQYPLVELLTSILCVAS 119 ++V S P + ENIP+LSW LRG+ R PIS +YPLVELLT++L VA Sbjct: 65 YNLMVPRSCCPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAV 124 Query: 120 VWRFGFGWQGFGAIVLSCFLVAMSGIDLRHKLLPDQLTLPLMWLGLVGSMDNLYMPAKPA 179 GW A++L+ LVA++ IDL LLPDQLTLPL+W GL+ ++ ++ A Sbjct: 125 AMTLAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDA 184 Query: 180 LLGAAVGYVSLWTVWWLFKQLTGKEGMGHGDFKLLAALGAWCGLKGILPIILISSLVGAI 239 ++GA GY+ LW+++W FK LTGKEGMG+GDFKLLAALGAW G + + ++L+SSLVGA Sbjct: 185 VIGAMAGYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAF 244 Query: 240 LGSIWLVAKGRDRATPIPFGPYLAIAGWVVFFWGNDLVDGYL 281 +G ++ + ++ PIPFGPYLAIAGW+ WG+ + YL Sbjct: 245 MGIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSITRWYL 286
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 382 bits (982), Expect = e-133 Identities = 113/405 (27%), Positives = 211/405 (52%), Gaps = 9/405 (2%) Query: 23 LFLWEGTDKRGIKMKGEQTARNMNMLRAELRRQGINPSIVKLK--------PKPLFGAAG 74 + ++ D +G K +G Q A + R LR +G+ P V L Sbjct: 3 QYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLRRK 62 Query: 75 KKITPKDIAFFSRQMATMMKSGVPIVGSLEIIGEGHKNPRMKKMVGQVRTDIEGGSSLYE 134 +++ D+A +RQ+AT++ + +P+ +L+ + + + P + +++ VR+ + G SL + Sbjct: 63 IRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSLAD 122 Query: 135 SISRHPVQFDELYRNLVRAGEGAGVLETVLDTVATYKENIEALKGKIKKALFYPAMVIAV 194 ++ P F+ LY +V AGE +G L+ VL+ +A Y E + ++ +I++A+ YP ++ V Sbjct: 123 AMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLTVV 182 Query: 195 ALIVSAILLIFVVPQFEEVFKGFGAELPAFTQMIVGASRFMVSYWWIMFFVIAGAIVGFV 254 A+ V +ILL VVP+ E F LP T++++G S + ++ M + + F Sbjct: 183 AIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMAFR 242 Query: 255 FAYKRSPSMQHTMDRLILRVPVIGQIMHNSSIARFARTTAVTFKAGVPLVEALGIVAGAT 314 R + + R +L +P+IG+I + AR+ART ++ + VPL++A+ I Sbjct: 243 VML-RQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDVM 301 Query: 315 GNRVYEDAVLRMRDDVSVGYPVNMAMKQVNLFPHMVIQMTAIGEEAGALDSMLFKVAEYF 374 N + D V G ++ A++Q LFP M+ M A GE +G LDSML + A+ Sbjct: 302 SNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADNQ 361 Query: 375 EQEVNNAVDALSSLLEPMIMVFIGVVVGGMVIGMYLPIFKLGAVV 419 ++E ++ + L EP+++V + VV +V+ + PI +L ++ Sbjct: 362 DREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 46.8 bits (111), Expect = 1e-09 Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 5/71 (7%) Query: 1 MKKQNGFTLIELMIVVAIIAILAAIALPAYQDYTVRGRVSEAMVAASAAKTVVAENAANG 60 KQ GFTL+E+M+V+ II +LA++ +P + G +A + + V ENA + Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVP-----NLMGNKEKADKQKAVSDIVALENALDM 58 Query: 61 SALNSGWTPPT 71 L++ P T Sbjct: 59 YKLDNHHYPTT 69
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 39.8 bits (93), Expect = 1e-05 Identities = 47/224 (20%), Positives = 78/224 (34%), Gaps = 32/224 (14%) Query: 31 DLAALHTHAAWMRAQLPAQCELFYAAKANA----EPPVLRTLATHVDGFEAASGGELAWL 86 DL AL + + +R Q ++ KANA + + DGF + E L Sbjct: 10 DLQALKQNLSIVR-QAATHARVWSVVKANAYGHGIERIWSAIGA-TDGFALLNLEEAITL 67 Query: 87 HAQQPQAPLLFGGPGKLDTELAQAAALPDCTVHVESLRELERLAAIATHGGRCVPVFLRM 146 + + P+L G + + T V S +L+ L + ++L++ Sbjct: 68 RERGWKGPILMLE-GFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARL--KAPLDIYLKV 124 Query: 147 NIAVPGAQSTRLMMGGQPSPFGLDPCDLDAAMQRLQASPSLRLEGFHFHLMSHQRNATAQ 206 N + RL G P + Q+L+A ++ LMSH A Sbjct: 125 NSGM-----NRL---------GFQPDRVLTVWQQLRAMANVGEMT----LMSHFAEAEHP 166 Query: 207 LHLVAAYLRTVQQWRQTYALGPLRVNAGGGFGVDYLAPEASFDW 250 + A R ++Q + N+ PEA FDW Sbjct: 167 DGISGAMAR-IEQAAEGLECRRSLSNSAATL----WHPEAHFDW 205
>PF04183#IucA / IucC family Length = 580 Score = 293 bits (751), Expect = 7e-94 Identities = 94/468 (20%), Positives = 165/468 (35%), Gaps = 46/468 (9%) Query: 100 DAQALARCLLQALASTQAINPELLAQSANSVAVT------AAFLRQAQLTAATGEAMIDA 153 D LA+ LL L +++ +A+ + T R+ + D Sbjct: 69 DEPVLAQTLLMQLKQVLSMSDATVAEHMQDLYATLLGDLQLLKARRGLSASDLINLNADR 128 Query: 154 EQSMLWGHALHPTPKSREGVDLDQVLACAPEARASFQLFWF-------------RIDPRL 200 Q +L GH K R G + + APE +F+L W +D Sbjct: 129 LQCLLSGHPKFVFNKGRRGWGKEALERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQ 188 Query: 201 LRIQGRDVRATLR-----QLSGSDDLY---PCHPWEAQRLLDAPLLRTMQARGLITPIGP 252 L D + R Q +G D + P HPW+ Q+ + + A G + +G Sbjct: 189 LLTAAMDPQEFARFSQVWQENGLDHNWLPLPVHPWQWQQKIATDFIADF-AEGRMVSLGE 247 Query: 253 LGDALRPTSSVRTLYHPE--LAYFLKCSVHVRLTNCVRKNAWYELESAVALSELLAPSWR 310 GD S+RTL + +K + + T+C R + + S L + Sbjct: 248 FGDQWLAQQSLRTLTNASRRGGLDIKLPLTIYNTSCYRGIPGRYIAAGPLASRWLQQVFA 307 Query: 311 ALAMQV-PGFDVMLEPAATSLDVALVDPALHAADPLAARTLSESFGILYRQGIPAAQRAR 369 A V G ++ EPAA V +AA A E G+++R+ + Sbjct: 308 TDATLVQSGAVILGEPAA-----GYVSHEGYAALARAPYRYQEMLGVIWRENPCRWLKPD 362 Query: 370 WQPQVAAALFTCDAQGNSVCAARLRALGSAQMNRRTATLLWFGAYAGLLLDGVWSALFQH 429 P + A L CD + A + G W +++ ++ L ++ Sbjct: 363 ESPVLMATLMECDENNQPLAGAYIDRSGLDAET-------WLTQLFRVVVVPLYHLLCRY 415 Query: 430 GIALEPHLQNTVIGFADGWPTRVWIRDLEGT-KLLAHHWPETRLRGVGERARQSLYYTPE 488 G+AL H QN + +G P RV ++D +G +L+ +PE + + + R Sbjct: 416 GVALIAHGQNITLAMKEGVPQRVLLKDFQGDMRLVKEEFPE--MDSLPQEVRDVTSRLSA 473 Query: 489 QGWNRVAYCALVNNLAEAIFHLSQGDAALETQLWQCVGEIALRWQQRH 536 + I L E + +Q + + + ++H Sbjct: 474 DYLIHDLQTGHFVTVLRFISPLMVRLGVPERRFYQLLAAVLSDYMKKH 521
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 60.6 bits (147), Expect = 3e-12 Identities = 50/156 (32%), Positives = 67/156 (42%), Gaps = 3/156 (1%) Query: 20 LGMPLFLPQVLAELAPAA-AVGWSGVLYVLPTLCTALTASSWGRWADRHGRKRSLLRAQL 78 L MP+ LP +L +L + G+L L L A G +DR GR+ LL + Sbjct: 23 LIMPV-LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLA 81 Query: 79 GLALGFAIAGFAPSLSWLVIGLVVQGTCGGSLAAANAYLASQPQAGPLARALDWTQYSAR 138 G A+ +AI AP L L IG +V G G + A A AY+A AR + Sbjct: 82 GAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFG 141 Query: 139 LAMVSAPALLGLALALGPAQSLYRALALLPLIAFAL 174 MV+ P L GL P + A A L + F Sbjct: 142 FGMVAGPVLGGLMGGFSPHAPFFAA-AALNGLNFLT 176 Score = 35.6 bits (82), Expect = 3e-04 Identities = 35/110 (31%), Positives = 42/110 (38%), Gaps = 4/110 (3%) Query: 278 LLPGLALFAVACVWQALLHDALALAVARLLFGL-GMLFALRGLNRSLAHIASGHGAGRLF 336 LL LA AV A L + R++ G+ G A+ G +A I G R F Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG--AYIADITDGDERARHF 133 Query: 337 GRFDACGKWAGVFAGAAAGALAQASGPATPFLAAALAAAAAALTVVVRFP 386 G AC G+ AG G L P PF AAA LT P Sbjct: 134 GFMSACFG-FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLP 182
>PF04183#IucA / IucC family Length = 580 Score = 148 bits (376), Expect = 2e-40 Identities = 95/437 (21%), Positives = 145/437 (33%), Gaps = 62/437 (14%) Query: 81 DSWIVRSDDGVHV---ERGAHAWLH-------RISAELDAQT--QQL-HRAYADEADCAA 127 D + + ERG WL + AQT QL +A A Sbjct: 35 DRYCINLPGAQWRFIAERGIWGWLWIDAQTLRCADEPVLAQTLLMQLKQVLSMSDATVAE 94 Query: 128 AHRGLARQAYHAQAPALRTALHHPDAAERAYRCDQLASYRD-HPFYPTARAKAGLDAAEL 186 + L L+ + D+L HP + + + G L Sbjct: 95 HMQDLYATLLGDLQ-LLKARRGLSASDLINLNADRLQCLLSGHPKFVFNKGRRGWGKEAL 153 Query: 187 RHYAPEFAPTFALHWLAIPQALAQCTSAAP------------AELWPDFASLGLPPELAA 234 YAPE+A TF LHWLA+ + + + F+ + L Sbjct: 154 ERYAPEYANTFRLHWLAVKREHMIWRCDNEMDIHQLLTAAMDPQEFARFSQVWQENGLDH 213 Query: 235 THLPWPVHPLMWERLEQEGFA--LPEDVLR----APNAWLDVRPSLSVRTLVPPQHPQ-L 287 LP PVHP W++ F E + + W S+RTL L Sbjct: 214 NWLPLPVHPWQWQQKIATDFIADFAEGRMVSLGEFGDQW---LAQQSLRTLTNASRRGGL 270 Query: 288 HLKLPIPMRTLGALNLRLIKPSTLYDGHWMERALRHIDALDPALQGRCVFV-DESHGGHV 346 +KLP+ + R I + G R L+ + A D L + E G+V Sbjct: 271 DIKLPLTIYNTSC--YRGIPGRYIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYV 328 Query: 347 -------------GQTRHLAYLVRRYPAL---DDATLVPVAALCAPMPDGRPMAIHLAER 390 L + R P D + V +A L + +P+A +R Sbjct: 329 SHEGYAALARAPYRYQEMLGVIWRENPCRWLKPDESPVLMATLMECDENNQPLAGAYIDR 388 Query: 391 FAHGDVLRWWRDYTELLLAVHLRLWLGYGIALEANQQNSVLVYSDGQATRLLMKDN-DAA 449 D W +++ L YG+AL A+ QN L +G R+L+KD Sbjct: 389 SGL-DAETWLTQLFRVVVVPLYHLLCRYGVALIAHGQNITLAMKEGVPQRVLLKDFQGDM 447 Query: 450 RIALPQLRAALPELDAL 466 R+ + PE+D+L Sbjct: 448 RLVKEE----FPEMDSL 460
>BACINVASINC#Salmonella/Shigella invasin protein C signature. Length = 409 Score = 29.5 bits (65), Expect = 0.006 Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 6/97 (6%) Query: 69 RDTAKSKRQAGDLAGAAAALDQALGLVSGDPAILQERAEVSVLQADWPAAERFAKQAIDL 128 R A+ + GDL + + S A QER+E + Q + A + +A + Sbjct: 315 RIDARKMQMTGDLIMKNSVTVGGIAGASRQYAATQERSEQQISQVNNRVASTASDEARES 374 Query: 129 GSKTGPLCRRHWATIEQSRLARGEKENAASAKAQIAG 165 K+ L + T+E ++ ASA A IAG Sbjct: 375 SRKSTSLIQEMLKTMESI------NQSKASALAAIAG 405
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 132 bits (332), Expect = 3e-39 Identities = 42/262 (16%), Positives = 88/262 (33%), Gaps = 37/262 (14%) Query: 11 MDDGRRLMMTLVISLLLHGVLILGVGFAVSEDAPLVPTLDVIFSQTSAPLTPKQADFLAQ 70 +D RR ++S+ +HG ++ G+ + +P AP P +A Sbjct: 8 LDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELP----------APAQPISVTMVAP 57 Query: 71 ANQQGGGDHDTAQRPRDSQPGVVPQDRTGLAPQAQRATSVNAPEPTQTRVVTSRRGEQAV 130 A D P P+ P+ + P + Sbjct: 58 A--------DLEPPQAVQPP---PEPVVEPEPEPEPIPEPPKEAPVV------------I 94 Query: 131 PTPQPNPQTDPLTPAEAQRIQRDAEMARLAAEVHLRSEQYAKRPNRKFVSASTREYAYAN 190 P+P P+ P + ++ +RD + + A+ + +A+++ Sbjct: 95 EKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVA 154 Query: 191 YLRAWVDRAERVGNLNYPDDARRRRLGGKVVISVGVRRDGSVESSRVLVSSGVPLLDDAA 250 + R YP A+ R+ G+V + V DG V++ ++L + + + Sbjct: 155 SGPRALSRN----QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREV 210 Query: 251 LRVVQLAQPFPPLPKTKDDVDI 272 ++ + P P + V+I Sbjct: 211 KNAMRRWRYEPGKPGSGIVVNI 232
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.3 bits (180), Expect = 2e-18 Identities = 28/115 (24%), Positives = 49/115 (42%), Gaps = 2/115 (1%) Query: 15 KVMVIDDSKTIRRTAETLLKREGCEVVTATDGFEALAKIADQQPQIIFVDIMMPRLDGYQ 74 ++V DD IR L R G +V ++ IA ++ D++MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 75 TCALIKGNQLFKSTPVIMLSSKDGLFDKARGRIVGSEQYLTKPFTREELLSAIRT 129 IK + PV+++S+++ + G+ YL KPF EL+ I Sbjct: 65 LLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.0 bits (218), Expect = 1e-23 Identities = 35/116 (30%), Positives = 57/116 (49%), Gaps = 2/116 (1%) Query: 2 ARIILIEDSPTDRAVFSQWLEKAGHTVVATDNAEEGLELVRSQAPDLVLMDVVLPGMSGF 61 A I++ +D R V +Q L +AG+ V T NA + + DLV+ DVV+P + F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 QATRALARDQATKDIPVLLVSTKGMETDRAWGLRQGASDYIVKPPREDDLIARIRQ 117 + + + D+PVL++S + +GA DY+ KP +LI I + Sbjct: 64 DLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 67.5 bits (165), Expect = 4e-13 Identities = 24/116 (20%), Positives = 53/116 (45%), Gaps = 2/116 (1%) Query: 2276 QVPLVMVVDDSLTMRKVTSRVLERHNLDVTTARDGVEALELLEERVPDLMLLDIEMPRMD 2335 ++V DD +R V ++ L R DV + + DL++ D+ MP + Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 2336 GYELATAMRADPRFKAVPIVMITSRSGEKHRQRAFEIGVQRYLGKPYQELDLMRNV 2391 ++L ++ +P++++++++ +A E G YL KP+ +L+ + Sbjct: 62 AFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGII 115
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.7 bits (90), Expect = 4e-05 Identities = 56/312 (17%), Positives = 107/312 (34%), Gaps = 41/312 (13%) Query: 76 FTLQVLFTCTFLIMVLLQPVYGALVSRYPRR-VFLPGVYGFFIATLLL-----FYVLFDS 129 +L L+ PV GAL R+ RR V L + G + ++ +VL+ Sbjct: 43 AHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIG 102 Query: 130 GVPG--RGMAFFLWVTVFNLFAVAVFWSFMADVFSNAQARSYYGYIGAAGTLGAFLGPVL 187 + G AV +++AD+ + ++G++ A G GPVL Sbjct: 103 RIVAGITGATG------------AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVL 150 Query: 188 TRVLVERIGIAHLMLVSAGFLAVCVVCVLRLRLWAVAREQEGQLSSGEVPMGGDVLGGLK 247 ++ H +A L L E + L + Sbjct: 151 GGLMGG-FS-PHAPFFAAAALNGLNFLTGCFLL----PESHKGERRPLRREALNPLASFR 204 Query: 248 LIVREPLLRWLAFMVLFGVGVGTLLYNEQAALVRRLYTDAAAATAYYSSIDLAIN----- 302 ++ L V L + A + ++ + ++ + + I+ Sbjct: 205 WARGMTVVAALMA-----VFFIMQLVGQVPAALWVIFGE---DRFHWDATTIGISLAAFG 256 Query: 303 ALALVLQLLVTRALLSRFGIAPALLIPGVAIMLGYAALAASPLPMMIAIVQVITRSSEFA 362 L + Q ++T + +R G AL++ +A GY LA + M + V+ S Sbjct: 257 ILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS--GG 314 Query: 363 LAKPARETLYTR 374 + PA + + +R Sbjct: 315 IGMPALQAMLSR 326
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 62.3 bits (151), Expect = 3e-14 Identities = 32/187 (17%), Positives = 66/187 (35%), Gaps = 14/187 (7%) Query: 29 QAALDLIAEQGVGAVAVEPLARRLGVTKGSFYWHFPSRDALLQAALERWEIFEQKEVFGS 88 AL L ++QGV + ++ +A+ GVT+G+ YWHF + L E E + Sbjct: 18 DVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEY 77 Query: 89 LEDVP-DPSARLRA----LFQLVAHEVKPHVIYSELLKALDHPAVRPVIDRVSQRRLDYL 143 P DP + LR + + E + ++ + + V+ + + Sbjct: 78 QAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLES 137 Query: 144 IASFRQ---AGLSR------TDAQHRARLAYAAYVGFLQLSLQLQQPKPAREDFEAYVEH 194 Q + + A + G ++ L Q +++ YV Sbjct: 138 YDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAI 197 Query: 195 VIQTLIP 201 +++ + Sbjct: 198 LLEMYLL 204
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 50.3 bits (120), Expect = 7e-11 Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 18/106 (16%) Query: 1 MKRTAAQVRGFTLIELMIVVAVVAILSAIAYPSYTEHVRKSRRAQAKVDLVEYGQLAERF 60 M+ T Q RGFTL+E+M+V+ ++ +L+++ P+ + K+ + +A D+V Sbjct: 1 MRATDKQ-RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIV--------- 50 Query: 61 HTVQNTYSGFTLPTNVSPR-EGGTAAYTLALTQQ------TQSGYV 99 ++N + L + P G + A T + GY+ Sbjct: 51 -ALENALDMYKLDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYI 95
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 29.1 bits (65), Expect = 0.006 Identities = 16/107 (14%), Positives = 40/107 (37%), Gaps = 13/107 (12%) Query: 7 LSARGYTAVQLLIVMAVIGIGAAIGVPSFKSLIEWQRATTRVHLLTAHLAMARSLAVTQG 66 + RG+T +++++++ ++G+ A + + +F + + A + A L + + G Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRD-DSAAQTLARFEAQLRFVQQRGLQTG 59 Query: 67 EPVSLCPSTDGTRCRTDRIWSQGWILFKDPGRGGQPPTSASVIRAEY 113 + + + W R G P A + Y Sbjct: 60 QFFGV------------SVHPDRWQFLVLEARDGADPAPADDGWSGY 94
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 212 bits (542), Expect = 3e-70 Identities = 52/230 (22%), Positives = 104/230 (45%), Gaps = 12/230 (5%) Query: 14 QVGAAVQKAVNYEVSIADLARRSEKRAWIVATLSMLVTVMTAGGYYYMLPLKEKVPYLVM 73 ++ A ++A ++E A RS+K AW+VA ++ + + PLK PY++ Sbjct: 9 ELKAYFEEAASWERDKLAAAERSKKLAWVVAGVAGALATAGVVAVAALTPLKTVEPYVIT 68 Query: 74 ADAYSGTSTIAKLEPNFGGRAISTSEALARSNIARFIIARESYDASNISDRDWNTVVAMA 133 D +G ++I G I+ EA+ + +A ++ RE + A+ + ++ V+ M+ Sbjct: 69 VDRNTGEASI--AAKLHGDATITYDEAVRKYFLATYVRYREGWIAAAREE-YFDAVMVMS 125 Query: 134 TTGVLAEYRALHAANNAARPFNVYGRNRAIRISILSITLIGGKGKPFTGATVRFQRSLYD 193 + + +N P N+ + + I ++ +GG A V F + Sbjct: 126 ARPEQDRWSRFYKTDNPQSPQNILANRTDVFVEIKRVSFLGGN-----VAQVYFTKESVT 180 Query: 194 KSSTVSTLLDNKIATMEFAYQDNLQMSDDLRVENPLGFRVSDYRVDNDYS 243 S++ T + +AT+++ D + R +NPLG++V YR D + Sbjct: 181 GSNSTKT---DAVATIKYKV-DGTPSKEVDRFKNPLGYQVESYRADVEVP 226
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 35.9 bits (82), Expect = 1e-04 Identities = 26/89 (29%), Positives = 42/89 (47%), Gaps = 10/89 (11%) Query: 44 TGLGITTQVELSPNEKILDYSTGFTGGWELTRRENVFYLKPKNVDVD-------TNMMIR 96 T L T ++L +E I +TGF GW + N +++PK+V + N + Sbjct: 59 TSLDNVTVIQLEKDETISYITTGFNKGWSIVPNSNHIFIQPKSVKSNLMFEKEAVNFALM 118 Query: 97 TATHSYILELK---VVATDWQRLEQAKQA 122 T + L+ K V A D + LE+ K+A Sbjct: 119 TRDYQEFLKTKKLIVDAPDPKELEEQKKA 147 Score = 28.6 bits (63), Expect = 0.027 Identities = 10/27 (37%), Positives = 17/27 (62%) Query: 165 YDYDYSTRTKKSWLVPSRVYDDGKFTY 191 Y+Y + + ++PS ++DDG FTY Sbjct: 401 YNYYQAPEKRSKHIMPSEIFDDGTFTY 427
>MYCMG045#Hypothetical mycoplasma lipoprotein (MG045) signature. Length = 483 Score = 31.6 bits (71), Expect = 0.004 Identities = 17/37 (45%), Positives = 23/37 (62%), Gaps = 3/37 (8%) Query: 187 TTFMKALVNHIP--NEERLVTIEDARELFISQPNSVH 221 T +KA+V H N+ RLV I+DAR +F S N V+ Sbjct: 171 TDVIKAIVKHKDRFNDNRLVFIDDARTIF-SLANIVN 206
>LPSBIOSNTHSS#Lipopolysaccharide core biosynthesis protein signature. Length = 166 Score = 210 bits (535), Expect = 1e-72 Identities = 72/152 (47%), Positives = 105/152 (69%) Query: 9 AVYPGTFDPITNGHIDLVNRAAPLFERVVVGVAYSPSKGPALSLERRVALAQEALAAHTN 68 A+YPG+FDPIT GH+D++ R LF++V V V +P+K P S++ R+ +A+A N Sbjct: 3 AIYPGSFDPITFGHLDIIERGCRLFDQVYVAVLRNPNKQPMFSVQERLEQIAKAIAHLPN 62 Query: 69 VEVRGFDTLLAHFVREMGAGVLLRGLRAVSDFEYEFQMASMNRHLIPEVETLFLTPAEQY 128 +V F+ L ++ R+ AG +LRGLR +SDFE E QMA+ N+ L ++ET+FLT + +Y Sbjct: 63 AQVDSFEGLTVNYARQRQAGAILRGLRVLSDFELELQMANTNKTLASDLETVFLTTSTEY 122 Query: 129 SFISSSLVREIARLGGDVSGFVPASVVEALRQ 160 SF+SSSLV+E+AR GG+V FVP+ V AL Sbjct: 123 SFLSSSLVKEVARFGGNVEHFVPSHVAAALYD 154
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 53.5 bits (128), Expect = 6e-09 Identities = 37/238 (15%), Positives = 78/238 (32%), Gaps = 20/238 (8%) Query: 725 QARMQASVAAQARQEREQQERVAQEQHVAQVREHLQQAQPEHE-DRSQSEQAVQAQAVLE 783 A + Q + E+ E+ A E AQ RE ++A+ + + +E A E Sbjct: 1036 TTETVAENSKQESKTVEKNEQDATET-TAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 784 GQRQAEQQRELEERQVQERQADNQQREQQDRQAQETRQVEAQEGQARQAQDQQQQTQALE 843 Q ++ E++ + + + E+ + T QV ++ Q+ Q Q + + + Sbjct: 1095 TQTTETKETATVEKEEKAK----VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150 Query: 844 PTQDQRQQASQQPDTQLHAPELALTQQTTLPQSQEDACSRLETQNQPANERLAPDAHDSL 903 PT + ++ SQ T + + T ++ + + + Sbjct: 1151 PTVNIKEPQSQTNTT----ADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPAT 1206 Query: 904 KQTSEAGDAQSHLAQGAERALESQAVQSRDTARIQVPLSEGRESGNPPLQSAQADAVS 961 Q + ++ + R++ S S N A D S Sbjct: 1207 TQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT----------TSSNDRSTVALCDLTS 1254
>FLAGELLIN#Flagellin signature. Length = 507 Score = 29.2 bits (65), Expect = 0.012 Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 7/75 (9%) Query: 7 AAMAEMMATLNASNTSLQETITVLTTLVASMQQREQRLRDV-VAEQ------LQVLQRAA 59 + + + ++L A IT L V ++ R+ D A + Q+LQ+A Sbjct: 429 SKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAG 488 Query: 60 SSADAKVNRVLENAL 74 +S A+ N+V +N L Sbjct: 489 TSVLAQANQVPQNVL 503
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 577 bits (1488), Expect = 0.0 Identities = 208/568 (36%), Positives = 321/568 (56%), Gaps = 11/568 (1%) Query: 274 AIVGIGASPGVAIGIVHRLRAAQTEVADQPV-GLGDGGAQLHDALTRTRQQLAAIQDDTQ 332 I GI AS GVAI ++ + + +L AL +++++L AI+D T+ Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTE 63 Query: 333 RRLGASDAAIFKAQAELLNDTDLITR-TCQLMVEGHGVAWSWHQAVEQMASGLAALGNPV 391 +GA A IF A +L+D +L+ ++ E ++ + + S ++ N Sbjct: 64 ASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDNEY 123 Query: 392 LAGRAADLRDVGRRVLAQLDPAAAGAGLTDLPEQPCILLASDLSPSDTANLDTARVLGLA 451 + RAAD+RDV +RVL L G+ L + E +++A DL+PSDTA L+ V G A Sbjct: 124 MKERAADIRDVSKRVLGHLIGVETGS-LATIAE-ETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 452 TAQGGPTSHTAILSRTLGLPALVAAGGQLLDIEDGVTAIIDGSSGRLYIDPSAQDLDAAR 511 T GG TSH+AI+SR+L +PA+V I+ G I+DG G + ++P+ +++ A Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 512 THIAEQQAIREREAAQRALPAETSDGHHIDIGANVNLPDQVAMALTQGAEGVGLMRTEFL 571 A + ++ A P+ T DG H+++ AN+ P V L G EG+GL RTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 572 FLESGRTPSEDEQHATYLAMAQALDGRPLIVRALDIGGDKQVAHLELPHEENPFLGVRGA 631 +++ + P+E+EQ Y + Q +DG+P+++R LDIGGDK++++L+LP E NPFLG R Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 632 RLLLRRPDLLEPQLRALYRAAKDGARLSIMFPMITSVPELVALRAICARIRVDLDA---- 687 RL L + D+ QLRAL RA+ G L +MFPMI ++ EL +AI + L + Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYG-NLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVD 420 Query: 688 --PEVPIGIMIEVPAAAAQADVLARHADFFSIGTNDLTQYVLAIDRQNPELAAEADSLHP 745 + +GIM+E+P+ A A++ A+ DFFSIGTNDL QY +A DR N ++ HP Sbjct: 421 VSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHP 480 Query: 746 AVLRMIRSTIDGARKHERWVGVCGGLAGDAFGASLLAGLGVQELSMTPNDIPAVKARLRG 805 A+LR++ I A +WVG+CG +AGD LL GLG+ E SM+ I +++L Sbjct: 481 AILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLK 540 Query: 806 AALSQLQQLAEQALACETAEQVRALEAK 833 + +L+ A++AL +TAE+V L K Sbjct: 541 LSKEELKPFAQKALMLDTAEEVEQLVKK 568
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1081 bits (2796), Expect = 0.0 Identities = 518/1038 (49%), Positives = 706/1038 (68%), Gaps = 17/1038 (1%) Query: 1 MPKFFIEHPVFAWVVAILISLAGVISILNLGIESYPTIAPPQVTVTANFPGASADTAEKA 60 M FFI P+FAWV+AI++ +AG ++IL L + YPTIAPP V+V+AN+PGA A T + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQQLTGIDHLLYFNSSSAANGRVTITLTFETGTDADIAQVQVQNKVSLATPRLPS 120 VTQVIEQ + GID+L+Y +S+S + G VTITLTF++GTD DIAQVQVQNK+ LATP LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVTQQGVVVAKANAGFLMVAALRSDNPSINRDALNDIVGSRVLEQISRVPGVGSTNQFGA 180 EV QQG+ V K+++ +LMVA SDNP +D ++D V S V + +SR+ GVG FGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 EYAMNIWLNPEKLQGYNLSATQVLTAVRNQNVQFAAGSVGADPTPEGISFTATVSAEGRF 240 +YAM IWL+ + L Y L+ V+ ++ QN Q AAG +G P G A++ A+ RF Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 SSPEQFENIILRTDNNGATVRLKDVARVTVGPSNYGFDTQYNGKPTGAFGIQLLPGANAL 300 +PE+F + LR +++G+ VRLKDVARV +G NY + NGKP GI+L GANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 NVSEAVGAKLDELQPTFPQGVTWFAPYESTTFVRISIEEVIHTLVEAIVLVFLVMLLFLQ 360 + ++A+ AKL ELQP FPQG+ PY++T FV++SI EV+ TL EAI+LVFLVM LFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATVIPTLVIPVALLGTFFGMYMIGFTINQLTLFAMVLAIGIVVDDAIVVIENVERIM 420 N RAT+IPT+ +PV LLGTF + G++IN LT+F MVLAIG++VDDAIVV+ENVER+M Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 SEEHLEPKAATQKAMTQITGAVVAITVVLAAVFIPSSLQPGASGAIYKQFALTIAMSMGF 480 E+ L PK AT+K+M+QI GA+V I +VL+AVFIP + G++GAIY+QF++TI +M Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SAFLALSFTPALCAAFLK---STHSTKKNWVYRTFDKYYDKLAHRYVGVVGHTLKRSPPW 537 S +AL TPALCA LK + H K + F+ +D + Y VG L + + Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 538 MIAFVALVVLCGFLFTRMPGSFLPEEDQGFAVAIVQLPPGATKIRTNEAFAQMRAVLEKQ 597 ++ + +V LF R+P SFLPEEDQG + ++QLP GAT+ RT + Q+ K Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 598 PA--VEGMLQIAGFSFLGSGENVGMGFIRLKPWEERDV---TAEQLIQQLNGAFYGIKGA 652 VE + + GFSF G +N GM F+ LKPWEER+ +AE +I + I+ Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 653 QIFVVNLPTVQGLGQFGGFDMWLQDRSGAGQEALINARNIVLGKAAEKQDALVGVRPNGL 712 + N+P + LG GFD L D++G G +AL ARN +LG AA+ +LV VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 713 ENSPQLQLHVDRVQAQSMGLDVSDIYSSIQLMLAPVYVNDYFAEGRIKRVNMRADDQFRA 772 E++ Q +L VD+ +AQ++G+ +SDI +I L YVND+ GR+K++ ++AD +FR Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 773 GPESLRDFFTPSATATGADGQPAMIPLSNVVKAEWNYASPALNRYNGYSAVNIVGNPAPG 832 PE + + S A+G+ M+P S + W Y SP L RYNG ++ I G APG Sbjct: 781 LPEDVDKLYVRS-----ANGE--MVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833 Query: 833 GSSGQAMSAMEDIVNNDLPPGFGFDWSGMSYQEIIAGNAATLLLALSVVVVFLCLAALYE 892 SSG AM+ ME++ + LP G G+DW+GMSYQE ++GN A L+A+S VVVFLCLAALYE Sbjct: 834 TSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYE 892 Query: 893 SWSIPVAVLLVVPIGVLGAITFSMLRGLPNDLYFKIGMITVIGLAAKNAILIVEFAVE-Q 951 SWSIPV+V+LVVP+G++G + + L ND+YF +G++T IGL+AKNAILIVEFA + Sbjct: 893 SWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLM 952 Query: 952 RAAGKTLREATLEAAHLRFRPILMTSFAFILGVLPLAISTGAGANSRHSIGTGVIGGMVF 1011 GK + EATL A +R RPILMTS AFILGVLPLAIS GAG+ +++++G GV+GGMV Sbjct: 953 EKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVS 1012 Query: 1012 ATVLGVIFIPLFFVVVRR 1029 AT+L + F+P+FFVV+RR Sbjct: 1013 ATLLAIFFVPVFFVVIRR 1030
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 40.2 bits (94), Expect = 1e-05 Identities = 17/108 (15%), Positives = 37/108 (34%) Query: 59 RSADVRARVDGVVLKRLYTEGANVTEGQPLFQIDPSQLKATLLQAQGQLAAAEATYTNAK 118 RS +++ + +V + + EG +V +G L ++ +A L+ Q L A T + Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154 Query: 119 IAATRARSLAPQQYVSRADIDTAEANERSSGANVQQARGAVEAARIQL 166 I + + + +E + + Q Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQK 202 Score = 32.1 bits (73), Expect = 0.004 Identities = 13/51 (25%), Positives = 24/51 (47%), Gaps = 4/51 (7%) Query: 59 RSADVRARVDGVVLK-RLYTEGANVTEGQPLFQIDPSQLKATLLQAQGQLA 108 +++ +RA V V + +++TEG VT + L I P L+ + Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED---DTLEVTALVQ 373
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 93.0 bits (231), Expect = 2e-25 Identities = 31/105 (29%), Positives = 50/105 (47%), Gaps = 3/105 (2%) Query: 6 RILIVDDFSTMRRIVKNLLGDLGFTNTAEAEDGNSALAALRAGPFDFVVTDWNMPGMTGI 65 IL+ DD + +R ++ L G+ + + + AG D VVTD MP Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 66 DLLRNIRADAKLKHLPVMMVTAEAKREQIIEAAQCGVNGYIIKPF 110 DLL I+ LPV++++A+ I+A++ G Y+ KPF Sbjct: 64 DLLPRIK--KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 347 bits (891), Expect = e-120 Identities = 104/344 (30%), Positives = 182/344 (52%), Gaps = 2/344 (0%) Query: 8 GERTELPTEKRLREAREQGNIPQSRELSTAAVFGAGVFALMVLARGIGDGAAVWMKTALS 67 GE+TE PT K++R+AR++G + +S+E+ + A+ A LM L+ + + M + Sbjct: 3 GEKTEQPTPKKIRDARKKGQVAKSKEVVSTALIVALSAMLMGLSDYYFEHFSKLM--LIP 60 Query: 68 PDPKMRENPMALFGHFGDLLLQLLWVMLPLIGICLAAGLAGPLMMSGLRFSGKAIMPDLS 127 + AL ++LL+ ++ PL+ + +A ++ G SG+AI PD+ Sbjct: 61 AEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIK 120 Query: 128 KLNPANGIKRMWGSNSLAELIKSVLRLLFVGLAASFCISKGLHGLRSLVNQPLEQAIGNG 187 K+NP G KR++ SL E +KS+L+++ + + I L L L +E Sbjct: 121 KINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLL 180 Query: 188 LDFTKSLLFYTAGALVLLAAIDAPYQKWNWLRKLKMTREEIKREMKESEGSPEVKGRIRQ 247 + L+ V+++ D ++ + ++++LKM+++EIKRE KE EGSPE+K + RQ Sbjct: 181 GQILRQLMVICTVGFVVISIADYAFEYYQYIKELKMSKDEIKREYKEMEGSPEIKSKRRQ 240 Query: 248 MQMQMSQRQMMEAVPKADVVLMNPTHYAVALKYEGGKMRAPIVVAKGVDEMAFRIREAGE 307 ++ R M E V ++ VV+ NPTH A+ + Y+ G+ P+V K D +R+ E Sbjct: 241 FHQEIQSRNMRENVKRSSVVVANPTHIAIGILYKRGETPLPLVTFKYTDAQVQTVRKIAE 300 Query: 308 QHRVAIVTAPPLARALHREAQIGKEIPVRLYSVVAQVLSYVYQL 351 + V I+ PLARAL+ +A + IP A+VL ++ + Sbjct: 301 EEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLRWLERQ 344
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 42.9 bits (100), Expect = 3e-07 Identities = 36/159 (22%), Positives = 76/159 (47%), Gaps = 7/159 (4%) Query: 51 HEGFARGHAEGFAQGQSEVRRLTAQIDGILDNFTRPLARLENEVVGALGELAVRIAGQLV 110 EG A+G +G A+ +S+ + A++ ++ F L L++ + L ++A+ A Q++ Sbjct: 73 QEGLAQGLEQGLAEAKSQQAPIHARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVI 132 Query: 111 GRVYQADPQLLADLVGEAVDAVGGAGREVEVRLHPDDITALLPHLAPSSTT---RVAPDM 167 G+ D L + + + + ++R+HPDD+ + L + + R+ D Sbjct: 133 GQTPTVDNSALIKQIQQLLQQEPLFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDP 192 Query: 168 SLSRGDLRVHAESVRIDGTLDARLRAALETVMRKSGAGL 206 +L G +V A+ +G LDA + + + R + G+ Sbjct: 193 TLHPGGCKVSAD----EGDLDASVATRWQELCRLAAPGV 227
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 308 bits (791), Expect = e-106 Identities = 106/329 (32%), Positives = 200/329 (60%) Query: 1 MTGVQRAAVLLLSLGESDAAEVLKHMDPKEVQKIGIAMATMTGISRDQVEKVMDEFNGEL 60 +TG Q+AA+LL+S+G +++V K++ +E++ + +A + I+ + + V+ EF + Sbjct: 15 LTGKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELM 74 Query: 61 AGKTSLGVGADDYIRNVLIQALGADKAGGLIDRILLGRNTTGLDTLKWMDPRAVADLVRN 120 + + G DY R +L ++LG KA +I+ + + + ++ DP + + ++ Sbjct: 75 MAQEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQ 134 Query: 121 EHPQIIAIVMAHLDSDQAAEALKLLPERTRADVLLRIATLDGIPPNALSELNDIMERQFA 180 EHPQ IA+++++LD +A+ L LP + +V RIA +D P + E+ ++E++ A Sbjct: 135 EHPQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLA 194 Query: 181 GNQNLKSSNVGGIKVAANILNFLDTGSDQGVLGEIGKIDADLAGKIQDLMFVFDNLVDLD 240 + ++ GG+ I+N D +++ ++ + + D +LA +I+ MFVF+++V LD Sbjct: 195 SLSSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLD 254 Query: 241 DRGLQTLLREVSGERLGLALRGADVKVREKITRNMSQRAAEILLEDMEARGPVRLADVEA 300 DR +Q +LRE+ G+ L AL+ D+ V+EKI +NMS+RAA +L EDME GP R DVE Sbjct: 255 DRSIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEE 314 Query: 301 AQKEILTIVRRLADEGAISLGGAGAEAMV 329 +Q++I++++R+L ++G I + G E ++ Sbjct: 315 SQQKIVSLIRKLEEQGEIVISRGGEEDVL 343
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 351 bits (902), Expect = e-116 Identities = 187/575 (32%), Positives = 300/575 (52%), Gaps = 45/575 (7%) Query: 16 KAGQWFDRVRSLQITRKLTMMAMIALAVAAGLAVFFWSQKPGYQSLYTGLDEKGNAEAAD 75 K +W +R+R+ ++ ++ + AVA +A+ W++ P Y++L++ L ++ Sbjct: 11 KPLEWLNRLRANP---RIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVA 67 Query: 76 LLRTAQIPYKIDQGTGAISVPQDRLYDARLKLAGSGLTGKETGGGFELMEKDPGFGVSQF 135 L IPY+ G+GAI VP D++++ RL+LA GL K GFEL++++ FG+SQF Sbjct: 68 QLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQGLP-KGGAVGFELLDQEK-FGISQF 125 Query: 136 VESARYQHALETELSRTIGTLRPVREARVHLAIPKPSAFTRQRDVASASVVLELRGGQGL 195 E YQ ALE EL+RTI TL PV+ ARVHLA+PKPS F R++ SASV + L G+ L Sbjct: 126 SEQVNYQRALEGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRAL 185 Query: 196 ERNQVDAIVNLVASSIPDMTPERVTVVDQSGRMLSIADPNSDAAQHAAQFEQVRRQESSY 255 + Q+ A+V+LV+S++ + P VT+VDQSG +L+ ++ + AQ + ES Sbjct: 186 DEGQISAVVHLVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLN-DAQLKFANDVESRI 244 Query: 256 NQRIRELLEPMTGPGRVNPETSVDMDFSVVEEARELYN----GEPAKLRSEQVSD-TSTS 310 +RI +L P+ G G V+ + + +DF+ E+ E Y+ A LRS Q++ Sbjct: 245 QRRIEAILSPIVGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVG 304 Query: 311 ATGPQGPPGATSNSPGQPPAPAVAGAPGT--------PAAANGQAAAPATPTESSKSATR 362 A P G PGA SN P PP A P T P + + A P + ++ T Sbjct: 305 AGYPGGVPGALSNQP-APPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETS 363 Query: 363 NYELDRTLQHTRQPAGRIKRVSVAVLLDNVPRPGAKGKMVEQPLTAAELTRIEGLVKQAV 422 NYE+DRT++HT+ G I+R+SVAV+++ K PLTA ++ +IE L ++A+ Sbjct: 364 NYEVDRTIRHTKMNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAM 419 Query: 423 GFDAARGDTVSVMNAPFVREAVAGEEGPKWWEDPRVQNGLRLLVGAVVVLALLF----GV 478 GF RGDT++V+N+PF G E P W + + L ++VL + + Sbjct: 420 GFSDKRGDTLNVVNSPFSAVDNTGGELPFWQQQSFIDQLLAAG-RWLLVLVVAWILWRKA 478 Query: 479 VRPTLRQLTGVTAVKDKQGKAGKDGTPQSADVRMVEDDDDLMPRLEEDTAQIGQDKKTPI 538 VRP L + +Q + ++ + A + D+ L Q ++ Sbjct: 479 VRPQLTRRVEEAKAAQEQAQVRQET--EEAVEVRLSKDEQL------------QQRRANQ 524 Query: 539 ALPDAYEERMRLAREAVKADSKRVAQVVKGWVASE 573 L E + RE D + VA V++ W++++ Sbjct: 525 RLG--AEVMSQRIREMSDNDPRVVALVIRQWMSND 557
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 60.1 bits (145), Expect = 3e-15 Identities = 28/84 (33%), Positives = 48/84 (57%) Query: 40 AGAQGTPATQAPSFSETLRGAIGGVNEAQQKAGALSKAFEMGDPNADLARVMVASQQSQV 99 A AQ + SF+ L A+ +++ Q A ++ F +G+P L VM Q++ V Sbjct: 20 ARAQESLPQPTISFAGQLHAALDRISDTQTAARTQAEKFTLGEPGVALNDVMTDMQKASV 79 Query: 100 AFRATVEVRNRLVQAYQDVMNMPL 123 + + ++VRN+LV AYQ+VM+M + Sbjct: 80 SMQMGIQVRNKLVAAYQEVMSMQV 103
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 109 bits (273), Expect = 7e-31 Identities = 68/260 (26%), Positives = 127/260 (48%), Gaps = 13/260 (5%) Query: 7 FNPFSLADKRILVSGASSGLGRAIALGCARMGGELIVSGRDPQRLDATLADLRAISERPH 66 N + K ++GA+ G+G A+A A G + +P++L+ ++ L+A + Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 67 QALRADLTVATERASLVAALS---APLHGVVHSAGISRLCPARMVGEAHLREVQATNVDA 123 AD+ + + A + P+ +V+ AG+ R + + + N Sbjct: 61 A-FPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTG 119 Query: 124 PILLTQGLLKRNLIAADGAIVFIASIAAHIGVAGVGAYSASKAALIAYARCLAMEVVKRH 183 ++ + K + G+IV + S A + + AY++SKAA + + +CL +E+ + + Sbjct: 120 VFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYN 179 Query: 184 IRVNCLSPALVDTPLL-------DATAQVV-GSLETERSNYPLG-FGRPDDVANAAIFLL 234 IR N +SP +T + + QV+ GSLET ++ PL +P D+A+A +FL+ Sbjct: 180 IRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 235 SGASRWITGTSLVMDGGLTI 254 SG + IT +L +DGG T+ Sbjct: 240 SGQAGHITMHNLCVDGGATL 259
>PF04183#IucA / IucC family Length = 580 Score = 29.1 bits (65), Expect = 0.028 Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 4/45 (8%) Query: 71 ERLQWKREEIDALIVVTQSPDYPIPATAII--LQDRLGLSHATVA 113 ER W IDA + D P+ A ++ L+ L +S ATVA Sbjct: 51 ERGIWGWLWIDAQTLRCA--DEPVLAQTLLMQLKQVLSMSDATVA 93
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 437 bits (1125), Expect = e-152 Identities = 173/489 (35%), Positives = 249/489 (50%), Gaps = 16/489 (3%) Query: 1 MSESRILLIDSDAVRAERTVSLLEFMDFNPRWVTDGADINPGRHRHDEWMAVMVGSAQDA 60 M+ + IL+ D DA L ++ R ++ A + R ++V Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLW--RWIAAGDGDLVVTDVVMP 58 Query: 61 -AQADKFFDWLADAKLPPPVLLMEGSPSAFAQTHGLHEANVWALDTPLRHAQLEALLRRA 119 A + A+ PVL+M + + L P +L ++ RA Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 120 S--LKRLDAEHQAGVQQDSGPTGNSEAVTRLRRLIDQVAAFDTTVLVLGESGTGKEVVAR 177 KR ++ + Q G S A+ + R++ ++ D T+++ GESGTGKE+VAR Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVAR 178 Query: 178 AIHQHSPRRDGPFVAINCGAIPPDLLESELFGHEKGAFTGALTTRKGRFEMAEGGTLLLD 237 A+H + RR+GPFVAIN AIP DL+ESELFGHEKGAFTGA T GRFE AEGGTL LD Sbjct: 179 ALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLD 238 Query: 238 EIGDMSLPMQVKLLRVLQERSFERVGGGQTIRCNVRVIAATHRNLESRISDGQFREDLFY 297 EIGDM + Q +LLRVLQ+ + VGG IR +VR++AAT+++L+ I+ G FREDL+Y Sbjct: 239 EIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYY 298 Query: 298 RLNVFPIEMPALRERVDDLAMLVQTIAGQLARTGRGEVRFADEALQALRGYDWPGNVREL 357 RLNV P+ +P LR+R +D+ LV+ Q + G RF EAL+ ++ + WPGNVREL Sbjct: 299 RLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 358 TNLVERLAVLHPGGLVRVQDLPARYRGDFASAIPVELPPEPELVTAPVEVSALPSNVVTL 417 NLV RL L+P ++ + + R + + + ++ V Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 418 QPKTADAEPSATSSLPDDGIDLRGHMANIELALINEALERTQGVVAHAAQLLGLRRTTLV 477 +A +E LI AL T+G AA LLGL R TL Sbjct: 419 FG-----------DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLR 467 Query: 478 EKLRKYGID 486 +K+R+ G+ Sbjct: 468 KKIRELGVS 476
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 55.2 bits (133), Expect = 3e-12 Identities = 20/118 (16%), Positives = 44/118 (37%), Gaps = 2/118 (1%) Query: 1 MSKLTVLLVDDHEGFINAAMRHFRKVEWLDIVGSAANGLEAIERSESLRPNVVLMDLAMP 60 M+ T+L+ DD + + + V +N + ++V+ D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 EMGGLQATRLIKTQDDPPYIVIASHFDDAEHREHALRAGADNFVSKLSYIQEVMPILE 118 + IK +++ S + A GA +++ K + E++ I+ Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 71.8 bits (176), Expect = 6e-17 Identities = 35/160 (21%), Positives = 66/160 (41%), Gaps = 9/160 (5%) Query: 2 RVIIVDDHTLVRAGLSRLLQTFAGIDVVGEASNAQQALDMTSLHRPDLVLMDLSLPGRSG 61 +++ DD +R L++ L + AG DV SNA + DLV+ D+ +P + Sbjct: 5 TILVADDDAAIRTVLNQAL-SRAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 62 LDAMTDVLRAAPRTHVVMMSMHDDPVHVRDALDRGAVGFVVKDAAPLELELALRAAAAGQ 121 D + + +A P V++MS + + A ++GA ++ K EL + A Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA---- 118 Query: 122 VFLSPQISSKMIAPMLGREKPVGIAALSPRQREILREIGR 161 + + + + + S +EI R + R Sbjct: 119 ---LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLAR 155
>FLAGELLIN#Flagellin signature. Length = 507 Score = 135 bits (341), Expect = 2e-37 Identities = 124/360 (34%), Positives = 181/360 (50%), Gaps = 10/360 (2%) Query: 2 AQVINTNVMSLNAQRNLNTSSASMSTSIQRLSSGLRINSAKDDAAGLAISERFTTQIRGL 61 AQVINTN +SL Q NLN S +S+S++I+RLSSGLRINSAKDDAAG AI+ RFT+ I+GL Sbjct: 1 AQVINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGL 60 Query: 62 DVASRNANDGISLAQTAEGAMVEIGSNLQRIRELSVQSSNATNSSTDRDALNSEVKQLTA 121 ASRNANDGIS+AQT EGA+ EI +NLQR+RELSVQ++N TNS +D ++ E++Q Sbjct: 61 TQASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLE 120 Query: 122 EIDRVANQTNFNGTKLLDGSFSGALFQVGADAGQTIGINSIVDANVDSLGKANFAAAVSG 181 EIDRV+NQT FNG K+L QVGA+ G+TI I + +V SLG F Sbjct: 121 EIDRVSNQTQFNGVKVLSQDNQ-MKIQVGANDGETITI-DLQKIDVKSLGLDGFNVNGPK 178 Query: 182 AGVTGTATASGSVSGISLSFNDASGSAKSVTIADVKIAAGDTAADVNKKVASAINDKLDQ 241 G +S ++ + + + + +K +A N +L Sbjct: 179 EATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTT 238 Query: 242 TGMYASIKSDGSLQIESLKAGQDFTSLSAG--------TSSAAGITVGAGITTASAASGS 293 + D +S + +++ T G+T T + +G Sbjct: 239 DDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGK 298 Query: 294 TASTLSSLDISTFSGSQKALEIVDKALTAVNSSRADMGAVQNRFTSTIANLSATSENLSA 353 ++T++ ++ A A T +S V +FT + +++ Sbjct: 299 VSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDL 358 Score = 97.4 bits (242), Expect = 4e-24 Identities = 74/340 (21%), Positives = 133/340 (39%), Gaps = 3/340 (0%) Query: 60 GLDVASRNANDGISLAQTAEGAMVEIGSNLQRIRELSVQSSNATNSSTDRDALNSEVKQL 119 G +V L + + + + +S A + T + +V Sbjct: 171 GFNVNGPKEATVGDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVN 230 Query: 120 TAEIDRVANQTNFNGTKLLDGSFSGALFQVGADAGQTIGINSIVDANVDSLGKANFAAAV 179 A + N L + A A D G Sbjct: 231 AANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTK 290 Query: 180 SGAGVTGTATASGSVSGISLSFNDASGSAKSVTIADVKIAAGDTAADVNKKVASAINDKL 239 +G G + + + ++L+ D + A +V A ++ + + VN + K Sbjct: 291 TGNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKN 350 Query: 240 DQTGMYASIKSDGSLQIESLKAGQDFTSLSAGTSSAAGITVGAGITTASAASGSTASTLS 299 + + ++ + + ++ +T+ + ++ ++ Sbjct: 351 ESAKLSDLEANNAVKGESKITVN---GAEYTANAAGDKVTLAGKTMFIDKTASGVSTLIN 407 Query: 300 SLDISTFSGSQKALEIVDKALTAVNSSRADMGAVQNRFTSTIANLSATSENLSASRSRIR 359 + + L +D AL+ V++ R+ +GA+QNRF S I NL T NL+++RSRI Sbjct: 408 EDAAAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIE 467 Query: 360 DTDYAKETAELTRTQILQQAGTAMLAQAKSVPQNVLSLLQ 399 D DYA E + +++ QILQQAGT++LAQA VPQNVLSLL+ Sbjct: 468 DADYATEVSNMSKAQILQQAGTSVLAQANQVPQNVLSLLR 507
>FLAGELLIN#Flagellin signature. Length = 507 Score = 58.5 bits (141), Expect = 2e-11 Identities = 62/349 (17%), Positives = 111/349 (31%), Gaps = 6/349 (1%) Query: 4 RISTSMMYSQSVASMGAKQSRLNQFESQLSSGQRLVTAKDDPVAAGTAVGLDRALAAITR 63 I+T+ + + ++ QS L+ +LSSG R+ +AKDD A + +T+ Sbjct: 3 VINTNSLSLLTQNNLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQ 62 Query: 64 FGENANNVQNRLGLQENALSQAGDKMARVTELAVQASNSSLSPDDRKAIASELTALRESM 123 NAN+ + E AL++ + + RV EL+VQA+N + S D K+I E+ E + Sbjct: 63 ASRNANDGISIAQTTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEI 122 Query: 124 VSLANSTDGTGRYLFGGTADGSAPFIKSNG---SVTYNGDQTQKQVEVAPDTFVSDTLPG 180 ++N T G + ++G ++ + + Sbjct: 123 DRVSNQTQFNGVKVLSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATV 182 Query: 181 SEIFMRIRTGDGTVDAHANAGNTGTGLLLDFSRDASTGSWNGGSYSVQFTAADTYEVRDS 240 ++ + G A + +T V Sbjct: 183 GDLKSSFKNVTGYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAE 242 Query: 241 TNTVVGTGTYKEG--EDINAAGVRMRISGAPAVGDSFQIGASGTKDVFSTID-DLVGALN 297 NT V + A + I G G + T D + D + + Sbjct: 243 NNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTT 302 Query: 298 SDTLTAPQKAAMINTLQTSMRDITQASSKMIDARTSGGAQLSAIDNANA 346 + A I ++ T SSK + G N Sbjct: 303 INGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351 Score = 36.6 bits (84), Expect = 1e-04 Identities = 50/269 (18%), Positives = 83/269 (30%), Gaps = 1/269 (0%) Query: 127 ANSTDGTGRYLFGGTADGSAPFIKSNGSVTYNGDQTQKQVEVAPDTFVSDTLPGSEIFMR 186 AN T D + G+ + DTF + + Sbjct: 232 ANGQLTTDDAENNTAVDLFKTTKSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKT 291 Query: 187 IRTGDGTVDAHANAGNTGTGLLLDFSRDASTGSWNGGSYSVQFTAADTYEVRDSTNTVVG 246 G+G V N + + A+ + S +T+ + T Sbjct: 292 GNDGNGKVSTTINGEKVTLTVADITAGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNE 351 Query: 247 TGTYKEGEDINAAGVRMRISGAPAVGDSFQIGASGTKDVFSTIDDLVGALNSDTLTAPQK 306 + + E NA +I+ A + G T + D A TL Sbjct: 352 SAKLSDLEANNAVKGESKITVNGAEYTANAAGDKVTLAGKTMFID-KTASGVSTLINEDA 410 Query: 307 AAMINTLQTSMRDITQASSKMIDARTSGGAQLSAIDNANALLESNEVTLKTSLSSIRDLD 366 AA + + I A SK+ R+S GA + D+A L + L ++ S I D D Sbjct: 411 AAAKKSTANPLASIDSALSKVDAVRSSLGAIQNRFDSAITNLGNTVTNLNSARSRIEDAD 470 Query: 367 YASAIGQYQLEKASLQAAQTIFQQMQSSS 395 YA+ + + QA ++ Q Sbjct: 471 YATEVSNMSKAQILQQAGTSVLAQANQVP 499
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 227 bits (580), Expect = 7e-69 Identities = 141/437 (32%), Positives = 220/437 (50%), Gaps = 8/437 (1%) Query: 2 SIMSTGTSALIAFQRALSTVSHNVANINTEGYSRQRVEFATRTPTDMGYAFVGNGAKITD 61 S+++ S L A Q AL+T S+N+++ N GY+RQ A T +VGNG ++ Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSG 61 Query: 62 VGRVADQLAISRLLDSGGELSRLQQLSSLSNRVDALYSNTATNVAGLWSNFFDSTSAVSS 121 V R D ++L + + S L +++D + S + +++A +FF S + S Sbjct: 62 VQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLVS 121 Query: 122 NASSTAERQSMLDSGNSLATRFKQLNGQMDSLSNEVNSGLTSSVDEVNRLTQQIAKLNGT 181 NA A RQ+++ L +FK + + +VN + +SVD++N +QIA LN Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181 Query: 182 I----GSSAQAAAPDMLDQRDALVSKLVGFTGGTAVIQDGGFMNVFTAGGQPLVVGTTSS 237 I G A A+ ++LDQRD LVS+L G +QDGG N+ A G LV G+T+ Sbjct: 182 ISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTAR 241 Query: 238 KLVTAADPYEPTKLQVAMQTQGQNVSLSASSL--GGQIGGLLEFRSSVLEPTQAELGRLA 295 +L +P++ VA L G +GG+L FRS L+ T+ LG+LA Sbjct: 242 QLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILTFRSQDLDQTRNTLGQLA 301 Query: 296 VGMASTFNAGHSQGMDLYGAMGGNFFNIGSPAVAANPSNTGSASLSASFSNVSAVDGQNV 355 + A FN H G D G G +FF IG PAV N N G ++ A+ ++ SAV + Sbjct: 302 LAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDY 361 Query: 356 TLSFDGTNWKAINASTGSAVPMTGTGTAADPLVLNGVSMVVGGTPASGDKFLLQPTAGLA 415 +SFD W+ ++ + T T A + +G+ + GTPA D F L+P + Sbjct: 362 KISFDNNQWQVTRLASNTT--FTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSDAI 419 Query: 416 GSLSVAITDPSRIAAAT 432 ++ V ITD ++IA A+ Sbjct: 420 VNMDVLITDEAKIAMAS 436 Score = 82.3 bits (203), Expect = 1e-18 Identities = 38/105 (36%), Positives = 56/105 (53%) Query: 517 AGSSDNGNAKLLANIDDAKALSGGTVTLNGALSGLTTSVGSAARAASYSADAQKVINDQA 576 AG SDN N + L ++ GG + N A + L + +G+ S+ Q + Q Sbjct: 440 AGDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQL 499 Query: 577 QASRDSISGVNLDEEAANMLKLQQAYQAAAQMISTADTIFQAILG 621 + SISGVNLDEE N+ + QQ Y A AQ++ TA+ IF A++ Sbjct: 500 SNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 130 bits (327), Expect = 5e-37 Identities = 63/140 (45%), Positives = 82/140 (58%), Gaps = 4/140 (2%) Query: 218 FVAKIWTHAQKAARELGVDPRALVAQAALETGWGRRGI--GNGGDSNNLFGIKATG-WSG 274 F+A++ AQ A+++ GV ++AQAALE+GWG+R I NG S NLFG+KA+G W G Sbjct: 152 FLAQLSLPAQLASQQSGVPHHLILAQAALESGWGQRQIRRENGEPSYNLFGVKASGNWKG 211 Query: 275 DKVTTGTHEYVNGVKTTETADFRAYGSAEESFADYVRLLKNNSRYQTALQAGTDIKGFAR 334 T EY NG A FR Y S E+ +DYV LL N RY A+ + A+ Sbjct: 212 PVTEITTTEYENGEAKKVKAKFRVYSSYLEALSDYVGLLTRNPRY-AAVTTAASAEQGAQ 270 Query: 335 GLQQAGYATDPGYAAKIAAI 354 LQ AGYATDP YA K+ + Sbjct: 271 ALQDAGYATDPHYARKLTNM 290 Score = 71.3 bits (174), Expect = 5e-16 Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 22/178 (12%) Query: 4 AASPIDLNPSTKADPA-KIDKVSRQLEGQFAQMLVKSMRDASGGDPMFPGQNQ-MFREMY 61 A S +L DPA I V+RQ+EG F QM++KSMRDA D +F ++ ++ MY Sbjct: 15 AQSLNELKAKAGEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMY 74 Query: 62 DQQMAKALTDGKGLGLSAMISKQLSGDTGGPALNTSL--------------NTAEAAKAY 107 DQQ+A+ +T GKGLGL+ M+ KQ++ + P +T N A + Sbjct: 75 DQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALSQLVQ 134 Query: 108 ALVAGKRDASLPLPARDGAATGVTTSSVAKAALGAGNLSGIGMSQVLDLIAGRTGAGE 165 V D SLP ++ A ++ A A SG+ +L A +G G+ Sbjct: 135 KAVPRNYDDSLPGDSKAFLA------QLSLPAQLASQQSGVPHHLILAQAALESGWGQ 186
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 93.7 bits (233), Expect = 1e-24 Identities = 25/131 (19%), Positives = 52/131 (39%) Query: 5 APVVYLIDDDASMRAALEDLFASVGLQVYAFGSTDQFLAHRLHEVPACLVLDIRMPGQSG 64 + + DDDA++R L + G V + +V D+ MP ++ Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 65 MEFHRRMVESGVALPTIFITGHGDIAMSVEAMKNGAIEFLTKPFRDQALLDAIQDGIRRD 124 + R+ ++ LP + ++ +++A + GA ++L KPF L+ I + Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 125 RARRQSEAVAA 135 + R + Sbjct: 123 KRRPSKLEDDS 133
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 645 bits (1665), Expect = 0.0 Identities = 233/1034 (22%), Positives = 426/1034 (41%), Gaps = 43/1034 (4%) Query: 11 QRRGIVWLVFVLIALYGTWSWTQLPVEAYPDIADVTSQVVTQVPGLGAEEVEQQITVPLE 70 +R W++ +++ + G + QLPV YP IA V PG A+ V+ +T +E Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIE 66 Query: 71 RALMGTPGLHVLRSRSLFA-LSLITLVFDDGTEGYFARQRVLERIQAVT--LPYGA-IPG 126 + + G L + S S A ITL F GT+ A+ +V ++Q T LP G Sbjct: 67 QNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQG 126 Query: 127 LDPYTSPTGEIYRYTLES--KTRSLRELSDLQFWTVIPRLQKVPGVADVTNFGGLTTQFS 184 + S + + S + ++SD V L ++ GV DV FG Sbjct: 127 ISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMR 185 Query: 185 LALEPDRLTRYGVSLQQVKSAITSNNAD------GGGSVMDRGEQSYVIRGIGLLHSLQD 238 + L+ D L +Y ++ V + + N GG + + + I + ++ Sbjct: 186 IWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPEE 245 Query: 239 IGNVVVSSS-NGVPVLVKDLGEVRYDNVERRGILGKDGNPDTIEGIALLLKDSNPSVALQ 297 G V + + +G V +KD+ V I +G P L +N + Sbjct: 246 FGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKP-AAGLGIKLATGANALDTAK 304 Query: 298 GIHSAVEELNNSVLPKDVKVVPYLDRTALIDATLHTVSATLTEGMLLVCVVLLIFLGSPR 357 I + + EL P+ +KV+ D T + ++H V TL E ++LV +V+ +FL + R Sbjct: 305 AIKAKLAELQPF-FPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363 Query: 358 AAAIVSLTIPLSLLIAFIFMHHLKIPANLLSLG--AIDFGILVDGAVVLVENVLRLREEN 415 A I ++ +P+ LL F + N L++ + G+LVD A+V+VENV R+ E+ Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423 Query: 416 SQRALTARDAIDATLQVARPIFFGMAVIGCAYLPLLAFERIEYKLFSPMAYAVGAALIGA 475 A + Q+ + V+ ++P+ F ++ + + +A+ + Sbjct: 424 KLPPKEA--TEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 476 LLVALTLIPGLAWLAFRKPRKMLH-----------NRALETLGQRYRAVLERSVGRRGWL 524 +LVAL L P L KP H N + Y + + +G G Sbjct: 482 VLVALILTPALCAT-LLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 525 LACAALALCVLAVLGGSIGRDFLPYIDEGSLWLQVQMPPGITLDKAATMANALRKATL-- 582 L AL + + VL + FLP D+G +Q+P G T ++ + + + L Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 583 EFPEVSYVVTQTGRNDDGTDYWTPSHTEASVGLRPYKDWP-AGMDKQALIAALGARYAQM 641 E V V T G + G + A V L+P+++ +A+I ++ Sbjct: 601 EKANVESVFTVNGFSFSGQ---AQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657 Query: 642 PGYTVSMMQPMIDGVQDKLSGAHSDLTVKVFGDDLQQVRGVADQVAAALHKVPGA-ADIA 700 V +G +L + G + +Q+ + P + + Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFEL-IDQAGLGHDALTQARNQLLGMAAQHPASLVSVR 716 Query: 701 VDVEPPLPNLQVRFDRAAAARYGINAADVSDLISTGIGGSPIGQMYLGEKSYDLTVRFPQ 760 + ++ D+ A G++ +D++ IST +GG+ + + L V+ Sbjct: 717 PNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADA 776 Query: 761 RYRNDPQAIGALRLRTAAGAEIPLSAVASITTTSGRSVIVREMGRRNIIVRLNVRGRDLS 820 ++R P+ + L +R+A G +P SA + G + R G ++ ++ Sbjct: 777 KFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP---G 833 Query: 821 SFLSDAQATLARQVRVDPQHMQLVWGGQFENLQRAQARLLVVLPTTLCIMFVLLFGAFGN 880 + DA A + P + W G + + + ++ + ++F+ L + + Sbjct: 834 TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893 Query: 881 LRQPTLVLAAVPLAMIGGLAALHLRGMTLNVSSAVGFVALFGVAVLNAVLMLAQIHRLRH 940 P V+ VPL ++G L A L +V VG + G++ NA+L++ L Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953 Query: 941 DVGMPLREAVVAGAVSRMRPVLMTATVAALGLAPAMLATGLGSDVQRPLATVVVGGLVTA 1000 G + EA + R+RP+LMT+ LG+ P ++ G GS Q + V+GG+V+A Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSA 1013 Query: 1001 TVLTLVLLPSLYYL 1014 T+L + +P + + Sbjct: 1014 TLLAIFFVPVFFVV 1027 Score = 92.6 bits (230), Expect = 4e-21 Identities = 67/344 (19%), Positives = 137/344 (39%), Gaps = 15/344 (4%) Query: 682 VADQVAAALHKVPGAADIAVDVEPPLPNLQVRFDRAAAARYGINAADVSDLISTG----I 737 VA V L ++ G D+ + +++ D +Y + DV + + Sbjct: 158 VASNVKDTLSRLNGVGDVQLFGAQYA--MRIWLDADLLNKYKLTPVDVINQLKVQNDQIA 215 Query: 738 GGSPIGQMYLGEKSYDLTVRFPQRYRNDPQAIGALRLRTAA-GAEIPLSAVASITTTS-G 795 G G L + + ++ R++N P+ G + LR + G+ + L VA + Sbjct: 216 AGQLGGTPALPGQQLNASIIAQTRFKN-PEEFGKVTLRVNSDGSVVRLKDVARVELGGEN 274 Query: 796 RSVIVREMGRRNIIVRLNVR-GRDLSSFLSDAQATLARQVRVDPQHMQLVWGGQFENLQR 854 +VI R G+ + + + G + +A LA PQ M++++ + Sbjct: 275 YNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQ 334 Query: 855 AQARLLV---VLPTTLCIMFVLLFGAFGNLRQPTLVLAAVPLAMIGGLAALHLRGMTLNV 911 +V L + + LF N+R + AVP+ ++G A L G ++N Sbjct: 335 LSIHEVVKTLFEAIMLVFLVMYLF--LQNMRATLIPTIAVPVVLLGTFAILAAFGYSINT 392 Query: 912 SSAVGFVALFGVAVLNAVLMLAQIHRLRHDVGMPLREAVVAGAVSRMRPVLMTATVAALG 971 + G V G+ V +A++++ + R+ + +P +EA ++ A V + Sbjct: 393 LTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAV 452 Query: 972 LAPAMLATGLGSDVQRPLATVVVGGLVTATVLTLVLLPSLYYLM 1015 P G + R + +V + + ++ L+L P+L + Sbjct: 453 FIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATL 496
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 55.2 bits (133), Expect = 1e-10 Identities = 38/238 (15%), Positives = 76/238 (31%), Gaps = 52/238 (21%) Query: 115 AELANAYSEAGKARATLEQARLELARQKTLAADSISAARDLQAAQQAFDSAGNDARAASD 174 AE + + + L +L A + + + A N+ R Sbjct: 214 AERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKS 273 Query: 175 RLAQLGVAAQASSHR--------------------------------------RYVLRAP 196 +L Q+ ++ V+RAP Sbjct: 274 QLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333 Query: 197 IAGRVVDLSA-ALGGFWNDTSASLMTVADISQVWLTASVPEREVGQVFEGQPVTASLDAY 255 ++ +V L GG ++ V + + +TA V +++G + GQ ++A+ Sbjct: 334 VSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF 393 Query: 256 PGQRF---VGHVQHV--DDLLDPAT-------RTLKVRVALTNRDGL-LKPGMFARAQ 300 P R+ VG V+++ D + D +++ T + L GM A+ Sbjct: 394 PYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAE 451 Score = 36.3 bits (84), Expect = 2e-04 Identities = 26/132 (19%), Positives = 46/132 (34%), Gaps = 9/132 (6%) Query: 80 RLVRVVPPLAGRVVALPKTLGDTVHAGDVLCVLDSAELANAYSEAGKARATLEQARLELA 139 R + P V + G++V GDVL L + A ++ K +++L QARLE Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTA---LGAEADTLKTQSSLLQARLEQT 151 Query: 140 RQKTLAADSISAARDLQAAQQAFDSAGNDARAASDRLAQLGVAAQASSHRR---YVLRAP 196 R S S + + D + + L + + S + Y Sbjct: 152 R---YQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELN 208 Query: 197 IAGRVVDLSAAL 208 + + + L Sbjct: 209 LDKKRAERLTVL 220
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 121 bits (304), Expect = 5e-32 Identities = 72/325 (22%), Positives = 117/325 (36%), Gaps = 43/325 (13%) Query: 78 NADLAQQAGARGQGVKLAVLDDNLVPSYAPISGKVDSFNDYTASPGTPESSANALRGHGT 137 A G+GVK+AVLD + + ++ ++T GHGT Sbjct: 30 QAPAVWNQTR-GRGVKVAVLDTGCDADHPDLKARIIGGRNFTDDDEGDPEIFKDYNGHGT 88 Query: 138 IVSALVLGSAQDGFAGGVAPDADLFYARICAENSCGTQQTRRAAVDLAAA-GVRIANLSI 196 V+ + + + GVAP+ADL ++ + G + A V I ++S+ Sbjct: 89 HVAGTIAATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSL 148 Query: 197 GASYPDATASANAALAWKYALTPLVQADALIVASTGNEGAAEAS-----YPAATPVQEAS 251 G A K A V + L++ + GNEG + YP Sbjct: 149 GGPEDVPELHE----AVKKA----VASQILVMCAAGNEGDGDDRTDELGYPGCYN----- 195 Query: 252 VRNNWLAVGAINIDSAGNAAGLTSYSNHCGAAAQWCLVAPGTYTAPALAGTELGGQIAGT 311 ++VGAIN D + +SN + LVAPG + G + +GT Sbjct: 196 ---EVISVGAINFDR-----HASEFSNSN---NEVDLVAPGEDILSTVPGGKY-ATFSGT 243 Query: 312 SFSTAAVSGVAAQVLGVYPW-----MSASNLQQTLLTTATDLGDPGVDALYGWGLVNAAK 366 S +T V+G A + + ++ L L+ LG + G GL+ Sbjct: 244 SMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLG--NSPKMEGNGLLYLTA 301 Query: 367 AIKGPGQFASNWAANVTAGYDSTFS 391 + + + AG ST S Sbjct: 302 V----EELSRIFDTQRVAGILSTAS 322
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 83.9 bits (207), Expect = 1e-21 Identities = 65/257 (25%), Positives = 109/257 (42%), Gaps = 19/257 (7%) Query: 3 VIVITGGSRGIGAGTALECAKRGMGVILTYQSQAEAAAAVVEEIKAKGGRAVALGLDVGD 62 + ITG ++GIG A A +G + E VV +KA+ A A DV D Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 63 VATFGEFKDAVARQLEDGWQVKKLSGLVNNAGHGLFNAIETVTEQQFDALCDVHLKGPFF 122 A E + R++ + LVN AG I ++++++++A V+ G F Sbjct: 69 SAAIDEITARIEREMG------PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 123 LTQALLPLF--ERGASIVNLTSATTRSATAGVAPYAACKGGLEVLTRYMAKEFGERGIRI 180 ++++ R SIV + S +A YA+ K + T+ + E E IR Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 181 NAVSPGAIRTELGGGM---DEAFEAVLSSQTA-------LGRIGEPEDVAHVIAMLLSQD 230 N VSPG+ T++ + + E V+ L ++ +P D+A + L+S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 231 GQWINGQSIDVSGGYNL 247 I ++ V GG L Sbjct: 243 AGHITMHNLCVDGGATL 259
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.4 bits (79), Expect = 0.002 Identities = 27/121 (22%), Positives = 48/121 (39%), Gaps = 4/121 (3%) Query: 688 ASLLLLCDDAAELDRLEEMLAALGHEPVGMLELPAAVAMATADPMRFDGVLLK-RDRAGD 746 A++L+ DDAA L + L+ G++ A D V+ + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMPDEN 61 Query: 747 AERAIGALHAAAPTLPLILATRATSLATR-KGLGGAITEIIAQPFDLGALAMALERALSR 805 A + + A P LP+++ + + T K + + +PFDL L + RAL+ Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 806 R 806 Sbjct: 122 P 122
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 26.6 bits (58), Expect = 0.029 Identities = 7/26 (26%), Positives = 13/26 (50%) Query: 67 VPSPPHRSNAANVIYLRDVIQRRHEE 92 + P ++ A ++ D QR+H E Sbjct: 21 LIQPALAADLAQDPFITDNAQRQHYE 46
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (290), Expect = 3e-33 Identities = 76/263 (28%), Positives = 111/263 (42%), Gaps = 7/263 (2%) Query: 4 GIKQRIALISGGDSGMGKETARQLLEAGVRVAITDLPNGTLDQAVAELSGLGEII-AIEG 62 GI+ +IA I+G G+G+ AR L G +A D L++ V+ L A Sbjct: 5 GIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 63 DVTQEQDVTRIWTQVRAQLGEPDIYVNAAGVTGATGDFLEVSDAGWLETLDINLMGAVRM 122 DV + I ++ ++G DI VN AGV G +SD W T +N G Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVL-RPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 123 CRQAIPAMRRKQWGRIVLFASEDAVQPYVDELAYCASKAGILSLAKGLSKAYGADNVLVN 182 R M ++ G IV S A P AY +SKA + K L N+ N Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 183 TVSPAFIATPMTDKMMQKRAQENGTSVEEAIASFLDEERPGMALKRRGRPEEVASVVAFL 242 VSP T+ MQ + E+ I L+ + G+ LK+ +P ++A V FL Sbjct: 184 IVSPG-----STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238 Query: 243 CSERASFINGAGVRVDSGSVFTI 265 S +A I + VD G+ + Sbjct: 239 VSGQAGHITMHNLCVDGGATLGV 261
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.008 Identities = 16/77 (20%), Positives = 29/77 (37%), Gaps = 8/77 (10%) Query: 219 VGAAVGVGGDTEQRIELLAAAGVDVVIVDTAHGHSQGVIDRVAWVKKTYPQLQVIGGNIV 278 G V + + +AA D+V+ D D + +KK P L V+ ++ Sbjct: 26 AGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA-FDLLPRIKKARPDLPVL---VM 81 Query: 279 TG----DAALALMDAGA 291 + A+ + GA Sbjct: 82 SAQNTFMTAIKASEKGA 98
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 81.4 bits (201), Expect = 7e-19 Identities = 61/342 (17%), Positives = 113/342 (33%), Gaps = 71/342 (20%) Query: 286 TVMVTGAGGSIGSEVCRQCARHGARRI----------VLLEIDELALLTIDSDLRRLFPD 335 +VTGA G IG V ++ G + + V L+ L LL P Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLA--------QPG 53 Query: 336 IEVVRVLGDCGDPAVVAHALNTATPDAVFHAAAYKQVPLLEEQLREAVRNNVLATENVAR 395 + + D D + + + VF + V E +N+ N+ Sbjct: 54 FQFHK--IDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILE 111 Query: 396 ACQRARIETFVFIST---------------DKAVEPVNVLGASKRYAEMICQSLDA-RDA 439 C+ +I+ ++ S+ D PV++ A+K+ E++ + Sbjct: 112 GCRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171 Query: 440 PTRFITVRFGNVLDSAGS---VVPLFREQIRQGGPVTV-THPDVTRYFMTIPEACQLVIQ 495 P +RF V G + F + + +G + V + + R F I + + +I+ Sbjct: 172 PA--TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIR 229 Query: 496 A------------------AASASHGAIYTLDMGEPVPIRLLAEQMIRLAGKQPGKDVAI 537 AAS + +Y + PV + I+ G + Sbjct: 230 LQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVEL----MDYIQALEDALGIEAKK 285 Query: 538 LYTGLRPGEKLHE----TLFYSDEDYRPTAHPKILEAGVREF 575 L+PG+ L Y + P ++ GV+ F Sbjct: 286 NMLPLQPGDVLETSADTKALYEVIGFTPETT---VKDGVKNF 324
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 117 bits (294), Expect = 5e-38 Identities = 31/89 (34%), Positives = 49/89 (55%), Gaps = 1/89 (1%) Query: 2 TKSELIEILARRQAHLKSDDVDLAVKSLLEMMGQALSDGDRIEIRGFGSFSLHYRPPRLG 61 K +LI +A L D AV ++ + L+ G+++++ GFG+F + R R G Sbjct: 3 NKQDLIAKVAE-ATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61 Query: 62 RNPKTGESVALPGKHVPHFKPGKELRERV 90 RNP+TGE + + VP FK GK L++ V Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDAV 90
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 27.1 bits (60), Expect = 0.022 Identities = 11/37 (29%), Positives = 18/37 (48%) Query: 12 AQAKAKLLDELQKLEEQEKTERASEASSAHATIVSLL 48 Q + LLD L ++ + + R S+ H I+S L Sbjct: 355 PQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFL 391
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 29.0 bits (65), Expect = 0.025 Identities = 15/74 (20%), Positives = 26/74 (35%), Gaps = 13/74 (17%) Query: 223 KQVSFFGAPLPALVAPDGDLGDTLGTWHLNAAWALLALVLLHIGAAL----------WHH 272 +Q LP + D + T+ W LLAL+ + + +H Sbjct: 200 EQFIHMKQALPLSTRVLMGMSDAVRTF---GPWMLLALLAGFMAFRVMLRQEKRRVSFHR 256 Query: 273 LVLRDGLLRRVLPG 286 +L L+ R+ G Sbjct: 257 RLLHLPLIGRIARG 270
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 49.8 bits (119), Expect = 2e-09 Identities = 33/164 (20%), Positives = 58/164 (35%), Gaps = 20/164 (12%) Query: 1 MMKTRIVVAADRTILVEGMVALLQKVPGIEVVGHAEDGLACLQIAAREQPDIVLVDVLLP 60 M I+VA D + + L + G +V + + A D+V+ DV++P Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRA-GYDVR-ITSNAATLWRWIAAGDGDLVVTDVVMP 58 Query: 61 GLNGIDLTRRLMQRSPNSRAICIAPSDACTQASAVFEAGAKAYLARTSRFAELLRAIQCV 120 N DL R+ + P+ + ++ + A E GA YL + EL+ I Sbjct: 59 DENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 121 IQDQTY-----------------ISPQMSRSLIAGLRRAAKADS 147 + + S M + + L R + D Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAM-QEIYRVLARLMQTDL 161
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 31.3 bits (71), Expect = 0.002 Identities = 11/40 (27%), Positives = 16/40 (40%), Gaps = 1/40 (2%) Query: 134 PLKNIETDFPPVFDRFYRSLALRTCSQCGHLHPAPERYAT 173 L + FP F+R Y ++ + GHL R A Sbjct: 119 SLADAMKCFPGSFERLYCAM-VAAGETSGHLDAVLNRLAD 157
>PilS_PF08805#PilS N terminal Length = 185 Score = 58.0 bits (140), Expect = 5e-12 Identities = 42/161 (26%), Positives = 69/161 (42%), Gaps = 16/161 (9%) Query: 56 GYTLVEVLLVLGVSSAMAAAGWLLFGPTSVAADVKQTQMDLSETANAIDRSLGIVGGYSG 115 G TL+EVLLV+GV +AA+ + L+ Q ++ + +SL G Y+ Sbjct: 27 GATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANM-KSLKFQGRYTD 85 Query: 116 --LSTSLVLSDGLAAQRLRQNDG--LRNAWGGSVSFWPNTVKRGNDSFLVETRDVPKAAC 171 +L L + + G +N WGGSV+ ++ SF V +VP+ C Sbjct: 86 SNYIKTLYAQGLLPSDMIADTTGASAKNPWGGSVTITTSS---DKYSFNVVEANVPQKNC 142 Query: 172 AKLIAAMAGDPAVADARVNGESVYLDEKYDPASAAVACERD 212 ++ A+ + A +++N S SAA C D Sbjct: 143 MAMVNALRS--SSAISKINNTST------STVSAATVCASD 175
>STREPKINASE#Streptococcus streptokinase protein signature. Length = 440 Score = 28.5 bits (63), Expect = 0.004 Identities = 16/45 (35%), Positives = 21/45 (46%) Query: 18 FWLSESAMPTREELATRLDALQEQLPKLSADEDADFDYLDFQARA 62 F AM + E A L A+QEQL D F+ +DF + A Sbjct: 89 FATDSGAMSHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDA 133
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 34.4 bits (79), Expect = 2e-04 Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 14/70 (20%) Query: 113 DFIRRRAIRHL-EKGRIAIFAAGTGNPFFTTDSG-------------AALRAIEIGADLL 158 + I+ L E+G I I + G G P D A E+ AD+ Sbjct: 172 GHVEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIF 231 Query: 159 LKATKVDGVY 168 + T V+G Sbjct: 232 MILTDVNGAA 241
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 29.0 bits (65), Expect = 0.022 Identities = 13/54 (24%), Positives = 22/54 (40%), Gaps = 1/54 (1%) Query: 193 GRPRGINSEGLKR-RGFDAERITAIKRAYRTLYVAGLPLADAKAQLAEQAESSE 245 + G L+R + + R TL A +PL +A +A+Q+E Sbjct: 49 QQKSGSTGLSLRRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPH 102
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.3 bits (206), Expect = 1e-20 Identities = 31/136 (22%), Positives = 57/136 (41%), Gaps = 4/136 (2%) Query: 2 RLLVIEDNRNMVANLFDYFEARGHTLDAAPDGVTGLHLATTQQYDALILDWMMPRMDGPE 61 +LV +D+ + L G+ + + T D ++ D +MP + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRRLREQHQSELPVIMLTARDELPDKIAGFRAGADDYLTKPFALPE---LEVRIEALLA 118 +L R+++ + +LPV++++A++ I GA DYL KPF L E + R A Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 119 RAHGRRRGKLLQVADL 134 R + L Sbjct: 124 RRPSKLEDDSQDGMPL 139
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 28.8 bits (64), Expect = 0.003 Identities = 20/61 (32%), Positives = 26/61 (42%), Gaps = 6/61 (9%) Query: 36 QIFNQNMQQQISLSQQQAMNQVQMAAAAKCVAMIERTSECKNQQSIDQMVKDIEKLIKDM 95 + Q QQQ QQQA Q A AA V ++ I Q+ KD+ KL + Sbjct: 335 VMPPQAQQQQGQGQQQQAQATAQEAVAAAAVRLL------NGSDQIAQLYKDLVKLQRHA 388 Query: 96 G 96 G Sbjct: 389 G 389
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 52.9 bits (127), Expect = 1e-09 Identities = 83/340 (24%), Positives = 127/340 (37%), Gaps = 44/340 (12%) Query: 47 IQQTISVYLLAYGLMSIAHGP----LSDAWGRKRVILGGLALFVAGSIGCALSQDLPTLL 102 + + L Y LM A P LSD +GR+ V+L LA A + L L Sbjct: 41 VTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLY 100 Query: 103 AFRALQGLSAGVGMIVGRAVIRDLFHGPDAQRLMSQVSMIFGIAPAIAPIIGGWILLSGA 162 R + G++ G + G A I D+ G + R +S FG P++GG L+ G Sbjct: 101 IGRIVAGITGATGAVAG-AYIADITDGDERARHFGFMSACFGFGMVAGPVLGG--LMGGF 157 Query: 163 GWPLIFWFLVVFGLVLLIATLTWLPETHPVEARTPLQFKRLMQDYVRIGFNPRFQRLAAA 222 F+ + + LPE+H E R PL+ + L F A Sbjct: 158 SPHAPFFAAAALNGLNFLTGCFLLPESHKGE-RRPLRREALNP---LASFRWARGMTVVA 213 Query: 223 GSFNFAGIFLYIASAPVLIMQHLKLGEGDFAWLFIPTIGGMTLGSF----------LSGR 272 I + P + + GE F W T G++L +F ++G Sbjct: 214 ALMAVFFIMQLVGQVPAALW--VIFGEDRFHW--DATTIGISLAAFGILHSLAQAMITGP 269 Query: 273 MAGRMQPVRQIRIGFICCGVAALANLAYTFAVAQIALPWAVLPIFL----AGMGMALIFP 328 +A R+ R + +G I G + T W PI + G+GM + Sbjct: 270 VAARLGERRALMLGMIADGTGYILLAFATRG-------WMAFPIMVLLASGGIGMPALQA 322 Query: 329 ILALAVLDMYPQQRGLASSLQAFTQLMTNTVVAGVLSPLL 368 +L+ V + +Q L SL A T L ++ PLL Sbjct: 323 MLSRQVDE--ERQGQLQGSLAALTSL------TSIVGPLL 354
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 114 bits (287), Expect = 7e-30 Identities = 69/428 (16%), Positives = 142/428 (33%), Gaps = 51/428 (11%) Query: 65 RWCVGLLMTTVILLLVGFFRLGFARSE---TLYGTVVPAGGLIAVTTPQSGVVVQVGAAQ 121 R + + L++ F + E T G + +G + ++ +V ++ + Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKE 114 Query: 122 GQRVAAGQLLFVLSA-EHRDDRGRPTQRAAAVLAEQQRLAVEAMAQ-------------- 166 G+ V G +L L+A D + EQ R + + + Sbjct: 115 GESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEP 174 Query: 167 ------------LRAQGRVQQQAAARALAGLRDRLQQIDAELD----LLRHWQQLTQSIE 210 L + + Q L + AE + ++ L++ + Sbjct: 175 YFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEK 234 Query: 211 QR---YRTALTRGLVSQQFVDEKQADVLDQRAHTLELQRERMALADALAQAQAEVQQLPL 267 R + + L + +++ V E++ ++ + + + + A+ E Q + Sbjct: 235 SRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQ 294 Query: 268 ----SVHQQLAMAGAGLQE-DRRAAIEQAAASRWEVRAPRAGRVA-LRPLQRGQAVAQGQ 321 + +L + A + +RAP + +V L+ G V + Sbjct: 295 LFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354 Query: 322 RLADLLPTSMATEVVLYAPSRAAGLIGPGMPVQLRFDALPYQHYGQFAGQVVEIAA-APE 380 L ++P EV ++ G I G ++ +A PY YG G+V I A E Sbjct: 355 TLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIE 414 Query: 381 PPRVDSTSASEPLYRVRVRLAGDAALRAGRTAVLRPGMRVQGTLALEWRRFSQWAFEPLS 440 R ++ V + + + + L GM V + R + PL Sbjct: 415 DQR------LGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPLE 468 Query: 441 -SLHGTLR 447 S+ +LR Sbjct: 469 ESVTESLR 476
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.0 bits (119), Expect = 7e-10 Identities = 18/123 (14%), Positives = 41/123 (33%), Gaps = 4/123 (3%) Query: 6 SRARGRPRAFDAEQAVATAQRLFHASGYDALSVADLTAALGINPPSFYAAFGSKAGLYAR 65 ++ + + + A RLF G + S+ ++ A G+ + Y F K+ L++ Sbjct: 5 TKQEAQETR---QHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 66 ILDR-YAQTGAIPLPQLLDADRPLADALADVLEHAARCYAADPAATGCLVLEGTRSNDAQ 124 I + + G + L L ++L H + + + + Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121 Query: 125 ARE 127 Sbjct: 122 EMA 124
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 75.9 bits (186), Expect = 1e-18 Identities = 60/251 (23%), Positives = 108/251 (43%), Gaps = 24/251 (9%) Query: 5 KNKSVLVLGGSRGIGAAIVRRFVAEGARVT-----FTYAGSAEAAQRLAGETGST--AVL 57 + K + G ++GIG A+ R ++GA + ++ + A + Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 58 ADSADRDAVIATV-RRSGPLDVLVVNSGIALFGDALDQDPDA-VDRLFRINVHAPYHAAV 115 DSA D + A + R GP+D+LV +G+ G + D + F +N ++A+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 116 EAARQMPS--GGRIIVIGSVNGDRMPLPGMASYALSKSALQGLARGLARDFGPRGITINV 173 ++ M G I+ +GS N +P MA+YA SK+A + L + I N+ Sbjct: 126 SVSKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 174 VQPGPIDTDA--------NPENGPMKDLMHSF---MAIKRHGRAEEVAGMVAWLAGPEAS 222 V PG +TD N +K + +F + +K+ + ++A V +L +A Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 223 FVTGAMHTIDG 233 +T +DG Sbjct: 245 HITMHNLCVDG 255
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 82.6 bits (204), Expect = 1e-21 Identities = 29/116 (25%), Positives = 56/116 (48%), Gaps = 2/116 (1%) Query: 2 ARILIVDDSPSQLLGIQRIVEKLGHETITATDGAAGVEAAKESLPDLVLMDVVMPNLNGF 61 A IL+ DD + + + + + G++ ++ A DLV+ DVVMP+ N F Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 62 QATRTLKREPTTQHIPVILVTTKDQDTDRMWGMRQGARAYITKPFSEDELLEVMER 117 +K+ +PV++++ ++ + +GA Y+ KPF EL+ ++ R Sbjct: 64 DLLPRIKKA--RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 40.6 bits (95), Expect = 7e-07 Identities = 27/143 (18%), Positives = 59/143 (41%), Gaps = 2/143 (1%) Query: 33 TESYHADREKVIELLNTALATEYVCTLRYYRHYFMAKGMLADAVKGEFLEHAQQEQEHAH 92 TE+ ++ V LNT L+ ++ + +R ++ KG + +F E E Sbjct: 3 TENAKTNQTLVENSLNTQLSNWFLLYSKLHRFHWYVKGPHFFTLHEKFEELYDHAAETVD 62 Query: 93 KLAERIVQLGGEP-DLNPDTLTKRSHAEYKEGTDLRDMVKENLIAERIAIDSYREMIDFI 151 +AER++ +GG+P + S + T +MV+ + + + +I Sbjct: 63 TIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYKQISSESKFVIGLA 122 Query: 152 GD-KDTTTKRILESILAQEEEHA 173 + +D T + ++ + E+ Sbjct: 123 EENQDNATADLFVGLIEEVEKQV 145
>adhesinb#Adhesin B signature. Length = 310 Score = 28.3 bits (63), Expect = 0.005 Identities = 10/32 (31%), Positives = 16/32 (50%) Query: 1 MKNARIALVVLTMALGLTACGGKPSSDNAKEA 32 MK R +++L +GL AC + SS + Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSS 32
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 31.8 bits (72), Expect = 0.008 Identities = 26/124 (20%), Positives = 51/124 (41%), Gaps = 10/124 (8%) Query: 367 FGGFAGFGRM-DADFGNRNGSFKQDDTTLGGFFGWYTGPVWVNAQVSYGWLSYDVDREVQ 425 G G+ + D F N NG ++ G F G+ P +V ++ Y WL + Sbjct: 30 TGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNP-YVGFEMGYDWLG-------R 81 Query: 426 LGPATRVHSGSPDGSNLTAALNAGYSLGEGNLKYGPVAGLTWQK-IKLDGYTESNDSATA 484 + V +G+ + GY + + Y + G+ W+ K + Y +++D+ + Sbjct: 82 MPYKGSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGMVWRADTKSNVYGKNHDTGVS 141 Query: 485 LGYA 488 +A Sbjct: 142 PVFA 145
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.003 Identities = 26/162 (16%), Positives = 50/162 (30%), Gaps = 26/162 (16%) Query: 105 TKLEVLGDERTLYPDVVQTLKAAEQLVADGFEVMVYTSDDPILAKRLEEIGCVAVMPLAA 164 + V D+ + + Q L A G++V + ++ + G + V + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA------GYDVRITSNAATLWRWIAAGDGDLVVTDVVM 57 Query: 165 PIGSGLGIQNKYNLLEII--ENAKVPIIVDAGVGTASDAAIAMELGCDGVLMNTAIAGAR 222 P + LL I +P++V + T A A E GA Sbjct: 58 PDENAFD------LLPRIKKARPDLPVLVMSAQNTFMTAIKASE------------KGAY 99 Query: 223 DPILMASAMRKAIEAGREAFLAGRIPRKRYASASSPVDGVIG 264 D + + + I A + + S ++G Sbjct: 100 DYLPKPFDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVG 141
>MECHCHANNEL#Bacterial mechano-sensitive ion channel signature. Length = 136 Score = 146 bits (370), Expect = 2e-48 Identities = 75/136 (55%), Positives = 97/136 (71%), Gaps = 7/136 (5%) Query: 1 MGMVSEFKQFAMRGNVIDLAVGVVIGAAFGKIVTALVEKIIMPPIGWAIGNVDFSRLAWV 60 M ++ EF++FAMRGNV+DLAVGV+IGAAFGKIV++LV IIMPP+G IG +DF + A Sbjct: 1 MSIIKEFREFAMRGNVVDLAVGVIIGAAFGKIVSSLVADIIMPPLGLLIGGIDFKQFAVT 60 Query: 61 LKPAGVDATGKEIPAVAIGYGDFINTVVQFLIIAFAIFLVVKLINRVTHRK--PDAPKGP 118 L+ A D IPAV + YG FI V FLI+AFAIF+ +KLIN++ +K P A P Sbjct: 61 LRDAQGD-----IPAVVMHYGVFIQNVFDFLIVAFAIFMAIKLINKLNRKKEEPAAAPAP 115 Query: 119 SEEVLLLREIRDALKN 134 ++E +LL EIRD LK Sbjct: 116 TKEEVLLTEIRDLLKE 131
>PHPHLIPASEA1#Bacterial phospholipase A1 protein signature. Length = 289 Score = 31.5 bits (71), Expect = 0.015 Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 10/60 (16%) Query: 917 WNDDVGYLDASLSYDVNDHLTLYAQATNLTGESERRYAQWTNHYFDQNIFERRYYAGLRL 976 WN G + LSY + H+ LY Q + GES D N + R G+ L Sbjct: 236 WNTGYGGAELGLSYPITKHVRLYTQVYSGYGES----------LIDYNFNQTRVGVGVML 285
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.025 Identities = 18/121 (14%), Positives = 40/121 (33%), Gaps = 8/121 (6%) Query: 18 LIAAPAAAQSLRVQSPDARTQVEFTLRADG-VPSYRVL-YRNTLVLGDAPLGLDLGRGNK 75 + ++ + + D + L + + VL N V L + + + Sbjct: 223 INRYENLSRVEKSRLDDFSS-----LLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQ 277 Query: 76 LGRDMTLQSSTTELHDSRFTLPV-GKTRQARDHYRALRVQLTDTQHRKLGIELRAYDDGV 134 + ++ +L F + K RQ D+ L ++L + R+ +RA Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVK 337 Query: 135 A 135 Sbjct: 338 V 338
>SECA#SecA protein signature. Length = 901 Score = 32.9 bits (75), Expect = 2e-04 Identities = 10/16 (62%), Positives = 10/16 (62%) Query: 8 DPCPCGRPANYAQCCG 23 DPCPCG Y QC G Sbjct: 883 DPCPCGSGKKYKQCHG 898
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.3 bits (78), Expect = 0.004 Identities = 38/240 (15%), Positives = 74/240 (30%), Gaps = 32/240 (13%) Query: 1116 RLKLPERARGDEPAVADAVDAAPSIEAAGEPAGVQGAVSADGMAI---DGAAVPTASPAT 1172 L PE + ++ + +I+A +V ++ I D A VP +PAT Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQAD------VPSVPSNNEEIARVDEAPVPPPAPAT 1032 Query: 1173 --DETLSAAPETQTGQRTPAV-----ATANKQNR------------NAKTAKTASSTRAA 1213 + T + A ++ +T QNR N +T + A S Sbjct: 1033 PSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSET 1092 Query: 1214 AQSQAMQTKKSTLRTPALASKAASDKRASGASAVPSAAASRSSVGKTTRSSKTPGKPIAA 1273 ++Q +TK++ +K ++K + + ++ + Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152 Query: 1274 TNASSATSGKGATAS----AAVKTAAAKPRATAASTGQPVRGAGKTSSKRAATTASPAKT 1329 N S TA A ++ + T ++T + T P Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212
>INTIMIN#Intimin signature. Length = 939 Score = 27.7 bits (61), Expect = 0.027 Identities = 10/53 (18%), Positives = 16/53 (30%) Query: 98 TATATATATATATATATATATATATARVCISNTMLKAAKQQSSKAAKQQSSKA 150 T A T T+T + +ARV +KA + + Sbjct: 709 TEKTDTNGYAKVTLTSTTPGKSLVSARVSDVAVDVKAPEVEFFTTLTIDDGNI 761
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 31.7 bits (72), Expect = 0.008 Identities = 37/224 (16%), Positives = 74/224 (33%), Gaps = 44/224 (19%) Query: 48 AVLHERLQRQLDALRADELSRELPRTAQAYLAHWLAQGWLERRLPEGATEEEYELSRATT 107 A +H ++ ++S E+ + A ++ + ++ E+ A Sbjct: 19 AFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAHL 78 Query: 108 QAI-------RFIAGLRESSSSATESRLSLVIQQLVQLAGQTEADPEL--RLAALRDERA 158 + + +A E L V V + + + + R A +RD Sbjct: 79 LVLDDPELVDGIKGKIENEQMNA-EYALKEVSDMFVSMFESMD-NEYMKERAADIRDVSK 136 Query: 159 RIDAEIERVASGRVAALDGKRALERARDLIHLSDELAEDFHRVRDDFEQLNRQFRERIID 218 R+ + V +G +A + + + ++++L D QLN+QF + Sbjct: 137 RVLGHLIGVETGSLATIA--------EETVIIAEDLTPS------DTAQLNKQFVKGFAT 182 Query: 219 DEGAR-------------------GDVLEQLFDGVDVIADSEAG 243 D G R +V E++ G VI D G Sbjct: 183 DIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEG 226
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 69.3 bits (169), Expect = 5e-17 Identities = 35/190 (18%), Positives = 69/190 (36%), Gaps = 13/190 (6%) Query: 7 DTQQKILATAEALIYQHGIHATGMDLLVKTSGVARKSIYRHFDNKDEVAAAALNARDVRW 66 +T+Q IL A L Q G+ +T + + K +GV R +IY HF +K ++ + + Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70 Query: 67 LAWFRQQCDK-----ADRPEARILRMFTVLKEWFQSEGYRGCAF--INTAGEVGDPDDPV 119 + K ++ + + F GE+ Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130 Query: 120 RKIARHHKQKLLDYTLELTGQLGITQPDALARQLLLLMEGAIT---VSRVMGDE--DAAD 174 R + ++ L+ + + D + R+ ++M G I+ + + + D Sbjct: 131 RNLCLESYDRIEQT-LKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKK 189 Query: 175 TARDIAQLLL 184 ARD +LL Sbjct: 190 EARDYVAILL 199
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 56.9 bits (137), Expect = 2e-12 Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 3/93 (3%) Query: 6 PLRADAQRNRERLLAAAEQVFLERGAEA-SMEDVAKRAGVGIGTLYRRFPTRESLFAAAY 64 + +AQ R+ +L A ++F ++G + S+ ++AK AGV G +Y F + LF+ + Sbjct: 4 KTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIW 63 Query: 65 SGRFLSLAAASHARASSL--DALAALRAYLEDL 95 ++ + D L+ LR L + Sbjct: 64 ELSESNIGELELEYQAKFPGDPLSVLREILIHV 96
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 104 bits (261), Expect = 3e-29 Identities = 65/254 (25%), Positives = 111/254 (43%), Gaps = 12/254 (4%) Query: 2 GKRFGGKVVVVTGGTDGIGLVTAKAFSAEGAQVY---ITGRRQDRLDAAVAEIGGGAVGV 58 K GK+ +TG GIG A+ +++GA + + +++ +++ A Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62 Query: 59 QGDVGVPEDMDRLYACIQQEHGRLDVVFANAGVSESAALGEIDIAHLERLLATNIKGTVF 118 DV +D + A I++E G +D++ AGV + + E + N G Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 119 TVQNALPLMAS--GGAVILAGSVAGSKGIGALSVYSATKAAIRSFARTWTSDLKRRGIRV 176 ++ M G+++ GS +++ Y+++KAA F + +L IR Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 177 NVMSPGMVHTPAMQTYLDANAGAE-------DAFKQMIPFGRLGDAEEIAEAVLFLASDA 229 N++SPG T + GAE + FK IP +L +IA+AVLFL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 230 SSFIAGHELFIDGG 243 + I H L +DGG Sbjct: 243 AGHITMHNLCVDGG 256
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 26.9 bits (59), Expect = 0.044 Identities = 18/59 (30%), Positives = 25/59 (42%) Query: 97 SEPGSERTAAGQSIPSQASELSGTWTNNGGDNLAPMVAHMQRLGTVSDAGAAGAGGTIT 155 S+PG RTA+G +I + G N L + G +SD G GT+T Sbjct: 56 SDPGGVRTASGTTIKVSGRQAQGILLENPAAELQFRNGSVTSSGQLSDDGIRRFLGTVT 114
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 34.7 bits (79), Expect = 0.002 Identities = 27/95 (28%), Positives = 37/95 (38%), Gaps = 10/95 (10%) Query: 299 PTPEMSPLPSTAPHPDAFPLPPAGEGARRAGEGSPPTDFPSTAPDPDAFPLLPAGEGARR 358 P P+++P +A P+A PLP A P + P T P+P+ P L Sbjct: 311 PRPDLTP--GSAEAPNAQPLPEVSP-AENPANNPAPNENPGTRPNPEPDPDLNPDANPDT 367 Query: 359 AGEGSAPTDLPSTAPDPGVFPLPPAGEGARRAGEG 393 G+ P T PD P P G + EG Sbjct: 368 DGQ-------PGTRPDSPAVPDRPNGRHRKERKEG 395 Score = 30.9 bits (69), Expect = 0.029 Identities = 32/100 (32%), Positives = 37/100 (37%), Gaps = 15/100 (15%) Query: 290 QGLPPPCDTP----TPEMSPLPSTAP--HPDAFPLPPAGEGARRAGE--------GSPPT 335 Q +P P TP P PLP +P +P P P G R E +P T Sbjct: 308 QVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEPDPDLNPDANPDT 367 Query: 336 DF-PSTAPDPDAFPLLPAGEGARRAGEGSAPTDLPSTAPD 374 D P T PD A P P G + EG L PD Sbjct: 368 DGQPGTRPDSPAVPDRPNGRHRKERKEGEDGGLLCKFFPD 407
>INTIMIN#Intimin signature. Length = 939 Score = 41.2 bits (96), Expect = 4e-05 Identities = 71/400 (17%), Positives = 114/400 (28%), Gaps = 50/400 (12%) Query: 433 NVSNVTGATVSDGQGLGTIVNDDAQPALSIDDVSVNEGNSGTTTATFTVSLSAASGQTVT 492 N SN T++ G +V+ + D S + T T TV + + V Sbjct: 537 NSSNNVLLTITVLSN-GQVVDQVGVTDFTADKTSAKADGTEAITYTATVKKNGVAQANVP 595 Query: 493 VNYATADGTATAG-------SDYVARSGTLSFAPGVTAQGVAVTVNGDTAVEPNETFSVG 545 V++ GTA A S PG A T +A+ N V Sbjct: 596 VSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVV-SAKTAEMTSALNANAVIFVD 654 Query: 546 LSGASNATIARATGTGTILNDDAVVTISPTSLPAATAGTAYSQTLTASGGTPGYSFVI-- 603 + AS I A T + N +T + + + T T + G S Sbjct: 655 QTKASITEIK-ADKTTAVANGQDAITYTVKVMKGDKPVSNQEVTFTTTLGKLSNSTEKTD 713 Query: 604 SAGTLPAGMTLNAAGVLSGTPTASGSFNFTV---TATDSGVPTSGSRAYTLTVAGANVTL 660 + G +T G + S V T + G L Sbjct: 714 TNGYAKVTLTSTTPGKSLVSARVSDV-AVDVKAPEVEFFTTLTIDDGNIEIVGTGVKGKL 772 Query: 661 PATTLPAGTAGQAYSSAITPATGGIAPYSYALIAGALPAGITLNSSSGTLTGTTTSVGSF 720 P L G A+GG Y++ A+ A + +S TL T+ Sbjct: 773 PTVWLQYGQVNL-------KASGGNGKYTWRSANPAI-ASVDASSGQVTLKEKGTTT--- 821 Query: 721 NFSVTATDSTSGTPSQGTRGYTLNIAAPTIALAPATVPTATRGTAYSQTLTAS------- 773 ++ S + T + YT+ I + T + Sbjct: 822 ---ISVISSDNQTAT-----YTIATPNSLIVPNMSKRVTYNDAVNTCKNFGGKLPSSQNE 873 Query: 774 --------GGTAAYTYAITSGALPAGITLASNGTLSGTAT 805 G Y Y +S + + + + SG A+ Sbjct: 874 LENVFKAWGAANKYEYYKSSQTIISWVQQTAQDAKSGVAS 913
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 42.6 bits (100), Expect = 1e-07 Identities = 29/92 (31%), Positives = 46/92 (50%), Gaps = 3/92 (3%) Query: 74 YRQQFADADFLIVQANGLSIGRLYLHRAAAHHTLV-DISLLPDWRGKGIGSHLIAHAQAC 132 Y ++ A FL N IGR+ + + L+ DI++ D+R KG+G+ L+ A Sbjct: 59 YVEEEGKAAFLYYLENNC-IGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEW 117 Query: 133 ARDAG-CVLSLHVLHANPAARRLYARHEFVAG 163 A++ C L L N +A YA+H F+ G Sbjct: 118 AKENHFCGLMLETQDINISACHFYAKHHFIIG 149
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 351 bits (901), Expect = e-113 Identities = 157/684 (22%), Positives = 277/684 (40%), Gaps = 106/684 (15%) Query: 91 ASSGSATFNFEGESVQAVVKAILGDMLGQNYVIAPGVQGTVTLATPNPVSPAQALNLLEM 150 A++ + +F+G +Q + + + L + +I P V+GT+T+ + + ++ Q Sbjct: 25 AAAEEFSASFKGTDIQEFINTVSKN-LNKTVIIDPSVRGTITVRSYDMLNEEQYYQFFLS 83 Query: 151 VLG-WNNARMVFSGGRYNIVPA-DQALAGTVAPSTASPSAARGFEVRVVPLKFISASEMK 208 VL + A + + G +V + D A S A+P RVVPL ++A ++ Sbjct: 84 VLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVVPLTNVAARDLA 143 Query: 209 KVLEPYARPNAIVGTD---PARNVITLGGTRAELENYLRTVQIFDVDWLSGMSVGVFPIQ 265 +L NA VG+ NV+ + G A ++ L V+ VD SV P+ Sbjct: 144 PLLRQL-NDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVE--RVDNAGDRSVVTVPLS 200 Query: 266 SGKAEKVSADLEKVFGEQSKT--PSAGMFRFMPLENANAVLVI---TPQPRYLDQIQQWL 320 A V + ++ + SK+ P + + + E NAVLV + R + I+Q L Sbjct: 201 WASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ-L 259 Query: 321 DRIDSAGGGVRLFSYELKYIKAKDLADRLSEVFGGGRGNSGGSPSLVPGGVVNMLGNNSG 380 DR + G ++ LKY KA DL + L+ + + Sbjct: 260 DRQQATQGNTKVIY--LKYAKASDLVEVLTGISSTMQSEKQA-------------AKPVA 304 Query: 381 SADRDESLGSSSGATGGSIGGASDGSSQSGTSGSFGGSNGSGMLQLQPSTNQNGSV---T 437 + D++ + G + ++ P + Sbjct: 305 ALDKNIII-------------------------KAHGQTNALIVTAAPDVMNDLERVIAQ 339 Query: 438 LDVEGGKVGVSAVAETNTLIVRATAQAWSSIRDVIEKLDVMPMQVHIEAQIAEVTLTGDL 497 LD+ +V V A+ + E D + + I+ +T Sbjct: 340 LDIRRPQVLVEAI--------------------IAEVQDADGLNLGIQWANKNAGMTQFT 379 Query: 498 QYGVNWYFENAVTNPFNSDG---SGGPALPSAAGRRIWGDISGSITNNGVAWTFLGKNAA 554 G+ A N +N DG S + S+ G G N A Sbjct: 380 NSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQG--------------NWA 425 Query: 555 AIISALDQVTNLRLLQTPSVFVRNNAEATLNVGSRIPINSTSINTGLGTDASYSSVQYID 614 +++AL T +L TPS+ +N EAT NVG +P+ + S T D +++V+ Sbjct: 426 MLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTT--SGDNIFNTVERKT 483 Query: 615 TGVILKVRPRVTKDGMVFLDIVQEVSTPGARPAACTAATATTINSAACNVDINTRRVKTE 674 G+ LKV+P++ + V L+I QEVS+ A A + S+ NTR V Sbjct: 484 VGIKLKVKPQINEGDSVLLEIEQEVSSV---------ADAASSTSSDLGATFNTRTVNNA 534 Query: 675 AAVQNGDTIMLAGLIDDNTSDGSNGIPFLSKLPVVGALFGRKTQNSSRREVIVLITPSIV 734 V +G+T+++ GL+D + SD ++ +P L +PV+GALF ++ S+R +++ I P+++ Sbjct: 535 VLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVI 594 Query: 735 RNPQEARDLTDEYGAKFNAMKPLS 758 R+ E R + FN + Sbjct: 595 RDRDEYRQASSGQYTAFNDAQSKQ 618
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 32.3 bits (73), Expect = 0.002 Identities = 25/134 (18%), Positives = 40/134 (29%), Gaps = 28/134 (20%) Query: 160 NGQGGQPPTANAAARGAATGAQPVPPP---------DAAALVPPQPPQPQPV-------A 203 N Q P + A PVPPP + A Q + Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET 1061 Query: 204 PGQQQQPGGQAPPTVP--PQRSDGAQEAPRPSDDQMRAIRE----------RIEARRRQL 251 Q ++ +A V Q ++ AQ + Q +E ++E + Q Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQE 1121 Query: 252 QQQRQSGSTPGQTQ 265 + S +P Q Q Sbjct: 1122 VPKVTSQVSPKQEQ 1135
>PilS_PF08805#PilS N terminal Length = 185 Score = 34.9 bits (80), Expect = 4e-05 Identities = 8/55 (14%), Positives = 24/55 (43%), Gaps = 4/55 (7%) Query: 1 MRHQRGYTLIEVIVAFALLALALSLLLGSLSGAARQVRAADESTRATLHAQSLLA 55 +G TL+EV++ ++ + + S ++ +S+ + +++A Sbjct: 22 KEQDKGATLMEVLLVVGVIVVLAASAYKLYSMV----QSNIQSSNEQNNVLTVIA 72
>PilS_PF08805#PilS N terminal Length = 185 Score = 34.9 bits (80), Expect = 7e-05 Identities = 12/61 (19%), Positives = 26/61 (42%), Gaps = 3/61 (4%) Query: 23 RGTSLLEMLLVIALIAIAGVLAAAALNG---GIDGMRLRTAGKAIAAQLRYTRTQAIATG 79 +G +L+E+LLV+ +I + A + I + + A ++ + Q T Sbjct: 26 KGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSLKFQGRYTD 85 Query: 80 T 80 + Sbjct: 86 S 86
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 136 bits (345), Expect = 2e-44 Identities = 40/132 (30%), Positives = 60/132 (45%), Gaps = 18/132 (13%) Query: 15 QAGMSLLEIIIVIVLIGAVLTLVGSRVLGGADRGKANLAKSQIQTLAGKIENFQLDTGKL 74 Q G +LLEI++VIV+IG + +LV ++G ++ A S I L ++ ++LD Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66 Query: 75 PSKLDDLVTQPGDSSGWLGPYAKPAELN------------DPWGHAIEYRAPGDGQPFDL 122 P+ T G S P P N DPWG+ PG+ +DL Sbjct: 67 PT------TNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120 Query: 123 ISLGKDGKPGGS 134 +S G DG+ G Sbjct: 121 LSAGPDGEMGTE 132
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 430 bits (1107), Expect = e-152 Identities = 133/411 (32%), Positives = 213/411 (51%), Gaps = 12/411 (2%) Query: 1 MPLYRYKALDAHGEMLDGQMEAASDAEVALRLQEQGHLPV---ETRLATGENDSPSLRML 57 M Y Y+ALDA G+ G EA S + L+E+G +P+ E R ++ S L L Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLS-L 59 Query: 58 LREKPFDNAALVQFTQQLATLIGAGQPLDRALSILMDLPEDEKSRRVIGDVRDTVRGGAP 117 R+ + L T+QLATL+ A PL+ AL + E +++ VR V G Sbjct: 60 RRKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHS 119 Query: 118 LSSALERQHGLFSKLYINMVRAGEAGGSMQDTLQRLADYLERSRALRGKVINALIYPAIL 177 L+ A++ G F +LY MV AGE G + L RLADY E+ + +R ++ A+IYP +L Sbjct: 120 LADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVL 179 Query: 178 LAVVGCALLFLLGYVVPQFAQMYESLDVALPWFTQAVLSVGLLVRDW--WIVLVVVPGVL 235 V + LL VVP+ + + + ALP T+ ++ + VR + W++L ++ G + Sbjct: 180 TVVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFM 239 Query: 236 G--LWLDRKRRNAAFRASLDQWLLRQKVVGSLIARLETARLTRTLGTLLRNGVPLLAAIG 293 + L +++R +F + LL ++G + L TAR RTL L + VPLL A+ Sbjct: 240 AFRVMLRQEKRRVSF----HRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMR 295 Query: 294 IARNVMSNLALVEDVANAADDVKNGHGLSMSLARGKRFPRLALQMIQVGEESGALDTMLL 353 I+ +VMSN ++ A D V+ G L +L + FP + MI GE SG LD+ML Sbjct: 296 ISGDVMSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLE 355 Query: 354 KTADTFELETAQAIDRALAALVPFITLVLASVVGLVIISVLVPLYDLTNAI 404 + AD + E + + AL P + + +A+VV +++++L P+ L + Sbjct: 356 RAADNQDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 205 bits (524), Expect = 4e-63 Identities = 104/359 (28%), Positives = 148/359 (41%), Gaps = 69/359 (19%) Query: 156 PQLVPNDPFYAQYQWHLSNPNGGINAPGAWDLSQGAGVVVAVLDTGILPDHPDFAGNLLQ 215 Q++ + + + I AP W+ ++G GV VAVLDTG DHPD ++ Sbjct: 10 YQVIKQEQQVNEIPRGV----EMIQAPAVWNQTRGRGVKVAVLDTGCDADHPDLKARIIG 65 Query: 216 GYDFITDAEVSRRPTDARVPGALDYGDWEEADNVCYAGSQAQESSWHGTHVSGTVAEATN 275 G +F D E + + HGTHV+GT+A AT Sbjct: 66 GRNFTDDDEGDPEIFK--------------------------DYNGHGTHVAGTIA-ATE 98 Query: 276 NGVGMAGVAPKATILPVRVLGRCG-GYTSDIADAIVWASGGSVDGVPTNTNPAEVINMSL 334 N G+ GVAP+A +L ++VL + G G I I +A VD +I+MSL Sbjct: 99 NENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD----------IISMSL 148 Query: 335 GGGEPCDSATQLAINGAVSRGTTVVVAAGNSSEDASN----HSPASCNNTITVGATRITG 390 GG E A+ AV+ V+ AAGN + P N I+VGA Sbjct: 149 GGPEDVP-ELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNEVISVGAINFDR 207 Query: 391 GIAYYSNYGSKVDLSGPGGGGSVDGNPGGYVWQAGYTGATTPTSGSYTYMGLGGTSMASP 450 + +SN ++VDL PG T Y GTSMA+P Sbjct: 208 HASEFSNSNNEVDLVAPGED-------------------ILSTVPGGKYATFSGTSMATP 248 Query: 451 HVAGVVALVQSAAIGLGEGPLTPAAVEALLKQTSRPFPVTPPASTPIGSGIVDAKAALE 509 HVAG +AL++ A E LT + A L + + P +P G+G++ A E Sbjct: 249 HVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNSPKME---GNGLLYLTAVEE 304
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 52.6 bits (125), Expect = 1e-08 Identities = 55/154 (35%), Positives = 78/154 (50%), Gaps = 21/154 (13%) Query: 71 GRGAAAPASKATAIGANSHASATGAVATGANSSASGVNSSAIGRQTNAIGENAVAIGYNS 130 G A+A + AIGA + A+ AVA GA S A+GVNS AIG + A+G++AV G S Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121 Query: 131 FVRQSG----------ENGVALGANAGVTGANSVALGAGSRTHEDDVVSVGSGNGRGG-- 178 ++ G + GVA+G N+ NSVA+G S + S+ G+ Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181 Query: 179 ---------PATRRITNVTAGVNATDAVNVAQLR 203 R++T++ AG TDAVNVAQL+ Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLK 215 Score = 52.2 bits (124), Expect = 2e-08 Identities = 64/200 (32%), Positives = 96/200 (48%), Gaps = 13/200 (6%) Query: 2133 AAAVGSITPAATSTAVGTAAVANHVTGTAIGGSAYAHGPNDTAIGSNARVNADGSTAVGA 2192 A + SI AT+ A AAVA A G ++ A GP A+G +A STA Sbjct: 67 AKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKD 126 Query: 2193 NTQIAAVATNA---VAMGEGAQVTAASGTAIGQGARATAQG--AVALGQGSVADRANTVS 2247 I A A+ + VA+G ++ A + AIG + A ++A+G S DR N+VS Sbjct: 127 GVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVS 186 Query: 2248 VGSVGGERQVANVAAGTRATDAVNKGQLDSGVAAANSYTDSRYNAMADSFESYQGDIEDR 2307 +G RQ+ ++AAGT+ TDAVN QL + T+ R + + +Y + Sbjct: 187 IGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYADN---- 242 Query: 2308 LNKGQLDSGVAAANSYTDSR 2327 + S + AN+YTDS+ Sbjct: 243 ----KSSSVLGIANNYTDSK 258 Score = 51.8 bits (123), Expect = 3e-08 Identities = 60/172 (34%), Positives = 88/172 (51%), Gaps = 19/172 (11%) Query: 342 GVGAYAAGTQSSAFGAVANAAGDYATAIGTQTSASGTSSTAVGGPVDYIPGLGFFVQTQA 401 G+ A A G S A GA A AA A A+G + A+G +S A+G ++A Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGP------------LSKA 109 Query: 402 SGEASTALGAGATASGTYTTAVGTLSEASGTEATAVGYFAYAPGEGATAVGPESWASGEL 461 G+++ GA +TA A+G + S T AVG+ + A + + A+G S + Sbjct: 110 LGDSAVTYGAASTAQKD-GVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVAANH 167 Query: 462 STALGYYSTARGANSVATRANTVSVGADGAERQITNVAAGTEGTDAVNLDQL 513 YS A G S R N+VS+G + RQ+T++AAGT+ TDAVN+ QL Sbjct: 168 G-----YSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQL 214 Score = 51.1 bits (121), Expect = 4e-08 Identities = 65/209 (31%), Positives = 101/209 (48%), Gaps = 8/209 (3%) Query: 1276 GGYSSASGFNSTALGNFSTASGSNTVAVGGDATATGAYSIAAGQGSVASGYNSVSVGGAL 1335 G +SA G +S A+G + A+ VAVG + ATG S+A G S A G ++V+ G A Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121 Query: 1336 ------LGLLPTEASGDYSTAVGGAAWAPGLNSTALGNFAGSTGEG--SVALGAGSVADR 1387 + + ++ D AVG + A NS A+G+ + S+A+G S DR Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181 Query: 1388 DFAVSVGSAGNERQITNVAAGTQGTDAVNLDQLNAVAEAGAATSKYFQASGSADSDAGAY 1447 + +VS+G RQ+T++AAGT+ TDAVN+ QL E + A A+++A A Sbjct: 182 ENSVSIGHESLNRQLTHLAAGTKDTDAVNVAQLKKEIEKTQENTNKRSAELLANANAYAD 241 Query: 1448 VDGDNALAAGEGANATGTGTTALGAGAQA 1476 + L + + T A +A Sbjct: 242 NKSSSVLGIANNYTDSKSAETLENARKEA 270 Score = 48.7 bits (115), Expect = 2e-07 Identities = 60/182 (32%), Positives = 87/182 (47%), Gaps = 4/182 (2%) Query: 1125 ATADGDYSSAFGSSSQATAIGAVAIGSGASATAQYANAAGYNAAASGFGSVSNGAFSQAS 1184 A AD ++ Q + A+G A G NA+A G S++ GA ++A+ Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82 Query: 1185 GDYAVAVGGESEAAGAQSTALGAAAGAYGDGSLAVGALSEAQGSESTAMGYFASASGESA 1244 AVAVG S A G S A+G + A GD ++ GA S AQ + A+G AS S ++ Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ-KDGVAIGARASTS-DTG 140 Query: 1245 TAVGAESVADGTSAAAFGFGAEATSN--YSTALGGYSSASGFNSTALGNFSTASGSNTVA 1302 AVG S AD ++ A G + +N YS A+G S NS ++G+ S +A Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLA 200 Query: 1303 VG 1304 G Sbjct: 201 AG 202 Score = 47.6 bits (112), Expect = 5e-07 Identities = 61/197 (30%), Positives = 90/197 (45%), Gaps = 46/197 (23%) Query: 895 GGYASASGFFATAVGNNSRAVDYYATALGGDSMASGYFSTAVGGSSVASGRGATAMGVDS 954 G ASA G + A+G + A A A+G S+A+G S A+G S A G A G S Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121 Query: 955 AARSDRDTAVGTESVADGGDSTALGANARADNYYSVALGTYALATGTSATSIGGQSYAPG 1014 A+ D VA+G A + T Sbjct: 122 TAQKD-----------------------------GVAIGARASTSDT------------- 139 Query: 1015 TESVALGWQSNASGEQSISLGSGAYTPADN--SVALGAGSLADRANTVSVGAAGTERQIA 1072 VA+G+ S A + S+++G ++ A++ S+A+G S DR N+VS+G RQ+ Sbjct: 140 --GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLT 197 Query: 1073 NVAAGTEGTDAVNLDQL 1089 ++AAGT+ TDAVN+ QL Sbjct: 198 HLAAGTKDTDAVNVAQL 214 Score = 46.8 bits (110), Expect = 9e-07 Identities = 66/250 (26%), Positives = 106/250 (42%), Gaps = 23/250 (9%) Query: 1631 GFIPARASGTGAAAFGAGAWATADYTTAIGWNSYADGVNASALGQSAAALADNTLALGGG 1690 G + A A G + A GA A A A+G S A GVN+ A+G + AL D+ + G Sbjct: 61 GGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAA 120 Query: 1691 SRADAVGASAVGVDASATGINSTGVGRQVNAIGENAVSVGYNSFVRQSAVNGVALGANAG 1750 S A G A+G AS + + V+VG+NS + ++ Sbjct: 121 STAQKDGV-AIGARASTS---------------DTGVAVGFNSKADAKNSVAIGHSSHVA 164 Query: 1751 ATGADSVALGSGSRTYEADTVSIGSGNGRGGPATRRIVNVSAGQAATDAVNKGQLDALAA 1810 A S+A+G S+T ++VSIG + R++ +++AG TDAVN QL Sbjct: 165 ANHGYSIAIGDRSKTDRENSVSIGHES-----LNRQLTHLAAGTKDTDAVNVAQLKKEIE 219 Query: 1811 DVQTTSGMLKTTGDGVASATGDRATAA--GAGATASGARSVAVASGSRASATGASAMGVD 1868 Q + A+A D +++ G + ++S +R A S ++ Sbjct: 220 KTQENTNKRSAELLANANAYADNKSSSVLGIANNYTDSKSAETLENARKEAFAQSKDVLN 279 Query: 1869 SSASGVNSTA 1878 + + NS A Sbjct: 280 MAKAHSNSVA 289 Score = 46.0 bits (108), Expect = 1e-06 Identities = 50/144 (34%), Positives = 73/144 (50%), Gaps = 3/144 (2%) Query: 826 GANAAAADTDSIAVGTYANAYGPRAISLGGQSRATGDDSIALGWGAQAEGEQGIALGAGG 885 G NA+A SIA+G A A A+++G S ATG +S+A+G ++A G+ + GA Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121 Query: 886 QADAYSTAIGGYASASGFFATAVGNNSRAVDYYATALGGDS--MASGYFSTAVGGSSVAS 943 A AIG AS S AVG NS+A + A+G S A+ +S A+G S Sbjct: 122 TAQKDGVAIGARASTSD-TGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTD 180 Query: 944 GRGATAMGVDSAARSDRDTAVGTE 967 + ++G +S R A GT+ Sbjct: 181 RENSVSIGHESLNRQLTHLAAGTK 204 Score = 44.9 bits (105), Expect = 3e-06 Identities = 56/170 (32%), Positives = 78/170 (45%), Gaps = 4/170 (2%) Query: 607 ASGDASTAVGSASQATANGATALGYESIANGADSTALGVGSVAFGDTSTAVGGASVAFGT 666 A +A Q + N ALG E A G+ + A G S A+G + A Sbjct: 25 ADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKG 84 Query: 667 DSAAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGDDSIALGASSQASALGTTA 726 + A GA + A G S AIG S A G+ V G AS A D +A+GA + S G A Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ-KDGVAIGARASTSDTG-VA 142 Query: 727 VGSNANASIANATAVGFNS--SAGDDYATALGGDSNASGYFSTAVGGTSI 774 VG N+ A N+ A+G +S +A Y+ A+G S S ++G S+ Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESL 192 Score = 42.6 bits (99), Expect = 2e-05 Identities = 39/141 (27%), Positives = 71/141 (50%) Query: 1178 GAFSQASGDYAVAVGGESEAAGAQSTALGAAAGAYGDGSLAVGALSEAQGSESTAMGYFA 1237 G + A G +++A+G +EAA + A+GA + A G S+A+G LS+A G + G + Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121 Query: 1238 SASGESATAVGAESVADGTSAAAFGFGAEATSNYSTALGGYSSASGFNSTALGNFSTASG 1297 +A + S +D A F A+A ++ + + +A+ S A+G+ S Sbjct: 122 TAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDR 181 Query: 1298 SNTVAVGGDATATGAYSIAAG 1318 N+V++G ++ +AAG Sbjct: 182 ENSVSIGHESLNRQLTHLAAG 202 Score = 41.0 bits (95), Expect = 5e-05 Identities = 45/139 (32%), Positives = 78/139 (56%), Gaps = 11/139 (7%) Query: 1566 AAFGGYSESTGRLSSALGYGAVASSDYSTAVGAVALASGASAVAVGEFSEAIGDESVAVG 1625 A G + + G S A+G A A+ + AVGA ++A+G ++VA+G S+A+GD +V G Sbjct: 59 GAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118 Query: 1626 GSTFFGFIPARASGTGAAAFGAGAWATADYTTAIGWNSYADGVNASALGQSAAALADN-- 1683 ++ + A GA A +T+D A+G+NS AD N+ A+G S+ A++ Sbjct: 119 AAS--------TAQKDGVAIGARA-STSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGY 169 Query: 1684 TLALGGGSRADAVGASAVG 1702 ++A+G S+ D + ++G Sbjct: 170 SIAIGDRSKTDRENSVSIG 188 Score = 41.0 bits (95), Expect = 6e-05 Identities = 42/129 (32%), Positives = 67/129 (51%), Gaps = 4/129 (3%) Query: 756 GGDSNASGYFSTAVGGTSIANGRGATAIGYETIGNGTASTALGFASVAWGDGGTAIGTES 815 G +++A G S A+G T+ A A A+G +I G S A+G S A GD G S Sbjct: 62 GLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAAS 121 Query: 816 LAYGDNSTAVGANAAAADTDSIAVGTYANAYGPRAISLGGQSRATGDD--SIALGWGAQA 873 A D A+GA A+ +DT +AVG + A ++++G S + SIA+G ++ Sbjct: 122 TAQKD-GVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179 Query: 874 EGEQGIALG 882 + E +++G Sbjct: 180 DRENSVSIG 188 Score = 40.7 bits (94), Expect = 7e-05 Identities = 48/146 (32%), Positives = 72/146 (49%), Gaps = 10/146 (6%) Query: 1827 ASATGDRATAAGAGATASGARSVAVASGSRASATGASAMGVDSSASGVNSTAMGRQTNSI 1886 A A A A GAG+ A+G SVA+ S+A A G S+A R + S Sbjct: 79 AEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARASTS- 137 Query: 1887 GENGVALGYNSFVRESGSNAVALGANAGASGADSVALGSGSRTYEANTVSVGSGNGRGGP 1946 + GVA+G+NS S A+ ++ A+ S+A+G S+T N+VS+G + Sbjct: 138 -DTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES----- 191 Query: 1947 ATRRIVNVGAGTIASASTDAINGGQL 1972 R++ ++ AGT TDA+N QL Sbjct: 192 LNRQLTHLAAGT---KDTDAVNVAQL 214 Score = 40.7 bits (94), Expect = 7e-05 Identities = 48/149 (32%), Positives = 71/149 (47%), Gaps = 11/149 (7%) Query: 544 AAGSNALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAVAQNTTALGG 603 A G NA A +S A+G+++ A+ A AVG+G+ AT N+ A+G S A+ + G Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119 Query: 604 NSSASGDASTAVGSASQATANGATALGYESIANGADSTALGVG---------SVAFGDTS 654 S+A D AS T++ A+G+ S A+ +S A+G S+A GD S Sbjct: 120 ASTAQKDGVAIGARAS--TSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRS 177 Query: 655 TAVGGASVAFGTDSAAFGANAAAGGTAST 683 SV+ G +S A GT T Sbjct: 178 KTDRENSVSIGHESLNRQLTHLAAGTKDT 206 Score = 40.7 bits (94), Expect = 8e-05 Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 4/103 (3%) Query: 1836 AAGAGATASGARSVAVASGSRASATGASAMGVDSSASGVNSTAMGRQTNSIGENGVALGY 1895 A G A+A G S+A+ + + A+ A A+G S A+GVNS A+G + ++G++ V G Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119 Query: 1896 NSFVRESGSNAVALGANAGASGADSVALGSGSRTYEANTVSVG 1938 S ++ G VA+GA A S VA+G S+ N+V++G Sbjct: 120 ASTAQKDG---VAIGARASTSDT-GVAVGFNSKADAKNSVAIG 158 Score = 37.6 bits (86), Expect = 7e-04 Identities = 53/182 (29%), Positives = 83/182 (45%), Gaps = 18/182 (9%) Query: 549 ALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAVAQNTTALGGNSSAS 608 A AD ++ S A+G A G N++A ++ A+G + A+ Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82 Query: 609 GDASTAVGSASQATANGATALGYESIANGADSTALGVGSVAFGDTSTAVGGASVAFGTDS 668 A+ AVG+ SIA G +S A+G S A GD++ G AS A D Sbjct: 83 KGAAVAVGAG--------------SIATGVNSVAIGPLSKALGDSAVTYGAASTA-QKDG 127 Query: 669 AAFGANAAAGGTASTAIGANSSAFGERTVALGGASNASGDD--SIALGASSQASALGTTA 726 A GA A+ T A+G NS A + +VA+G +S+ + + SIA+G S+ + + Sbjct: 128 VAIGARASTSDTG-VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVS 186 Query: 727 VG 728 +G Sbjct: 187 IG 188 Score = 37.2 bits (85), Expect = 0.001 Identities = 36/130 (27%), Positives = 64/130 (49%) Query: 1455 AAGEGANATGTGTTALGAGAQAVVDNATAVGVGALAGGTGAAALGSNAQAVGENSSAVGS 1514 A G A+A G + A+GA A+A A AVG G++A G + A+G ++A+G+++ G+ Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGA 119 Query: 1515 NALASDIGATANGAGAQAISTYTTALGSEAVASDNQAIAAGFRSTASSVGSAAFGGYSES 1574 + A G + + + S+A A ++ AI A+ S A G S++ Sbjct: 120 ASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKT 179 Query: 1575 TGRLSSALGY 1584 S ++G+ Sbjct: 180 DRENSVSIGH 189 Score = 36.4 bits (83), Expect = 0.002 Identities = 52/180 (28%), Positives = 78/180 (43%), Gaps = 2/180 (1%) Query: 705 ASGDDSIALGASSQASALGTTAVGSNANASIANATAVGFNSSAGDDYATALGGDSNASGY 764 A D I + Q S A+G A G N+SA ++ A+G + A+ Sbjct: 25 ADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAAKG 84 Query: 765 FSTAVGGTSIANGRGATAIGYETIGNGTASTALGFASVAWGDGGTAIGTESLAYGDNSTA 824 + AVG SIA G + AIG + G ++ G AS A DG AIG + D A Sbjct: 85 AAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDG-VAIGARAST-SDTGVA 142 Query: 825 VGANAAAADTDSIAVGTYANAYGPRAISLGGQSRATGDDSIALGWGAQAEGEQGIALGAG 884 VG N+ A +S+A+G ++ S+ R+ D ++ G ++ Q L AG Sbjct: 143 VGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQLTHLAAG 202 Score = 35.6 bits (81), Expect = 0.002 Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 4/142 (2%) Query: 1113 AQGEDATAAGSNATADGDYSSAFGSSSQATAIGAVAIGSGASATAQYANAAGYNAAASGF 1172 + A G NA+A G +S A G++++A AVA+G+G+ AT + A G + A G Sbjct: 53 VRPPVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGD 112 Query: 1173 GSVSNGAFSQASGDYAVAVGGESEAAGAQSTALGAAAGAYGDGSLAVGALSE--AQGSES 1230 +V+ GA S A D VA+G + + A+G + A S+A+G S A S Sbjct: 113 SAVTYGAASTAQKD-GVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVAANHGYS 170 Query: 1231 TAMGYFASASGESATAVGAESV 1252 A+G + E++ ++G ES+ Sbjct: 171 IAIGDRSKTDRENSVSIGHESL 192 Score = 32.9 bits (74), Expect = 0.016 Identities = 44/149 (29%), Positives = 68/149 (45%), Gaps = 20/149 (13%) Query: 236 AAGDGANATGTATTALGTGANAVANNATAVGANALASGQNSAAFGHNAQANGPASVAVGG 295 A G A+A G + A+G A A A AVGA ++A+G NS A GP S A+G Sbjct: 60 AGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAI-------GPLSKALGD 112 Query: 296 AAVDEDGEPLVTNGGVPVTTGATSAGVGGTAVGASANADGFAASSFGVGAYAAGTQSSAF 355 +AV GV + A+++ G AVG ++ AD + + G ++ A Sbjct: 113 SAVTYGAASTAQKDGVAIGARASTSDT-GVAVGFNSKADAKNSVAIGHSSHVA------- 164 Query: 356 GAVANAAGDYATAIGTQTSASGTSSTAVG 384 A Y+ AIG ++ +S ++G Sbjct: 165 -----ANHGYSIAIGDRSKTDRENSVSIG 188 Score = 32.6 bits (73), Expect = 0.021 Identities = 44/133 (33%), Positives = 65/133 (48%), Gaps = 4/133 (3%) Query: 1105 GTGTGTADAQGEDATAAGSNATADGDYSSAFGSSSQATAIGAVAIGSGASATAQYANAAG 1164 G G A A+G + A G+ A A + A G+ S AT + +VAIG + A A G Sbjct: 59 GAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYG 118 Query: 1165 YNAAASGFGSVSNGAFSQASGDYAVAVGGESEAAGAQSTALGAAA--GAYGDGSLAVGAL 1222 + A G V+ GA + S D VAVG S+A S A+G ++ A S+A+G Sbjct: 119 AASTAQKDG-VAIGARASTS-DTGVAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDR 176 Query: 1223 SEAQGSESTAMGY 1235 S+ S ++G+ Sbjct: 177 SKTDRENSVSIGH 189 Score = 32.2 bits (72), Expect = 0.026 Identities = 51/171 (29%), Positives = 72/171 (42%), Gaps = 4/171 (2%) Query: 535 AIAQGVDSVAAGSNALADSDYSTALGSSSAASAQGATAVGSGANATTDNATAVGFNSTAV 594 A A D + + + ALG A G A+A ++ A+G + A Sbjct: 23 AFADDYDGIPNLTAVQISPNADPALGLEYPVRPPVPGAGGLNASAKGIHSIAIGATAEAA 82 Query: 595 AQNTTALGGNSSASGDASTAVGSASQATANGATALGYESIANGADSTALGVGSVAFGDTS 654 A+G S A+G S A+G S+A + A G S A D A+G + DT Sbjct: 83 KGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQ-KDGVAIGARAST-SDTG 140 Query: 655 TAVGGASVAFGTDSAAFG--ANAAAGGTASTAIGANSSAFGERTVALGGAS 703 AVG S A +S A G ++ AA S AIG S E +V++G S Sbjct: 141 VAVGFNSKADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHES 191
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 204 bits (520), Expect = 1e-62 Identities = 102/379 (26%), Positives = 146/379 (38%), Gaps = 80/379 (21%) Query: 139 QVDLRMYPLQASGALPNDPLLQTNQWHLIDPVGGINVAQAWKTTQGEGVVVAVLDTGILP 198 +V + Y + + + + + I W T+G GV VAVLDTG Sbjct: 4 KVHIIPYQV-----IKQEQQVNEIPRGV----EMIQAPAVWNQTRGRGVKVAVLDTGCDA 54 Query: 199 DHPDLAGNLLAGYDFITDPFFSRRATAERVPGALDLGDWIAEDGDCGLFSVASDSSWHGT 258 DHPDL ++ G +F D D G + D + HGT Sbjct: 55 DHPDLKARIIGGRNFT--------------------------DDDEGDPEIFKDYNGHGT 88 Query: 259 HVAGTVAEATNNGIGGAGVAYRAKVLPVRVLGHCG-GRLSDISDAIVWASGGHVDGVPDN 317 HVAGT+A AT N G GVA A +L ++VL G G+ I I +A VD Sbjct: 89 HVAGTIA-ATENENGVVGVAPEADLLIIKVLNKQGSGQYDWIIQGIYYAIEQKVD----- 142 Query: 318 RDPAEVINLSLGGGGACGSTMQAAIDGAVARGTAVVVAAGNSTADVSTMA----PANCAN 373 +I++SLGG + A+ AVA V+ AAGN P Sbjct: 143 -----IISMSLGGPED-VPELHEAVKKAVASQILVMCAAGNEGDGDDRTDELGYPGCYNE 196 Query: 374 VIAVAATRATGGLADYSNFGRQIDLAGPGGSSMSFVTNDGPIRSFVWQTLYTGKTTPTSG 433 VI+V A +++SN ++DL PG + T+ GK S Sbjct: 197 VISVGAINFDRHASEFSNSNNEVDLVAPG--------------EDILSTVPGGKYATFS- 241 Query: 434 QFTYGGTHYAGTSMASPHVAGTAALVQSALIADGKPPLSPAAMESLLKRTARPFPVSIPV 493 GTSMA+PHVAG AL++ A + L+ + + L + P S Sbjct: 242 ----------GTSMATPHVAGALALIKQLANASFERDLTEPELYAQLIKRTIPLGNS--- 288 Query: 494 ATPAGAGIVDAGAAVARAL 512 G G++ A + Sbjct: 289 PKMEGNGLLYLTAVEELSR 307
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 30.6 bits (68), Expect = 0.044 Identities = 45/176 (25%), Positives = 66/176 (37%), Gaps = 27/176 (15%) Query: 632 PKMHRDAAHPAAPQWPVLQTASLDLQQAGLRVLA--HPTVASKSFLVTIGDRSVGGLTAR 689 P + +P P PV L+ G+ +A A+K V +G S+ Sbjct: 45 PALG--LEYPVRP--PVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIA-TGVN 99 Query: 690 EQMIGPWQLPLADCAITLAGFDTFEGEAMSIGERTPLALLNAAASARMAVGEAITNLCAA 749 IGP L D A+T T + + ++IG R + + +AVG Sbjct: 100 SVAIGPLSKALGDSAVTYGAASTAQKDGVAIGARA------STSDTGVAVG--------- 144 Query: 750 PVQRLDSIKLSANWMAAAGHSGEDALLYDAVRAIGMELCPALELSVPVGKDSLSMQ 805 +S K A A GHS A + AIG E SV +G +SL+ Q Sbjct: 145 ----FNS-KADAKNSVAIGHSSHVAANHGYSIAIGDRSKTDRENSVSIGHESLNRQ 195
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 57.9 bits (140), Expect = 1e-11 Identities = 31/115 (26%), Positives = 46/115 (40%), Gaps = 12/115 (10%) Query: 140 GATVLYIEDSRVVAEATKRMLERQSLKVVHVLTAEDAFALLTAESLGRTERRIDVVLTDV 199 GAT+L +D + + L R V A + + A D+V+TDV Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-------GDLVVTDV 55 Query: 200 TLKGELNGRDVVGRIRIDFAYGKRRLPVLVMTGDTNPRNQSELLRAGANDLVQKP 254 + E N D++ RI+ LPVLVM+ + GA D + KP Sbjct: 56 VMPDE-NAFDLLPRIKKARP----DLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105 Score = 51.0 bits (122), Expect = 2e-09 Identities = 22/88 (25%), Positives = 39/88 (44%), Gaps = 4/88 (4%) Query: 12 DAPRVMVVDGSKLVRKLIADVLKRDLPNVQVIGCSNIAEARQALEAGAVDLVTTSLSLPD 71 ++V D +R ++ L R V SN A + + AG DLV T + +PD Sbjct: 2 TGATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59 Query: 72 GDGLTLARSVRETAGQAYVPVIVVSGDA 99 + L +++ + +PV+V+S Sbjct: 60 ENAFDLLPRIKKA--RPDLPVLVMSAQN 85
>PF01540#Adhesin lipoprotein Length = 475 Score = 37.4 bits (86), Expect = 5e-05 Identities = 33/102 (32%), Positives = 49/102 (48%), Gaps = 13/102 (12%) Query: 34 MRKPWATLLTIVVMALALALPLGLSIALDNVKLLAGSVQQSREINLFLKVDVAADAAQAL 93 M+K +T+ +A LP+ +I+ ++ KL E N K D A A AL Sbjct: 1 MKKSKKIFITLCGIAATAVLPIA-TISCNDDKL--------AEKNGKEKADAALKQANAL 51 Query: 94 AGELRARPDVAKVTLRTPEQGLAELRESAKLDEAADALGDNP 135 A EL+ PD +K+ L T + +AE +S K A + GD P Sbjct: 52 AEELKKNPDYSKI-LETLNKEIAEATKSFK---EAGSYGDYP 89
>cloacin#Cloacin signature. Length = 551 Score = 30.1 bits (67), Expect = 0.026 Identities = 23/66 (34%), Positives = 26/66 (39%), Gaps = 6/66 (9%) Query: 433 GGGRGGPGGGSRSGSGGGRRDGAGADGKPRPRRKPRVEGQAPATSAPSA-TPVVAAAAVE 491 GGG G GG SGGG G P V PA S P A V+ +A Sbjct: 57 GGGSGHGNGGGNGNSGGGSGTGGNLSAVAAP-----VAFGFPALSTPGAGGLAVSISAGA 111 Query: 492 ASSTIA 497 S+ IA Sbjct: 112 LSAAIA 117
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 31.3 bits (70), Expect = 0.008 Identities = 19/59 (32%), Positives = 29/59 (49%), Gaps = 3/59 (5%) Query: 96 EGGQSQQQRFNQAQQQQNQTQGQNQG--QGQQQGQNQNQNAGQNQQGQGGQGQGQNQQG 152 E S +Q+ Q Q Q ++ QG G +G+QQG Q G Q + G + ++QQ Sbjct: 35 EAEPSLEQQLAQLQMQAHE-QGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQA 92
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 38.0 bits (88), Expect = 2e-05 Identities = 18/92 (19%), Positives = 30/92 (32%), Gaps = 19/92 (20%) Query: 136 PPRYPEAAFRAGATGVVYLMLKIGRDGKVADLIAEQVNLTSLVPESKRARLRQVFADAAS 195 P+YP A G V + + DG+V ++ + A+ +F Sbjct: 164 QPQYPARAQALRIEGQVKVKFDVTPDGRVDNV------------QILSAKPANMFEREVK 211 Query: 196 KKARTWTFLPPTEGPEVEAPYWVMRVPVSFDI 227 R W + P G + V + F I Sbjct: 212 NAMRRWRYEPGKPGSGI-------VVNILFKI 236
>PF09025#YopR Core Length = 143 Score = 28.8 bits (64), Expect = 0.016 Identities = 19/68 (27%), Positives = 27/68 (39%), Gaps = 12/68 (17%) Query: 38 QMRALEQRLG--YPLLQRHARGVTATPQGQQLLDRIAPHLDAIA----------EAFEPF 85 Q+ A EQ LG P R G+ G++LL R A L + A P Sbjct: 28 QVLAFEQALGGEPPAAGRRLAGLENGALGERLLQRFAQPLQGLEADRLELKAMLRAELPL 87 Query: 86 GARREDTL 93 G +++ L Sbjct: 88 GRQQQTFL 95
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 53.7 bits (129), Expect = 7e-10 Identities = 30/149 (20%), Positives = 57/149 (38%), Gaps = 22/149 (14%) Query: 64 ASALGTVTAL-NTVTVSPQVGGQLMSLNFKEGQEVKKGDLLAQIDPRT-------LQASY 115 A+A G +T + + P + + KEG+ V+KGD+L ++ Q+S Sbjct: 84 ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSL 143 Query: 116 DQALAAKRQNQALLA---TSRVNYQRSNDPAYKQYVS-----------RTDLDTQRNQVA 161 QA + + Q L +++ + D Y Q VS + T +NQ Sbjct: 144 LQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKY 203 Query: 162 QYEAAVAANDAQMRSAQVQLQFTRVTAPI 190 Q E + A+ + ++ + + Sbjct: 204 QKELNLDKKRAERLTVLARINRYENLSRV 232 Score = 35.2 bits (81), Expect = 4e-04 Identities = 22/177 (12%), Positives = 64/177 (36%), Gaps = 29/177 (16%) Query: 93 EGQEVKKGDLLAQIDPRTLQASYDQ-------ALAAKR----------QNQALLATSRVN 135 + + ++ +LA+I+ + ++ +L K+ +N+ + A + + Sbjct: 210 DKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELR 269 Query: 136 YQRSNDPAYKQYVSRTDLD-TQRNQVAQYEAAVAANDAQMRSAQV---------QLQFTR 185 +S + + + Q+ + E + + Q + Sbjct: 270 VYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329 Query: 186 VTAPIDGIAGIRGV-DVGNIVTSSSTIVTLT-QIRPIYVSFNLPERELQAVRTGQTA 240 + AP+ V G +VT++ T++ + + + V+ + +++ + GQ A Sbjct: 330 IRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNA 386
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 734 bits (1896), Expect = 0.0 Identities = 298/1072 (27%), Positives = 497/1072 (46%), Gaps = 65/1072 (6%) Query: 4 STIFIRRPIATSLLMAGVLLLGILGYRQLPVSALPEIDAPSLVVTTQYPGANATTMASLV 63 + FIRRPI +L +++ G L QLPV+ P I P++ V+ YPGA+A T+ V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 64 TTPLERQFGQISGLQMMTSDS-SAGLSTIILQFSMERDIDIASQDVQAAIRQAT--LPSS 120 T +E+ I L M+S S SAG TI L F D DIA VQ ++ AT LP Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 121 LPYQPVYNRVNPADAAILTLKLTSDS--LPLREVNRYADAILAQRLSQVPGVGLVSIAGN 178 + Q + + + ++ SD+ +++ Y + + LS++ GVG V + G Sbjct: 122 VQQQGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 179 VRPAVRIQVNPAQLSNMGLTMESLRSALTQTNVSAPKGSLN------GKTQSYSIGTNDQ 232 A+RI ++ L+ LT + + L N G L G+ + SI + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 233 LTDAAQYRETII-SYKDGRPVRLADVANVVDGVENDQLAAWADGKQAVLLEIRRQPGANI 291 + ++ + + DG VRL DVA V G EN + A +GK A L I+ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 292 VQTVEQIRNILPQLRSVLPADVHLEVFSDRTETIRASVHEVKFTLVLTIALVVAVIFVFL 351 + T + I+ L +L+ P + + D T ++ S+HEV TL I LV V+++FL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 352 RRLWATIIPSVAVPLSLAGTFGVMAFAGMSLDNLSLMALVVATGFVVDDAIVMIENIVRY 411 + + AT+IP++AVP+ L GTF ++A G S++ L++ +V+A G +VDDAIV++EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 412 IEQGKSGP-EAAEIGAKQIGFTVLSLTVSLVAVFLPLLLMPGVTGRLFHEFAWVLSIAVV 470 + + K P EA E QI ++ + + L AVF+P+ G TG ++ +F+ + A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 471 ISMLVSLTLTPMMCAYLLKPDALPEGEDAHERATAAGKTNLWTRTVGAYERSLDWVLAHQ 530 +S+LV+L LTP +CA LLKP + E ++ + +V Y S+ +L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHE--NKGGFFGWFNTTFDHSVNHYTNSVGKILGST 537 Query: 531 PLTLAVAIGAVALTVVLYVAIPKGLLPEQDTGLITGVVQADQNVAFPQMEQRTQAVAAAL 590 L + VA VVL++ +P LPE+D G+ ++Q + ++ V Sbjct: 538 GRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYY 597 Query: 591 RKDPA--VTGVAAFIGAGTMNPTLNQGQLSIVLKTRGEREG----LDEVLPRLQKAVAGI 644 K+ V V G N G + LK ER G + V+ R + + I Sbjct: 598 LKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657 Query: 645 PGVALFLKPVQDV-TLDTRVAATEYQYSISDVDSSELATWAGRMTESMRKLP-ELADVDN 702 + + + L T + + L ++ + P L V Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 703 NLANQGRALELSIDRDKASMLGVPMQTIDDTLYDSFGQRQISTIFTELNQYRVVLEVAPE 762 N +L +D++KA LGV + I+ T+ + G ++ ++ ++ + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 763 FRTSTALMNQLAVASNGSGALTGTNATSFGQVTSSNSSTATGVGAQNTGIVVGAGSIIPL 822 FR +++L V S G ++P Sbjct: 778 FRMLPEDVDKLYVRSA-------------------------------------NGEMVPF 800 Query: 823 AALAEAKVTNTPLVVSHQQQLPAVTISFNLAPGHSLSQAVAAIEKAREELKMPTQVHAQF 882 +A + + LP++ I APG S A+A +E K+P + + Sbjct: 801 SAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLAS--KLPAGIGYDW 858 Query: 883 VGKAAEFTGSQTDIVWLLLASIVVIYIVLGVLYESYIHPLTIISTLPPAGVGALLALMLC 942 G + + S L+ S VV+++ L LYES+ P++++ +P VG LLA L Sbjct: 859 TGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLF 918 Query: 943 GLSLSVDGIVGIVLLIGIVKKNAIMMIDFAIDA-RREGASAHDAIRRACLLRFRPIMMTT 1001 V +VG++ IG+ KNAI++++FA D +EG +A A +R RPI+MT+ Sbjct: 919 NQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTS 978 Query: 1002 AAAMLGALPLALGTGIGSELRRPLGIAIVGGLLLSQLVTLYTTPVIYLYMER 1053 A +LG LPLA+ G GS + +GI ++GG++ + L+ ++ PV ++ + R Sbjct: 979 LAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030 Score = 76.4 bits (188), Expect = 4e-16 Identities = 58/319 (18%), Positives = 117/319 (36%), Gaps = 14/319 (4%) Query: 747 FTELNQYRVVLEVAPEFRTSTALMNQLAVASNGS-GALTGTNATSFGQVTSSNSSTATGV 805 LN+Y++ L Q + G G + + Sbjct: 190 ADLLNKYKLTPV-----DVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNPE 244 Query: 806 GAQNTGIVVGA-GSIIPLAALAEAKVT--NTPLVVSHQQQLPAVTISFNLAPGHSLSQAV 862 + V + GS++ L +A ++ N ++ + PA + LA G + Sbjct: 245 EFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANALDTA 303 Query: 863 AAIEKAREELK--MPTQVHAQFVGKAAEF-TGSQTDIVWLLLASIVVIYIVLGVLYESYI 919 AI+ EL+ P + + F S ++V L +I+++++V+ + ++ Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363 Query: 920 HPLTIISTLPPAGVGALLALMLCGLSLSVDGIVGIVLLIGIVKKNAIMMIDFAIDARRE- 978 L +P +G L G S++ + G+VL IG++ +AI++++ E Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423 Query: 979 GASAHDAIRRACLLRFRPIMMTTAAAMLGALPLALGTGIGSELRRPLGIAIVGGLLLSQL 1038 +A ++ ++ +P+A G + R I IV + LS L Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483 Query: 1039 VTLYTTPVIYLYMERAGER 1057 V L TP + + + Sbjct: 484 VALILTPALCATLLKPVSA 502
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 751 bits (1941), Expect = 0.0 Identities = 287/1030 (27%), Positives = 489/1030 (47%), Gaps = 26/1030 (2%) Query: 7 FIKRPIGTSLLAIGLFVIGLMCYLRLGVAALPNIQIPIIFVHATQSGADASTMASTVTAP 66 FI+RPI +LAI L + G + L+L VA P I P + V A GADA T+ TVT Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64 Query: 67 LERHLGQLPGIDRMRSSS-SESSSLVVLVFQSSRNIDSAAQDIQTAINASQSDLPSGLGT 125 +E+++ + + M S+S S S + L FQS + D A +Q + + LP + Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124 Query: 126 PMYSKANPNDDPVIAIALTSET--QSADELYNVADSLLAQRLRQITGISSVDIAGASTPA 183 S + ++ S+ + D++ + S + L ++ G+ V + GA A Sbjct: 125 QGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY-A 183 Query: 184 VRVDVDLRALNALGLTPDDLRNAVRAANVTSPTGFL------SDGNTTMAIISNDSVSKA 237 +R+ +D LN LTP D+ N ++ N G L +II+ Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKNP 243 Query: 238 ADFAQLAISTQSNGRIVRLGDVATVYDGQQDAYQAAWFNGKPAVVMYAFTRAGANIVETV 297 +F ++ + S+G +VRL DVA V G ++ A NGKPA + GAN ++T Sbjct: 244 EEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTA 303 Query: 298 DQVKAQIPELRSYLQPGTTLTPYFDRTPTIRASLHEVQATLMISLAMVILTMALFLRRLA 357 +KA++ EL+ + G + +D TP ++ S+HEV TL ++ +V L M LFL+ + Sbjct: 304 KAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMR 363 Query: 358 PTLIAAVTVPLSLAGSALVMYVLGFTLNNLSLLALVIAIGFVVDDAIVVIENVMRHL-DE 416 TLI + VP+ L G+ ++ G+++N L++ +V+AIG +VDDAIVV+ENV R + ++ Sbjct: 364 ATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMED 423 Query: 417 GMSRLDAALAGAREIGFTIVSITASLVAVFIPMLFASGMIGAFFREFTVTLVAAIVVSML 476 + +A +I +V I L AVFIPM F G GA +R+F++T+V+A+ +S+L Sbjct: 424 KLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVL 483 Query: 477 VSLTLTPALCSRFLSPHTEP--EKPGRFGAWLDRMHERMLRVYTVALDFSLRHALLLSLT 534 V+L LTPALC+ L P + E G F W + + + YT ++ L L Sbjct: 484 VALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLI 543 Query: 535 PLLLIAATIFLGSAVKKGSFPAQDTGLIWGRANSSATVSFADMVSRQRRITDMLMADP-- 592 L++A + L + P +D G+ A + ++TD + + Sbjct: 544 YALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKA 603 Query: 593 ---AVKTVGARLGSGRQGSSASFNIELKKRDE--GRRDTTAEVVARLSAKADRYPDLDLR 647 +V TV SG+ ++ + LK +E G ++ V+ R + + D Sbjct: 604 NVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIR--DGF 661 Query: 648 LRAIQDLPSDGGGGTSQGAQYRVSLQGNDLAQLQEWLPKLQAALKKNP-HLRDVGTDVDT 706 + G + + G L + +L ++P L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 707 AGLRQNIVIDRAKAARLGISVGAIDGALYGAFGQRSISTIYSDLNQYSVVVNALPSQTAT 766 + + +D+ KA LG+S+ I+ + A G ++ + V A Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 767 PKALDQIFVPNRAGQMVPITAVATQAPGLAPPQIIHENQYTTMDLSYNLAPGVSTGEADL 826 P+ +D+++V + G+MVP +A T P++ N +M++ APG S+G+A Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 827 IIKSTVQGLRMPDGIRLS-GDDSFNVQHSPNSMGILLLAAVLTVYIVLGMLYESLIHPVT 885 ++++ ++P GI S+ + S N L+ + + V++ L LYES PV+ Sbjct: 842 LMENLAS--KLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 886 ILSTLPAAGVGALLALFITNTELSVISMIALVLLIGIVKKNAIMMIDFALVAQRVHGMDA 945 ++ +P VG LLA + N + V M+ L+ IG+ KNAI++++FA G Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 946 RAAAREASIVRFRPIMMTTMVAILAAVPLAVGLGEGSELRRPLGIAMIGGLVFSQSLTLL 1005 A A +R RPI+MT++ IL +PLA+ G GS + +GI ++GG+V + L + Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1006 STPALYVIFS 1015 P +V+ Sbjct: 1020 FVPVFFVVIR 1029 Score = 109 bits (274), Expect = 3e-26 Identities = 81/506 (16%), Positives = 165/506 (32%), Gaps = 31/506 (6%) Query: 2 NISAPFIKRPIGTSLLAIGLFVIGLMCYLRLGVAALPNIQIPIIFVHA-TQSGADASTMA 60 N + L+ + ++ +LRL + LP + +GA Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587 Query: 61 STVT----------APLERHLGQLPGIDRMRSSSSESSSLVVLVFQSSRNIDS-AAQDIQ 109 + + + G + + + V L RN D +A+ + Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647 Query: 110 TAINASQSDLPSGLGTPMYSKANPNDDPVIAIALTSETQSA-----DELYNVADSLLAQR 164 + G P + A E D L + LL Sbjct: 648 HRAKMELGKIRDGFVIPF--NMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMA 705 Query: 165 LRQITGISSVDIAG-ASTPAVRVDVDLRALNALGLTPDDLRNAVRAANVTSPTGFLSDGN 223 + + SV G T +++VD ALG++ D+ + A + D Sbjct: 706 AQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRG 765 Query: 224 TTMAIIS---NDSVSKAADFAQLAISTQSNGRIVRLGDVATVYDGQQDAYQAAWFNGKPA 280 + D +L + + +NG +V T + + + +NG P+ Sbjct: 766 RVKKLYVQADAKFRMLPEDVDKLYVRS-ANGEMVPFSAFTTSHWVYG-SPRLERYNGLPS 823 Query: 281 VVMYAFTRAGANIVETVDQVKAQIPELRSYLQPGTTLTPYFDRTPTIRASLHEVQATLMI 340 + + G + A + L S L G + + R S ++ A + I Sbjct: 824 MEIQGEAAPGTS----SGDAMALMENLASKLPAGIGYD-WTGMSYQERLSGNQAPALVAI 878 Query: 341 SLAMVILTMALFLRRLAPTLIAAVTVPLSLAGSALVMYVLGFTLNNLSLLALVIAIGFVV 400 S +V L +A + + + VPL + G L + + ++ L+ IG Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 401 DDAIVVIENVM-RHLDEGMSRLDAALAGAREIGFTIVSITASLVAVFIPMLFASGMIGAF 459 +AI+++E EG ++A L R I+ + + + +P+ ++G Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGA 998 Query: 460 FREFTVTLVAAIVVSMLVSLTLTPAL 485 + ++ +V + L+++ P Sbjct: 999 QNAVGIGVMGGMVSATLLAIFFVPVF 1024
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 30.1 bits (67), Expect = 0.024 Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 4/75 (5%) Query: 117 DEAALSANPFRVFTSLLRLELIEDAALRAQAEQILQQRQIFTAGALQLIERHERQGGLDA 176 D ++ R + LLR L A + +L L +++ ER+GG+D Sbjct: 436 DVRRITPKKLRELSDLLRTHLSSAATKQLDMGGVLSDLDTM----LVALDKAEREGGVDK 491 Query: 177 DQARQFVAEALETFR 191 DQ + F + L+T+R Sbjct: 492 DQLKSFNSLILKTYR 506
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.9 bits (75), Expect = 0.003 Identities = 43/246 (17%), Positives = 70/246 (28%), Gaps = 57/246 (23%) Query: 127 LAAAQAGRRLIVPLANGAEAAIAGHVEAFTARTL------LDVCATLNGSQKAPAAELAA 180 L + R + L A+ ++A D+ + +A A Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 181 QALGARALPDMADVRGQP----HARRALEIAAAGGHHLLLVGSPGCGKTLLASRLPGLLP 236 + D + G+ R L L++ G G GK L+A L Sbjct: 126 PSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALH---- 181 Query: 237 EASEAEALETAAITSISGRGLDLARWRQRPYRAPHHTASPVALVG------------GGT 284 D + R P+ A + A P L+ G Sbjct: 182 ---------------------DYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQ 220 Query: 285 HPRPGEISLSHNGVLFLDEL----PEWQRQTLEVLREPLESGVVTIARASRSVDFPARFQ 340 G + G LFLDE+ + Q + L VL++ + + Sbjct: 221 TRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQG------EYTTVGGRTPIRSDVR 274 Query: 341 LVAAMN 346 +VAA N Sbjct: 275 IVAATN 280
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 110 bits (276), Expect = 2e-33 Identities = 59/155 (38%), Positives = 88/155 (56%), Gaps = 12/155 (7%) Query: 5 PELATVPS-LDLNRYLGTWYEIARLPTRFEDADCTDVSAHYTLEDDGSVRVQNRCFTAE- 62 PE S +LN YLG WYE+ARL FE + V+A Y + +DG + V NR ++ E Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERG-LSQVTAEYRVRNDGGISVLNRGYSEEK 78 Query: 63 GELEEAVGQARAIDD-THSRLEVTFLPEGLRWIPFTKGDYWVMRIDAD-YTAALVGSPDR 120 GE +EA G+A ++ T L+V+F PF G Y V +D + Y+ A V P+ Sbjct: 79 GEWKEAEGKAYFVNGSTDGYLKVSFFG------PFY-GSYVVFELDRENYSYAFVSGPNT 131 Query: 121 KYLWLLARLPQLDENIAQAYLAHAREQGFDLSPLI 155 +YLWLL+R P ++ I ++ ++E+GFD + LI Sbjct: 132 EYLWLLSRTPTVERGILDKFIEMSKERGFDTNRLI 166
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.8 bits (121), Expect = 5e-10 Identities = 20/102 (19%), Positives = 49/102 (48%), Gaps = 1/102 (0%) Query: 12 PPSRKPAISREDLIAAALSLIGPHRSLSTVSLREVAREAGIAPNSFYRQFRDMDELAVAL 71 ++ +R+ ++ AL L + +S+ SL E+A+ AG+ + Y F+D +L + Sbjct: 4 KTKQEAQETRQHILDVALRLFS-QQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 72 IDLAGRSLRTIIGQARQRATSTDRSVIRVSVEAFMEQLRADD 113 +L+ ++ + + + + SV+R + +E ++ Sbjct: 63 WELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEE 104
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 31.4 bits (71), Expect = 0.007 Identities = 23/88 (26%), Positives = 37/88 (42%), Gaps = 16/88 (18%) Query: 23 VRGVPLEEQAHAQL--RNIAAVPFVGPW----VAVMP-----DVHLGKGATVGSVIPTRG 71 + +E Q + R A +P+ + VA+ +V L V +V+PTRG Sbjct: 726 AKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLD--NAVANVVPTRG 783 Query: 72 AIIPAAVGVDIGCGMAAVRTTLRANDLP 99 AI+ A +G + TL N+ P Sbjct: 784 AIVRAEFKARVG---IKLLMTLTHNNKP 808
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 39.4 bits (92), Expect = 1e-05 Identities = 15/27 (55%), Positives = 18/27 (66%) Query: 1 MHLLITGGTGFIGQALCPALLQAGHQV 27 M L+TG GFIG + LL+AGHQV Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQV 27
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 85.7 bits (212), Expect = 1e-21 Identities = 26/155 (16%), Positives = 60/155 (38%), Gaps = 5/155 (3%) Query: 2 HLLLVEDDTMLASAICDGVRQQSWTVDHVGHANAAKTVLVDHRYSAVLLDIGLPGESGLS 61 +L+ +DD + + + + + + V +A + V+ D+ +P E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VIRFMRSHYDATPVIALTARGQLTDRIRGLDAGADDYLVKPFQFDELMARLRAVTRRSQG 121 ++ ++ PV+ ++A+ I+ + GA DYL KPF EL+ + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 RVVPLLSHGD-----VCLDPGSRKVTKDGKWVALS 151 R L V +++ + + + Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQT 159
>PF06580#Sensor histidine kinase Length = 349 Score = 32.9 bits (75), Expect = 0.002 Identities = 29/156 (18%), Positives = 61/156 (39%), Gaps = 31/156 (19%) Query: 209 LETARRSNRLAEQLLDLARLDAGISSAAYHQVEMGELISHVLDEFSVQADAR---QMQLQ 265 LE ++ + L +L R S+A QV + + ++ V + + ++Q + Sbjct: 187 LEDPTKAREMLTSLSELMRYSLRYSNA--RQVSLADELTVVDSYLQLA-SIQFEDRLQFE 243 Query: 266 VEASPCLLRCDVDAVGILIRNLVDNAIRYG----RLHGKVEVSCGYCVRADVLHPFLQVS 321 + +P ++ V +L++ LV+N I++G GK+ + D L+V Sbjct: 244 NQINPAIMDVQVPP--MLVQTLVENGIKHGIAQLPQGGKILLK----GTKDNGTVTLEVE 297 Query: 322 DDGPGVPEGAQTTIFERFYRVPGSAVQGSGIGLSLV 357 + G + + + +G GL V Sbjct: 298 NTGSLALKNTK---------------ESTGTGLQNV 318
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 35.8 bits (82), Expect = 2e-04 Identities = 20/145 (13%), Positives = 40/145 (27%) Query: 111 QKLTATKDAAKQTLTSTTQAAKQKLSSTSAAAKKKITDTKATTKRKLETAKANAKAEAAA 170 +K T D T + QA + S + + + AE + Sbjct: 986 EKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045 Query: 171 LSAKTAAKSAARKTAVATVNARTAAKKAAKKAVAKSAAAKKPLVKPAAKKAPVAKQTATR 230 +KT K+ T N A + + + + Sbjct: 1046 QESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETAT 1105 Query: 231 QAAVKKAPLKKAVTKTALKKAAKVT 255 +KA ++ T+ K ++V+ Sbjct: 1106 VEKEEKAKVETEKTQEVPKVTSQVS 1130
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.2 bits (177), Expect = 6e-17 Identities = 34/131 (25%), Positives = 59/131 (45%), Gaps = 5/131 (3%) Query: 2 TTLLIADDHPLFREALRGAVQRVMPGVELFEADNV-DALYTLADAQPDADLLLMDLNMPG 60 T+L+ADD R L A+ R G ++ N +A D L++ D+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSRA--GYDVRITSNAATLWRWIAAGDGD--LVVTDVVMPD 59 Query: 61 AQGFSALVHMRSLHPQLPVVVVSAREEPTVMRRAIDHGAFGFIPKSADSDTIGRALATVL 120 F L ++ P LPV+V+SA+ +A + GA+ ++PK D + + L Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 121 DGERWIPAEAQ 131 + P++ + Sbjct: 120 AEPKRRPSKLE 130
>PF04335#VirB8 type IV secretion protein Length = 227 Score = 31.0 bits (70), Expect = 0.005 Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 11/70 (15%) Query: 168 LLWLLLTIATF--AAMTLALFVM-------PPQVMFDRSTGGHALRESLRASLHNLP--A 216 L W++ +A A +A+ + P + DR+TG ++ L A Sbjct: 34 LAWVVAGVAGALATAGVVAVAALTPLKTVEPYVITVDRNTGEASIAAKLHGDATITYDEA 93 Query: 217 MLVFFVLAFI 226 + +F+ ++ Sbjct: 94 VRKYFLATYV 103
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 83.9 bits (207), Expect = 1e-22 Identities = 47/171 (27%), Positives = 74/171 (43%), Gaps = 9/171 (5%) Query: 1 MFDIGVGELTLIAVVALVVLGPERLPKAARFAGLWVRRARMQWDSVKQELERELEAEELK 60 MFDIG EL L+ ++ LVVLGP+RLP A + W+R R +V+ EL +EL+ +E + Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRSLATTVQNELTQELKLQEFQ 60 Query: 61 RSLQDVQ-ASLREAEDQLRTTQQQVEQGARALHEDVGRDIDIRASATPVATPLELAHADL 119 SL+ V+ ASL +L+ + ++ Q A + A + AH Sbjct: 61 DSLKKVEKASLTNLTPELKASMDELRQAA--------ESMKRSYVANDPEKASDEAHTIH 112 Query: 120 SASPNVDTAAGATEAAGTAHTAPVIAQAQPIAPAPQQPLVPAPHDTRVPAP 170 + + AA A T + +P A + + AP Sbjct: 113 NPVVKDNEAAHEGVTPAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAP 163
>TATBPROTEIN#Bacterial sec-independent translocation TatB protein signature. Length = 171 Score = 31.1 bits (70), Expect = 2e-04 Identities = 10/41 (24%), Positives = 18/41 (43%) Query: 1 MGGFSIWHWLIVLVIVLLVFGTKRLTSGAKDLGSAVKEFKK 41 M L+V +I L+V G +RL K + ++ + Sbjct: 1 MFDIGFSELLLVFIIGLVVLGPQRLPVAVKTVAGWIRALRS 41
>PERTACTIN#Pertactin signature. Length = 922 Score = 28.9 bits (64), Expect = 0.032 Identities = 21/81 (25%), Positives = 26/81 (32%), Gaps = 5/81 (6%) Query: 207 NERPSTDVIAFRDRLEEATYTARANRSTDAAADGAPPVPRPQTPPPAQAQQPANVPPPAS 266 N+ D+ +R RL A N APP P+P P Q PP Sbjct: 538 NKDGKVDIGTYRYRL-----AANGNGQWSLVGAKAPPAPKPAPQPGPQPGPQPPQPPQPP 592 Query: 267 EASTVPMQPSTTPPAQQGFQP 287 + P P P A P Sbjct: 593 QPPQPPQPPQRQPEAPAPQPP 613
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 33.3 bits (76), Expect = 5e-04 Identities = 31/177 (17%), Positives = 56/177 (31%), Gaps = 22/177 (12%) Query: 6 PLRVVAVSGGMQRPSKAVALAEHLLELIADQVPCERHLVEIGALAPHFAGALWRTQVPGA 65 PLR R L H ++ + +++ + PG Sbjct: 309 PLR--------DRAEDIPDLVRHFVQQAE------KEGLDVKRFDQEALELMKAHPWPGN 354 Query: 66 VEQALCLVEQADILVVATPVYRGSFTGLFKHFFDFIDQDALIDTPVLLAATGGSDRHALV 125 V + LV + L + R + + + + AA GS + Sbjct: 355 VRELENLVRRLTALYPQDVITR-------EIIENELRSEIPDSPIEKAAARSGSLSISQA 407 Query: 126 IDHQLRPLFSFFQARTLPLGVYATDRDFLDYRVHNEALAERARLAVQRALPLIELTR 182 ++ +R F+ F P G+Y ++Y + AL R +A L+ L R Sbjct: 408 VEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAAL-TATRGNQIKAADLLGLNR 463
>INTIMIN#Intimin signature. Length = 939 Score = 28.5 bits (63), Expect = 0.047 Identities = 19/83 (22%), Positives = 33/83 (39%), Gaps = 2/83 (2%) Query: 218 TIARTGASGVLLGVAVIAITGLPLLLADRWIGGGNGTAGVAASSTAGAAVATPALIAGMA 277 T+ + G + + V+ ++G +L A+ G+G A V S V A A M Sbjct: 583 TVKKNGVAQANVPVSFNIVSGTAVLSANSANTNGSGKATVTLKSDKPGQVVVSAKTAEM- 641 Query: 278 PQFAPAAPAATALVASAVIVTSL 300 A A A + + +T + Sbjct: 642 -TSALNANAVIFVDQTKASITEI 663
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 32.0 bits (72), Expect = 0.006 Identities = 23/68 (33%), Positives = 26/68 (38%), Gaps = 3/68 (4%) Query: 44 GRDKLGTF---VQVDENGKLPASAMPATPAQPLPPAPGATTPADTAVAQAAPAPAPVATP 100 GRD G VQV L + A AQPLP A PA+ P P P Sbjct: 296 GRDSQGNTTVDVQVIPRPDLTPGSAEAPNAQPLPEVSPAENPANNPAPNENPGTRPNPEP 355 Query: 101 APAKSGDA 108 P + DA Sbjct: 356 DPDLNPDA 363
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 26.8 bits (59), Expect = 0.024 Identities = 13/50 (26%), Positives = 17/50 (34%) Query: 63 QQAGQSNGSPSQYTQMLMNIVGDILQAQNGGGFGGGAGGDFGGGLGVSLA 112 Q +S + GD ++ NG GFG D G G A Sbjct: 173 QGKNESQSADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAA 222
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 117 bits (295), Expect = 3e-34 Identities = 79/261 (30%), Positives = 114/261 (43%), Gaps = 9/261 (3%) Query: 1 MPTPAIRPQRVLIAGGSRGIGLAIAEGFVRGGAHVSICARNAAGLAQAADALAAHGTPVH 60 M I + I G ++GIG A+A GAH++ N L + +L A Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 61 TLACDLADAAQIDAYVQAAAQALGGLDVVINNAS----GFGHGNDDASWQAGLDVDLMAA 116 D+ D+A ID + +G +D+++N A G H D W+A V+ Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 117 VRCNRAALPYLRLSDAAVILNISSINAQRPTPRAIAYSTAKAALDYYTTTLAAELARERI 176 +R+ Y+ + I+ + S A P AY+++KAA +T L ELA I Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 177 RVNAISPGSIE--FPDGLWDTRSREEPELY---ARIRDSIPFGGFGQVQHVADAALFLAS 231 R N +SPGS E LW + E + + IP + +ADA LFL S Sbjct: 181 RCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVS 240 Query: 232 PQASWITGQVLAVDGGQSLGV 252 QA IT L VDGG +LGV Sbjct: 241 GQAGHITMHNLCVDGGATLGV 261
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 58.5 bits (141), Expect = 7e-13 Identities = 27/194 (13%), Positives = 56/194 (28%), Gaps = 6/194 (3%) Query: 18 DVRDQIVVAATEHFSRYGYEKTAVSDLAREIGFSKAYIYKFFESKQAIGEMICSHCLGEI 77 + R I+ A FS+ G T++ ++A+ G ++ IY F+ K + I I Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNI 70 Query: 78 -EAEVLAAVSAAASPPEKLRSLFKTIIEASLRLYSRERKLYEIATSA-ATERWPPVI--- 132 E E+ P LR + ++E+++ R + I V Sbjct: 71 GELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQAQ 130 Query: 133 -AYEGHIQALLQEILVQGRQNGDFERKTPLDELTQATYLVMRPYINPVLLQHSLDHAGDV 191 +++ L + + + L Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFAPQSFDLKKE 190 Query: 192 PLLLSSLVLRSLSP 205 +++L Sbjct: 191 ARDYVAILLEMYLL 204
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 16/102 (15%), Positives = 33/102 (32%), Gaps = 7/102 (6%) Query: 70 GKVSERLVDAGQRVKRGQALMRIDPVDLQLAARAQQDAVAAARARAQ-------QTAEDE 122 V E +V G+ V++G L+++ + + Q ++ AR ++ Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164 Query: 123 ARYRDLRGTGAISASAYDQIKAAADAAKAQLSAAQAQADVAR 164 L + +++ K Q S Q Q Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKE 206 Score = 32.5 bits (74), Expect = 0.003 Identities = 11/102 (10%), Positives = 29/102 (28%), Gaps = 5/102 (4%) Query: 98 QLAARAQQDAVAAARARAQQTAEDEARYRDLRGTGAISASAYDQIKAAADAAKAQLSAAQ 157 + + V ++ ++ A+ T D+++ + Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQT----TDNIGLLT 315 Query: 158 AQADVARNANRYTDLLADADGVVMDTLV-EPGQVVAAGQTVV 198 + + + + A V V G VV +T++ Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLM 357
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 434 bits (1118), Expect = e-137 Identities = 227/1048 (21%), Positives = 429/1048 (40%), Gaps = 65/1048 (6%) Query: 8 LSALAVRERSITLFLIFLISLAGLVAFLKLGRAEDPAFTIKVMTIVTAWPGATPQEIQDQ 67 ++ +R L ++ +AG +A L+L A+ P +++ +PGA Q +QD Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 68 VAEKLEKRMQELRWYDRTETYT-RPGLAFTTLTLMDSTPP----GEVQEQFYQARKKAGD 122 V + +E+ M + + + G TLT T P +VQ + A Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPL--- 117 Query: 123 EVANLPAGVIGPLINDEYADVTFAL---FALKAKGEPQRLLARDAE-TLRQRILHVPGVK 178 LP V I+ E + ++ + F G Q ++ ++ + + GV Sbjct: 118 ----LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVG 173 Query: 179 KVNIIGEQPERIFVEFSHERLATLGVSPQDVFAALNAQNALNAAGSVETRGP------QV 232 V + G Q + + + L ++P DV L QN AAG + Sbjct: 174 DVQLFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232 Query: 233 FIRLDGALDSLQKIRDTPLVVQ--GRTLKLSDIATVERGYEDPSTFMIRSGGEPALLLGI 290 I + ++ L V G ++L D+A VE G E+ + R G+PA LGI Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVI-ARINGKPAAGLGI 291 Query: 291 IMRDGWNGLDLGKSLDAEVGAINAELPLGMRLSKVTDQAVNIDASVGEFMTKFFVALLVV 350 + G N LD K++ A++ + P GM++ D + S+ E + F A+++V Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV 351 Query: 351 MLVCFVSMG-WRVGIVVAAAVPLTLAAVFVVMLATGKNFDRITLGSLILALGLLVDDAII 409 LV ++ + R ++ AVP+ L F ++ A G + + +T+ ++LA+GLLVDDAI+ Sbjct: 352 FLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIV 411 Query: 410 AIEMMV-VKMEEGYSRVAASAYAWSHTAAPMLSGTLVTAVGFMPNGFAASTAGEYTSNMF 468 +E + V ME+ A+ + S ++ +V + F+P F + G Sbjct: 412 VVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFS 471 Query: 469 WIVGIALIVSWVVAVVFTPYLGVKMLPDLKKVEGGHAA--------MYDTPRYNRFRDAL 520 + A+ +S +VA++ TP L +L + + +D N + +++ Sbjct: 472 ITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFD-HSVNHYTNSV 530 Query: 521 GRVIASKWLVAGSVVGLFVLAVVGMGIVKKQFFPISDRPEVLVEVQLPYGTSINQTSAAA 580 G+++ S + VV + F P D+ L +QLP G + +T Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590 Query: 581 AKVEAWLSKQKEAKIVTAYIGQGAPRFFLAMGPELPDPSFAKIVV-----RTDDQHERDA 635 +V + K ++A + + + G + + + A + + R D++ +A Sbjct: 591 DQVTDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEA 645 Query: 636 LKLRLREAIAQ-----GLASEARVRVTQLTFGPYSKFPVAYRVSGPDPTVLRGIAAQVMQ 690 + R + + + + V T + + +G L Q++ Sbjct: 646 VIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELID--QAGLGHDALTQARNQLLG 703 Query: 691 VMQDSP-MLRTVNTDWGVRTPTLHFSLDQDRLQAVGLTSTAVAQQLQFLLTGVPITLVRE 749 + P L +V + T +DQ++ QA+G++ + + Q + L G + + Sbjct: 704 MAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID 763 Query: 750 DIRSVQVVARSAGDTRLDPARIADFTLAGGNGQRVPLSQVGKVDVRMEEPVMRRRDRVPT 809 R ++ ++ R+ P + + NG+ VP S P + R + +P+ Sbjct: 764 RGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPS 823 Query: 810 ITVGGDVDDQLQPPDVSAAITRQLQPIIDKLPGGYQIREAGSIEESGKATTAMLPLFPIM 869 + + G+ P S ++ + KLP G G + + L I Sbjct: 824 MEIQGEA----APGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAIS 879 Query: 870 LAATLLIIILQVRSISAMVMVFLTSPLGLIGVVPTLILFQQPFGINALVGLIALSGILMR 929 L + S S V V L PLG++GV+ LF Q + +VGL+ G+ + Sbjct: 880 FVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAK 939 Query: 930 NTLILIGQIHH-NEAEGLDPFHALVEATVQRTRPVILTALAAILAFIPLTHSVFWGT--- 985 N ++++ E EG A + A R RP+++T+LA IL +PL S G+ Sbjct: 940 NAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQ 999 Query: 986 --LAYTLIGGTLAGTVLTLVFLPAMYSI 1011 + ++GG ++ T+L + F+P + + Sbjct: 1000 NAVGIGVMGGMVSATLLAIFFVPVFFVV 1027 Score = 70.3 bits (172), Expect = 3e-14 Identities = 59/330 (17%), Positives = 122/330 (36%), Gaps = 24/330 (7%) Query: 712 LHFSLDQDRLQAVGLT----STAVAQQLQFLLTGVPITLVREDIRSVQVVARSAGDTRLD 767 + LD D L LT + Q + G + + + + + Sbjct: 184 MRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK-N 242 Query: 768 PARIADFTL-AGGNGQRVPLSQVGKVDVRMEE-PVMRRRDRVPTITVGGDVDDQLQPPDV 825 P TL +G V L V +V++ E V+ R + P +G + D Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 826 SAAITRQLQPIIDKLPGGYQIREA----GSIEESGKATTAMLPLFPIMLAATLLIIILQV 881 + AI +L + P G ++ ++ S L IML L++ L + Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTL-FEAIMLVF--LVMYLFL 359 Query: 882 RSISAMVMVFLTSPLGLIGVVPTLILFQQPFGINA--LVGLIALSGILMRNTLILIGQIH 939 +++ A ++ + P+ L+G IL + IN + G++ G+L+ + ++++ + Sbjct: 360 QNMRATLIPTIAVPVVLLGTF--AILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVE 417 Query: 940 -HNEAEGLDPFHALVEATVQRTRPVILTALAAILAFIPL-----THSVFWGTLAYTLIGG 993 + L P A ++ Q ++ A+ FIP+ + + + T++ Sbjct: 418 RVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSA 477 Query: 994 TLAGTVLTLVFLPAMYSIWFKIRPDPGSGN 1023 ++ L+ PA+ + K N Sbjct: 478 MALSVLVALILTPALCATLLKPVSAEHHEN 507
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.0 bits (233), Expect = 1e-24 Identities = 56/191 (29%), Positives = 82/191 (42%), Gaps = 14/191 (7%) Query: 45 VVLITGVSSGIGRAAAEHFARTGCIVYGSVRHLAGATPLTAVELVE--------MDIRDA 96 + ITG + GIG A A A G + + + + E D+RD+ Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 97 ASVQRAVDGIIARAGRIDVLVNNAGTNLVGAIEETSVDEAAALFDINVLGILRTVQAVQA 156 A++ I G ID+LVN AG G I S +E A F +N G+ A Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF------NA 123 Query: 157 VQAVLPHMRARGQGRIVNVSSVLGFLPAPYMGVYAASKHAVEGLSETLDHELRQFGISVT 216 ++V +M R G IV V S +P M YA+SK A ++ L EL ++ I Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 217 LVEPAYTKTSL 227 +V P T+T + Sbjct: 184 IVSPGSTETDM 194
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 30.2 bits (68), Expect = 0.010 Identities = 17/82 (20%), Positives = 31/82 (37%), Gaps = 9/82 (10%) Query: 4 TLVAV-VVALTLGHLVPAQVAKLRNFAWFGQWLRRLDSYAAGRGAWQGRYGVLLAVLPAL 62 T+VA+ VV++ L +VP +V + F Q L G +G + + Sbjct: 180 TVVAIAVVSILLSVVVP-KVVEQ--FIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLA 236 Query: 63 LVLLVQWLLD-----DVWHGFL 79 + + +L +H L Sbjct: 237 GFMAFRVMLRQEKRRVSFHRRL 258
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 53.7 bits (129), Expect = 3e-11 Identities = 24/124 (19%), Positives = 51/124 (41%), Gaps = 14/124 (11%) Query: 1 MTAIRTILLAEDSPADAEMAVDALREARLANPIVHVEDGVEAMDYLLRRGVFADREEGLP 60 MT IL+A+D A + AL R + + ++ G Sbjct: 1 MTGAT-ILVADDDAAIRTVLNQALS--RAGYDVRITSNAATLWRWI---------AAGDG 48 Query: 61 AVLLLDIKMPRLDGLEVLKQVRSDETLKRLPVVILSSSREESDLARSWDLGVNAYVVKPV 120 +++ D+ MP + ++L +++ LPV+++S+ ++ + G Y+ KP Sbjct: 49 DLVVTDVVMPDENAFDLLPRIKKAR--PDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF 106 Query: 121 DVDQ 124 D+ + Sbjct: 107 DLTE 110
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 68.7 bits (168), Expect = 2e-14 Identities = 35/147 (23%), Positives = 62/147 (42%), Gaps = 4/147 (2%) Query: 12 KILLVEDSPEDAELLSDQLLEAGLDAAFERVDSEPSLRAALDEFQPDIVLSDLSMPGFSG 71 IL+ +D +L+ L AG D + +L + D+V++D+ MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV--RITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 72 HQALRLVRQNGA-TPFIFVSGTMGEETAVKALQDGANDYIIKH-NPTRLPSAVIRAIREA 129 L +++ P + +S TA+KA + GA DY+ K + T L + RA+ E Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 130 RADLERQRVESELMRAQRLESLAMLAA 156 + + +S+ S AM Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEI 149 Score = 41.4 bits (97), Expect = 7e-06 Identities = 28/126 (22%), Positives = 53/126 (42%), Gaps = 15/126 (11%) Query: 380 GQRILLVDGEATRLSLLGNALSSQGYQPQLASDGAAALQLVQQHAMPDLVIIDSDIILLS 439 G IL+ D +A ++L ALS GY ++ S+ A + + DLV+ D + + Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDEN 61 Query: 440 AVSVLLSMQELGYQGPAIVLED-------VGAPLQRAHFPADLPVHVLRKPLEMRRVFRA 492 A +L +++ P +V+ + A + A+ L KP ++ + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAY-------DYLPKPFDLTELIGI 114 Query: 493 VSHALE 498 + AL Sbjct: 115 IGRALA 120
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 117 bits (293), Expect = 5e-34 Identities = 80/253 (31%), Positives = 126/253 (49%), Gaps = 16/253 (6%) Query: 6 KVVLITGAARRIGAQIATTLHAAGYRVALHAHRSADALDARVAELCAQRAGSAHALHADL 65 K+ ITGAA+ IG +A TL + G +A + + L+ V+ L A+ A A A AD+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAE-ARHAEAFPADV 66 Query: 66 RLPDAPAQLVADCLAAFGRLDGVVNNASAFYPTPVGAATAAQWDELFAVNARAPFFIAQA 125 R A ++ A G +D +VN A P + + + +W+ F+VN+ F +++ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 126 AAAQLRQRR-GAIVNLTDLHAQQPMRNHPLYGASKSALEMLTRSLALELAPQ-VRVNAVA 183 + + RR G+IV + A P + Y +SK+A M T+ L LELA +R N V+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 184 PGAI-------LWPEEGKSADAKQALLAR----TPLARIGTPEEIAEAVRWLLDD-ASFV 231 PG+ LW +E + + L PL ++ P +IA+AV +L+ A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 232 TGHTLHVDGGRQL 244 T H L VDGG L Sbjct: 247 TMHNLCVDGGATL 259
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 30.6 bits (69), Expect = 0.006 Identities = 11/61 (18%), Positives = 24/61 (39%), Gaps = 3/61 (4%) Query: 25 EQALQPLLDQGWNEQDAIDAVEALVRAHIQQHAQANGLPMPVRV---PALQQDTDASLLA 81 A QP ++ + Q+A++ +R + + + L + R+ LQ + Sbjct: 110 VDAYQPFSEEKISMQEALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRI 169 Query: 82 L 82 L Sbjct: 170 L 170
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 30.4 bits (68), Expect = 0.018 Identities = 21/45 (46%), Positives = 22/45 (48%), Gaps = 2/45 (4%) Query: 39 PPAPAPAPTPAPTPAPTPAPAPSGPAADCPSG--FSNVGTIASNT 81 PPAP PAP P P P P P P PA P+G S A NT Sbjct: 572 PPAPKPAPQPGPQPPQPPQPQPEAPAPQPPAGRELSAAANAAVNT 616
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 43.4 bits (102), Expect = 4e-07 Identities = 41/185 (22%), Positives = 68/185 (36%), Gaps = 20/185 (10%) Query: 83 IVLHGVRAGG-AQAAAYLSGSDGRQGVYRVGDTV-ATGVVVQAIAADHVLLRAGGSVRRI 140 + L GV AG + + D Q V + V + +I D V+L+ G + Sbjct: 95 LSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEVPGYNAKIVSIRPDRVVLQYQGRYEVL 154 Query: 141 ALGESGAAAAALPPAATGATASAAVAATVQSNVSAAAGTSAATAVDPQQLLASAGLRASA 200 L + + P A V + A T+ + V ++ L+ Sbjct: 155 GLYSQEDSGSDGVPGAQ-----------VNEQLQQRASTTMSDYVSFSPIMNDNKLQ--- 200 Query: 201 DGGGFTIMPRGDGALLRQAGLAPGDVLTQINGRTL-DAEHLRELQDELRDGQSATLTCRR 259 G+ + P + GL D+ +NG L DAE ++ + + D + TLT R Sbjct: 201 ---GYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMADVHNFTLTVER 257 Query: 260 DGQTH 264 DGQ Sbjct: 258 DGQRQ 262
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 368 bits (946), Expect = e-120 Identities = 209/679 (30%), Positives = 334/679 (49%), Gaps = 60/679 (8%) Query: 8 WLLSAALLFALPAVPMTALHAADAPAVRLQDVDLRAFIQDVSRATGITFIVDTRVQGSVN 67 + L+ + AL P AA+ + + D++ FI VS+ T I+D V+G++ Sbjct: 10 FSLTLLIFAALLFRPA----AAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTIT 65 Query: 68 VARAQAMSEADLLGMLLAVLRANGLIAVSSGPSTYRIIPDDTAAQQPG-----SAANGNL 122 V ++E L+VL G ++ +++ A +A Sbjct: 66 VRSYDMLNEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGD 125 Query: 123 GFATQVFTLQRVDARSAAEILKPLIGRGGVIMAM--PQGNSLLIADYADNLRRIRTLVAQ 180 T+V L V AR A +L+ L GV + N LL+ A ++R+ T+V + Sbjct: 126 EVVTRVVPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVER 185 Query: 181 IDTDR-AAIDTVTLRNSSAQELARTLTSLF----GQGGERSNVLSVLPVESSNSLIVRGD 235 +D ++ TV L +SA ++ + +T L S V +V+ E +N+++V G+ Sbjct: 186 VDNAGDRSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGE 245 Query: 236 PALVQRVVRTAVDLDGRAERRGDVSVVRLQHASAEQLLPVLQQLVGQAPGNEAQAGQDTR 295 P QR++ LD + +G+ V+ L++A A L+ VL + +E QA + Sbjct: 246 PNSRQRIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTG-ISSTMQSEKQAAKPV- 303 Query: 296 TNAVDVAAASGAAQTQVIAPAAGKRPVIVRY-PGSNALIINADPETQRALMDVIRQLDVH 354 AA + +I++ +NALI+ A P+ L VI QLD+ Sbjct: 304 --------------------AALDKNIIIKAHGQTNALIVTAAPDVMNDLERVIAQLDIR 343 Query: 355 REQVLVEAIVVEISDTAAKRLGVQLLLAGRNGTVPLLATQYSGAAPGIVPLAAAAAGTRS 414 R QVLVEAI+ E+ D LG+Q A +N + TQ++ + +P++ A AG Sbjct: 344 RPQVLVEAIIAEVQDADGLNLGIQW--ANKNAGM----TQFTNSG---LPISTAIAGANQ 394 Query: 415 GNADDDSVLEQARNVAAQSLLGLSGGLIGLAGQSDDAVFGMIIDAVKSDTGSNLLSTPSI 474 N D + SL G+A + M++ A+ S T +++L+TPSI Sbjct: 395 YNKDGTV---------SSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSI 445 Query: 475 MTLDNEQARILVGQEVPITTGEVLGAANDNPFRTIQRQDVGVELEVRPQINTAGGITLAI 534 +TLDN +A VGQEVP+ TG + DN F T++R+ VG++L+V+PQIN + L I Sbjct: 446 VTLDNMEATFNVGQEVPVLTGSQTTS-GDNIFNTVERKTVGIKLKVKPQINEGDSVLLEI 504 Query: 535 KQEVSAIAGPVSAQSSEL--VFNKRQIETRVVVENGAIVALGGLLDQNDRQTVEKVPLLG 592 +QEVS++A S+ SS+L FN R + V+V +G V +GGLLD++ T +KVPLLG Sbjct: 505 EQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLG 564 Query: 593 DVPGLGALFRHRSRNRDKTNLMVFIRPTIIRDAADAQRMTAPRYNYLRERQLADGDPEAA 652 D+P +GALFR S+ K NLM+FIRPT+IRD + ++ ++ +Y + Q E Sbjct: 565 DIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFNDAQSKQRGKENN 624 Query: 653 LDALVRDYLRAQPPQLPAG 671 L +D L P Q A Sbjct: 625 DAMLNQDLLEIYPRQDTAA 643
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 341 bits (877), Expect = e-117 Identities = 173/405 (42%), Positives = 242/405 (59%), Gaps = 8/405 (1%) Query: 1 MPQFDYTVLDLHGRNRHGVISADSVHGARAQLEQRQWVPVRVEAAAATASTSG------- 53 M Q+ Y LD G+ G ADS AR L +R VP+ V+ SG Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 54 RAARFSGKDLVLFTRQLATLVETA-PLEEALRTIGTQSERRGVRRVTSRTHALVVEGFRL 112 R R S DL L TRQLATLV + PLEEAL + QSE+ + ++ + + V+EG L Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 113 SDAMARQGKAFPALYRAMVAAGESAGALPQVLERLADLLERQAQVRSKLQSALVYPAALA 172 +DAM +F LY AMVAAGE++G L VL RLAD E++ Q+RS++Q A++YP L Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 173 VTAGAVVIVLMTFVVPKVVDQFDSMGRALPWLTRAVIGVSQFLLHAGIPLLVALVVALVA 232 V A AVV +L++ VVPKVV+QF M +ALP TR ++G+S + G +L+AL+ +A Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 233 TARLLQRPALRLAADRALLRAPLLGRLIRDLHAARMARTLAIMVNSGLPLMEGLMIAART 292 +L++ R++ R LL PL+GR+ R L+ AR ARTL+I+ S +PL++ + I+ Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 293 VDNRALRLATDSMVTAIREGGSLAAAMKRAGVFPPTLLYMASSGENSGRLAPMLERAADY 352 + N R A+REG SL A+++ +FPP + +M +SGE SG L MLERAAD Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 353 LEREFEAFTAAAMSLLEPAIIVLLGGVVAVIVLSILLPILQFNTL 397 +REF + A+ L EP ++V + VV IVL+IL PILQ NTL Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTL 405
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 183 bits (466), Expect = 1e-62 Identities = 65/141 (46%), Positives = 94/141 (66%), Gaps = 3/141 (2%) Query: 13 LTARRRTRGFTLVELMVVIVIIGLLATVVMINVMPSQDRAMVEKARADVAVLEQALETYR 72 + A + RGFTL+E+MVVIVIIG+LA++V+ N+M ++++A +KA +D+ LE AL+ Y+ Sbjct: 1 MRATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYK 60 Query: 73 LDNLSYPSTEQGLQALLNPPSGLTRPERYRQGGYIRRLPEDPWGHAYQYRRPGRQGGFDV 132 LDN YP+T QGL++L+ P+ Y + GYI+RLP DPWG+ Y PG G +D+ Sbjct: 61 LDNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDL 120 Query: 133 YSLGADGAEGGDADNADIGNW 153 S G DG G + DI NW Sbjct: 121 LSAGPDGEMGTE---DDITNW 138
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 515 bits (1328), Expect = 0.0 Identities = 169/474 (35%), Positives = 256/474 (54%), Gaps = 17/474 (3%) Query: 6 SALVVDDERDIRELLVLTLGRMGLRISTAANLAEARELLANNPYDLCLTDMRLPDGNGIE 65 + LV DD+ IR +L L R G + +N A +A DL +TD+ +PD N + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 LVTEIAKHYPQTPVAMITAFGSMDLAVEALKAGAFDFVSKPVDIGVLRGLVKHALELNNR 125 L+ I K P PV +++A + A++A + GA+D++ KP D+ L G++ AL R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 126 DRPAPPPPPPEQASRLLGDSSAMESLRATISKVARSQAPVYIVGESGVGKELVARTIHEQ 185 RP+ + L+G S+AM+ + ++++ ++ + I GESG GKELVAR +H+ Sbjct: 125 -RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 186 GARAAGPFVPVNCGAIPAELMESEFFGHKKGSFTGAHADKPGLFQAAHGGTLFLDEVAEL 245 G R GPFV +N AIP +L+ESE FGH+KG+FTGA G F+ A GGTLFLDE+ ++ Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 246 PLQMQVKLLRAIQEKSVRPVGASSESLVDVRILSATHKDLGDLVSDGRFRHDLYYRINVI 305 P+ Q +LLR +Q+ VG + DVRI++AT+KDL ++ G FR DLYYR+NV+ Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 306 ELRVPPLRERGGDLPQLAAAIIARLAHSHGRPIPLLTQSALDALNHYGFPGNVRELENIL 365 LR+PPLR+R D+P L + + A G + Q AL+ + + +PGNVRELEN++ Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQ-AEKEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 366 ERALALAEDDQISATDLRLPAH---------------GGHRLAAPPGGAAAEPREAVVDI 410 R AL D I+ + G ++ + + D Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 411 DPASAALPSYIEQLERAAIQKALEENRWNKTKTAAQLGITFRALRYKLKKLGME 464 P S + ++E I AL R N+ K A LG+ LR K+++LG+ Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 32.6 bits (74), Expect = 0.003 Identities = 19/91 (20%), Positives = 35/91 (38%), Gaps = 9/91 (9%) Query: 210 TGVLVVDTHNHISLANEAALSLLG-DG---DQRTPSTDLSLVALTPELARRLQRWRSGWR 265 VL VD S+ L G G + TD+ + + R L R W Sbjct: 114 NAVLEVDP----SITFSIIDRLFGGTGQAAKVQRDLTDIENSVMEGVIVRILANVRESW- 168 Query: 266 EEEAPLQLGADRPEVQPRFVRLLADSDLALV 296 + L+ + E P+F +++ S++ ++ Sbjct: 169 TQVIDLRPRLGQIETNPQFAQIVPPSEMVVL 199
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 28.4 bits (63), Expect = 0.046 Identities = 20/72 (27%), Positives = 28/72 (38%), Gaps = 17/72 (23%) Query: 17 AADASIRPKRLADYLGQQPVRE----QMEIYIQAAKAR-----------GEAMD--HVLI 59 A A +AD L Q E Q E +I++ K R +D H+L+ Sbjct: 131 LAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLV 190 Query: 60 FGPPGLGKTTLS 71 FGP L + L Sbjct: 191 FGPNSLFQEILD 202
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 61.2 bits (148), Expect = 3e-12 Identities = 46/280 (16%), Positives = 86/280 (30%), Gaps = 35/280 (12%) Query: 39 LWSPE-----RSVEPAAGDPSMEASLDVSAAEARVARQALKATPVETPPPPAPLPEPAPE 93 L++PE ++V+ DV + + A PP PA E Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 94 DSVPPPQ--PIPEPRPQDA--PTPQQAQAQERVAQPDKVDQERVDALAISAEKAKQEQEA 149 + Q E QDA T Q + K + V A + E A+ E Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREV-------AKEAKSNVKANTQTNEVAQSGSET 1092 Query: 150 KRRQEQIDLTERKRQEEAEQKLRLAKQQEEAD------AKKKQAAAQQAAEEAERQKKIA 203 K Q ++E + K+ K QE K++Q+ Q E R+ Sbjct: 1093 KETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPT 1152 Query: 204 EIRRQRAQADKEMALAEQKLRQVAAARAQQASAAAATSAQPTAGQGGTSTDLSAKYAAAI 263 ++ A EQ ++ ++ Q + + + + + +T + Sbjct: 1153 VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212 Query: 264 QQ-------------KVLAQWVRPPSVPPGQKCTINIRQL 290 + + + V P + + T+ + L Sbjct: 1213 SESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDL 1252 Score = 37.0 bits (85), Expect = 1e-04 Identities = 33/217 (15%), Positives = 62/217 (28%), Gaps = 16/217 (7%) Query: 47 EPAAGDPSMEASLDVSAAE-----ARVARQALKATPVETPPPPAPLPEPAPEDSVPPPQP 101 + + EA +V A A+ + + ET E + V + Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATV--EKEEKAKVETEKT 1119 Query: 102 IPEPRPQDAPTPQQAQAQERVAQPDKVDQERVDALAISAEKAKQEQEAKRRQEQIDLTER 161 P+ +P+Q Q++ Q + +E + I +++ A Q + + Sbjct: 1120 QEVPKVTSQVSPKQEQSETVQPQAEP-ARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178 Query: 162 KRQEEAEQKLRLAKQQEEADAKKKQAAAQQAAEEAERQKKIAEIRRQRAQADKEMALAEQ 221 Q E + + A Q +E K R+ + + Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVR-------SVP 1231 Query: 222 KLRQVAAARAQQASAAAATSAQPTAGQGGTSTDLSAK 258 + A + S A T S D AK Sbjct: 1232 HNVEPATTSSNDRSTVALCDLTSTNTNAVLS-DARAK 1267
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 106 bits (266), Expect = 3e-30 Identities = 35/112 (31%), Positives = 51/112 (45%), Gaps = 11/112 (9%) Query: 67 VYFDLDQDSLKPEFQAIMACHAKYLR--DRPSSRITLQGNADERGSREYNMGLGERRGNA 124 V F+ ++ +LKPE QA + L D + + G D GS YN GL ERR + Sbjct: 221 VLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQS 280 Query: 125 VSSSLQAAGGSASQLTVVSYGEERPVCTESNE---------SCWSQNRRVEI 167 V L + G A +++ GE PV + + C + +RRVEI Sbjct: 281 VVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEI 332
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.6 bits (77), Expect = 7e-04 Identities = 23/88 (26%), Positives = 40/88 (45%), Gaps = 7/88 (7%) Query: 30 RVAVLEQQQANSQANNDL---LNQLQQARSDLQALRSTVEQLQHD--NEQLKQ--QSKDQ 82 + AVLEQ+ +A N+L +QL+Q S++ + + + + NE L + Q+ D Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310 Query: 83 YLDLDGRLNRLEGAGGATPPLPPATGSV 110 L L + E A+ P + V Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKV 338
>PF06580#Sensor histidine kinase Length = 349 Score = 36.0 bits (83), Expect = 2e-04 Identities = 40/209 (19%), Positives = 70/209 (33%), Gaps = 24/209 (11%) Query: 146 EHKQHEQHLQLLINELN-HRVKNSLVMVQSLARQSFTNAGGLADAQEKLDARLLALSRAH 204 E L L ++N H + N+L +++L + ++ L L R Sbjct: 155 ASMAQEAQLMALKAQINPHFMFNALNNIRALILED-------PTKAREMLTSLSELMRYS 207 Query: 205 DTLTRENWVS-ADVLELTRDAAALYESHDSQRFTLQGDSCRLDP--RRALALSMALHELC 261 + VS AD L + L R + ++P M + L Sbjct: 208 LRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQ---INPAIMDVQVPPMLVQTLV 264 Query: 262 TNALKHG-ALSLPAGNVMVSWERSTRGEQERLELIWRESGGPPVQP-PTHKGFGTRLLER 319 N +KHG A G +++ + + L +G ++ G G + + Sbjct: 265 ENGIKHGIAQLPQGGKILL----KGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRE 320 Query: 320 GLKHDLKGE---VELSFDPAGVCFRVSIP 345 L+ L G ++LS V V IP Sbjct: 321 RLQM-LYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.5 bits (113), Expect = 1e-09 Identities = 13/82 (15%), Positives = 37/82 (45%), Gaps = 3/82 (3%) Query: 4 RVLLVEDESLVAMLLEDCLAELGYEVAATVADVDAALQAVQEGNLDLALLDINLGGTLSF 63 +L+ +D++ + +L L+ GY+V ++ + + G+ DL + D+ + +F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDV-RITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 64 PIAEELDAR--GVPYIFVTGYA 83 + + +P + ++ Sbjct: 64 DLLPRIKKARPDLPVLVMSAQN 85
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.6 bits (173), Expect = 2e-14 Identities = 24/126 (19%), Positives = 51/126 (40%) Query: 1063 LLLVEDDATVAQVIVGLLQTRGHHVTHVLHGLAALAEVSTRNFDAGLCDLDLPGLDGAAL 1122 +L+ +DDA + V+ L G+ V + ++ + D + D+ +P + L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 1123 VAQLRARGVRFPIVAVTARADTDAEPQAMAAGCNGFLRKPVTGDLLAQALARVLADADDG 1182 + +++ P++ ++A+ +A G +L KP L + R LA+ Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 1183 QRDREA 1188 E Sbjct: 126 PSKLED 131
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.2 bits (177), Expect = 4e-15 Identities = 21/101 (20%), Positives = 46/101 (45%) Query: 1052 RILLVEDDPTVAEVISGLLTNRGHRVVHAAHGLAALSETVDGGFDIALLDLDLPGLDGFA 1111 IL+ +DD + V++ L+ G+ V ++ G D+ + D+ +P + F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1112 LASQLRQLGHRFPLLAVTARADSAAQTQAKAAGFDGFMRKP 1152 L ++++ P+L ++A+ +A G ++ KP Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKP 105
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 75.6 bits (186), Expect = 4e-16 Identities = 29/121 (23%), Positives = 51/121 (42%) Query: 1058 RILLVEDDPTIAEVIIGLLRAQGHSVVHAPHGLAALTEAADNTFDLALLDLDLPGLDGFA 1117 IL+ +DD I V+ L G+ V + A DL + D+ +P + F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1118 LARQLRVFGYEMPLIAVTARSDEAAEPSAQEAGFDRFLRKPLTGEMLATTIAEALRHARA 1177 L +++ ++P++ ++A++ A E G +L KP L I AL + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 1178 R 1178 R Sbjct: 125 R 125
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 126 bits (319), Expect = 4e-34 Identities = 85/408 (20%), Positives = 176/408 (43%), Gaps = 17/408 (4%) Query: 17 LLWLVSLAIFMQMLDATIVNTALPSMARSLRESPLQMQSVVFSYALAVAMFIPASGWIAD 76 L+WL L+ F +L+ ++N +LP +A + P V ++ L ++ G ++D Sbjct: 16 LIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSD 74 Query: 77 RFGTRRTFLAAIIVFTLGSLLCAAAQ-HLPQLVAARVVQGIGGAMLLPVGRLAVLKTVAR 135 + G +R L II+ GS++ L+ AR +QG G A + + V + + + Sbjct: 75 QLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK 134 Query: 136 ADFLRAMSFIAIPALIGPLIGPTLGGWLVEVASWHWVFLINLP-IGVLGFIAALKIMPDH 194 + +A I +G +GP +GG + HW +L+ +P I ++ +K++ Sbjct: 135 ENRGKAFGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 195 YGDARKRFDLVGYLMLAFGMVALSLALDGISELGLRHAFVMLLAIGGLAALAGYWLHAGN 254 + FD+ G ++++ G+V L S F+++ + + + H Sbjct: 193 -VRIKGHFDIKGIILMSVGIVFFMLFTTSYSIS-----FLIVSVL----SFLIFVKHIRK 242 Query: 255 TPNALFPLALFKVASYRIGILGNLFARVGSGSMPFLIPLLLQVGLGMSPMNAG-LMMVPV 313 + L K + IG+L ++P +++ +S G +++ P Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302 Query: 314 ALAGMAAKRAAVKLVGRFGYRRVLMLNTVLVGVAMASFALIDIGQPLWLRLVQLACFGAV 373 ++ + LV R G VL + + V+ + + + ++ ++ + G + Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362 Query: 374 NSLQFTVMNTVTLRDLDRDQASPGNSLLSMVMMLATGFGAAAAGSLLA 421 + + TV++T+ L + +A G SLL+ L+ G G A G LL+ Sbjct: 363 SFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 31.5 bits (71), Expect = 0.005 Identities = 33/126 (26%), Positives = 45/126 (35%), Gaps = 15/126 (11%) Query: 174 ELLHHLLGTVTDAVIAYLAAQRAAGAQALQVFDTWGGVLSPAMYREFSLPYLTRIARELE 233 EL +LG V + Y+ AQRA AQ L G+++ A+ S IA + + Sbjct: 274 ELTTKVLGNVGKGISQYIIAQRA--AQGLSTSAAAAGLIASAVTLAISPLSFLSIADKFK 331 Query: 234 R----GTGAERTP---------LVLFGKGNGAYVADLAASGAEAVGVDWTISLADAAQRA 280 R ++R L F K GA A L V IS A Sbjct: 332 RANKIEEYSQRFKKLGYDGDSLLAAFHKETGAIDASLTTISTVLASVSSGISAAATTSLV 391 Query: 281 GGRVAL 286 G V+ Sbjct: 392 GAPVSA 397
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 27.1 bits (60), Expect = 0.035 Identities = 10/26 (38%), Positives = 17/26 (65%) Query: 60 RQHETETLQALLEQDNKLISTGGGAV 85 E ET++ L+E+ +I++GGG V Sbjct: 172 GHVEAETIKKLVERGVIVIASGGGGV 197
>PF06580#Sensor histidine kinase Length = 349 Score = 36.4 bits (84), Expect = 3e-04 Identities = 12/52 (23%), Positives = 23/52 (44%), Gaps = 8/52 (15%) Query: 400 LVRNAMDHGIEPADVRVARGKPARGTVGLNAYHDSGSIVIQITDDGGGLNRD 451 LV N + HGI P G + L D+G++ +++ + G ++ Sbjct: 263 LVENGIKHGIAQ--------LPQGGKILLKGTKDNGTVTLEVENTGSLALKN 306
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.9 bits (80), Expect = 6e-05 Identities = 11/53 (20%), Positives = 21/53 (39%) Query: 109 ILVSSFVAGQGLGRQLMRKLVKWARRKYLDCLFGDVLQSNVPMLQLAESLGFK 161 I V+ +G+G L+ K ++WA+ + L + N+ F Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 1e-20 Identities = 36/139 (25%), Positives = 62/139 (44%), Gaps = 1/139 (0%) Query: 4 SAARVLVVEDEAAIADTVLYALRSEGYAPEHCLLGRDALTRLRADPADVVILDVGLPDIN 63 + A +LV +D+AAI + AL GY + A D+V+ DV +PD N Sbjct: 2 TGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 64 GFEVCRTLRS-FSEVPVIFLTARNDEIDRVLGLELGADDYMAKPFSPRELVARVRARLRR 122 F++ ++ ++PV+ ++A+N + + E GA DY+ KPF EL+ + L Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 123 RHAGAAAESGWQPHGAFAI 141 + G + Sbjct: 122 PKRRPSKLEDDSQDGMPLV 140
>PF06580#Sensor histidine kinase Length = 349 Score = 35.6 bits (82), Expect = 3e-04 Identities = 25/103 (24%), Positives = 40/103 (38%), Gaps = 23/103 (22%) Query: 384 LLENA----IAFSKQDSHVRLHARLRDGRWELVVEDRGSGVPDYALERVFERFYSLARPQ 439 L+EN IA Q + L +G L VE+ GS + Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLA-----------------LK 305 Query: 440 TGQRSSGLGLPFVRE-VARLHGGDVMLG-NRHGGGARAVLRLP 480 + S+G GL VRE + L+G + + + G A++ +P Sbjct: 306 NTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 73.1 bits (179), Expect = 4e-18 Identities = 24/180 (13%), Positives = 53/180 (29%), Gaps = 10/180 (5%) Query: 5 ENPMRVRTEEKREAIVQAASEVFLELGFEGASMSQIAARVGGSKRTLYGYFPSKEELFVA 64 + +E R+ I+ A +F + G S+ +IA G ++ +Y +F K +LF Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 65 VAKDMSDRYFDPLLHALSQSSGPVDEAL-QRFGEDVLRFLCAPPNITSWQTIIGVSGRSA 123 + + + + E ++ L + + ++ + Sbjct: 62 IW----ELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKC 117 Query: 124 VGALYFSAGQEEGIQRFAEYLQAQVDCGLLHCEDTLLAAHQYAALLESETLMPCLFGALK 183 + QR L + A A L + + G + Sbjct: 118 EFV--GEMAVVQQAQRNLCLESYDRIEQTL---KHCIEAKMLPADLMTRRAAIIMRGYIS 172
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.1 bits (99), Expect = 3e-06 Identities = 20/112 (17%), Positives = 37/112 (33%), Gaps = 12/112 (10%) Query: 68 EIRPQVGGIIQSRQFTEGGDVKAGQTLYQIDPAQYRASYASAQASLAKAEATLRTAQLKA 127 EI+P I++ EG V+ G L ++ A+A K +++L A+L+ Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALG-------AEADTLKTQSSLLQARLEQ 150 Query: 128 ERYKELAQIKAISQQEGDDTDAALGQAKADVAAGKASVETARINLAFARLDA 179 RY+ L E + + +L + Sbjct: 151 TRYQIL-----SRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFST 197 Score = 29.8 bits (67), Expect = 0.020 Identities = 14/36 (38%), Positives = 17/36 (47%), Gaps = 1/36 (2%) Query: 67 SEIRPQVGGIIQSRQ-FTEGGDVKAGQTLYQIDPAQ 101 S IR V +Q + TEGG V +TL I P Sbjct: 328 SVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED 363
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1220 bits (3157), Expect = 0.0 Identities = 666/1034 (64%), Positives = 809/1034 (78%), Gaps = 3/1034 (0%) Query: 1 MARFFIDRPIFAWVLAIIVMLAGILSIATLPIAQYPSIAPPAVAITANYPGASAQTLEDT 60 MA FFI RPIFAWVLAII+M+AG L+I LP+AQYP+IAPPAV+++ANYPGA AQT++DT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQKMKGLDHLSYMASTSESSGAVTITLTFENGTDPDTAQVQVQNKLSLATPLLPQ 120 VTQVIEQ M G+D+L YM+STS+S+G+VTITLTF++GTDPD AQVQVQNKL LATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGVTVTKSATNFLNVLAFTSEDGSMSDSDLSDYVAANVQETISRVEGVGDTTLFGS 180 EVQQQG++V KS++++L V F S++ + D+SDYVA+NV++T+SR+ GVGD LFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRVWMDPNKLSNFNLTPVDVRNALQAQNAQISAGQLGALPAVANQQLNATITAQTRL 240 QYAMR+W+D + L+ + LTPVDV N L+ QN QI+AGQLG PA+ QQLNA+I AQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KTAEQFESILLRTQSDGAQVRLRDVARIELGSESYNTVGRYNGKPAAGLAIKLATGANAL 300 K E+F + LR SDG+ VRL+DVAR+ELG E+YN + R NGKPAAGL IKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTVRAIDKSLEEQEKFFPPGMKVQKPYDTTPFVRISIEQVVHTLVEAVVLVFLVMYLFLQ 360 DT +AI L E + FFP GMKV PYDTTPFV++SI +VV TL EA++LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFGVLAAFGFTINTLTMFAMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTF +LAAFG++INTLTMF MVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 SEEQLSPKDATRKSMDQISGALIGVALVLAAVFVPMAFFSGSTGVIYRQFSITIVSAMTL 480 E++L PK+AT KSM QI GAL+G+A+VL+AVF+PMAFF GSTG IYRQFSITIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVAMILTPALCATLLKPVHKGHGLATTGFFGWFNRLFDRGNTGYQGVVRHMLGKGWRY 540 SVLVA+ILTPALCATLLKPV H GFFGWFN FD Y V +LG RY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 MLAYAALLALVVFGFMKLPVGFLPDEDQGTLFVLVQLPPGATNARTSDVLKQVEHHFLVD 600 +L YA ++A +V F++LP FLP+EDQG ++QLP GAT RT VL QV ++L + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 QKDSVAGVFAVTGFSFAGSGQNVGFAFVKLRPWDERTGKGQSVTDVAAKAGAFFAGIRDA 660 +K +V VF V GFSF+G QN G AFV L+PW+ER G S V +A IRD Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 KVFAFAPPAVSELGNATGFDLMLQDRANLGHAALMQARNQLLAELSQD-KRLVAVRPNGQ 719 V F PA+ ELG ATGFD L D+A LGH AL QARNQLL +Q LV+VRPNG Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 EDTPEFKLEIDPHKAQAMGVSISDINDTFSSAWGSTYVNDFIDKGRVKKVMLQADAPYRM 779 EDT +FKLE+D KAQA+GVS+SDIN T S+A G TYVNDFID+GRVKK+ +QADA +RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 780 NPQDIDHWFVRNSAGTMVPFNAFATASWQSGSPRLERYNSVPSMEILGMALPGAASSGEA 839 P+D+D +VR++ G MVPF+AF T+ W GSPRLERYN +PSMEI G A PG SSG+A Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG-TSSGDA 839 Query: 840 MQIVEAAAAKLPPGIGFEWTGLSRQEKASSGQTGLLYSVSILIVFLCLAALYESWAIPFS 899 M ++E A+KLP GIG++WTG+S QE+ S Q L ++S ++VFLCLAALYESW+IP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 900 VILVVPLGVFGTLLAAMLTWKMNDVYFQVGLLTTIGLASKNAILIVEFARELHE-SGKSL 958 V+LVVPLG+ G LLAA L + NDVYF VGLLTTIGL++KNAILIVEFA++L E GK + Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 959 VAAALEAARMRLRPILMTSLAFILGVVPLVLTSGAGAGAQHALGTAVIGGMVSGTVLAIF 1018 V A L A RMRLRPILMTSLAFILGV+PL +++GAG+GAQ+A+G V+GGMVS T+LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1019 FVPLFFVLVCGLFQ 1032 FVP+FFV++ F+ Sbjct: 1020 FVPVFFVVIRRCFK 1033 Score = 56.8 bits (137), Expect = 5e-10 Identities = 45/334 (13%), Positives = 109/334 (32%), Gaps = 18/334 (5%) Query: 714 VRPNGQEDTPEFKLEIDPHKAQAMGVSISDINDTFSSA----WGSTYVNDFIDKGRVKKV 769 V+ G + ++ +D ++ D+ + G+ Sbjct: 175 VQLFGAQY--AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232 Query: 770 MLQADAPYRMNPQDIDHWFVR-NSAGTMVPFNAFATASWQSGSPR-LERYNSVPSMEILG 827 + A ++ NP++ +R NS G++V A + + R N P+ + Sbjct: 233 SIIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGI 291 Query: 828 MALPGA---ASSGEAMQIVEAAAAKLPPGIGFEWTGLSRQEKASSGQTGLLYSV--SILI 882 GA ++ + P G+ + ++ ++ +I++ Sbjct: 292 KLATGANALDTAKAIKAKLAELQPFFPQGMKVLYP-YDTTPFVQLSIHEVVKTLFEAIML 350 Query: 883 VFLCLAALYESWAIPFSVILVVPLGVFGTLLAAMLTWKMNDVYFQVGLLT-TIGLASKNA 941 VFL + ++ + VP+ + GT A + + + + + IGL +A Sbjct: 351 VFLVMYLFLQNMRATLIPTIAVPVVLLGTF-AILAAFGYSINTLTMFGMVLAIGLLVDDA 409 Query: 942 ILIVE-FARELHESGKSLVAAALEAARMRLRPILMTSLAFILGVVPLVLTSGAGAGAQHA 1000 I++VE R + E A ++ ++ ++ +P+ G+ Sbjct: 410 IVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ 469 Query: 1001 LGTAVIGGMVSGTVLAIFFVPLFFVLVCGLFQRR 1034 ++ M ++A+ P + Sbjct: 470 FSITIVSAMALSVLVALILTPALCATLLKPVSAE 503
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.4 bits (79), Expect = 0.001 Identities = 33/202 (16%), Positives = 59/202 (29%), Gaps = 28/202 (13%) Query: 229 SQLTLRQAQTTVETARVDVERYTA-QVAQDRNALVLLVGRSVPVELLPHALPDNASVEGN 287 ++ + Q+++ AR++ RY + + N L P LP P +V Sbjct: 132 AEADTLKTQSSLLQARLEQTRYQILSRSIELNKL--------PELKLP-DEPYFQNVSEE 182 Query: 288 VLASVPAGLPSQLLQRRPDILEAERNLRAANANIGAARAAFFPSISLTASTGSSSSSLSR 347 + + + + Q + + E NL A A +L+ S S Sbjct: 183 EVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSS 242 Query: 348 LFDAGTRAWSFVPTLTLPIFNAGRNRANLDMAKANRDIEVARYEKSIQSA---------- 397 L + + A ++ +E E I SA Sbjct: 243 LLHKQ-----AIAKHAVLEQENKYVEAVNELRVYKSQLEQI--ESEILSAKEEYQLVTQL 295 Query: 398 -FREVSDALAQRDTLGRQLQAQ 418 E+ D L Q L + Sbjct: 296 FKNEILDKLRQTTDNIGLLTLE 317
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 63.5 bits (154), Expect = 2e-14 Identities = 40/211 (18%), Positives = 67/211 (31%), Gaps = 16/211 (7%) Query: 5 APPVAPRRAPHEKRGAILAAAGVLFQQHGFDRTSMDTIAERAMVSKATVYAHFASKEVLF 64 A R IL A LF Q G TS+ IA+ A V++ +Y HF K LF Sbjct: 2 ARKTKQEAQET--RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59 Query: 65 RTTLEALAQASPNRWTALLELQGPLERRLAAVADAVLRVSASSMRDDAAYGLVRPPLLPG 124 E + N LE Q +V +L S + L+ + Sbjct: 60 ---SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHK 116 Query: 125 QMREEMWTLCFGRYDTM-------MRTLLAREVQRGALVIDNVPDASVH-FFGLMTGRPA 176 + + + L ++ L D + + G ++G Sbjct: 117 CEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLME 176 Query: 177 TAAARDDATDAQSVQLDADAYVSGAVALFLR 207 + D + +A YV+ + ++L Sbjct: 177 NWLFAPQSFDLKK---EARDYVAILLEMYLL 204
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 61.8 bits (150), Expect = 1e-12 Identities = 87/368 (23%), Positives = 134/368 (36%), Gaps = 22/368 (5%) Query: 15 ALLALTIGAFGIGTTEFVIMGLLQQVAADLGVSLSAAGLLISGYALGVFVGAPVLTLASA 74 L + + A GIG V+ GLL+ + + G+L++ YAL F APVL S Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHS-NDVTAHYGILLALYALMQFACAPVLGALSD 68 Query: 75 RLPRKAVLVGLMLIFTVGNVACALAPDYTSLMVARVLTSLAHGTFFGVGAVVATSLVPAE 134 R R+ VL+ + V A AP L + R++ + T GA +A + + Sbjct: 69 RFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGD 127 Query: 135 RRASAISLMFAGLTVATLLGVPAGAWLGLQLGWRATFWAVAAIGVLATASVAVWVPAAAG 194 RA M A + G G +G A F+A AA+ L + +P + Sbjct: 128 ERARHFGFMSACFGFGMVAGPVLGGLMG-GFSPHAPFFAAAALNGLNFLTGCFLLPESHK 186 Query: 195 AATPASWRQEVAVLQRGQVLLALAITVVGYAGVFAVFTYIQ-----PLLLQVT------G 243 R+ + L A + A + AVF +Q P L V Sbjct: 187 GERRPLRREALNPL----ASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 244 FAQAAVSPVLLVFGV-GMIVGNLLGGRLADR-RPTAALLGSLAALVVVLGALGFALHSKA 301 + + L FG+ + ++ G +A R AL+ + A L FA Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302 Query: 302 AMVAVVGLLGVAAF--ATVAPLQLRVLEHARGAGQNLASSLNIAAFNLGNALGAWLGGVV 359 A +V L A A L +V E +G Q ++L +G L + Sbjct: 303 AFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362 Query: 360 IATQAGLV 367 I T G Sbjct: 363 ITTWNGWA 370
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 37.9 bits (88), Expect = 4e-06 Identities = 15/49 (30%), Positives = 29/49 (59%) Query: 5 RSRGFTLIELMVTIAVLAIVVAIGYPSFQGVLRSNRVAAANNELIALLN 53 + RGFTL+E+MV I ++ ++ ++ P+ G A ++++AL N Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALEN 54
>PilS_PF08805#PilS N terminal Length = 185 Score = 34.5 bits (79), Expect = 9e-05 Identities = 28/125 (22%), Positives = 45/125 (36%), Gaps = 7/125 (5%) Query: 5 ARFNARSSRGFTLIEVLIAILVLAFGLLGFALLQTMNVRFVQSANYRTQATNLAYDLTDQ 64 AR +G TL+EVL+ + V+ L +M +QS+N + + ++ Sbjct: 18 ARRKKEQDKGATLMEVLLVVGVIVVLAASAYKLYSMVQSNIQSSNEQNNVLTVIANMKSL 77 Query: 65 MRSNRYLVTQYTAATFAAGSVTPTGACAYPTGTAVPVAQNIARWQCQVA-KALGDKAAAT 123 RY + Y +A G + T A+N W V DK + Sbjct: 78 KFQGRYTDSNYIKTLYAQGLLPSDM----IADTTGASAKNP--WGGSVTITTSSDKYSFN 131 Query: 124 VTYVN 128 V N Sbjct: 132 VVEAN 136
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 88.4 bits (219), Expect = 4e-21 Identities = 36/175 (20%), Positives = 83/175 (47%), Gaps = 3/175 (1%) Query: 439 VIGPSLGAENVERGVTAVVYSFLFTLVFFTIYYRVFGAITSV-ALLFNLLIVVAVMSLFG 497 +GP + E V V +++ + + + + + + A+ +V AL+ ++L+ V + ++ Sbjct: 142 SVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTVGLFAVLQ 201 Query: 498 ATMTLPGFAGLALSVGLSVDANVLINERIREELRL--GVPAKSAIAAGYEKAGGTILDAN 555 L A L G S++ V++ +R+RE L +P + + + + Sbjct: 202 LKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETLSRTVMTG 261 Query: 556 LTGLIVAVALYAFGTGPLKGFALTMMIGIFASMFTAITVSRALAVLIYGSRKKLK 610 +T L+ V + +G ++GF M+ G+F ++++ V++ + + I R K K Sbjct: 262 MTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNKEK 316
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 282 bits (724), Expect = 2e-96 Identities = 98/320 (30%), Positives = 160/320 (50%), Gaps = 10/320 (3%) Query: 4 FPLHLIPNDTKIDFMSWRKPVLILMLVLAVASVGIIVGKGFNYALEFTGGTLVQTSFQKT 63 F L L+P T DF W+ +V+ +ASV + + G N+ ++F GGT ++T Sbjct: 3 FRLKLVPEKTNFDFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTA 62 Query: 64 VDVDQVREKLSKAGFENAQVQNAR------GGNEVMIRLQPHGQNNNRDDAAR---TVAE 114 +DV R L + + R + MIR+Q + + Sbjct: 63 IDVGVYRAALEPLELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVN 122 Query: 115 DVRKAVTSDENPATVQPGEFVGPQVGKDLALNGVYATVFMLVGFLIYIAFRFEWKFAVVA 174 V A+T+ + + E VGP+V +L V++ + V + YI RFEW+FA+ A Sbjct: 123 KVETALTAVDPALKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGA 182 Query: 175 SLTALFDLLVTVAFVSLTGREFDLTVLAGLLSVMGFAINDIIVVFDRVRENFRALRVEPL 234 + + D+L+TV ++ +FDLT +A LL++ G++IND +VVFDR+REN + PL Sbjct: 183 VVALVHDVLLTVGLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPL 242 Query: 235 -EVLNRSINQTLSRTVITAVMFFLSALALYIYGGESMEGLAETHMIGAVIVVISSVIVAV 293 +V+N S+N+TLSRTV+T + L+ + + I+GG+ + G + G SSV VA Sbjct: 243 RDVMNLSVNETLSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAK 302 Query: 294 PMLSIGPFAVTKQDLLPKAK 313 ++ K+ P K Sbjct: 303 NIVLFIGLDRNKEKKDPSDK 322
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 113 bits (285), Expect = 2e-29 Identities = 79/411 (19%), Positives = 162/411 (39%), Gaps = 17/411 (4%) Query: 23 LILACAI-FMEQMDATVLATALPTLARDFGVAAPAMSIAMTSYLLALAVLIPASGAIADR 81 LI C + F ++ VL +LP +A DF + + T+++L ++ G ++D+ Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 82 FGLRRVFGASIWVFVGGSILCSLADS-LPTMVAARVLQGAGGAMMAPLGRLILLRTVERR 140 G++R+ I + GS++ + S ++ AR +QGAG A L +++ R + + Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 141 HLVSAMAWTLVPAFIGPMLGPPLGGFFVSYLDWRWIFYINVPIGIAGFLLVRRFIPEIPS 200 + A +G +GP +GG Y+ W ++ I + I I + + + + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPM-ITIITVPFLMKLLKKEVR 194 Query: 201 ESAPARFDLRGFLLCGTALGCLLFGLEMVSQQNGVGQASWLLAIGGSAGLG-YLWHARHH 259 FD++G +L + + S I ++ H R Sbjct: 195 IKGH--FDIKGIILMSVGIVFFMLFTTS---------YSISFLIVSVLSFLIFVKHIRKV 243 Query: 260 PAPLLDLSLLRIASFRLSVIGGALMRITQGAQPFLLPLLFQIGFGMSAAHSGRLILATAL 319 P +D L + F + V+ G ++ T ++P + + +S A G +I+ Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 320 GALLMRS-ITPQLLRRFGYRNSLIGNGVLASLGYMVCAFFRPDWPPSVMFGLLLCCGAFM 378 ++++ I L+ R G L S+ ++ +F + ++ G + Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GL 362 Query: 379 SFQFAAYNTIAYENVPAARMSRASSLYTTLQQLMLSVGVCAGAMILNLAML 429 SF +TI ++ SL L G+ +L++ +L Sbjct: 363 SFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 63.7 bits (155), Expect = 2e-13 Identities = 31/145 (21%), Positives = 62/145 (42%), Gaps = 4/145 (2%) Query: 1 MPSRPLLCVDDESSNLATLRQLL-RDDFALVFAKSGAEALDAVTRHTPKLILLDVELPDM 59 M +L DD+++ L Q L R + + + A + L++ DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 60 DGYAVARALKQQPSSNAIPILFVTSRNSEHDERLGLEAGAADYVSKPYSPALLKARIGTQ 119 + + + +K+ +P+L ++++N+ E GA DY+ KP+ L IG Sbjct: 61 NAFDLLPRIKKARPD--LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118 Query: 120 LKLAENARLAQQYREAIHLLGTAGQ 144 L R ++ ++ + G+ Sbjct: 119 LAE-PKRRPSKLEDDSQDGMPLVGR 142
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 73.7 bits (181), Expect = 2e-15 Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 4/142 (2%) Query: 1029 LEGAHLLLVDDSDINCEVAQRILEGEGAMVTVAHDGEQAVSTLKRAPNLFHLVLMDVQMP 1088 + GA +L+ DD V + L G V + + + LV+ DV MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD--GDLVVTDVVMP 58 Query: 1089 VVDGYEATRRLRQIPALASLPVIALTAGAFRPQQEKALEAGMNGFIAKPFNVEELVTAIR 1148 + ++ R+++ A LPV+ ++A KA E G ++ KPF++ EL+ I Sbjct: 59 DENAFDLLPRIKK--ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116 Query: 1149 HFLQPGTRRIPSLPHEAQAHAG 1170 L RR L ++Q Sbjct: 117 RALAEPKRRPSKLEDDSQDGMP 138 Score = 61.4 bits (149), Expect = 1e-11 Identities = 29/138 (21%), Positives = 55/138 (39%), Gaps = 17/138 (12%) Query: 891 PRVLIADDHDAALNNLVRIATELGWRVDAVASGHAALQAIEHATEPYDIFLLDWRMPDID 950 +L+ADD A L + + G+ V ++ + I A D+ + D MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWI--AAGDGDLVVTDVVMPDEN 61 Query: 951 GVAIARQIRARATPGPH-PVIVM---------VTAYERRLLEQHPEQQDLDAVMTKPVTG 1000 + +I+ P PV+VM + A E+ + P+ DL ++ + G Sbjct: 62 AFDLLPRIKKA---RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIG--IIG 116 Query: 1001 AALHRLVEQLLEQRPGAR 1018 AL + + ++ Sbjct: 117 RALAEPKRRPSKLEDDSQ 134
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 32.0 bits (72), Expect = 0.005 Identities = 29/128 (22%), Positives = 43/128 (33%), Gaps = 21/128 (16%) Query: 17 SALRRWLKERSITEVECLVPDITGNARG--KIIPADKFSHDYGTRLPEGIFATTVTGDFP 74 A+ RW++E P+ + A K + P V+GDF Sbjct: 294 EAVDRWIQEN---------PNAAETVEAVFNVAAAAKVAKLAKAAKPG---KAAVSGDFA 341 Query: 75 DDYYALTSPSDSDMHLRPDASTVRMVPWAADPTAQVIHDCYTKDGQPHEL-APRNVLRRV 133 D Y + SDS L +A + + + D +K E+ A N Sbjct: 342 DSYKKKLALSDSARQLYQNAKYREALDIHYEDLIRRKTDGSSKFINGREIDAVTN----- 396 Query: 134 LDAYAQAK 141 DA QAK Sbjct: 397 -DALIQAK 403
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 93.4 bits (232), Expect = 7e-23 Identities = 52/371 (14%), Positives = 116/371 (31%), Gaps = 83/371 (22%) Query: 81 SVAVAPRVSGYVTKVLVSDNQIVEAGQPLLQIDDRTYQATLQQAEAAIAARQADIVAATA 140 S + P + V +++V + + V G LL++ +A + ++++ + + Sbjct: 96 SKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI 155 Query: 141 NVSAQESALLQARTQVTAAAASLKFAQAEVKRFAPLAASGADTHEHQES-LQHDLARARA 199 + E L + ++ EV R L T ++Q+ + +L + RA Sbjct: 156 LSRSIELNKLPELK-LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214 Query: 200 QYDAAQAQAKAGESQIQASRAQLE------------------------QAQAGVKQATAD 235 + A+ E+ + +++L+ +A ++ + Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQ 274 Query: 236 ADQARVAVEDTRLTSRIH------------------------------------------ 253 +Q + + ++ Sbjct: 275 LEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPV 334 Query: 254 -GRVGD-KTVQVGQFLGAGTRTMTIVPQESLYLV-ANFKETQVGLMRPGQPAEIEVDALS 310 +V K G + M IVP++ V A + +G + GQ A I+V+A Sbjct: 335 SVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFP 394 Query: 311 GVK---LHGKVESLSPGTGSQFALLPPENATGNFTKVVQRVPVRIRVLAGEEARKVLVPG 367 + L GKV++++ + G V+ + + L G Sbjct: 395 YTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGNKNIP--LSSG 445 Query: 368 MSVEVTVDTRS 378 M+V + T Sbjct: 446 MAVTAEIKTGM 456
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 103 bits (258), Expect = 1e-25 Identities = 83/407 (20%), Positives = 165/407 (40%), Gaps = 20/407 (4%) Query: 25 WLAVLAGTIGSFMATLDISIVNAALPTIQGEVGASGTEGTWISTAYLVAEIIMIPLTGWF 84 WL +L SF + L+ ++N +LP I + W++TA+++ I + G Sbjct: 18 WLCIL-----SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72 Query: 85 VRTLGLRNFLLICAVMFTAFSVVCGLSTS-LSMMIIGRVGQGLAGGALIPTALTIVATRL 143 LG++ LL ++ SV+ + S S++I+ R QG A + +VA + Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132 Query: 144 PPSQQTMGTALFGMTVIMGPVIGPLLGGWLTENVSWHYAFFINVPICVGLVALLLLGLRH 203 P + L G V MG +GP +GG + + W Y + +P+ + L+ L Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSY--LLLIPMITIITVPFLMKLLK 190 Query: 204 EKGDWAGLLNADWLGIYGLTAGLGGLTVVLEEGQRERWFESSEINTLSLMALSGFIALVI 263 ++ G D GI ++ G+ + +L F +S + ++++ F+ V Sbjct: 191 KEVRIKGHF--DIKGIILMSVGI--VFFML--------FTTSYSISFLIVSVLSFLIFVK 238 Query: 264 SQFRRRPPVIRLSLLLQRSFGAVFIMVMAVGMILFGVMYMIPQFLAVISGYNTEQAGYVL 323 + P + L F + + + G + M+P + + +T + G V+ Sbjct: 239 HIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI 298 Query: 324 LLSGLPTVLLMPMMPKLLETVDVRILVIAGLICFAAACFVNLSLTADTVGTHFVAGQLLQ 383 + G +V++ + +L + V+ + F + F+ S +T + Sbjct: 299 IFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFV 358 Query: 384 GCGLALAMMSLNQAAISSVPPELAGDASGLFNAGRNLGGSVGLALIS 430 GL+ ++ SS+ + AG L N L G+A++ Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVG 405
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.8 bits (80), Expect = 8e-04 Identities = 35/230 (15%), Positives = 66/230 (28%), Gaps = 33/230 (14%) Query: 65 DPLLTQLVTQALADSPNLRAA--QARLRANRALAQQRRAERLPKLNASAVYAYAEPPQTI 122 D LL A AD+ +++ QARL R R E KL + Sbjct: 122 DVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELN-KLPELKLPDEPYFQNVS 180 Query: 123 VDTLGGLQQGQPGQPPAAGSQALDLEKTEIYSAGFDASWELDFFGRRRRAAEGALAQAQA 182 + + L Q +Q E ++R LA+ Sbjct: 181 EEEVLRLTSLIKEQFSTWQNQKYQKELN---------------LDKKRAERLTVLARINR 225 Query: 183 SEAELADAQVQLA-----AEVGQV----YLNYRG----LQARLAIADANLDKIRQSLQLV 229 E + +L + L L + + L++I + Sbjct: 226 YENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285 Query: 230 QQRRGQGVASDLQVEQIATQVQQQQAQRLPLEMQSQEALDQLALMVGREP 279 ++ V + +I +++Q L ++ + ++ V R P Sbjct: 286 KEEYQL-VTQLFK-NEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAP 333
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 20/74 (27%), Positives = 27/74 (36%), Gaps = 16/74 (21%) Query: 768 KLLRRKRELEQLVAKRT-------AELEQDKRDLEAARAEL-SLKATHDELTGLLN---- 815 L K LE A A + +RDL+A+R L+A H +L Sbjct: 285 TLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEA 344 Query: 816 -RAGI---LAALRE 825 R + L A RE Sbjct: 345 SRQSLRRDLDASRE 358
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 124 bits (313), Expect = 1e-36 Identities = 79/239 (33%), Positives = 131/239 (54%), Gaps = 2/239 (0%) Query: 23 WTMLRTGALLTAMPLIGTRAVPGRVRVMLAGTLAMVLAPILPPVPEWDGFTAQAVLSIAR 82 W +LR AL++ P++ R+VP RV++ LA + +AP LP L++ + Sbjct: 18 WPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLAV-Q 76 Query: 83 ELAVGASMGFMLKLIFEAGALAGELVSQSTGLSFAQMSDPMRGVTSGVIAQWFYLGFGLL 142 ++ +G ++GF ++ F A AGE++ GLSFA DP + V+A+ + LL Sbjct: 77 QILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLALLL 136 Query: 143 FFSANGHLAVIALLVDSYKALPIGTALPDAGAFAEVAPTLFLQILRGGLTLALPMMVAML 202 F + NGHL +I+LLVD++ LPIG ++ AF + I GL LALP++ +L Sbjct: 137 FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALT-KAGSLIFLNGLMLALPLITLLL 195 Query: 203 AVNLAFGALAKAAPALNPMQLGLPLTVLLGLFLLSSFASEFAPPVQRMFDTAFDAAREL 261 +NLA G L + AP L+ +G PLT+ +G+ L+++ AP + +F F+ ++ Sbjct: 196 TLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 43.2 bits (102), Expect = 3e-09 Identities = 17/69 (24%), Positives = 32/69 (46%) Query: 13 GLVTVLWIAGPMLLAVLVVGVVIGVVQAATQLNEPTIAFVAKAVALTATLFATGSMLLGH 72 L VL ++G + ++G+++G+ Q TQL E T+ F K + + LF Sbjct: 11 ALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLFLLSGWYGEV 70 Query: 73 LVEFTIALF 81 L+ + + Sbjct: 71 LLSYGRQVI 79
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 239 bits (612), Expect = 2e-81 Identities = 123/228 (53%), Positives = 161/228 (70%), Gaps = 1/228 (0%) Query: 51 PAGSNQLPSLPNVSVGRIGDQPVSLPLQTLLLMTAITLLPSMLLVLTAFTRITIVLGLLR 110 P QLP + + + G Q SLP+QTL+ +T++T +P++LL++T+FTRI IV GLLR Sbjct: 17 PLAFAQLPGITSQPL-PGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFTRIIIVFGLLR 75 Query: 111 QALGTGQTPSNQVLLGLAMFLTALVMMPVWQKMWGAGLQPYLNNQIDFSTAWTLTTQPLR 170 ALGT P NQVLLGLA+FLT +M PV K++ QP+ +I A QPLR Sbjct: 76 NALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQEALEKGAQPLR 135 Query: 171 AFMLAQIRETDLMTFAGMAGDSKYAGPDAVPFPVLVASFVTSELKTAFEIGFLIFIPFVI 230 FML Q RE DL FA +A GP+AVP +L+ ++VTSELKTAF+IGF IFIPF+I Sbjct: 136 EFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQIGFTIFIPFLI 195 Query: 231 IDLVVASVLMSMGMMMLSPMLISAPFKILLFILVDGWVLVVGTLAASF 278 IDLV+ASVLM++GMMM+ P I+ PFK++LF+LVDGW L+VG+LA SF Sbjct: 196 IDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSF 243
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 113 bits (284), Expect = 7e-36 Identities = 50/90 (55%), Positives = 74/90 (82%) Query: 22 DQNAADLNLDVILDVPVTLSLEVGRARIPIRNLLQLNQGSVVELERGAGEPLDVYVNGTL 81 D + A ++D+I+D+PV L++E+GR R+ I+ LL+L QGSVV L+ AGEPLD+ +NG L Sbjct: 46 DVSGAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYL 105 Query: 82 IAHGEVVVINDRFGIRLTDVVSPSERIRRL 111 IA GEVVV+ D++G+R+TD+++PSER+RRL Sbjct: 106 IAQGEVVVVADKYGVRITDIITPSERMRRL 135
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 259 bits (662), Expect = 1e-86 Identities = 91/327 (27%), Positives = 163/327 (49%), Gaps = 14/327 (4%) Query: 3 VSDLLSQDEIDALLHGVDSGAVNTEPEPLPGEARQ-----YDLSSQDRIIRGRMPTLEMV 57 ++++LSQDEID LL + SG + E + YD D+ + +M TL ++ Sbjct: 1 MTEVLSQDEIDQLLTAISSG--DASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLM 58 Query: 58 NERFARLWRIGLFNLIRRSADLSVRGIDLVKFNEYMHSLYVPTNLNLIRFKPLRGTGLIV 117 +E FARL L +R + V +D + + E++ S+ P+ L +I PL+G ++ Sbjct: 59 HETFARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLE 118 Query: 118 FEPTLVFTVVDNFFGGDGRFHTRIEGREFTATEMRVIQLMLKQTFADLKEAWAPVMDVDF 177 +P++ F+++D FGG G+ R+ T E V++ ++ + A+++E+W V+D+ Sbjct: 119 VDPSITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRP 176 Query: 178 EYINSEINPHFANIVTPREYVVVCRFHVELEGGGGEIHITLPYSMLEPIRELLDAG--IQ 235 E NP FA IV P E VV+ ++ G ++ +PY +EPI L + Sbjct: 177 RLGQIETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFS 236 Query: 236 SDRNDRDDSWNVMLREQLDTAEVTLSSVLASKRMSLRQLTGLKVGDIL---PIDLPAQVP 292 S R + +LR++L T ++ + + + S R+S+R + GL+VGDI+ + Sbjct: 237 SVRRSSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFV 296 Query: 293 LCVEDIPLFTGEFGVSNGNNAVKITAV 319 L + + F + GV A +I Sbjct: 297 LSIGNRKKFLCQPGVVGKKIAAQILER 323
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 47.5 bits (112), Expect = 5e-08 Identities = 40/176 (22%), Positives = 80/176 (45%), Gaps = 6/176 (3%) Query: 247 AAKALEPAADDSAAAAAPDAPAFVLPTTTAPALSRLQDPAPIFSASPTPTPELGSDTFDD 306 A+ L P ++ + A + + +P ++ Q A+P + LGS + Sbjct: 183 PAQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQ 242 Query: 307 AIGARMSWLADQKIGHAHIKVTPNEMGPVEVRLHLEGDKVNASFTAANADTRQALEQSLP 366 ++ +S Q A +++ P ++G V++ L ++ ++ + + R ALE +LP Sbjct: 243 SLSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALP 302 Query: 367 RLREMLGQNGFQLGQADV------GQQQRNSSGNRNGGNDSGNGLTLDDAPPVGIP 416 LR L ++G QLGQ+++ GQQQ S ++ + L +D + +P Sbjct: 303 VLRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQRTANHEPLAGEDDDTLPVP 358
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 27.5 bits (60), Expect = 0.016 Identities = 35/142 (24%), Positives = 59/142 (41%), Gaps = 4/142 (2%) Query: 1 MMQSKRIDPLLRRAQEQEDKVARDLAERQRVLETHQSRLEELRRYAEEYANSQMAGTSAV 60 M + + L A+++ + AR L E +R + + +L+ L Y EY N+ + SA Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 ALSNR----RAFLDRLDSAVLQQAQTVQSNIAKVEAERTRLLLASREKQVLEQLAASYRA 116 SNR + F+ L+ A+ Q Q + KV+ + Q + L Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 117 QENKVIERRDQREMDDLGARRA 138 R DQ++MD+ R A Sbjct: 121 AALLAENRLDQKKMDEFAQRAA 142
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 361 bits (928), Expect = e-126 Identities = 156/364 (42%), Positives = 220/364 (60%), Gaps = 9/364 (2%) Query: 10 LLAAAVALCALAAPASAERIKDLAQVGGVRGNALVGYGLVVGLDGSGDRTSQAPFTVQSL 69 + +A L A A RIKD+A + R N L+GYGLVVGL G+GD +PFT QS+ Sbjct: 12 VFSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSM 71 Query: 70 KNLLGELGVNVPANVNPQLKNVAAVAIHAELPPFAKPGQPIDITVSSIANAVSLRGGSLL 129 + +L LG+ KN+AAV + A LPPFA PG +D+TVSS+ +A SLRGG+L+ Sbjct: 72 RAMLQNLGITTQGG-QSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLI 130 Query: 130 MAPLKGADGQVYAMAQGNLVVGGFGAQGKDGSRVSVNVPSVGRIPNGATVERALPDVFAG 189 M L GADGQ+YA+AQG L+V GF AQG D + ++ V + R+PNGA +ER LP F Sbjct: 131 MTSLSGADGQIYAVAQGALIVNGFSAQG-DAATLTQGVTTSARVPNGAIIERELPSKFKD 189 Query: 190 TGEITLNLHQNDFTTVSRMVAAIDS----SFGAGTARAVDGVTVAVRSPTDPGARIGLLS 245 + + L L DF+T R+ +++ +G A D +AV+ P L++ Sbjct: 190 SVNLVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKP-RVADLTRLMA 248 Query: 246 RLENVELSPGDAPAKVVVNARTGTVVIGQLVRVMPAAIAHGSLTVTISENTNVSQPGAFS 305 +EN+ + D PAKVV+N RTGT+VIG VR+ A+++G+LTV ++E+ V QP FS Sbjct: 249 EIENLTVET-DTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESPQVIQPAPFS 307 Query: 306 GGRTAVTQQSTITATSEGSRMFKFEGGTTLDQIVRAVNEVGAAPGDLVAILEALKQAGAL 365 G+TAV Q+ I A EGS++ E G L +V +N +G ++AIL+ +K AGAL Sbjct: 308 RGQTAVQPQTDIMAMQEGSKVAIVE-GPDLRTLVAGLNSIGLKADGIIAILQGIKSAGAL 366 Query: 366 TAEL 369 AEL Sbjct: 367 QAEL 370
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 148 bits (374), Expect = 2e-46 Identities = 79/199 (39%), Positives = 111/199 (55%), Gaps = 15/199 (7%) Query: 39 VPVVAPVAQPTAGAIYAAGPSLN-----LYGDRRARDVGDLLTVNLVESTTASSTANTSI 93 VP PVA G+I+ + +N L+ DRR R++GD LT+ L E+ +AS +++ + Sbjct: 40 VPGPTPVA---NGSIFQSAQPINYGYQPLFEDRRPRNIGDTLTIVLQENVSASKSSSANA 96 Query: 94 SKKDATTM---AAPTLLGAPLTVGGLNVLENSTSGDRSFAGKGNTAQSNRMQGSVTVTVM 150 S+ T P L +V SG +F GKG SN G++TVTV Sbjct: 97 SRDGKTNFGFDTVPRYLQGLFGNARADV---EASGGNTFNGKGGANASNTFSGTLTVTVD 153 Query: 151 QRLPNGNLVIQGQKQLRLTQGDELVQVQGIVRAADIAPDNTVPSSKVADARIAYGGRGAI 210 Q L NGNL + G+KQ+ + QG E ++ G+V I+ NTVPS++VADARI Y G G I Sbjct: 154 QVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISGSNTVPSTQVADARIEYVGNGYI 213 Query: 211 AQSNAMGWLSRFFNSRLSP 229 ++ MGWL RFF + LSP Sbjct: 214 NEAQNMGWLQRFFLN-LSP 231
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 39.2 bits (91), Expect = 1e-05 Identities = 12/41 (29%), Positives = 20/41 (48%) Query: 219 LEGSNVNTVEELVSMIETQRAYEMNAKAISTTDSMLGYLNN 259 S VN EE ++ Q+ Y NA+ + T +++ L N Sbjct: 504 QSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALIN 544 Score = 37.6 bits (87), Expect = 3e-05 Identities = 19/82 (23%), Positives = 31/82 (37%), Gaps = 20/82 (24%) Query: 5 LWVAKTGLDAQQTRMSVISNNLANTNTTGFKRDRAAFEDLLYQQVRAPGGSTSAQTQLPT 64 + A +GL+A Q ++ SNN+++ N G+ R T T Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYTRQT-----------------TIMAQANST 46 Query: 65 ---GLQLGTGVRVVSTFKGFDQ 83 G +G GV V + +D Sbjct: 47 LGAGGWVGNGVYVSGVQREYDA 68
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 30.3 bits (68), Expect = 0.009 Identities = 9/31 (29%), Positives = 19/31 (61%) Query: 5 LYVAMTGARASLQAQSTVSHNLANVDTVGFK 35 + AM+G A+ A +T S+N+++ + G+ Sbjct: 4 INNAMSGLNAAQAALNTASNNISSYNVAGYT 34
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 46.1 bits (109), Expect = 2e-07 Identities = 25/69 (36%), Positives = 37/69 (53%), Gaps = 3/69 (4%) Query: 2 GFNTSLSGINAANADLNVTSNNIANVNTTGFKESRAEFADMFQSTSYGLSRNAVGSGVRV 61 N ++SG+NAA A LN SNNI++ N G+ A Q+ S + VG+GV V Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA---QANSTLGAGGWVGNGVYV 59 Query: 62 SNVAQQFSQ 70 S V +++ Sbjct: 60 SGVQREYDA 68 Score = 44.2 bits (104), Expect = 7e-07 Identities = 31/188 (16%), Positives = 69/188 (36%), Gaps = 16/188 (8%) Query: 232 LQFSDTGALTTPANGIIAMDPFTPSTGAGVLN-MQLNVTGSTQYGEAFALRDTRQDGYAS 290 + F + T + G + ++L TG+ ++F L+ A Sbjct: 363 ISFDNNQWQVTRLASNTTFTVTPDANGKVAFDGLELTFTGTPAVNDSFTLKPVSD---AI 419 Query: 291 GKLNEISIDTSGVVFARYSNGADKPLGQVALSSFVNPQGLQSQGNNMWA-ESY------- 342 ++ + D + + A + D + ++ G ++Y Sbjct: 420 VNMDVLITDEAKIAMASEEDAGDSDNRNGQ-ALLDLQSNSKTVGGAKSFNDAYASLVSDI 478 Query: 343 ---TSGAARTGAPDTSDLGQIESGSLEASTVDLTEQLVNMIVAQRNFQANSQMISTQDQV 399 T+ + A + + Q+ + S V+L E+ N+ Q+ + AN+Q++ T + + Sbjct: 479 GNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAI 538 Query: 400 TQTIINIR 407 +INIR Sbjct: 539 FDALINIR 546
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 38.7 bits (90), Expect = 2e-05 Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 9/75 (12%) Query: 184 VLVVDDSRVARQQIRSVLDQLGVSATLLSDGRQALDHLLQVAASGENPADRYAMVISDIE 243 +LV DD R + L + G + S+ + A +V++D+ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---------AGDGDLVVTDVV 56 Query: 244 MPAMDGYTLTTEIRR 258 MP + + L I++ Sbjct: 57 MPDENAFDLLPRIKK 71
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 28.2 bits (62), Expect = 0.007 Identities = 15/66 (22%), Positives = 24/66 (36%), Gaps = 2/66 (3%) Query: 35 DKLSALQALEAAMPAGEEERLRELAEANRANGALLARRRREVNWALRHLGRTESAPSYDA 94 + +S+LQ + A + A R A A+R+ E R +A +Y Sbjct: 195 EAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEAKRKAEE--QARQQAAIRAANTYAM 252 Query: 95 KGQSSV 100 SV Sbjct: 253 PANGSV 258
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 99 bits (249), Expect = 2e-24 Identities = 34/115 (29%), Positives = 56/115 (48%), Gaps = 1/115 (0%) Query: 447 TLLLLDDEENVLRSLVRLFRRDGYRILAAGNVRDAFDLLATNDVQVILSDQRMSDMSGTE 506 T+L+ DD+ + L + R GY + N + +A D ++++D M D + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 507 FLGRVKMLYPDTVRLVLSGYTDLATVTEAINRGAIYRFLTKPWNDDELREHIRQA 561 L R+K PD LV+S T +A +GA Y +L KP++ EL I +A Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGA-YDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 39.5 bits (92), Expect = 3e-05 Identities = 18/84 (21%), Positives = 29/84 (34%), Gaps = 10/84 (11%) Query: 622 NALRHA---CAGEVHLRLHSI-DADSFRLEVSDDGDGFEPEGPR--GLGLIVMRERAQTV 675 N ++H + L D + LEV + G G GL +RER Q + Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGTGLQNVRERLQML 325 Query: 676 GG---TLAIESAPGAGTRVTLRLP 696 G + + G + +P Sbjct: 326 YGTEAQIKLSEKQG-KVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.2 bits (68), Expect = 0.016 Identities = 16/76 (21%), Positives = 30/76 (39%) Query: 85 LNVRSPEPGALPLLLTHGWPGSILEFRDVIGPLSHPVAHGGKASDAFHLVIPSLPGFGFS 144 L+V+ + AL L+ H WPG++ E +++ L+ + + S Sbjct: 333 LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPI 392 Query: 145 GKPTARGWGVGRTAAA 160 K AR + + A Sbjct: 393 EKAAARSGSLSISQAV 408
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 33.6 bits (77), Expect = 8e-04 Identities = 17/69 (24%), Positives = 30/69 (43%), Gaps = 6/69 (8%) Query: 9 IVVAGATGNLGYRIAAALKDQGAAVVALVRHGAG------QSRVTALEGRGVQVRRVEFD 62 +V GA G +G+ ++ L + G VV + Q+R+ L G Q +++ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDLA 62 Query: 63 DAERLRDAI 71 D E + D Sbjct: 63 DREGMTDLF 71
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 60.0 bits (145), Expect = 3e-13 Identities = 28/121 (23%), Positives = 47/121 (38%), Gaps = 2/121 (1%) Query: 12 RPPPDKAGDVDRRLLDAALQLFLERGFEHTSCEDIARLAGAGKASLYARYANKDAIFEAV 71 R +A + + +LD AL+LF ++G TS +IA+ AG + ++Y + +K +F + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 72 IRRDVQTQPLPAASSAPLDLEARLRLAGRAILAHALQ-PQTVAMMRLVVGTSIRAPELAA 130 R IL H L+ T RL++ E Sbjct: 63 WELSES-NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVG 121 Query: 131 E 131 E Sbjct: 122 E 122
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 35.3 bits (81), Expect = 4e-04 Identities = 16/111 (14%), Positives = 30/111 (27%), Gaps = 3/111 (2%) Query: 317 VIPERPQIAAPAARLREISPTVRMPEVAVRPAELPNVPDPAPAPVAAAPIVPATPATPDP 376 + + P + E P PE P + V P P P Sbjct: 59 DLEPPQAVQPPPEPVVEPEP---EPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPK 115 Query: 377 RPAPVAAPSAQAAAQPAPSQASPAQSERSSSAAAAASMPAKPAASSHAGPA 427 R + + + + ++++ S+ + P A S P Sbjct: 116 RDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRALSRNQPQ 166 Score = 33.0 bits (75), Expect = 0.003 Identities = 16/101 (15%), Positives = 30/101 (29%), Gaps = 1/101 (0%) Query: 345 VRPAELPNVPDPAPAPVAAAPIVPATPATPDP-RPAPVAAPSAQAAAQPAPSQASPAQSE 403 V PA+L P P P P+P + APV + +P P + Sbjct: 55 VAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQP 114 Query: 404 RSSSAAAAASMPAKPAASSHAGPAPADRSGGWDVAANADDW 444 + + + ++ A P + + + Sbjct: 115 KRDVKPVESRPASPFENTAPARPTSSTATAATSKPVTSVAS 155 Score = 30.7 bits (69), Expect = 0.013 Identities = 30/123 (24%), Positives = 40/123 (32%), Gaps = 4/123 (3%) Query: 154 PTSEASTQAAATAASSPAHAGVS--AAESEPAASPTPMPAQPATDPVERPDMAQAPENAP 211 P S A A P A EP P P+P P PV P+ P Sbjct: 46 PAQPISVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKP 105 Query: 212 EPVQAASEPITAEIPQVTVQVPPVTIESPLQVTETPVATNDFVVPPPPTITLTPRAIERA 271 +PV+ +P P + P +P T P ++ PRA+ R Sbjct: 106 KPVKKVEQPKRDVKPVESRPASPFENTAP--ARPTSSTATAATSKPVTSVASGPRALSRN 163 Query: 272 APQ 274 PQ Sbjct: 164 QPQ 166
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 54.6 bits (131), Expect = 7e-12 Identities = 15/103 (14%), Positives = 35/103 (33%), Gaps = 5/103 (4%) Query: 18 GGCGKSPQQAAAPTVAPTELAAVKTPPPEYSPQLACAGVGGTTVLRVVVGPQGSPTDVSV 77 + + T + A+ P+Y + + G ++ V P G +V + Sbjct: 138 TSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQI 197 Query: 78 AQSSGQPVLDEAAQTRVREWQFKAATRNGQAVAQTIQVPVSFK 120 + + + + +R W+++ V V + FK Sbjct: 198 LSAKPANMFEREVKNAMRRWRYEPGKPGSGIV-----VNILFK 235
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 556 bits (1435), Expect = 0.0 Identities = 232/1043 (22%), Positives = 448/1043 (42%), Gaps = 59/1043 (5%) Query: 3 VAAFSIRRPVTTIMCFVSLVVVGLIAAFRLPLEALPDISAPFLFVQLPYTGSTPDEVERN 62 +A F IRRP+ + + L++ G +A +LP+ P I+ P + V Y G+ V+ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 63 LVRPAEEALATMTGIKRMRSTATADG-ANIFIEFSDWDRDIAIAASDARERLDAVRDDFP 121 + + E+ + + + M ST+ + G I + F D IA + +L P Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQS-GTDPDIAQVQVQNKLQLATPLLP 119 Query: 122 EDLQRFHVFKWSSSDEPVLKVRLAS---QTDLTGAYDMLDREFKRRIERIPGVAKVEISG 178 +++Q+ + SS ++ S T D + K + R+ GV V++ G Sbjct: 120 QEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG 179 Query: 179 APPNEVEIAIAPDRLTAHDLSLNDLSERLGKLNFSVSAGQI------DDNGQRIRVQPIG 232 A + I + D L + L+ D+ +L N ++AGQ+ + Sbjct: 180 AQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQT 238 Query: 233 ELRDLQELRELVLNAKG----VRLGDIAEVRLKPTRMNYGRRLDGRPAIGLDVYKERSAN 288 ++ +E ++ L VRL D+A V L N R++G+PA GL + AN Sbjct: 239 RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGAN 298 Query: 289 LVEVSKAALKEVEDIRAQ-PALRDVQVKVIDNQGKAVTSSLAELAEAGAVGLLLSITVLF 347 ++ +KA ++ +++ P ++V + V S+ E+ + ++L V++ Sbjct: 299 ALDTAKAIKAKLAELQPFFPQ--GMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMY 356 Query: 348 FFLRHWPSTLMVTLAIPICFAITLGFMYFVGVTLNILTMMGLLLAVGMLVDNAVVVVESI 407 FL++ +TL+ T+A+P+ T + G ++N LTM G++LA+G+LVD+A+VVVE++ Sbjct: 357 LFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENV 416 Query: 408 YQERERMPDQPQLAALLGTRSVAIALSAGTLCHCIVFVPNLFGETNNISIFMAQIAITIS 467 + P+ A + AL + VF+P + + Q +ITI Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIP-MAFFGGSTGAIYRQFSITIV 475 Query: 468 VSLLASWLVAISLIPMLSARMKTPPMVTSEHG------------VIARLQRRYAKVLAWT 515 ++ S LVA+ L P L A + P V++EH Y + Sbjct: 476 SAMALSVLVALILTPALCATLLKP--VSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 516 LAHRG-WSVAGIILVSAISLVPMKLTKVDMFGGDGGNEAFIQYQWKGSYTREQLGEEIGR 574 L G + + ++V+ + ++ ++L + D G Q T+E+ + + + Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQG-VFLTMIQLPAGATQERTQKVLDQ 592 Query: 575 VENYLQANRAK--YHITQIYSWFSEVEGSNTVVTFDASKVKDLPPLLEKIRKELPRSARA 632 V +Y N + + + + N + F + K + E + + A+ Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652 Query: 633 DYSIGNQG----------DGGNGNQGVQVQLV---GDSTDALKALADDVIPLLAQR-KEL 678 + G G +L+ G DAL + ++ + AQ L Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712 Query: 679 RDVHVDTGDRTSELAIRVDRERAAAFGFSAEQVASFVGLALRGTPLREFRRGDNEVPVWV 738 V + + T++ + VD+E+A A G S + + AL GT + +F ++V Sbjct: 713 VSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYV 772 Query: 739 RFAGAEQSKPEDLASFTVRTKDGRSVPLLSLVEVQIRPAATQIGRTNRQTTLTIKANLAE 798 + + PED+ VR+ +G VP + + ++ R N ++ I+ A Sbjct: 773 QADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAP 832 Query: 799 KVTMPEARAAMEAPLKAMSFPAGYSYTFDGGDYQNDGEAMNQMVFNLVIALVMIYVVMAA 858 + +A A ME + PAG Y + G + + NQ + I+ V++++ +AA Sbjct: 833 GTSSGDAMALMENLASKL--PAGIGYDW-TGMSYQERLSGNQAPALVAISFVVVFLCLAA 889 Query: 859 VFESLLFPAAIMSGVLFSIFGVFWLFWITGTSFGIMSFIGILVLMGVVVNNGIVMIEHIN 918 ++ES P ++M V I GV + + +G+L +G+ N I+++E Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949 Query: 919 NLRRR-GMGRTQALVEGSRERLRPIMMTMGTAILAMVPISLTSTTMFSDGPPYFPMARAI 977 +L + G G +A + R RLRPI+MT IL ++P+++++ + + Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGA---GSGAQNAVGIGV 1006 Query: 978 AGGLAFSTVVSLLFLPTIYAILD 1000 GG+ +T++++ F+P + ++ Sbjct: 1007 MGGMVSATLLAIFFVPVFFVVIR 1029
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 664 bits (1715), Expect = 0.0 Identities = 263/1142 (23%), Positives = 481/1142 (42%), Gaps = 137/1142 (11%) Query: 24 LVAFATRRRVTIAMITVTMLLFGLIALRSLKVNLLPDLSYPTLTVRTEYTGAAPAEIETL 83 + F RR + ++ + +++ G +A+ L V P ++ P ++V Y GA ++ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 84 VTEPVEEAVGVVKNLRKLKSIS-RTGQSDVVLEFAWGTNMDQASLEVRDKMEAL--SLPL 140 VT+ +E+ + + NL + S S G + L F GT+ D A ++V++K++ LP Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 141 ETKPPVLLRFNPSTEPIMRLALSPKQAPASDTDAIRQLTGLRRYADEDLKKKLEPVAGVA 200 E + + S+ +M + + Y ++K L + GV Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFV-------SDNPGTTQDDISDYVASNVKDTLSRLNGVG 173 Query: 201 AVKVGGGLEDEIQVDIDQQKLAQLNLPIDNVITRLKEENVNISGGRL------EEGSQRY 254 V++ G + +++ +D L + L +VI +LK +N I+ G+L Sbjct: 174 DVQLFGA-QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNA 232 Query: 255 LVRTVNQFVDLDEIRNMLVTTQSSSGSAAEAAMQQMYAIAASTGSQAALAAAAEVQSTSS 314 + +F + +E + + S Sbjct: 233 SIIAQTRFKNPEEFGKVTLRVNSD------------------------------------ 256 Query: 315 SSSSIAGGMPVRLKDVAQVRQGYKEREAIIRLGGKEAVELAIYKEGDANTVSTAAALRKR 374 G VRLKDVA+V G + I R+ GK A L I AN + TA A++ + Sbjct: 257 -------GSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAK 309 Query: 375 LEQLKATVPGDVEITTIEDQSHFIEHAISDVKKDAVIGGVLAILIIFLFLRDGWSTFVIS 434 L +L+ P +++ D + F++ +I +V K +L L+++LFL++ +T + + Sbjct: 310 LAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPT 369 Query: 435 LSLPVSIITTFFFMGQLGLSLNVMSLGGLALATGLVVDDSIVVLESIAKA-RERGLSVLD 493 +++PV ++ TF + G S+N +++ G+ LA GL+VDD+IVV+E++ + E L + Sbjct: 370 IAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKE 429 Query: 494 AAIAGTREVSMAVMASTLTTIAVFLPLVFVEGIAGQLFRDQALTVAIAIAISLVVSMTLI 553 A ++ A++ + AVF+P+ F G G ++R ++T+ A+A+S++V++ L Sbjct: 430 ATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILT 489 Query: 554 PMLSSLKGAPPMAFPDEPSHPDWQPEQRWLKPVAAGRRGAGASVRYGFFGAAWAVVKVWR 613 P L + LKPV+A + GFFG Sbjct: 490 PALCA----------------------TLLKPVSAEHHEN----KGGFFGWFNTT----- 518 Query: 614 GLSRVVGPVMRKASDLAMAPYARAERGYLAMLPAALRRPWLVLGLAAAAFIGTVFLVPML 673 + + Y + L L + A G V L L Sbjct: 519 --------------------FDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558 Query: 674 GADLIPQLAQDRFEMTVKLPSGTPLAQTDAVVRELQ--LAHDKDPGVASLYGVSGSGTRL 731 + +P+ Q F ++LP+G +T V+ ++ ++ V S++ V+G Sbjct: 559 PSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNGFSF-- 616 Query: 732 DANPTESGENIGKLTVVMAG-----GGSPAVEAAATERLRSSMVGHPGAQV-DFARPALF 785 + +N G V + G + EA R + + V F PA+ Sbjct: 617 ----SGQAQNAGMAFVSLKPWEERNGDENSAEAVI-HRAKMELGKIRDGFVIPFNMPAIV 671 Query: 786 SF--STPLEVEL---RGQDLGELERAGQKLAAMLRAN-GHYADVKSTVEEGFPEIQIRFD 839 +T + EL G L +A +L M + V+ E + ++ D Sbjct: 672 ELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVD 731 Query: 840 QERAGALGLTTRQIADVIVKKVRGDVATRYSFRDRKIDVLVRAQQADRASVDAIRQLIVN 899 QE+A ALG++ I I + G + R R + V+A R + + +L V Sbjct: 732 QEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVR 791 Query: 900 PGSSRPVRLAAVAEVLATTGPSEIHRADQTRVAIVSASL-KDIDLGGAVREVETMVRKDP 958 + V +A G + R + + G A+ +E + K P Sbjct: 792 SANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMALMENLASKLP 851 Query: 959 LAAGVGMHIGGQGEELAQSVKSLLFAFGLAIFLVYLVMASQFESLLHPFVILFTIPLAMV 1018 AG+G G + S ++ +V+L +A+ +ES P ++ +PL +V Sbjct: 852 --AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIV 909 Query: 1019 GAVLALLMTGKPISVVVFIGLILLVGLVTKNAIILIDKVNQLRE-DGVPKREALIEGARS 1077 G +LA + + V +GL+ +GL KNAI++++ L E +G EA + R Sbjct: 910 GVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRM 969 Query: 1078 RLRPIIMTTLCTLFGFLPLAVAMGEGAEVRAPMAITVIGGLLVSTLLTLLVIPVVYDLLD 1137 RLRPI+MT+L + G LPLA++ G G+ + + I V+GG++ +TLL + +PV + ++ Sbjct: 970 RLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029 Query: 1138 RR 1139 R Sbjct: 1030 RC 1031
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 54.8 bits (132), Expect = 2e-10 Identities = 41/259 (15%), Positives = 87/259 (33%), Gaps = 34/259 (13%) Query: 33 EGEAKAAEEKKAVDAVPVEIAKAARRAVAASYTGTAALEPRAEAQVVAKTSGVALSVMVE 92 + E +++ V + + Q +AK +V+ + Sbjct: 204 QKELNLDKKRAERLTV-LARINRYENLSRVEKSRLDDFSSLLHKQAIAK-----HAVLEQ 257 Query: 93 EGQKVSAGQALVRLDPDRAHL--AVAQSEAQLRKLENSYRRATQLVGQQLVSA-ADVDQL 149 E + V A L + + ++ + + + ++ + +L ++ L Sbjct: 258 ENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN---EILDKLRQTTDNIGLL 314 Query: 150 KFDVENSRAQHRLASLELSYTTVQAPISGVIASRSIKT-GNFVQINTPIFRIV-DDSQLE 207 + + ++AP+S + + T G V + IV +D LE Sbjct: 315 -------TLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLE 367 Query: 208 ATLNVPERELATLKSGQPVTLLADALPGQQF---VGKVDRIAP--VVDSGSGT-FRVVCA 261 T V +++ + GQ + +A P ++ VGKV I + D G F V+ + Sbjct: 368 VTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIIS 427 Query: 262 FGQGAEA-------LQPGM 273 + + L GM Sbjct: 428 IEENCLSTGNKNIPLSSGM 446 Score = 43.7 bits (103), Expect = 8e-07 Identities = 16/74 (21%), Positives = 33/74 (44%), Gaps = 9/74 (12%) Query: 78 VVAKTSGVALSVMVEEGQKVSAGQALVRLDPDRAHLAVAQSEAQLRKLENSYR--RATQL 135 + + + ++V+EG+ V G L++L +EA K ++S R Q Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTA-------LGAEADTLKTQSSLLQARLEQT 151 Query: 136 VGQQLVSAADVDQL 149 Q L + ++++L Sbjct: 152 RYQILSRSIELNKL 165
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 2e-17 Identities = 32/130 (24%), Positives = 58/130 (44%), Gaps = 4/130 (3%) Query: 1002 RVLLVDDDQDSREAVMQFLMLAGAQVQAAGSVDAAEHCLANAHFDVLVSDIAMPLRDGYD 1061 +L+ DDD R + Q L AG V+ + +A D++V+D+ MP + +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 1062 LIRTVRSGRADLPRHIPAIALTAYVREEDRDRAVVAGFDAHMGKPVEPPGLVDLIERLIL 1121 L+ ++ R DLP + ++A +A G ++ KP + L+ +I R + Sbjct: 65 LLPRIKKARPDLPV----LVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 1122 PTHAVRAALE 1131 + LE Sbjct: 121 EPKRRPSKLE 130
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.9 bits (75), Expect = 0.002 Identities = 26/158 (16%), Positives = 47/158 (29%), Gaps = 17/158 (10%) Query: 151 REENAPWLDMPAFGLNRN----HQSRLQKLARAQ----QEFQAQSEAYGEQLKAAIEQAF 202 P L +P +N RL L + Q Q + Q E ++ +A Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL 220 Query: 203 ARFASKLSEHESSGSQLTSARALFD------LWIEAAEESYADVALSEQFRKVYGGFANA 256 AR + S+L +L + E Y + +VY Sbjct: 221 ARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA---VNELRVYKSQLEQ 277 Query: 257 HMRLRAALQEEVEQLSERFGMPTRSEMDAAHRRIAELE 294 + +EE + +++ F ++ I L Sbjct: 278 IESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 101 bits (252), Expect = 7e-28 Identities = 71/256 (27%), Positives = 109/256 (42%), Gaps = 13/256 (5%) Query: 2 RSILITGAGSGIGAGIASQLAADGHHLLVSDVQLAAAERTADALRQVGGSAEALALDVTD 61 + ITGA GIG +A LA+ G H+ D E+ +L+ AEA DV D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 62 ANSIAQALAHASRAPQ---VLVNNAGLQQVAALEEFPMQQWALLVDVMLTGAARLSRAVL 118 + +I + A R +LVN AG+ + + ++W V TG SR+V Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 119 PGMRAAGYGRIVNIGSIHSLVASPYKSAYVAAKHGLVGLAKVIALETADCDITVNTLCPS 178 M G IV +GS + V +AY ++K V K + LE A+ +I N + P Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 179 YVRT----PLVERQIADQARTRGIAEDAVVRDVMLKPMPKGAFIEYDELAGTVAFLMSHA 234 T L + + +G E +P + ++A V FL+S Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKT------GIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 235 ARNITGQSIAIDGGWT 250 A +IT ++ +DGG T Sbjct: 243 AGHITMHNLCVDGGAT 258
>BACTRLTOXIN#Bacterial toxin signature. Length = 266 Score = 28.7 bits (64), Expect = 0.011 Identities = 7/30 (23%), Positives = 14/30 (46%) Query: 73 YDLCDPVTGEPDPSAYVRLYRDARQAETTH 102 YD+ + D S Y+ +Y D + ++ Sbjct: 225 YDMMPAPGDKFDQSKYLMMYNDNKTVDSKS 254
>CLENTEROTOXN#Clostridium enterotoxin signature. Length = 319 Score = 31.6 bits (71), Expect = 0.003 Identities = 14/64 (21%), Positives = 21/64 (32%), Gaps = 2/64 (3%) Query: 3 TIRPVFYVSDGTGITAETIGHSLLTQF--SGFNFVTDRMSFIDDADKARDAAMRVRAAGE 60 + V+ G T+E I S+ F + T S A +V A Sbjct: 78 SKEVSINVNFSVGFTSEFIQASVEYGFGITIGEQNTIERSVSTTAGPNEYVYYKVYATYR 137 Query: 61 RYQV 64 +YQ Sbjct: 138 KYQA 141
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 279 bits (715), Expect = 2e-86 Identities = 138/574 (24%), Positives = 234/574 (40%), Gaps = 89/574 (15%) Query: 260 KAIRMVYSDVPGERVRTEDTPAE---LRSTFSISDEDVQELSKQAL---------VIEKH 307 KA + +V E+ D E L + S E+++ + Q + H Sbjct: 18 KAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQTEASMGADKAEIFAAH 77 Query: 308 YGRPMDIEWAKDGVSGKLFIVQARPETVKSRSHATQIERFALEAKGAKILAEGRAVGAKI 367 D E + GK+ Q E + F E+ + + E RA A I Sbjct: 78 LLVLDDPELVDG-IKGKIENEQMNAEYALKEVSDMFVSMF--ESMDNEYMKE-RA--ADI 131 Query: 368 GSGVARVVRSLDDMNRVQAGD-----VLIA-DMTDPDWEPVMK-RASAIVTNRGGRTCHA 420 RV+ L + V+IA D+T D + K T+ GGRT H+ Sbjct: 132 RDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFATDIGGRTSHS 191 Query: 421 AIIARELGVPAVVGSGNATAVIKDGQEVTVSCAEG---------DTGFIYEGKLAFERTT 471 AI++R L +PAVVG+ T I+ G V V EG + E + AFE+ Sbjct: 192 AIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYEEKRAAFEKQK 251 Query: 472 TDLGNMPPAP--------LKIMMNVANPERAFDFGQLPNAGIGLARLEMIIASHIGVHPN 523 + + P +++ N+ P+ GIGL R E + + Sbjct: 252 QEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFLYMDR-----D 306 Query: 524 ALLEYDKQDADVRKKIDAKIAGYGDPVSFYVNRLAEGIATLTASVAPNTVIVRLSDFKSN 583 L ++Q ++ + G P V++R D + Sbjct: 307 QLPTEEEQFEAYKEVVQRM---DGKP-----------------------VVIRTLDIGGD 340 Query: 584 EYANLIGGSRYEPHEENPMIGFRGASRYVAPSFTKAFALECKAVLKVRNEMGLDNLWVMI 643 + + + P E NP +GFR + F + +A+L+ NL VM Sbjct: 341 KELSYL----QLPKELNPFLGFRAIRLCL--EKQDIFRTQLRALLRAS---TYGNLKVMF 391 Query: 644 PFVRTLEEGRKVIEVLEQNGLKQGENG------LKIIMMCELPSNALLADEFLEIFDGFS 697 P + TLEE R+ ++++ K G +++ +M E+PS A+ A+ F + D FS Sbjct: 392 PMIATLEELRQAKAIMQEEKDKLLSEGVDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFS 451 Query: 698 IGSNDLTQLTLGLDRDSSIVAHLFDERNLAVKKLLSLAIKSARAKGKYVGICGQGPSDHP 757 IG+NDL Q T+ DR + V++L+ + A+ +L+ + IK+A ++GK+VG+CG+ D Sbjct: 452 IGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLVDMVIKAAHSEGKWVGMCGEMAGD-E 510 Query: 758 ELAEWLMQEGIESVSLNPDTVVDTWLRLAKLKSE 791 L+ G++ S++ +++ +L KL E Sbjct: 511 VAIPLLLGLGLDEFSMSATSILPARSQLLKLSKE 544
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.9 bits (106), Expect = 6e-07 Identities = 36/163 (22%), Positives = 68/163 (41%), Gaps = 8/163 (4%) Query: 24 LWLAILG--SNIGTWINDVAASWVMAEQTGSPLMVAAVQSATTLPVVLLALVAGTLADIV 81 +WL IL S + + +V+ + + P V +A L + V G L+D + Sbjct: 17 IWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQL 76 Query: 82 DRRRYLLLTQAWMLLVAGLLALLAHLQLLTPWVLVALTFAMGVGAAMAMPAQAAIVSELV 141 +R LL +++ +++ + +L+ F G GAA +V+ + Sbjct: 77 GIKRLLLFG----IIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYI 132 Query: 142 PRPMLASAVALNSIGMNIARSIGPAVGGLIVAQFGPPWAFLLN 184 P+ A L + + +GPA+GG+I W++LL Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH--WSYLLL 173
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 35.5 bits (82), Expect = 6e-04 Identities = 25/84 (29%), Positives = 34/84 (40%), Gaps = 9/84 (10%) Query: 32 SSAPGKSPMVDLVVRNARITTLDPRQPTATAIAVADGRIVAVGD-------DAKIMALAR 84 S + VD V+ NA I LD I + DGRI A+G + + Sbjct: 59 SQVTREGGAVDTVITNALI--LDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGP 116 Query: 85 GVRAIDAQGRRLLPGLNDSHTHLI 108 G I +G+ + G DSH H I Sbjct: 117 GTEVIAGEGKIVTAGGMDSHIHFI 140
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 43.1 bits (101), Expect = 3e-07 Identities = 27/139 (19%), Positives = 56/139 (40%), Gaps = 5/139 (3%) Query: 53 KGFKVPVILTTVAEKSFSGPLFPELPDIFPGEPVFDRTSMNAWEDQGVIDRVNALGKQRL 112 F P + + E+ L PE + V + +A++ +++ + G+ +L Sbjct: 92 TDFWGPGLNSGPYEEKIITELAPE-----DDDLVLTKWRYSAFKRTNLLEMMRKEGRDQL 146 Query: 113 VIAGLWTSVCIVGPTLSAIEQGFQVYVITDACGDVSDEAHERAVTRMVQAGAAPMTSVQY 172 +I G++ + + A + + + + DA D S E H+ A+ A + + Sbjct: 147 IITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDSL 206 Query: 173 LLELQRDWARSDTYALTTG 191 L +LQ A + TG Sbjct: 207 LDQLQNAPADVQKTSANTG 225
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 72.6 bits (178), Expect = 4e-17 Identities = 33/132 (25%), Positives = 59/132 (44%), Gaps = 5/132 (3%) Query: 8 TEVLVVDDHPLLRDGLSAMLAAE-HDMRVVGEAEDGEQAVACYTRLRPDVVLMDLQMPRV 66 +LV DD +R L+ L+ +D+R+ A + +A D+V+ D+ MP Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA---AGDGDLVVTDVVMPDE 60 Query: 67 DGVQAIQRIRQVDSAAKVIVLTTYTGDVRAVRALQAGACGYLLKSALRRELVDTI-RDVR 125 + + RI++ V+V++ + A++A + GA YL K EL+ I R + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 126 RGQRRHVPASVA 137 +RR Sbjct: 121 EPKRRPSKLEDD 132
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 137 bits (345), Expect = 1e-41 Identities = 82/252 (32%), Positives = 123/252 (48%), Gaps = 10/252 (3%) Query: 4 RVALVTGGTGGIGTAICKRLADQGHRVASNFRNEEKARDWQQRMQAQGYEVALFRGDVAS 63 ++A +TG GIG A+ + LA QG +A+ N EK ++A+ F DV Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 64 SEHARALVEEVEASLGPIEVLVNNAGITRDTTFHRMSAEQWHEVINTNLNSVFNVTRPVI 123 S + +E +GPI++LVN AG+ R H +S E+W + N VFN +R V Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 124 EGMRKRGWGRVIQISSINGLKGQYGQANYAAAKAGMHGFTISLARENAAFGVTVNTVSPG 183 + M R G ++ + S + A YA++KA FT L E A + + N VSPG Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 184 YVATDM--VMAVPEEVRAKIVA--------DIPTGRLGRPEEIAYAVAFLVAEEAAWITG 233 TDM + E +++ IP +L +P +IA AV FLV+ +A IT Sbjct: 189 STETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITM 248 Query: 234 SNLDINGGHHMG 245 NL ++GG +G Sbjct: 249 HNLCVDGGATLG 260
>cloacin#Cloacin signature. Length = 551 Score = 27.4 bits (60), Expect = 0.037 Identities = 14/44 (31%), Positives = 15/44 (34%), Gaps = 1/44 (2%) Query: 143 GAGFGRPGGPG-TPPNPPGASGLGSGPMGTGTHGSAGGNHGTTG 185 G G G G G + N P G GSG G G G Sbjct: 28 GVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNS 71
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.4 bits (68), Expect = 0.022 Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 3/92 (3%) Query: 322 LQDALAHTRAGVTPNSIGSEGATDMGGTFGGMGSLAGSGAPRDGSTSGSGNGTYGYASWT 381 ++ A+A+ + GV + G T + GT G GS G G + GS S S + A W+ Sbjct: 1 METAVAYYKDGVPYDDKGQVIITLLNGTPDGSGS--GGGGGKGGSKSESSAAIHATAKWS 58 Query: 382 PSQTPLGLRVDEARAAYSALYAAPPSSAQQSA 413 +Q ARA +A A + A + A Sbjct: 59 TAQLKKTQAEQAARAK-AAAEAQAKAKANRDA 89
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 33.0 bits (75), Expect = 0.002 Identities = 30/139 (21%), Positives = 42/139 (30%), Gaps = 1/139 (0%) Query: 139 SAVAAAAPTAVPAPRPLNAQAEAARATAALAASAQRASSVPPPQPS-TPPPAPPVPASAM 197 + VA T+V L A A+ T A + +V PP P P P Sbjct: 22 AVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEP 81 Query: 198 PTVTQAPVPTTVATGVPTPRPATSASAPAPTGVAGNAPNRASVTNANANANVASGAGVAG 257 P + P P+P V AS A A + S A Sbjct: 82 PKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAA 141 Query: 258 SSASAAAILNGGRAPMGAP 276 +S ++ +G RA Sbjct: 142 TSKPVTSVASGPRALSRNQ 160
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 75.3 bits (185), Expect = 5e-17 Identities = 29/194 (14%), Positives = 70/194 (36%), Gaps = 18/194 (9%) Query: 16 AMRPPS-IAKAVAWMLLIGIGIAAAILALAPWVQTASGKGQVVSLDPSDRQQPVTAFVPG 74 P S + VA+ ++ + IA + L A+ G++ S R + + Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLT---HSGRSKEIKPIENS 105 Query: 75 RVERWYVHDGQHVSKGDPIARVGDLDPDLLTRLASERAQAQAEIAAIQQSRAVASIDVAR 134 V+ V +G+ V KGD + ++ L + + Q+ A ++Q+R Sbjct: 106 IVKEIIVKEGESVRKGDVLLKLTALG----AEADTLKTQSSLLQARLEQTRYQILSRSIE 161 Query: 135 SRQLLAEGLAGRRDYELTQIKVAEADAKLAES-----RAKLTRIDIQLNRQSAQLVRAPR 189 +L L ++ + L + + + + ++ L+++ A+ Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER----- 216 Query: 190 DGRVQQLNAASGSA 203 + ++N + Sbjct: 217 LTVLARINRYENLS 230 Score = 73.7 bits (181), Expect = 2e-16 Identities = 32/175 (18%), Positives = 63/175 (36%), Gaps = 20/175 (11%) Query: 104 LTRLASERAQAQAEIAAIQQSRAVASIDVARSRQLLAEGLAGRRDYELTQIKVAEADAKL 163 +E ++++ I+ A + QL K+ + + Sbjct: 261 YVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF---------KNEILDKLRQTTDNI 311 Query: 164 AESRAKLTRIDIQLNRQSAQLVRAPRDGRVQQLNAASGSAMVSPGTVLAVIAPERVERAV 223 +L + + + ++RAP +VQQL + +V+ L VI PE V Sbjct: 312 GLLTLELAKNEERQQAS---VIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368 Query: 224 ELYIDGRDVPLIRPGRPVRLEFEGWPAIQFSGWPSVAHGMFDGRVRAIDPNAAPD 278 + +D+ I G+ I+ +P +G G+V+ I+ +A D Sbjct: 369 TALVQNKDIGFINVGQNAI--------IKVEAFPYTRYGYLVGKVKNINLDAIED 415
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 34.0 bits (78), Expect = 0.001 Identities = 25/196 (12%), Positives = 61/196 (31%), Gaps = 24/196 (12%) Query: 287 RAVLTRIDQATARLMLAQNDLKPRLDVSVEVSKDLGPPGVGGPNRSLTDAIIGFRFSVPL 346 + R++Q +++ +L ++ + + V ++I +FS Sbjct: 142 SLLQARLEQTRYQILSRSIELNKLPELKL--PDEPYFQNVSEEEVLRLTSLIKEQFST-W 198 Query: 347 ENRAARGRV--AEARAEIEALDQRSRFLRDQISIEVESIVISLNAAERLAK--------I 396 +N+ + + + RAE + R + +E L+ L + Sbjct: 199 QNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSR----LDDFSSLLHKQAIAKHAV 254 Query: 397 ADEERGLAD---RLAAAERRRFELGSG----DFFLVNQREETANDARVRLIDAQARIASA 449 ++E + L + + ++ S + N+ +L I Sbjct: 255 LEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLL 314 Query: 450 RAELAAATADRDALQL 465 ELA + A + Sbjct: 315 TLELAKNEERQQASVI 330
>PF04183#IucA / IucC family Length = 580 Score = 32.2 bits (73), Expect = 0.002 Identities = 17/95 (17%), Positives = 29/95 (30%), Gaps = 11/95 (11%) Query: 106 SLANGEAFADWLEQTLPQAPQLRYCLDPVIGDTHTGPYVEPGLERVFAERLLPHAWLVTP 165 +A G + WL+Q L ++G+ G G A P+ + Sbjct: 291 YIAAGPLASRWLQQVFATDATLVQSGAVILGEPAAGYVSHEGYA---ALARAPYRY---- 343 Query: 166 NAFELG---RLTGLPSLQQGDAIVAARALLARGPQ 197 LG R L+ ++ V L+ Sbjct: 344 -QEMLGVIWRENPCRWLKPDESPVLMATLMECDEN 377
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 137 bits (348), Expect = 5e-38 Identities = 82/392 (20%), Positives = 145/392 (36%), Gaps = 91/392 (23%) Query: 5 IGIDLGTTNSCVSIMDGGKARVIENSEGDRTTPSIVAYTKDGE------VLVGASAKRQA 58 + IDLGT N+ + + G + E PS+VA +D VG AK+ Sbjct: 13 LSIDLGTANTLIYVKGQGIV-LNE--------PSVVAIRQDRAGSPKSVAAVGHDAKQML 63 Query: 59 VTNPKNTFYAVKRLIGRKFTDGEVQKDISHVPYGILAHDNGDAWVQTSDAKRMAPQEISA 118 P N A++ + KD + + +++ Sbjct: 64 GRTPGN-IAAIRPM-----------KDGVIADFFV-------------------TEKMLQ 92 Query: 119 RVLEKMKKTAEDFLGEKVTEAVITVPAYFNDSQRQATKDAGRIAGLDVKRIINEPTAAAL 178 ++++ + ++ VP +R+A +++ + AG +I EP AAA+ Sbjct: 93 HFIKQVHS---NSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAI 149 Query: 179 AYGLDKNGGDRKIAVYDLGGGTFDVSIIEIAEVDGEKQFEVLATNGDTFLGGEDFDNRVI 238 GL + V D+GGGT +V++I + V + +GG+ FD +I Sbjct: 150 GAGLPVS-EATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAII 199 Query: 239 EYLVDEFNKDQGIDLRKDPLALQRLKDAAERAKIELSSS----QQTEVNLPYVTADASGP 294 Y+ + G + AER K E+ S+ + E+ + P Sbjct: 200 NYVRRNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246 Query: 295 KHLNIKLTRAKLEALVE------DLVKKSIEPCRTALNDAGLRASDINE--VILVGGQTR 346 + + + LEAL E V ++E C L ASDI+E ++L GG Sbjct: 247 RGFTLN-SNEILEALQEPLTGIVSAVMVALEQCPPEL------ASDISERGMVLTGGGAL 299 Query: 347 MPKVQQAVADFFGKEPRKDVNPDEAVAVGAAI 378 + + + + + G +P VA G Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGK 331
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 92.0 bits (228), Expect = 2e-24 Identities = 66/216 (30%), Positives = 93/216 (43%), Gaps = 17/216 (7%) Query: 5 KIALVTGATRGIGLETVRQLAQAGVHTLLAGRKRDDAVAAALKLQAEGLPVEAIQLDVND 64 KIA +TGA +GIG R LA G H + L+AE EA DV D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 65 DISVAAAVGTVEQRHGHLDILINNAGIMLDDLQRTPSQQ-SLEVWKRTFDTNLFAVVEVT 123 ++ +E+ G +DIL+N AG+ L+ S E W+ TF N V + Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV----LRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 124 KAFLPLLRRSLAGRIVNVSSLLGSLTLHSQPGSPIYDFKIPAYNISKSALNSWTVHLAHE 183 ++ + +G IV V S + G P + AY SK+A +T L E Sbjct: 125 RSVSKYMMDRRSGSIVTVGS--------NPAGVP--RTSMAAYASSKAAAVMFTKCLGLE 174 Query: 184 LRDTAIKVNAVHPGSVKTDMNGGGELEVEQGAASSV 219 L + I+ N V PGS +TDM L ++ A V Sbjct: 175 LAEYNIRCNIVSPGSTETDMQ--WSLWADENGAEQV 208
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1050 bits (2716), Expect = 0.0 Identities = 434/1041 (41%), Positives = 642/1041 (61%), Gaps = 20/1041 (1%) Query: 4 SRFFIDRPIFAAVLSIIIFAAGLIAMPLLPISEYPEVVPPSVQVRAVYPGANPKVIAETV 63 + FFI RPIFA VL+II+ AG +A+ LP+++YP + PP+V V A YPGA+ + + +TV Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 64 ATPLEEAINGVENMMYMKSVAGSDGVLVVTVTFKPGTDPDQAQVQVQNRVSQAQARLPED 123 +E+ +NG++N+MYM S + S G + +T+TF+ GTDPD AQVQVQN++ A LP++ Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 124 VRRQGVTTQKQSPTLTMVVHLTSPKGKYNSLYLSNYATLKVKDELSRLPGVGQIQIFGAG 183 V++QG++ +K S + MV S +S+Y VKD LSRL GVG +Q+FG Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-A 180 Query: 184 DYAMRIWLNPDKVAARGLTASDVVAAIREQNVQVSAGQLGAEPMPNKSDFLLSINAQGRL 243 YAMRIWL+ D + LT DV+ ++ QN Q++AGQLG P SI AQ R Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 244 TTEEEFGNIVIRSGNSGEIVRLSDVARLELGAGNYTLRSQLDNQNAVGMGVFQSPGANAI 303 EEFG + +R + G +VRL DVAR+ELG NY + ++++ + A G+G+ + GANA+ Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 304 ELSDAVRAKMAELEKQFPQDMAWSAAYDPTVFVRDSISAVVHTLLEAVLLVVLVVILFLQ 363 + + A++AK+AEL+ FPQ M YD T FV+ SI VV TL EA++LV LV+ LFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 364 TWRASIIPLLAVPVSVVGTFAALYLLGFSINTLSLFGLVLAIGIVVDDAIVVVENVER-N 422 RA++IP +AVPV ++GTFA L G+SINTL++FG+VLAIG++VDDAIVVVENVER Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 423 IEEGLSPLAAAHQAMREVSGPIIAIALVLCAVFVPMAFLSGVTGQFYKQFAVTIAISTVI 482 +E+ L P A ++M ++ G ++ IA+VL AVF+PMAF G TG Y+QF++TI + + Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 483 SAINSLTLSPALAAMLLKLHDAPKDGPSRLIDRLFGWLFRPFNRFFNTSSHKYQGAVSRA 542 S + +L L+PAL A LLK FGW FN F+ S + Y +V + Sbjct: 481 SVLVALILTPALCATLLK---PVSAEHHENKGGFFGW----FNTTFDHSVNHYTNSVGKI 533 Query: 543 LGKRGAVFVVYLLLLVGTGFMFKLVPGGFIPTQDKLYLIAGTKLPEGSSLERTNEVIRQI 602 LG G ++Y L++ G +F +P F+P +D+ + +LP G++ ERT +V+ Q+ Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593 Query: 603 TQIALQT--DGVDHAIAFPGLNPLQFTNTPNTGTVFLTLKPFSQRSR---TAAQINAEIN 657 T L+ V+ G + N G F++LKP+ +R+ +A + Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFS--GQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAK 651 Query: 658 ARISQIQQGFAFAFMPPPILGLGQGSGYSLYIQDRAGLGYGQLQSAVNAMSGAISQTPG- 716 + +I+ GF F P I+ LG +G+ + D+AGLG+ L A N + G +Q P Sbjct: 652 MELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPAS 711 Query: 717 MQFPIGTYQANVPQLDAKVDRDKAKAQGVPLTNLFDTLQTYLGSSYINDFNRFGRTYQVI 776 + + Q +VD++KA+A GV L+++ T+ T LG +Y+NDF GR ++ Sbjct: 712 LVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLY 771 Query: 777 AQADGQFRDSVEDIANLRTRNANGDMVPIGSMVTLGQTYGPDPVIRYNGYPAADLIGEAD 836 QAD +FR ED+ L R+ANG+MVP + T YG + RYNG P+ ++ GEA Sbjct: 772 VQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAA 831 Query: 837 PRVLSSTEAMQKLSSMAPQVLPNGMNIEWTDLSYQQSTQGNSALIVFPMAVLLAFLVLAA 896 P SS +AM + ++A + LP G+ +WT +SYQ+ GN A + ++ ++ FL LAA Sbjct: 832 PGT-SSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAA 889 Query: 897 LYESWTLPLAVILIVPMTLLSALFGVWLTGGDNNVFVQVGLVVLMGLACKNAILIVEFAR 956 LYESW++P++V+L+VP+ ++ L L N+V+ VGL+ +GL+ KNAILIVEFA+ Sbjct: 890 LYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAK 949 Query: 957 EL-EMHGKGIVEAALEACRLRLRPIVMTSIAFIAGTVPLVFGHGAGAEVRSVTGITVFAG 1015 +L E GKG+VEA L A R+RLRPI+MTS+AFI G +PL +GAG+ ++ GI V G Sbjct: 950 DLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGG 1009 Query: 1016 MLGVTLFGLFLTPVFYVALRK 1036 M+ TL +F PVF+V +R+ Sbjct: 1010 MVSATLLAIFFVPVFFVVIRR 1030 Score = 101 bits (253), Expect = 8e-24 Identities = 60/321 (18%), Positives = 124/321 (38%), Gaps = 17/321 (5%) Query: 185 YAMRIWLNPDKVAARGLTASDVVAAIREQNVQVSAGQLGAEPMPNKS-DFLLSINAQGRL 243 ++ ++ +K A G++ SD+ I + + + + + +A+ R+ Sbjct: 724 AQFKLEVDQEKAQALGVSLSDINQTI---STALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 244 TTEEEFGNIVIRSGNSGEIVRLSDVARLELGAGNYTLRSQLDNQNAVGMGVFQSPGANAI 303 E+ + +RS +GE+V S G+ +L+ N + Q A Sbjct: 781 LPED-VDKLYVRS-ANGEMVPFSAFTTSHWVYGS----PRLERYNGLPSMEIQGEAAPGT 834 Query: 304 ELSDAVRAKMAELEKQFPQD--MAWSAAYDPTVFVRDSISAVVHTLLEAVLLVVLVVILF 361 DA A M L + P W+ R S + + + ++V L + Sbjct: 835 SSGDA-MALMENLASKLPAGIGYDWTGMSYQ---ERLSGNQAPALVAISFVVVFLCLAAL 890 Query: 362 LQTWRASIIPLLAVPVSVVGTFAALYLLGFSINTLSLFGLVLAIGIVVDDAIVVVENV-E 420 ++W + +L VP+ +VG A L + + GL+ IG+ +AI++VE + Sbjct: 891 YESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD 950 Query: 421 RNIEEGLSPLAAAHQAMREVSGPIIAIALVLCAVFVPMAFLSGVTGQFYKQFAVTIAIST 480 +EG + A A+R PI+ +L +P+A +G + + Sbjct: 951 LMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010 Query: 481 VISAINSLTLSPALAAMLLKL 501 V + + ++ P ++ + Sbjct: 1011 VSATLLAIFFVPVFFVVIRRC 1031 Score = 89.9 bits (223), Expect = 3e-20 Identities = 90/514 (17%), Positives = 182/514 (35%), Gaps = 41/514 (7%) Query: 548 AVFVVYLLLLVGTGFMFKLVPGGFIPTQDKLYLIAGTKLPEGSSLERTNEVIRQITQIAL 607 +V+ ++L++ +P PT + P + + V + I Q Sbjct: 11 FAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQVIEQNMN 70 Query: 608 QTDGVDHAIAFPGLNPLQFTNTPNTGTVFLTLKPFSQRSRTAAQINAEINARISQIQQGF 667 D + + + +++ + T+ LT + + Q+ ++ + Q Sbjct: 71 GIDNLMYMSST--------SDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE- 121 Query: 668 AFAFMPPPILGLGQGSG----YSLYIQDRAGLGYGQLQS-AVNAMSGAISQTPGMQFPIG 722 + + + + S + ++ D G + + + +S+ G +G Sbjct: 122 ----VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNG----VG 173 Query: 723 TYQANVPQLDAKV--DRDKAKAQGVPLTNLFDTLQTYL----GSSYINDFNRFGRTYQVI 776 Q Q ++ D D + ++ + L+ G+ Sbjct: 174 DVQLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNAS 233 Query: 777 AQADGQFRDSVEDIANLRTR-NANGDMVPIGSMVTLGQTYGPDPVI-RYNGYPAADLI-- 832 A +F+ + E+ + R N++G +V + + + VI R NG PAA L Sbjct: 234 IIAQTRFK-NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIK 292 Query: 833 ---GEADPRVLSSTEAMQKLSSMAPQVLPNGMNIEWT-DLSYQQSTQGNSALIVFPMAVL 888 G + KL+ + P P GM + + D + + + A++ Sbjct: 293 LATGANALDT--AKAIKAKLAELQP-FFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIM 349 Query: 889 LAFLVLAALYESWTLPLAVILIVPMTLLSALFGVWLTGGDNNVFVQVGLVVLMGLACKNA 948 L FLV+ ++ L + VP+ LL + G N G+V+ +GL +A Sbjct: 350 LVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDA 409 Query: 949 ILIVE-FARELEMHGKGIVEAALEACRLRLRPIVMTSIAFIAGTVPLVFGHGAGAEVRSV 1007 I++VE R + EA ++ +V ++ A +P+ F G+ + Sbjct: 410 IVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQ 469 Query: 1008 TGITVFAGMLGVTLFGLFLTPVFYVALRKWVTRR 1041 IT+ + M L L LTP L K V+ Sbjct: 470 FSITIVSAMALSVLVALILTPALCATLLKPVSAE 503
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 29/186 (15%), Positives = 61/186 (32%), Gaps = 44/186 (23%) Query: 8 FRFPLRTVLAGAVLAVVLAGCGSKAAETGAPPPPSVSVAPVLMKQISQWDEFSGRIEPV- 66 R ++ V+A +L+ G VA +G++ Sbjct: 57 PRLVAYFIMGFLVIAFILSVLG-----------QVEIVATA-----------NGKLTHSG 94 Query: 67 ESVELRPRVSGYIDKVNYTEGAEVKKGDVLFTIDERSYRAEFARANASLVRARTQA---- 122 S E++P + + ++ EG V+KGDVL + A+ + +SL++AR + Sbjct: 95 RSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154 Query: 123 -----------------TLARSEAARARKLSEQQAISTETWEQRRAAADQADADLQAAQA 165 + ++ ++ E + + Q + +L +A Sbjct: 155 ILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRA 214 Query: 166 AVDTAR 171 T Sbjct: 215 ERLTVL 220 Score = 38.3 bits (89), Expect = 4e-05 Identities = 18/102 (17%), Positives = 37/102 (36%), Gaps = 7/102 (6%) Query: 104 YRAEFARANASLVRARTQATLARSEAARARKLSEQ--QAISTETWEQRRAAADQADADLQ 161 ++ A L ++Q SE A++ + Q E ++ R Q ++ Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR----QTTDNIG 312 Query: 162 AAQAAVDTARLNLDWTRVRAPIDGRAGRAMV-TAGNLVTAGD 202 + + +RAP+ + + V T G +VT + Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAE 354 Score = 31.7 bits (72), Expect = 0.006 Identities = 12/73 (16%), Positives = 29/73 (39%) Query: 99 IDERSYRAEFARANASLVRARTQATLARSEAARARKLSEQQAISTETWEQRRAAADQADA 158 ++ RAE A + R + + +S L +QAI+ ++ +A Sbjct: 207 LNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVN 266 Query: 159 DLQAAQAAVDTAR 171 +L+ ++ ++ Sbjct: 267 ELRVYKSQLEQIE 279
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 114 bits (287), Expect = 2e-32 Identities = 50/170 (29%), Positives = 78/170 (45%), Gaps = 22/170 (12%) Query: 68 ERRQHAMVGAGIGALSGAAIGQYQDRQERALRERTANTGIEVQRQGDNISLNLPDGITFD 127 R + M+ G+ G + + EVQ + L + F+ Sbjct: 176 TRPDNGMLSLGVSYRFGQG-------EAAPVVAPAPAPAPEVQTK----HFTLKSDVLFN 224 Query: 128 FGKSALKPQFYTALNGVASTLREYN--QTMVEVVGHTDSVGSDAVNQRLSEERASAVAQY 185 F K+ LKP+ AL+ + S L + V V+G+TD +GSDA NQ LSE RA +V Y Sbjct: 225 FNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDY 284 Query: 186 LTAQGVQRERMETMGAGKRYPIADNTTDAGR---------AKNRRVEIRL 226 L ++G+ +++ G G+ P+ NT D + A +RRVEI + Sbjct: 285 LISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 35.2 bits (80), Expect = 9e-04 Identities = 37/195 (18%), Positives = 63/195 (32%), Gaps = 13/195 (6%) Query: 391 VMSGGGSSRVDYTINGGNAVPGITPTTWPGPVIIHPSSPLQALRAALPNVQIDYVDGKDR 450 + G++ + I+ AV G + P + + +S + R A D R Sbjct: 268 IQVAQGAASLAQAISDAIAVLGRVLASAPSVMAVGFASLTYSSRTA--EQWQDQTPDSVR 325 Query: 451 NAAARVAKAADVAIVFATQW-----AAESVDLPDMRLPDNQDALIDAVA-KANPKTAVVL 504 A AA + + + A+ +VDLP MRL + ++ + +V Sbjct: 326 YALG--MDAAKLGLPPSVNLNAVAKASGTVDLP-MRLTNEARGNTTTLSVVSTDGVSVPK 382 Query: 505 ETNGPVRMPWAERVPAVLQAWYPGIGGGEAIANLLTGAVNPSGHLPVTWPVDESQLPRPS 564 PVRM + + P L +P G+ + P P Sbjct: 383 AV--PVRMAAYNATTGLYEVTVPSTTAEAPPLILTWTPASPPGNQNPSSTTPVVPKPVPV 440 Query: 565 IPGLGFKPAKPGEDT 579 G P K +T Sbjct: 441 YEGATLTPVKATPET 455
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 116 bits (293), Expect = 2e-30 Identities = 95/402 (23%), Positives = 163/402 (40%), Gaps = 30/402 (7%) Query: 33 LAMASFMQVLDTTIANVSLPTIAGNLGASSQQATWVITSFAVSTAIALPLTGWLSRRFGE 92 L + SF VL+ + NVSLP IA + WV T+F ++ +I + G LS + G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 93 TKLFVWSTLAFTIASLLCGLAQSM-GMLVVARALQGFVAGPMYPITQSLLVSIY-PREKR 150 +L ++ + S++ + S +L++AR +QG +P ++V+ Y P+E R Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG-AGAAAFPALVMVVVARYIPKENR 137 Query: 151 GQALALLAMITVVAPIAGPILGGWITDNYSWEWIFLINVPLGIIASSIVGSQLRH--RPE 208 G+A L+ I + GP +GG I W +L+ +P + + I L + E Sbjct: 138 GKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIP---MITIITVPFLMKLLKKE 192 Query: 209 QLEKPRMDYIGLILLVVGVGALQLVLDLGNDEDWFSSDKIVVLACIAAVALVVFVIWELT 268 K D G+IL+ VG+ L F++ + ++ ++ ++FV Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRK 242 Query: 269 DKDPIVDLKLFRHRNFRAGTLAMVVAYAAFFSVSLLIPQWLQRDMGYTAIWAGLATAPIG 328 DP VD L ++ F G L + + ++P ++ + G G Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302 Query: 329 ILPVLMT-PFVGKYALRFDLRMLATIAFIFMS---FTSFFRSNFNLQVDFGHVATIQLVM 384 + V++ G R + I F+S T+ F TI +V Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW-----FMTIIIVF 357 Query: 385 GVGVALFFMPVLQ-ILLSDLDGREIAAGSGLATFLRTLGGSF 425 +G F V+ I+ S L +E AG L F L Sbjct: 358 VLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 76.0 bits (187), Expect = 3e-17 Identities = 49/295 (16%), Positives = 93/295 (31%), Gaps = 40/295 (13%) Query: 82 VERGQLLVQLDPADTEVALQQAEANLAKTVRQVRGLYRTVEGAQAELSAREVTLRSARSD 141 V R L++ + + Q E NL K + A ++ E R +S Sbjct: 184 VLRLTSLIKEQFSTWQNQKYQKELNLDKKRAER-------LTVLARINRYENLSRVEKSR 236 Query: 142 FARRKDLAATGAIS--------------NEELAHARDELAAAEAAVSGSRESLERNRAL- 186 L AI+ EL + +L E+ + ++E + L Sbjct: 237 LDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLF 296 Query: 187 ---VDDSAVANQPDVQTAAAQLRQAYLNHARTGVVAPVSGYVARRSAQ-VGQRVQPGSVL 242 + D ++ +L + + + APVS V + G V L Sbjct: 297 KNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL 356 Query: 243 MAVVPLEQV-WVEANFKETQLKHMRLGQEVELHSDLYGGGVDYT--GRIESLGLGTGSAF 299 M +VP + V A + + + +GQ + + + YT G + G Sbjct: 357 MVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAF----PYTRYGYLV----GK---V 405 Query: 300 SLLPAQNASGNWIKIVQRVPVRIAVDAKQLASNPLRIGLSMKVDVNLHDQQGSVL 354 + + +V V + I + + + + M V + SV+ Sbjct: 406 KNINLDAIEDQRLGLVFNVIISIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVI 460
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 32.5 bits (74), Expect = 0.004 Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 20/187 (10%) Query: 72 AQLDALIAEGLQHSPSLAAADARLQQAQARIGSAQAERG--PSLSVSGGYTGLQLPESMV 129 +L AL AE + ARL+Q + +I S E P L + + E V Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 130 GEELGGSYGGSAQVVLDFRYGIDLWGGKRSAWEAAVDQAHAAEVDAQAARLNLSSAIAEG 189 L + W ++ E +D+ A + A + Sbjct: 185 ----------LRLTSL-IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVE 233 Query: 190 YAQLAYAWSLHDLANDELSRAQKTLELTRQRRSAGIDSELQVRQAQARVPAAQQQLQSAQ 249 ++L L + + LE + +++ ++R ++++ + ++ SA+ Sbjct: 234 KSRLD---DFSSLLHKQAIAKHAVLEQENKY----VEAVNELRVYKSQLEQIESEILSAK 286 Query: 250 QQIDEAR 256 ++ Sbjct: 287 EEYQLVT 293
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 62.1 bits (151), Expect = 9e-13 Identities = 89/400 (22%), Positives = 146/400 (36%), Gaps = 31/400 (7%) Query: 17 VLILLALAMGGFAIGISEFSTMGLMTQIAQGLQITEPQVGHVISAYALGVVVGAPLLAIL 76 ++IL +A+ IG+ GL+ + +T G +++ YAL AP+L L Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTA-HYGILLALYALMQFACAPVLGAL 66 Query: 77 GARWPRRTLLLMLMVFYALGNVASALAPSYHTMLLCRFIAGLPHGAYFGVASLVAASISP 136 R+ RR +LL+ + A+ A AP + + R +AG+ GA VA A I+ Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITD 125 Query: 137 PNQRATAVGRVLLGLSVALLVGNPLATWLGQIVSWRWAYASVSVIALGTVAAV-AILLPP 195 ++RA G + ++ G L +G +A+ AL + + L P Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAA---ALNGLNFLTGCFLLP 182 Query: 196 QPEEPRQQPLRELRAFNQPQVWLALAIGAVGFSGMFCVF------SYLAPTLTAVTGVTA 249 + + ++PLR P A G + + VF + L + G Sbjct: 183 ESHKGERRPLRREAL--NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240 Query: 250 ARIPLAMVAF--GVGGVLGSILGGWLFDR-MQFRAVPVLLLWSMVVMLT--FPLAALSDV 304 + G+L S+ + L+ M+ T LA + Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRG 300 Query: 305 WVFVSIVAVGTMGALA-PALQTRL-MDVAAEAQTLAAASNHAAFNTANALGPWLGG---- 358 W+ I+ + G + PALQ L V E Q S A + + +GP L Sbjct: 301 WMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 359 --MAITAGWGWTSTGYVGAATALGGLLVYAAAVWQERHQQ 396 + GW W GAA L L +W Q+ Sbjct: 361 ASITTWNGWAWI----AGAALYLLCLPALRRGLWSGAGQR 396
>FRAGILYSIN#Fragilysin metallopeptidase (M10C) enterotoxin signature. Length = 405 Score = 27.7 bits (61), Expect = 0.009 Identities = 17/81 (20%), Positives = 32/81 (39%) Query: 19 GATSICAECGFEWSAGDAAADTTVVRDSNGNVLQAGDTVTVIKDLKVKGSSIPLKQGTVI 78 G ++ A C E + + D V + + D T + D+ G I LK Sbjct: 19 GTAALLAACSNEADSLTTSIDAPVTASIDLQSVSYTDLATQLNDVSDFGKMIILKDNGFN 78 Query: 79 RNIRLVEDDAEHIEGNSEKIK 99 R + + D I+ ++E ++ Sbjct: 79 RQVHVSMDKRTKIQLDNENVR 99
>INFPOTNTIATR#Macrophage infectivity potentiator signature. Length = 233 Score = 28.8 bits (64), Expect = 0.007 Identities = 19/64 (29%), Positives = 27/64 (42%), Gaps = 1/64 (1%) Query: 8 VATIHYTLSDDNGQVLDRSTPDTPLSYLHGAGNIVPGLEQALEGKQLGDTLTADVVPEQG 67 T+ YT + +G V D ST ++PG +AL+ G T V + Sbjct: 146 TVTVEYTGTLIDGTVFD-STEKAGKPATFQVSQVIPGWTEALQLMPAGSTWEVFVPADLA 204 Query: 68 YGPR 71 YGPR Sbjct: 205 YGPR 208
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 50.5 bits (121), Expect = 5e-09 Identities = 27/75 (36%), Positives = 37/75 (49%), Gaps = 5/75 (6%) Query: 203 AAIAAGFDNVDFLEEPAAAAMHYHVSHDSRHDTVVVDIGGGTTDIAHASVGGSAAPQVHR 262 +A AG V +EEP AAA+ + ++VVDIGGGTT++A S+ G R Sbjct: 129 SAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVR 188 Query: 263 AWGIARGGTDIDLAL 277 GG D A+ Sbjct: 189 I-----GGDRFDEAI 198
>PERTACTIN#Pertactin signature. Length = 922 Score = 27.8 bits (61), Expect = 0.004 Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 1/53 (1%) Query: 28 IAEHSPLPGDMRLEEAPVWTPAQSQLLREERLDDADWIVTIDQLNIALHTTAD 80 I S P D+ L WT A ++ + +D+A W++T + AL +D Sbjct: 401 IPGASSGPLDVALASQARWTGA-TRAVDSLSIDNATWVMTDNSNVGALRLASD 452
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 66.8 bits (163), Expect = 6e-15 Identities = 29/118 (24%), Positives = 44/118 (37%) Query: 11 PRLLLVEDDPISRGFLQAVLESLPATVDCADSLSSALDRARERRHDLWLIDVNLPDGTGS 70 +L+ +DD R L L V + ++ DL + DV +PD Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 GLLRALRLLHPDVPALAHTADATMSMQHSLQSDGFLEMLVKPLTSERLLQAVRRGLAR 128 LL ++ PD+P L +A T G + L KP L+ + R LA Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.7 bits (72), Expect = 0.004 Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 10/70 (14%) Query: 62 LVDTPGLHREQKRAMNRVMNRAARGSLEGVDAAVLVIEAGRWDEEDT-LAFRVLSDASVP 120 ++DTPG H + + R SL +D A+L+I A + T + F L +P Sbjct: 72 IIDTPG-HMDFLAEVYR--------SLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 121 VVLVVNKVDR 130 + +NK+D+ Sbjct: 123 TIFFINKIDQ 132
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 146 bits (371), Expect = 1e-39 Identities = 93/453 (20%), Positives = 181/453 (39%), Gaps = 85/453 (18%) Query: 3 NIRNFSIIAHVDHGKSTLADRIIQLCGG---LQAREMEAQVLDSNPIERERGITIKAQSV 59 I N ++AHVD GK+TL + ++ G L + + D+ +ER+RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 60 SLPYTAKDGQTYHLNFIDTPGHVDFSYEVSRSLAACEGALLVVDAAQGVEAQSVANCYTA 119 S + + +N IDTPGH+DF EV RSL+ +GA+L++ A GV+AQ+ + Sbjct: 62 SFQW-----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHAL 116 Query: 120 VEQGLEVVPVLNK-----IDLP----------TADVDRAKA----------------EIE 148 + G+ + +NK IDL +A++ + + + Sbjct: 117 RKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWD 176 Query: 149 AVIG--------------IDAEDAVAV----------------SAKTGLNIDLVLEAIVH 178 VI ++A + SAK + ID ++E I + Sbjct: 177 TVIEGNDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITN 236 Query: 179 RIPPPKPRDTDKLQALIIDSWFDNYLGVVSLVRVMQGEIKPGSKILVMSTGRTHLVDKVG 238 + R +L + + ++ +R+ G + + + + + + Sbjct: 237 KFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKIKITEMYT 296 Query: 239 VFTPKRKELPALGAGEVGWINASIKDVHGAPVGDTLTLAGDPAPHALPGFQEMQPRVFAG 298 + ++ +GE+ + + + +GDT L + P + Sbjct: 297 SINGELCKIDKAYSGEIVILQNEFLKL-NSVLGDTKLLPQRERI------ENPLPLLQTT 349 Query: 299 LFPVDAEDYPDLREALDKLRLNDAALRFE--PESSEAMGFGFRCGFLGMLHMEIVQERLE 356 + P + L +AL ++ +D LR+ + E + FLG + ME+ L+ Sbjct: 350 VEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII-----LSFLGKVQMEVTCALLQ 404 Query: 357 REYNLDLISTAPTVVY--EVLKTDGTIVNMDNP 387 +Y++++ PTV+Y LK ++++ P Sbjct: 405 EKYHVEIEIKEPTVIYMERPLKKAEYTIHIEVP 437 Score = 33.7 bits (77), Expect = 0.003 Identities = 21/103 (20%), Positives = 38/103 (36%), Gaps = 18/103 (17%) Query: 362 DLISTAPTVVYEVLKTDGTIVNMDNPAKLPQLNLVQEIREPIIRANVLTPEEYIGNIIKL 421 D AP V+ +VLK GT E+ EP + + P+EY+ Sbjct: 515 DFRMLAPIVLEQVLKKAGT-----------------ELLEPYLSFKIYAPQEYLSRAYTD 557 Query: 422 CEEKRGTQIGINYLGSQVQISYELPMAEVVLDFFDKLKSVSRG 464 + + ++V +S E+P + ++ L + G Sbjct: 558 APKYCANIVDTQLKNNEVILSGEIPARC-IQEYRSDLTFFTNG 599
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 72.7 bits (178), Expect = 4e-16 Identities = 33/163 (20%), Positives = 58/163 (35%), Gaps = 28/163 (17%) Query: 133 AGKSMGSGFIISADGYVLTNHHVVDGASEVTVKLTDRR-----------EFKA-KVVGSD 180 G + SG ++ +LTN HVVD L F A ++ Sbjct: 99 TGTFIASGVVV-GKDTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYS 157 Query: 181 EQYDVALLKIEA--------KGLPTVRLGDSNTLKPGQWVVAIGSPFGLDHSVTAGIVSA 232 + D+A++K + + + ++ + Q + G P V+ Sbjct: 158 GEGDLAIVKFSPNEQNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKP-------VAT 210 Query: 233 TGRSNPYADQRYVPFIQTDVAINQGNSGGPLLNTRGEVVGINS 275 S +Q D++ GNSG P+ N + EV+GI+ Sbjct: 211 MWESKGKITYLKGEAMQYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 31.6 bits (71), Expect = 0.004 Identities = 25/90 (27%), Positives = 30/90 (33%), Gaps = 18/90 (20%) Query: 170 AALAAAVPAAALASTRRGAATRNQQVARNAAARQQQAPTRMVAAAAPASTGAASAVAATP 229 AAL A P AA A A T N A N A+T A V TP Sbjct: 13 AALLAVAPIAATAMPVNAATTINADSAIN------------------ANTNAKYDVDVTP 54 Query: 230 SNPFTHPDTTLQARPWPRAALSGAGESSLN 259 S P +L+G+ +S N Sbjct: 55 SISAIAAVAKSDTMPAIPGSLTGSISASYN 84
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.025 Identities = 10/22 (45%), Positives = 14/22 (63%) Query: 25 VVALVGPSGAGKTTVLNAIAGL 46 V L G G GK+T++N + GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGL 619
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 64.6 bits (157), Expect = 6e-14 Identities = 20/79 (25%), Positives = 41/79 (51%), Gaps = 5/79 (6%) Query: 309 PPRYPPDAVAAGLAGFVELQIAVSATGAPEHIAIVRSTPTGVFDQTVLDAARHWRFTPAL 368 P+YP A A + G V+++ V+ G +++ I+ + P +F++ V +A R WR+ P Sbjct: 164 QPQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGK 223 Query: 369 EDGKAVASEVRVPVKFELD 387 + V + F+++ Sbjct: 224 PGSG-----IVVNILFKIN 237
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.011 Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 20/79 (25%) Query: 348 SLLLRNLLENAVRY----TPPGGRILVS-THSAPSPTLVVEDSGPGIPEAARARVFHRFH 402 +L++ L+EN +++ P GG+IL+ T + TL VE++G + + Sbjct: 257 PMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK-------- 308 Query: 403 RELGTGVEGSGLGLSIVHD 421 E +G GL V + Sbjct: 309 -------ESTGTGLQNVRE 320
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 81.8 bits (202), Expect = 4e-20 Identities = 36/143 (25%), Positives = 58/143 (40%) Query: 2 RILLVEDDLSLGEGIRTALRRAAYAVDWVHDGVSALMALQEETMDLVILDLGLPRMDGIE 61 IL+ +DD ++ + AL RA Y V + + + DLV+ D+ +P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VIRTARARAVDTPILVLSARERAADRALGLDVGADDYLGKPFDTNELLARTRALLRRSAG 121 ++ + D P+LV+SA+ + GA DYL KPFD EL+ L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 RAQPAVQAGALRLDPAGMSVRWH 144 R + G S Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQ 147
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 58.5 bits (141), Expect = 4e-12 Identities = 40/187 (21%), Positives = 72/187 (38%), Gaps = 2/187 (1%) Query: 3 LHGKCVIVTGATGGIGSVLCAGLVEAGSTVVAVGRTEQTLQRLAAAHAPGRVVP--VVAD 60 + GK +TGA GIG + L G+ + AV + L+++ ++ AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 61 LASDSGRAVLLARTHEMRPAPSVLVLAHAQSHFGLLQDQDPADLAAVVHLNLTVPMLLVQ 120 + + + AR +LV GL+ + A +N T + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 ALLPAFARQPEAAMVAVGSTFGSIGFAGFAGYSASKFGLRGLFEALAREHAGTSVRFQYL 180 ++ + ++V VGS + A Y++SK + L E A ++R + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 181 SPRATAT 187 SP +T T Sbjct: 186 SPGSTET 192
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 34.3 bits (78), Expect = 0.002 Identities = 30/191 (15%), Positives = 68/191 (35%), Gaps = 13/191 (6%) Query: 392 LYNLGNALARQGQYDAAIAAYDRALKQHPNQQDAIANRAAVDAARKRQQQNNKDGKGQSK 451 LYN Q I + P+ A VD A + Sbjct: 980 LYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTET 1039 Query: 452 DQKPSGQDGKGQQQAGQNQQDKQSGQDGQNQQDSKSQPSEAQPPQDSRSQDAQSKNGQGE 511 + S Q+ K ++ Q+ + + ++ + + Q + ++S +++K Q Sbjct: 1040 VAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSG-SETKETQTT 1098 Query: 512 QRKQDTPPQSADTKAQQQADEAQRRKMQQAMAQAGDKQ---------ADGSDKPEAAVAS 562 + K+ T + KA+ + ++ Q ++ + +Q KQ A+ + + + V Sbjct: 1099 ETKE-TATVEKEEKAKVETEKTQ--EVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNI 1155 Query: 563 ETPEQREQRQA 573 + P+ + A Sbjct: 1156 KEPQSQTNTTA 1166
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 27.5 bits (61), Expect = 0.025 Identities = 6/34 (17%), Positives = 16/34 (47%), Gaps = 3/34 (8%) Query: 25 GWWLVIAMVVLVVGSAFFWWWRRRQRQRRWLAAF 58 G W+++A++ + F R++++R Sbjct: 227 GPWMLLALLAGFMA---FRVMLRQEKRRVSFHRR 257
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 32.5 bits (74), Expect = 0.002 Identities = 38/158 (24%), Positives = 59/158 (37%), Gaps = 24/158 (15%) Query: 35 IVGQA----ALVERLLIALLADGHLLVEGAPGLAKTT---AIRALASRLEADFARVQ--- 84 +VG++ + L + D L++ G G K A+ R F + Sbjct: 139 LVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAA 198 Query: 85 FTPDLLPADLTG------TEIWRPQDSRFEFMPGPIFHPILLADEINRAPAKVQSALLEA 138 DL+ ++L G T RFE G L DEI P Q+ LL Sbjct: 199 IPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGT----LFLDEIGDMPMDAQTRLLRV 254 Query: 139 MGERQVT-VGRHTYALPQLFLVMATQNPIEQ---EGTF 172 + + + T VG T + +V AT ++Q +G F Sbjct: 255 LQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLF 292
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 222 bits (567), Expect = 7e-66 Identities = 94/430 (21%), Positives = 166/430 (38%), Gaps = 49/430 (11%) Query: 230 VPWDQALDIVLRAKGLDKRRDGGVVWVAPQPELAKFEQDKEDARIAIENREDLITDYVQ- 288 + W A D+V L+K + + + E+ N I ++ Sbjct: 199 LSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRIIAMIKQ 258 Query: 289 ---------------INYHNAAVIFKALTEAKGIGGGGGGQGGQGGQGGAGQQDNGFLSP 333 + Y A+ + + LT G Q + D + Sbjct: 259 LDRQQATQGNTKVIYLKYAKASDLVEVLT-----GISSTMQSEKQAAKPVAALDKNII-- 311 Query: 334 RGRLVADERTNTLMISDIPKKVAQMRELISHIDRPVDQVLIESRIVIATDTFARDLGARF 393 + A +TN L+++ P + + +I+ +D QVL+E+ I D +LG ++ Sbjct: 312 ---IKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQW 368 Query: 394 GITGATGRGILSGALESNVNFQNTAAQRANEIANTGTSTTLASHLFPSGLNVDLGASGFT 453 A + L + + ++ ++ L Sbjct: 369 ANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASAL------------------- 409 Query: 454 NSRAAGLAYTLLGSNFNLDIELSAMQEEGRGEVVSNPRIVTANQREGVIKQGREIGYVTI 513 S G+A N+ + L+A+ + ++++ P IVT + E G+E+ +T Sbjct: 410 -SSFNGIAAGFYQGNWAM--LLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466 Query: 514 SGGGAAGSAAQANVQFKEVLLELKVTPTITNDNRVFLNMNVKKDEVARFIILEGYGTVPE 573 S + + V+ K V ++LKV P I + V L + + VA Sbjct: 467 SQTTSGDNIFN-TVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAASSTSSDLGAT 525 Query: 574 INRREVNTAVLVGDGETVVIGGVYEFTDRESVSKVPFLGDIPFLGNLFKKRGRSKEKAEL 633 N R VN AVLVG GETVV+GG+ + + ++ KVP LGDIP +G LF+ + K L Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585 Query: 634 LVFVTPKVLR 643 ++F+ P V+R Sbjct: 586 MLFIRPTVIR 595 Score = 50.7 bits (121), Expect = 1e-08 Identities = 31/209 (14%), Positives = 75/209 (35%), Gaps = 30/209 (14%) Query: 175 AAAQIAARGYSGRPVTFNFQDVPVRTVLQLIAEESNLNIVASDTVQGNVTLR----LMNV 230 A + R + + +F+ ++ + +++ N ++ +V+G +T+R L Sbjct: 16 IFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDMLNEE 75 Query: 231 PWDQALDIVLRAKGLDK-RRDGGVVWVAPQPELAKFEQDKEDARIAIENREDLITDYVQI 289 + Q VL G + GV+ V + AK + A ++++T V + Sbjct: 76 QYYQFFLSVLDVYGFAVINMNNGVLKVVRSKD-AKTAAVPVASDAAPGIGDEVVTRVVPL 134 Query: 290 NYHNAAVIFKALTEAKGIGGGGGGQGGQGGQGGAGQQDNGFLSPRGRLVADERTNTLMIS 349 A + L + G G +V E +N L+++ Sbjct: 135 TNVAARDLAPLLRQLNDNAGV------------------------GSVVHYEPSNVLLMT 170 Query: 350 DIPKKVAQMRELISHIDRPVDQVLIESRI 378 + ++ ++ +D D+ ++ + Sbjct: 171 GRAAVIKRLLTIVERVDNAGDRSVVTVPL 199
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.039 Identities = 14/53 (26%), Positives = 17/53 (32%), Gaps = 1/53 (1%) Query: 199 GTAPGAVDPAAPGTAAPGAAPAGATPAAPAAAPAPATPPAAAPAPTQAAPAPA 251 GTA + + TAA G A G P + T P P P Sbjct: 378 GTARALLADVSSPTAAAGGAGGGEPPKKRDPSAGAGTDP-GGPGGGDDGEDPF 429
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 34.0 bits (78), Expect = 7e-04 Identities = 52/210 (24%), Positives = 82/210 (39%), Gaps = 45/210 (21%) Query: 153 RQSALELGGLTAKVMDVEAFAVENAFALVASELPVAADAVVALVDIGATMTTLSVLRSGR 212 R+SA G +++ E A A + + LPV+ +VDIG T ++V+ Sbjct: 127 RESAQGAGAREVFLIE-EPMA-----AAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNG 180 Query: 213 SLYSREQVFGGKQLTDEVM----RRYGL-----TYEEA----GLAKRQG----------- 248 +YS GG + + ++ R YG T E G A Sbjct: 181 VVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRN 240 Query: 249 ---GLPESYEV---EVLEPFKE---ATVQQISRLLQFF---YAGSEFNRVDCIVLAGGCA 296 G+P + + E+LE +E V + L+ A R +VL GG A Sbjct: 241 LAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISER--GMVLTGGGA 298 Query: 297 ALARLPEMVEEQLGVTTVVA-NPLAQMTLG 325 L L ++ E+ G+ VVA +PL + G Sbjct: 299 LLRNLDRLLMEETGIPVVVAEDPLTCVARG 328
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 251 bits (642), Expect = 6e-82 Identities = 156/398 (39%), Positives = 223/398 (56%), Gaps = 7/398 (1%) Query: 17 ALIFIFITVLIDVLSFGVIIPVLPDLVRHFTGGDYVVAAGWIGWFGFLFAAIQFVCSPLQ 76 LI I TV +D + G+I+PVLP L+R + V A G L+A +QF C+P+ Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAH--YGILLALYALMQFACAPVL 63 Query: 77 GALSDRFGRRPVILLSCLGLGLDFVLMAIAHSLPMLLLARIISGVCSASFSTANAYIADV 136 GALSDRFGRRPV+L+S G +D+ +MA A L +L + RI++G+ A+ + A AYIAD+ Sbjct: 64 GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADI 123 Query: 137 TPPDKRAGAFGMLGAAFGIGFVAGPLIGGWLGSIGLRWPFWFAAGLALLNVLYGWFVLPE 196 T D+RA FG + A FG G VAGP++GG +G PF+ AA L LN L G F+LPE Sbjct: 124 TDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 197 SLPAERRTARLDWSHANPLGALKLLRRYPQVFGLASVVFLANLAHYVYPSTFVLFAGYQY 256 S ERR R + NPL + + R V L +V F+ L V + +V+F ++ Sbjct: 184 SHKGERRPLRREAL--NPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241 Query: 257 HWGPREVSWVLAGVGVCNIIVNALLVGRLVRRLGERRALLLGLGCGVIGFIIYGLADSGT 316 HW + LA G+ + + A++ G + RLGERRAL+LG+ G+I+ A G Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 317 AFLVGVPISALWAIAAPSAQALITREVGADAQGRVQGALTGLVSLAGIAGPLLFANVFAW 376 + + A I P+ QA+++R+V + QG++QG+L L SL I GPLLF ++A Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAA 361 Query: 377 FIGS--GAPLHLPGAPWLLAAVLLAAG-WGMAWKRAAR 411 I + G A +LL L G W A +RA R Sbjct: 362 SITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRADR 399
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 55.6 bits (134), Expect = 9e-11 Identities = 30/208 (14%), Positives = 71/208 (34%), Gaps = 17/208 (8%) Query: 83 YEIALEQARAALAERQATLTQLRREIARDRSLQDLVAAEDAEVRRSNVQKAQAAVATAQS 142 E +A L ++ L Q+ EI + LV +++ + Sbjct: 257 QENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTL 316 Query: 143 AVDLAQLNLDRTQVRSPAEGRVSDRTVR-VGDYVTAGCPVVAVL-DTGSFRVDGYFEETR 200 + + + +R+P +V V G VT ++ ++ + + V + Sbjct: 317 ELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKD 376 Query: 201 LQGVHPGQRVDVQLMGEPLTLHGHVQSIAAGIEDRYRSGSAGALPNVTPAFDWVRLAQRI 260 + ++ GQ +++ P T +G++ I + A+ + L + Sbjct: 377 IGFINVGQNAIIKVEAFPYTRYGYLVGKVKNI-------NLDAIEDQRLG-----LVFNV 424 Query: 261 PVRIVLDHVPA---HVQLIAGRTATVSI 285 + I + + ++ L +G T I Sbjct: 425 IISIEENCLSTGNKNIPLSSGMAVTAEI 452 Score = 47.5 bits (113), Expect = 3e-08 Identities = 25/168 (14%), Positives = 59/168 (35%), Gaps = 19/168 (11%) Query: 10 PALLTLAMVMVAALVLQHLWRYYMQAPWTRDAHVGADVV------QVAPDVSGLVESVAV 63 +A ++ LV+ + A + ++ P + +V+ + V Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVL--GQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIV 112 Query: 64 ADNQPVRRGQLLLVVDRARYEIALEQARAALAERQATLTQLRREIARD----RSLQDLVA 119 + + VR+G +LL + E + +++L QA L Q R +I L +L Sbjct: 113 KEGESVRKGDVLLKLTALGAEADTLKTQSSLL--QARLEQTRYQILSRSIELNKLPELKL 170 Query: 120 AEDAEVRRSNVQKAQAAVATAQSAVD-----LAQLNLDRTQVRSPAEG 162 ++ + + ++ + + Q L+ + R+ Sbjct: 171 PDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLT 218
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.6 bits (77), Expect = 0.001 Identities = 16/117 (13%), Positives = 36/117 (30%), Gaps = 5/117 (4%) Query: 357 TLPSSGARARVRATEAGADAALAQFDNTVLQA-LREVQTTLSRYAQDLDRLHLLEQA-QQ 414 LP V E +L + + Q + + L + + + + Sbjct: 169 KLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYEN 228 Query: 415 QAELASSQN---RRLYQGGRTPYLSSLDAERTLASADMTLANAQAQVSQDQIQLFLA 468 + + S+ L + L+ E A L ++Q+ Q + ++ A Sbjct: 229 LSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSA 285
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 33.3 bits (76), Expect = 0.002 Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 12/85 (14%) Query: 69 AIFAMTFLMRPIGAWYFGRFADRYGRRLALTISVSVMALCSFVIAITPTVATIGIAAPII 128 A++A LM+ A G +DR+GRR L +S++ A+ ++A P + + Sbjct: 50 ALYA---LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLW--------V 98 Query: 129 LLVARLLQGFATGGEYGTSATYMSE 153 L + R++ G TG + Y+++ Sbjct: 99 LYIGRIVAGI-TGATGAVAGAYIAD 122
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 81.6 bits (201), Expect = 2e-20 Identities = 50/193 (25%), Positives = 84/193 (43%), Gaps = 9/193 (4%) Query: 3 KTWLITGASSGFGRLLAETVLARGDRIVATVRTPQALA------DLQARYGDAATVLQLD 56 K ITGA+ G G +A T+ ++G I A P+ L +AR+ +A D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA---FPAD 65 Query: 57 VRDFAAVHAAVAQAFAALGRIDVVVSNAGYGTLGAAEAATEAQVRAIIDTNLIGSIALIQ 116 VRD AA+ A+ +G ID++V+ AG G + ++ + A N G + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 117 AVLPRLRQQGGGHVVQVSSEGGQIAYPGFSLYHASKWGIEGFVEAVQQEVAGFGIHFTLA 176 +V + + G +V V S + + Y +SK F + + E+A + I + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 177 EPGPARTNFGAAL 189 PG T+ +L Sbjct: 186 SPGSTETDMQWSL 198
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 27.3 bits (60), Expect = 0.037 Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 8/82 (9%) Query: 89 ARALVEQWMDWQATELNTAWRYAFMATVRGSAAH--------ADAQAIAASVEQWNRHMA 140 AR L Q A + N +++++ A H D+ AI + R M Sbjct: 25 ARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARIEREMG 84 Query: 141 ILDAQLQRGGPFVLGARFTLAD 162 +D + G G +L+D Sbjct: 85 PIDILVNVAGVLRPGLIHSLSD 106
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.3 bits (63), Expect = 0.009 Identities = 9/43 (20%), Positives = 20/43 (46%), Gaps = 3/43 (6%) Query: 71 GLIGVGLVVGIVASFL---PASIGNALSIPLALLAGMSANYAY 110 + LV ++ FL A++ +++P+ LL + A+ Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAF 386
>TYPE3OMOPROT#Type III secretion system outer membrane O protein family signature. Length = 303 Score = 30.0 bits (67), Expect = 0.029 Identities = 38/140 (27%), Positives = 48/140 (34%), Gaps = 22/140 (15%) Query: 245 VITEAVDACVRDGTSWDLELPLTSATGRRL---------WVHSTGSVEHVDGRKRLIGAV 295 ++ + C R G LE P RL W+ +EHV L GA Sbjct: 14 LLAQTATECQRHGREATLEYPTRQGMWVRLSDAEKRWSAWIKPGDWLEHVS--PALAGAA 71 Query: 296 QDVTDRHRAVDALAASERKFRKMFQYSLGLICTHDMHGRLVSINPAAARSL--GRSVEQM 353 H V LAA+ER F L H RL NP +L G+ + M Sbjct: 72 VSAGAEHLVVPWLAATERPFE--------LPVPHLSCRRLCVENPVPGSALPEGKLLHIM 123 Query: 354 EGRSLVEFVR-PERHAALRG 372 R + F PE A G Sbjct: 124 SDRGGLWFEHLPELPAVGGG 143
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.4 bits (66), Expect = 0.030 Identities = 11/49 (22%), Positives = 21/49 (42%), Gaps = 3/49 (6%) Query: 308 PLQALLAQDRRCQLLKTLSVWFGAGMRMAPTAKALGIHRNTLDYRMQRI 356 +LA+ +L L+ A LG++RNTL +++ + Sbjct: 428 LYDRVLAEMEYPLILAALTA---TRGNQIKAADLLGLNRNTLRKKIREL 473
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 36.4 bits (84), Expect = 1e-04 Identities = 31/143 (21%), Positives = 56/143 (39%), Gaps = 28/143 (19%) Query: 79 ANAAALLILGTLAGSV-YPRATVMALPLLWLGSGLGAWLLGEPGSRH-------LGASGV 130 + L L L S P + ++ +PL +G L A L + + Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 131 THGLMFLVFVLGLLR----------------RDRPAIATSMIAFLFYGGMLMTILPHEAG 174 + ++ + F L+ R RP + TS +AF+ G+L + + AG Sbjct: 939 KNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTS-LAFIL--GVLPLAISNGAG 995 Query: 175 VSWQSHLGGAV-AGLIAALLLRL 196 Q+ +G V G+++A LL + Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAI 1018
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.038 Identities = 35/123 (28%), Positives = 47/123 (38%), Gaps = 24/123 (19%) Query: 220 VLIGPPNAGKSSLLNALAGSDRAIVTDV-AGTTRDTLREAIQLDGFELTLVDTAGLRDGG 278 VL G GKS+L+N L G D T GT +D+ + + +EL+ Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELS----------- 648 Query: 279 DAIEREGMRRARAELERADLALVVLDARDPQAARDAIGDAIDAVPRQLWI---HNKCDLL 335 E RRA AE A+ + R A G + PRQ+ I NK L Sbjct: 649 ---EMTAFRRADAE------AVKAFFSSRKDRYRGAYGRYVQDHPRQVVIWCTTNKRQYL 699 Query: 336 ADA 338 D Sbjct: 700 FDI 702
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 31.1 bits (70), Expect = 0.011 Identities = 20/94 (21%), Positives = 36/94 (38%), Gaps = 7/94 (7%) Query: 800 QRYADAAEQF-------AEALKLRPDFALAANNLGFVYYRQGRFAESARWLENTLKIDPS 852 Q Y A E F A ++ D +L F Y+ G++ ++ + + +D Sbjct: 9 QEYQLAMESFLKGGGTIAMLNEISSDTLEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHY 68 Query: 853 RAVAYLNLGDAYAKAGDRDKARKAYSTYLELQPQ 886 + +L LG G D A +YS + + Sbjct: 69 DSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIK 102
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 459 bits (1181), Expect = e-158 Identities = 207/572 (36%), Positives = 300/572 (52%), Gaps = 42/572 (7%) Query: 1 MNQTRVFLIFAWLMVAALLWMEWGKDKAAANAPVVAATQSVPAARDLDAAAPSAPNVPSA 60 M+ R L+ A L V+ ++W W +DK P A Q+ +A + Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKN----PQPQAQQTT-------QTTTTAAGSAAD 49 Query: 61 QAIPQAGALGTVPATSSTAATPAAAGAAPVVTLTSDVLRLKLD--GRSVLDAELLQFPQT 118 Q +P A+G ++++ +DVL L ++ G V A L +P+ Sbjct: 50 QGVP-------------------ASGQGKLISVKTDVLDLTINTRGGDVEQALLPAYPKE 90 Query: 119 KDGTAPVSLLTEDPAHPYNATSGWASEHSPVPGVGGFRA--EQPGTTFELAKGQNTLVVP 176 + T P LL P Y A SG P G R + LA+GQN L VP Sbjct: 91 LNSTQPFQLLETSPQFIYQAQSGLTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVP 150 Query: 177 FVWNGPDGVSIRRTFTLERGRYAISIKDEVINKSGAPWNGYVFRKLSR---VPTILSRGM 233 + G + +TF L+RG YA+++ V N P F +L + +P L G Sbjct: 151 MTYTDAAGNTFTKTFVLKRGDYAVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGS 210 Query: 234 TNPDSFSFNGATWYSPQEGYERRAFKDYMDDGGLNRQITGGWVALLQHHFFTAWIPQKDQ 293 +N +F GA + +P E YE+ F D+ LN GGWVA+LQ +F TAWIP D Sbjct: 211 SNFALHTFRGAAYSTPDEKYEKYKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHNDG 270 Query: 294 ASLYVLAQDGPRD-VAELRGPAFTVAPGQTASTEARLWVGPKLVSLIAKEDVKGLDRVVD 352 + + A G + V PGQT + + LWVGP++ + LD VD Sbjct: 271 TNNFYTANLGNGIAAIGYKSQPVLVQPGQTGAMNSTLWVGPEIQDKM-AAVAPHLDLTVD 329 Query: 353 YSRFSIMAIIGQGLFWVLSHLHSFLHNWGWAIIGLVVLLRLALYPLSAAQYKSGAKMRRF 412 Y I Q LF +L +HSF+ NWG++II + ++R +YPL+ AQY S AKMR Sbjct: 330 YGWLWF---ISQPLFKLLKWIHSFVGNWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRML 386 Query: 413 QPRLAQLKERYGDDRVKYQQATMELFKKEKINPMGGCLPLLIQMPIFFALYWVLVESVEL 472 QP++ ++ER GDD+ + Q M L+K EK+NP+GGC PLLIQMPIF ALY++L+ SVEL Sbjct: 387 QPKIQAMRERLGDDKQRISQEMMALYKAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVEL 446 Query: 473 RQAPWLGWIQDLTARDPYFILPLLNISIMWATQKLTPTPGMDPMQAKMMQFMPLVFGVMM 532 RQAP+ WI DL+A+DPY+ILP+L M+ QK++PT DPMQ K+M FMP++F V Sbjct: 447 RQAPFALWIHDLSAQDPYYILPILMGVTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFF 506 Query: 533 AFMPAGLVLYWVVNGGLGLLIQWWMIRQHGEK 564 + P+GLVLY++V+ + ++ Q + R ++ Sbjct: 507 LWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKR 538