>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 68.3 bits (167), Expect = 1e-14 Identities = 80/339 (23%), Positives = 132/339 (38%), Gaps = 25/339 (7%) Query: 27 LPALPEITQQLQATSTQTQLSLTAALIGLGLGQLFFGP----LSDRIGRLKPLALSLLLF 82 +P LP + + L S L L Q P LSDR GR L +SL Sbjct: 25 MPVLPGLLRDLVH-SNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83 Query: 83 IFASAMCALTRDINMLIVWRFLQGFAGAGGSVLSRSIARDKYQGTLLTQFFALLMTVNGI 142 A+ A + +L + R + G GA G+V IA D G + F + G Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITDGDERARHFGFMSACFGF 142 Query: 143 APVLSPVLGGYVITAFDWRILFWTMAAIGGVLLVMSLAILRETRPATAAHASRQRPGQPV 202 V PVLGG + F F+ AA+ G+ + +L E+ R+ Sbjct: 143 GMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNP-- 199 Query: 203 LKNRRFLRFCLIQAFMMA-----GLFSYIGSSSFVMQSE--YGMSAMQFSLLFGLNGI-G 254 L + R+ R + A +MA L + ++ +V+ E + A + GI Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259 Query: 255 LIIAAMIFSRLARRFSAESLLRGGLTLAVSCAAIMLLFA---WLHLPVLALVGL--FFTV 309 + AMI +A R L G+ +A I+L FA W+ P++ L+ Sbjct: 260 SLAQAMITGPVAARLGERRALMLGM-IADGTGYILLAFATRGWMAFPIMVLLASGGIGMP 318 Query: 310 SLMSGISTVAGAEAMSAVDAAQSG--TASALMGTLMFVF 346 +L + +S E + + + + ++++G L+F Sbjct: 319 ALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1144 bits (2961), Expect = 0.0 Identities = 583/1031 (56%), Positives = 754/1031 (73%), Gaps = 6/1031 (0%) Query: 3 SRFFVRRPVFAWVIAILIMLAGVLAIRTLPVGQYPDVAPPAVKISATYTGASAETLENSV 62 + FF+RRP+FAWV+AI++M+AG LAI LPV QYP +APPAV +SA Y GA A+T++++V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 63 TQVIEQQLTGLDHLLYFSSTSSSDGSVSITVTFEQGTDPDTAQVQVQNKVQQAESRLPSE 122 TQVIEQ + G+D+L+Y SSTS S GSV+IT+TF+ GTDPD AQVQVQNK+Q A LP E Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 VQQSGVTVEKSQSSFLLILAVYDKTNRATSSDISDWLVSNMQDPLARVEGVGSLQVFGAE 182 VQQ G++VEKS SS+L++ T DISD++ SN++D L+R+ GVG +Q+FGA+ Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ 181 Query: 183 YAMRVWMDPTKLASYSLMPSDVQSAIEAQNVQVSAGKIGALPSSNAQQLTATVRAQSRLQ 242 YAMR+W+D L Y L P DV + ++ QN Q++AG++G P+ QQL A++ AQ+R + Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 243 TPDQFKAIIVKSQADGSVVRLSDVARVEMGSEDYTATANLNGHPAAGIAVMMAPGANALD 302 P++F + ++ +DGSVVRL DVARVE+G E+Y A +NG PAAG+ + +A GANALD Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 303 TATLVKSKIAEFQRQMPQGYDIAYPKDSTEFIKISVEDVIQTLFEAIILVVCVMYLFLQN 362 TA +K+K+AE Q PQG + YP D+T F+++S+ +V++TLFEAI+LV VMYLFLQN Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 363 FRATLIPAVAVPVVLLGTFGVLALFGYSINTLTLFAMVLAIGLLVDDAIVVVENVERIMR 422 RATLIP +AVPVVLLGTF +LA FGYSINTLT+F MVLAIGLLVDDAIVVVENVER+M Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 423 DEGLPAREATEKSMGEISGALVAIALVLSAVFLPMAFFGGSTGVIYRQFSVTIISAMMLS 482 ++ LP +EATEKSM +I GALV IA+VLSAVF+PMAFFGGSTG IYRQFS+TI+SAM LS Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 483 VVVALTLTPALCGALL----SHSKPHTKGFFGAFNRLWGRTEAGYQRRVLGGLRRGAVMM 538 V+VAL LTPALC LL + + GFFG FN + + Y V L + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 539 GAYALICGAMALAMWKLPGSFLPVEDQGEIMVQYTLPAGATAVRTAEVRRQVTDWFLTKE 598 YALI M + +LP SFLP EDQG + LPAGAT RT +V QVTD++L E Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 599 KANTDVIFTVDGFSFSGSGQNAGMAFVSLKNWSQRKGDDNTAQAIALRATKELGTIRDAT 658 KAN + +FTV+GFSFSG QNAGMAFVSLK W +R GD+N+A+A+ RA ELG IRD Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 659 LFAMTPPSVDGLGQSNGFTFELMASGGTDRDSLMKLRSQLLAAANQS-SELQSVRANDLP 717 + P++ LG + GF FEL+ G D+L + R+QLL A Q + L SVR N L Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 718 QMPQLQVDIDSNKAVSLGLSLSDVTDTLSSAWGGTYVNDFIDRGRVKKVYIQGESDARAV 777 Q ++++D KA +LG+SLSD+ T+S+A GGTYVNDFIDRGRVKK+Y+Q ++ R + Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 778 PSDLGKWFVRGSDNSMTPFSAFATTHWQYGPESLVRYNGSAAFEIQGENAAGFSSGAAMD 837 P D+ K +VR ++ M PFSAF T+HW YG L RYNG + EIQGE A G SSG AM Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAMA 841 Query: 838 KMEKLADSLPAGSTWAWSGISLQEKLASGQAMSLYAISILVVFLCLAALYESWSVPFSVI 897 ME LA LPAG + W+G+S QE+L+ QA +L AIS +VVFLCLAALYESWS+P SV+ Sbjct: 842 LMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVM 901 Query: 898 MVIPLGLLGAALAATLRGLSNDVYFQVALLTTIGLSSKNAILIVEFAESAVD-EGYSLSR 956 +V+PLG++G LAATL NDVYF V LLTTIGLS+KNAILIVEFA+ ++ EG + Sbjct: 902 LVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVE 961 Query: 957 AAIRAAQTRLRPIVMTSLAFIAGVLPLAIATGAGANSRVAIGTGIIGGTLTATLLAVFFV 1016 A + A + RLRPI+MTSLAFI GVLPLAI+ GAG+ ++ A+G G++GG ++ATLLA+FFV Sbjct: 962 ATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFV 1021 Query: 1017 PLFFVLVKRLF 1027 P+FFV+++R F Sbjct: 1022 PVFFVVIRRCF 1032
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 42.5 bits (100), Expect = 2e-06 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 1/70 (1%) Query: 42 PVPVVSQLTGRTTAS-LSAEVRPQVGGIIQKRLFTEGDMVKAGQALYQIDPSSYRAAWNE 100 V +V+ G+ T S S E++P I+++ + EG+ V+ G L ++ A + Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 101 AAAALKQAQA 110 ++L QA+ Sbjct: 139 TQSSLLQARL 148
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.5 bits (84), Expect = 1e-05 Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 6/72 (8%) Query: 51 GLIAKRKGNW---LCIEYLWVSETTRGRGLGSELMQEAEQQAQAQGCSHLLVDTFSFQ-- 105 G I R NW IE + V++ R +G+G+ L+ +A + A+ L+++T Sbjct: 78 GRIKIRS-NWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINIS 136 Query: 106 ALPFYQKLGYQL 117 A FY K + + Sbjct: 137 ACHFYAKHHFII 148
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 69.7 bits (170), Expect = 7e-17 Identities = 32/175 (18%), Positives = 70/175 (40%), Gaps = 10/175 (5%) Query: 12 RPGRPRGKKPGTANREQLMDIALTLFARDGAGRVSLNAIAKEAGVTPAMLHYYFSSRDAL 71 R + ++ R+ ++D+AL LF++ G SL IAK AGVT ++++F + L Sbjct: 3 RKTKQEAQE----TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDL 58 Query: 72 VTQLIEERFMPLRNHISRIFVDHLQDPVL----ALTMMVETLAHMAEKNAWFAPLWM-QE 126 +++ E + DP+ L ++E+ + ++ E Sbjct: 59 FSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCE 118 Query: 127 IIGEMPILRQHMDARFGEERFQVMLETVRRWQQEGKINPALAPELLFTTVISLVL 181 +GEM +++Q E + + +T++ + + L + + Sbjct: 119 FVGEMAVVQQ-AQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYIS 172
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 139 bits (351), Expect = 2e-38 Identities = 94/418 (22%), Positives = 179/418 (42%), Gaps = 19/418 (4%) Query: 20 LLLVMLLSALDQTIVSTALPTIVGELGGL-DKLSWVVTAYILSSTIAVPLYGKFGDLFGR 78 L ++ S L++ +++ +LP I + +WV TA++L+ +I +YGK D G Sbjct: 19 LCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGI 78 Query: 79 KIVLQVAIGLFLVGSALCGLAQNMTQLVLM-RGLQGLGGGGLMVISMAAVADVIPPANRG 137 K +L I + GS + + + L++M R +QG G + M VA IP NRG Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 138 RYQGLFGGVFGLATVIGPLIGGFLVQHASWRWIFYINLPLGLFALLVIGAVFHSSNKRSQ 197 + GL G + + +GP IGG + + W ++ +P+ + R + Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPMITIITVPFLMKLLKKEVRIK 196 Query: 198 HQIDWLGAIYLSMALLCIILFTSEGGSVHAWNDPQLWCILAFGIVGIIGFIYEERMAAEP 257 D G I +S+ ++ +LFT+ L ++ + F+ R +P Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTS----------YSISFLIVSVLSFLIFVKHIRKVTDP 246 Query: 258 IIPLALFRNRSFLLCSLIGFVIGMSLFGSVTFLPLYLQVVKEATPTEAGLQLI-PLMGGL 316 + L +N F++ L G +I ++ G V+ +P ++ V + + E G +I P + Sbjct: 247 FVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSV 306 Query: 317 LLTSIISGRIISRTGKYRLFPILGTLLGVTGMVLLTRITIHSPLWQLYLFTGVLGAGLGL 376 ++ I G ++ R G +G + + + + + + VLG GL Sbjct: 307 IIFGYIGGILVDRRGP-LYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSF 364 Query: 377 VMQVLVLAVQNAMPAQMYGVATSGVTLFRSIGGSIGVALFGAVFTHVLQSNLQQLLPE 434 V+ V +++ Q G S + + G+A+ G + + + Q+LLP Sbjct: 365 TKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS--IPLLDQRLLPM 420
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 60.2 bits (146), Expect = 3e-12 Identities = 50/353 (14%), Positives = 123/353 (34%), Gaps = 89/353 (25%) Query: 47 IERILINKGDNVAAGQELVKIESFDA-------QNIFLRAEEKLSAESALLRNLESGERP 99 ++ I++ +G++V G L+K+ + A Q+ L+A + + L R++E + P Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLP 166 Query: 100 E-----------------------------------------------ELDIIRSQIKKA 112 E E + ++I + Sbjct: 167 ELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRY 226 Query: 113 QSAESQVKRQLGRYRNLYANHAISLAEWEDIRDELTQKGAQVEEL---INQLKARQLPAR 169 ++ K +L + +L AI+ + ++ + ++ + Q+++ L A+ Sbjct: 227 ENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAK 286 Query: 170 Q--------------DEISKQRSMVAAAKLERDKALWDVQQTTIVSPVNAKVFDI-IYRA 214 + D++ + + LE K Q + I +PV+ KV + ++ Sbjct: 287 EEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTE 346 Query: 215 GERPSAGKPIISLLPPEN-IKVRFFIPEAKLGKFKIGSKVKLICDGCAEP------IAGV 267 G + + ++ ++P ++ ++V + +G +G + + A P + G Sbjct: 347 GGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE--AFPYTRYGYLVGK 404 Query: 268 INYISPEA---EFTPPVIYSTKRREKLIFMAEAIPALQQAGRMKIGQPFDVEI 317 + I+ +A + V E+ + + + G EI Sbjct: 405 VKNINLDAIEDQRLGLVFNVIISIEE-----NCLSTGNKNIPLSSGMAVTAEI 452
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 44.5 bits (105), Expect = 2e-07 Identities = 35/180 (19%), Positives = 67/180 (37%), Gaps = 6/180 (3%) Query: 190 IMGSILSTTLILMTALSITRERENGALENLLVSPLSGLEVIIGKITPFVIIGLFQATLIL 249 + S ++ + R E +L + L ++++G++ I Sbjct: 74 VATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIG 133 Query: 250 IAAVLLFDIPLHGSVFLLFFVLLIYVFLCLSIGIGISGLAQNQLQALQMSSFYFIPSLML 309 + A L S+ V+ + S+G+ ++ LA + + + P L L Sbjct: 134 VVAAALGYTQWL-SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFL 192 Query: 310 SGFVSPFISMPDWAKAIGSCLPLTYFIRLVKGIMLKGYSATALLPDLLPLIGLAVIVIGV 369 SG V P +P + LPL++ I L++ IML D+ +G I I + Sbjct: 193 SGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPV-----VDVCQHVGALCIYIVI 247
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 51.9 bits (124), Expect = 1e-10 Identities = 38/185 (20%), Positives = 72/185 (38%), Gaps = 15/185 (8%) Query: 1 MAEK-QTAKRNRREEILQSLALMLESSDGSQRITTAKLAASVGVSEAALYRHFPSKTRMF 59 MA K + + R+ IL AL L S G + ++A + GV+ A+Y HF K+ +F Sbjct: 1 MARKTKQEAQETRQHILDV-ALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLF 59 Query: 60 DSLIEFIEDSLITRIN-LILKDEKDTTARLRLIVLLILGFGERNPGLTRILT-------G 111 + E E ++ K D + LR I++ +L ++ Sbjct: 60 SEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 112 HALMFEQDRLQGRIN-QLFERIEAQLRQVMREKKMREGEGYTLDETLLASQLLAFCEGML 170 M + Q + + ++RIE L+ + K + L A + + G++ Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPAD----LMTRRAAIIMRGYISGLM 175 Query: 171 SRFVR 175 ++ Sbjct: 176 ENWLF 180
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.1 bits (68), Expect = 0.017 Identities = 18/55 (32%), Positives = 22/55 (40%), Gaps = 15/55 (27%) Query: 50 GHIELGKWADLVILAPA----------TADLIARVAAGMANDLVSTICLATPSPV 94 G +E+GK ADLV+ PA IA G N + TP PV Sbjct: 424 GSLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNA-----SIPTPQPV 473
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 29.9 bits (67), Expect = 0.027 Identities = 13/60 (21%), Positives = 21/60 (35%) Query: 277 GYANLNSGNTAAAKQQFEEVLQTNPQDADALAGMGYIAQRSGDYQAASQYLSRAADLGGD 336 + SG A + F+ + + D+ G+G Q G Y A S A + Sbjct: 43 AFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSYGAIMDIK 102
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 26.7 bits (59), Expect = 0.047 Identities = 19/109 (17%), Positives = 36/109 (33%), Gaps = 12/109 (11%) Query: 49 LETPRWQAILARHETYFPHINPHRPRPLDPLRYL-------LQSLWLLTTRVPEPEKKVN 101 E ++ +L T P++ + + L + LQ T P K N Sbjct: 320 EENDTFRLLLNPIITLLPNLKEQKASLVKALMFFSKSFLFNLQHFIPETNLFVSPYYKGN 379 Query: 102 WRSLAALEGVHGRYTQWLEKLPEQMNARTGHLDKQKELAHLNPKLRRAI 150 + +L+ + +W+ KLP + H ++ LR Sbjct: 380 QKLYTSLKLI---VEEWMAKLPGKRYLNHKHF--HLFCHYVEQILRNIQ 423
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 30.6 bits (69), Expect = 0.011 Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 5/97 (5%) Query: 182 TFIPILANTFARRAVEIPVMHAEREFGDSKYSFMRLINLMYDLVTCLTTTPLRLLSIFGS 241 P L T + + FG +F +N + V + + R L I+ Sbjct: 487 ILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYAL 546 Query: 242 VIALLGFAFGLLLVVLRLAFGPQWAAEGVFMLFAVLF 278 ++A + F L +F P+ +GVF+ L Sbjct: 547 IVAGMVVLFLRLPS----SFLPE-EDQGVFLTMIQLP 578
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 108 bits (272), Expect = 5e-28 Identities = 73/361 (20%), Positives = 137/361 (37%), Gaps = 60/361 (16%) Query: 317 RVLILGVNGFIGNHLTERLLQDDNYEIYGLDIGSD--------AISRFLDCPRFHFVEGD 368 + L+ G GFIG H+++RLL + +++ G+D +D A L P F F + D Sbjct: 2 KYLVTGAAGFIGFHVSKRLL-EAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 369 ISIHSEWIE--YHIKKCDVVLPLVAIATPIEYT-RNPLRVFELDFEENLKIIRDCVKYN- 424 ++ E + + + V + Y+ NP + + L I+ C Sbjct: 61 LADR-EGMTDLFASGHFERVFISPHRLA-VRYSLENPHAYADSNLTGFLNILEGCRHNKI 118 Query: 425 KRIIFPSTSEVYGMCTDKNFDEDSSNLVVGPINKQRWIYSVSKQLLDRVIWAYGDKNGLK 484 + +++ S+S VYG+ F D S V P++ +Y+ +K+ + + Y GL Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDS--VDHPVS----LYAATKKANELMAHTYSHLYGLP 172 Query: 485 FTLFRPFNWMGPRLDNLNAARIGSSRAITQLILNLVEGSPIKLIEGGKQKRCFTDISDGI 544 T R F GP A+ + ++EG I + GK KR FT I D Sbjct: 173 ATGLRFFTVYGPWGR--------PDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIA 224 Query: 545 EALFRIIEN---------------KDGRCDGQIINIGNPDNEASIKELAEMLLACFERHP 589 EA+ R+ + ++ NIGN + + + L Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVE-LMDYIQALEDALGIEA 283 Query: 590 LRDRFPPFAGFREVESSDYYGKGYQDVEHRKPSIRNAKRCLNWEPKVEMEETVEHTLDFF 649 ++ P G DV + + + P+ +++ V++ ++++ Sbjct: 284 KKNMLPLQPG---------------DVLETSADTKALYEVIGFTPETTVKDGVKNFVNWY 328 Query: 650 L 650 Sbjct: 329 R 329
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 32.2 bits (73), Expect = 2e-04 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 1/32 (3%) Query: 35 RHILFWLGMALLCLGCGMLLW-LSVLQSIPVS 65 R ILF+L M L C M+ W + + + PVS Sbjct: 15 RRILFYLLMLLFCQQLAMIFWRIGLPDNAPVS 46
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.4 bits (123), Expect = 4e-09 Identities = 74/365 (20%), Positives = 133/365 (36%), Gaps = 30/365 (8%) Query: 13 LRLNLRIVSVVIFNFASYLTIGLPLAVLPGYVHDVM--GFSAFWAGLVISLQYFATLLSR 70 ++ N ++ ++ + IGL + VLPG + D++ G++++L Sbjct: 1 MKPNRPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACA 60 Query: 71 PHAGRYADLLGPKKIVVFGLGGCFLSGLSYLLATWGSGWPLISLLLLCLGRVILGI-GQS 129 P G +D G + +++ L + + Y + L +L +GR++ GI G + Sbjct: 61 PVLGALSDRFGRRPVLLVSL---AGAAVDYAIMATAP-----FLWVLYIGRIVAGITGAT 112 Query: 130 FAGTGSTLWGVGVVGSLHIGRVISWNGIVTYGAMAMGAPLGVLC--YSHIGLSGLAGVIM 187 A G+ + + R + M G LG L +S A + Sbjct: 113 GAVAGAYIADITDGDER--ARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALN 170 Query: 188 AVALVAILCALP-------RAAVKAAKGKAMSFR-AVLGRVWPYGMALA-LASAGFGVIA 238 + + LP R + A SFR A V MA+ + V A Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPA 230 Query: 239 TFITLFYDAK-GWDGAAFALTLFSCAFVGA---RLLFPNAINRLGGLNVAMLCFSVEAIG 294 +F + + WD ++L + + + ++ RLG ML + G Sbjct: 231 ALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTG 290 Query: 295 LLLVGFADTPMMAKIGTFLTGAGFSLVFPALGVVAVKAVPQHNQGSALATYTVFMDLSLG 354 +L+ FA MA L A + PAL + + V + QG + L+ Sbjct: 291 YILLAFATRGWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLT-S 348 Query: 355 VSGPL 359 + GPL Sbjct: 349 IVGPL 353 Score = 32.1 bits (73), Expect = 0.005 Identities = 41/155 (26%), Positives = 59/155 (38%), Gaps = 13/155 (8%) Query: 253 AAFALTLFSCAFVGARLLFPNAINRLGGLNVAMLCFSVEAIGLLLVGFADTPMMAKIGTF 312 A +AL F+CA V L +R G V ++ + A+ ++ A + IG Sbjct: 50 ALYALMQFACAPVLGAL-----SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI 104 Query: 313 LTGAGFSLVFPALGVVAVKAVPQHNQGSALATYTVFMDLSLG---VSGPLAGLLMAWTGI 369 + G + A G VA + G A + FM G V+GP+ G LM Sbjct: 105 VAG-----ITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP 159 Query: 370 SMIYLAAAGLVMAALLLGWRLKNGPRLANRRPAHQ 404 + AAA L L G L RRP + Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRR 194
>PF01206#SirA family protein Length = 76 Score = 103 bits (259), Expect = 3e-33 Identities = 27/71 (38%), Positives = 43/71 (60%) Query: 9 DHTLDALGLRCPEPVMMVRKTVRTMPVGETLLIIADDPATTRDIPGFCRFMEHELVAQET 68 D +LDA GL CP P++ +KT+ TM GE L ++A DP + +D F + HEL+ Q+ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 69 EALPYRYLIRK 79 E Y + +++ Sbjct: 65 EDGTYHFRLKR 75
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 51.2 bits (122), Expect = 9e-09 Identities = 43/186 (23%), Positives = 64/186 (34%), Gaps = 13/186 (6%) Query: 18 KEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEKAH 77 QA EE V E A P P+ P E AE+ + ++TV ++E+ Sbjct: 1002 NIQADVPSVPSNNEEIARVDE---APVPPPAPATPSETTETVAENSKQESKTVEKNEQD- 1057 Query: 78 LAEPASAQEEEWVETPALTEETPVVEPEPAVSEPPEQPAVVEPLAEEVIAEPVVVEAVAE 137 A +AQ E E V+ +E + + + E VE + Sbjct: 1058 -ATETTAQNRE-----VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEK 1111 Query: 138 QPVEGIVVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEALAQEQ 197 VE + QE E+ E AE A E V E ++ A + Sbjct: 1112 AKVE--TEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTA-DT 1168 Query: 198 EKPTKE 203 E+P KE Sbjct: 1169 EQPAKE 1174 Score = 47.8 bits (113), Expect = 1e-07 Identities = 28/163 (17%), Positives = 46/163 (28%), Gaps = 9/163 (5%) Query: 17 QKEQAQETETEQKVEEQQAVAEEIPAVETPAEPSAPKADPEA-----FAEDVVEVTETVV 71 + +ET+T + E EE VET PK + +E V E Sbjct: 1088 SGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147 Query: 72 ESEKAHLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPPE----QPAVVEPLAEEVIA 127 E++ + +Q +T +ET +P Sbjct: 1148 ENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATT 1207 Query: 128 EPVVVEAVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQALA 170 +P V + +P + E A S + AL Sbjct: 1208 QPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALC 1250 Score = 43.5 bits (102), Expect = 2e-06 Identities = 29/188 (15%), Positives = 58/188 (30%), Gaps = 14/188 (7%) Query: 17 QKEQAQETET-EQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEK 75 K++++ E EQ E A E+ + + + +V + E++ Sbjct: 1044 SKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTN------EVAQSGSETKETQT 1097 Query: 76 AHLAEPASAQEEEWVETPALTEETPVVEPEPAVSEPP--EQPAVVEPLAEEVIAEPVVVE 133 E A+ ++EE + TE+T P+ P EQ V+P AE E Sbjct: 1098 TETKETATVEKEE--KAKVETEKTQ-EVPKVTSQVSPKQEQSETVQPQAEPA-RENDPTV 1153 Query: 134 AVAEQPVEGIVVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEAL 193 ++P + +E + ++ + Sbjct: 1154 N-IKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVN 1212 Query: 194 AQEQEKPT 201 ++ KP Sbjct: 1213 SESSNKPK 1220 Score = 36.6 bits (84), Expect = 3e-04 Identities = 21/171 (12%), Positives = 46/171 (26%), Gaps = 4/171 (2%) Query: 26 TEQKVEEQQAVAEEIPAVETPAEPSAPKADPEAFAEDVVEVTETVVESEKAHLAEPASAQ 85 + E ++ E T + K + E E ++ + E++ +P + Sbjct: 1086 AQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEP 1145 Query: 86 EEEWVETPALTEETPVVEPEPAVSEPPEQ--PAVVEPLAEEVIAEPVVVEAVAEQPVEGI 143 E T + E +P ++ V +P+ E + Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205 Query: 144 VVQPQETEAPEEDAPLSDEELEAQALAAEAAEEAAVVVPAPEDEAPLEALA 194 QP P + +++ E A A + + Sbjct: 1206 TTQPTVNSESSN-KPKNRHRRSVRSVPHN-VEPATTSSNDRSTVALCDLTS 1254
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 33.9 bits (77), Expect = 7e-04 Identities = 23/92 (25%), Positives = 35/92 (38%), Gaps = 2/92 (2%) Query: 156 AQGCEGKNVIIIGAGT-IGLLALQCARELGANSVTAIDINPQKLELAKTLGATHVFNSRE 214 A+G EGK I GA IG + GA+ + A+D NP+KLE + ++ Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAH-IAAVDYNPEKLEKVVSSLKAEARHAEA 61 Query: 215 MSGQAIQQALESIQFDQLVLETAGTPQTVALA 246 A ++ E V +A Sbjct: 62 FPADVRDSAAIDEITARIEREMGPIDILVNVA 93
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 29.9 bits (67), Expect = 0.026 Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 6/81 (7%) Query: 160 ADPNSPQYNVIAATLMKVGQQAFSIMVPVFTAYIAWSISGRPGMVAGFVGGLLANATGAG 219 D + + A Q + +P+ TA + + G V+ + L++ Sbjct: 357 QDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSF---- 412 Query: 220 FLGGIIAGFAAGYFMLLIRHL 240 GI AGF G + +L+ L Sbjct: 413 --NGIAAGFYQGNWAMLLTAL 431
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 1081 bits (2798), Expect = 0.0 Identities = 412/566 (72%), Positives = 475/566 (83%), Gaps = 2/566 (0%) Query: 4 ISRQAYADMFGPTVGDKVRLADTELWIEVEDDLTTYGEEVKFGGGKVIRDGMGQGQML-A 62 +SR AYA+MFGPTVGDKVRLADTEL+IEVE D TT+GEEVKFGGGKVIRDGMGQ Q+ Sbjct: 5 MSRAAYANMFGPTVGDKVRLADTELFIEVEKDFTTHGEEVKFGGGKVIRDGMGQSQVTRE 64 Query: 63 ADCVDLVLTNALIVDHWGIVKADIGVKDGRIFAIGKAGNPDIQPNVTIPIGAATEVIAAE 122 VD V+TNALI+DHWGIVKADIG+KDGRI AIGKAGNPD+QP VTI +G TEVIA E Sbjct: 65 GGAVDTVITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGE 124 Query: 123 GKIVTAGGIDTHIHWICPQQAEEALVSGVTTMVGGGTGPAAGTHATTCTPGPWYISRMLQ 182 GKIVTAGG+D+HIH+ICPQQ EEAL+SG+T M+GGGTGPA GT ATTCTPGPW+I+RM++ Sbjct: 125 GKIVTAGGMDSHIHFICPQQIEEALMSGLTCMLGGGTGPAHGTLATTCTPGPWHIARMIE 184 Query: 183 AADSLPVNIGLLGKGNVSQPDALREQVAAGVIGLKIHEDWGATPAAIDCALTVADEMDIQ 242 AAD+ P+N+ GKGN S P AL E V G LK+HEDWG TPAAIDC L+VADE D+Q Sbjct: 185 AADAFPMNLAFAGKGNASLPGALVEMVLGGATSLKLHEDWGTTPAAIDCCLSVADEYDVQ 244 Query: 243 VALHSDTLNESGFVEDTLAAIGGRTIHTFHTEGAGGGHAPDIITACAHPNILPSSTNPTL 302 V +H+DTLNESGFVEDT+AAI GRTIH +HTEGAGGGHAPDII C PN++PSSTNPT Sbjct: 245 VMIHTDTLNESGFVEDTIAAIKGRTIHAYHTEGAGGGHAPDIIRICGQPNVIPSSTNPTR 304 Query: 303 PYTLNTIDEHLDMLMVCHHLDPDIAEDVAFAESRIRRETIAAEDVLHDLGAFSLTSSDSQ 362 PYT+NT+ EHLDMLMVCHHL P I ED+AFAESRIR+ETIAAED+LHD+GAFS+ SSDSQ Sbjct: 305 PYTVNTLAEHLDMLMVCHHLSPTIPEDIAFAESRIRKETIAAEDILHDIGAFSIISSDSQ 364 Query: 363 AMGRVGEVILRTWQVAHRMKVQRGALAEETGDNDNFRVKRYIAKYTINPALTHGIAHEVG 422 AMGRVGEV +RTWQ A +MK QRG L EETGDNDNFRVKRYIAKYTINPA+ HG++HE+G Sbjct: 365 AMGRVGEVAIRTWQTADKMKRQRGRLKEETGDNDNFRVKRYIAKYTINPAIAHGLSHEIG 424 Query: 423 SIEVGKLADLVVWSPAFFGVKPATVIKGGMIAIAPMGDINASIPTPQPVHYRPMFGALGS 482 S+EVGK ADLV+W+PAFFGVKP V+ GG IA APMGD NASIPTPQPVHYRPMFGA G Sbjct: 425 SLEVGKRADLVLWNPAFFGVKPDMVLLGGTIAAAPMGDPNASIPTPQPVHYRPMFGAYGR 484 Query: 483 ARHHCRLTFLSQAAAANGVAGRLNLRSAIAVVKGCR-TVQKADMVHNSLQPNITVDAQTY 541 +R + +TF+SQA+ G+AGRL + + V+ R + KA M+HNSL P+I VD +TY Sbjct: 485 SRTNSSVTFVSQASLDAGLAGRLGVAKELVAVQNTRGGIGKASMIHNSLTPHIEVDPETY 544 Query: 542 EVRVDGELITSEPADVLPMAQRYFLF 567 EVR DGEL+T EPA VLPMAQRYFLF Sbjct: 545 EVRADGELLTCEPATVLPMAQRYFLF 570
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.5 bits (66), Expect = 0.004 Identities = 19/65 (29%), Positives = 29/65 (44%), Gaps = 6/65 (9%) Query: 88 LAVDKSLHGKGVGRALVRDAGLRMIQVAETIGIRGMLVHALSDE--ARDFYLRVGFEPSP 145 +AV K KGVG AL+ A I+ A+ G+++ A FY + F Sbjct: 95 IAVAKDYRKKGVGTALLHKA----IEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150 Query: 146 MDPMM 150 +D M+ Sbjct: 151 VDTML 155
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 85.1 bits (210), Expect = 1e-21 Identities = 59/201 (29%), Positives = 93/201 (46%), Gaps = 8/201 (3%) Query: 1 MNAQ-IEGRVAVVTGGSSGIGFETLRLLLGEGAKVAFCGRNPDRLASAHAALQNE--YPE 57 MNA+ IEG++A +TG + GIG R L +GA +A NP++L ++L+ E + E Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 58 GEVFSWRCDVLNEAEVEAFAAAVAARFGGVDMLINNAGQGYVAHFADTPREAWLHEAELK 117 ++ DV + A ++ A + G +D+L+N AG E W + Sbjct: 61 ----AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116 Query: 118 LFGVINPVKAFQSLLEASDIASITCVNSLLALQPEEHMIATSAARAALLNMTLTLSKELV 177 GV N ++ + SI V S A P M A ++++AA + T L EL Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 178 DKGIRVNSILLGMVESG-QWQ 197 + IR N + G E+ QW Sbjct: 177 EYNIRCNIVSPGSTETDMQWS 197
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.010 Identities = 33/188 (17%), Positives = 73/188 (38%), Gaps = 12/188 (6%) Query: 33 SFYGIRPLLILFMAATVYDGGMGLARENASAIVGIFAGSMYLAALPGGWLADNWLGQQRA 92 SF+ + ++L ++ + + + F + + G L+D LG +R Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ-LGIKRL 81 Query: 93 VWYGSILIALGHLSIALSAWLGNDLFFIGLMFIVL---GSGLFKTCISVMVGTLYKKGDA 149 + +G I+ G ++ ++G+ F + +M + G+ F + V+V K Sbjct: 82 LLFGIIINCFG----SVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPK--E 135 Query: 150 RRDGGFSLFYMGINIGSFIAPLISGWLIKSHGWHWGFGIGGIGMLVALIIFRVFAVPSMK 209 R F L + +G + P I G + +H HW + + + + + F + + Sbjct: 136 NRGKAFGLIGSIVAMGEGVGPAIGG--MIAHYIHWSYLLLIPMITIITVPFLMKLLKKEV 193 Query: 210 RYDAEVGL 217 R + Sbjct: 194 RIKGHFDI 201
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 29.3 bits (66), Expect = 0.050 Identities = 17/109 (15%), Positives = 41/109 (37%), Gaps = 13/109 (11%) Query: 394 LASLLALLLIVFVQPWTDSLTGLLAMSLPV---LALAAWIAAGSERIAYAGIQIGFTFA- 449 + L+ + P++ +L+ ++ L L A IA +Q GF + Sbjct: 53 FSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISG 112 Query: 450 ---------LAFLSWFAPLTNLTELRDRVLGILLGVLVSSIVHLYLWPD 489 + + + ++ L + + IL VL+S ++ + + + Sbjct: 113 EAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSILIWIIIKGN 161
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 61.0 bits (148), Expect = 6e-13 Identities = 36/225 (16%), Positives = 74/225 (32%), Gaps = 24/225 (10%) Query: 5 AKARLTTLDAQIMLTQRTIKAQEYNAQSVAAAVERARALVKQTTSTRIRLEPLVPQGFAS 64 K + +T Q + + + +V A + R L + S L+ + + Sbjct: 191 IKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIA 250 Query: 65 QEDLDQARTAEKAARAELEATLLQAKQASAAVTGVDAMVAQRAGVL-------------- 110 + + + A EL Q +Q + + + Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310 Query: 111 -----AQIALAELHLEFTEVRAPFNGVVVALKT-TVGQYASALKPVFTLL-DDDRWYVIA 163 ++A E + + +RAP + V LK T G + + + ++ +DD V A Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370 Query: 164 NFRETDLNNVRPGVAARITVMT-NHNRT--FNGVVDSVGSGVLPE 205 + D+ + G A I V + R G V ++ + + Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIED 415
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.6 bits (77), Expect = 5e-05 Identities = 12/77 (15%), Positives = 29/77 (37%), Gaps = 5/77 (6%) Query: 11 KKWPLLALVLAAILALILVIWQL-----QTSPETNDAYVYADTIDVVPEVSGRIVEMPIR 65 + P L +I I + + + ++ P + + E+ ++ Sbjct: 54 SRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVK 113 Query: 66 DNQRVRKGDLLFRIDPP 82 + + VRKGD+L ++ Sbjct: 114 EGESVRKGDVLLKLTAL 130
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 28.4 bits (63), Expect = 0.002 Identities = 11/55 (20%), Positives = 23/55 (41%) Query: 11 YVNDAQGNQVAEIVFVPTGEHLSIIEHTDVDPSLKGQGVGKQLVAKVVEKMRQEQ 65 ++ + N + I ++IE V + +GVG L+ K +E ++ Sbjct: 68 FLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENH 122
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 426 bits (1096), Expect = e-143 Identities = 163/479 (34%), Positives = 244/479 (50%), Gaps = 14/479 (2%) Query: 10 RLSTAIAIALCCFPPFSSGQENPGTVYQFNDGFIVG-SREKVDLSRFSTS-AITEGTYSL 67 + L F++ FN F+ + DLSRF + GTY + Sbjct: 21 HRLAGFFVRLFVACAFAAQAPLSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRV 80 Query: 68 DVYTNDEWKGRYDLR-IARDKDGRLGVCYTKAMLAQYGIAAEKLNPQLSEQEGYCGSLKS 126 D+Y N+ + D+ D + + C T+A LA G+ ++ + C L S Sbjct: 81 DIYLNNGYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTS 140 Query: 127 WRNEENVKDNLVQSSLRLNISVPQIYEDQRLKNYVSPEFWDKGITALNLGWMANAWNSHT 186 + L RLN+++PQ + R + Y+ PE WD GI A L + + + Sbjct: 141 MI--HDATAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSG--NSV 196 Query: 187 SSVGGSDNSSAYLGVNAGLSWDGWLLKHIGNLNWQQQQG----KAHWNSNQTYLQRPIPQ 242 + G ++ AYL + +GL+ W L+ ++ K W T+L+R I Sbjct: 197 QNRIGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIP 256 Query: 243 LNSIVSGGQIFTNGEFFDTIGLRGVNLSTDDNMFPDGMRSYAPEIRGVAQSNALVTVRQG 302 L S ++ G +T G+ FD I RG L++DDNM PD R +AP I G+A+ A VT++Q Sbjct: 257 LRSRLTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQN 316 Query: 303 SNIIYQTTVPPGPFTLQDVYPSGYGSDLEVSVKEADGSVEVFSVPYASVAQLLRPGMTRY 362 IY +TVPPGPFT+ D+Y +G DL+V++KEADGS ++F+VPY+SV L R G TRY Sbjct: 317 GYDIYNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRY 376 Query: 363 ALSAGKV-DDSALRNKPMLYQATWQHGINNLLTGYTGVTGFDDYQAFLVGTGMNTG-IGA 420 +++AG+ +A + KP +Q+T HG+ T Y G D Y+AF G G N G +GA Sbjct: 377 SITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGA 436 Query: 421 LSFDVTHSRLKS-DAHDDSGQSYRATFNRMFTDTQTSIVLAAYRYSTKGYYNLNDALYA 478 LS D+T + D GQS R +N+ ++ T+I L YRYST GY+N D Y+ Sbjct: 437 LSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYS 495
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 248 bits (635), Expect = 1e-77 Identities = 97/334 (29%), Positives = 151/334 (45%), Gaps = 19/334 (5%) Query: 1 MTFTVNQNLPDGWGGFYLSGRISDYWNRSGTEKQYQVSYNNSFGRLSWSASAQRVYTPDS 60 + TV Q L YLSG YW S ++Q+Q N +F ++W+ S T ++ Sbjct: 529 LQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSYSL--TKNA 585 Query: 61 SGHRRDDRISLNFSYPL-------WFGDNRTANLTSNTSFNNSRFASSQIGINGSLDSEN 113 RD ++LN + P R A+ + + S + + ++ G+ G+L +N Sbjct: 586 WQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDN 645 Query: 114 NLNYGVSTTTATGGQHD----VALNGSYRTPWTTLNGSYSQGEGYRQSGIGASGTMIAHS 169 NL+Y V T A GG + +YR + N YS + +Q G SG ++AH+ Sbjct: 646 NLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHA 705 Query: 170 GGVVLSPESGSTMALIEAKDAAGAMLPGSPGTRVDSNGYAILPYLRPYRINAVEIDPKGS 229 GV L T+ L++A A A + G R D GYA+LPY YR N V +D Sbjct: 706 NGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVALDTNTL 765 Query: 230 HDDVAFDRTVAQVVPWEGSVVKVAFGTKVQNNLTLQARQANHEPLPFAASIFSPDGKEIG 289 D+V D VA VVP G++V+ F +V L + N +PLPF A + S + G Sbjct: 766 ADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLTHNN-KPLPFGAMVTSESSQSSG 824 Query: 290 VIGQGSMMFISDANAK-RAIVKW---SGGQCSVD 319 ++ +++S + VKW C + Sbjct: 825 IVADNGQVYLSGMPLAGKVQVKWGEEENAHCVAN 858
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 593 bits (1530), Expect = 0.0 Identities = 185/571 (32%), Positives = 314/571 (54%), Gaps = 7/571 (1%) Query: 168 QTRIRALPASSGVAIAEGWMDVSLPLMEQVYEASTLDTASERERLTGALEEAANEFRRYS 227 +I + ASSGVAIA+ ++ + + + + S D ++E E+LT ALE++ E R Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNV--DIEKTSITDVSTEIEKLTAALEKSKEELRAIK 59 Query: 228 KRYAAGAQKETAAIFDLYSHLLSDARLRRELFAEVDKGAV-AEWAVKKIIEKFAEQFAAL 286 + A + A IF + +L D L + +++ + AE+A+K++ + F F ++ Sbjct: 60 DQTEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESM 119 Query: 287 SDGYLKERAGDLRTLGQRLLFHLDDS-IQGPNTWPARIILVADELSATTLAEVPQDRLAG 345 + Y+KERA D+R + +R+L HL T +++A++L+ + A++ + + G Sbjct: 120 DNEYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKG 179 Query: 346 VVVRDGAANSHAAIMVRALGIPTVMGA-DIQPSLLHGHTLIVDGYRGELLVDPEPVLLQE 404 G SH+AIM R+L IP V+G ++ + HG +IVDG G ++V+P ++ Sbjct: 180 FATDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKA 239 Query: 405 YQRLISEENELSRLAEDDLQRASELKSGERVKVMLNAGLSPEHEEKLGSFVDGIGLYRTE 464 Y+ + + + + S K G V++ N G + + L + +GIGLYRTE Sbjct: 240 YEEKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTE 299 Query: 465 IPFMLQSGFPSEEEQVAQYQGMLQMFNSKPVTLRTLDIGADKQLPYMPISEE-NPCLGWR 523 +M + P+EEEQ Y+ ++Q + KPV +RTLDIG DK+L Y+ + +E NP LG+R Sbjct: 300 FLYMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFR 359 Query: 524 GIRITLDQPEIFLIQVRAMLRANAATGNLSILLPMVTSLEEVDEARRLIDRASREVEEMI 583 IR+ L++ +IF Q+RA+LRA + GNL ++ PM+ +LEE+ +A+ ++ ++ Sbjct: 360 AIRLCLEKQDIFRTQLRALLRA-STYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEG 418 Query: 584 GYAIPRPRLGVMLEVPSMVFMLPQLASRIDFISVGTNDLTQYLLAVDRNNTRVASMYDSL 643 +G+M+E+PS A +DF S+GTNDL QY +A DR N RV+ +Y Sbjct: 419 VDVSDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478 Query: 644 HPAVLRALAMIAHDAERFGIDLRLCGEMAGDPMCVTILIGLGYRHLSMNGRSVARVKYLL 703 HPA+LR + M+ A G + +CGEMAGD + + +L+GLG SM+ S+ + L Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQL 538 Query: 704 RRIDIEEAQELSRRSLDAQMTAEVRHQVAAF 734 ++ EE + ++++L EV V Sbjct: 539 LKLSKEELKPFAQKALMLDTAEEVEQLVKKT 569
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 28.8 bits (64), Expect = 0.008 Identities = 19/96 (19%), Positives = 36/96 (37%), Gaps = 1/96 (1%) Query: 31 REHGYTLMETLVTLTLMMILSVGGLYGWQRWQQQQRLWQTAVQVRDFLLFLRDDANAYNR 90 R+ G+TL+E ++ L LM + + L + + A + L F++ + Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLA-RFEAQLRFVQQRGLQTGQ 60 Query: 91 DRVLRVGQDEVGWCLSAEGEGPDCASGTSFTLRPRW 126 + V D + + +G D A RW Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRW 96
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 25.7 bits (56), Expect = 0.035 Identities = 10/23 (43%), Positives = 15/23 (65%) Query: 7 RQRGFSLPETVLAMALMVLTVTA 29 RQRGF+L E +L + LM ++ Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGM 24
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 559 bits (1441), Expect = 0.0 Identities = 198/395 (50%), Positives = 270/395 (68%), Gaps = 5/395 (1%) Query: 4 KIMAINAGSSSLKFQLLNMPQGALLCQGLIERIGLPEARFTLKTSAQKWQETLPIADHHE 63 KI+ IN GSSSLK+QL+ G +L +GL ERIG+ ++ T + +K + + DH + Sbjct: 2 KILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKD 61 Query: 64 AVTLLLEALTGR--GILSSLQEIDGVGHRVAHGGERFKDAALVCDDTLREIERLAELAPL 121 A+ L+L+AL G++ + EID VGHRV HGGE F + L+ DD L+ I ELAPL Sbjct: 62 AIKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPL 121 Query: 122 HNPVNALGIRLFRQLLPAVPAVAVFDTAFHQTLAPEAWLYPLPWRYYAELGIRRYGFHGT 181 HNP N GI+ Q++P VP VAVFDTAFHQT+ A+LYP+P+ YY + IR+YGFHGT Sbjct: 122 HNPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGT 181 Query: 182 SHHYVSSALAEKLGVPLSALRVVSCHLGNGCSVCAIKGGQSVNTSMGFTPQSGVMMGTRS 241 SH YVS AE L P+ +L++++CHLGNG S+ A+K G+S++TSMGFTP G+ MGTRS Sbjct: 182 SHKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRS 241 Query: 242 GDIDPSILPWLVEKEGKSAQQLSQLLNNESGLLGVSGVSSDYRDVEQAADA-GNERAALA 300 G IDPSI+ +L+EKE SA+++ +LN +SG+ G+SG+SSD+RD+E AA G++RA LA Sbjct: 242 GSIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLA 301 Query: 301 LSLFAERIRATIGSYIMQMGGLDALIFTGGIGENSARARAAICRNLHFLGLALDDEKNQR 360 L++FA R++ TIGSY MGG+D ++FT GIGEN R I L FLG LD EKN+ Sbjct: 302 LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKV 361 Query: 361 SA--TFIQADNALVKVAVINTNEELMIARDVMRLA 393 I ++ V V V+ TNEE MIA+D ++ Sbjct: 362 RGEEAIISTADSKVNVMVVPTNEEYMIAKDTEKIV 396
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 9/58 (15%), Positives = 17/58 (29%), Gaps = 1/58 (1%) Query: 152 ADFVICFYNPRSRGREGHLARAFTLLAASKSADTPVGVVKSAGRKKQEKWLTTLGEMD 209 D+ + G+ + L S + +G K + + L EM Sbjct: 595 FDYSVVLEGTGGIGKSTLINTLVGLDFFSDTHFD-IGTGKDSYEQIAGIVAYELSEMT 651
>PF05616#Neisseria meningitidis TspB protein Length = 501 Score = 27.4 bits (60), Expect = 0.009 Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%) Query: 18 KFSGQSISQAMQEWDSTDFSITPEILWKQTGKPAKNQKVRVTRGDGTTVEMTTDDQGKLP 77 K+ + ++ ++E S P+ K TG P ++KV V G + TD G P Sbjct: 230 KYKEEMDAKKLEEILSLKVDANPDKYIKATGYPGYSEKVEVAPGTKVNMGPVTDRNGN-P 288 Query: 78 VQ 79 VQ Sbjct: 289 VQ 290
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 378 bits (972), Expect = e-127 Identities = 141/356 (39%), Positives = 196/356 (55%), Gaps = 24/356 (6%) Query: 4 PESPSTAPALI--DPASKAFQSLLDKLAPTEATVLIVGETGTGKEVVARYLHHHSARRQQ 61 + L+ A + +L +L T+ T++I GE+GTGKE+VAR LH + RR Sbjct: 130 EDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNG 189 Query: 62 PFLAVNCGALTESLAEAELFGHEKGAFTGAQQGQPGWFEAAEGGTLLLDEIGELSLPLQV 121 PF+A+N A+ L E+ELFGHEKGAFTGAQ G FE AEGGTL LDEIG++ + Q Sbjct: 190 PFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQT 249 Query: 122 KLLRVLQEREITRVGSRKAIKVNVRVIAATHVDLAQAIRERRFREDLYYRLNIAVVPLPP 181 +LLRVLQ+ E T VG R I+ +VR++AAT+ DL Q+I + FREDLYYRLN+ + LPP Sbjct: 250 RLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPP 309 Query: 182 LRQRRQDIPLLAHHFLSLYARRLGRPTLRLAPESLARLMDYSWPGNIRELENTLHNAVLL 241 LR R +DIP L HF+ + G R E+L + + WPGN+RELEN + L Sbjct: 310 LRDRAEDIPDLVRHFVQQA-EKEGLDVKRFDQEALELMKAHPWPGNVRELENLVRRLTAL 368 Query: 242 SKEEEISPAQLRLATLNDAP-----------------GPASDHELDDFIRHQLALPGEPL 284 ++ I+ + ++ P ++ F ALP L Sbjct: 369 YPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL 428 Query: 285 WQRVTSA----LIRHAMAHCDDNQSQAAALLGISRHTLRTQLANLGLIKSRRRPPA 336 + RV + LI A+ NQ +AA LLG++R+TLR ++ LG+ R A Sbjct: 429 YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVSVYRSSRSA 484
>HOKGEFTOXIC#Hok/Gef cell toxic protein family signature. Length = 52 Score = 54.4 bits (131), Expect = 1e-14 Identities = 17/50 (34%), Positives = 29/50 (58%) Query: 1 MLTKYALVAIIVLCITVLGFTLLVHSSLCELSIKERNIEFKAVLAYESKK 50 + + ++++C+T+L FT L SLCE+ ++ E A +AYES K Sbjct: 3 LPRSSLVWCVLIVCLTLLIFTYLTRKSLCEIRYRDGYREVAAFMAYESGK 52
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 36.5 bits (84), Expect = 7e-05 Identities = 44/191 (23%), Positives = 75/191 (39%), Gaps = 16/191 (8%) Query: 52 PPAAQKLPDVGYLRQLNAEGILALRPQLVLASAQAQPSLVLHKVQASGVKVVNVPGGESL 111 PP + DVG + N E + ++P ++ SA PS + A G G + L Sbjct: 72 PPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPL 131 Query: 112 SAIDNKVAVIAEALGKTAAGDALRQQLQQQIAAIPTQPV---AKRVLFILSHGGMNTLVA 168 + + +A+ L +A + Q + I ++ + V A+ +L + LV Sbjct: 132 AMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVF 191 Query: 169 GQHTAADGAIRAAGLQNAMQG---FDHYRAMSQEGVAA-SQADLVVISADGLKGMGGEAG 224 G ++ + G+ NA QG F A+S + +AA D++ D K M Sbjct: 192 GPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKDM----- 246 Query: 225 LWKLPGLAQTP 235 L TP Sbjct: 247 ----DALMATP 253
>PF07675#Cleaved Adhesin Length = 1358 Score = 28.1 bits (62), Expect = 0.041 Identities = 17/51 (33%), Positives = 22/51 (43%) Query: 212 YTVMVKGTVLASGPTETTFTAGNLERAFSGVLRHVALTGGEAQIITDDERP 262 YT+ T +ASG TETT+ +L F V GE+ I T Sbjct: 1261 YTIYRNNTQIASGVTETTYRDPDLATGFYTYGVKVVYPNGESAIETATLNI 1311
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 377 bits (970), Expect = e-126 Identities = 136/373 (36%), Positives = 200/373 (53%), Gaps = 41/373 (10%) Query: 350 YREIQRLKERLVDENLALTEQLNNVESEFGEIIGRSEAMNNVLKQVEMVAHSDSTVLILG 409 E+ + R + E +L + + ++GRS AM + + + + +D T++I G Sbjct: 108 LTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITG 167 Query: 410 ETGTGKELIARAIHNLSGRNGRRMVKMNCAAMPAGLLESDLFGHERGAFTGASAQRIGRF 469 E+GTGKEL+ARA+H+ R V +N AA+P L+ES+LFGHE+GAFTGA + GRF Sbjct: 168 ESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRF 227 Query: 470 ELADKSSLFLDEVGDMPLELQPKLLRVLQEQEFERLGSNKLIQTDVRLIAATNRDLKQMV 529 E A+ +LFLDE+GDMP++ Q +LLRVLQ+ E+ +G I++DVR++AATN+DLKQ + Sbjct: 228 EQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSI 287 Query: 530 IDREFRSDLYYRLNVFPIHLPPLRERPDDIPLLVKAFTFKIARRMGRNIDSIPAETLRTL 589 FR DLYYRLNV P+ LPPLR+R +DIP LV+ F + A + G ++ E L + Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFV-QQAEKEGLDVKRFDQEALELM 346 Query: 590 TRMEWPGNVRELENVIERAVLLTRGNVLQ------------------------------- 618 WPGNVRELEN++ R L +V+ Sbjct: 347 KAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQ 406 Query: 619 -----LSLPERDIVEAPRTPAVLPEEGED-EYQLIVRVLKESNGVVAGPKGAAQRLGLKR 672 + +A + + EY LI+ L + G AA LGL R Sbjct: 407 AVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQI---KAADLLGLNR 463 Query: 673 TTLLSRMKRLGIN 685 TL +++ LG++ Sbjct: 464 NTLRKKIRELGVS 476
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 28.2 bits (63), Expect = 0.040 Identities = 14/74 (18%), Positives = 27/74 (36%), Gaps = 3/74 (4%) Query: 19 ALVVCLALSLSTTMLGVFLLLRRMSLMGDALSHAILP-GVAVGYLLSGMSLLAMTLGG-- 75 + + +ALS L + LM + LP A+ Y++ + L L Sbjct: 31 STALIVALSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFYLCFPL 90 Query: 76 FIAGIVVALVAGWV 89 ++A+ + V Sbjct: 91 LTVAALMAIASHVV 104
>adhesinb#Adhesin B signature. Length = 310 Score = 238 bits (608), Expect = 1e-79 Identities = 92/308 (29%), Positives = 168/308 (54%), Gaps = 17/308 (5%) Query: 1 MKRSAIVVALALGLMAQGAMAKT----------LNVVSSFSVLGDIAQQVGGEHVHVDTL 50 MK+ +V L L + A + LNVV++ S++ DI + + G+ +++ ++ Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTETGSSKLNVVATNSIIADITKNIAGDKINLHSI 60 Query: 51 VGPDGDPHTFEPSPKDSALLSKADVVVVNGLGLE----GWLDRLIKASGFKGE--LVVAS 104 V DPH +EP P+D S+AD++ NG+ LE W +L++ + K S Sbjct: 61 VPVGQDPHEYEPLPEDVKKTSQADLIFYNGINLETGGNAWFTKLVENAKKKENKDYYAVS 120 Query: 105 KGVKTHTLDEEGKTVT-DPHAWNSAANGALYAQNILDGLVKADPEDKAALTSSGKRYIDQ 163 +GV L+ + + DPHAW + NG +YAQNI L + DP +K + K Y+++ Sbjct: 121 EGVDVIYLEGQSEKGKEDPHAWLNLENGIIYAQNIAKRLSEKDPANKETYEKNLKAYVEK 180 Query: 164 LTSLDGWAKAQFSAIPLAKRKVLTSHDAFGYFGRAYHVTFLAPQGLSSESEASAAQVAAL 223 L++LD AK +F+ IP K+ ++TS F YF +AY+V +++E E + Q+ L Sbjct: 181 LSALDKEAKEKFNNIPGEKKMIVTSEGCFKYFSKAYNVPSAYIWEINTEEEGTPDQIKTL 240 Query: 224 IKQIKADGVHTWFMENQLDPRLVKQIASATGAQPGGELYPEALSKPGGVADSYVKMMRHN 283 +++++ V + F+E+ +D R +K ++ T +++ +++++ G DSY MM++N Sbjct: 241 VEKLRKTKVPSLFVESSVDDRPMKTVSKDTNIPIYAKIFTDSVAEKGEEGDSYYSMMKYN 300 Query: 284 VELIANSM 291 +E IA + Sbjct: 301 LEKIAEGL 308
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 720 bits (1860), Expect = 0.0 Identities = 324/851 (38%), Positives = 459/851 (53%), Gaps = 46/851 (5%) Query: 20 PADSAERYNAQFVNG-----IDPLAFNQFVASDGDVMPGTYDVNIYINDLLVDSRPVRFS 74 + + +N +F+ D F ++ PGTY V+IY+N+ + +R V F+ Sbjct: 42 LSSAELYFNPRFLADDPQAVADLSRFEN----GQELPPGTYRVDIYLNNGYMATRDVTFN 97 Query: 75 EDSAHGGLAPCLSAAEYIRYGVKIDD-------DHQPCFALSQTIRQAEQQLDIANHQLI 127 + G+ PCL+ A+ G+ C L+ I A QLD+ +L Sbjct: 98 TGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDATAQLDVGQQRLN 157 Query: 128 IHIPQQYIEHYPRDYVSPMRFDEGINAAFVNYSYS-TDANNGDGGSHQYQYLSLNSGINI 186 + IPQ ++ + R Y+ P +D GINA +NY++S N GG+ Y YL+L SG+NI Sbjct: 158 LTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNI 217 Query: 187 ASWRLRNNAYWNKF-----SGQADKWQSIASWAETNIIPWRSRLVVGQTSTDNSVFDSVQ 241 +WRLR+N W+ SG +KWQ I +W E +IIP RSRL +G T +FD + Sbjct: 218 GAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGIN 277 Query: 242 FRGVQLGTDVEMRPSSQTGFAPVIRGVANSNARVEVRQNNYLIYSENVPAGPFELNDISA 301 FRG QL +D M P SQ GFAPVI G+A A+V ++QN Y IY+ VP GPF +NDI A Sbjct: 278 FRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYA 337 Query: 302 VNRSGDFYVTVIEADGSQTTFTVAYTTLPQLVRAGQWNYQLSAGKYH-DGADGYAPALMQ 360 SGD VT+ EADGS FTV Y+++P L R G Y ++AG+Y A P Q Sbjct: 338 AGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQ 397 Query: 361 SSLSYGLNNTFTLYGGALAAENYRAGAFGVGSNLGEIGALSADYTLAGTTLANGQRKQGG 420 S+L +GL +T+YGG A+ YRA FG+G N+G +GALS D T A +TL + + G Sbjct: 398 STLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQ 457 Query: 421 SVRFLYAKSFLSSKTDFQIAGYRYSTAGYYSLSDAVNERRRWHNGLYENDYWPSDEDESW 480 SVRFLY KS S T+ Q+ GYRYST+GY++ +D R +N +D Sbjct: 458 SVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIE--------TQDGVI 509 Query: 481 QASAPQHYYTSWFYNKKHRFDISARQTLGKNSTFFLNFSQQNYWNSSGSDISLQAGFNST 540 Q Y + YNK+ + ++ Q LG+ ST +L+ S Q YW +S D QAG N+ Sbjct: 510 QVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTA 569 Query: 541 IHNVNYGLYYQNTRSHFTHD-DNSITLRVSIPF-------TLQENRRINTAFTLAHSKSS 592 ++N+ L Y T++ + D + L V+IPF + + R + +++++H + Sbjct: 570 FEDINWTLSYSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNG 629 Query: 593 GTSGQAGVNGTLLDDDRLSWAVTSAYDD----TSHSTNSASLGYLGQYGNLYTGYAYSKN 648 + AGV GTLL+D+ LS++V + Y S ST A+L Y G YGN GY++S + Sbjct: 630 RMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDD 689 Query: 649 HRQASLNLSGGVVAHRGGVTLSQPLGSTFALVEAKDAQGVGIENQTGVRIDPFGYAVVPQ 708 +Q +SGGV+AH GVTL QPL T LV+A A+ +ENQTGVR D GYAV+P Sbjct: 690 IKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPY 749 Query: 709 SVPYRVNSVALNPQDFDAFLDVPNAVADTVPTRGAITRVRFDTFRGYSVLIHTTLADGSY 768 + YR N VAL+ +D+ NAVA+ VPTRGAI R F G +L+ T + Sbjct: 750 ATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLM-TLTHNNKP 808 Query: 769 PPLGAELYRASGISNGLVGPGGDVYVSGVDSGEKLQMKWGETHQQSCEITLPELRQEPQQ 828 P GA + S S+G+V G VY+SG+ K+Q+KWGE C Q Sbjct: 809 LPFGAMVTSESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQ--LPPESQ 866 Query: 829 ATAWRELSLIC 839 +LS C Sbjct: 867 QQLLTQLSAEC 877
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 95.5 bits (237), Expect = 1e-25 Identities = 69/258 (26%), Positives = 122/258 (47%), Gaps = 11/258 (4%) Query: 5 LAGKVALVTASTAGIGFAIAKGLAESGAEVILNGRSEQSVNAAIARLQNEVPGAKARPAI 64 + GK+A +T + GIG A+A+ LA GA + + + + ++ L+ E A+A PA Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPA- 64 Query: 65 ADLSDADG----AAQLLRAVTGVDILVNNAGIYGPQDFYATDDATWDNYWQTNVMSGVRL 120 D+ D+ A++ R + +DILVN AG+ P ++ D W+ + N Sbjct: 65 -DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 121 SRGLLPAMVNKGWGRVVFISSESARNIPADMIHYGVTKTAQLSLARGLAKYVAGSGVTVN 180 SR + M+++ G +V + S A M Y +K A + + L +A + N Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 181 SVLPGPTISDGFAEMLKDEVAKTGQSLEELAKAFVMTHRPSSVIQRAASVAEVANMVVYV 240 V PG T +D + DE E++ K + T + +++ A +++A+ V+++ Sbjct: 184 IVSPGSTETDMQWSLWADENGA-----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238 Query: 241 CSPQASATSGAALRVDGG 258 S QA + L VDGG Sbjct: 239 VSGQAGHITMHNLCVDGG 256
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1037 bits (2682), Expect = 0.0 Identities = 430/1040 (41%), Positives = 640/1040 (61%), Gaps = 16/1040 (1%) Query: 4 SRFFIDRPIFAAVLSILIFITGLIAIPLLPVSEYPDVVPPSVQVRAEYPGANPKVIAETV 63 + FFI RPIFA VL+I++ + G +AI LPV++YP + PP+V V A YPGA+ + + +TV Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 64 ATPLEEAINGVENMMYMKSVAGSDGVLVTTVTFRPGTDPDQAQVQVQNRVAQAEARLPED 123 +E+ +NG++N+MYM S + S G + T+TF+ GTDPD AQVQVQN++ A LP++ Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 124 VRRLGITTQKQSPTLTLVVHLFSPKGKYDSLYMRNYATLKVKDELARLPGVGQIQIFGSG 183 V++ GI+ +K S + +V S + +Y VKD L+RL GVG +Q+FG Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-A 180 Query: 184 EYAMRVWLDPNKVAARGLTASDVVTAMQEQNVQVSAGQLGAEPLPQESDFLISINAQGRL 243 +YAMR+WLD + + LT DV+ ++ QN Q++AGQLG P SI AQ R Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 244 HTEEEFGNIILKTAQDGSLVRLRDVARIEMGSGSYALRSQLNNKDAVGIGIFQSPGANAI 303 EEFG + L+ DGS+VRL+DVAR+E+G +Y + +++N K A G+GI + GANA+ Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 304 DLSNAVRAKMAELATRFPEDMQWAAPYDPTVFVRDSIRAVVQTLLEAVVLVVLVVILFLQ 363 D + A++AK+AEL FP+ M+ PYD T FV+ SI VV+TL EA++LV LV+ LFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 364 TWRASIIPLIAVPVSVVGTFSILYLLGFSLNTLSLFGLVLAIGIVVDDAIVVVENVER-N 422 RA++IP IAVPV ++GTF+IL G+S+NTL++FG+VLAIG++VDDAIVVVENVER Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 423 IEEGLAPLAAAHQAMREVSGPIIAIALVLCAVFVPMAFLSGVTGQFYKQFAVTIAISTVI 482 +E+ L P A ++M ++ G ++ IA+VL AVF+PMAF G TG Y+QF++TI + + Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 483 SAINSLTLSPALAALLLKPHGAKKDLPTRLIDRLFGWIFRPFNRFFLRSSNGYQGLVSKT 542 S + +L L+PAL A LLKP A+ FGW FN F S N Y V K Sbjct: 481 SVLVALILTPALCATLLKPVSAE---HHENKGGFFGW----FNTTFDHSVNHYTNSVGKI 533 Query: 543 LGRRGAVFAVYLLLLCAAGVMFKVVPGGFIPTQDKLYLIGGVKMPEGSSLARTDAVIRKM 602 LG G +Y L++ V+F +P F+P +D+ + +++P G++ RT V+ ++ Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593 Query: 603 SEIGMNTEGVDYAVAFPGLNALQFTNTPNTGTVFFGLKPFDQR---KHTAAEINAEINAK 659 ++ + E + F N G F LKP+++R +++A + + Sbjct: 594 TDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKME 653 Query: 660 IAQIQQGFGFSILPPPILGLGQGSGYSLYIQDRGGLGYGALQSAVNAMSGAIMQTPG-MH 718 + +I+ GF P I+ LG +G+ + D+ GLG+ AL A N + G Q P + Sbjct: 654 LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLV 713 Query: 719 FPISTYQANVPQLDVQVDRDKAKAQGVSLTDLFGTLQTYLGSSYVNDFNQFGRTWRVMAQ 778 + Q ++VD++KA+A GVSL+D+ T+ T LG +YVNDF GR ++ Q Sbjct: 714 SVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQ 773 Query: 779 ADGPYRESVEDIANLRTRNNQGEMVPIGSMVNISTTYGPDPVIRYNGYPAADLIGDADPR 838 AD +R ED+ L R+ GEMVP + YG + RYNG P+ ++ G+A P Sbjct: 774 ADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPG 833 Query: 839 VLSSSQAMTHLEELSKQILPNGMNIEWTDLSFQQATQGNTALIVFPVAVLLAFLVLAALY 898 SS AM +E L+ + LP G+ +WT +S+Q+ GN A + ++ ++ FL LAALY Sbjct: 834 T-SSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALY 891 Query: 899 ESWTLPLAVILIVPMTMLSALFGVWLTGGDNNVFVQVGLVVLMGLACKNAILIVEFAREL 958 ESW++P++V+L+VP+ ++ L L N+V+ VGL+ +GL+ KNAILIVEFA++L Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951 Query: 959 -EIQGKGIMEAALEACRLRLRPIVMTSIAFIAGTIPLILGHGAGAEVRGVTGITVFSGML 1017 E +GKG++EA L A R+RLRPI+MTS+AFI G +PL + +GAG+ + GI V GM+ Sbjct: 952 MEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMV 1011 Query: 1018 GVTLFGLFLTPVFYVTLRKL 1037 TL +F PVF+V +R+ Sbjct: 1012 SATLLAIFFVPVFFVVIRRC 1031 Score = 83.3 bits (206), Expect = 3e-18 Identities = 71/328 (21%), Positives = 123/328 (37%), Gaps = 23/328 (7%) Query: 730 QLDVQVDRDKAKAQGVSLTDLFGTLQT--------YLGSSYVNDFNQFGRTWRVMAQADG 781 + + +D D ++ D+ L+ LG + Q ++AQ Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQL--NASIIAQT-- 238 Query: 782 PYRESVEDIANLRTRNNQ-GEMVPIGSMVNISTTYGPDPVI-RYNGYPAADLI----GDA 835 ++ E+ + R N G +V + + + VI R NG PAA L A Sbjct: 239 -RFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGA 297 Query: 836 DPRVLSSSQAMTHLEELSKQILPNGMNIEWT-DLSFQQATQGNTALIVFPVAVLLAFLVL 894 + + L EL + P GM + + D + + + A++L FLV+ Sbjct: 298 NALDTAK-AIKAKLAEL-QPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVM 355 Query: 895 AALYESWTLPLAVILIVPMTMLSALFGVWLTGGDNNVFVQVGLVVLMGLACKNAILIVE- 953 ++ L + VP+ +L + G N G+V+ +GL +AI++VE Sbjct: 356 YLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVEN 415 Query: 954 FARELEIQGKGIMEAALEACRLRLRPIVMTSIAFIAGTIPLILGHGAGAEVRGVTGITVF 1013 R + EA ++ +V ++ A IP+ G+ + IT+ Sbjct: 416 VERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIV 475 Query: 1014 SGMLGVTLFGLFLTPVFYVTLRKLVTRR 1041 S M L L LTP TL K V+ Sbjct: 476 SAMALSVLVALILTPALCATLLKPVSAE 503
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 52.5 bits (126), Expect = 1e-09 Identities = 26/155 (16%), Positives = 56/155 (36%), Gaps = 13/155 (8%) Query: 70 LRPRVSGYIDKVNYTDGQEVKKGQVLFTIDDRTYRAALEQAQAALARAKT-----QASLA 124 ++P + + ++ +G+ V+KG VL + A + Q++L +A+ Q Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSR 158 Query: 125 QSEANRTDKLVHTN----LVSREEWEQRRSAAVQAQADIRAAQAAVDAAQLNLDFTKVTA 180 E N+ +L + EE R ++ ++ Q Q Q L+ K A Sbjct: 159 SIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ----KYQKELNLDKKRA 214 Query: 181 PIDGRASRALITSGNLVTAGDTASVLTTLVSQKTV 215 +R ++L+ ++ + Sbjct: 215 ERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAI 249 Score = 39.8 bits (93), Expect = 1e-05 Identities = 19/104 (18%), Positives = 37/104 (35%), Gaps = 7/104 (6%) Query: 102 TYRAALEQAQAALARAKTQASLAQSEANRTDK--LVHTNLVSREEWEQRRSAAVQAQADI 159 +A L K+Q +SE + + T L E ++ R Q +I Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLR----QTTDNI 311 Query: 160 RAAQAAVDAAQLNLDFTKVTAPIDGRASR-ALITSGNLVTAGDT 202 + + + + AP+ + + + T G +VT +T Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 27.9 bits (62), Expect = 0.016 Identities = 18/78 (23%), Positives = 34/78 (43%), Gaps = 8/78 (10%) Query: 36 GSRVLELGPTQMTAAVDVSKAGISKTFTTRNTLTSNQSILMSLVDGPFKKLIGGWK---- 91 G+ VLE+ P+ + +D G + + LT I S+++G +++ + Sbjct: 113 GNAVLEVDPSITFSIIDRLFGGTGQAAKVQRDLT---DIENSVMEGVIVRILANVRESWT 169 Query: 92 -FIPLSPEACKIEFHLDF 108 I L P +IE + F Sbjct: 170 QVIDLRPRLGQIETNPQF 187
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 31.3 bits (70), Expect = 0.017 Identities = 24/64 (37%), Positives = 31/64 (48%), Gaps = 13/64 (20%) Query: 162 GFTLYPDRALIEITGKIFNGNATPRH--FLWW-ANPAVKGGDAHQSVFPPDVTAVFDHGK 218 G DR+ + K+ GNATP +LW A PAV Q +F T VFD+G+ Sbjct: 220 GNEAGRDRSAMRYLSKVQYGNATPAADLYLWTSATPAV------QWLF----TLVFDYGE 269 Query: 219 RDVS 222 R V Sbjct: 270 RGVD 273
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 29.9 bits (67), Expect = 0.007 Identities = 16/82 (19%), Positives = 27/82 (32%), Gaps = 10/82 (12%) Query: 92 DNDIWYLLGYCAEQAGDAQQAAEYYQLARQGGSTLDAGRYYNDQPADYLFWQGIALRKSG 151 D+ + LG C + G A Y + +D +P + F L + G Sbjct: 69 DSRFFLGLGACRQAMGQYDLAIHSYSYG----AIMD-----IKEP-RFPFHAAECLLQKG 118 Query: 152 NPAQAEQHFRHFIDWAAQHRDD 173 A+AE + A + Sbjct: 119 ELAEAESGLFLAQELIADKTEF 140
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 110 bits (276), Expect = 2e-28 Identities = 80/364 (21%), Positives = 139/364 (38%), Gaps = 60/364 (16%) Query: 23 GIDLGTTNSLVATVRSGQAETLPDHQGRYLLPSVVNYHASGLTVGYDARLNAAQDPANTI 82 IDLGT N+L+ G PSVV G + A A Sbjct: 14 SIDLGTANTLIYVKGQGIV---------LNEPSVVA--IRQDRAGSPKSVAAVGHDA--- 59 Query: 83 SSVKRMMGRSLADIQNRYPHLPYQLQASENGLPMIQTAGGLLNPIRVSADILKALAARAT 142 K+M+GR+ +I P G++ V+ +L+ + Sbjct: 60 ---KQMLGRTPGNIAAIRP-----------------MKDGVIADFFVTEKMLQHFIKQVH 99 Query: 143 EALAGE-LDGVVITVPAYFDDAQRQGTKDAARLAGLHVLRLLNEPTAAAIAYGLDSGQEG 201 V++ VP +R+ +++A+ AG + L+ EP AAAI GL + Sbjct: 100 SNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEAT 159 Query: 202 VIAVYDLGGGTFDISILRLSRGVFEVLATGGDSALGGDDFDHLLADYLREQAGF--SDRS 259 V D+GGGT +++++ L+ V +GGD FD + +Y+R G + + Sbjct: 160 GSMVVDIGGGTTEVAVISLNGVV-----YSSSVRIGGDRFDEAIINYVRRNYGSLIGEAT 214 Query: 260 DNRLQRELLDAAIAAKIALSDAEAAHVEVGG---WQG-----DITRSQFNDLIAPLVKRT 311 R++ E+ A E +EV G +G + ++ + + + Sbjct: 215 AERIKHEI-------GSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQEPLTGI 267 Query: 312 LMACRRALKDAGVE-AQEVLE--VVMVGGSTRVPLVRERVGEFFGRTPLTSIDPDKVVAI 368 + A AL+ E A ++ E +V+ GG + + + E G + + DP VA Sbjct: 268 VSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLMEETGIPVVVAEDPLTCVAR 327 Query: 369 GAAI 372 G Sbjct: 328 GGGK 331
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 45.9 bits (109), Expect = 8e-08 Identities = 47/179 (26%), Positives = 70/179 (39%), Gaps = 41/179 (22%) Query: 33 LGIDLGTCD----------------VVSMVVDRDGQP---VAVCLDWADVV--------- 64 L IDLGT + VV++ DR G P AV D ++ Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQMLGRTPGNIAA 72 Query: 65 ----RDGIVWDFFGAVTLVRRHLATLEQQLGCRFT-HAATSFPPGTDP---RISINVLES 116 +DG++ DFF +++ + + R + P G R + Sbjct: 73 IRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQG 132 Query: 117 AGLEISHVLDEPTAVA---DLLQLDNAG--VVDIGGGTTGIAIVKQGRVTYSADEATGG 170 AG +++EP A A L + G VVDIGGGTT +A++ V YS+ GG Sbjct: 133 AGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGG 191
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 8e-04 Identities = 11/33 (33%), Positives = 16/33 (48%) Query: 30 MVALLGPSGSGKTTLLRIIAGLEHQTSGHIRFH 62 V L G G GK+TL+ + GL+ + H Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG 630
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 26.9 bits (59), Expect = 0.023 Identities = 15/60 (25%), Positives = 25/60 (41%), Gaps = 3/60 (5%) Query: 16 DRAISQED--YDTLMSYYAEDAALVVKPGMVVRGKENIRKAFIAIADYFQHRLVVTQGKM 73 DR + Q Y+ L Y ED+ PG V + R + ++DY ++ K+ Sbjct: 141 DRVVLQYQGRYEVLGLYSQEDSGSDGVPGAQVNEQLQQRAST-TMSDYVSFSPIMNDNKL 199
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 746 bits (1927), Expect = 0.0 Identities = 275/571 (48%), Positives = 392/571 (68%), Gaps = 2/571 (0%) Query: 1 MISGILASPGIAFGKALLLKEDEIVIDRKKISADKVDQEVERFLSGRAKASAQLEVIKTK 60 I+GI AS G+A KA + E + I++ I V E+E+ + K+ +L IK + Sbjct: 4 KITGIAASSGVAIAKAFIHLEPNVDIEKTSI--TDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 61 AGETFGEEKEAIFEGHIMLLEDEELEQEIIALIKDKHMTADAAANEVIDGQATALEELDD 120 + G +K IF H+++L+D EL I I+++ M A+ A EV D + E +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 121 EYLKERAADVRDIGKRLLRNILGLAIIDLSAIQDEVILVAADLTPSETAQLNLKKVLGFI 180 EY+KERAAD+RD+ KR+L +++G+ L+ I +E +++A DLTPS+TAQLN + V GF Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 181 TDAGGRTSHTSIMARSLELPAIVGTGSITAQVKNGDYLILDAVNNQVLINPSNEQIEALR 240 TD GGRTSH++IM+RSLE+PA+VGT +T ++++GD +I+D + V++NP+ E+++A Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 241 SLQAQVAEEKAELAKLKDLPAITLDGHQVEVCANIGTVRDVEGAERNGAEGVGLYRTEFL 300 +A ++K E AKL P+ T DG VE+ ANIGT +DV+G NG EG+GLYRTEFL Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 301 FMDRDALPTEEEQFAAYKAVAEACGSQAVIVRTMDIGGDKELPYMNFPKEENPFLGWRAV 360 +MDRD LPTEEEQF AYK V + + V++RT+DIGGDKEL Y+ PKE NPFLG+RA+ Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 361 RIAMDRKEILRDQVRAILRASAFGKLRIMFPMIISVEEVRALKKEIEIYKQELRDEGKAF 420 R+ +++++I R Q+RA+LRAS +G L++MFPMI ++EE+R K ++ K +L EG Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 421 DESIEIGVMVETPAAATIARHLAKEVDFFSIGTNDLTQYTLAVDRGNDMISHLYQPMSPS 480 +SIE+G+MVE P+ A A AKEVDFFSIGTNDL QYT+A DR N+ +S+LYQP P+ Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 481 VLNLIKQVIDASHAEGKWTGMCGELAGDERATLLLLGMGLDEFSMSAISIPRIKKIIRNT 540 +L L+ VI A+H+EGKW GMCGE+AGDE A LLLG+GLDEFSMSA SI + + Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 541 NFEDAKVLAEQALAQPTTDELMTLVNKFIEE 571 + E+ K A++AL T +E+ LV K + Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKKTYLK 572
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 53.1 bits (127), Expect = 1e-09 Identities = 26/126 (20%), Positives = 40/126 (31%), Gaps = 7/126 (5%) Query: 98 ERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAP-QPGWQQPQPAQPPVQPQHQPQ 156 E + Q +E + + Q+ + P +Q + QP +P + Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150 Query: 157 PVV--QQPVAPQPVTPTVAQPQPAAPQQPAPQPVAASQPAVAEPQPVE---PQQPAAPQP 211 P V ++P + T QP QPV S VE PA QP Sbjct: 1151 PTVNIKEPQSQTNTTADTEQPAKETSSNV-EQPVTESTTVNTGNSVVENPENTTPATTQP 1209 Query: 212 KERKET 217 E+ Sbjct: 1210 TVNSES 1215 Score = 47.8 bits (113), Expect = 6e-08 Identities = 23/135 (17%), Positives = 42/135 (31%), Gaps = 1/135 (0%) Query: 78 AHGEHEAPRQSPQHQYQPPYERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAPQP 137 A E E ++ P+ Q +++ + +P+ + P P Q QP Sbjct: 1112 AKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP 1171 Query: 138 GWQQPQPAQPPVQPQHQPQPVVQQPVAPQPVTPTVAQPQPAAPQQPAPQPVAASQPAVAE 197 + + PV P+ TP QP + P+ + + Sbjct: 1172 AKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK-NRHRRSVRSV 1230 Query: 198 PQPVEPQQPAAPQPK 212 P VEP ++ Sbjct: 1231 PHNVEPATTSSNDRS 1245 Score = 35.4 bits (81), Expect = 4e-04 Identities = 22/152 (14%), Positives = 40/152 (26%), Gaps = 11/152 (7%) Query: 42 KRMKSRDDESENDDFDDNVEGVGEVRVHPVTHAPHGAHGEHEAPRQSPQHQYQP-----P 96 K + + E EV +P E P+ P + P Sbjct: 1098 TETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKE 1157 Query: 97 YERQMQQPARPEEPVRQPPQPPRQAPVPPQQQPAPHAAPQ-PGWQQPQPAQPPVQPQHQP 155 + Q A E+P ++ Q ++ + P P QP V + Sbjct: 1158 PQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSN 1217 Query: 156 QPVVQ-----QPVAPQPVTPTVAQPQPAAPQQ 182 +P + + V T + + Sbjct: 1218 KPKNRHRRSVRSVPHNVEPATTSSNDRSTVAL 1249 Score = 32.7 bits (74), Expect = 0.002 Identities = 26/155 (16%), Positives = 40/155 (25%), Gaps = 36/155 (23%) Query: 92 QYQPPYERQMQQP---------ARPEEPVRQPPQPP---------------RQAPVPPQQ 127 P Q P AR +E PP P V + Sbjct: 996 NITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNE 1055 Query: 128 QPAPHAAPQPGWQQPQPAQPPVQPQHQPQPVVQQPVAPQPVTPTVAQPQP---------- 177 Q A Q + + A+ V+ Q V Q + T + Sbjct: 1056 QDATETTAQNR-EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKV 1114 Query: 178 -AAPQQPAPQPVAASQPAVAEPQPVEPQQPAAPQP 211 Q P+ + P + + V+PQ A + Sbjct: 1115 ETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREN 1149
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 35.1 bits (81), Expect = 9e-05 Identities = 20/94 (21%), Positives = 33/94 (35%), Gaps = 15/94 (15%) Query: 12 IQVESRTALENLDAILEVDGIDGVFIGPADL----------SASLGYPDDAGHPDVQRVI 61 I VE + + + +D IG DL + + Y HP + R++ Sbjct: 429 IMVEIPSTAVAANLFAKE--VDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPAILRLV 486 Query: 62 EQSIRRIRAAGKAAGF---LAVDPAMAEKCLAWG 92 + I+ + GK G +A D L G Sbjct: 487 DMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLG 520
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.2 bits (68), Expect = 0.013 Identities = 23/119 (19%), Positives = 46/119 (38%), Gaps = 6/119 (5%) Query: 8 GFSRGDLGFALSGISIAYGFSK-FIMGSVSDRSNPRIFLPAGLILAALVMLVMGFVPWAT 66 + +G +L+ I + ++ I G V+ R R L G+I +++ F Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 67 SSIMIMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLF 125 + IM +L G+G P + ++ +G + ++ + PLLF Sbjct: 302 MAFPIMVLLASG-----GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLF 355
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 28.8 bits (64), Expect = 0.039 Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 4/54 (7%) Query: 252 NYSYDWMFKPGAMAQIAQYADGIGPDYHMLVAEGSKPGAVKLTAMVKEAHASHL 305 +Y YD+ F A+ +G Y+ L + K + + A+ A + HL Sbjct: 1143 SYGYDFAFFRNALVLKPS----VGVSYNHLGSTNFKSNSNQKVALKNGASSQHL 1192
>PF06580#Sensor histidine kinase Length = 349 Score = 213 bits (543), Expect = 2e-67 Identities = 59/216 (27%), Positives = 117/216 (54%), Gaps = 3/216 (1%) Query: 239 LGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTLKAVIRRDSDQA 298 L G + + + ++ ++++ L AQ+NPHF+FNALN ++A+I D +A Sbjct: 134 LYFGWHFFKNYKQAEIDQWKMASMAQEAQLMALKAQINPHFMFNALNNIRALILEDPTKA 193 Query: 299 GQLVQYLSTFFRKNLKR-PTEIVTLADEIEHVNAYLQIEKARFQANLQIQMAVPEGLAHH 357 +++ LS R +L+ V+LADE+ V++YLQ+ +F+ LQ + + + Sbjct: 194 REMLTSLSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDV 253 Query: 358 QLPAFTLQPIVENAIKHGTSQHLGVGEITIRASQDDRWLQLDIEDNAGL-YRANPQASGL 416 Q+P +Q +VEN IKHG +Q G+I ++ ++D+ + L++E+ L + +++G Sbjct: 254 QVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKESTGT 313 Query: 417 GMNLVDRRLRARFGADCGISVTCEPERFTRVTLRLP 452 G+ V RL+ +G + I ++ + + + +P Sbjct: 314 GLQNVRERLQMLYGTEAQIKLSEKQGKVN-AMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 70.6 bits (173), Expect = 3e-16 Identities = 36/168 (21%), Positives = 70/168 (41%), Gaps = 10/168 (5%) Query: 3 RVLIVDDEPLARENLRILLETQRDIEIVGECGNAVEAIGAVHKLRPDVLFLDIQMPRISG 62 +L+ DD+ R L L + ++ NA + D++ D+ MP + Sbjct: 5 TILVADDDAAIRTVLNQALS-RAGYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 63 LEMVGMLDPEHRPYI--VFLTAFD--EYAVKAFEEHAFDYLLKPIEAARLEKTLARLRQE 118 +++ + + RP + + ++A + A+KA E+ A+DYL KP + L + R E Sbjct: 63 FDLLPRIK-KARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121 Query: 119 RNLQDVSLLDDAQQTLKYIPCTGHSRIWLLQMEDVAFVSSRMSGIYVT 166 + L DD+Q + + G S +A + + +T Sbjct: 122 PKRRPSKLEDDSQDGMPLV---GRSAAMQEIYRVLARLMQTDLTLMIT 166
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 15/85 (17%), Positives = 35/85 (41%), Gaps = 7/85 (8%) Query: 57 DEQLWVAECDGQPVGFAAV---WTADNFLHHLFVDPDWQGKHIGSALLAQVERTFTASGT 113 + ++ + +G + W + + V D++ K +G+ALL + + Sbjct: 64 GKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123 Query: 114 LKCLMENKN----ALRFYQRHGWTI 134 ++E ++ A FY +H + I Sbjct: 124 CGLMLETQDINISACHFYAKHHFII 148
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 34.7 bits (80), Expect = 0.001 Identities = 27/105 (25%), Positives = 46/105 (43%), Gaps = 17/105 (16%) Query: 13 QAARGESPFDLLLIDAQIVDMATGEIRPADVGIVGEMIASVHPRGSRE----------DA 62 Q R D ++ +A I+D G I AD+G+ IA++ G+ + Sbjct: 60 QVTREGGAVDTVITNALILD-HWG-IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPG 117 Query: 63 HEVRSMAGGYLSPGLMDTHVHLESSHLPPERYAEIVLTQGTTAVF 107 EV + G ++ G MD+H+H + P++ E L G T + Sbjct: 118 TEVIAGEGKIVTAGGMDSHIHF----ICPQQ-IEEALMSGLTCML 157
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 76.4 bits (188), Expect = 4e-18 Identities = 31/148 (20%), Positives = 67/148 (45%), Gaps = 3/148 (2%) Query: 11 PRILIVEDEPKLGQLLIDYLQAAGYAPALINHGDKVLPYVRQTPPHLILLDLMLPGTDGL 70 IL+ +D+ + +L L AGY + ++ + ++ L++ D+++P + Sbjct: 4 ATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 71 TLCREIR-RFSDVPVVMVTAKIEEIDRLLGLEIGADDYICKPYSPREVVARVKTIL--RR 127 L I+ D+PV++++A+ + + E GA DY+ KP+ E++ + L + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 128 CKPQRDLQALDAQSPLIVDEGRFQASWR 155 +P + PL+ Q +R Sbjct: 124 RRPSKLEDDSQDGMPLVGRSAAMQEIYR 151
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 35.2 bits (81), Expect = 4e-04 Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 8/77 (10%) Query: 79 LLAALATFPLARGLLAPVKRLVEGTHKLAA------GDFST--RVTVTGGDELGRLAQDF 130 +A + P L+A V+ V H LA G F V G+ G L Sbjct: 93 AVAKQSEKPHLSQLMAAVRSKVMEGHSLADAMKCFPGSFERLYCAMVAAGETSGHLDAVL 152 Query: 131 NQLASTLERNQQMRRDL 147 N+LA E+ QQMR + Sbjct: 153 NRLADYTEQRQQMRSRI 169
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 123 bits (311), Expect = 5e-33 Identities = 93/436 (21%), Positives = 181/436 (41%), Gaps = 19/436 (4%) Query: 20 FMQSLDTTIVNTALPSMAKSLGESPLHMHMIIVSYVLTVAVMLPASGWLADRVGVRNIFF 79 F L+ ++N +LP +A + P + + +++LT ++ G L+D++G++ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 80 TAIVLFTAGSLFCAQA-STLDQLVMARVLQGVGGAMMVPVGRLTVMKIVPRDQYMAAMTF 138 I++ GS+ S L+MAR +QG G A + + V + +P++ A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 139 VTLPGQVGPLLGPALGGVLVEYASWHWIFLINIP-VGIVGAIATLCLMPNYTMQTRRFDL 197 + +G +GPA+GG++ Y HW +L+ IP + I+ + L+ FD+ Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 198 SGFLLLAAGMATLTLALDGQKGLGISPAWLAGLVAVGLCALLLYLWHARGNARALFSLNL 257 G +L++ G+ L + ++ + V + + L+++ H R L Sbjct: 202 KGIILMSVGIVFFMLF---------TTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGL 252 Query: 258 FRNRTFSLGLGGSFAGRIGSGMLPFMTPVFLQIGLGFSPFHAG-LMMIPMVLGSMGMKRI 316 +N F +G+ M P ++ S G +++ P + + I Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 317 VVQVVNRFGYRRVL-VASTLGLAAVSLLFMFSALAGWYYALPLVLFLQGMINASRFSSMN 375 +V+R G VL + T + W+ + +V L G+ + + ++ Sbjct: 313 GGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL--SFTKTVIS 370 Query: 376 TLTLKDLPDDLASSGNSLLSMVMQLSMSIGVTIAGLLLGLYGQQHMSLDAASTHQVFLYT 435 T+ L A +G SLL+ LS G+ I G LL + L +LY+ Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYS 430 Query: 436 --YLSMAAIIALPALI 449 L + II + L+ Sbjct: 431 NLLLLFSGIIVISWLV 446
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 890 bits (2301), Expect = 0.0 Identities = 281/1035 (27%), Positives = 503/1035 (48%), Gaps = 36/1035 (3%) Query: 6 LFIYRPVATILISLAITLCGILGFRLLPVAPLPQVDFPVIMVSASLPGASPETMASSVAT 65 FI RP+ ++++ + + G L LPVA P + P + VSA+ PGA +T+ +V Sbjct: 4 FFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQ 63 Query: 66 PLERSLGRIAGVNEMTSSS-SLGSTRIILEFNFDRDINGAARDVQAAINAAQSLLPSGMP 124 +E+++ I + M+S+S S GS I L F D + A VQ + A LLP + Sbjct: 64 VIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQ 123 Query: 125 SRPTYRKANPSDAPIMILTLTSDT--YSQGELYDFASTQLAQTIAQIDGVGDVDVGGSSL 182 + S + +M+ SD +Q ++ D+ ++ + T+++++GVGDV + G+ Sbjct: 124 -QQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY 182 Query: 183 PAVRVDLNPQALFNQGVSLDAVRTAISDANVRKPQG------ALEDSAHRWQVQTNDELK 236 A+R+ L+ L ++ V + N + G AL + K Sbjct: 183 -AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 237 TAADYQPLIVHF-QNGAAVRLGDVATVSDSVQDVRNAGMTNAKPAILLMIRKLPEANIIQ 295 ++ + + +G+ VRL DVA V ++ N KPA L I+ AN + Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALD 301 Query: 296 TVDSIRARLPELQQTIPAAIDLQIAQDRSPTIRASLEEVEQTLVISVALVILVVFLFLRS 355 T +I+A+L ELQ P + + D +P ++ S+ EV +TL ++ LV LV++LFL++ Sbjct: 302 TAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQN 361 Query: 356 GRATLIPAVAVPVSLIGTFAAMYLCGFSLNNLSLMALTIATGFVVDDAIVVLENISRHL- 414 RATLIP +AVPV L+GTFA + G+S+N L++ + +A G +VDDAIVV+EN+ R + Sbjct: 362 MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMM 421 Query: 415 EAGMKPLQAALQGSREVGFTVLSMSLSLVAVFLPLLLMGGLPGRLLREFAVTLSVAIGIS 474 E + P +A + ++ ++ +++ L AVF+P+ GG G + R+F++T+ A+ +S Sbjct: 422 EDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALS 481 Query: 475 LAVSLTLTPMMCGWLLKSGKPHQPTRNRGFG----RLLVAVQGGYGKSLKWVLKHSRLTG 530 + V+L LTP +C LLK GF Y S+ +L + Sbjct: 482 VLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYL 541 Query: 531 LVVLGTIALSVWLYISIPKTFFPEQDTGVLMGGIQADQSISFQ----AMRGKLQDFMKII 586 L+ +A V L++ +P +F PE+D GV + IQ + + + ++K Sbjct: 542 LIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNE 601 Query: 587 R-EDPAVDNVTGFT-GGSRVNSGMMFITLKPRDQRH---ETAQQVIDRLRKKLANEPGAN 641 + +V V GF+ G N+GM F++LKP ++R+ +A+ VI R + +L Sbjct: 602 KANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 642 LFLMAVQDIRVGGRQSNASYQYTLLSDDLSALREWEPKIRKALAAL-----PELADVNSD 696 + + I G + ++ L D + + R L + L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFE---LIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPN 718 Query: 697 QQDNGAEMDLVYDRDTMSRLGISVQDANNLLNNAFGQRQISTIYQPLNQYKVVMEVDPAY 756 ++ A+ L D++ LG+S+ D N ++ A G ++ K+ ++ D + Sbjct: 719 GLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKF 778 Query: 757 TQDVSALDKMFVINSDGKPIPLAYFAKWQPANAPLSVNHQGLSAASTISFNLPTGRSLSE 816 +DK++V +++G+ +P + F + + I G S + Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGD 838 Query: 817 ASEAIDRAMTQLGVPSSVRGSFAGTAQVFQQTMNAQVILILAAIATVYIVLGVLYESYVH 876 A ++ ++L P+ + + G + + + N L+ + V++ L LYES+ Sbjct: 839 AMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSI 896 Query: 877 PLTILSTLPSAGVGALLALEIFDAPFSLIALIGIMLLIGIVKKNAIMMVDFALEAQRNGN 936 P++++ +P VG LLA +F+ + ++G++ IG+ KNAI++V+FA + Sbjct: 897 PVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEG 956 Query: 937 LTPEEAIFQACLLRFRPIMMTTLAALFGALPLVLSGGDGSELRQPLGITIVGGLVMSQLL 996 EA A +R RPI+MT+LA + G LPL +S G GS + +GI ++GG+V + LL Sbjct: 957 KGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLL 1016 Query: 997 TLYTTPVVYLFFDRL 1011 ++ PV ++ R Sbjct: 1017 AIFFVPVFFVVIRRC 1031
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 899 bits (2324), Expect = 0.0 Identities = 294/1036 (28%), Positives = 507/1036 (48%), Gaps = 29/1036 (2%) Query: 13 SRLFIMRPVATTLLMVAILLAGIIGYRFLPVSALPEVDYPTIQVVTLYPGASPDVVTSAI 72 + FI RP+ +L + +++AG + LPV+ P + P + V YPGA V + Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 73 TAPLERQFGQMSGLKQMSSQS-SGGASVVTLQFQLTLPLDVAEQEVQAAINAATNLLPSD 131 T +E+ + L MSS S S G+ +TL FQ D+A+ +VQ + AT LLP + Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 132 LPNPPVYSKVNPADPPIMTLAVTSSAIPMTQVE--DMVETRVAQKISQVSGVGLVTLAGG 189 + + S + +M S TQ + D V + V +S+++GVG V L G Sbjct: 122 VQQQGI-SVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 190 QRPAVRVKLNAQAIAALGLTSETVRTAITSANVNSAKGSLDGP------ARAVTLSANDQ 243 Q A+R+ L+A + LT V + N A G L G ++ A + Sbjct: 181 QY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 244 MQSAEDYRRLII-AYQNGAPIRLGDVASVEQGAENSWLGAWANQQRAIVMNVQRQPGANI 302 ++ E++ ++ + +G+ +RL DVA VE G EN + A N + A + ++ GAN Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 303 IDTADSIRQMLPQLTESLPKSVKVQVLSDRTTNIRASVRDTQFELMLAIALVVMIIYLFL 362 +DTA +I+ L +L P+ +KV D T ++ S+ + L AI LV +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 363 RNVPATIIPGVAVPLSLVGTFAVMVFLDFSINNLTLMALTIATGFVVDDAIVVIENISRY 422 +N+ AT+IP +AVP+ L+GTFA++ +SIN LT+ + +A G +VDDAIVV+EN+ R Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 423 I-EKGEKPLAAALKGAGEIGFTIISLTFSLIAVLIPLLFMGDIVGRLFREFAVTLAVAIL 481 + E P A K +I ++ + L AV IP+ F G G ++R+F++T+ A+ Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 482 FSAVVSLTLTPMMCARML---SHESLRKQNRFSRASERFFERVIAVYGRWLSRVLNHPWL 538 S +V+L LTP +CA +L S E + F F+ + Y + ++L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 539 TLGVALSTLALSIILWVFIPKGFFPIQDNGIIQGTLQAPQSVSFASMAERQRQVASIILK 598 L + +A ++L++ +P F P +D G+ +Q P + + QV LK Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 599 DPA--VESLTSFVGVDGTNPALNSARLQINLKPLDERDDR---VQTVISRLQQAVDGVPG 653 + VES+ + G + A N+ ++LKP +ER+ + VI R + + + Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 654 VALYLQPTQDLTIDTTVSRTQYQFTLQ---ANSLEALSTWVPPLLSRLQAQP-QLADVSS 709 ++ P I + T + F L +AL+ LL P L V Sbjct: 660 G--FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 710 DWQDKGLAAYIKVDRDSASRLGISMADVDNALYNAFGQRLISTIYTQANQYRVVLEQDTE 769 + + ++VD++ A LG+S++D++ + A G ++ + ++ ++ D + Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 770 ATPGLAALENIRLTSSDGGIVPLTAIATVEQRFTPLSVNHLDQFPVTTISFNVPDNYSLG 829 ++ + + S++G +VP +A T + + + P I S G Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 830 EAVEAILAAEQSLDFPTDIRTQFQGSSLAFQSALGSTVWLVVAAVVAMYIVLGVLYESFI 889 +A+ + L P I + G S + + LV + V +++ L LYES+ Sbjct: 838 DAMALMENLASKL--PAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 890 HPITILSTLPTAGVGALLALWLAGSELDVIAIIGIILLIGIVKKNAIMMIDFALAAEREQ 949 P++++ +P VG LLA L + DV ++G++ IG+ KNAI++++FA ++ Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 950 GMPPREAIYQACLLRFRPILMTTLAALLGALPLMLSTGVGAELRRPLGIGMVGGLMLSQV 1009 G EA A +R RPILMT+LA +LG LPL +S G G+ + +GIG++GG++ + + Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1010 LTLFTTPVIYLLFDRL 1025 L +F PV +++ R Sbjct: 1016 LAIFFVPVFFVVIRRC 1031
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 44.4 bits (105), Expect = 6e-07 Identities = 26/139 (18%), Positives = 52/139 (37%), Gaps = 16/139 (11%) Query: 55 GAALAPVQAATATEEAVPRYLTGLGTVTAA-NTVTVRSRVDGQLLSLHFQEGQQVKAGDL 113 +A + + E V T G +T + + ++ + + + +EG+ V+ GD+ Sbjct: 67 FLVIAFILSVLGQVEIV---ATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDV 123 Query: 114 LAQIDPSQFKVALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVE 173 L ++ A+ K Q++L AR + RYQ L ++ EL+ L + Sbjct: 124 LLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQILSRS-----IELNKLPELKLP 171 Query: 174 SAGTVKADEAAVASAQLQL 192 + L Sbjct: 172 DEPYFQNVSEEEVLRLTSL 190 Score = 37.5 bits (87), Expect = 8e-05 Identities = 26/170 (15%), Positives = 63/170 (37%), Gaps = 17/170 (10%) Query: 125 ALAQAQGQLAKDQATLANARRDLARYQQLVKTNLVSRQELDTQQSLVVESAGTVKADEAA 184 +A +L ++ L ++ ++ + ++ +++ + Sbjct: 260 KYVEAVNELRVYKSQLEQIESEILSAKEEY------QLVTQLFKNEILDKLRQTTDNIGL 313 Query: 185 V----ASAQLQLDWTRITAPIDGRV-GLKQVDIGNQISSGDTTGIVVLTQTHPIDVVFTL 239 + A + + + I AP+ +V LK G +++ +T ++V ++V + Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPED-DTLEVTALV 372 Query: 240 PESSIATVVQAQKAGKTLSVEAWDRTNKQKISVGE--LLSLDNQIDATTG 287 I + Q A + VEA+ T + G+ ++LD D G Sbjct: 373 QNKDIGFINVGQNA--IIKVEAFPYTRYGYLV-GKVKNINLDAIEDQRLG 419
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 65.6 bits (160), Expect = 1e-14 Identities = 32/141 (22%), Positives = 61/141 (43%), Gaps = 6/141 (4%) Query: 4 RLAIIEDNADLLDELLAWLGYRGFEVWGTRSAEAFWRQLHSHPVDIVLVDIGLPGEDGFS 63 + + +D+A + L L G++V T +A WR + + D+V+ D+ +P E+ F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 VLNYLHELGHY-GLVVVSARGQQQDKLQALSLGADAYLIKPVNFAH-LAETLTALGARLR 121 +L + + ++V+SA+ ++A GA YL KP + + AL R Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 QDRP----AAPPAEAIGTPPA 138 + + +G A Sbjct: 125 RPSKLEDDSQDGMPLVGRSAA 145
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 45.9 bits (109), Expect = 1e-07 Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 20/129 (15%) Query: 96 AMMLH-IKLQAESQLPEQIDQAVIGRPINFQGLGGDEANAQAQGILERAAHRAGFRDVVF 154 M+ H IK + + ++ P+ + E A + +A AG R+V Sbjct: 89 KMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV---ERRA-----IRESAQGAGAREVFL 140 Query: 155 QFEPVAAGLDFEATLSEEKRVLVVDIGGGTTDCSLLLMGPQWRERADRQQSLLGHSGCRI 214 EP+AA + +SE +VVDIGGGTT+ +++ + ++ S RI Sbjct: 141 IEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN-----------GVVYSSSVRI 189 Query: 215 GGNDLDIAL 223 GG+ D A+ Sbjct: 190 GGDRFDEAI 198 Score = 35.1 bits (81), Expect = 4e-04 Identities = 30/121 (24%), Positives = 50/121 (41%), Gaps = 22/121 (18%) Query: 296 RLSYRLV---RSAEESKIALSSA--ASVETALPFIQDDLATA------IAQQGLEAALDQ 344 R +Y + +AE K + SA + +LA + + AL + Sbjct: 203 RRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFTLNSNEILEALQE 262 Query: 345 PLTRIMEQVRLALDSSQTTPDV--------IYLTGGSARSPLIKKALAAQLPGIPLAGGD 396 PLT I+ V +AL+ Q P++ + LTGG A + + L + GIP+ + Sbjct: 263 PLTGIVSAVMVALE--QCPPELASDISERGMVLTGGGALLRNLDRLLMEET-GIPVVVAE 319 Query: 397 D 397 D Sbjct: 320 D 320
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 37.4 bits (86), Expect = 2e-04 Identities = 21/154 (13%), Positives = 50/154 (32%), Gaps = 5/154 (3%) Query: 248 AVNYLAQNIDRQAAQDAKSLEFLNNQLPKVRSDLDVAEDKLNQFRRLNDSVDLSLEAKSV 307 + + + D+ ++ L + + + E L + + ++ Sbjct: 194 ELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLE- 252 Query: 308 LDQIVNVDNQLNELTFRESEISQLYTKEHPTYKALLEKRKTLQDEKSKLNKRVTAMPETQ 367 + ++ + EL T + K L ++ L+ EK+ L + + Sbjct: 253 -AEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVL---N 308 Query: 368 QEILRLSRDVESGRAVYMQLLNRQQELSIAKSSA 401 L RD+++ R QL Q+L + Sbjct: 309 ANRQSLRRDLDASREAKKQLEAEHQKLEEQNKIS 342
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 150 bits (381), Expect = 6e-46 Identities = 70/281 (24%), Positives = 121/281 (43%), Gaps = 34/281 (12%) Query: 1 MKIIVTGGAGFIGSAVVRHIINNTQDEVIVLDCLT--YAGNL-ESLLPVAKNPRFYFENV 57 MK +VTG AGFIG V + ++ +V+ +D L Y +L ++ L + P F F + Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAG-HQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 58 NICDRKELDRVFREYQPGAVMHLAAESHVDRSIDGPAAFIETNIVGTYTLLEATRHYWNS 117 ++ DR+ + +F V V S++ P A+ ++N+ G +LE RH Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN--- 116 Query: 118 LDGKIKESFRFHHISTDEVYGDLHGTEDLFTESTPYA-PSSPYSASKASSDHLVRAWLRT 176 KI+ + S+ VYG + F+ P S Y+A+K +++ + + Sbjct: 117 ---KIQ---HLLYASSSSVYGL--NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHL 168 Query: 177 YGLPTIVTNCSNNYGPYHFPEKLIPLIILNALDGKPLPVYGDGGQIRDWLYVEDHARALY 236 YGLP YGP+ P+ + L+GK + VY G RD+ Y++D A A+ Sbjct: 169 YGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAII 228 Query: 237 KVV------------------TEGLVGETYNIGGHNERKIL 259 ++ YNIG + +++ Sbjct: 229 RLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELM 269
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 34.4 bits (79), Expect = 7e-06 Identities = 8/29 (27%), Positives = 15/29 (51%) Query: 6 DAQKILNELGWQPLETFESGIRKTVEWYL 34 D + + +G+ P T + G++ V WY Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYR 329
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 53.6 bits (129), Expect = 2e-10 Identities = 49/239 (20%), Positives = 87/239 (36%), Gaps = 49/239 (20%) Query: 1 MRILLTGANGQLGRCFQDR-VEAGWEVRATDA------------------------NELD 35 M+ L+TGA G +G R +EAG +V D +++D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 36 ITNLKDVVNAIEQFKPDIVVNAAAYTAVDKAESDEVIAEAVNKTGPYNLALAAKNIG-AL 94 + + + + + + V + AV + + N TG N+ ++ Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 95 FIHISTDYVFDGLSNKKYKENDQTN-PLSIYGKTKLAGEI---AVTDIYD-KTIIIRTAW 149 ++ S+ V+ + +D + P+S+Y TK A E+ + +Y +R Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180 Query: 150 VFSEYGN------NFVKTM-----IRLAKDRDQLKIVSDQFGCPTYAGDIASAIIKLIK 197 V+ +G F K M I + + F TY DIA AII+L Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKR----DF---TYIDDIAEAIIRLQD 232
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 279 bits (716), Expect = 6e-98 Identities = 124/156 (79%), Positives = 136/156 (87%) Query: 1 MKFLVTGAAGFIGFHVSKRLLNDGHQVVGIDNINDYYDVKLKESRLEQLESPSFTFYKLD 60 MK+LVTGAAGFIGFHVSKRLL GHQVVGIDN+NDYYDV LK++RLE L P F F+K+D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 61 LADRDGMAKLFETEQFERVIHLAAQAGVRYSLENPYAYADSNLTGYLNILEGCRHNKVQH 120 LADR+GM LF + FERV + VRYSLENP+AYADSNLTG+LNILEGCRHNK+QH Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKK 156 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKK Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKK 156
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 299 bits (766), Expect = e-105 Identities = 140/173 (80%), Positives = 157/173 (90%) Query: 1 MAHTYSHLYSIPTTGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI 60 MAHTYSHLY +P TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI Sbjct: 161 MAHTYSHLYGLPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYI 220 Query: 61 DDIVEAIVRIQDVIPQPDHEWTVEEGSPATSSAPYRVYNIGNSSPVELMDYINALEQALG 120 DDI EAI+R+QDVIP D +WTVE G+PA S APYRVYNIGNSSPVELMDYI ALE ALG Sbjct: 221 DDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALG 280 Query: 121 LEAKKNMMPIQPGDVLNTSAETQALYKTIGFKPETPVQQGVKNFVDWYKEYYQ 173 +EAKKNM+P+QPGDVL TSA+T+ALY+ IGF PET V+ GVKNFV+WY+++Y+ Sbjct: 281 IEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDFYK 333
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 585 bits (1511), Expect = 0.0 Identities = 265/334 (79%), Positives = 304/334 (91%) Query: 1 MKFLVTGAAGFIGFHTCKRLLNAGHEVVGLDNMNDYYDVNLKQARLDLLQSPLFRFHKID 60 MK+LVTGAAGFIGFH KRLL AGH+VVG+DN+NDYYDV+LKQARL+LL P F+FHKID Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 61 LADRDGVAQLFANEKFNRVIHLAAQAGVRYSLENPFAYADSNLIGYLNILEGCRHNQVEH 120 LADR+G+ LFA+ F RV + VRYSLENP AYADSNL G+LNILEGCRHN+++H Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGIPTTGLRFFT 180 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG+P TGLRFFT Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGLRFFT 180 Query: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIVEAIVRMQDIIPQPNPE 240 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDI EAI+R+QD+IP + + Sbjct: 181 VYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQ 240 Query: 241 WTVETGSPATSSAPYRVYNIGNSSPVELMDYITALEEALGMEAQKNMMPIQPGDVLDTSA 300 WTVETG+PA S APYRVYNIGNSSPVELMDYI ALE+ALG+EA+KNM+P+QPGDVL+TSA Sbjct: 241 WTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSA 300 Query: 301 DTQPLYDLVGFKPQTSVKDGVKNFVDWYKDYYQI 334 DT+ LY+++GF P+T+VKDGVKNFV+WY+D+Y++ Sbjct: 301 DTKALYEVIGFTPETTVKDGVKNFVNWYRDFYKV 334
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 28.9 bits (64), Expect = 0.047 Identities = 22/135 (16%), Positives = 49/135 (36%), Gaps = 13/135 (9%) Query: 254 ALLHERMNAINNMSQMIDERDRTIHDQKCLIDERDRTIHDQKRLIDERDSTVLTQKNLID 313 L MN S I + + E ++ + + + T + Sbjct: 232 KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 291 Query: 314 ERDLVSAQ---QNQLIEQNNKTIQQQIQNVTDLNSQVSSKEQKVDELQNQNIKLISLIDE 370 + A Q+Q++ N +++++ + + Q+ ++ QK++E QN+ I E Sbjct: 292 ALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE-QNK-------ISE 343 Query: 371 KDLHIAQLSADLERA 385 L DL+ + Sbjct: 344 ASR--QSLRRDLDAS 356
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 35.4 bits (81), Expect = 6e-04 Identities = 23/92 (25%), Positives = 39/92 (42%), Gaps = 2/92 (2%) Query: 2 NHDYLARIAALEDTLRQKDSQLSLVAETESFLRSALARAEEKIENEEREIEHLRAQIEKL 61 AR A LE L + + + L + A E + + E + + L A + L Sbjct: 255 KAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSL 314 Query: 62 RRMLFGTRSEKLRRQVEEAEALLKQQEQQSDR 93 RR L +R K +Q+E L++Q + S+ Sbjct: 315 RRDLDASREAK--KQLEAEHQKLEEQNKISEA 344
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 27.5 bits (61), Expect = 0.011 Identities = 14/84 (16%), Positives = 33/84 (39%), Gaps = 6/84 (7%) Query: 16 RLYRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIIASMKSDYEDR 75 Y++ NKL++ + +++ N++ + L ++ I + + I+ I E Sbjct: 385 SYYKKYNKLKKSEEAANEQLLQNEEELNYLYSVLTNINNADNYDEIEEIKK------ELI 438 Query: 76 VDDYIIKNAELSKERRDISKKLKV 99 YI ++ SK + Sbjct: 439 ETGYIKFKKIYKSKKSKTSKPMHF 462 Score = 26.4 bits (58), Expect = 0.025 Identities = 10/45 (22%), Positives = 17/45 (37%), Gaps = 1/45 (2%) Query: 13 EFVRLYRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMS 57 R ++ L ++ E K LL N+ +K G+S Sbjct: 311 NINRCTKKDKILNNTLKKCEDKDIFKLYGELLTANIYA-LKKGLS 354
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.8 bits (67), Expect = 0.021 Identities = 33/153 (21%), Positives = 57/153 (37%), Gaps = 35/153 (22%) Query: 78 LGGIIFGHFGDRLGRKRMLMMTVWMMGIATACIGLLPSFNQIGWWAPVLLVFLRAVQGFA 137 +G ++G D+LG KR+L + GI C G + F +G LL+ R +QG Sbjct: 64 IGTAVYGKLSDQLGIKRLL-----LFGIIINCFGSVIGF--VGHSFFSLLIMARFIQGA- 115 Query: 138 VGGEWGGAALLS---------VENAPQGKK-AFYSSGVQVGYGVGLLLSTGLVSLISSLT 187 G AA + + +GK S V +G GVG + + I Sbjct: 116 -----GAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYI---- 166 Query: 188 SDQQFLSWGWRVPFLFSVVLVLIALWIRNGMAE 220 W L ++ ++ ++ + + Sbjct: 167 --------HWSYLLLIPMITIITVPFLMKLLKK 191
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 28.7 bits (64), Expect = 0.018 Identities = 12/64 (18%), Positives = 22/64 (34%), Gaps = 3/64 (4%) Query: 10 EGVEKVLHSLEARKHKEDPERRQQRLAALAERDP-LAFERDKVKGAIRTDFI--LSAEIV 66 E +L + + E L +++ DP L + D I F+ + E+ Sbjct: 340 ENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVT 399 Query: 67 AITL 70 L Sbjct: 400 CALL 403
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 538 bits (1388), Expect = 0.0 Identities = 286/356 (80%), Positives = 318/356 (89%) Query: 1 MTRPVVASIDLLALRQNLQIVRRAAPGARLWAVVKANAYGHGVARVWSALSAADGFALLN 60 MTRP+ AS+DL AL+QNL IVR+AA AR+W+VVKANAYGHG+ R+WSA+ A DGFALLN Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60 Query: 61 LEEAILLREQGWKGPILLLEGFFHADELAVLDQYRLTTSVHSNWQIKALQQAKLRAPLDI 120 LEEAI LRE+GWKGPIL+LEGFFHA +L + DQ+RLTT VHSNWQ+KALQ A+L+APLDI Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120 Query: 121 YLKVNSGMNRLGCMPERVHTVWQQLRAISNVGEMTLMSHFAEAENPQGIVEPMRRIEQAA 180 YLKVNSGMNRLG P+RV TVWQQLRA++NVGEMTLMSHFAEAE+P GI M RIEQAA Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180 Query: 181 EGLDCPRSLANSAATLWHPEAHFDWVRPGIVLYGASPSGQWQDIANTGLKPVMTLRSEII 240 EGL+C RSL+NSAATLWHPEAHFDWVRPGI+LYGASPSGQW+DIANTGL+PVMTL SEII Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240 Query: 241 GVQNLRPGEAIGYGGLYRTTQEQRIGIVACGYADGYPRVAPSGTPVLVDGVRTTTVGRVS 300 GVQ L+ GE +GYGG Y EQRIGIVA GYADGYPR AP+GTPVLVDGVRT TVG VS Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300 Query: 301 MDMLAVDLTPCPQAGIGAPVELWGKEIKIDDVAASSGTVGYELMCALAPRVPVVTL 356 MDMLAVDLTPCPQAGIG PVELWGKEIKIDDVAA++GTVGYELMCALA RVPVVT+ Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPVVTV 356
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 501 bits (1291), Expect = e-180 Identities = 162/395 (41%), Positives = 245/395 (62%), Gaps = 11/395 (2%) Query: 7 VLVINCGSSSIKFSVLDAASCDCLLNGVAEGINAERASLSLNGGE---PVALAQRGYEGA 63 +LVINCGSSS+K+ ++++ + L G+AE I + L+ N + + ++ A Sbjct: 3 ILVINCGSSSLKYQLIESKDGNVLAKGLAERIGINDSLLTHNANGEKIKIKKDMKDHKDA 62 Query: 64 LQAIAGVLAQRDL-----IDSVALIGHRVAHGGDLFTESVIISEEVINNIRQVSSLAPLH 118 ++ + L D + + +GHRV HGG+ FT SV+I+++V+ I LAPLH Sbjct: 63 IKLVLDALVNSDYGVIKDMSEIDAVGHRVVHGGEYFTSSVLITDDVLKAITDCIELAPLH 122 Query: 119 NYASLSGIASAQRLFPEVMQVAVFDTSFHQTLAPEAFLYGLPWEYYQNLGVRRYGFHGTS 178 N A++ GI + ++ P+V VAVFDT+FHQT+ A+LY +P+EYY +R+YGFHGTS Sbjct: 123 NPANIEGIKACTQIMPDVPMVAVFDTAFHQTMPDYAYLYPIPYEYYTKYKIRKYGFHGTS 182 Query: 179 HRYVSQRALALLGLPEQESGLVIAHLGNGASICAVRNGRSVDTSMGMTPLEGLMMGTRSG 238 H+YVSQRA +L P + ++ HLGNG+SI AV+NG+S+DTSMG TPLEGL MGTRSG Sbjct: 183 HKYVSQRAAEILNKPIESLKIITCHLGNGSSIAAVKNGKSIDTSMGFTPLEGLAMGTRSG 242 Query: 239 DVDFGAMAWIAGETRQTLSDLERVANTASGLLGISGLSSDLR-VLEQAWHEGHARARLAI 297 +D ++++ + + ++ + N SG+ GISG+SSD R + + A+ G RA+LA+ Sbjct: 243 SIDPSIISYLMEKENISAEEVVNILNKKSGVYGISGISSDFRDLEDAAFKNGDKRAQLAL 302 Query: 298 KTFVHRIARHIAGHAAALQRLDGIIFTGGIGENSVLIRRLVSERLTVFGLAMDAARNQQP 357 F +R+ + I +AAA+ +D I+FT GIGEN IR + + L G +D +N+ Sbjct: 303 NVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREFILDGLEFLGFKLDKEKNKVR 362 Query: 358 NSAGERLISADGSRVRCAVIPTNEERMIALDAIRL 392 E +IS S+V V+PTNEE MIA D ++ Sbjct: 363 GE--EAIISTADSKVNVMVVPTNEEYMIAKDTEKI 395
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 44.2 bits (104), Expect = 3e-08 Identities = 21/64 (32%), Positives = 26/64 (40%), Gaps = 2/64 (3%) Query: 73 STWLGRNGIYMEDLYVTPDYRGIGAGKALLKTIAQYAVQRQCGRLEWSVLDWNQPAIDFY 132 S W G +ED+ V DYR G G ALL ++A + L D N A FY Sbjct: 84 SNWNGY--ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFY 141 Query: 133 LSIG 136 Sbjct: 142 AKHH 145
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 37.9 bits (88), Expect = 5e-05 Identities = 74/369 (20%), Positives = 138/369 (37%), Gaps = 62/369 (16%) Query: 12 LMRPIGAIVLGAYIDKVGRRKGLIVTLSIMATGTFLIVLIPSYQTIGLWAPLLVLIGRLL 71 LM+ A VLGA D+ GRR L+V+L+ A ++ P LW ++ IGR++ Sbjct: 54 LMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPF-----LW---VLYIGRIV 105 Query: 72 QGFSAGAELGGVSVYLAEIATPGRKGFYTSWQSGSQQVAIMVAAAMGFALNAVLEPSAIS 131 G + GA Y+A+I + + + S ++ +G + Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGF------- 157 Query: 132 DWGWRIPFLFGCLIVPFIFIL------------RR--KLEETQEFTARRHHLAMRQVFAT 177 PF + F+ RR + E + R M V A Sbjct: 158 --SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAAL 215 Query: 178 LLANWQVVIAGMMMVAMTTTAFYLITVYAPTFGKKVLMLSASD-SLLVTLLVAISNFFWL 236 + V M +V A ++I FG+ A+ + + + + Sbjct: 216 M-----AVFFIMQLVGQVPAALWVI------FGEDRFHWDATTIGISLAAFGILHSLAQA 264 Query: 237 PVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFLMMLSVLLWLSFIYGMYNGA 296 + G ++ R G R + + ++A T +LA A M +++ L+ G Sbjct: 265 MITGPVAARLGERR-ALMLGMIADGT---GYILLAFATRGWMAFPIMVLLAS-----GGI 315 Query: 297 MIPALTEIMPAEV------RVAGFSLAYSLATAVFGGFTPVISTALIEYTGDKASPGYWM 350 +PAL ++ +V ++ G A + T++ G P++ TA+ + + W+ Sbjct: 316 GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVG---PLLFTAIYAASITTWNGWAWI 372 Query: 351 SFAAICGLL 359 + AA+ L Sbjct: 373 AGAALYLLC 381 Score = 36.0 bits (83), Expect = 2e-04 Identities = 39/157 (24%), Positives = 62/157 (39%), Gaps = 20/157 (12%) Query: 218 ASDSLLVTLLVAISNFFWLPVGGALSDRFGRRSVLIAMTLLALATAWPALTMLANAPSFL 277 + ++ L A+ F PV GALSDRFGRR VL L++LA A ++A AP Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVL----LVSLAGAAVDYAIMATAPFLW 97 Query: 278 MMLSVLLWLSFIYGMYNGAMIPALT----EIMPAEVRVAGFSLAYSLATAVFGGFT--PV 331 + L++ I GA +I + R F ++ G PV Sbjct: 98 V-----LYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF---MSACFGFGMVAGPV 149 Query: 332 ISTALIEYTGDKASPGYWMSFAAICGLLATCYLYRRS 368 + + ++ +P + + L C+L S Sbjct: 150 LGGLMGGFS--PHAPFFAAAALNGLNFLTGCFLLPES 184
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.011 Identities = 14/76 (18%), Positives = 32/76 (42%), Gaps = 15/76 (19%) Query: 18 FIKDENGENRYFHVIKVANPDLIKKDAAVTFEPTTNNKGLSAYAVKVIPESKYIYIAGER 77 ++ D G R++ V+ +L+ L + ++ E+ ++Y+AGER Sbjct: 698 YLFDITGNRRFWPVLVPGRANLV---------------WLQKFRGQLFAEALHLYLAGER 742 Query: 78 LKLTSIKSYVVYREEE 93 + + +R E+ Sbjct: 743 YFPSPEDEEIYFRPEQ 758
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 102 bits (256), Expect = 8e-29 Identities = 51/152 (33%), Positives = 70/152 (46%), Gaps = 13/152 (8%) Query: 7 DWAPPPPPRPVIKQVVQGPQTIRLDSMALFDTGKSTLKPGSTKLL--VNSLLGIKAKPGW 64 + AP P P VQ + L S LF+ K+TLKP L + S L Sbjct: 195 EAAPVVAPAPAPAPEVQT-KHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG 253 Query: 65 LIVVAGHTDSIGNDKSNQQLSLKRAEAVRDWMRDTGDVPESCFAVQGYGASRPVASN--- 121 +VV G+TD IG+D NQ LS +RA++V D++ G +P + +G G S PV N Sbjct: 254 SVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKG-IPADKISARGMGESNPVTGNTCD 312 Query: 122 ------ETPEGRAQNRRVEISLVPQKDACLTP 147 + A +RRVEI + KD P Sbjct: 313 NVKQRAALIDCLAPDRRVEIEVKGIKDVVTQP 344
>PF05704#Capsular polysaccharide synthesis protein Length = 307 Score = 27.9 bits (62), Expect = 0.024 Identities = 7/12 (58%), Positives = 9/12 (75%) Query: 1 MKMKPIWICWWQ 12 M+ K I+ICW Q Sbjct: 66 MRQKYIFICWLQ 77
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.019 Identities = 23/74 (31%), Positives = 34/74 (45%), Gaps = 9/74 (12%) Query: 212 SRSAGQGKFEWG-SSQTTVHDGSAGGSTTPPTPIPEP-------RSIWGLPNPAPE-SLP 262 S++A Q E G S + G+ G+ P P PEP + W P PE ++P Sbjct: 87 SKAAAQVAREEGLESVAGIVMGAPAGAPAPKPPRPEPPPRPVVEKECWETIQPVPEHAVP 146 Query: 263 PVPGTPIPEEQEPN 276 P P P+ +EP+ Sbjct: 147 PSFWHPAPKGREPD 160
>SECA#SecA protein signature. Length = 901 Score = 57.6 bits (139), Expect = 1e-12 Identities = 16/28 (57%), Positives = 20/28 (71%) Query: 125 IDGTRPLIGRNDPCPCGSGKKFKKCCGQ 152 +GRNDPCPCGSGKK+K+C G+ Sbjct: 872 AQTGERKVGRNDPCPCGSGKKYKQCHGR 899 Score = 28.3 bits (63), Expect = 0.012 Identities = 8/15 (53%), Positives = 9/15 (60%) Query: 5 CPCGSALEYSSCCQR 19 CPCGS +Y C R Sbjct: 885 CPCGSGKKYKQCHGR 899
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 87.6 bits (217), Expect = 3e-21 Identities = 37/152 (24%), Positives = 61/152 (40%), Gaps = 3/152 (1%) Query: 10 ILIVEDEPVFRSLLHGWLTSLGATTFQAEDGKDALHKMTEVHPDLMICDISMPRMNGLEL 69 IL+ +D+ R++L+ L+ G + + DL++ D+ MP N +L Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 70 VETLRNRGEQLPILMISATENMADIAKALRLGVQDVLLKPVKDFDRLRETVYACLYPAMF 129 + ++ LP+L++SA KA G D L KP D L + L A Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPF-DLTELIGIIGRAL--AEP 122 Query: 130 SSRVEEEERLFEDWDALVSNPIAASRLLQELQ 161 R + E +D LV A + + L Sbjct: 123 KRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLA 154
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 28.8 bits (64), Expect = 0.018 Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 2/86 (2%) Query: 99 NLGIDVKLVNQEWKTFLDTRHQGTYDVARAGWCADYNEPTSFLNTMLSDSSMNTAHYKSP 158 G+ V V +E+ F+ + + +G A Y E S ++ MLS S+ + A Sbjct: 54 GNGVYVSGVQREYDAFITNQLRAA-QTQSSGLTARY-EQMSKIDNMLSTSTSSLATQMQD 111 Query: 159 AFDKIMAESVKASDEAQRTAAYAKAE 184 F + A D A R A K+E Sbjct: 112 FFTSLQTLVSNAEDPAARQALIGKSE 137
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.0 bits (70), Expect = 0.008 Identities = 9/16 (56%), Positives = 11/16 (68%) Query: 55 VVGESGCGKSTFARAI 70 + GESG GK ARA+ Sbjct: 165 ITGESGTGKELVARAL 180
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 221 bits (563), Expect = 1e-74 Identities = 174/241 (72%), Positives = 198/241 (82%), Gaps = 7/241 (2%) Query: 1 MTLDLPRRFPWPTLLSVAIHGAVVAGLLYTSVHQVIEQPSPTQPIEITMVAPADLEPPPA 60 MTLDLPRRFPWPTLLSV IHGAVVAGLLYTSVHQVIE P+P QPI +TMV PADLEPP A Sbjct: 1 MTLDLPRRFPWPTLLSVCIHGAVVAGLLYTSVHQVIELPAPAQPISVTMVTPADLEPPQA 60 Query: 61 AQPVVEPVVEPEPEPEPEVAPEPPKEAPVVIHKPEPKPKPKPKPKPKPEKKVEQPKREVK 120 QP EPVVEPEPEPEP PEPPKEAPVVI KP+PKPKPKPKP K + EQPKR+VK Sbjct: 61 VQPPPEPVVEPEPEPEPI--PEPPKEAPVVIEKPKPKPKPKPKPVKKVQ---EQPKRDVK 115 Query: 121 PAAEPRPASPFENNNTAPARTAPSTSTAAAKPTVTAPSGPRAISRVQPSYPARAQALRIE 180 P E RPASPFEN A T+ + + A +KP + SGPRA+SR QP YPARAQALRIE Sbjct: 116 P-VESRPASPFENTAPA-RLTSSTATAATSKPVTSVASGPRALSRNQPQYPARAQALRIE 173 Query: 181 GTVRVKFDVSPDGRIDNLQILSAQPANMFEREVKSAMRRWRYQQGRPGTGVTMTIKFRLN 240 G V+VKFDV+PDGR+DN+QILSA+PANMFEREVK+AMRRWRY+ G+PG+G+ + I F++N Sbjct: 174 GQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAMRRWRYEPGKPGSGIVVNILFKIN 233 Query: 241 G 241 G Sbjct: 234 G 234
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 27.3 bits (60), Expect = 0.005 Identities = 6/41 (14%), Positives = 18/41 (43%), Gaps = 6/41 (14%) Query: 4 LSWIIFGLIAGILAKWIMP------GKDGGGFIVTVILGII 38 + I+ G I+G++ W+ K+ ++ ++ + Sbjct: 163 AAIIMRGYISGLMENWLFAPQSFDLKKEARDYVAILLEMYL 203
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 66.4 bits (162), Expect = 5e-14 Identities = 78/373 (20%), Positives = 136/373 (36%), Gaps = 30/373 (8%) Query: 16 VLLGSQFVFNIGFYAVVPFLALFLRDDMLLSGGLI---GLILGLRTFSQQGMFILGGTLA 72 V+L + + +G ++P L LRD ++ S + G++L L Q + G L+ Sbjct: 9 VILSTVALDAVGIGLIMPVLPGLLRD-LVHSNDVTAHYGILLALYALMQFACAPVLGALS 67 Query: 73 DRYGAKAIILAGCVVRVAGFLLLACGASLWPIILGACLTGVGGALFSPSIEALLARAGTH 132 DR+G + ++L + ++A LW + +G + G+ GA + Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGA-------VAGAYI 120 Query: 133 SQANGKRSRAEWFALFAVCGELGAVIGPVAGGVLSGIGFRHIALAGAGIFLLALAVLFFC 192 + RA F + C G V GPV GG++ G A A + L F Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFL 180 Query: 193 LPADGHTTTTRRRVPWWTPLRQPRFVAFILAYSSWLLSY------NQLYLALPV--EIQR 244 LP R PL R+ + ++ + + Q+ AL V R Sbjct: 181 LPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDR 240 Query: 245 SGGREQDLAPLFMLASLLIITLQLPLA-RFARRMGAVRILPVGFLLLSASFASVALFAAA 303 + +L Q + A R+G R L +G + + +A Sbjct: 241 FHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFA--- 297 Query: 304 PPAEGWLRLMPAAGFVTLLTLGQMLLVPAAKDLIPLFAEESTLGAHYGALATAGGCAVLA 363 GW+ P + LL G + + PA + ++ +E G G+LA + Sbjct: 298 --TRGWM-AFPI---MVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIV 350 Query: 364 GNLLLGHLLDLAL 376 G LL + ++ Sbjct: 351 GPLLFTAIYAASI 363
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.030 Identities = 10/20 (50%), Positives = 12/20 (60%) Query: 33 LLGPNGCGKSSLLRVLAGLR 52 L G G GKS+L+ L GL Sbjct: 601 LEGTGGIGKSTLINTLVGLD 620
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 37.1 bits (86), Expect = 6e-05 Identities = 36/162 (22%), Positives = 60/162 (37%), Gaps = 28/162 (17%) Query: 6 KVLILGASGGIGGEVARRLVADNWQVRA-----------LKRGAQMRDPEDGIQWIAGDA 54 K L+ GA+G IG V++RL+ QV LK+ + G Q+ D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 55 LDGGQVAA--AAAGCDVIVH-----AV-----NPPGYRHWRQQVLPMLRNTLQAAERQR- 101 D + A+ + + AV NP Y + L N L+ + Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYAD--SNLTGFL-NILEGCRHNKI 118 Query: 102 ALVVLPGTVYNYGPDA-FPLIAEEAAQQPVTRKGAIRVAMEL 142 ++ + YG + P +++ PV+ A + A EL Sbjct: 119 QHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 70.0 bits (171), Expect = 4e-17 Identities = 37/196 (18%), Positives = 57/196 (29%), Gaps = 14/196 (7%) Query: 19 RDQILDAAMAHFSRYGYEKTTVTDLAKAIGFSKAYIYKFFDSKQAIGEAICASRLEKIMV 78 R ILD A+ FS+ G T++ ++AKA G ++ IY F K + I I Sbjct: 13 RQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGE 72 Query: 79 AVSAAIADAPSASEK-----LRRLFR-ALTEAGSELFFE--DRKLYDIAAVAARDKWPST 130 A P L + +TE L E K + +A + Sbjct: 73 LELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQA--Q 130 Query: 131 EQYAGHLQQLIGQILVEGRQAGEFERKTPLDEATLAVYMVMC--PFINPVQLQYNLDTAP 188 I Q L +A L A+ M + Sbjct: 131 RNLCLESYDRIEQTLKHCIEAKML--PADLMTRRAAIIMRGYISGLMENWLFAPQSFDLK 188 Query: 189 TAAVLLASLILRSLSP 204 A +++L Sbjct: 189 KEARDYVAILLEMYLL 204
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 39.8 bits (93), Expect = 1e-05 Identities = 19/92 (20%), Positives = 35/92 (38%), Gaps = 9/92 (9%) Query: 70 GKVLERRVETGHSVKRGQLLLRLDPADLALQAQSQQRAVDAARARAKKAANDLARYRGLV 129 V E V+ G SV++G +LL+L Q ++ AR L + R + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQAR---------LEQTRYQI 155 Query: 130 ASGAISAAEFDQINAAAEAARADLSAAQAQAN 161 S +I + ++ E ++S + Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEEVLRL 187 Score = 31.7 bits (72), Expect = 0.005 Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 4/84 (4%) Query: 178 GVVVETLAEPGQVVSAGQVVIRLARAGQREARVQLPETLRPAVGSEALATRYGSESQPV- 236 +V E + + G+ V G V+++L G ++ +L A TRY S+ + Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQA---RLEQTRYQILSRSIE 161 Query: 237 TATLRLLSDAADATTRTFEARYVL 260 L L + + VL Sbjct: 162 LNKLPELKLPDEPYFQNVSEEEVL 185 Score = 30.2 bits (68), Expect = 0.013 Identities = 12/128 (9%), Positives = 37/128 (28%), Gaps = 15/128 (11%) Query: 103 SQQRAVDAARARAKKAANDLARYRGLVAS--GAISAAEFDQINAAAEA----------AR 150 ++ ++ + A N+L Y+ + I +A+ + Sbjct: 250 AKHAVLEQENKYVE-AVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTT 308 Query: 151 ADLSAAQAQANVAQNATGYAGLLADADGVVVE-TLAEPGQVVSAGQVVIRLARAGQR-EA 208 ++ + + + + A V + + G VV+ + ++ + E Sbjct: 309 DNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEV 368 Query: 209 RVQLPETL 216 + Sbjct: 369 TALVQNKD 376
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 75.3 bits (185), Expect = 2e-18 Identities = 39/179 (21%), Positives = 71/179 (39%), Gaps = 16/179 (8%) Query: 8 LSALAVRERSVTLFLIILISVAGLVAFFGLGRAEDPPFTVKQMTVITVWPGATAQEMQDQ 67 ++ +R L I++ +AG +A L A+ P ++V +PGA AQ +QD Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 68 VAEPLEKRLQELKWYDRTETYT-RPGMALITLSLQDQTPP----SEVPEQFYQARKKLGD 122 V + +E+ + + + + G ITL+ Q T P +V + A Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPL--- 117 Query: 123 EAKNLPAGVSGPMINDEFADVTFALFAL--KARGEPPRQLVRD--AEALRQQLLHVPGV 177 LP V I+ E + ++ + A + + D A ++ L + GV Sbjct: 118 ----LPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGV 172
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 354 bits (911), Expect = e-110 Identities = 184/850 (21%), Positives = 337/850 (39%), Gaps = 54/850 (6%) Query: 5 GEQAERIYLSFSHDRLATLGLSPEAIFAALNSQNVLTAAGAIET------RGGQIFIRLD 58 + A RI+L D L L+P + L QN AAG + + I Sbjct: 180 AQYAMRIWLDA--DLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQ 237 Query: 59 GAFDRLQQIRDTPIIAG--GRTLKLADVATVERGYEDPATFLIRHQGEPALLLGVVMREG 116 F ++ + G ++L DVA VE G E+ R G+PA LG+ + G Sbjct: 238 TRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIA-RINGKPAAGLGIKLATG 296 Query: 117 WNGLALGKALDAETTSINQSLPLGMSLTKVTDQSVNISAAVDEFMIKFFVA-LLVVMAVC 175 N L KA+ A+ + P GM + D + + ++ E + F A +LV + + Sbjct: 297 ANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMY 356 Query: 176 FVSMGWRVGVVVAAAVPLTLAVVFVVMAATGKNFDRITLGSLILALGLLVDDAIIAIEMM 235 R ++ AVP+ L F ++AA G + + +T+ ++LA+GLLVDDAI+ +E + Sbjct: 357 LFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENV 416 Query: 236 V-VKMEEGYDRLKASAYAWSHTAAPMLAGTLVTAVGFMPNGFAQSTAGEYASNVFWIVGI 294 V ME+ +A+ + S ++ +V + F+P F + G + Sbjct: 417 ERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVS 476 Query: 295 ALIASWIVAVIFTPWLGVHLLPDRKPVAAGHAALYDT----------PRYQRFRRLLTRV 344 A+ S +VA+I TP L LL KPV+A H + + ++ Sbjct: 477 AMALSVLVALILTPALCATLL---KPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKI 533 Query: 345 IAHKWRVAAGVVALFIVAILGMSVVNKQFFPTSDRPEVLVEVQLPYGSSISQTSAAAAKI 404 + R + ++ + F P D+ L +QLP G++ +T ++ Sbjct: 534 LGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQV 593 Query: 405 EHWLQRQPEAKIVASYIGQGAPRFYLAMAPELPDP--SFAKLVVLTDGQGARE---ALKR 459 + + +A + + + G + + + + +F L + G A+ Sbjct: 594 TDYYLKNEKANVESVFTVNG-----FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIH 648 Query: 460 RLREAV-----ANGLAPEARVRVTQLVFGPYSPYPVAWRVMGPDPHALLDIAERVKSVLQ 514 R + + + V + + +G D AL ++ + Sbjct: 649 RAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHD--ALTQARNQLLGMAA 706 Query: 515 ASPL-MRTVNTDWGSRVPVMHFSLNQDRLQASGLSSQSVAQQLQFLLSGIPITTVREDIR 573 P + +V + ++Q++ QA G+S + Q + L G + + R Sbjct: 707 QHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGR 766 Query: 574 AVQVIGRAAGDIRLDPAKIADFTLVGSGGQRVPLSQIGDVSIRMEDPLLRRRDRTPTITV 633 ++ +A R+ P + + + G+ VP S P L R + P++ + Sbjct: 767 VKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEI 826 Query: 634 RGDVAENLQPPDVSTALMKPLQPIIDSLPPGYRIETAGSIEESGKATRAMVPLFPIMIAL 693 +G+ A P S M ++ + LP G + G + + L I + Sbjct: 827 QGEAA----PGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVV 882 Query: 694 TLLIIILQVRSLSAMVMVFLTAPVGLIGVVPTLLLFNQPFGINALVGLIALSGILMRNTL 753 L + S S V V L P+G++GV+ LFNQ + +VGL+ G+ +N + Sbjct: 883 VFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAI 942 Query: 754 ILIGQIHHNQQA-GLDPFHAVVEATVQRARPVLLTALAAILAFIPLTHSVFWGT-----L 807 +++ + G A + A R RP+L+T+LA IL +PL S G+ + Sbjct: 943 LIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAV 1002 Query: 808 AYTLIGGTLG 817 ++GG + Sbjct: 1003 GIGVMGGMVS 1012
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 65.8 bits (160), Expect = 2e-15 Identities = 29/162 (17%), Positives = 61/162 (37%), Gaps = 4/162 (2%) Query: 6 HDEAQSLKARIFSAAIAVFAEHGLSGARMEQIATEAQTTKRMVVYYFKSKEQLYQEVLQH 65 EAQ + I A+ +F++ G+S + +IA A T+ + ++FK K L+ E+ + Sbjct: 6 KQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL 65 Query: 66 VYARIRETEQQLGLENVPPVEALVR---LVRWSVRYHATHADYMRVICMENMQR-GKWLK 121 + I E E + + +++R + + I + G+ Sbjct: 66 SESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAV 125 Query: 122 SSGELKPLNRTALSILEDILLRGQQQGVFQAGLDARDVHRLI 163 + L + +E L + + A L R ++ Sbjct: 126 VQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIM 167
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 47.5 bits (113), Expect = 6e-08 Identities = 69/370 (18%), Positives = 124/370 (33%), Gaps = 43/370 (11%) Query: 64 GILFSAFAWTYALAQIPGGLFLDRFGNKVTYFLSLTLWSLFTLFHGMAVGLKTLLLCRFG 123 GIL + +A G DRFG + +SL ++ A L L + R Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 124 LGISEAPCFPVNSRVVSAWFPQQERAKA----TAVYTVGEYLGLACFAPLLFWIMDGFGW 179 GI+ A V ++ ERA+ +A + G G P+L +M GF Sbjct: 106 AGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMVAG-----PVLGGLMGGFSP 159 Query: 180 RVLFVSVGAVGILFALVWWRCYREPHEDPRLSQQEREHIENGGGLSAPTDQQVAFSWPLV 239 F + A+ L L E H+ R + + +F W Sbjct: 160 HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREA-----------LNPLASFRWARG 208 Query: 240 RQLLSKRQIIGASIGQFAGNTVLVFFLTWFPTWLATERHMPWLKVGFFSILPFVAAAGGV 299 +++ + + ++ + E W + + AA G+ Sbjct: 209 MTVVAALMAVFFIMQLVGQVPAALWVIF-------GEDRFHWDA----TTIGISLAAFGI 257 Query: 300 M---FGGWLSDKLLKATGSANLGRKLPIVAGLL--MASCIITANWLESDLAVILVMSFAF 354 + ++ + LG + ++ G++ I+ A +A +++ A Sbjct: 258 LHSLAQAMITGPVAA-----RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312 Query: 355 FGQGMVGLGWTLISDIAPKGLGGLTGGLFNFCANLAGILTPLVIGFIVAGFGNFFYALIY 414 G GM L ++S + G G +L I+ PL+ I A + + Sbjct: 313 GGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAW 371 Query: 415 IGGAALLGVV 424 I GAAL + Sbjct: 372 IAGAALYLLC 381
>PF06438#Heme acquisition protein HasAp Length = 205 Score = 27.6 bits (61), Expect = 0.042 Identities = 25/121 (20%), Positives = 39/121 (32%), Gaps = 17/121 (14%) Query: 139 VGSRIRDWSIGFVD-------TVADNASCGLYVIGGPAQRPAGLDLKQCAMHMTRNQE-L 190 V + DWS F D V + + G GP D Q A+ T + Sbjct: 16 VADYLADWSAYFGDVNHRPGQVVDGSNTGGFN--PGP------FDGSQYALKSTASDAAF 67 Query: 191 VSSGRGSECLGHPLNAAVWLARKLASLGEPLRAGDIVLTGALG-PMVTINEGDSFVAHIE 249 ++ G L + +W +LG+ L G AL V+ + + Sbjct: 68 IAGGDLHYTLFSNPSHTLWGKLDSIALGDTLTGGASSGGYALDSQEVSFSNLGLDSPIAQ 127 Query: 250 G 250 G Sbjct: 128 G 128
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 48.7 bits (116), Expect = 2e-08 Identities = 79/400 (19%), Positives = 149/400 (37%), Gaps = 52/400 (13%) Query: 14 VTIGLCFMVALMEGLDLQAAGIAAVGMAQAFALDKMQMGWIFSAGILGLLPGALVGGMLA 73 + I LC + L+ ++ +A F W+ +A +L G V G L+ Sbjct: 15 ILIWLCILS-FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 74 DRHGRKRILLGSVLLFGLFSLATALAWS-FPTLLLARLLTGVGLGAALPNLIA-LTSEAA 131 D+ G KR+LL +++ S+ + S F L++AR + G G AA P L+ + + Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAG-AAAFPALVMVVVARYI 132 Query: 132 GSRFRGRAVSLMYCGVPIGAALAAALGFSGLAAAWQIIFWIGGVVPLLLIPLLMRWLPES 191 RG+A L+ V +G + A+G + ++ ++ +P LM+ L + Sbjct: 133 PKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKE 192 Query: 192 QAFQRA---------EASVPLRTLFAPGQAAATLLLWLGYFFTLLVVYMLINWLPMLLVG 242 + + LF + + L++ + F + V ++ P + G Sbjct: 193 VRIKGHFDIKGIILMSVGIVFFMLFTTSYSISFLIVSVLSFL-IFVKHIRKVTDPFVDPG 251 Query: 243 QGFRASQAAGVMFSLQI-GAACGTLLLGALMDK--------------LTPLRMSLLIYS- 286 G GV+ I G G + + M K + P MS++I+ Sbjct: 252 LGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 287 --GILAS------LLALGSASSLTGMLLAGFV----------AGLFATGGQSVLYALAPL 328 GIL +L +G L A F+ +F GG S + Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVIST 371 Query: 329 FYPAAIRATGVGTAVA----VGRLGAMSGPLLAGKMLALG 364 ++++ G ++ L +G + G +L++ Sbjct: 372 IVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIP 411
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.005 Identities = 13/32 (40%), Positives = 18/32 (56%), Gaps = 4/32 (12%) Query: 38 LRPG---ESVALL-GPSGCGKSTLLRLLAGLE 65 + PG + +L G G GKSTL+ L GL+ Sbjct: 589 MEPGCKFDYSVVLEGTGGIGKSTLINTLVGLD 620
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 51.0 bits (122), Expect = 4e-09 Identities = 41/147 (27%), Positives = 60/147 (40%), Gaps = 2/147 (1%) Query: 57 VSLYLAGGLALQWLLGPLSDRIGRRPVLLTGAIIFALACFSMIFVTSIDQYLIARFIQGT 116 ++LY A +LG LSDR GRRPVLL A+ M + I R + G Sbjct: 49 LALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI 108 Query: 117 SICFISTVGYVSIQEAFDEKESIRIMAALTSIVLLAPVIGPLAGAGLMNFLHWKLLFAII 176 + + G I + D E R +++ V GP+ G GLM F Sbjct: 109 TGATGAVAGAY-IADITDGDERARHFGFMSACFGFGMVAGPVLG-GLMGGFSPHAPFFAA 166 Query: 177 GAMSLLAWALLIFNMPETVTSQGRGFR 203 A++ L + F +PE+ + R R Sbjct: 167 AALNGLNFLTGCFLLPESHKGERRPLR 193
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.006 Identities = 63/343 (18%), Positives = 116/343 (33%), Gaps = 23/343 (6%) Query: 48 GLLAALPPAGMMISSFLSPALCRRVEMGVLLSGSLILLALATIASCMTTDMTLLLLPRLL 107 G+L AL + + AL R +L SL A+ + +L + R++ Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 108 TGLASGVIIVLGESWITGGAAGSQRATLTGLYASAFTGCQLAGPLL------ISVGPAWQ 161 G+ V G ++I G +RA G ++ F +AGP+L S + Sbjct: 106 AGITGATGAVAG-AYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFF 164 Query: 162 TSALIAIVAVTAVCLLMLRHLPTGTRE------SLGERASWRSLGAFLPVLASGVFCFAF 215 +A + + C L+ R + W + L + F Sbjct: 165 AAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQL 224 Query: 216 FDASILALLPLYGMDK-GLNEGLAVLLVTVVLTGDAMFQTPL-GWLADRVGIRRVHLSCA 273 AL ++G D+ + + + ++ Q + G +A R+G RR + Sbjct: 225 VGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGM 284 Query: 274 VVFSLSLLALPLMLGSRIQLMAICLLLGAAAG--ALYTLSLVRAGKTFNGQKLIMINALF 331 + + L + + LL G AL + + + GQ + Sbjct: 285 IADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQ----LQGSL 340 Query: 332 GFFWSAGSVAGPVVSGMLIG--ITGYDGLIVTLVASGVLFLLI 372 S S+ GP++ + IT ++G A+ L L Sbjct: 341 AALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLP 383
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.044 Identities = 12/34 (35%), Positives = 17/34 (50%) Query: 31 VVSLLGPSGSGKTTLLRAVAGLEKPTSGRIAIGN 64 V L G G GK+TL+ + GL+ + IG Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 78.3 bits (193), Expect = 7e-20 Identities = 28/111 (25%), Positives = 57/111 (51%), Gaps = 1/111 (0%) Query: 18 IIVAEDDDDIAAILTGYLRKAGMKTLRAEDGEQAINLTRLNKPDLLLLDIQLPVYDGWNV 77 I+VA+DD I +L L +AG + DL++ D+ +P + +++ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 78 LTTLRKE-TNVPVIMVTALDQDVDKLMGLRLGADDYVIKPFNPSEVIARVE 127 L ++K ++PV++++A + + + GA DY+ KPF+ +E+I + Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1059 bits (2741), Expect = 0.0 Identities = 503/1031 (48%), Positives = 690/1031 (66%), Gaps = 7/1031 (0%) Query: 1 MPHFFIERPIFAWVIALFIVLTGLLSIPRLPVAQYPEVAPPGIIISVSYPGASPEVMNTS 60 M +FFI RPIFAWV+A+ +++ G L+I +LPVAQYP +APP + +S +YPGA + + + Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VVSLIEREISSVDNLLYFESSSDTTGMASITVTFKPGTDIKLAQMDLQNQIKIVESRLPQ 120 V +IE+ ++ +DNL+Y S+SD+ G +IT+TF+ GTD +AQ+ +QN++++ LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 SVRQNGINVEAANSGFLMMVGLKSPSGAYQEADLSDYFARNVTDELRRVPGVGKVQLFGG 180 V+Q GI+VE ++S +LM+ G S + + D+SDY A NV D L R+ GVG VQLFG Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 EKALRIWLDPMKLHSYGLSVTDVLSAISQQNVIVSPGRTGDEPATSSQEVTYPITVKGQL 240 + A+RIWLD L+ Y L+ DV++ + QN ++ G+ G PA Q++ I + + Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 SSVEEFRNITIKSQVSAARVTLADVARVESGLQSYAFGIRENGVPATAAAIQLSPGANAI 300 + EEF +T++ + V L DVARVE G ++Y R NG PA I+L+ GANA+ Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 STASGIRARLTELSGVLPEGMIFTVPFDTAPFVKLSILKVVETFVEAMVLVFFVMLLFLH 360 TA I+A+L EL P+GM P+DT PFV+LSI +VV+T EA++LVF VM LFL Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 KIRCTLIPAIVAPVALLGTFTVMLLSGYSINILTMFGMILAIGIIVDDAIVVVENVERLM 420 +R TLIP I PV LLGTF ++ GYSIN LTMFGM+LAIG++VDDAIVVVENVER+M Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 EDKKMSPQDATREAMREITPAIIGITLVLTAVFIPMAFASGSVGIIYRQFSISMAISILL 480 + K+ P++AT ++M +I A++GI +VL+AVFIPMAF GS G IYRQFSI++ ++ L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SAFLALTLTPALCATLLKP-HGIHQGKSSVFSAWFNAHFHRLTSFYATGLGFVLKRTGRM 539 S +AL LTPALCATLLKP H F WFN F + Y +G +L TGR Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 540 MMIYAALCLALFAGLSTLPSSFLPDEDQGYFMSSIQLPSDATMQRTLKVVDTFEEEI--A 597 ++IYA + + LPSSFLP+EDQG F++ IQLP+ AT +RT KV+D + Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 598 HRQAVESNIMILGFGFSGSGQNSAMAFTTLKDWRQRKGT--TAQEEADHIRSQMANVPDA 655 + VES + GF FSG QN+ MAF +LK W +R G +A+ + ++ + D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 656 VTMSLLPPAISDMGTSSGFTYYLQDRGGKGYQALKKAANELIVQANHNP-HLADVYIDGL 714 + PAI ++GT++GF + L D+ G G+ AL +A N+L+ A +P L V +GL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 715 GEGTSLSLHVDREKAEAMGVSFDEINQTISVAAGSNYVNDYTNNGRVQQVIVQADAPYRM 774 + L VD+EKA+A+GVS +INQTIS A G YVND+ + GRV+++ VQADA +RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 775 QPEQLLALSVKNRLGQMLPLSTFVTLSWNVAPQQLIRYQGYPAIRITGSSAQGKSSGTAM 834 PE + L V++ G+M+P S F T W +L RY G P++ I G +A G SSG AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 835 AAMDNLAKHLPPGFAGEWAGSSLQEKESASQLPGLIVLSVLVVFMVLAALYESWSIPFAV 894 A M+NLA LP G +W G S QE+ S +Q P L+ +S +VVF+ LAALYESWSIP +V Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 895 MLVVPLGLLGAVLAVSVTNMTNDVFFKVGLITLIGLSAKNAILIIEFARQLM-KEGKSLI 953 MLVVPLG++G +LA ++ N NDV+F VGL+T IGLSAKNAILI+EFA+ LM KEGK ++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 954 DATLTAAKLRLRPILMTSLAFTLGVVPLMLASGASDSTQHAIGTGVFGGMISGTLLAIFF 1013 +ATL A ++RLRPILMTSLAF LGV+PL +++GA Q+A+G GV GGM+S TLLAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1014 VPVFFVTITRF 1024 VPVFFV I R Sbjct: 1021 VPVFFVVIRRC 1031
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 99.0 bits (246), Expect = 3e-27 Identities = 61/228 (26%), Positives = 97/228 (42%), Gaps = 10/228 (4%) Query: 1 MAIADYNDATAKAVASEINQAGGRAMAVKVDVSDRDQVFAAVEQARKTLGGFDVIVNNAG 60 +A DYN + V S + A A DV D + + + +G D++VN AG Sbjct: 35 IAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAG 94 Query: 61 VAPSTPIESITPEIVDKVYNINVKGVIWGIQAAVEAFKKEGHGGKIINACSQAGHVGNPE 120 V I S++ E + +++N GV ++ + G I+ S V Sbjct: 95 VLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMMDR-RSGSIVTVGSNPAGVPRTS 153 Query: 121 LAVYSSSKFAVRGLTQTAARDLAPLGITVNGYCPGIVKTPM----WAEIDRQVSEAAGKP 176 +A Y+SSK A T+ +LA I N PG +T M WA+ + G Sbjct: 154 MAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGS- 212 Query: 177 LGYGTAEFAKRITLGRLSEPEDVAACVSYLASPDSDYMTGQSLLIDGG 224 F I L +L++P D+A V +L S + ++T +L +DGG Sbjct: 213 ----LETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (288), Expect = 4e-33 Identities = 66/255 (25%), Positives = 105/255 (41%), Gaps = 9/255 (3%) Query: 3 LHGKTALVTGSTSGIGLGIAKVLAQAGAQLVLNGFGDSSHARAE--VAALGKIPGYHDAD 60 + GK A +TG+ GIG +A+ LA GA + + + + A + AD Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 61 LRDVGQIEAMMRYAESTFGGVDIVINNAGIQHVAPVEQFPVDKWNDILAINLSSVFHTTR 120 +RD I+ + E G +DI++N AG+ + ++W ++N + VF+ +R Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 121 LALPGMRQRNWGRIINIASVHGLVASKEKSAYVAAKHAVVGLTKTVALETARSGITCNAI 180 M R G I+ + S V +AY ++K A V TK + LE A I CN + Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIV 185 Query: 181 CPGWVLTPLVQQQIDKRIAEGVDPEQASAQLLAEKQ---PSGEFVTPQQLGEMALFLCSD 237 PG T + EQ L + P + P + + LFL S Sbjct: 186 SPGSTETDMQWSLWADENGA----EQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 238 AAAQVRGAAWNMDGG 252 A + +DGG Sbjct: 242 QAGHITMHNLCVDGG 256
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 29.9 bits (67), Expect = 0.014 Identities = 32/129 (24%), Positives = 47/129 (36%), Gaps = 15/129 (11%) Query: 6 ALAALALLMLAAYRGY----SVILFAPIAALGAVLLTDPGAVGPA----------FTGLF 51 ALA + ++AA GY S++ P+ AL + G V A + L Sbjct: 126 ALAGAGIGVVAAALGYTQWLSLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLV 185 Query: 52 MEKMVGFVKLYFPVFLLGAVFGKLIELSGFSRSIVAAAIRILGRRHAIPVIVLVCALLTY 111 + ++ FPV L VF S SI +LG + V V AL Y Sbjct: 186 ITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHP-VVDVCQHVGALCIY 244 Query: 112 GGVSLFVVA 120 + F+ Sbjct: 245 IVIPFFLST 253
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.023 Identities = 15/42 (35%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 53 ITLLGPSGCGKSTLLKMVAGLVEPSDGKLMLW-RRDSREKAQ 93 + L G G GKSTL+ + GL SD + +DS E+ Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIA 640
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 32.2 bits (73), Expect = 0.003 Identities = 17/72 (23%), Positives = 32/72 (44%), Gaps = 9/72 (12%) Query: 188 LVSRYHDPRPESLRRVVMAPTTVLHSAPGAQ-LREMAKLARQLGIRL------HSHLSET 240 + S + P PE L R+ AP + + G Q L K ++ L +HL++ Sbjct: 101 VWSAGYGPSPEMLARI--APGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETHLAQY 158 Query: 241 VDYLDAARQKFA 252 D++ + + +F Sbjct: 159 EDFIRSMKPRFV 170
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.6 bits (139), Expect = 9e-13 Identities = 37/142 (26%), Positives = 63/142 (44%), Gaps = 3/142 (2%) Query: 1 MLTFGSFLLAAGVTADAIDRKRIFIAGAALFCLSSLLFCLTHNLFLSGVL-RALQGLAAA 59 MLTF G +D + KR+ + G + C S++ + H+ F ++ R +QG AA Sbjct: 59 MLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAA 118 Query: 60 MILASGSAALAQLYDGAQRTRAFSILGTVFGAGLAFGPLLIGFMTDAVGWRGVYALFALL 119 A +A+ R +AF ++G++ G GP + G + + W Y L + Sbjct: 119 AFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPM 176 Query: 120 SAIVLLIGLAYLPAAEKASRGH 141 I+ + L L E +GH Sbjct: 177 ITIITVPFLMKLLKKEVRIKGH 198
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 28.7 bits (64), Expect = 0.007 Identities = 11/37 (29%), Positives = 17/37 (45%) Query: 5 AISAVPVAKAGMAAGLFNTVRVAGEGIALAIVSAVLT 41 S++ +AG L N EG +AIV +L+ Sbjct: 373 VSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 30.2 bits (68), Expect = 0.004 Identities = 14/44 (31%), Positives = 21/44 (47%), Gaps = 5/44 (11%) Query: 10 QRQALICQILQENGRVVCAELAARLQ-----VSEHTIRRDLHEL 48 QR I +I+ N EL L+ V++ T+ RD+ EL Sbjct: 5 QRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKEL 48
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 37.9 bits (88), Expect = 7e-05 Identities = 36/215 (16%), Positives = 74/215 (34%), Gaps = 17/215 (7%) Query: 52 LGMSEADSITLFSSFSALVYGLVAIGGWLGDKVLGTKRVIMLGAIVLAIGYALVAWSGHD 111 A + + ++F A+ G L D+ LG KR+++ G I+ G + Sbjct: 44 FNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ-LGIKRLLLFGIIINCFGSVIGFVGHSF 102 Query: 112 AAIVYMGMATIAVGNGLFKANPSSLLST-CYDKNDPRLDGAFTMYYMSINIGSFFSMLAT 170 +++ M G F A +++ +N AF + + +G Sbjct: 103 FSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG---KAFGLIGSIVAMGEGVGPAIG 159 Query: 171 PWLAARFGWSVAFALSVVGMVITIINFAFCQKWVKQYGSKPD-FAPVHMGKLLATIAGVV 229 +A WS + +ITII F K +K+ F + + I + Sbjct: 160 GMIAHYIHWSYLLLI----PMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFM 215 Query: 230 VLVAIATWLLHNQGIARMVLGVVALGIVVIFAKET 264 + + +++ V++ I V ++ Sbjct: 216 LFTTSYSISF-------LIVSVLSFLIFVKHIRKV 243
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 25.6 bits (56), Expect = 0.044 Identities = 7/29 (24%), Positives = 12/29 (41%) Query: 50 PRVAIVVDKSTWTREIIERNGTFGIVVPG 78 P I ++ T ++ E NG +P Sbjct: 359 PGEIIAYSRNHVTNKLFEENGIKVHRIPS 387
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.4 bits (68), Expect = 0.007 Identities = 12/31 (38%), Positives = 14/31 (45%) Query: 37 LLGPSGCGKSTLLRLLAGLSVPASGEIRFGD 67 L G G GKSTL+ L GL + G Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 65.1 bits (158), Expect = 3e-14 Identities = 48/163 (29%), Positives = 73/163 (44%) Query: 6 ITGGTAGAGKATALRFARAGYHVALIARDETGLQETRQACERFGIKTLAISADVVDAGAL 65 ITG G G+A A A G H+A + + L++ + + A ADV D+ A+ Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72 Query: 66 QRAAAEVETTLGAIDVWINNAMTTVLAPFRQMSEEEFRRVTEVTYLGYVNGTRAALEVMI 125 A +E +G ID+ +N A +S+EE+ V G N +R+ + M+ Sbjct: 73 DEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKYMM 132 Query: 126 PRDRGVIIQAGSALAWRSIPLQSAYCGAKAAIRGFTDAVRTEL 168 R G I+ GS A +AY +KAA FT + EL Sbjct: 133 DRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 96.4 bits (240), Expect = 4e-25 Identities = 42/127 (33%), Positives = 65/127 (51%), Gaps = 1/127 (0%) Query: 6 HILVVDDDRDIRELIVDYLEKSGYRASGAANGKAMWSVLKNHQIDLIVLDIMMPGEDGLT 65 ILV DDD IR ++ L ++GY +N +W + DL+V D++MP E+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 66 LCRQLRANPQQDIPVLMLTARTDDSDRILGLEMGADDYLIKPFVARELLARIKAILRRTR 125 L +++ + D+PVL+++A+ I E GA DYL KPF EL+ I L + Sbjct: 65 LLPRIKK-ARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPK 123 Query: 126 ALPPNLQ 132 P L+ Sbjct: 124 RRPSKLE 130
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 26.8 bits (59), Expect = 0.044 Identities = 11/39 (28%), Positives = 15/39 (38%) Query: 21 TLTLGSLPPARLKLASGLFNLMRNLGGAIGIALCGTVLN 59 T+ SL L N L GIA+ G +L+ Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 65.7 bits (160), Expect = 3e-14 Identities = 45/235 (19%), Positives = 84/235 (35%), Gaps = 19/235 (8%) Query: 34 RLFQGKQRVIAAATIGGLASLAPTLGPTVGGWITENYNWHWLFFINVVPGIYIAVAVPLL 93 R + R A IG + ++ +GP +GG I +W +L I ++ I + + LL Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLL 189 Query: 94 VKVDSADPTL-LRGADYLSILLLALSLGCLEYTLEEGPRWGWFDDATLTTTAWVALLCGV 152 K ++G +S+ ++ L Y + V + Sbjct: 190 KKEVRIKGHFDIKGIILMSVGIVFFMLFTTSY------SISFL-------IVSVLSF--L 234 Query: 153 AFVIRTLHHPQPVMDLRALQDRTFSLGCYFSFMAGVGIFATIYLTPLYLGSVRGFSALEI 212 FV P +D ++ F +G + + + + P + V S EI Sbjct: 235 IFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEI 294 Query: 213 GLAV-FSTGLFQVMSIPFYSWLANRVDLRWLLMAGLIGFAVSMY--SFVPITHDW 264 G + F + ++ L +R ++L G+ +VS SF+ T W Sbjct: 295 GSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSW 349
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 107 bits (269), Expect = 4e-28 Identities = 54/370 (14%), Positives = 105/370 (28%), Gaps = 81/370 (21%) Query: 44 VGGDISAISSKVSGYIQQLAVQDNMAVKKGDLLIRIDDRDYRAALAKA------------ 91 G I + ++++ V++ +V+KGD+L+++ A K Sbjct: 92 HSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQT 151 Query: 92 -----------------------------AGEVAAQ-----------QAALADIQATRQL 111 EV Q + Sbjct: 152 RYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDK 211 Query: 112 QQATIAGSAASLLAATAATEKLANDNRRYNALAASSAISAQIRDNASADYRRAHAEQEKA 171 ++A A + + + +++L AI+ Y A E Sbjct: 212 KRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVY 271 Query: 172 KADKTVAERQLAVLDARHQQ--------ILAALAQAQAN-------LEMARLNLSYTDIR 216 K+ E ++ +Q IL L Q N L + IR Sbjct: 272 KSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIR 331 Query: 217 APFDGVIGNRRAWS-GSFVSSGTQLLSLVPA-HGLWIDANFKENQLAHMRAGQPATIVAD 274 AP + + + G V++ L+ +VP L + A + + + GQ A I + Sbjct: 332 APVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVE 391 Query: 275 VLPNHTF---KGHVASLAPATGSRFSILPAENATGNFTKIVQRVPVRIALEGDGAKLDVL 331 P + G V ++ + G ++ + G+ K L Sbjct: 392 AFPYTRYGYLVGKVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPL 442 Query: 332 RPGLSVIVTV 341 G++V + Sbjct: 443 SSGMAVTAEI 452
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.4 bits (76), Expect = 2e-04 Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 10/84 (11%) Query: 67 NRLWALISALVIEESSRGSGIGQQLLQAAERLARDKQCAQIELSSSEKESERINFMKITA 126 N ALI + + + R G+G LL A A++ + L E++ IN I+A Sbjct: 87 NGY-ALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLML-----ETQDIN---ISA 137 Query: 127 TRRFANASLNIC-LNRLRARAFPG 149 +A I ++ + FP Sbjct: 138 CHFYAKHHFIIGAVDTMLYSNFPT 161
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 29.1 bits (65), Expect = 0.020 Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 13/72 (18%) Query: 58 RLGYDKYKDMRDELRTL-------RQSGMPLTDQRDAV------QGNTLLARHYKQEMAN 104 L Y K D+ + L + +Q+ P+ + Q N L+ M + Sbjct: 273 YLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIIIKAHGQTNALIVTAAPDVMND 332 Query: 105 LTQWVNALDARQ 116 L + + LD R+ Sbjct: 333 LERVIAQLDIRR 344
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.015 Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 4/42 (9%) Query: 31 VISIIGRSGSGKSTLLRCINGLEGYQEGSIKLGGMTITNRDS 72 + + G G GKSTL+ + GL+ + + +G T +DS Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIG----TGKDS 635
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 62.1 bits (151), Expect = 1e-12 Identities = 64/279 (22%), Positives = 117/279 (41%), Gaps = 17/279 (6%) Query: 29 LVFLLSDIAHSFHVDLEEVTLAILLTLAVRPVGALIFGRAAEKFGRKPILMLNIVFFSAF 88 L LL D+ HS V L L L ++ A + G +++FGR+P+L++++ + Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYAL-MQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86 Query: 89 ELLSAAAPSLMLFFLLRVLYGVAMGGIWGVASSLAMETIPDRSR----GLMSGLFQAGYP 144 + A AP L + ++ R++ G+ G VA + + R G MS F G Sbjct: 87 YAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFG-- 143 Query: 145 FGYLLAAVAYGLLFEQLGWRGMFVIGAAPVLLLPFIYFCVEESPVWQAARQNKESTALLP 204 ++A G L F AA + L F+ C + R+ AL P Sbjct: 144 ---MVAGPVLGGLMGGFSPHAPFFAAAA-LNGLNFLTGCFLLPESHKGERRPLRREALNP 199 Query: 205 VLRSHWKLCLYLVVLMAAFNF----FSHGTQDLYPVFLKVQHGFEPKTVSI-IAVCYNIA 259 + W + +V + A F L+ +F + + ++ T+ I +A + Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259 Query: 260 SIIGGVFFGSLSEKIGRRKAIMIAALLALPVIPLWAFAS 298 S+ + G ++ ++G R+A+M+ + L AFA+ Sbjct: 260 SLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFAT 298 Score = 32.1 bits (73), Expect = 0.004 Identities = 29/120 (24%), Positives = 45/120 (37%), Gaps = 6/120 (5%) Query: 62 ALIFGRAAEKFGRKPILMLNIVFFSAFELLSAAA---PSLMLFFLLRVLYGVAMGGIWGV 118 A+I G A + G + LML ++ +L A A +L G+ M + + Sbjct: 264 AMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAM 323 Query: 119 ASSLAMETIPDRSRGLMSGLFQAGYPFGYLLAAVAYGLLFEQLGWRG-MFVIGAAPVLLL 177 S E + +G ++ L G LL Y W G ++ GAA LL Sbjct: 324 LSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITT--WNGWAWIAGAALYLLC 381
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 67.3 bits (164), Expect = 3e-16 Identities = 31/171 (18%), Positives = 58/171 (33%), Gaps = 15/171 (8%) Query: 2 KPKQADILRHASTLFNREGYQSPSIERIAEHAGISKMTFYRYYADKEALILAILKQKESE 61 + + IL A LF+++G S S+ IA+ AG+++ Y ++ DK L I + Sbjct: 10 QETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWEL-SES 68 Query: 62 FMQDLAQITADK------ASAREKLFAVFDYYHRWFTCETFHGCMFTRALFEYGSSSPAI 115 + +L K + RE L V + +F + F + Sbjct: 69 NIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMAVVQQ 128 Query: 116 REQCSRFKSLLWQFFRDILL------QVLKPEPAERVAMMMVMLIDGAIAA 160 ++ + L + R A++M I G + Sbjct: 129 AQRN--LCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMEN 177
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 43.7 bits (103), Expect = 9e-07 Identities = 48/291 (16%), Positives = 94/291 (32%), Gaps = 28/291 (9%) Query: 70 VTFSLLIILQTFFSPFQGRLVEKFGPRRLIAIGTVMAGMSWVLSAQVNGLATLWL---VY 126 + +L ++Q +P G L ++FG R ++ + A + + + A L L++ V Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106 Query: 127 GCMGGLGTG----IVYIGVVGLMVKWFPQQRGFAAGAVAAGYGMGAIITTFPISLSLTTN 182 G G G I I + F GF + G G ++ S Sbjct: 107 GITGATGAVAGAYIADITDGDERARHF----GFMSACFGFGMVAGPVLGGLMGGFSP--- 159 Query: 183 GLEHTMTTFGILFALVGFLASQ-GLKLPPLAVSQPVSQTVVQSSRSFTSREMLRQPLFWL 241 H + FL L +P+ + + SF + + L Sbjct: 160 ---HAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMT-VVAAL 215 Query: 242 MFAMMAMMSTSGLMVTSQMAVFAED-FGISQAVV-FGMAALPLALTIDRFTNGLTRPLFG 299 M M + +F ED F + +AA + + + + G Sbjct: 216 MAVFFIMQLVGQVPAA-LWVIFGEDRFHWDATTIGISLAAFGI---LHSLAQAM---ITG 268 Query: 300 FISDRFGREQTMFIAFALEGVAMMLWLACREDPLLFVLLSGVVFFGWGRSS 350 ++ R G + + + +G +L + F ++ + G G + Sbjct: 269 PVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPA 319
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.8 bits (67), Expect = 0.017 Identities = 26/141 (18%), Positives = 48/141 (34%), Gaps = 4/141 (2%) Query: 18 MVIALVQFTNALEYMMFSPVFTFMAADF---AVPVTFSGYVSGMYTSGAVLSGIIAFYWI 74 +VI +A+ + PV + D G + +Y + Sbjct: 8 IVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALS 67 Query: 75 DRCNKKHFLIANMVLLAMATLLTTFTTSFPLLLTLRFFAGLVGGTTMAVGITILINHTPA 134 DR ++ L+ ++ A+ + +L R AG+ G T AV + + T Sbjct: 68 DRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYIADITDG 126 Query: 135 DLRGKMLATVIASFSMVSIVG 155 D R + + A F + G Sbjct: 127 DERARHFGFMSACFGFGMVAG 147
>INTIMIN#Intimin signature. Length = 939 Score = 32.7 bits (74), Expect = 4e-04 Identities = 22/70 (31%), Positives = 40/70 (57%), Gaps = 7/70 (10%) Query: 84 SDGVKVTQSGAESR-FYTVKSGDTLSAISKAMYGSANEYQRIFEANKPMLTHPD---KIY 139 SD +T + ++R FYT+K+G+T++ +SK+ + + I+ NK + + K Sbjct: 49 SDSKLLTHNSYQNRLFYTLKTGETVADLSKSQDINLST---IWSLNKHLYSSESEMMKAE 105 Query: 140 PGQVLIIPAK 149 PGQ +I+P K Sbjct: 106 PGQQIILPLK 115
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 30.6 bits (69), Expect = 0.012 Identities = 33/168 (19%), Positives = 60/168 (35%), Gaps = 10/168 (5%) Query: 25 TPYLKEQLDLSATQI---GLLSSCMLIAYGISKGVMSSLADKASPKVFMACGLVLCAIVN 81 P L L S G+L + + V+ +L+D+ + + L A+ Sbjct: 28 LPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDY 87 Query: 82 VGLGFSTAFWVFAALVVLNGLFQGMGVGPSFITIANWFPRRERGRVGAFWNISHNVGGGI 141 + + WV ++ G+ G IA+ ER R F +S G G+ Sbjct: 88 AIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERAR--HFGFMSACFGFGM 144 Query: 142 VA-PIVGAAFAILGTEHWQSASYIVPACVAVVFAISVLVLGKGSPREE 188 VA P++G ++G + + A + F +L + E Sbjct: 145 VAGPVLG---GLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGER 189
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 30.6 bits (69), Expect = 0.010 Identities = 5/35 (14%), Positives = 15/35 (42%), Gaps = 4/35 (11%) Query: 312 QRLVQRMFDTAISFRLAQLKDAWRALHSAEVRLKR 346 +++ + LA ++++W + RL + Sbjct: 150 NSVMEGVIVRI----LANVRESWTQVIDLRPRLGQ 180
>PF06580#Sensor histidine kinase Length = 349 Score = 27.9 bits (62), Expect = 0.011 Identities = 8/42 (19%), Positives = 17/42 (40%), Gaps = 1/42 (2%) Query: 28 QVLV-NVLSNALDACPHAAQITVSWQIQGGRLCVLIADNGPG 68 Q LV N + + + P +I + G + + + + G Sbjct: 261 QTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSL 302
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 227 bits (579), Expect = 8e-72 Identities = 112/464 (24%), Positives = 185/464 (39%), Gaps = 73/464 (15%) Query: 7 SILLIDDDADVLDAYTQLLEQAGYHVSACNNPFDAREQVPKDWPGIVLSDVCMPGCSGID 66 +IL+ DDDA + Q L +AGY V +N + +V++DV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 67 LMTVFHQDDDLLPILLITGHGDVPMAVEAVKKGAWDFLQKPIDPGKLLTLVDAALRQRQS 126 L+ + LP+L+++ A++A +KGA+D+L KP D +L+ ++ AL + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 127 VIARRQYCQQKLQVELIGRSQWTVRYRQRLQQLAETDIAVWLYGEPGTGRMTGARYLHQL 186 ++ + Q L+GRS + L +L +TD+ + + GE GTG+ AR LH Sbjct: 125 RPSKLEDDSQDGM-PLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDY 183 Query: 187 GRHAEGPFIA--CELTPAN----------------AHTLNE-LIAQAQGGTLVLSHPEHL 227 G+ GPF+A P + A T + QA+GGTL L + Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 228 THEQQHQLVQ-LQSHEKRP----------FRLIGIGSASLVELAASSQIVAELYYCFAMT 276 + Q +L++ LQ E R++ + L + +LYY + Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 277 QIGCQPLSKRPNDIEPLFHHYLQKTCQRLNHPVPEVDAGLLKGMMRRVWPNNVRELANAA 336 + PL R DI L H++Q+ + V D L+ M WP NVREL N Sbjct: 304 PLRLPPLRDRAEDIPDLVRHFVQQAE-KEGLDVKRFDQEALELMKAHPWPGNVRELENLV 362 Query: 337 ELFAV--------------------------------GVLPLAETVNPLMH--------- 355 G L +++ V M Sbjct: 363 RRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDA 422 Query: 356 IGEPTPLDQRVEDVERQIITEALNIHQGRINEVAEYLLIPRKNF 399 + D+ + ++E +I AL +G + A+ L + R Sbjct: 423 LPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTL 466
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 94.3 bits (234), Expect = 3e-26 Identities = 52/171 (30%), Positives = 88/171 (51%), Gaps = 2/171 (1%) Query: 3 LASKTAIVTGAARGIGFGIAQVLAREGARVIIADRDAHG-EAAAASLRESGAQALFISCN 61 + K A +TGAA+GIG +A+ LA +GA + D + E +SL+ A + Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 62 IAEKTQVEALFSQAEEAFGPVDILVNNAGINRDAMLHKLTEADWDTVIDVNLKGTFLCMQ 121 + + ++ + ++ E GP+DILVN AG+ R ++H L++ +W+ VN G F + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 122 QAAIRMRERGAGRIINIAS-ASWLGNVGQTNYSASKAGVVGMTKTACRELA 171 + M +R +G I+ + S + + Y++SKA V TK ELA Sbjct: 126 SVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 32.3 bits (73), Expect = 6e-05 Identities = 22/77 (28%), Positives = 31/77 (40%), Gaps = 10/77 (12%) Query: 2 NAICPGFIDTDMTRG--VPENVWQIMISK--------IPAGYAGEAKDVGECVAFLASDG 51 N + PG +TDM EN + +I IP + D+ + V FL S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 52 ARYINGEVINVGGGMVL 68 A +I + V GG L Sbjct: 243 AGHITMHNLCVDGGATL 259
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.3 bits (84), Expect = 2e-04 Identities = 31/141 (21%), Positives = 57/141 (40%), Gaps = 18/141 (12%) Query: 51 SVDIGLSATAFGLGAGLFFLTYAVLEIPSNLFLTRIGARRWIARIMITWGILSCG----- 105 + IG+S AFG+ L +T A R R + G+++ G Sbjct: 245 ATTIGISLAAFGILHSLA-----------QAMITGPVAARLGERRALMLGMIADGTGYIL 293 Query: 106 MAFVTGPTSFYVMRLLLGAAEAGLYPGIIYYLTLWFGREERAKATGLFLLGVCLANIIGA 165 +AF T + + +LL + G+ P + L+ E + + G L +I+G Sbjct: 294 LAFATRGWMAFPIMVLLASGGIGM-PALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGP 352 Query: 166 PLGGLLLSLDGMSGWHGWQWM 186 L + + ++ W+GW W+ Sbjct: 353 LLFTAIYAAS-ITTWNGWAWI 372
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 6e-04 Identities = 36/171 (21%), Positives = 65/171 (38%), Gaps = 18/171 (10%) Query: 35 WVSLLVCWLIWVLNAYDREMILRLG-PVISKEFSLSPEQWGNIVALIMVALAVLDIPGSI 93 W+ +L VLN EM+L + P I+ +F+ P + M+ ++ Sbjct: 18 WLCILS--FFSVLN----EMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGK 71 Query: 94 WSDRYGSGWKRARFQVPLVLGYTALSFISGIKAISHGLTAFVLL-RVGVNLGAGWGEPVG 152 SD+ G + L+ G F S I + H + +++ R GA + Sbjct: 72 LSDQLG-------IKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALV 124 Query: 153 VSNTAEWWPKEKRGFALGVHHTGYPIGALLSGVVASLVLATFGEGSWRYCF 203 + A + PKE RG A G+ + +G + + ++ W Y Sbjct: 125 MVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIH---WSYLL 172
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 28.2 bits (63), Expect = 0.016 Identities = 31/141 (21%), Positives = 55/141 (39%), Gaps = 6/141 (4%) Query: 1 MGINVVLP--PYLYHVSGLSLAASAGLSIIFTLTGTLGQVIWPWL---SDSFGRKRTLIV 55 +GI +++P P L S +A I+ L + P L SD FGR+ L+V Sbjct: 19 VGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLV 78 Query: 56 CGLWMSIGIALFYFATNMPRLIAIQLFFGLVANAVWPIYYAMASDSAEERATSTANGIIT 115 ++ A+ A + L ++ G + A + A +D + + G ++ Sbjct: 79 SLAGAAVDYAIMATAPFLWVLYIGRIVAG-ITGATGAVAGAYIADITDGDERARHFGFMS 137 Query: 116 TAMFIGGGISPLLMGWLIQFG 136 G P+L G + F Sbjct: 138 ACFGFGMVAGPVLGGLMGGFS 158
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 28.8 bits (64), Expect = 0.012 Identities = 13/50 (26%), Positives = 30/50 (60%), Gaps = 2/50 (4%) Query: 296 INNVRQLLEHDSGEVLLDTLSSFIANNAEPGKTSLLLGIHRNTLTYRLQQ 345 +N++ +L+ + + LLD + + N + +L++GI+R TL +L++ Sbjct: 47 VNDLYELVLAEVEQPLLDMVMQYTRGNQT--RAALMMGINRGTLRKKLKK 94
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 6e-06 Identities = 26/118 (22%), Positives = 49/118 (41%), Gaps = 1/118 (0%) Query: 55 GLVMSVLLVGAALGSVFGGKFADYFGRRKYLLFLSFVFLIGALLSAAAPDITILLIARAL 114 G+++++ + + G +D FGRR LL + + A AP + +L I R + Sbjct: 46 GILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 115 LGYAVGGASVTAPTFISEVAPTEMRGKLTGLNEVAIVIGQLAAFAINAIIGIIWGHLP 172 G G A +I+++ + R + G G +A + ++G H P Sbjct: 106 AGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAP 162 Score = 32.5 bits (74), Expect = 0.004 Identities = 32/137 (23%), Positives = 51/137 (37%), Gaps = 22/137 (16%) Query: 321 LVDRFKRKTIIIYGFAIMATLHLIIAAVDYTLVGDLKATAIWLLGALFVGVMQGSMGFIT 380 L DRF R+ +++ L AAVDY ++ + +G + G + G+ G + Sbjct: 66 LSDRFGRRPVLLVS--------LAGAAVDYAIMATAPFLWVLYIGRIVAG-ITGATGAVA 116 Query: 381 WVVLAELFPLKFRGLSMGISVFFMWIMNAVVSYLFPL------LQAKLGLGPVFFIFAAI 434 +A++ R G M+A + L FF AA+ Sbjct: 117 GAYIADITDGDERARHFGF-------MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAAL 169 Query: 435 NYLAILFVVFALPETSN 451 N L L F LPE+ Sbjct: 170 NGLNFLTGCFLLPESHK 186 Score = 30.6 bits (69), Expect = 0.013 Identities = 30/152 (19%), Positives = 52/152 (34%), Gaps = 8/152 (5%) Query: 48 ALTPTTEGLVMSVL-LVGAALGSVFGGKFADYFGRRKYLLFLSFVFLIGALLSAAAPDIT 106 TT G+ ++ ++ + ++ G A G R+ L+ G +L A A Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 107 ILLIARALLGYAVGGASVTAPTFISEVAPTEMRGKLTGLN----EVAIVIGQLAAFAINA 162 + LL + G +S E +G+L G + ++G L AI A Sbjct: 302 MAFPIMVLLA-SGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360 Query: 163 IIGIIWGHLPDVWRYMLLVQAIPAICLFVGMW 194 W W + + L G+W Sbjct: 361 ASITTWNGW--AWIAGAALYLLCLPALRRGLW 390
>FIMBRIALPAPE#Escherichia coli: P pili tip fibrillum papE protein signature. Length = 173 Score = 31.9 bits (72), Expect = 9e-04 Identities = 39/168 (23%), Positives = 75/168 (44%), Gaps = 22/168 (13%) Query: 24 ARAAGTLNFTGKIINESCQIANNGGDVNVDFGNVDMSALKSHEAKTAETPFTINLTGCPL 83 AA L F GK+I +C + N V++G++++ L ++ + FT+++ CP Sbjct: 22 VHAADNLTFKGKLIIPACTVQN----AEVNWGDIEIQNLV--QSGGNQKDFTVDMN-CPY 74 Query: 84 AQNISISLEGTPDTNANGTSAAVLALSDAADTAKGVGIEVFSSPDGS-----TEGTQLTF 138 S+ T+ T ++L + + + G+ I +++S + T G+Q+T Sbjct: 75 ----SLGTMKVTITSNGQTGNSILVPNTSTASGDGLLIYLYNSNNSGIGNAVTLGSQVTP 130 Query: 139 DKQSKTAVSQADENGDIAFNFIADLKSDSSQDVTAGNINATANIDIVY 186 K + TA ++ I K + Q + AG +ATA + Y Sbjct: 131 GKITGTAPAR-----KITLYAKLGYKGN-MQSLQAGTFSATATLVASY 172
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 575 bits (1483), Expect = 0.0 Identities = 266/747 (35%), Positives = 391/747 (52%), Gaps = 47/747 (6%) Query: 12 VSLSILLGGQSALLHAQAT--FNMDLLEKNDHLPAVDLQRFNQQAGQPPGAYPVSWQVNG 69 V L + + + A FN L + DL RF PPG Y V +N Sbjct: 28 VRLFVACAFAAQAPLSSAELYFNPRFLADDP-QAVADLSRFENGQELPPGTYRVDIYLNN 86 Query: 70 VTLDARKTVTFRQND-RGQLTPCLKPEDLLQAGVNPAVLSQATGATSRSCPELNALLPGS 128 + + VTF D + PCL L G+N A +S +C L +++ + Sbjct: 87 GYMA-TRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLADDACVPLTSMIHDA 145 Query: 129 TVNFDFAHQRLVMTIPQALMTHRARDNVPSALWDEGISAFQSNYRYSGASQRTREGSTER 188 T D QRL +TIPQA M++RAR +P LWD GI+A NY +SG S + R G Sbjct: 146 TAQLDVGQQRLNLTIPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSH 205 Query: 189 DNYLMLKSGVNVGAWRLRASNSLTAN-----SDDKPQWTTSGAWLERDLTRWQSELTLGD 243 YL L+SG+N+GAWRLR + + + N S K +W WLERD+ +S LTLGD Sbjct: 206 YAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGD 265 Query: 244 TFTSGDVFDAVQFQGISLASSDAMLPDSQKGFAPTIRGIARTNAQVTVRQNGYVLYQTYV 303 +T GD+FD + F+G LAS D MLPDSQ+GFAP I GIAR AQVT++QNGY +Y + V Sbjct: 266 GYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTV 325 Query: 304 TPGAFVIDDLYPTASSGNLEVAVKESDGEIRRFTQPYASVTSMQREGSLKYNLVAGRYHS 363 PG F I+D+Y +SG+L+V +KE+DG + FT PY+SV +QREG +Y++ AG Y S Sbjct: 326 PPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRS 385 Query: 364 DDASQR-PLMMQLSLMRGFTHNLTLFGGLQSAAQYHNLSLGAGQGLGEAGALSLQLLNAR 422 +A Q P Q +L+ G T++GG Q A +Y + G G+ +G GALS+ + A Sbjct: 386 GNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQAN 445 Query: 423 -DQHQQDPIDGRAWQLQYSKGFDRLGTQLTFTGWRYSHQRYATLSEAFSSPGSDDDLQDS 481 DG++ + Y+K + GT + G+RYS Y ++ S + +++ Sbjct: 446 STLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQ 505 Query: 482 D-----------------NKKATLQITASQSLPYDITLYLSLDQDSYWSGGATQRTANMG 524 D NK+ LQ+T +Q L TLYLS +YW G Sbjct: 506 DGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVDEQFQAG 565 Query: 525 ISSQVHGIAWSLSYSDSRSSHGDEEDDEPHSDKVVTLSLSVPLSHLLPG--------SYA 576 +++ I W+LSYS ++++ D+++ L++++P SH L + A Sbjct: 566 LNTAFEDINWTLSYSLTKNAWQKG------RDQMLALNVNIPFSHWLRSDSKSQWRHASA 619 Query: 577 GYTLTSSRHSVGSQMVSLNGTLLDNHALSYAVSQTRDRQ----NGSSGSLTAGYSSGRGD 632 Y+++ + + + + GTLL+++ LSY+V +GS+G T Y G G+ Sbjct: 620 SYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGN 679 Query: 633 LNLGYSHDSQAARLNYGASGGILIHRHGVVFTPEMNGAVVLIDAGGAGGVTLANQKTIAT 692 N+GYSH +L YG SGG+L H +GV +N VVL+ A GA + NQ + T Sbjct: 680 ANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRT 739 Query: 693 NGDGYAVLPFATAYHRNDVSLDSHSLP 719 + GYAVLP+AT Y N V+LD+++L Sbjct: 740 DWRGYAVLPYATEYRENRVALDTNTLA 766
>PF06057#Type IV secretory pathway VirJ component Length = 243 Score = 29.0 bits (65), Expect = 0.019 Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 2/46 (4%) Query: 224 PAPTAASAASAADGTFTITLASTGERWPVPGDKTIAQVLQEHGVAV 269 P+ +A+S I L+ G W DK + +LQ+ G V Sbjct: 38 PSTQVNAASSHTKPPLVIFLSGDGG-W-ATLDKAVGGILQQQGWPV 81
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.5 bits (105), Expect = 6e-07 Identities = 32/157 (20%), Positives = 59/157 (37%), Gaps = 2/157 (1%) Query: 26 LGVFGLIVAEFLPASLLTPMASSLGVSEGMAGQAVTATALVALVTGLLIATATRNIDRRW 85 L F ++ L SL +A+ TA L + + + + + Sbjct: 22 LSFFSVLNEMVLNVSLPD-IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 86 VLMFFSILQIVSSLMVAFADSLAFLL-LGRLLLGIAIGGFWAMSTATAMRLVPAAHVPKA 144 +L+F I+ S++ S LL + R + G F A+ R +P + KA Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140 Query: 145 LAIIFSAVSVATVVAAPLGSYLGELIGWRNVFILCAI 181 +I S V++ V +G + I W + ++ I Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMI 177
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 25.8 bits (56), Expect = 0.026 Identities = 10/29 (34%), Positives = 19/29 (65%) Query: 52 RRTPWARKEVEAMYLASLDDDAPVEKADP 80 +R WA KEV+A ++ + +D +E+ +P Sbjct: 415 KRLYWASKEVKAQFMRVVQNDKALEEGNP 443
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.006 Identities = 31/128 (24%), Positives = 50/128 (39%), Gaps = 4/128 (3%) Query: 246 VHLWALFGLAAAPSCLIWHKLVLKWGYRQALTRNLLVQALGVILPACSASLLFCVLSALL 305 + L+AL A AP + L ++G R L +L A+ + A + L + ++ Sbjct: 49 LALYALMQFACAP---VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIV 105 Query: 306 VGFTFMGTVTIALPKAKSLSHQVSFNMIAAMTALYGVGQIAGPLIAGALYQIAASFNPAL 365 G T A M+A +G G +AGP++ G + + P Sbjct: 106 AGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHA-PFF 164 Query: 366 YAAALALL 373 AAAL L Sbjct: 165 AAAALNGL 172
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 34.8 bits (80), Expect = 3e-04 Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 3/71 (4%) Query: 22 GRGKVADYIPALASVSGDKLGI-AISTVDGQHFAAGDAHERFSIQSISKVL--SLVVAMN 78 + + I S ++G+ + G+ A A ERF + S KV+ V+A Sbjct: 21 ASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVVLCGAVLARV 80 Query: 79 HYQEEEIWQRV 89 +E++ +++ Sbjct: 81 DAGDEQLERKI 91
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.5 bits (84), Expect = 2e-05 Identities = 11/50 (22%), Positives = 21/50 (42%) Query: 80 IDPQHRGQQLGEKLLAALEAKARQRDCHTLRLETGIHQHAAIALYTRNGY 129 + +R + +G LL A++ L LET +A Y ++ + Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 57.2 bits (138), Expect = 4e-11 Identities = 39/155 (25%), Positives = 69/155 (44%), Gaps = 2/155 (1%) Query: 36 LSDIADSFGMETAQVGMMLTIYAWVVALMSLPFMLLTSKVERRRLLIGLFILFIASHVLS 95 L DIA+ F A + T + ++ + + L+ ++ +RLL+ I+ V+ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 96 FFAWN-FDVLVISRIGIAFAHAVFWSITSALAIRMAPPGKRAQALSLIATGTALAMVFGI 154 F + F +L+++R A F ++ + R P R +A LI + A+ G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 155 PIGRIIGQYFGWRMTFLAIGLGALATLACLVKLLP 189 IG +I Y W L I + + T+ L+KLL Sbjct: 157 AIGGMIAHYIHWSYLLL-IPMITIITVPFLMKLLK 190
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 28.2 bits (63), Expect = 0.010 Identities = 18/106 (16%), Positives = 41/106 (38%), Gaps = 4/106 (3%) Query: 4 TLRRSTIALLASSLLLTIGRGATLPFMTIYLTRRFQLEVDVIGYALSLALVVGVLF-SMG 62 AL+A ++ + I+ RF + IG +L+ ++ L +M Sbjct: 207 RGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMI 266 Query: 63 FGILADRFDKKRYMVWSVLVFILGFSAIPLVNN---ALLVVIFFAL 105 G +A R ++R ++ ++ G+ + A +++ A Sbjct: 267 TGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLAS 312
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.002 Identities = 42/167 (25%), Positives = 76/167 (45%), Gaps = 13/167 (7%) Query: 68 AFASCLSQYVLVVASSDFAE---KVVAVVLPVNAA--VVVALQYAVGRRLSAR-NIRPLM 121 +F S L++ VL V+ D A K A VN A + ++ AV +LS + I+ L+ Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 122 TFGTVCFVIG-LVGFMFSGASLWAWGISAAIFTLGEVIYAPGEYMLI--DHIAPPGMKAS 178 FG + G ++GF+ G S ++ I A P M++ +I + Sbjct: 83 LFGIIINCFGSVIGFV--GHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKA 140 Query: 179 YFSAQSLGWLGAAFNPMLTGLILTHLPHWS-LFVILIVAIVAAWLMI 224 + S+ +G P + G+I ++ HWS L +I ++ I+ ++ Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYI-HWSYLLLIPMITIITVPFLM 186
>TYPE3OMBPROT#Type III secretion system outer membrane B protein family signature. Length = 538 Score = 27.7 bits (61), Expect = 0.021 Identities = 14/32 (43%), Positives = 22/32 (68%), Gaps = 1/32 (3%) Query: 19 EQLAEMAGLSVRTIQRIENGER-PGLETLSAL 49 E+ + G+SV + QR++NGER G+E L+ L Sbjct: 23 EETGKHKGVSVISYQRVKNGERNKGIEALNRL 54
>PF05211#Neuraminyllactose-binding hemagglutinin Length = 260 Score = 27.7 bits (61), Expect = 0.039 Identities = 15/55 (27%), Positives = 28/55 (50%), Gaps = 7/55 (12%) Query: 43 LSLAIGVGELRCVIGPNGAGKTTLMDVITGKTRPQSGKALYDQSVDLTTLDPVAI 97 L + G+ ++ V+ P G K T+++ P SG++L ++DL+ LD Sbjct: 145 LLFSTGLDKMEGVLIPAGFVKVTILE-------PMSGESLDSFTMDLSELDIQEK 192
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 5e-06 Identities = 64/288 (22%), Positives = 111/288 (38%), Gaps = 39/288 (13%) Query: 33 PFFPVWLADVNHLTK--TETGIVFSSISLFAIIFQPVFGLMSDKLGLRKHLLWTITVLLI 90 P P L D+ H GI+ + +L PV G +SD+ G R LL V L Sbjct: 26 PVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLA 81 Query: 91 LFA-PFFIFVFSPLLQMNIIAGSLVGGIYLGIVFSSGSGAVEAYIERVSRANRFEYGKVR 149 A + I +P L + + G +V GI G + + + RA F + Sbjct: 82 GAAVDYAIMATAPFLWV-LYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGF---- 135 Query: 150 VAGCVGWALCAS--ITGVLFGIDPNITFWIASGFALVLGLLLWLSRPESSNS------AQ 201 ++ C G+ + A + G++ G P+ F+ A+ + L PES + Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRRE 195 Query: 202 VIEALGANRQAFSLRTAAELLRMPRFWGFIVYVVG--VASVYDVFDQQFANFFKSFFASP 259 + L + R A + A L+ FI+ +VG A+++ +F + ++ Sbjct: 196 ALNPLASFRWARGMTVVAALM----AVFFIMQLVGQVPAALWVIFGEDRFHW-------- 243 Query: 260 QRGTEVFGFVTTGGELLNALI-MFCAPAIVNRIGAKNALLTAGMIMSV 306 G +L++L + R+G + AL+ GMI Sbjct: 244 --DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM-LGMIADG 288
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 437 bits (1126), Expect = e-159 Identities = 285/286 (99%), Positives = 285/286 (99%) Query: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKQSESQLSGRVGMIEMDLASGRTLTAWRADE 60 MRYIRLCIISLLATLPLAVHASPQPLEQIK SESQLSGRVGMIEMDLASGRTLTAWRADE Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADE 60 Query: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA Sbjct: 61 RFPMMSTFKVVLCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSEKHLADGMTVGELCA 120 Query: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA Sbjct: 121 AAITMSDNSAANLLLATVGGPAGLTAFLRQIGDNVTRLDRWETELNEALPGDARDTTTPA 180 Query: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG Sbjct: 181 SMAATLRKLLTSQRLSARSQRQLLQWMVDDRVAGPLIRSVLPAGWFIADKTGAGERGARG 240 Query: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR 286 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR Sbjct: 241 IVALLGPNNKAERIVVIYLRDTPASMAERNQQIAGIGAALIEHWQR 286
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 53.7 bits (129), Expect = 5e-10 Identities = 62/295 (21%), Positives = 108/295 (36%), Gaps = 15/295 (5%) Query: 55 VQPILPVLSNEFGVSPASSS---ISLSISTAMLAVGLLFTGPLSDAIGRKPVMVTALLLA 111 + P+LP L + S ++ I L++ M G LSD GR+PV++ +L A Sbjct: 24 IMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGA 83 Query: 112 ACCSLLSTMMTSWHGILIMRALIGLSLSGVAAVGMTYLSEEIHPSFVAFSMGLYISGNSI 171 A + + I R + G++ AV Y+++ A G + Sbjct: 84 AVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGF 142 Query: 172 GGMSGRLLTGVFTDFFGWRVALAAISGFALAAAIMFWRILPES--RHFRPTSLRPKTLLI 229 G ++G +L G+ F A + + +LPES RP L Sbjct: 143 GMVAGPVLGGLMGGF-SPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLA 201 Query: 230 NFRLHWRDRGLPLLFVEGFLLM---GAFVTLFN-YIGYRLMMSPWSLSQAVVGLLSVAYL 285 +FR + L F++ L+ + R ++ ++ + L Sbjct: 202 SFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSL 261 Query: 286 TGTWSSPKAGAMTVRFG-RGPVMLGFTAVMLCGLLLTLFSSLWLIFIGMLLFSAG 339 G + R G R +MLG A +LL + W+ F M+L ++G Sbjct: 262 AQAMI---TGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG 313
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 109 bits (273), Expect = 5e-30 Identities = 27/234 (11%), Positives = 70/234 (29%), Gaps = 47/234 (20%) Query: 26 LSAKDIKTLFFGHDDRKAVNRPEESPWDAIGQLET---ASGNLCTATLISPHLALTAGHC 82 L ++ + ++DR + + + ++ + + ++ LT H Sbjct: 61 LEQREHANVILPNNDRHQITDTTNGHYAPVTYIQVEAPTGTFIASGVVVGKDTLLTNKHV 120 Query: 83 LLTPPRGKPDKAVALRFI------SRKGNWVYE---IHGIDGRVDPSLGRRLKADGDGWI 133 + AL+ N + I G D ++ + + Sbjct: 121 V----DATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVK-FSPNEQN-- 173 Query: 134 VPSAAAPSDFGLIVLRYAPSGITPIPLFPGSKADLTAALKAADRKVTQSGYPEDH-LDNL 192 ++ + P + A ++ +T +GYP D + + Sbjct: 174 ---------------KHIGEVVKPATM-------SNNAETQVNQNITVTGYPGDKPVATM 211 Query: 193 YSHQDCIVTGWAQTSVLSHQCDTLPGDSGSPLLLKTEDGWQVIAVQSSAPGPQD 246 + + + + + + T G+SGSP+ + +VI + + Sbjct: 212 WESK--GKITYLKGEAMQYDLSTTGGNSGSPVF---NEKNEVIGIHWGGVPNEF 260
>PF01206#SirA family protein Length = 76 Score = 92.9 bits (231), Expect = 4e-29 Identities = 16/71 (22%), Positives = 38/71 (53%) Query: 7 DYRLDMVGEPCPYPAVATLEAMPSLQKGEILEVVSDCPQSINNIPLDARNHGYTVLDIQQ 66 D LD G CP P + + + ++ GE+L V++ P S+ + ++ G+ +L+ ++ Sbjct: 5 DQSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKE 64 Query: 67 DGPTIRYLIQK 77 + T + +++ Sbjct: 65 EDGTYHFRLKR 75
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.7 bits (77), Expect = 0.002 Identities = 35/159 (22%), Positives = 65/159 (40%), Gaps = 27/159 (16%) Query: 59 ILSWL--SFSLTFFIRPIGGVIFAHIGDRIGRKKTLVLTLSLMGSATVAIGLLPTYEMVG 116 +W+ +F LTF IG ++ + D++G K+ L+ + + +V VG Sbjct: 50 STNWVNTAFMLTF---SIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVI-------GFVG 99 Query: 117 LWAPALLITLRIIQGMGIGGEWGGALLLAYEYAPEKRK----GFFGSIPQAGVTIGMLMA 172 +LLI R IQG G +++ Y P++ + G GSI G +G + Sbjct: 100 HSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIG 159 Query: 173 TFIVSLMTLFDEAQFLAWGWRIPFLLSSVLVFLGLWIRK 211 I A ++ W + L+ + + ++ K Sbjct: 160 GMI---------AHYIHWSYL--LLIPMITIITVPFLMK 187
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 58.9 bits (142), Expect = 3e-13 Identities = 28/166 (16%), Positives = 58/166 (34%), Gaps = 12/166 (7%) Query: 1 MRADARKNYDLLIEVARDVFVEQGAEA-SLRDIARRAGVGMGTLYRHFPNRDSLLEALLR 59 + +A++ +++VA +F +QG + SL +IA+ AGV G +Y HF ++ L + Sbjct: 5 TKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWE 64 Query: 60 SRFAALTARAESLL------LAADPAAALLEWLAESVAFTHQHRGIIAPLMSAIDDPESA 113 + + + L+ L +V + + E A Sbjct: 65 LSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFVGEMA 124 Query: 114 L-----HSACVALRAAGTSLLTRAQQAGLARPDLSGEELFDLIAAL 154 + + C+ L +A + DL ++ Sbjct: 125 VVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGY 170
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 40.2 bits (94), Expect = 6e-06 Identities = 29/129 (22%), Positives = 47/129 (36%), Gaps = 19/129 (14%) Query: 195 TVLVFGATGQQGGSVARALLHRGWRVRALVRDPFSAG---------AAALAARGAELVVG 245 LV GA G G V++ LL G +V + D + LA G + Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGI--DNLNDYYDVSLKQARLELLAQPGFQFHKI 59 Query: 246 TFEDRAAMRSAMA--GVDGVF------SVQPSSPGGTVTDEQEVRYGITIADLAVECAVK 297 DR M A + VF +V+ S + + + I + ++ Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQ 119 Query: 298 HLVYSSGSA 306 HL+Y+S S+ Sbjct: 120 HLLYASSSS 128
>PF05043#Transcriptional activator Length = 493 Score = 33.8 bits (77), Expect = 9e-04 Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 1/66 (1%) Query: 1 MNKIIENDFSRIDLNLLTVLMVLYREGSVTRTAEVLHLGQPAISGALKRLREMFDDPLFV 60 M ++ R L LL +L R + AE+L+ + A+ L ++ F D +F Sbjct: 1 MRDLLSKKSHR-QLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFH 59 Query: 61 RSARGM 66 S G+ Sbjct: 60 SSTNGI 65
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 55.4 bits (133), Expect = 3e-12 Identities = 35/120 (29%), Positives = 56/120 (46%), Gaps = 7/120 (5%) Query: 3 KIALITGANRGLGRQTALDIARQGGDVIVTYRGSLEQAEAVVADIRALGRKAIALPLDMA 62 KIA ITGA +G+G A +A QG + + E+ E VV+ ++A R A A P D+ Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 63 QTASFPAFADSLGSALASVWGRATFDHLINNAGHGEFAPLAETREAQFDGLFNVHVKGVF 122 +A A + + D L+N AG + + +++ F+V+ GVF Sbjct: 68 DSA---AIDEITARIEREM---GPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVF 121
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 38.1 bits (88), Expect = 1e-06 Identities = 20/86 (23%), Positives = 32/86 (37%), Gaps = 7/86 (8%) Query: 4 ELGGRGITVNTIAPGAIATDFGGGL-VRDDAEVN------AQFAAMTALGRVGVPEDIGP 56 EL I N ++PG+ TD L ++ F L ++ P DI Sbjct: 174 ELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIAD 233 Query: 57 MIASLLRDDNRWVTAQRIEVSGGQTI 82 + L+ +T + V GG T+ Sbjct: 234 AVLFLVSGQAGHITMHNLCVDGGATL 259
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 83.2 bits (205), Expect = 3e-21 Identities = 65/249 (26%), Positives = 111/249 (44%), Gaps = 24/249 (9%) Query: 7 KSVLVLGGSRGIGAAIVRRFVADGASVVFSYSGSPEAAERLAAETGSTA-----VQADSA 61 K + G ++GIG A+ R + GA + + +PE E++ + + A AD Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 62 DRDAVISLV----RDSGPLDVLVVNAGIALFGDALEQDSDAIDRLFRINIHAPYHASVEA 117 D A+ + R+ GP+D+LV AG+ G + + F +N ++AS Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 118 ARRMP--EGGRIIVIGSVNGDRMPVPGMAAYAVSKSALQGLARGLARDFGPRGITVNVVQ 175 ++ M G I+ +GS N +P MAAYA SK+A + L + I N+V Sbjct: 128 SKYMMDRRSGSIVTVGS-NPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 176 PGPIDTDA--------NPENGPMKELMHSF---MAIKRHGRPEEVAGMVAWLAGPEASFV 224 PG +TD N +K + +F + +K+ +P ++A V +L +A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 225 TGAMHTIDG 233 T +DG Sbjct: 247 TMHNLCVDG 255
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.9 bits (59), Expect = 0.019 Identities = 17/48 (35%), Positives = 24/48 (50%), Gaps = 3/48 (6%) Query: 1 MKSALISPLLAGLLLLTGCAQPAAQAGGGTIKAINHTKWAINHFSING 48 MK L S LA +L+TGCAQ G A+ + +HF ++G Sbjct: 6 MKKMLFSAALA--MLITGCAQQTFTVGNKPT-AVTPKETITHHFFVSG 50
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.013 Identities = 30/98 (30%), Positives = 41/98 (41%), Gaps = 13/98 (13%) Query: 580 GKRVVGQEAALSAIARRL-RAAKTGLTPENGPQGVFLLVGPSGTGKTETALALADALFGG 638 G +VG+ AA+ I R L R +T LT ++ G SGTGK A AL D Sbjct: 136 GMPLVGRSAAMQEIYRVLARLMQTDLT--------LMITGESGTGKELVARALHDYGKRR 187 Query: 639 EKALITINLSEYQEPHTVSQL----KGSPPGYVGYGQG 672 + IN++ S+L KG+ G G Sbjct: 188 NGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTG 225
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 91.1 bits (226), Expect = 4e-22 Identities = 45/146 (30%), Positives = 63/146 (43%), Gaps = 12/146 (8%) Query: 416 PPPPPRPVQHVAPNVIRLDSMSLFDTGKWVLKPGSTKRL--VSSLMDIKARPGWLIVVAG 473 P P P V L S LF+ K LKP L + S + +VV G Sbjct: 200 VAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLG 259 Query: 474 HTDSVGEEKANQLLSLKRAESVRDWMRDTGDVPDSCFAVQGYGESRPIATNDT------- 526 +TD +G + NQ LS +RA+SV D++ G +P + +G GES P+ N Sbjct: 260 YTDRIGSDAYNQGLSERRAQSVVDYLISKG-IPADKISARGMGESNPVTGNTCDNVKQRA 318 Query: 527 --PEGRALNRRVEISLVPQVDACRLP 550 + A +RRVEI + D P Sbjct: 319 ALIDCLAPDRRVEIEVKGIKDVVTQP 344
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 73.9 bits (181), Expect = 8e-18 Identities = 64/254 (25%), Positives = 103/254 (40%), Gaps = 15/254 (5%) Query: 6 KIALVTGGSRGLGRATVEALAQRGVNVVLTYKTRLAEANEVVTRVEALGARAIALPFSAG 65 KIA +TG ++G+G A LA +G ++ + +VV+ ++A A A P Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHI-AAVDYNPEKLEKVVSSLKAEARHAEAFP---- 63 Query: 66 EIDTFDAFVSAFQGALTELDADKFDYLVNNAGNASGMGFLNATEAEFDALYRIHVKSVFF 125 D D+ A E + D LVN AG + ++ E++A + ++ VF Sbjct: 64 -ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 126 LSQKLLPLLAD--GGRIVNVSSGLTRIVMANRAPYAIMKSAVETLTRYMAFELGSRGITV 183 S+ + + D G IV V S + + A YA K+A T+ + EL I Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 184 NCVAPGAIATDFSGGVVRDNPQVAQAVANMTA-------LGRPGLPEDIGPMIASLLSDD 236 N V+PG+ TD + D Q + L + P DI + L+S Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 237 HRWVNAQRIEVSGG 250 + + V GG Sbjct: 243 AGHITMHNLCVDGG 256
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 31.2 bits (70), Expect = 0.006 Identities = 22/90 (24%), Positives = 40/90 (44%), Gaps = 10/90 (11%) Query: 11 VRIVERGSFSAAAADLGVSRPVATAAIKALEVSLGARLLHRTTRHVRPTAEGSLYYQRCV 70 +RIV SAAAA L P+A A+ V+ + + + A+ + + Sbjct: 5 LRIV-----SAAAAALLAVAPIAAT---AMPVNAATTINADSAINANTNAKYDVDVTPSI 56 Query: 71 SILAALEEANRSAG--GSISGTIRVDVAGN 98 S +AA+ +++ GS++G+I G Sbjct: 57 SAIAAVAKSDTMPAIPGSLTGSISASYNGK 86
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 33.3 bits (76), Expect = 0.002 Identities = 21/113 (18%), Positives = 45/113 (39%), Gaps = 2/113 (1%) Query: 76 RKWLLLGLTALMAASGVIIALASSFPVYMLGRALIGIVIGGFWSMSAATAIRLVPQRQVP 135 ++ LL G+ S + S F + ++ R + G F ++ R +P+ Sbjct: 79 KRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRG 138 Query: 136 RALAIFNGGNALATVVAAPLGSYLGATVGWRGAFLCLVPLALLAFVWQCISLP 188 +A + A+ V +G + + W ++L L+P+ + V + L Sbjct: 139 KAFGLIGSIVAMGEGVGPAIGGMIAHYIHW--SYLLLIPMITIITVPFLMKLL 189
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 305 bits (784), Expect = e-101 Identities = 111/377 (29%), Positives = 172/377 (45%), Gaps = 38/377 (10%) Query: 174 VLTGAVAMLRSTVRMGRQLQTMTSQDTSAFSQILAVGPKMRHVVEQARKLAMLSAPLLIV 233 LT + ++ + ++ + D+ ++ M+ + +L L+I Sbjct: 107 DLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMIT 166 Query: 234 GDTGTGKDLLAHACHLASPRAGKPYLALNCGSIPEDAVESELFG-------DALQGKKGF 286 G++GTGK+L+A A H R P++A+N +IP D +ESELFG A G Sbjct: 167 GESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGR 226 Query: 287 FKQANGGSVLLDEIGEMSPRMQTKLLRFLNDGTFRRVGEDHEVHVDVRVICATQKNLIEL 346 F+QA GG++ LDEIG+M QT+LLR L G + VG + DVR++ AT K+L + Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQS 286 Query: 347 VQKGLFREDLYYRLNVLTLYLPPLRDCPQDIMPLTELFVARFADEQGIPRPKLSADLSTV 406 + +GLFREDLYYRLNV+ L LPPLRD +DI L FV + E G+ + + + Sbjct: 287 INQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALEL 345 Query: 407 LTRYSWPGNVRQLKNAVYRALTQLEGFELRPQDILLP---------------DHDVASLP 451 + + WPGNVR+L+N V R + + I S+ Sbjct: 346 MKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSIS 405 Query: 452 VGEEAM--------------EGSLDDITRRFERSVLTQ-LYRSYPSTRKLAKRLGVSHTA 496 E G D + E ++ L + + K A LG++ Sbjct: 406 QAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNT 465 Query: 497 IANKLREYGLSQKKGDE 513 + K+RE G+S + Sbjct: 466 LRKKIRELGVSVYRSSR 482
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 250 bits (639), Expect = 3e-83 Identities = 84/176 (47%), Positives = 116/176 (65%) Query: 7 AQYKDNLLGEANSFLEVLEQVSRLAPLDKPVLVIGERGTGKELIANRLHYLSSRWQGPFI 66 +Q L+G + + E+ ++RL D +++ GE GTGKEL+A LH R GPF+ Sbjct: 133 SQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFV 192 Query: 67 SLNCAALNDNLLDSELFGHEAGAFTGASKRHPGRFERADGGTLFLDELATAPMLVQEKLL 126 ++N AA+ +L++SELFGHE GAFTGA R GRFE+A+GGTLFLDE+ PM Q +LL Sbjct: 193 AINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLL 252 Query: 127 RVIEYGELERVGGSQPLQVNVRLVCATNADLPQMVEEGHFRADLLDRLAFDVVQLP 182 RV++ GE VGG P++ +VR+V ATN DL Q + +G FR DL RL ++LP Sbjct: 253 RVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLP 308
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 47.5 bits (113), Expect = 5e-10 Identities = 18/66 (27%), Positives = 31/66 (46%), Gaps = 1/66 (1%) Query: 23 PGSPPEAAPGDELPALPLDLRDFQLQQEKRLLQRSLEQAKYHQKQAAELLGLTYHQLRAL 82 A+ GD LP L + E L+ +L + +Q +AA+LLGL + LR Sbjct: 411 NMRQYFASFGDALPPSGL-YDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKK 469 Query: 83 LKKHQL 88 +++ + Sbjct: 470 IRELGV 475
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 89.2 bits (221), Expect = 3e-21 Identities = 78/398 (19%), Positives = 156/398 (39%), Gaps = 22/398 (5%) Query: 35 VINV-VPAMKSSLDISLETLTLAVSLSALFSGCFVVASGGLADKFGRMRMTTLGLGLSIV 93 V+NV +P + + + + + L G L+D+ G R+ G+ ++ Sbjct: 32 VLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF 91 Query: 94 GSAMLVVAQGP-GLFLAGRVLQGLSAACIMPATLALIKTWYEGRARQRAVSFWVIGSWGG 152 GS + V L + R +QG AA + ++ + R +A G Sbjct: 92 GSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMG 151 Query: 153 SGLCSFVGGAIATGLGWRWIFVFSIAVALLALFLLRGTPESRSASASQHKLDVGGLLSLI 212 G+ +GG IA + W ++ + + + FL++ + H D+ G++ + Sbjct: 152 EGVGPAIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKK--EVRIKGH-FDIKGIILMS 208 Query: 213 VALVLVNLFISKGHGWGWSSPLSLTMLAGALAAGTIFIRNGMRKGEAALIDFALFSNRAY 272 V +V LF + + + L+ L IF+++ +RK +D L N + Sbjct: 209 VGIVFFMLF-TTSYSISFLIVSVLSFL--------IFVKH-IRKVTDPFVDPGLGKNIPF 258 Query: 273 GAAVLSNFLINGAI-GTMMIANIWLQQGHHLTPLESGMMTLGYLVTVLAMIR--VGEKLL 329 VL +I G + G + + ++ H L+ E G + + + T+ +I +G L+ Sbjct: 259 MIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVII-FPGTMSVIIFGYIGGILV 317 Query: 330 QRYGARLPMMAGPVLTAIAIALISCTFLEKALYIGVVFASNVLFGLGLGCYATPSTDTAV 389 R G + G +++ L + LE + + + G GL T + Sbjct: 318 DRRGPLYVLNIGVTFLSVSF-LTASFLLETTSWF-MTIIIVFVLG-GLSFTKTVISTIVS 374 Query: 390 ANAPENKIGVASGIYKMGSSLGGAMGIAVTASLFALFL 427 ++ + + G + S L GIA+ L ++ L Sbjct: 375 SSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPL 412
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.006 Identities = 41/201 (20%), Positives = 82/201 (40%), Gaps = 21/201 (10%) Query: 242 IRVLALALVYFGTSAGLYTLGIWSPQII-RSFGASSLEIGFLNAFPA-VIGVIAMILWAR 299 + + + FGT AG ++ P ++ S+ EIG + FP + +I + Sbjct: 259 MIGVLCGGIIFGTVAGFVSM---VPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGI 315 Query: 300 HSDRTKERSWHVIGACLLAAAGLIYAGNV-STLFTVMVALTLVTVGISASKPPLWSMPTL 358 DR IG L+ + L + + +T + + + + V G+S +K + ++ + Sbjct: 316 LVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSS 375 Query: 359 FLSGPAAAAGIAAINSIGNLGGFVGPMMIGV---------------IREQTGSYSWGLYF 403 L A AG++ +N L G ++G + + T YS L Sbjct: 376 SLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLLDQRLLPMEVDQSTYLYSNLLLL 435 Query: 404 VAGLLALSALVVMILSARANR 424 +G++ +S LV + + + R Sbjct: 436 FSGIIVISWLVTLNVYKHSQR 456 Score = 29.5 bits (66), Expect = 0.028 Identities = 27/199 (13%), Positives = 73/199 (36%), Gaps = 20/199 (10%) Query: 16 RIIPFIMLLYFIAFLDRVNIGFAALTMNQDLGFSPTVFGLGAGIFFLGYFLFEVPSNLIL 75 +I+ ++ +L F + L+ + + + + D P + L + Sbjct: 14 QILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPAS----TNWVNTAFMLTFSIGTAVY 69 Query: 76 HKVGARIWIARVMITWG--FVSGCMAFVQGTTSFYIL---RFLLGVAEAGFFPGIILYLS 130 K+ ++ I R+++ G + G + F +L RF+ G A F +++ ++ Sbjct: 70 GKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVA 129 Query: 131 YWFPAARRAQVTAIFMAAAPLSTALGSPVSAALLEMHGFLGYAGWQWMFVLEALPALVLG 190 + P R + + + + +G + + Y W ++ ++ ++ Sbjct: 130 RYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAH------YIHWSYLLLI-----PMIT 178 Query: 191 VVVLFFLTDRPAKAKWLTD 209 ++ + FL K + Sbjct: 179 IITVPFLMKLLKKEVRIKG 197
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 46.6 bits (110), Expect = 2e-08 Identities = 37/188 (19%), Positives = 69/188 (36%), Gaps = 6/188 (3%) Query: 9 ILIVGASRGLGHAMAATFLQHGWEVIGTVRDLSSHTPLHDLAKTHPLRLRLATLDIRDEA 68 I GA++G+G A+A T G + + + K D+RD A Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSA 70 Query: 69 QLAALQATLPP--ASLDMLFVNAGTTNRDPSQTIGDVSTEEFYQVMLTNALAPMRVIERL 126 + + A + +D+L AG ++ D E + V T R + + Sbjct: 71 AIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130 Query: 127 QQAVKPQGLLGVMSSGQGSLTNNLTGQRELYRGSKAALNMFMRSFAARPSSASHPLVVMA 186 + ++ V S+ G ++ Y SKAA MF + + + +++ Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAA----YASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 187 PGWIRTEL 194 PG T++ Sbjct: 187 PGSTETDM 194
>PF05272#Virulence-associated E family protein Length = 892 Score = 27.7 bits (61), Expect = 0.041 Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 4/44 (9%) Query: 11 VRRQPLLREVAFSVAPG----EVLTLMGPSGSGKSTLFAWIIGA 50 V + L+ VA + PG + L G G GKSTL ++G Sbjct: 576 VGKYILMGHVARVMEPGCKFDYSVVLEGTGGIGKSTLINTLVGL 619
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 45.6 bits (108), Expect = 2e-07 Identities = 34/132 (25%), Positives = 59/132 (44%), Gaps = 1/132 (0%) Query: 40 LSALAADFHQTESGVGLAVTAYGWVGALAALLSGAMPARISRKALLVGLMLILAFSCLAA 99 L +A DF++ + TA+ ++ + G + ++ K LL+ ++I F + Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 100 TRSYSMFA-LMSARMIGALAHGAFWALIGLVAAQLVPPHRLGLATAIIFGGVSAASVVGV 158 +S F+ L+ AR I AF AL+ +V A+ +P G A +I V+ VG Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 159 PLASFIATLAGW 170 + IA W Sbjct: 157 AIGGMIAHYIHW 168
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 122 bits (307), Expect = 5e-36 Identities = 70/254 (27%), Positives = 120/254 (47%), Gaps = 10/254 (3%) Query: 2 SKKLADKVALVTGGSAGIGLASAKALAEQGAKVY---ITGRRQEELDAAVRFIGPAARGI 58 +K + K+A +TG + GIG A A+ LA QGA + + E++ ++++ A Sbjct: 3 AKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAF 62 Query: 59 RADAAVLSDLDAVFATIAEESGRLDVLFANAGGGDMLPLSAITEAHVDRIFATNVRGVVF 118 AD + +D + A I E G +D+L AG + ++++ + F+ N GV Sbjct: 63 PADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFN 122 Query: 119 TVQKALPLLTD--GASVILTGSTAAVKGTANFSIYSASKAAVRSLARSWALEVSDRGIRI 176 + + D S++ GS A + + Y++SKAA + LE+++ IR Sbjct: 123 ASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRC 182 Query: 177 NVVSPGPVRTPGLGGLVAEADRQ-----GLFDALAAGVPLGRLGEPEEIGRTVVFLASDE 231 N+VSPG T L A+ + G + G+PL +L +P +I V+FL S + Sbjct: 183 NIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQ 242 Query: 232 SSFINAAEIYVDGG 245 + I + VDGG Sbjct: 243 AGHITMHNLCVDGG 256
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 26.6 bits (58), Expect = 0.045 Identities = 17/64 (26%), Positives = 26/64 (40%) Query: 46 VEKQGLTVGIIILTIGVMAPIASGTLPPSTLIHSFMNWKSLLAIAVGVFVSWLGGRGVSL 105 V Q + L IG + + LPPS ++ N ++ A VS LG ++L Sbjct: 171 VTVQRSAIVDGGLHIGALQSLQPEDLPPSRVVLRDTNVTAVPASGAPAAVSVLGASELTL 230 Query: 106 MGSQ 109 G Sbjct: 231 DGGH 234
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 31.6 bits (71), Expect = 0.007 Identities = 13/44 (29%), Positives = 22/44 (50%), Gaps = 2/44 (4%) Query: 253 ENGLARQR--LEQQRDADWAIRELLARMTQRLQGCETIEDVIKV 294 +NG+ R + LE + W +R +A M L +D++KV Sbjct: 95 QNGIKRVKEFLESSPNTQWELRAFMAVMHFSLTADRIDDDILKV 138
>PF06580#Sensor histidine kinase Length = 349 Score = 29.4 bits (66), Expect = 0.019 Identities = 26/137 (18%), Positives = 43/137 (31%), Gaps = 12/137 (8%) Query: 110 SHRIFWDYAFAGGSLLATFSQGIVVGAFINGFAVADRRFAGSTLDWLTPFNLFCGLGLVV 169 +++ +W G + G A S + GL L Sbjct: 8 ANKYYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLM----------GLVLTH 57 Query: 170 AYLLLGTTWLIMKSEGALQQRMRELTRKVLLALMVVIAVVSVWTPLGWRYVAERWFTLPN 229 AY +K Q +R L V++ ++ +A S+W L + FTLP Sbjct: 58 AYRSFIKRQGWLKL-NMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPL 116 Query: 230 FF-WFVPVPILVLALSL 245 V ++ SL Sbjct: 117 ALSIIFNVVVVTFMWSL 133
>PF01540#Adhesin lipoprotein Length = 475 Score = 40.1 bits (93), Expect = 2e-04 Identities = 46/245 (18%), Positives = 92/245 (37%), Gaps = 55/245 (22%) Query: 854 EQTAEALRKEAEDQAKQVSQDIDASAKSITADVDGKI-SAVNKTITDEITSVNEA----- 907 ++ A+A K+A A+++ ++ D S KI +NK I + S EA Sbjct: 38 KEKADAALKQANALAEELKKNPDYS----------KILETLNKEIAEATKSFKEAGSYGD 87 Query: 908 ---LDSGLAQANKGVQEAKSAVADANKQIATVNKSLTDSITQVRQSVTDTAAEINATIDL 964 + S L+ A + + + V ANK+IA N + + ++ + +++ TI L Sbjct: 88 YPAIISKLSAAVENAKSEQQKVDQANKKIADENLKIKEGAKELLK-LSEKIQSFADTIAL 146 Query: 965 EIARVNKTLTDGDAALNAQ-IKTAENGLK-----------------------QSLSQVNT 1000 I ++ D Q I T E K +S + NT Sbjct: 147 TITKLEGKKFQIDETFKKQLISTIELLNKKSAEVKTFATVNTIKKDFLLSELESFKEFNT 206 Query: 1001 TLTNAVKQETA-------DRIADVNAKASQAADELLAATQGIEASIESLTQVMKTADENL 1053 + + E +A++ A+ + +L Q I+ + L ++ + ++ Sbjct: 207 SWLEKIVSEWEEVKKAWSKELAEIKAEDDK---KLAEENQKIKEGAKELLKLSEKI-QSF 262 Query: 1054 AREMS 1058 A ++ Sbjct: 263 ADTIA 267
>SECA#SecA protein signature. Length = 901 Score = 26.8 bits (59), Expect = 0.033 Identities = 15/83 (18%), Positives = 33/83 (39%), Gaps = 4/83 (4%) Query: 19 QHEPLSE---ESLGMFQGMIDEMYETFTGSVAEYRGLNQQAVIDTQAGLYFGPGAVSAG- 74 Q +P E ES MF M++ + ++++ + + V + + ++ Sbjct: 796 QKDPKQEYKRESFSMFAAMLESLKYEVISTLSKVQVRMPEEVEELEQQRRMEAERLAQMQ 855 Query: 75 LADEVSDPQAAINAIAAKYQQPR 97 D AA A+AA+ + + Sbjct: 856 QLSHQDDDSAAAAALAAQTGERK 878
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 113 bits (284), Expect = 1e-32 Identities = 68/252 (26%), Positives = 115/252 (45%), Gaps = 13/252 (5%) Query: 4 KNKVAVITGSTTGIGEAVADQLHKHGSKVVIVSRSSEQAKQKAKQLSSQGQQAVGIGCDV 63 + K+A ITG+ GIGEAVA L G+ + V + E+ ++ L ++ + A DV Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 64 SQPEQVRQMIDDVIKHFGRLDYAVNNAGLTGEHGINITEQTIENWDKVIATSLSGVFYCL 123 + ++ + + G +D VN AG+ I + E W+ + + +GVF Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVL--RPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 124 KYEIPQMM-KSGGSIVNLSAVNGLVGIPGLAPYTVAKHGIIGLTQTAALEFACEGIRINA 182 + MM + GSIV + + V +A Y +K + T+ LE A IR N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNI 184 Query: 183 VAPGYVQTPRMSEF------PENIVRSFANSH----PMKRMAKMQEVADFILFLLSDNSA 232 V+PG +T E +++ + P+K++AK ++AD +LFL+S + Sbjct: 185 VSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAG 244 Query: 233 FCTGGVYPIDGG 244 T +DGG Sbjct: 245 HITMHNLCVDGG 256
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 27.7 bits (61), Expect = 0.005 Identities = 16/48 (33%), Positives = 25/48 (52%) Query: 10 KIIGEQLGVKQEEVTNNASFVEDLGADSLDTVELVMALEEEFDTEIPD 57 ++ G Q + QEE+ N F+E L ++ L +E+F TEI D Sbjct: 380 QLTGSQRALSQEEIQNKIDFMEFLAQNNAKLDNLSEKEKEKFRTEIKD 427
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 156 bits (395), Expect = 4e-49 Identities = 91/250 (36%), Positives = 136/250 (54%), Gaps = 13/250 (5%) Query: 4 EGKIALVTGASRGIGRAIAETLVARGAKVIGTATSESGAQAISDYLGANGK---GLMLNV 60 EGKIA +TGA++GIG A+A TL ++GA + + + + L A + +V Sbjct: 7 EGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADV 66 Query: 61 TDPASIESVLENVRAEFGEVDILVNNAGITRDNLLMRMKDDEWNDIIETNLSSVFRLSKA 120 D A+I+ + + E G +DILVN AG+ R L+ + D+EW N + VF S++ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 121 VMRAMMKKRHGRIITIGSVVGTMGNAGQANYAAAKAGLIGFSKSLAREVASRGITVNVVA 180 V + MM +R G I+T+GS + A YA++KA + F+K L E+A I N+V+ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 181 PGFIETDMTRAL-----TDEQR-AGTLA----AVPAGRLGTPNEIASAVAFLASDEASYI 230 PG ETDM +L EQ G+L +P +L P++IA AV FL S +A +I Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 231 TGETLHVNGG 240 T L V+GG Sbjct: 247 TMHNLCVDGG 256
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 50.1 bits (119), Expect = 5e-08 Identities = 43/274 (15%), Positives = 79/274 (28%), Gaps = 35/274 (12%) Query: 522 KRPEQPALAAFVMPDAPPAPMLEEPAAAPVAAAAPVAAAAPAQPGLLSRFFSALKNIFSG 581 + D P P E A APV APA P Sbjct: 992 VDTTNITTPNNIQADVPSVPSNNEEIARV--DEAPVPPPAPATP---------------- 1033 Query: 582 AEEAKPAEVQVEKKAEEKPERQQERRKPRANNRRDRNDRRDNRDNRDNRDNRDNRDTRAD 641 +E + +++++ + +Q+ + A NR + + Sbjct: 1034 SETTETVAENSKQESKTVEKNEQDATETTAQNRE-----------VAKEAKSNVKANTQT 1082 Query: 642 NAEGREPRESREENRRNRREKPSQNVEARDVRQTSGDDAEKAKSRDEQQPRRERTRRRND 701 N + E++E +E + + + + E K + P++E++ Sbjct: 1083 NEVAQSGSETKETQTTETKETATVE-KEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQP 1141 Query: 702 DKRQAQQEAKAQTREEPVVQETEQEERVQTLPR-----RKPRQLAQKVRVESAVVEPVAE 756 A++ +EP Q + Q +P + V ++VVE Sbjct: 1142 QAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPEN 1201 Query: 757 IVPEAVVAEVIAPQSEPVKAELPAGVESVADQDE 790 P V + S K V SV E Sbjct: 1202 TTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVE 1235 Score = 46.6 bits (110), Expect = 7e-07 Identities = 53/287 (18%), Positives = 91/287 (31%), Gaps = 33/287 (11%) Query: 691 PRRERTRRRNDDKRQAQQEAKAQTREEPVVQETEQEERVQTLPRRKPRQLAQKVRVESAV 750 P E+ R + D Q V E+ RV P P S Sbjct: 983 PEVEK-RNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAP-----ATPSET 1036 Query: 751 VEPVAEIVP-EAVVAEVIAPQSEPVKAE----LPAGVESVADQDENGESREANGMPRRSR 805 E VAE E+ E + A+ +V + E ++ + ++ Sbjct: 1037 TETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096 Query: 806 RSPRHLRVSGQRRRRYRDERYPTQSPMPLTVACASPEMASGKVWIRYPVVRPQDQQPEEV 865 + + ++ + + E TQ P++ S V P+ +Q E V Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQE---------VPKVTSQ--------VSPKQEQSETV 1139 Query: 866 QVQDASVAKTVEAVAAPVAVVETVTAAPVTVEPATMEPVTAEPVVVEPVAAAEPLVVDAA 925 Q Q + V +T T A T +PA V +PV + + + Sbjct: 1140 QPQAEPARENDPTVNIKEPQSQTNTTAD-TEQPAKETS----SNVEQPVTESTTVNTGNS 1194 Query: 926 EVVAPAAVEPAPQEPVTEAPAVEAPQAIAPVTLDAEPVVVEPEAVET 972 V P PA +P + + P+ ++ + P VEP + Sbjct: 1195 VVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSS 1241 Score = 38.1 bits (88), Expect = 2e-04 Identities = 45/286 (15%), Positives = 79/286 (27%), Gaps = 20/286 (6%) Query: 724 EQEERVQTLPRRKPRQLAQKVRVESAVVEPVAEIVPEAVVAEVIAPQSEPVKAELPAGVE 783 E E+R QT+ +V EI A V E AP P A E Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEI---ARVDE--APVPPPAPATPSETTE 1038 Query: 784 SVADQDENGESREANGMPRRSRRSPRHLRVSGQRRRRYRDERYPTQSPMPLTVACASPEM 843 +VA+ + + + ++ V+ + + + VA + E Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKAN------TQTNEVAQSGSET 1092 Query: 844 ASGKVWIRYPVVRPQDQQPEEVQVQDASVAKTVEAVAAPVAVVETVTAAPVTVEPATMEP 903 + + E+ + KT E + V TV+P EP Sbjct: 1093 KETQ-----TTETKETATVEKEEKAKVETEKTQEV-PKVTSQVSPKQEQSETVQPQA-EP 1145 Query: 904 VTAEPVVVEPVAAAEPLVVDAAEVVAPAAVEPAPQEPVTEAPAVEAPQAIAPVTLDAEPV 963 V A ++PVTE+ V ++ + P Sbjct: 1146 ARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPA 1205 Query: 964 VVEPEAVETTPVVAA--PVETIAPVAETVEQAPVTEAAPAEPVKAE 1007 +P + ++ V VE A + + + Sbjct: 1206 TTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCD 1251
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 41.5 bits (97), Expect = 4e-07 Identities = 18/131 (13%), Positives = 45/131 (34%), Gaps = 10/131 (7%) Query: 2 AERAGVSKTNLLYYYPSKEALYVAVLQQILAIWLAPLKAFREDI--SPLVAIREYIRLKL 59 A+ AGV++ + +++ K L+ + + + ++ PL +RE + L Sbjct: 38 AKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLREILIHVL 97 Query: 60 EVSRDHPQASKLF------CLEMLQGAPLLMGELTGDLKALVDEKSAIVSGWIDRGKL-A 112 E + + +L E + ++ D + I+ L A Sbjct: 98 ESTV-TEERRRLLMEIIFHKCEFVGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPA 156 Query: 113 PVDPQHLIFMI 123 + + ++ Sbjct: 157 DLMTRRAAIIM 167
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 72.4 bits (177), Expect = 3e-17 Identities = 44/184 (23%), Positives = 72/184 (39%), Gaps = 23/184 (12%) Query: 4 LPARPESLTFAPQQSALIVVDMQNAYASQGGYLDLAGFDVSATRPVIDNINTAVAAARAA 63 +P S P ++ L++ DMQN + +D S + NI Sbjct: 17 MPQNKVSWVPDPNRAVLLIHDMQNYF------VDAFTAGASPVTELSANIRKLKNQCVQL 70 Query: 64 GMLIIWFQNGWDDQYVEAGGPGSPNYHKSNALKTMRQRPELQGKLLAKGGWDYQLVDELT 123 G+ +++ PGS N L G L G ++ +++ EL Sbjct: 71 GIPVVY-----------TAQPGSQNPDDRALLTDF------WGPGLNSGPYEEKIITELA 113 Query: 124 PQEGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGIV 183 P++ D+VL K RYS F T L ++R G L+ TGI ++ T + F + Sbjct: 114 PEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFF 173 Query: 184 LEDA 187 + DA Sbjct: 174 VGDA 177
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 496 bits (1278), Expect = e-179 Identities = 222/385 (57%), Positives = 266/385 (69%), Gaps = 29/385 (7%) Query: 2 MKRNILAVVIPALLVAGAANAAEIYNKNGNKLDFYGKMVGEHVWTTNGDTSSDDTTYARI 61 MKR +LA+VIPALL AGAA+AAEIYNK+GNKLD YGK+ G H ++ + + D TY R+ Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDD-SSKDGDQTYMRV 59 Query: 62 GLKGETQINDQLIGYGQWEYNMDASNVEGSQT-TKTRLAFAGLKAGEYGSFDYGRNYGAI 120 G KGETQINDQL GYGQWEYN+ A+ EG + TRLAFAGLK G+YGSFDYGRNYG + Sbjct: 60 GFKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVL 119 Query: 121 YDVEAATDMLVEWGGDGWNYTDNYMTGRTNGVATYRNSDFFGLVDGLSFALQYQGKNDHD 180 YDVE TDML E+GGD + Y DNYMTGR NGVATYRN+DFFGLVDGL+FALQYQGKN+ Sbjct: 120 YDVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQ 179 Query: 181 RA---------------IRKQNGDGFSTAATYAFDNGIALSAGYSSSNRSVDQKA----D 221 A IR NGDGF + TY G + A Y++S+R+ +Q Sbjct: 180 SADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDIGMGFSAGAAYTTSDRTNEQVNAGGTI 239 Query: 222 GNGDKAEAWATSAKYDANNIYAAVMYSQTYNMTP------EEDNHFAGKTQNFEAVVQYQ 275 GDKA+AW KYDANNIY A MYS+T NMTP D A KTQNFE QYQ Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 276 FDFGLRPSIGYVQTKGKDLQSRAGFSGGDADLVKYIEVGTWYYFNKNMNVYAAYKFNQLD 335 FDFGLRP++ ++ +KGKDL + +G D DLVKY +VG YYFNKN + Y YK N LD Sbjct: 300 FDFGLRPAVSFLMSKGKDL-TYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLD 358 Query: 336 DND-YTKAAGVATDDQAAVGIVYQF 359 D+D + K AG++TDD A+G+VYQF Sbjct: 359 DDDPFYKDAGISTDDIVALGMVYQF 383
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 37.0 bits (85), Expect = 2e-04 Identities = 40/307 (13%), Positives = 101/307 (32%), Gaps = 18/307 (5%) Query: 63 QFEQLKEDYAYAQQTQRDARQQAFALAEVVQRRAHFSYSDSAEMLSGNSDLNEKLRQRLE 122 ++ +Q + Q+ E+ SD + D N++L + L Sbjct: 36 NTNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELS 95 Query: 123 QAESERSRARDAMRAHAAQLSQYNQVLASLKSSYDTKKELLNDLYKELQDIGVRADTGAE 182 A+ + + ++ A+++ + A L+ + + +++ + A Sbjct: 96 NAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAA 155 Query: 183 ERA--RARRDELHMQLSNNRSRRNQLEKALTFCEAEMDNLTRKLRKLERDY-------CE 233 +A + + + ++ LE EA L + L Sbjct: 156 RKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKT 215 Query: 234 MREQVVTAKAGWCAVMRLVKDNGVERRLHRRELAYLSAD------ELRSMSDKALGALRL 287 + + A + + ++ ++ L A+ + GA+ Sbjct: 216 LEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNF 275 Query: 288 AVADNEHLRDVLRISEDPKRPERKIQFFVAVYQHLRERIRQDIIRTDDPVEAIEQMEIEL 347 + AD+ ++ + + + ++ V R+ +R+D+ D EA +Q+E E Sbjct: 276 STADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL---DASREAKKQLEAEH 332 Query: 348 SRLTEEL 354 +L E+ Sbjct: 333 QKLEEQN 339 Score = 30.4 bits (68), Expect = 0.024 Identities = 40/228 (17%), Positives = 72/228 (31%), Gaps = 15/228 (6%) Query: 19 LADRVDEIQERLDEAQEAARFIQQHGNQLAKLEPIVSVLQSDPEQFEQLKEDYAYAQQTQ 78 +I+ E + L + + + E K A + Sbjct: 171 STADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL 230 Query: 79 RDARQQAFALAEVVQRRAHFSYSDSAEMLSGNSDLNEKLRQRLEQAESER------SRAR 132 A + A + + ++ A + + ++L + L + + ++ + Sbjct: 231 EKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEK 290 Query: 133 DAMRAHAAQLSQYNQVL----ASLKSSYDTKKELLNDLYKELQDIGVRADTGAEERARAR 188 A+ A A L +QVL SL+ D +E L E Q + + R R Sbjct: 291 AALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLR 350 Query: 189 RDELHMQLSNNRSRRNQLEKALTFCEAEMDNLTRKLRKLERDYCEMRE 236 RD L +R + QLE E + + L RD RE Sbjct: 351 RD-----LDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE 393
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 45.8 bits (108), Expect = 6e-07 Identities = 44/281 (15%), Positives = 93/281 (33%), Gaps = 20/281 (7%) Query: 304 QEKIERYEADLDELQIRLEEQNEVVAEAVDRQEENEARAEAAELEVDELKSQLADYQQAL 363 + K + L+ +E E ++ A ++ +N+ ++ EL+++ AD ++AL Sbjct: 70 KLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKAL 129 Query: 364 DVQQTRAIQYNQALQALERAKALCHLPDLTPESADEWLETFQAKEQEATEKMLSLEQKMS 423 + + + ++ LE KA L AD + + A + K+ Sbjct: 130 EGAMNFSTADSAKIKTLEAEKA-----ALAARKADL-----EKALEGAMNFSTADSAKIK 179 Query: 424 VAQTAHSQFEQAYQLVAAINGPLARNEAWDVARELLRDGVNQRHQAEQAQGLRSRLNELE 483 + + E D + + L +R +LE Sbjct: 180 TLEAEKAALEARQAE--------LEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 231 Query: 484 QRLREQQDAERQLAEFCKRQGKRYDIDDLETLHQELEARIASLADSVSNAQEQRMALRQE 543 + L + + K + + LE ELE + + + + L E Sbjct: 232 KALEGAMNFSTADSA--KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE 289 Query: 544 LEQLQSRTQTLMRRAPVWLAAQNSLNQLCEQSGEQFASGQE 584 L++ L ++ V A + SL + + S E + Sbjct: 290 KAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEA 330 Score = 36.2 bits (83), Expect = 5e-04 Identities = 61/363 (16%), Positives = 117/363 (32%), Gaps = 29/363 (7%) Query: 218 HLISEATNYVAADYMRHANERRIHLDKALEYRRDLFTSRSQLAAEQYKHVDMARELQEHN 277 + E + + + + +L+ + K + L E Sbjct: 53 EKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKA 112 Query: 278 GAEGDLEADY----QAASDHLNLVQTALRQQEKIERYEADLDELQIRLEEQNEVVAEAVD 333 +LEA +A +N + + +E +A L + LE+ E Sbjct: 113 SKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFST 172 Query: 334 RQEENEARAEAAELEVDELKSQLADYQQALDVQQTRAIQYNQALQA-LERAKALCHLPDL 392 EA + ++ +++L + T + L+A A + Sbjct: 173 ADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEK 232 Query: 393 TPESADEWLETFQAKEQEATEKMLSLEQKM-SVAQTAHSQFEQAYQLVAAINGPLARNEA 451 E A + AK + + +LE + + + + A I A A Sbjct: 233 ALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAA 292 Query: 452 WDVARELLRDGVNQRHQAEQAQGLRSRLNELEQRLREQQDAERQLAEFCK------RQGK 505 + + L +Q A + Q LR L + + ++Q +AE Q E RQ Sbjct: 293 LEAEKADLEH-QSQVLNANR-QSLRRDL-DASREAKKQLEAEHQKLEEQNKISEASRQSL 349 Query: 506 RYDID--------------DLETLHQELEARIASLADSVSNAQEQRMALRQELEQLQSRT 551 R D+D LE ++ EA SL + ++E + + + LE+ S+ Sbjct: 350 RRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKL 409 Query: 552 QTL 554 L Sbjct: 410 AAL 412
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 116 bits (293), Expect = 5e-38 Identities = 33/88 (37%), Positives = 57/88 (64%), Gaps = 1/88 (1%) Query: 2 TKSELIERLASQQSHIPAKAVEDAVKEMLEHMASTLAQGERIEIRGFGSFSLHYRAPRTG 61 K +LI ++A + + + K AV + ++S LA+GE++++ GFG+F + RA R G Sbjct: 3 NKQDLIAKVA-EATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKG 61 Query: 62 RNPKTGDKVELEGKYVPHFKPGKELRDR 89 RNP+TG++++++ VP FK GK L+D Sbjct: 62 RNPQTGEEIKIKASKVPAFKAGKALKDA 89
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 118 bits (296), Expect = 2e-34 Identities = 74/255 (29%), Positives = 119/255 (46%), Gaps = 16/255 (6%) Query: 3 LKDKVAIITGAASARGLGFATAKLFAENGAKVVIIDLNGEAS---KTAAAALGEGHLGLA 59 ++ K+A ITGAA +G+G A A+ A GA + +D N E ++ A Sbjct: 6 IEGKIAFITGAA--QGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFP 63 Query: 60 ANVADEVQVQAAIEQILAKYGRVDVLVNNAGITQPLKLMDIKRANYDAVLDVSLRGTLLM 119 A+V D + +I + G +D+LVN AG+ +P + + ++A V+ G Sbjct: 64 ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNA 123 Query: 120 SQAVIPTMRAQKSGSIVCISSVSAQRGGGIFGGPHYSAAKAGVLGLARAMARELGPDNVR 179 S++V M ++SGSIV + S A G Y+++KA + + + EL N+R Sbjct: 124 SRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIR 181 Query: 180 VNCITPGLIQTDITAGKLTDD---------MTANILAGIPMNRLGDAIDIARAALFLGSD 230 N ++PG +TD+ D+ GIP+ +L DIA A LFL S Sbjct: 182 CNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSG 241 Query: 231 LSSYSTGITLDVNGG 245 + + T L V+GG Sbjct: 242 QAGHITMHNLCVDGG 256
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 43.3 bits (102), Expect = 1e-06 Identities = 58/400 (14%), Positives = 128/400 (32%), Gaps = 50/400 (12%) Query: 22 LTMIFLVYAINYADRTNIGAVLPFIIDEFHINNFEAGAIASMFFLGYAVSQIP----AGF 77 L +I A++ I VLP ++ + +N + L YA+ Q G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLAL-YALMQFACAPVLGA 65 Query: 78 FIAKRGTRGLVSLSIFGFSAFTWLMGTVSSVFGLKLVRLGLGLSEGPCPVGLASTINNWF 137 + G R ++ +S+ G + +M T ++ L + R+ G++ V + I + Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAV-AGAYIADIT 124 Query: 138 PPKEKATATGVYIAATMFAPIIVPPLAVWIAVTWGWRWVFFSFAIPGIVAAIAWYLLVKS 197 E+A G +++A ++ P+ + + FF+ A + + L+ Sbjct: 125 DGDERARHFG-FMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL-- 181 Query: 198 KPAESGFVSQSELAEINAGRESHNNSVR-ENILIAERFTWLDKIIRVKKMAPIDTAKGLF 256 ESH R + +A + + Sbjct: 182 -------------------PESHKGERRPLRREALNPLASFRWARGMTVVAAL-----MA 217 Query: 257 TSKNILGDCLAYFMMVSVLYGLLTWIPLYLVKERGFDVMSMGFVASMPCIGGFIGAIGGG 316 +F+M V ++ +D ++G + G + ++ Sbjct: 218 V----------FFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLA---AFGILHSLAQA 264 Query: 317 WISDKLLGR-RRKPTMMFTAVSTVVMMLIMLNIPASTLAVCIGLFFVGFCLNIGWPAFTA 375 I+ + R + +M ++ +++ +A I + IG PA A Sbjct: 265 MITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASG--GIGMPALQA 322 Query: 376 YGMAVSDSKTYPIASSIINSGGNLGGFVAPMAAGFLLDKT 415 D + + + +L V P+ + + Sbjct: 323 MLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS 362
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.1 bits (86), Expect = 1e-04 Identities = 21/103 (20%), Positives = 44/103 (42%), Gaps = 3/103 (2%) Query: 54 SGDSTELSFKRGGQVESLDIRQGASVAQGQTLARLNAREAQQRVNERQTAATLAQRQFDR 113 SG S E+ V+ + +++G SV +G L +L A + +T ++L Q + ++ Sbjct: 93 SGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTAL--GAEADTLKTQSSLLQARLEQ 150 Query: 114 FQTLAGRQAISQAEMDVQRANRDAANAALKIAREELAQMSLIA 156 + ++I ++ + + + E L SLI Sbjct: 151 TRYQILSRSIELNKLPELKLPDEPYFQNVS-EEEVLRLTSLIK 192 Score = 34.0 bits (78), Expect = 0.001 Identities = 20/152 (13%), Positives = 45/152 (29%), Gaps = 13/152 (8%) Query: 80 AQGQTLARLNAR--EAQQRVNERQTAATLAQRQFDRFQTLAGRQAISQAEMDVQRANRDA 137 Q + ++ + ++ A+ ++ L + + + N Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQ--TTDNIGL 313 Query: 138 ANAALKIAREELAQMSLIAPFSGIAAGVHIRNH-QVVAAGQPVITLTRTD-LLDVVFSIP 195 L E + AP S + + VV + ++ + D L+V + Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQ 373 Query: 196 ENLFTSL------DIRNTAYRPVVRINTLPGR 221 + I+ A+ P R L G+ Sbjct: 374 NKDIGFINVGQNAIIKVEAF-PYTRYGYLVGK 404
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 393 bits (1011), Expect = e-124 Identities = 187/906 (20%), Positives = 377/906 (41%), Gaps = 59/906 (6%) Query: 9 SGDSFTNPELVRYAE-QLRRELVLVPGVGKVAIGGVIPQQINVDISLAKMAARGITLNQL 67 T ++ Y ++ L + GVG V + G + + + + +T + Sbjct: 145 DNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA-QYAMRIWLDADLLNKYKLTPVDV 203 Query: 68 AAILARLNVVSSAGEIRVGSESI-------RLHPTGEFQSIDELGDLLVSPHGASATTRL 120 L N +AG++ G+ ++ + F++ +E G + + + + RL Sbjct: 204 INQLKVQNDQIAAGQL-GGTPALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRL 262 Query: 121 RDIATLSRGLTDSPASIYHANGRQAVTMGVSFIPGVNVIDVGHALEARLQQMAADKPAGI 180 +D+A + G ++ I NG+ A +G+ G N +D A++A+L ++ P G+ Sbjct: 263 KDVARVELG-GENYNVIARINGKPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGM 321 Query: 181 DIAIFYDQAAEVAHSVNGFITNFLMALAIVVGVLLVFMG-VRSGIIIALSLALNVLGTLL 239 + YD V S++ + A+ +V V+ +F+ +R+ +I +++ + +LGT Sbjct: 322 KVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFA 381 Query: 240 IMYIWGIELQRISLGALIIALSMLVDNAIVIVEGVL-IARQQGSPLLGAINYVLRRSALP 298 I+ +G + +++ +++A+ +LVD+AIV+VE V + + P A + + Sbjct: 382 ILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGA 441 Query: 299 LLGATIIAILAFAPIGLSQDSTGEYCKSLFQVLLISLMLSWFSALTITPVLIKWWLFKNA 358 L+G ++ F P+ STG + ++ ++ LS AL +TP L L Sbjct: 442 LVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLL---K 498 Query: 359 PSAAAAEEKADPYRGSFYR-------GYQQALRILLQQKTLTLLLMGALLAGAIWGFTFV 411 P +A E + G F Y ++ +L LL+ ++AG + F + Sbjct: 499 PVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRL 558 Query: 412 RQNFFPSSNTPIFFVDLWLPYGTDINATEKMTRDIERSI--AGQPGVVTTVSTIGQGSMR 469 +F P + +F + LP G T+K+ + + V + + G Sbjct: 559 PSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEKANVESVFTVNG----- 613 Query: 470 FILTYSGQRQYSNYAQIMVRMDDQR-GIAPVTRHVEDWIARNYPQVNASTKRIMFGP--- 525 ++SGQ Q + A + ++ ++R G V ++ P Sbjct: 614 --FSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFVIPFNMPAIV 671 Query: 526 -----SGDSAIEVRIKGPDPDTLRALASQIGDILAADPAT-DSVRNDWQNRSKVIRPQYS 579 +G + G D L +Q+ + A PA+ SVR + + + + Sbjct: 672 ELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLEDTAQFKLEVD 731 Query: 580 PALGRELGVDKQDIDNALEMNFSGSRAGLYREGADLLPVIVRPPEAERQDANHLNNVLVW 639 + LGV DI+ + G+ + + + + V+ R ++ + V Sbjct: 732 QEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRMLPEDVDKLYVR 791 Query: 640 SQSRQQYIPLSNVINGFALEWED--PLILRRDRTRVLTVQTDPSPLSGQTSGDILARVKP 697 S + + +P S W P + R + + +Q + +P G +SGD +A ++ Sbjct: 792 SAN-GEMVPFSAFTT---SHWVYGSPRLERYNGLPSMEIQGEAAP--GTSSGDAMALMEN 845 Query: 698 RIDALPLPHGYRIEWGGDAENSSEAQQGLFTTLPLGYLVMFIITVLMFSSLKNAVAIWLT 757 LP G +W G + + + + ++V+F+ ++ S V++ L Sbjct: 846 LASKLP--AGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLV 903 Query: 758 VPLALIGVTPGFLLTGIPFGFMALIGLLSLSGMLIRNGIVLVEEIEQ--QKQEKDQRQAI 815 VPL ++GV L ++GLL+ G+ +N I++VE + +K+ K +A Sbjct: 904 VPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEAT 963 Query: 816 IDAATSRLRPILLTAFTTVLGLAPLLRD-----VFFQSMAVVIMFGLAFATVLTLLVLPV 870 + A RLRPIL+T+ +LG+ PL ++ + +M G+ AT+L + +PV Sbjct: 964 LMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPV 1023 Query: 871 IYACFH 876 + Sbjct: 1024 FFVVIR 1029 Score = 67.2 bits (164), Expect = 2e-13 Identities = 79/514 (15%), Positives = 174/514 (33%), Gaps = 46/514 (8%) Query: 384 RILLQQKTLTLLLMGALLAGAIWGFTFVRQNFFPSSNTPIFFVDLWLPYGTDINATEKMT 443 +++ +L L+ + +P+ P V P + +T Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62 Query: 444 RDIERSIAGQPGVVTTVST-IGQGSMRFILTYSGQRQYSNYAQIMVRMDDQRGIAPVTRH 502 + IE+++ G ++ ST GS+ LT+ + AQ+ V+ Q Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQ-SGTDPDIAQVQVQNKLQL-------- 113 Query: 503 VEDWIARNYPQVNASTKRIMFGPSGDSAIEVRIKGPDPDTLRA-----LASQIGDILAAD 557 PQ + S + +P T + +AS + D L+ Sbjct: 114 ----ATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRL 169 Query: 558 PATDSVRNDWQNRSKVIRPQYSPALGRELGVDKQDIDNALEMNFSGSRAG-----LYREG 612 V+ + I L + + D+ N L++ AG G Sbjct: 170 NGVGDVQLFGAQYAMRIWLD--ADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPG 227 Query: 613 ADLLPVIVRPPEAERQDANHLNNVLVWSQSRQQYIPLS---NVINGFALEWEDPLILRRD 669 L I+ + ++ V + S + L V G + Sbjct: 228 QQLNASIIA--QTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIAR-INGK 284 Query: 670 RTRVLTVQTDPSPLSGQTSGDILARVKPRIDALP--LPHGYRIEWGGD-AENSSEAQQGL 726 L ++ +G + D +K ++ L P G ++ + D + + Sbjct: 285 PAAGLGIK----LATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEV 340 Query: 727 FTTLPLGYLVMFIITVLMFSSLKNAVAIWLTVPLALIGVTPGFLLTGIPFGFMALIGLLS 786 TL +++F++ L +++ + + VP+ L+G G + + G++ Sbjct: 341 VKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL 400 Query: 787 LSGMLIRNGIVLVEEIEQQKQEK--DQRQAIIDAATSRLRPILLTAFTTVLGLAPLL--- 841 G+L+ + IV+VE +E+ E ++A + + ++ A P+ Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFG 460 Query: 842 --RDVFFQSMAVVIMFGLAFATVLTLLVLPVIYA 873 ++ ++ I+ +A + ++ L++ P + A Sbjct: 461 GSTGAIYRQFSITIVSAMALSVLVALILTPALCA 494
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 28.0 bits (62), Expect = 0.048 Identities = 17/85 (20%), Positives = 30/85 (35%), Gaps = 8/85 (9%) Query: 67 QTDIDSLRGQIQENQYQLNQIVER------QKQILLQIDSLSSGG--GAASGAQAPSSSG 118 +D + E Y N + Q I Q+ + GG GA S AP + Sbjct: 266 TAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGALSNQPAPPNEA 325 Query: 119 DQSAAATSAAPAATSGAPAMTGDAN 143 + T+ A + + + ++N Sbjct: 326 PIATPPTNQQNAQNTPQTSTSTNSN 350
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 116 bits (291), Expect = 7e-34 Identities = 36/119 (30%), Positives = 55/119 (46%), Gaps = 4/119 (3%) Query: 56 EEQARLQMQQLQQNNIVYFDLDKYDIRSDFAAMLDAHANFLRSN--PSYKVTVEGHADER 113 +Q + + V F+ +K ++ + A LD + L + V V G+ D Sbjct: 205 APAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRI 264 Query: 114 GTPEYNIALGERRANAVKMYLQGKGVSADQISIVSYGKEKPAVLGHDEAAYAKNRRAVL 172 G+ YN L ERRA +V YL KG+ AD+IS G+ P V G+ K R A++ Sbjct: 265 GSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP-VTGN-TCDNVKQRAALI 321
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 64.3 bits (156), Expect = 2e-13 Identities = 30/228 (13%), Positives = 71/228 (31%), Gaps = 8/228 (3%) Query: 69 QQQASARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQERLQAQEAAKEAKEQ----Q 124 + A + + AE +++ ++ + + Q +E AKEAK Sbjct: 1021 DEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANT 1080 Query: 125 KQAEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKAAADA-K 183 + E A + + + + E KA E + + ++ + + ++ + Sbjct: 1081 QTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQ 1140 Query: 184 KQAEAAAAKAAAEAKKQAEAEAAKAAAEAQKKAEAAAAKKAQQEAEKKPSRKRLSRRQLK 243 QAE A K+ +++ A Q E ++ + + + + ++ Sbjct: 1141 PQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVE---QPVTESTTVNTGNSVVE 1197 Query: 244 KRLLRKPPRKKRPLRRPPPRKPQPLKKRRQRKRLQQRRLQLIKRPRRQ 291 P + + KP+ +R R R Sbjct: 1198 NPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRS 1245 Score = 54.3 bits (130), Expect = 3e-10 Identities = 29/169 (17%), Positives = 59/169 (34%), Gaps = 3/169 (1%) Query: 102 RLKQLEQERL-QAQEAAKEAKEQQKQAEEAAAKAAAAAKAKAD-AQAKEAQEAAAKAAAE 159 L E E+ Q + QA+ + + A+ D A A E Sbjct: 979 DLYNPEVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTE 1038 Query: 160 AKAKADAQAKAAEQAAAKAAADAKKQAEAAAAKAAAEAKKQAEA-EAAKAAAEAQKKAEA 218 A+ Q + + A + Q A +A + K + E A++ +E ++ Sbjct: 1039 TVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTT 1098 Query: 219 AAAKKAQQEAEKKPSRKRLSRRQLKKRLLRKPPRKKRPLRRPPPRKPQP 267 + A E E+K + +++ K + P++++ P +P Sbjct: 1099 ETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAR 1147 Score = 44.3 bits (104), Expect = 4e-07 Identities = 27/174 (15%), Positives = 53/174 (30%), Gaps = 14/174 (8%) Query: 65 NRQQQQQASARRAAEQREKQAQQQAEELREKQAAEQERLKQLEQERLQAQEAAKEAK--E 122 N Q + + K+ +E EK E E K E ++ +Q + K+ + Sbjct: 1083 NEVAQSGSETKETQTTETKETATVEKE--EKAKVETE--KTQEVPKVTSQVSPKQEQSET 1138 Query: 123 QQKQAEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKAAADA 182 Q QAE A+ + Q++ A + A+ + EQ ++ Sbjct: 1139 VQPQAE--PARENDPTVNIKEPQSQTNTTADTEQPAK------ETSSNVEQPVTESTTVN 1190 Query: 183 KKQAEAAAAKAAAEAKKQAEAEAAKAAAEAQKKAEAAAAKKAQQEAEKKPSRKR 236 + + A Q + + + + + E S R Sbjct: 1191 TGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDR 1244 Score = 34.7 bits (79), Expect = 5e-04 Identities = 23/172 (13%), Positives = 46/172 (26%), Gaps = 11/172 (6%) Query: 68 QQQQASARRAAEQREKQA-----QQQAEELREKQAAEQERLKQLEQERLQAQEAAKE--- 119 ++ Q + ++ KQ Q QAE RE + Q + E + Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETS 1176 Query: 120 -AKEQQKQAEEAAAKAAAAAKAKADAQAKEAQEAAAKAAAEAKAKADAQAKAAEQAAAKA 178 EQ + + + Q ++ ++ + + Sbjct: 1177 SNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEP 1236 Query: 179 AADAKKQAEAAAAKAAAEAKKQAEAEAAKAAAEAQKKAEAAAAKKAQQEAEK 230 A + A A + A A+AQ A +Q ++ Sbjct: 1237 ATTSSNDRSTVALCDLTS--TNTNAVLSDARAKAQFVALNVGKAVSQHISQL 1286
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.4 bits (71), Expect = 0.003 Identities = 11/41 (26%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 14 VDDAPHMQDYTLEAEEGRDM-MLLDALIQLKEKDPSLSFRR 53 +++ + T+E + + MLLDAL+++ + DP L + Sbjct: 339 IENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYV 379
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 29.3 bits (65), Expect = 0.036 Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 4/115 (3%) Query: 361 ILTLSARWSAAY-GQSSMPLMVLGLAVMGFAELFIDPVAMSQITRIEIPGVTGVLTGIYM 419 +LT+ + +A + G +S+ L +GLAVM E+ +S I + P + VL + Sbjct: 324 LLTIVSVVAAVFTGGASLALAAVGLAVMVADEIVKAATGVSFIQQALNPIMEHVLKPLME 383 Query: 420 LLSGAIANYLAGVIAD-QTSQASFDAAGAVNYSID--AYITVFSQITWGALACVG 471 L+ AI L G+ D +T++ + GA+ +I A I V + + GA A +G Sbjct: 384 LIGKAITKALEGLGVDKKTAEMAGSIVGAIVAAIAMVAVIVVVAVVGKGAAAKLG 438
>PF06580#Sensor histidine kinase Length = 349 Score = 30.2 bits (68), Expect = 0.024 Identities = 24/130 (18%), Positives = 49/130 (37%), Gaps = 28/130 (21%) Query: 527 HIQLDLPDPLQLVHVDGPLFERVLINLLENAHKYAGAR----ARIGIRAEADARQLSLEV 582 + + + V V P ++ L+EN K+ A+ +I ++ D ++LEV Sbjct: 241 QFENQINPAIMDVQV--PPM--LVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 583 WDNGPGIPAGQEQTIFDKFARGNKESAIPGVGLGLA-ICQAIVDVHGG--TISASNRPEG 639 + G ++ G GL + + + ++G I S + +G Sbjct: 297 ENTGS----------------LALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEK-QG 339 Query: 640 GASFRVTLPG 649 + V +PG Sbjct: 340 KVNAMVLIPG 349
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 88.3 bits (219), Expect = 2e-22 Identities = 35/122 (28%), Positives = 57/122 (46%), Gaps = 1/122 (0%) Query: 4 VLIIEDEHAIRRFLRTALEADGMRVFEAETLQRGLIEAATRKPDLAILDLGLPDGDGIDF 63 +L+ +D+ AIR L AL G V A DL + D+ +PD + D Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 IRDLRQ-WSQMPIIVLSARSEEHDKIAALDAGADDYLSKPFGIGELQARLRVALRRHGAA 122 + +++ +P++V+SA++ I A + GA DYL KPF + EL + AL Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 123 QA 124 + Sbjct: 126 PS 127
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 36.7 bits (85), Expect = 1e-04 Identities = 15/32 (46%), Positives = 22/32 (68%) Query: 345 LETLLQENGNVVRAADRLGLHRNTLHQRIQRI 376 L L GN ++AAD LGL+RNTL ++I+ + Sbjct: 442 LAALTATRGNQIKAADLLGLNRNTLRKKIREL 473
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 36.7 bits (85), Expect = 1e-04 Identities = 53/313 (16%), Positives = 106/313 (33%), Gaps = 26/313 (8%) Query: 99 LGLLLSAGMNLMMGMTTNALLLAIFWGINGWAQSMGVGPCAVSLARWYGVKERGTFYGIW 158 + L +A +M +L I + G + G A +A ER +G Sbjct: 78 VSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAY-IADITDGDERARHFGFM 136 Query: 159 STAHNIGEAVTYMVIAAVIAGFGWQMGYLSTAALGAAGVVLLVLFMHDSPQSSGFPSINV 218 S G V V+ ++ GF + + AAL + + +S + P Sbjct: 137 SACFGFG-MVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRP---- 191 Query: 219 IRDEPQEEVEARGSVFKNQLLALRNPALWTLALASAFMYIDRYAVNSWGIFFLEQDKAYS 278 EA + + +A+ + + W I F E + Sbjct: 192 ------LRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVI-FGEDRFHWD 244 Query: 279 TLEASGIIGVN-AIAGIAGTIIAGMLSDRF---FPRNRSVMAGFISLLNTAGFALMLWSP 334 IG++ A GI ++ M++ R++M G I+ + G+ L+ ++ Sbjct: 245 A----TTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIA--DGTGYILLAFAT 298 Query: 335 HNYYTDILAMIIFGATIGALTCFLGGLIAVDISSRKAAGAALGTIGIASYAGAGLGEFLT 394 + M++ A+ G L +++ + + G G++ + + +G L Sbjct: 299 R-GWMAFPIMVLL-ASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALTSLTSIVGPLLF 355 Query: 395 GIIIDKTAILENG 407 I + NG Sbjct: 356 TAIYAASITTWNG 368
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 135 bits (341), Expect = 4e-41 Identities = 86/253 (33%), Positives = 130/253 (51%), Gaps = 15/253 (5%) Query: 5 LTGKKALVTGASRGLGRAIALSLARAGADVVITYEKSVDKAQAVADEIKALGRYGEAVQA 64 + GK A +TGA++G+G A+A +LA GA + + + +K + V +KA R+ EA A Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIA-AVDYNPEKLEKVVSSLKAEARHAEAFPA 64 Query: 65 DSASAQAIQDAVTHAARSLGGLDILVNNAGIARGGPLESMTLADIDALINVNIRGVVIAT 124 D + AI + R +G +DILVN AG+ R G + S++ + +A +VN GV A+ Sbjct: 65 DVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNAS 124 Query: 125 QEALVHMAD--GGRIINIGSCLANRVAMPGISVYAMTKSALNALTRGLARDLGPRGITVN 182 + +M D G I+ +GS A ++ YA +K+A T+ L +L I N Sbjct: 125 RSVSKYMMDRRSGSIVTVGSNPAGVPRT-SMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 183 LVHPGPTNSDMN-----PEDGEQ------AEAQRQMIAVGHYGQPEDIAAAVTFLASPAA 231 +V PG T +DM E+G + E + I + +P DIA AV FL S A Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 232 GQISGTGLDVDGG 244 G I+ L VDGG Sbjct: 244 GHITMHNLCVDGG 256
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.3 bits (71), Expect = 0.013 Identities = 9/60 (15%), Positives = 25/60 (41%), Gaps = 1/60 (1%) Query: 172 VIILAVLAMIVVKALTHSPWG-TYTVAFTIPLAIFMGIYIRYLRPGRIGEVSVIGLVMLV 230 ++ ++ + + + A + W +V +PL I + L + ++GL+ + Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 340 bits (873), Expect = e-121 Identities = 109/265 (41%), Positives = 151/265 (56%), Gaps = 20/265 (7%) Query: 1 MAALDFRGQTVWVTGAGKGIGYATALAFVEAGANVTGFD---------------LAFDGE 45 M A G+ ++TGA +GIG A A GA++ D A E Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 46 SYPFATETLDVADADQVREACSRLLANTERLDVLVNAAGILRMGATDQLSAEDWQQTFAV 105 ++P DV D+ + E +R+ +D+LVN AG+LR G LS E+W+ TF+V Sbjct: 61 AFP-----ADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSV 115 Query: 106 NVGGAFNLFQQTMAQFRRQRGGAIVTVASDAAHTPRIGMSAYGASKAALKSLALTVGLEL 165 N G FN + +R G+IVTV S+ A PR M+AY +SKAA +GLEL Sbjct: 116 NSTGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLEL 175 Query: 166 AGSGVRCNLVSPGSTDTDMQRTLWVSDDAEQQRIRGFGEQFKLGIPLGKIARPQEIANTI 225 A +RCN+VSPGST+TDMQ +LW ++ +Q I+G E FK GIPL K+A+P +IA+ + Sbjct: 176 AEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAV 235 Query: 226 LFLASSHASHITLQDIVVDGGSTLG 250 LFL S A HIT+ ++ VDGG+TLG Sbjct: 236 LFLVSGQAGHITMHNLCVDGGATLG 260
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 427 bits (1098), Expect = e-154 Identities = 151/303 (49%), Positives = 200/303 (66%), Gaps = 20/303 (6%) Query: 1 MAIPKLQAYALPEASDIPANKVNWAFEPSRAALLIHDMQEYFLNFWGENSAMMEKVVANI 60 MAIP +Q Y +P ASD+P NKV+W +P+RA LLIHDMQ YF++ + ++ + ++ ANI Sbjct: 1 MAIPAIQPYQMPTASDMPQNKVSWVPDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANI 60 Query: 61 AALRDFCKQNGIPVYYTAQPKEQSDEDRALLNDMWGPGLTRSPEQQQVIAALAPDEDDTV 120 L++ C Q GIPV YTAQP Q+ +DRALL D WGPGL P ++++I LAP++DD V Sbjct: 61 RKLKNQCVQLGIPVVYTAQPGSQNPDDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLV 120 Query: 121 LVKWRYSAFHRSPLEEMLKETGRDQLIITGVYAHIGCMTTATDAFMRDIKPFFVADALAD 180 L KWRYSAF R+ L EM+++ GRDQLIITG+YAHIGC+ TA +AFM DIK FFV DA+AD Sbjct: 121 LTKWRYSAFKRTNLLEMMRKEGRDQLIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVAD 180 Query: 181 FSREEHLMALKYVAGRSGRVVMTEELL--------PLPASKA-----------ALRALIL 221 FS E+H MAL+Y AGR VMT+ LL + + A +R I Sbjct: 181 FSLEKHQMALEYAAGRCAFTVMTDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIA 240 Query: 222 PLLDESDEPLD-DENLIDYGLDSVRMMALAARWRKVHGDIDFVMLAKNPTIDAWWALLTR 280 LL E+ E + E+L+D GLDSVR+M L +WR+ ++ FV LA+ PTI+ W LLT Sbjct: 241 ELLQETPEDITDQEDLLDRGLDSVRIMTLVEQWRREGAEVTFVELAERPTIEEWQKLLTT 300 Query: 281 EVH 283 Sbjct: 301 RSQ 303
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 52.6 bits (126), Expect = 6e-10 Identities = 59/288 (20%), Positives = 100/288 (34%), Gaps = 31/288 (10%) Query: 40 HTLPSQPLRIVSTSVTLTGSLLAIDAPVVASGATTPNNRVADSQGFLRQWSEVAKARKLA 99 H P RIV+ LLA+ VAD+ + SE + Sbjct: 29 HAAAIDPNRIVALEWLPVELLLALGIVPYG---------VADTINYRLWVSEPPLPDSV- 78 Query: 100 RLYIG---EPSAEAVAAQMPDLILVSATGGDSALPLYDQLKTIAPTLVINYDDKS----- 151 + +G EP+ E + P ++ SA G P + L IAP N+ D Sbjct: 79 -IDVGLRTEPNLELLTEMKPSFMVWSAGYG----PSPEMLARIAPGRGFNFSDGKQPLAM 133 Query: 152 WQTLLTQLGQITGHEQQASARIADFNKQLVSLKEKMKLPPQPVTALVYTAAAHSANIWTP 211 + LT++ + + A +A + + S+K + L ++ P Sbjct: 134 ARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGP 193 Query: 212 ESAQGQLLEQLGFSLATLPGGLPASHSQGKRHDIVQLGGENLAAGLNGQSLFLFAGDQKD 271 S ++L++ G A + + + LAA + L + KD Sbjct: 194 NSLFQEILDEYGIP--------NAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245 Query: 272 ADAIYANPLLAHLPAVAGKRVYPLGTETFRLDYYSALLVLQRLSSLFG 319 DA+ A PL +P V R + F SA+ ++ L + G Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIG 293
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.8 bits (85), Expect = 2e-04 Identities = 39/187 (20%), Positives = 72/187 (38%), Gaps = 8/187 (4%) Query: 24 IARFISILSLGLLGVAIPVQIQMMTHSTWQVGLSVTLTGASMFVGLMVGGVLADRYERKR 83 I F S+L+ +L V++P T + +G V G L+D+ KR Sbjct: 21 ILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKR 80 Query: 84 LILLARGTCGVGFVGLCLNALLPEPSLAAIYLLGIWDGFFASLGVTALLAATPALVGREN 143 L+L G V + + A ++ G F +L ++ + +EN Sbjct: 81 LLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPAL----VMVVVARYIPKEN 136 Query: 144 LMQAGAITMLTVRLGSVISPMIGGLLLATGGVAWNFVLAAAGTFITTLTLLRLPQLPPPP 203 +A + V +G + P IGG++ + W+++L IT +T+ L +L Sbjct: 137 RGKAFGLIGSIVAMGEGVGPAIGGMIAHY--IHWSYLLLIP--MITIITVPFLMKLLKKE 192 Query: 204 QPREHPL 210 + Sbjct: 193 VRIKGHF 199
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 64.3 bits (156), Expect = 5e-15 Identities = 31/172 (18%), Positives = 58/172 (33%), Gaps = 15/172 (8%) Query: 10 RRRQLIDATLDAINEVGMHDATIAQIARRAGVSTGIISHYFKDKNGLLEATMRDITSQLR 69 R+ ++D L ++ G+ ++ +IA+ AGV+ G I +FKDK+ L S + Sbjct: 12 TRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIG 71 Query: 70 DAVLNRLHALPDGSASQRLQAIVGGNFDETQISSAAMKAWLAFWASSMHQP-------ML 122 + L P L+ I+ + T ++ + + Sbjct: 72 ELELEYQAKFPGDPL-SVLREILIHVLEST-VTEERRRLLMEIIFHKCEFVGEMAVVQQA 129 Query: 123 YRLQQVSSRRLLSNLVYEFRRE---LPREQAQEAGYGLAALIDGL---WLRA 168 R + S + + + A + I GL WL A Sbjct: 130 QRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLFA 181
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 83.9 bits (207), Expect = 2e-21 Identities = 52/187 (27%), Positives = 84/187 (44%), Gaps = 5/187 (2%) Query: 6 VVFITGATSGFGEAAAQVFADAGWSLVLSGRRYPRLKALQ--DRLAARVPVHIIELDVRD 63 + FITGA G GEA A+ A G + +L+ + + AR DVRD Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHA-EAFPADVRD 68 Query: 64 SEAVAAAVASLPADFADITTLINNAGLALSPLPAQEVALEDWKTMIDTNVTGLVTVTHAL 123 S A+ A + + I L+N AG+ L P ++ E+W+ N TG+ + ++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGV-LRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 124 LPTLIRHGAGASIINIGSIAGQWPYPGSHVYGASKAFVKQFSYNLRCDLLGTGVRVTDLA 183 ++ +G SI+ +GS P Y +SKA F+ L +L +R ++ Sbjct: 128 SKYMMDRRSG-SIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 184 PGIAETE 190 PG ET+ Sbjct: 187 PGSTETD 193
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 869 bits (2246), Expect = 0.0 Identities = 392/990 (39%), Positives = 575/990 (58%), Gaps = 17/990 (1%) Query: 3 VSASWPGASASDVAEAIAAPLETQLNGVDHMLYMESTSSDEGTYRLSITFAAGTDADLAA 62 VSA++PGA A V + + +E +NG+D+++YM STS G+ +++TF +GTD D+A Sbjct: 45 VSANYPGADAQTVQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQ 104 Query: 63 IDVQNRVAQALAQLPAEVQQNGVQVRKRASNLLMGVSLYSPLGTLSPLFVSNYASTQVRE 122 + VQN++ A LP EVQQ G+ V K +S+ LM S + +S+Y ++ V++ Sbjct: 105 VQVQNKLQLATPLLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKD 164 Query: 123 ALARLPGVGEVQMFGARDYSMRIWLRPDRMNALNITTDDVAQALREQNVQGAAGQVGTPP 182 L+RL GVG+VQ+FGA+ Y+MRIWL D +N +T DV L+ QN Q AAGQ+G P Sbjct: 165 TLSRLNGVGDVQLFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTP 223 Query: 183 VFNGQQQTLTINGLGRLNEAASFGEIIIRRGAQGQLVRLADVATIELGARSYSSGAQLNG 242 GQQ +I R FG++ +R + G +VRL DVA +ELG +Y+ A++NG Sbjct: 224 ALPGQQLNASIIAQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARING 283 Query: 243 KASAYLGIYPTPTANALQVASAVRAELNRLHTRFPADLTWEVKFDTTRFVTATIKEIGVS 302 K +A LGI ANAL A A++A+L L FP + +DTT FV +I E+ + Sbjct: 284 KPAAGLGIKLATGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKT 343 Query: 303 LALTLLAVVVVVSLFLQSWRATLIVVLAIPVSLIGTFAVLYLLGYSANTLSLFAIILALT 362 L ++ V +V+ LFLQ+ RATLI +A+PV L+GTFA+L GYS NTL++F ++LA+ Sbjct: 344 LFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIG 403 Query: 363 MVVDDAIVVVENVETKMAE-GLDRLQATAQALRQIAGPVIATTLVLLAVFVPVALLPGIV 421 ++VDDAIVVVENVE M E L +AT +++ QI G ++ +VL AVF+P+A G Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGST 463 Query: 422 GELYRQFAVTLSTAVALSSLVALTLTPALCALLLRPRPARP----AAVWRAFNRLLDGTR 477 G +YRQF++T+ +A+ALS LVAL LTPALCA LL+P A + FN D + Sbjct: 464 GAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSV 523 Query: 478 YGYGRLVGRMNRRPWLALAATVAAGALVAFSFTSMPKGFLPQEDQGYLFASVQLPEAASL 537 Y VG++ L A + F +P FLP+EDQG +QLP A+ Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583 Query: 538 ERTEAVMTQARKLLMANPA--VEDVIQVSGFNILNGTSASNGGFISVMLKDWHQRPP--- 592 ERT+ V+ Q + N VE V V+GF+ A N G V LK W +R Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSF--SGQAQNAGMAFVSLKPWEERNGDEN 641 Query: 593 -LDAVMADIQRQLLSLPEATIMTFAPPTLPGLGNASGFDLRILAQAGQSSAELEQVTREI 651 +AV+ + +L + + ++ F P + LG A+GFD ++ QAG L Q ++ Sbjct: 642 SAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQL 701 Query: 652 LQLANQH-SQLSRVFTTWSSNVPQLTLTVDRDRAALLDVPVAQIFSSLQTAFGGTRAGDF 710 L +A QH + L V + Q L VD+++A L V ++ I ++ TA GGT DF Sbjct: 702 LGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF 761 Query: 711 SRNNRVYHVVMQNEMQWRERAEQISELYVRSRDGERVRLSNLVTITPTVGAPFIQQYNQF 770 RV + +Q + ++R E + +LYVRS +GE V S T G+P +++YN Sbjct: 762 IDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGL 821 Query: 771 PSVSVSGSAAEGVSSRTAMAAMEQILQAHLPPGYDYAWSGISWQEQQTGNQAVWIVLAAV 830 PS+ + G AA G SS AMA ME + LP G Y W+G+S+QE+ +GNQA +V + Sbjct: 822 PSMEIQGEAAPGTSSGDAMALMENLASK-LPAGIGYDWTGMSYQERLSGNQAPALVAISF 880 Query: 831 AMAWLFLVAQYESWTLPASVMLSVLFAIGGALLWLWTAGYANDVYVQIGLVLLIALAAKN 890 + +L L A YESW++P SVML V I G LL NDVY +GL+ I L+AKN Sbjct: 881 VVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKN 940 Query: 891 AILIVEFARSRRE-EGLSIVDAAREGATRRFRAVMMTAVSFIIGIMPMMLATGAGAQSRR 949 AILIVEFA+ E EG +V+A R R ++MT+++FI+G++P+ ++ GAG+ ++ Sbjct: 941 AILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQN 1000 Query: 950 IIGTTVFSGMLVATMVGILFIPSLYVLFQR 979 +G V GM+ AT++ I F+P +V+ +R Sbjct: 1001 AVGIGVMGGMVSATLLAIFFVPVFFVVIRR 1030 Score = 76.8 bits (189), Expect = 3e-16 Identities = 87/522 (16%), Positives = 180/522 (34%), Gaps = 45/522 (8%) Query: 489 RRPWLALAATVAAGALVAFSFTSMPKGFLPQEDQGYLFASVQLPEAASLERTEAVMTQAR 548 RRP A + A + +P P + S P A + + +TQ Sbjct: 7 RRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYP-GADAQTVQDTVTQ-- 63 Query: 549 KLLMANPAVEDVIQVSGFNILNGTSASNGGFISVMLKDWHQRPPLDA---VMADIQRQLL 605 +++ + ++ TS S G +++ L P A V +Q Sbjct: 64 ------VIEQNMNGIDNLMYMSSTSDSAG-SVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 606 SLPEATIMTFAPPTLPGLGNASGFDLRILAQAGQSSAELEQVTREILQLANQHSQLSRV- 664 LP+ + ++S + + + + Q +N LSR+ Sbjct: 117 LLPQE----VQQQGISVEKSSSSYLMVAGFVS--DNPGTTQDDISDYVASNVKDTLSRLN 170 Query: 665 ----FTTWSSNVPQLTLTVDRDRAALLDVPVAQIFSSLQTAFGGTRAGDF------SRNN 714 + + + + +D D + + + L+ AG Sbjct: 171 GVGDVQLFGAQY-AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQ 229 Query: 715 RVYHVVMQNEMQWRERAEQISELYVR-SRDGERVRLSNLVTITPTVGA-PFIQQYNQFPS 772 ++ Q + E+ ++ +R + DG VRL ++ + I + N P+ Sbjct: 230 LNASIIAQTRFK---NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPA 286 Query: 773 VSVSGSAAEGVSSRTA----MAAMEQILQAHLPPG--YDYAWSGISWQEQQTGNQAVWIV 826 + A G ++ A + + LQ P G Y + + + ++ V + Sbjct: 287 AGLGIKLATGANALDTAKAIKAKLAE-LQPFFPQGMKVLYPYDTTPFVQLSI-HEVVKTL 344 Query: 827 LAAVAMAWLFLVAQYESWTLPASVMLSVLFAIGGALLWLWTAGYANDVYVQIGLVLLIAL 886 A+ + +L + ++ ++V + G L GY+ + G+VL I L Sbjct: 345 FEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGL 404 Query: 887 AAKNAILIVE-FARSRREEGLSIVDAAREGATRRFRAVMMTAVSFIIGIMPMMLATGAGA 945 +AI++VE R E+ L +A + ++ A++ A+ +PM G+ Sbjct: 405 LVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 946 QSRRIIGTTVFSGMLVATMVGILFIPSLYVLFQRMREWAHRR 987 R T+ S M ++ +V ++ P+L + H Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHE 506
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 46.4 bits (110), Expect = 2e-10 Identities = 21/40 (52%), Positives = 28/40 (70%) Query: 1 MLTFFIRRPRFAMVIALLLTFVGAVSLKLIPVEQYPAITP 40 M FFIRRP FA V+A++L GA+++ +PV QYP I P Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAP 40
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 144 bits (365), Expect = 6e-41 Identities = 75/255 (29%), Positives = 122/255 (47%), Gaps = 21/255 (8%) Query: 89 QALCADRQDSLAQLIGAQGSLQEALRQCKAAISYPGAGLPLLLRGPTGTGKSFLARQLWH 148 L D QD L+G ++QE R + L L++ G +GTGK +AR Sbjct: 127 SKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTD---LTLMITGESGTGKELVAR---- 178 Query: 149 YAIDEGILPADAPFTVFNCAEYANNPELLTSKLFGHAKGAFTGADKAVPGLIETSNGGVL 208 A+ + + PF N A A +L+ S+LFGH KGAFTGA G E + GG L Sbjct: 179 -ALHDYGKRRNGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTL 235 Query: 209 FIDEVHRLPPEGQEKLFHFMDNGSWRRLGESADERSATVRLIFASTEDLEK-----HFLA 263 F+DE+ +P + Q +L + G + +G + VR++ A+ +DL++ F Sbjct: 236 FLDEIGDMPMDAQTRLLRVLQQGEYTTVG-GRTPIRSDVRIVAATNKDLKQSINQGLFRE 294 Query: 264 TFIRRIPVI-VKILPIAERGQFERLAFIHHFFRREAQRLNHD-LELDGEIVSQLMRETLE 321 R+ V+ +++ P+ +R E + + F ++A++ D D E + + Sbjct: 295 DLYYRLNVVPLRLPPLRDRA--EDIPDLVRHFVQQAEKEGLDVKRFDQEALELMKAHPWP 352 Query: 322 GNVGGLENLIRNICA 336 GNV LENL+R + A Sbjct: 353 GNVRELENLVRRLTA 367
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 27.9 bits (62), Expect = 0.031 Identities = 10/36 (27%), Positives = 14/36 (38%) Query: 81 HSALMRILLPALLAVVCYGWGFRRQWRETQWHYSTE 116 HS R+L LL + Y W W + Y + Sbjct: 6 HSFFKRVLTGTLLLLSSYSWAQELDWLPIPYVYVAK 41
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.8 bits (67), Expect = 0.012 Identities = 18/87 (20%), Positives = 32/87 (36%), Gaps = 10/87 (11%) Query: 199 AMAEHRGDPAWENKLARFFAASSEFEALWHQRYEVRGVENQIKHFNHPQLGRFSLQQMYW 258 A+ + ++E + + + + + YEV I H N PQ+G L Sbjct: 13 ALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEV-----VITHGNGPQVGSLLLHMDAG 67 Query: 259 YSAPRNGSRLLVYLPMDEAGEQALAWL 285 + + PMD AG + W+ Sbjct: 68 QATYGIPA-----QPMDVAGAMSQGWI 89
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 103 bits (257), Expect = 7e-26 Identities = 76/409 (18%), Positives = 160/409 (39%), Gaps = 29/409 (7%) Query: 21 MLPLIDTSITNVALDAITHTLAASATQLELIVALYGVAFAVCLAMGSKLGDNYGRRRLFM 80 +++ + NV+L I + + + + F++ A+ KL D G +RL + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 81 WGVALFGIASLLCGMANSIGALL-AARTLQGAGAALIVPQILATLHVTLKGPAH-ARAIS 138 +G+ + S++ + +S +LL AR +QGAGAA P ++ + + +A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFG 142 Query: 139 LYGGIGGIAFIVGQMGGGWLVSADIAGLGWRNAFFINVPICLLVLALSRRYVPETRRETP 198 L G I + VG GG + + W ++ + +P+ ++ + + Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHY----IHW--SYLLLIPMITIITVPFLMKLLKKEVRIK 196 Query: 199 SRIDWQGTLYL-ALILCCLLFPMALGPELHWPLWLQLMLVAVLPLLFAMRQSALRQQQRG 257 D +G + + I+ +LF + L++ + L+F ++ ++ Sbjct: 197 GHFDIKGIILMSVGIVFFMLFTTSY-------SISFLIVSVLSFLIF------VKHIRKV 243 Query: 258 DHPLLPPRLLQLTSIRFGMAIALLFFSAWSGFMFCMALTMQEGLGMAPWQSGNSFIALG- 316 P + P L + G+ + F +GF+ + M++ ++ + G+ I G Sbjct: 244 TDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGT 303 Query: 317 VAYFISALYAPRLIARYSMGRILLTGLAVQIAGLLLLCATFSRFGVATNALTLVPATALI 376 ++ I L+ R +L G+ L F + T + + + Sbjct: 304 MSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTAS-----FLLETTSWFMTIIIVFV 358 Query: 377 GYGQALIVNSFYRIGMRDISASDAGAGSAILSTLQQATLGLGPAILGSL 425 G + I + +AGAG ++L+ + G G AI+G L Sbjct: 359 LGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 68.3 bits (167), Expect = 8e-14 Identities = 106/699 (15%), Positives = 206/699 (29%), Gaps = 99/699 (14%) Query: 125 KDASLSLDTRSFYLELTVNRAAMQAAILPRTNMLGESTAQN--LSSVLNYSMGSYYNKYE 182 DA+ LD L LT+ +A M + + +LNY+ + Sbjct: 143 HDATAQLDVGQQRLNLTIPQAFMS---NRARGYIPPELWDPGINAGLLNYNFSGNSVQNR 199 Query: 183 ---NTDNASSYLTL-DNT--WSLR-EHHLNFNGSLYGIGTGNQESKLYRSMYERDYQGRR 235 N+ A L N W LR ++N S G+ N+ + ERD R Sbjct: 200 IGGNSHYAYLNLQSGLNIGAWRLRDNTTWSYNSSDSSSGSKNKWQHINTW-LERDIIPLR 258 Query: 236 --LAMGMVDTWNLQSIASMSALNSSRIYGVSYGNKSSSQTQDNTLALVPVTVFLPA---- 289 L +G + G + D+ + F P Sbjct: 259 SRLTLG-------DGYTQGDIFDGINFRGAQLAS-------DDNMLPDSQRGFAPVIHGI 304 Query: 290 ---AGEVHVYRDGKLLSIQNFSMGSYELDTSRLPFGIYNVDIQVVV---NGRVVSSRTAN 343 +V + ++G + G + ++ + + D+QV + +G Sbjct: 305 ARGTAQVTIKQNGYDIYNSTVPPGPFTIND--IYAAGNSGDLQVTIKEADGSTQI----- 357 Query: 344 INKTFARKSSVT--GDLSWQTFGGSLEYNKMDYRHKYDIN----YGTKNTWIAGIAAATS 397 ++ + G + G +G W + Sbjct: 358 FTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLA 417 Query: 398 QPWLS---GVNLKTTLYG---FDT--------NGVNETEANVIFNDAFSFNQQGLLATDG 443 + + G+ G D + +V F S N+ G Sbjct: 418 DRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLV 477 Query: 444 SWQ-STSTFNMSLPDGYG--NLWGSRQYSSIGNALPMQQNDYVTIGAN------ANLRKI 494 ++ STS + + D + + + + DY + N + + Sbjct: 478 GYRYSTSGY-FNFADTTYSRMNGYNIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQ 536 Query: 495 APFLGTLSVSRTNNKYTGSTYTNVDYDQSLLAN-RYATVSLRAGIQNYQYNNHENLRDKY 553 TL +S ++ Y G++ + + L +L + N + RD+ Sbjct: 537 LGRTSTLYLSGSHQTYWGTSNVDEQFQAGLNTAFEDINWTLSY---SLTKNAWQKGRDQM 593 Query: 554 VNIDVSIPFSTWLSTGVSSQNGNMLANATLRKSFDDSAITQVGAS--------VSKQIKQ 605 + ++V+IPFS WL + SQ + A+ ++ + G +S ++ Sbjct: 594 LALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQT 653 Query: 606 NKNDDSRYRSDDYAANGYVSYDTKYNAGTVSVSRSSQHSSNYSLSSQGSLAWTEKNVYVG 665 S ++Y Y + S S G + V +G Sbjct: 654 GYAGGGDGNSGST-GYATLNYRGGYGNANIGYSHSDDIKQ-LYYGVSGGVLAHANGVTLG 711 Query: 666 KGTQTAGLVVNTNFSGKGRMMAQINGQNYPLT---GKSNFISLPPYAEYKVELMNDKNSE 722 + ++V G A++ Q T G + Y E +V L + Sbjct: 712 QPLNDTVVLVKA----PGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRVAL-DTNTLA 766 Query: 723 DSVDIVNGRRNKVVLYPGNVSVINPEVKQLVTVFGRVKD 761 D+VD+ N VV G + + + + + + Sbjct: 767 DNVDLDNAVA-NVVPTRGAIVRAEFKARVGIKLLMTLTH 804
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 30.3 bits (68), Expect = 0.022 Identities = 10/23 (43%), Positives = 13/23 (56%) Query: 57 AEQKVQQLTQQQQQTQATTQQVA 79 + + QQ QQQQ QAT Q+ Sbjct: 338 PQAQQQQGQGQQQQAQATAQEAV 360
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 30.6 bits (69), Expect = 0.009 Identities = 23/130 (17%), Positives = 45/130 (34%), Gaps = 21/130 (16%) Query: 114 GETPLDEPISLSPPLSRVSLAAYCHKLNTFADLLLR------------DYDLQLAYHHHL 161 P+ E ++ L+ + L YC +LN F L + + Y + L Sbjct: 57 SSLPITE-VAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKRMISCQFTHPSKETYLYQL 115 Query: 162 ----MMLVEHDDELERFLSHTHDNVGLAFDTGHAFVAGVEIPRVLHKYGHRIRHLHLKDV 217 +L L + + + L F++ R+ +R+ LK Sbjct: 116 YASSNVL----QLLAFLIKNGSHSRPLTDFARSHFLSNSSAYRMREALIPLLRNFELKLS 171 Query: 218 RPQVLGRLYR 227 + +++G YR Sbjct: 172 KNKIVGEEYR 181
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.030 Identities = 41/208 (19%), Positives = 76/208 (36%), Gaps = 14/208 (6%) Query: 44 SHAISLFSAYA-SLVYVTPILGGWLADRLLGNRTAVIAGALLMTLGHVVLGVESTSAWSL 102 +H L + YA P+LG +DR G R ++ + + ++ + W L Sbjct: 43 AHYGILLALYALMQFACAPVLGAL-SDRF-GRRPVLLVSLAGAAVDYAIMAT-APFLWVL 99 Query: 103 YVALAIIICGY-GLFKSNISCLLGELYAHDDPRRDGGFSLLYAAGNVGSIAAPIACGLAA 161 Y + I+ G G + + ++ D+ R F + A G +A P+ GL Sbjct: 100 Y--IGRIVAGITGATGAVAGAYIADITDGDE--RARHFGFMSACFGFGMVAGPVLGGLMG 155 Query: 162 QWYGWHIGFALAGIGMFIGLMIFLSGSRHFRHT-RGVDKPALRAVKFVLPTWGWLLVMLC 220 + H F A + + FL+G + +G +P R L ++ W M Sbjct: 156 G-FSPHAPFFAAA---ALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTV 211 Query: 221 LAPVFFTLLLQNNWSGYLLAIVCLFAAQ 248 +A + + A+ +F Sbjct: 212 VAALMAVFFIMQLVGQVPAALWVIFGED 239
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.9 bits (67), Expect = 0.008 Identities = 15/61 (24%), Positives = 23/61 (37%), Gaps = 6/61 (9%) Query: 74 RYIQIGTVMTEPDHRNKGLAGQLIHHILQDWQQEADAFFLFANPTTVD-----FYPKFGF 128 Y I + D+R KG+ L+ H +W +E L ++ FY K F Sbjct: 88 GYALIEDIAVAKDYRKKGVGTALL-HKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146 Query: 129 T 129 Sbjct: 147 I 147
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 526 bits (1357), Expect = e-179 Identities = 200/722 (27%), Positives = 320/722 (44%), Gaps = 48/722 (6%) Query: 1 MPQALLQPHYRGAVNLKKVDSGVPAAVLRYQANSYQSIVDGDSSSHH-YLGLDASLRAFG 59 +PQA + RG + + D G+ A +L Y + +SH+ YL L + L Sbjct: 160 IPQAFMSNRARGYIPPELWDPGINAGLLNYNFSGNSVQNRIGGNSHYAYLNLQSGLNIGA 219 Query: 60 WRLHHQSSYQAQEGN------THWDSIATWAERSVVNWASTLRLGQGWTDGTFFDSVSFI 113 WRL +++ + W I TW ER ++ S L LG G+T G FD ++F Sbjct: 220 WRLRDNTTWSYNSSDSSSGSKNKWQHINTWLERDIIPLRSRLTLGDGYTQGDIFDGINFR 279 Query: 114 GGRLATDVRMLPGSRRGFAPSVSGVARTNARVTVTQNGALLYEATVPPGKFTFDDLYPTN 173 G +LA+D MLP S+RGFAP + G+AR A+VT+ QNG +Y +TVPPG FT +D+Y Sbjct: 280 GAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDIYNSTVPPGPFTINDIYAAG 339 Query: 174 AGGDLQVTIHEADGSQDTFTVPYATLPGLVRAGAVYYDLSLGYLDEDGIAGR-PGFGEAT 232 GDLQVTI EADGS FTVPY+++P L R G Y ++ G P F ++T Sbjct: 340 NSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITAGEYRSGNAQQEKPRFFQST 399 Query: 233 LQYGFNDYISGYTGANATTDYYSTLVGSAFNTY-WGALAVDLSRSAAKGREHGWQEGYRW 291 L +G + Y G Y + G N GAL+VD++++ + + +G Sbjct: 400 LLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVDMTQANSTLPDDSQHDGQSV 459 Query: 292 RVSASKSFT-SDTRMLLSMSHSNDGNYRSIRDAAWEHDRHPND----------------- 333 R +KS S T + L + Y + D + N Sbjct: 460 RFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGYNIETQDGVIQVKPKFTDYY 519 Query: 334 ---WREMTRYSATLSQQAGS-GSLSFNGIWSE--DVRHHRWRSYQLGYANRYGQLNYYLY 387 + + + T++QQ G +L +G + +Q G + +N+ L Sbjct: 520 NLAYNKRGKLQLTVTQQLGRTSTLYLSGSHQTYWGTSNVD-EQFQAGLNTAFEDINWTLS 578 Query: 388 AQQSQDIHHRNNQV-VGVSFSLPFG-----------QAGSLTTRFNHDKNYGSQLQSSYT 435 +++ + + ++ ++PF + S + +HD N + Sbjct: 579 YSLTKNAWQKGRDQMLALNVNIPFSHWLRSDSKSQWRHASASYSMSHDLNGRMTNLAGVY 638 Query: 436 GSAGEKNAFSYGLTASYDMPRENPNEASVAANGSLRTDYAYLNASASAGRHQQQYSLGAS 495 G+ E N SY + Y + + ++ A + R Y N S +Q G S Sbjct: 639 GTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLNYRGGYGNANIGYSHSDDIKQLYYGVS 698 Query: 496 GALVAHQGGMTTTPELGETFAIVEAPGAVGARVANRLGQPINRQGFTIIPYLDPFTANWL 555 G ++AH G+T L +T +V+APGA A+V N+ G + +G+ ++PY + N + Sbjct: 699 GGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVENQTGVRTDWRGYAVLPYATEYRENRV 758 Query: 556 DLDPQGLNDHVEIVSSSTTVVPDSGAAVKVKFVTRTGYPWFAHVTLPDGAAPPLGAEVFD 615 LD L D+V++ ++ VVP GA V+ +F R G +T + P GA V Sbjct: 759 ALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKARVGIKLLMTLT-HNNKPLPFGAMVTS 817 Query: 616 DNGRAVGAVGQGGLLYARVPQDHGSVSVVWGERSGQRCRLAYAIRDDAVQQVSSGTPHQV 675 ++ ++ G V G +Y G V V WGE C Y + ++ QQ+ + Sbjct: 818 ESSQSSGIVADNGQVYLSGMPLAGKVQVKWGEEENAHCVANYQLPPESQQQLLTQLS-AE 876 Query: 676 CR 677 CR Sbjct: 877 CR 878
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 35.2 bits (81), Expect = 1e-05 Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 2/49 (4%) Query: 20 WAAEAFSFNRAHLHGAAQ--VDLQKYQYGNPLHAGQYRSTLSVNGRDLG 66 ++ FN L Q DL +++ G L G YR + +N + Sbjct: 42 LSSAELYFNPRFLADDPQAVADLSRFENGQELPPGTYRVDIYLNNGYMA 90
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 29.4 bits (66), Expect = 0.025 Identities = 20/95 (21%), Positives = 34/95 (35%), Gaps = 14/95 (14%) Query: 60 GDGDKGSYKRNG---FDGGTRFRFAADYYLFDDISWISYYELGVNIPALFDWDNHYAEGA 116 +G + + G D G++ F L + + I E +I A Sbjct: 39 HNGAQAASVETGTGIVDLGSKIGFKGQEDLGNGLKAIWQVEQKASI----------AGTD 88 Query: 117 NNTTRRMLYTGLKSDTWGTLTYGQQNSIYYDVVGV 151 + R + GLK +G L G+ NS+ D + Sbjct: 89 SGWGNRQSFIGLKGG-FGKLRVGRLNSVLKDTGDI 122
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.024 Identities = 13/28 (46%), Positives = 16/28 (57%) Query: 33 VKPRQTIALIGESGSGKSTLLAILAGLD 60 K ++ L G G GKSTL+ L GLD Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLVGLD 620
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 78.9 bits (194), Expect = 1e-19 Identities = 47/212 (22%), Positives = 79/212 (37%), Gaps = 7/212 (3%) Query: 3 KTVLVTGCSSGIGLESALDLTRQGFRVLAA-CRKAEDVARMQELGLTG-----ILLDLDD 56 K +TG + GIG A L QG + A + + L D+ D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 57 PQSVERAAAEVIALTDNRLYGLFNNAGYGVYGPLNTISRQQMEQQFSANFFGAHQLTMLL 116 +++ A + + L N AG G ++++S ++ E FS N G + + Sbjct: 69 SAAIDEITARIEREMG-PIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 117 LPAMTPHGEGRIVMTSSVMGLIASPGRGAYAASKYALEAWSDALRMELRHSGIQVSLIEP 176 M G IV S + AYA+SK A ++ L +EL I+ +++ P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 177 GPIRTRFTDNVNQTQSDKPVENPGIAARFTLG 208 G T ++ ++ G F G Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTG 219
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 30.4 bits (69), Expect = 0.005 Identities = 8/39 (20%), Positives = 22/39 (56%), Gaps = 2/39 (5%) Query: 488 ESLDKLADEVDESTKEAEKALEPFVERVKNLL--GDRVK 524 + + K+A+ + + K++ A++ V + L G++V+ Sbjct: 6 DLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQ 44
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 30.7 bits (69), Expect = 0.007 Identities = 10/69 (14%), Positives = 18/69 (26%) Query: 290 PLPEPEVQPAQAAAPAPRQPVAPAAPPPQSPQSLPPTTSQVLAARSHLQRSQGATPPKKS 349 P PEPE P P P+ + + + + Sbjct: 76 PEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPVESRPASPFENTAPA 135 Query: 350 EPAAASARG 358 P +++A Sbjct: 136 RPTSSTATA 144
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 47.0 bits (111), Expect = 4e-07 Identities = 42/282 (14%), Positives = 95/282 (33%), Gaps = 4/282 (1%) Query: 31 RAADLPDRAEVQSQLNTLNKQKELTPQDKLVQQDLTQTLETLDKIERIKSETAQLRQQVE 90 + +DL + N +EL+ + ++++ E KI+ +++ A L + +E Sbjct: 72 KNSDLSFNNKALKDHND-ELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALE 130 Query: 91 QAPAKLRQAVESLNNLSDVPNDDATRKTLSTLSLRQLESRVTQTLDDLQNAQNDLATYNS 150 A + L A RK +L + T ++ + + A + Sbjct: 131 GAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEA 190 Query: 151 QLVSLQTQPERVQNAMFNASQQLQQIRNRLNGTSVGD---ETLRPTQQVLLQAQQALLNA 207 + L+ E N S +++ + + E A A + Sbjct: 191 RQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKT 250 Query: 208 QIEQQRKSLEGNTILQDTLQKQRDYVTAWSNRLEHQLQLLQEAVNSKRLTLTEKTAQEAV 267 ++ L+ L+ ++ TA S +++ K + A Sbjct: 251 LEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNAN 310 Query: 268 TPDETARIQANPLVKQELDINHQLSEKLIQATENGNQLVQRN 309 + A+ K++L+ HQ E+ + +E Q ++R+ Sbjct: 311 RQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRD 352
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 185 bits (470), Expect = 2e-61 Identities = 170/213 (79%), Positives = 194/213 (91%) Query: 1 MARKTKQQARETRQLILDVALRLFSQQGVSSTSLATIAKAAGVTRGAIYWHFKNKSDLFN 60 MARKTKQ+A+ETRQ ILDVALRLFSQQGVSSTSL IAKAAGVTRGAIYWHFK+KSDLF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EIWELSDASISDLEIEYRAKFPNDPLSVIREILVYVLEATVTEERRRLMMEIIYHKCEFV 120 EIWELS+++I +LE+EY+AKFP DPLSV+REIL++VLE+TVTEERRRL+MEII+HKCEFV Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 121 GEMTVVQQAQRQLSLASYERIEQTLKECIAAKLLPANLLTRRAAVLMRSYLSGLMENWLF 180 GEM VVQQAQR L L SY+RIEQTLK CI AK+LPA+L+TRRAA++MR Y+SGLMENWLF Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 181 APDSFDLHAEARDYVAILLEMYQFCPTLRGPES 213 AP SFDL EARDYVAILLEMY CPTLR P + Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLRNPAT 213
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 43.3 bits (102), Expect = 1e-06 Identities = 30/210 (14%), Positives = 75/210 (35%), Gaps = 19/210 (9%) Query: 100 TYQASYDSAKGDLAKAQAAANMDQLTVKRYQKLLGTKYISQQDYDTAVATA-QQSNAAVV 158 + Y A +L ++ + + ++ Q + + +Q+ + Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLV---TQLFKNEILDKLRQTTDNIG 312 Query: 159 AAKAAVETARINLAYTKVTSPISGRIGKSAV-TEGALVQNGQTTALATVQQLDPIYVDVT 217 + + + +P+S ++ + V TEG +V +T + V + D + V Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETL-MVIVPEDDTLEVTAL 371 Query: 218 QSSNDFLRLKQEL-ADGRLKQENGK------AKVELVTNDGLKYPQSGTLEFSDVTVDQT 270 + D + A +++ KV+ + D ++ + G + +++++ Sbjct: 372 VQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEEN 431 Query: 271 TGSITLRAIFPNPDHTLLPGMFVRARLEEG 300 S + I L GM V A ++ G Sbjct: 432 CLSTGNKNIP------LSSGMAVTAEIKTG 455 Score = 29.4 bits (66), Expect = 0.027 Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 10/88 (11%) Query: 48 APLQITTELPGR-TSAYRIAEVRPQVSGIILKRNFV-EGSDIQAGVSLYQIDPATYQASY 105 ++I G+ T + R E++P + I+ K V EG ++ G L ++ +A Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEA-- 134 Query: 106 DSAKGDLAKAQAAANMDQLTVKRYQKLL 133 D K Q++ +L RYQ L Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQILS 157
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 13/32 (40%), Positives = 17/32 (53%) Query: 33 VFVGPSGCGKSTLLRMIAGLEEVSEGEVLIGD 64 V G G GKSTL+ + GL+ S+ IG Sbjct: 600 VLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 30.4 bits (68), Expect = 0.007 Identities = 19/89 (21%), Positives = 34/89 (38%), Gaps = 12/89 (13%) Query: 56 LVHSFHSYFLRPGDSQKPIVYDVEVLRDGNSFSARRVAAIQNGKPIFYMTASFQAPENGY 115 L+H +YF+ + V ++ + + +Q G P+ Y TA P + Sbjct: 34 LIHDMQNYFVDAFTAGASPVTELS-----ANIRKLKNQCVQLGIPVVY-TAQ---PGSQN 84 Query: 116 EHQKAMPA---APSPDGLPSETDIARKLA 141 +A+ P + P E I +LA Sbjct: 85 PDDRALLTDFWGPGLNSGPYEEKIITELA 113
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 31.0 bits (70), Expect = 0.006 Identities = 13/65 (20%), Positives = 27/65 (41%), Gaps = 1/65 (1%) Query: 120 PVGQLISRVTNDTEVIRDLYVTVVATVLRSAALIGAMLVAMFSLDWRMALVAIAIFPAVL 179 P G + + T ++ VV T+ + L+ +++ +F + R L+ P VL Sbjct: 318 PQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLV-FLVMYLFLQNMRATLIPTIAVPVVL 376 Query: 180 IVMII 184 + Sbjct: 377 LGTFA 381
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.022 Identities = 11/64 (17%), Positives = 23/64 (35%), Gaps = 10/64 (15%) Query: 193 LAVLSQHLGFTLQECMAFGDAMNDREMLGSVGRGFIMGN----------AMPQLKAELPH 242 VL+Q L + +A + + ++ + +P++K P Sbjct: 16 RTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPD 75 Query: 243 LPVI 246 LPV+ Sbjct: 76 LPVL 79
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 116 bits (293), Expect = 4e-38 Identities = 49/88 (55%), Positives = 65/88 (73%) Query: 2 NKSQLIDKIAAGADISKAAAGRALDALIASVTESLQAGDDVALVGFGTFAVKERAARTGR 61 NK LI K+A +++K + A+DA+ ++V+ L G+ V L+GFG F V+ERAAR GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 62 NPQTGKEITIAAAKVPGFRAGKALKDAV 89 NPQTG+EI I A+KVP F+AGKALKDAV Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAV 90
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 34.3 bits (78), Expect = 0.002 Identities = 34/133 (25%), Positives = 68/133 (51%), Gaps = 15/133 (11%) Query: 191 ERLEYLMAMMESEIDLLQVEKRIRNRVKKQMEKSQREYYLNEQMKAIQKELGEMDDAPD- 249 LE A +E + +L R +++ ++ S+ +Q++A ++L E + + Sbjct: 291 AALEAEKADLEHQSQVLNAN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEA 344 Query: 250 ENEALKRKIDAAKMPKEAKEKTEAELQKLKMMSPMS-AEATVVRGYIDWMVQVPWNARSK 308 ++L+R +DA++ EAK++ EAE QKL+ + +S A +R +D + A+ + Sbjct: 345 SRQSLRRDLDASR---EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASRE----AKKQ 397 Query: 309 VKKDLRQAQEILD 321 V+K L +A L Sbjct: 398 VEKALEEANSKLA 410
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 38.3 bits (89), Expect = 5e-05 Identities = 42/196 (21%), Positives = 75/196 (38%), Gaps = 15/196 (7%) Query: 221 RNNAWLI-LLLIVLYKLGDAFAMSLTTTFLIRGVGFDAGEVGMVNKTLGLFATILGALYG 279 R+N LI L ++ + + + ++++ + VN L +I A+YG Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 280 GVLMQRLTLFRALLIFGLLQGVSNAGYWLLSITDKHLYSMATAVFFENLCGGMGTAAFVA 339 L +L + R LL ++ S+ +S + + G G AAF A Sbjct: 71 K-LSDQLGIKRLLLFGIIINC-------FGSVIGFVGHSFFSLLIMARFIQGAGAAAFPA 122 Query: 340 LLM----TLCNKSFSATQFALLSALSAVGRVYVGP-IAGWFVEAHGWSTFYLFSVVAAVP 394 L+M K F L+ ++ A+G VGP I G WS L ++ + Sbjct: 123 LVMVVVARYIPKENRGKAFGLIGSIVAMG-EGVGPAIGGMIAHYIHWSYLLLIPMITIIT 181 Query: 395 GIALLLLCRQTLEHTQ 410 L+ L ++ + Sbjct: 182 VPFLMKLLKKEVRIKG 197
>CHANNELTSX#Nucleoside-specific channel-forming protein Tsx signature. Length = 294 Score = 537 bits (1384), Expect = 0.0 Identities = 294/294 (100%), Positives = 294/294 (100%) Query: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE Sbjct: 1 MKKTLLAAGAVVALSTTFAAGAAENDKPQYLSDWWHQSVNVVGSYHTRFGPQIRNDTYLE 60 Query: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP Sbjct: 61 YEAFAKKDWFDFYGYIDAPVFFGGNSTAKGIWNKGSPLFMEIEPRFSIDKLTNTDLSFGP 120 Query: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN Sbjct: 121 FKEWYFANNYIYDMGRNDSQEQSTWYMGLGTDIDTGLPMSLSLNVYAKYQWQNYGASNEN 180 Query: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH Sbjct: 181 EWDGYRFKVKYFVPLTDLWGGSLSYIGFTNFDWGSDLGDDNFYDLNGKHARTSNSIASSH 240 Query: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF Sbjct: 241 ILALNYAHWHYSIVARYFHNGGQWADDAKLNFGDGPFSVRSTGWGGYFVVGYNF 294
>ARGREPRESSOR#Bacterial arginine repressor signature. Length = 149 Score = 32.1 bits (73), Expect = 9e-04 Identities = 18/66 (27%), Positives = 28/66 (42%), Gaps = 7/66 (10%) Query: 3 RRADRLFQIVQILRGRRLTT-----AALLAERLGVSERTVYRDIRDLSLSGVPVEGEAGS 57 + R +I +I+ + T L + V++ TV RDI++L L V V GS Sbjct: 2 NKGQRHIKIREIITANEIETQDELVDILKKDGYNVTQATVSRDIKELHL--VKVPTNNGS 59 Query: 58 GYRLLA 63 L Sbjct: 60 YKYSLP 65
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 342 bits (880), Expect = e-120 Identities = 101/308 (32%), Positives = 172/308 (55%), Gaps = 12/308 (3%) Query: 18 DFMRWDYWAFGISGFLLIVSIAIIGVRGFNWGLDFTGGTVIEITLEKPVDLDQMRDSLQK 77 DF RW + FG + ++I S+ + V G N+G+DF GGT I +D+ R +L+ Sbjct: 15 DFFRWQWATFGAAIVMMIASVILPLVIGLNFGIDFKGGTTIRTESTTAIDVGVYRAALEP 74 Query: 78 AGFEEPQVQNFGSSR------DIMVRMPPVHDANGSQELGSKVVTVINE------STSQN 125 + + M+R+ D G++ G++ ++N+ + Sbjct: 75 LELGDVIISEVRDPSFREDQHVAMIRIQMQEDGQGAEGQGAQGQELVNKVETALTAVDPA 134 Query: 126 AAVKRIEFVGPSVGADLAQTGALALIAALVCILIYVGFRFEWRLAAGVVIALAHDVVITM 185 + E VGP V +L T +L+AA V I+ Y+ RFEW+ A G V+AL HDV++T+ Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 186 GVLSLFHIEIDLTIVASLMSVIGYSLNDSIVVSDRIRENFRKIRRGTPYEIFNVSLTQTL 245 G+ ++ ++ DLT VA+L+++ GYS+ND++VV DR+REN K + ++ N+S+ +TL Sbjct: 195 GLFAVLQLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNETL 254 Query: 246 HRTLITSGTTLMVILMLFLFGGPILEGFSLTMLIGVSIGTASSIYVASALALKLGMKREH 305 RT++T TTL+ ++ + ++GG ++ GF M+ GV GT SS+YVA + L +G+ R Sbjct: 255 SRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIVLFIGLDRNK 314 Query: 306 LIQQKVEK 313 + +K Sbjct: 315 EKKDPSDK 322
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 69.5 bits (170), Expect = 5e-15 Identities = 37/183 (20%), Positives = 88/183 (48%), Gaps = 5/183 (2%) Query: 422 IQIVEERTIGPTLGMQNIKQGLEACLAGLVVSILFMIL-FYKKFGLIATSALIANLILIV 480 ++I ++GP + + + + + LA VV + ++ + F +F L A AL+ +++L V Sbjct: 135 LKITSFESVGPKVSGELVWTAVWSLLAATVVIMFYIWVRFEWQFALGAVVALVHDVLLTV 194 Query: 481 GIMSLIPGATLTMPGIAGIVLTLAVAVDANVLINERIKEEL--SNGRTVQQAIDEGYRGA 538 G+ +++ + +A ++ +++ V++ +R++E L ++ ++ Sbjct: 195 GLFAVL-QLKFDLTTVAALLTITGYSINDTVVVFDRLRENLIKYKTMPLRDVMNLSVNET 253 Query: 539 FSSIFDANVTTLIKVIILYAVGTGAIKGFAITTGIGIATSMFTAIVGTRAIVNLLYGGKR 598 S +TTL+ ++ + G I+GF G+ T ++++ + IV L G R Sbjct: 254 LSRTVMTGMTTLLALVPMLIWGGDVIRGFVFAMVWGVFTGTYSSVYVAKNIV-LFIGLDR 312 Query: 599 VKK 601 K+ Sbjct: 313 NKE 315
>SALSPVBPROT#Salmonella virulence plasmid 65kDa B protein signature. Length = 591 Score = 29.7 bits (66), Expect = 0.014 Identities = 15/38 (39%), Positives = 21/38 (55%), Gaps = 4/38 (10%) Query: 39 WAVAALQLISPLFLPPPGQVLQKLITIAGPQGFMDATL 76 ++ A L LI+P FLP G+ L + +GP G TL Sbjct: 7 FSSATLALITPPFLPKGGKALSQ----SGPDGLASITL 40
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 29.1 bits (65), Expect = 0.018 Identities = 20/57 (35%), Positives = 30/57 (52%), Gaps = 2/57 (3%) Query: 159 LWLLYRTRY--GMAIRAVAFDVNTVRLMGIDANRIISLVFALGSSLAALGGVFYSIS 213 L L+ R R MA+ + + +NT ID NRI++L + L ALG V Y ++ Sbjct: 4 LPLISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVA 60
>ECOLIPORIN#E.coli/Salmonella-type porin signature. Length = 383 Score = 536 bits (1381), Expect = 0.0 Identities = 225/384 (58%), Positives = 263/384 (68%), Gaps = 35/384 (9%) Query: 1 MKKSTLALMMMGFVASTATQAAEVYNKNANKLDVYGKIKAMHYFSDYDSKDGDQTYVRFG 60 MK+ LAL++ +A+ A AAE+YNK+ NKLD+YGK+ +HYFSD SKDGDQTY+R G Sbjct: 1 MKRKVLALVIPALLAAGAAHAAEIYNKDGNKLDLYGKVDGLHYFSDDSSKDGDQTYMRVG 60 Query: 61 IKGETQINDDLTGYGRWESEFSGNKTESDSSQ-KTRLAFAGVKLKNYGSFDYGRNLGALY 119 KGETQIND LTGYG+WE N TE + + TRLAFAG+K +YGSFDYGRN G LY Sbjct: 61 FKGETQINDQLTGYGQWEYNVQANTTEGEGANSWTRLAFAGLKFGDYGSFDYGRNYGVLY 120 Query: 120 DVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGLVDGLDLTLQYQGKNE--- 176 DVE WTDM PEFGGDS DN+MT RA+G+ATYRNTDFFGLVDGL+ LQYQGKNE Sbjct: 121 DVEGWTDMLPEFGGDSYTYADNYMTGRANGVATYRNTDFFGLVDGLNFALQYQGKNESQS 180 Query: 177 -------------GREAKKQNGDGVGTSLSYDFGGSDFAVSAAYTSSDRTNDQNLLAR-- 221 G + + NGDG G S +YD G F+ AAYT+SDRTN+Q Sbjct: 181 ADDVNIGTNNRNNGDDIRYDNGDGFGISTTYDI-GMGFSAGAAYTTSDRTNEQVNAGGTI 239 Query: 222 GQGSKAEAWATGLKYDANNIYLATMYSETRKMTP-------ISGGFANKAQNFEAVAQYQ 274 G KA+AW GLKYDANNIYLATMYSETR MTP GG ANK QNFE AQYQ Sbjct: 240 AGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGKTDKGYDGGVANKTQNFEVTAQYQ 299 Query: 275 FDFGLRPSLGYVLSKGKDIE----GVGSEDLVNYIDVGLTYYFNKNMNAFVDYKINQLKS 330 FDFGLRP++ +++SKGKD+ +DLV Y DVG TYYFNKN + +VDYKIN L Sbjct: 300 FDFGLRPAVSFLMSKGKDLTYNNVNGDDKDLVKYADVGATYYFNKNFSTYVDYKINLLDD 359 Query: 331 DNKL----GINDDDIVALGMTYQF 350 D+ GI+ DDIVALGM YQF Sbjct: 360 DDPFYKDAGISTDDIVALGMVYQF 383
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 37.9 bits (88), Expect = 4e-05 Identities = 31/140 (22%), Positives = 53/140 (37%), Gaps = 20/140 (14%) Query: 119 DTLRALLDNSI---------VPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLT 169 +T++ L++ + VPVI E+ + E V D D A AD ++LT Sbjct: 177 ETIKKLVERGVIVIASGGGGVPVILEDGEIKGVE-AVIDKDLAGEKLAEEVNADIFMILT 235 Query: 170 DQPGLFTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMGTKLQAA-DVACRAG 228 D G + + +++V +++ + G MG K+ AA G Sbjct: 236 DVNGAALY--YGTEKEQWLREV-KVEELRKYYEEG---HFKAGSMGPKVLAAIRFIEWGG 289 Query: 229 IDTIIAAGNRPDVIGHAMAG 248 IIA + A+ G Sbjct: 290 ERAIIAH---LEKAVEALEG 306 Score = 29.0 bits (65), Expect = 0.032 Identities = 16/76 (21%), Positives = 33/76 (43%), Gaps = 13/76 (17%) Query: 4 SQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQ----LHAMGHRIVIVTSG-------- 51 + +V+ LG + L ++ + +++ VR+ A+ + A G+ +VI Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61 Query: 52 -AIAAGREHLGYPELP 66 + AG+ G P P Sbjct: 62 LHMDAGQATYGIPAQP 77
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 30.0 bits (67), Expect = 0.023 Identities = 21/79 (26%), Positives = 33/79 (41%), Gaps = 4/79 (5%) Query: 472 TKDLLDQIKTEKEALAETKPKRSIKRGKKVTSGRKPLLTVLNEQSEPIKPDELMQLAGFS 531 T LLDQ++ + +T K + RK + +L E E I E + G Sbjct: 203 TDSLLDQLQNAPADVQKTSANTGKKNVFTCENIRKQIAELLQETPEDITDQEDLLDRGLD 262 Query: 532 SNE----VEEFYIELAEIS 546 S VE++ E AE++ Sbjct: 263 SVRIMTLVEQWRREGAEVT 281
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.8 bits (67), Expect = 0.011 Identities = 13/56 (23%), Positives = 29/56 (51%), Gaps = 4/56 (7%) Query: 44 SKIVNVLEAPFAGTLRRMLAREGETLQVGAVLALAADASVSDAELDEFVARLATAK 99 SK + +E ++ ++ +EGE+++ G VL A ++A+ + + L A+ Sbjct: 96 SKEIKPIEN---SIVKEIIVKEGESVRKGDVLLK-LTALGAEADTLKTQSSLLQAR 147
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 102 bits (256), Expect = 2e-28 Identities = 74/253 (29%), Positives = 125/253 (49%), Gaps = 16/253 (6%) Query: 8 RTAIVTGGATGLGREFVLSLAKEGVNIC-FTYMREEEHPERLIETVKASANVEIIAVKTD 66 + A +TG A G+G +LA +G +I Y E+ E+++ ++KA A A D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKL--EKVVSSLKAEARHAE-AFPAD 65 Query: 67 LSDEQSRENLFATCIDRLGKADILVNNAGIWLSGYVTEISPQDWDLVMNVNLKAIFHLSQ 126 + D + + + A +G DILVN AG+ G + +S ++W+ +VN +F+ S+ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 LFVNHCLQHDQMGSILNITSQAAFHGSTTGHAHYAASKAGLVAFAISLAREVAKQKINVN 186 + + + GSI+ + S A T A YA+SKA V F L E+A+ I N Sbjct: 126 SVSKY-MMDRRSGSIVTVGSNPA-GVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCN 183 Query: 187 NIAVGIMDTAMIRKN-IEQNPDYYVSR---------IPVGRVAQPQEIADIGVFMVSPKT 236 ++ G +T M ++N V + IP+ ++A+P +IAD +F+VS + Sbjct: 184 IVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQA 243 Query: 237 SYMTGATLDVTGG 249 ++T L V GG Sbjct: 244 GHITMHNLCVDGG 256
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 54.4 bits (131), Expect = 1e-10 Identities = 55/258 (21%), Positives = 94/258 (36%), Gaps = 9/258 (3%) Query: 3 LALFALTIGAFAIGTTEFVIVGLVPTIAQQLSISLPSA---GLLVSIYALGVAIGAPVLT 59 + L + + A IG +I+ ++P + + L S G+L+++YAL APVL Sbjct: 9 VILSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 60 ALTGRMPRKQLLLALMVLFTAGNVLAWQAPGYETLILARLLTGLAHGVFFSIGSTIATSL 119 AL+ R R+ +LL + + AP L + R++ G+ G+ IA Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 120 VAKEKAASAIAIMFGGLTVALVTGVPFGTFIGQHFGWRETFLAVSILGVIALISSLILVP 179 E+A M +V G G +G F F A + L + ++ L+P Sbjct: 125 DGDERAR-HFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLP 182 Query: 180 NNIPGRASASLRDQIKVLTHPRLLMIYAITALGYGGVFTAFTFLAPMMQELAGFSPSAVS 239 + G R+ + L R + A F F Sbjct: 183 ESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 240 WILLGYGVSVAIGNVWGA 257 W G+S+A + + Sbjct: 243 WDATTIGISLAAFGILHS 260
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 29.8 bits (67), Expect = 0.009 Identities = 18/66 (27%), Positives = 24/66 (36%), Gaps = 14/66 (21%) Query: 120 AEAI-SLLRNNRVVILSAGTGNPFFTT-------------DSAACLRGIEIEADVVLKAT 165 AE I L+ +VI S G G P D A E+ AD+ + T Sbjct: 176 AETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILT 235 Query: 166 KVDGVF 171 V+G Sbjct: 236 DVNGAA 241
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 48.0 bits (114), Expect = 1e-08 Identities = 33/174 (18%), Positives = 61/174 (35%), Gaps = 9/174 (5%) Query: 23 APRVITLSPANTELAFAAGITPVGVSSYSDY------PSQAKTIEQVASWQGMNLERIVA 76 R++ L EL A GI P GV+ +Y P ++ V NLE + Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEPNLELLTE 94 Query: 77 LKPDVVLAWRG-GNAERQVNQLQSLGIHVLWVQTSTIEEIIATLRELAQWSPQPEKAQQA 135 +KP ++ G G + + ++ + +L E+A A+ Sbjct: 95 MKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNLQSAAETH 154 Query: 136 AQAMQQEYDALKARYANAPKKRVFL-QFGSAP-LFTSGPGSIQDQVLRLCGGEN 187 + ++K R+ + + L + GP S+ ++L G N Sbjct: 155 LAQYEDFIRSMKPRFVKRGARPLLLTTLIDPRHMLVFGPNSLFQEILDEYGIPN 208
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 430 bits (1106), Expect = e-155 Identities = 212/291 (72%), Positives = 243/291 (83%) Query: 6 LITRRRLLIAMTISPLLWQMRGAQAADVDPQRVVALEWLPAELLLALGVTPYGVADIPNY 65 LI+RRRLL AM +SPLLWQM A AA +DP R+VALEWLP ELLLALG+ PYGVAD NY Sbjct: 6 LISRRRLLTAMALSPLLWQMNTAHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINY 65 Query: 66 RLWVNEPALPDSVIDVGLRTEPNLELLTQMKPSFIVWSAGYGPSPEKLARIAPGRGFTFS 125 RLWV+EP LPDSVIDVGLRTEPNLELLT+MKPSF+VWSAGYGPSPE LARIAPGRGF FS Sbjct: 66 RLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFS 125 Query: 126 DGKRPLAMAQRSLLEMADLLGKTQQAKRHLAEFDALMESLRPRFAGRGDRPLLMISLLDP 185 DGK+PLAMA++SL EMADLL A+ HLA+++ + S++PRF RG RPLL+ +L+DP Sbjct: 126 DGKQPLAMARKSLTEMADLLNLQSAAETHLAQYEDFIRSMKPRFVKRGARPLLLTTLIDP 185 Query: 186 RHVLVFGENCLFQEVLDRFGIKNAWHGEAAFWGSVSVGIDRLAAFNEADVICFDHGNERD 245 RH+LVFG N LFQE+LD +GI NAW GE FWGS +V IDRLAA+ + DV+CFDH N +D Sbjct: 186 RHMLVFGPNSLFQEILDEYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD 245 Query: 246 MAQLLATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFARVLADAQGSPA 296 M L+ATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHF RVL +A G A Sbjct: 246 MDALMATPLWQAMPFVRAGRFQRVPAVWFYGATLSAMHFVRVLDNAIGGKA 296
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 34.9 bits (80), Expect = 0.002 Identities = 13/27 (48%), Positives = 13/27 (48%) Query: 819 QQQQQQQQQQQQQQQPPKQQEKSDGVA 845 Q QQQQ Q QQQQ QE A Sbjct: 338 PQAQQQQGQGQQQQAQATAQEAVAAAA 364 Score = 34.5 bits (79), Expect = 0.002 Identities = 13/25 (52%), Positives = 14/25 (56%) Query: 814 FNQSGQQQQQQQQQQQQQQQPPKQQ 838 F Q QQQQ Q QQQQ Q Q+ Sbjct: 334 FVMPPQAQQQQGQGQQQQAQATAQE 358 Score = 31.9 bits (72), Expect = 0.012 Identities = 12/34 (35%), Positives = 15/34 (44%) Query: 812 NPFNQSGQQQQQQQQQQQQQQQPPKQQEKSDGVA 845 N + + Q QQQQ Q QQ Q + VA Sbjct: 328 NQIHLNFVMPPQAQQQQGQGQQQQAQATAQEAVA 361
>BCTERIALGSPC#Bacterial general secretion pathway protein C signature. Length = 272 Score = 214 bits (546), Expect = 4e-71 Identities = 100/266 (37%), Positives = 159/266 (59%), Gaps = 7/266 (2%) Query: 17 KLLPQIVTLIILITAIPQLAKLTWRVVFPVSPEDISALPLTMPPAADPELKNVRPAFTLF 76 ++ +I+ ++++ QLA + WR+ P ++ + + PA + FTLF Sbjct: 12 SVIRRILFYLLMLLFCQQLAMIFWRIGLP---DNAPVSSVQITPAQARQQPVTLNDFTLF 68 Query: 77 GLAV-KNSPTPTDAASLNQVPVSSLKLRLAGLLASSNPARSIAIIEKGNQQVSLSTGDPL 135 G++ KN DA+ ++ +P S+L L L G++A + +RSIAII K N+Q S + + Sbjct: 69 GVSPEKNKAGALDASQMSNLPPSTLNLSLTGVMAGDDDSRSIAIISKDNEQFSRGVNEEV 128 Query: 136 PGYDARIAAILPDRIIVNYQGRKEAILLFNDSRAPSPPPTAAGNPPLVKRLREQPQNILT 195 PGY+A+I +I PDR+++ YQGR E + L++ + S G + + + Sbjct: 129 PGYNAKIVSIRPDRVVLQYQGRYEVLGLYSQEDSGSDGVP--GAQVNEQLQQRASTTMSD 186 Query: 196 YLSISPVLSGDKLLGYRLNPGKDASLFRQSGLQANDLAIALNGIDLRDQEQAQQALQNLA 255 Y+S SP+++ +KL GYRLNPG + F + GLQ ND+A+ALNG+DLRD EQA++A++ +A Sbjct: 187 YVSFSPIMNDNKLQGYRLNPGPKSDSFYRVGLQDNDMAVALNGLDLRDAEQAKKAMERMA 246 Query: 256 DMTEITLTVEREGQRHDIAFAL-GDE 280 D+ TLTVER+GQR DI GDE Sbjct: 247 DVHNFTLTVERDGQRQDIYMEFGGDE 272
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 839 bits (2169), Expect = 0.0 Identities = 606/646 (93%), Positives = 631/646 (97%) Query: 10 ALLILTPLLFSPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 69 LLI LLF PAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72 Query: 70 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRAKDAKTSAVPVASAAAPGEGDEVVTRVV 129 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVR+KDAKT+AVPVAS AAPG GDEVVTRVV Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGIGDEVVTRVV 132 Query: 130 PLTNVAARDLAPLLRQLNDNAGAGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 189 PLTNVAARDLAPLLRQLNDNAG GSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR Sbjct: 133 PLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGDR 192 Query: 190 SVVTVPLSWASAAEVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 249 SVVTVPLSWASAA+VVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI Sbjct: 193 SVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSRQRI 252 Query: 250 IAMIKQLDRQQAVQGNTKVIYLKYAKAADLVEVLTGISSSLQSDKQSARPVAAIDKNIII 309 IAMIKQLDRQQA QGNTKVIYLKYAKA+DLVEVLTGISS++QS+KQ+A+PVAA+DKNIII Sbjct: 253 IAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNIII 312 Query: 310 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 369 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN Sbjct: 313 KAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKN 372 Query: 370 AGMTQFTNSGLPISTAIAGANQYNKDGTISSSLASALGSFNGIAAGFYQGNWAMLLTALS 429 AGMTQFTNSGLPISTAIAGANQYNKDGT+SSSLASAL SFNGIAAGFYQGNWAMLLTALS Sbjct: 373 AGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALS 432 Query: 430 SSTKNDILATPSIVTLDNMQATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 489 SSTKNDILATPSIVTLDNM+ATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP Sbjct: 433 SSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKP 492 Query: 490 QINEGDAVLLEIEQEVSSVADSASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKT 549 QINEGD+VLLEIEQEVSSVAD+ASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDK+ Sbjct: 493 QINEGDSVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDKS 552 Query: 550 VTDTADKVPLLGDIPVIGALFRSDSKKVSKRNLMLFIRPTIIRDRDEYRQASSGQYTAFN 609 V+DTADKVPLLGDIPVIGALFRS SKKVSKRNLMLFIRPT+IRDRDEYRQASSGQYTAFN Sbjct: 553 VSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYRQASSGQYTAFN 612 Query: 610 NAQTKQRGKESSEASLSNDLLHIYPQQETQAFRQVSAAIDAFNLGG 655 +AQ+KQRGKE+++A L+ DLL IYP+Q+T AFRQVSAAIDAFNLGG Sbjct: 613 DAQSKQRGKENNDAMLNQDLLEIYPRQDTAAFRQVSAAIDAFNLGG 658
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 512 bits (1321), Expect = 0.0 Identities = 277/407 (68%), Positives = 335/407 (82%), Gaps = 4/407 (0%) Query: 1 MALFRYQALDAQGKTRRGLQQADSARHARQLLRDKGWLALEVTTADPARRLWAGGSLT-- 58 MA + YQALDAQGK RG Q+ADSAR ARQLLR++G + L V ++ L+ Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 59 --RRTSAGDLALLTRQLATLVAAGIPLEKALDAVAQQCEKPSLRTLMAGVRSKVLEGHSL 116 R S DLALLTRQLATLVAA +PLE+ALDAVA+Q EKP L LMA VRSKV+EGHSL Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 117 AEAMRGYPACFDGLFCAMVAAGETSGHLDGVLNRLANYTEQRQQLRARLLQAMIYPIVLT 176 A+AM+ +P F+ L+CAMVAAGETSGHLD VLNRLA+YTEQRQQ+R+R+ QAMIYP VLT Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 177 LVAISVIAILLSTVVPKVVEQFVHLKQALPFSTRLLMSLSDIVRSAGPWLALLSLLALLA 236 +VAI+V++ILLS VVPKVVEQF+H+KQALP STR+LM +SD VR+ GPW+ L L +A Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 237 LRYLLRQPARRLAWDRMLLRLPVIGRVARSVNSARYARTLSILNASAVPLLLSMRISADV 296 R +LRQ RR+++ R LL LP+IGR+AR +N+ARYARTLSILNASAVPLL +MRIS DV Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 297 LSNAWARSQLAAASESVREGVSLHRALESTALFPPMMRYMIASGEQSGELTAMLERAAEN 356 +SN +AR +L+ A+++VREGVSLH+ALE TALFPPMMR+MIASGE+SGEL +MLERAA+N Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 357 QDRELSAQIQMALSLFEPLLVVTMAGMVLFIVLAILQPILQLNTLMS 403 QDRE S+Q+ +AL LFEPLLVV+MA +VLFIVLAILQPILQLNTLMS Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLMS 407
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 243 bits (621), Expect = 2e-86 Identities = 98/140 (70%), Positives = 112/140 (80%) Query: 1 MQRQRGFTLLEIMVVIVILGILASLVVPNLMGNKEKADRQKVVSDLVALEGALDMYKLDN 60 +QRGFTLLEIMVVIVI+G+LASLVVPNLMGNKEKAD+QK VSD+VALE ALDMYKLDN Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 61 SRYPNTEQGLQALVTAPAAEPHARNYPEGGYIRRLPQDPWGNEYQLLSPGQHGAIDVFSV 120 YP T QGL++LV AP P A NY + GYI+RLP DPWGN+Y L++PG+HGA D+ S Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLLSA 123 Query: 121 GPDGMPDTNDDIGNWTLGKK 140 GPDG T DDI NW L KK Sbjct: 124 GPDGEMGTEDDITNWGLSKK 143
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 177 bits (451), Expect = 8e-60 Identities = 97/164 (59%), Positives = 124/164 (75%) Query: 1 MSQRGFTLLEMMLVLLLIGVSASMVLLAFPSARTQEATQILARFQTQLDFVRERGQQTGQ 60 M QRGFTLLEMML+LLL+GVSA MVLLAFP++R A Q LARF+ QL FV++RG QTGQ Sbjct: 1 MRQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQ 60 Query: 61 LFGIIIHPERWQFMRLQPADDSAPAAADDRWGNAQWLPLQAGRVTTAETLPRARLTLRFP 120 FG+ +HP+RWQF+ L+ D + PA ADD W +WLPL+AGRV T+ ++ +L L F Sbjct: 61 FFGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAGRVATSGSIAGGKLNLAFA 120 Query: 121 DGQAWTPDEQPDVLIFPGGEVTPFQLRIDAATGINVDAQGDSQP 164 G+AWTP + PDVLIFPGGE+TPF+L + A GI +A+G+S P Sbjct: 121 QGEAWTPGDNPDVLIFPGGEMTPFRLTLGEAPGIAFNARGESLP 164
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 33.7 bits (77), Expect = 6e-05 Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 8/99 (8%) Query: 1 MKREAGMTLIEVMVALVIF-ALAGLAV---MQSTLQQTRQLGRMEEKILASWLADNQLVQ 56 ++ G TL+E+MV +VI LA L V M + + +Q + L + L +L Sbjct: 4 TDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN 63 Query: 57 LRLENRWPALS--WSETTVEAAGTRWFVRWQGVETALPQ 93 L T+ + +G LP Sbjct: 64 HHYPTTNQGLESLVEAPTLPPLAANY--NKEGYIKRLPA 100
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 33.3 bits (76), Expect = 3e-04 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 5/63 (7%) Query: 4 KMRGFTLIETLLALAILAVLSAAAV-MVLQNVIRADGLTREKS-LQIAALQRAFRQIADD 61 K RGFTL+E ++ + I+ VL++ V ++ N +AD ++K+ I AL+ A D Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKAD---KQKAVSDIVALENALDMYKLD 62 Query: 62 VTH 64 H Sbjct: 63 NHH 65
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 35.6 bits (82), Expect = 1e-04 Identities = 31/175 (17%), Positives = 77/175 (44%), Gaps = 14/175 (8%) Query: 7 PTAAGLYLNYLIHGMGVLLITLNMAHLQEQWGTDKAGVSIVISSLGI-GKLA-TIVTGFL 64 AA + + +++ +G + L + ++++ D + I +++ GI LA ++TG + Sbjct: 211 VVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPV 270 Query: 65 SDRFGRKPFIYLGILSYLIFFVGILLTKNIYLAYVFGIMAGLANSFLDSGTYPALMESFP 124 + R G + + LG+++ ++ + ++A+ ++ + PAL Sbjct: 271 AARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGM-----PALQAMLS 325 Query: 125 HSASRANV-----LIKAFVSAGQFLLPFIISFLIWANL--WFGWSFVIAAALFVL 172 + A S + P + + + A++ W GW+++ AAL++L Sbjct: 326 RQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLL 380 Score = 29.8 bits (67), Expect = 0.007 Identities = 34/143 (23%), Positives = 60/143 (41%), Gaps = 6/143 (4%) Query: 60 VTGFLSDRFGRKPFIYLGILSYLIFFVGILLTKNIYLAYVFGIMAGLANS-FLDSGTYPA 118 V G LSDRFGR+P + + + + + + +++ Y+ I+AG+ + +G Y A Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA 121 Query: 119 LMESFPHSASRANVLIKAFVSAGQFLLPFIISFLIWANLWFGWSFVIAAALFVLSGIYLL 178 + +R + A G P + + F AAAL L+ + Sbjct: 122 DITD-GDERARHFGFMSACFGFGMVAGPVLGGLM--GGFSPHAPFFAAAALNGLNFLTGC 178 Query: 179 KMPFPDSQAAKKRKPLRHRRKQP 201 + P+S +R+PLR P Sbjct: 179 FL-LPESHKG-ERRPLRREALNP 199
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 31.3 bits (71), Expect = 0.001 Identities = 25/122 (20%), Positives = 46/122 (37%), Gaps = 8/122 (6%) Query: 13 VAPQLSKEIHID---PAMMGIIFSAFAWTYALAQIPGGMFLDRFGNKVTYALSIFFWSLF 69 V P L +++ A GI+ + +A G DRFG + +S+ ++ Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86 Query: 70 TLLQSFTLGLKSLLLLRLGLGVSEAPCFPANSRIVSTWFPQHERARA----TATYTVGEY 125 + + L L + R+ G++ A ++ ERAR +A + G Sbjct: 87 YAIMATAPFLWVLYIGRIVAGITGAT-GAVAGAYIADITDGDERARHFGFMSACFGFGMV 145 Query: 126 IG 127 G Sbjct: 146 AG 147
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.3 bits (89), Expect = 2e-05 Identities = 27/143 (18%), Positives = 52/143 (36%), Gaps = 13/143 (9%) Query: 98 AAIGILFGGWISDRLLKRTGSVNISRKLPIISGLLLSSC--IIAANWVSANSTVIIIMSV 155 + L I+ + R G ++ G++ I+ A I++ + Sbjct: 256 GILHSLAQAMITGPVAARLGERRA-----LMLGMIADGTGYILLAFATRGWMAFPIMVLL 310 Query: 156 AFFGQGMVGLGWTLISDIAPENMAGLTGGIFNFCANMASIIAPLIIGVIISATGNFFYAL 215 A G GM L ++S E G G ++ SI+ PL+ I +A+ Sbjct: 311 ASGGIGMPALQ-AMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAAS-----IT 364 Query: 216 IYVGLTALIGVIAYIFIIGDIKR 238 + G + G Y+ + ++R Sbjct: 365 TWNGWAWIAGAALYLLCLPALRR 387
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.9 bits (75), Expect = 0.002 Identities = 41/187 (21%), Positives = 67/187 (35%), Gaps = 17/187 (9%) Query: 16 AAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWVGLFYTVNAIAGILVSLWLAKRSDS 75 AA M V F+M + G + A +F +G+ I L + + Sbjct: 213 AALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAA 272 Query: 76 RGDRRRLIMFCCLMAVGNALLFAFNRHYLTLITCGVMLASIANAAMPQLFALAREYADSS 135 R RR +M + +L AF V+LAS MP L A+ D Sbjct: 273 RLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGG-IGMPALQAMLSRQVDEE 331 Query: 136 AREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTTMFSIAAG-----IFVISLALIAI 190 + + S SL ++GP L FT +++ + ++ AL + Sbjct: 332 RQGQLQGSLAALT--SLTSIVGPLL---------FTAIYAASITTWNGWAWIAGAALYLL 380 Query: 191 KLPSVPR 197 LP++ R Sbjct: 381 CLPALRR 387
>NEISSPPORIN#Neisseria sp. porin signature. Length = 348 Score = 36.5 bits (84), Expect = 2e-04 Identities = 17/81 (20%), Positives = 30/81 (37%), Gaps = 2/81 (2%) Query: 367 YHAGEHYQ-GNWFPAYGLLPRWHHASNHACEKPAGLETVTLTYYRDHVEHRVIGGIMRDL 425 YH G +YQ +F Y L + + E ++ + HR++GG + Sbjct: 180 YHVGLNYQNSGFFAQYAGLFQRYGEGTKKIEYDDQTYSIPSLFVEKLQVHRLVGGYDNNA 239 Query: 426 LAAHQVKLEIQELEYDAWHRG 446 L V + Q+ + G Sbjct: 240 LYV-SVAAQQQDAKLYGAMSG 259
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 372 bits (955), Expect = e-133 Identities = 171/262 (65%), Positives = 209/262 (79%) Query: 1 MAWRSLPLSDELIWRAPLPTAEHALAESIREKIATLRPHLLDFLRLDEPAPRHALTLAEW 60 MA+RS PL +++IWR L + LA+++R IA R HLL+F+RLDEPAP +A+TLA+W Sbjct: 1 MAYRSAPLYEDVIWRTHLQPQDPTLAQAVRATIAKHREHLLEFIRLDEPAPLNAMTLAQW 60 Query: 61 SQPIALRSLLATWSDHIYRHQPTLPREQKPLLSLWAQWYIGLLVPPLMLALLNEPQGLSL 120 S P L SLLA +SDHIYR+QP + RE KPL+SLWAQWYIGL+VPPLMLALL + + L + Sbjct: 61 SSPNVLSSLLAVYSDHIYRNQPMMIRENKPLISLWAQWYIGLMVPPLMLALLTQEKALDV 120 Query: 121 APEHFHVEFHESGRAACFWIDVHSDADIERLSPQARMDALVTRTLQPVVEALAATGEINS 180 +PEHFH EFHE+GR ACFW+DV D + SPQ RM+ L+++ L PVV+AL ATGEIN Sbjct: 121 SPEHFHAEFHETGRVACFWVDVCEDKNATPHSPQHRMETLISQALVPVVQALEATGEING 180 Query: 181 KLIWSNTGYLINWYLGEMRALLGDERLAALRQHCFFEKQLADGQDNPLWRTVMLREGQLV 240 KLIWSNTGYLINWYL EM+ LLG+ + +LR FFEK L +G+DNPLWRTV+LR+G LV Sbjct: 181 KLIWSNTGYLINWYLTEMKQLLGEATVESLRHALFFEKTLTNGEDNPLWRTVVLRDGLLV 240 Query: 241 RRTCCQRYRLPDVQQCGDCTLK 262 RRTCCQRYRLPDVQQCGDCTLK Sbjct: 241 RRTCCQRYRLPDVQQCGDCTLK 262
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 28.2 bits (62), Expect = 0.033 Identities = 13/38 (34%), Positives = 22/38 (57%), Gaps = 4/38 (10%) Query: 237 PQYEETLMSIAQKLKQEGRE----EGREEGHLEGLQEG 270 P E+ L + + ++G + EGR++GH +G QEG Sbjct: 38 PSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEG 75
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 157 bits (399), Expect = 3e-43 Identities = 79/301 (26%), Positives = 132/301 (43%), Gaps = 27/301 (8%) Query: 1 MAELLAESDRQPEQADHFSLLTGHDGSLRKPIEQMKTALFYPNGGLPLLITGDSGTGKSY 60 +AE + + + L G ++++ + + L L+ITG+SGTGK Sbjct: 119 LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLM---QTDLTLMITGESGTGKEL 175 Query: 61 MAELMHEFAIAQELLAPDAPFVSFNCAQYASNPELLAANLFGYVKGAFTGAQSDKAGAFE 120 +A +H++ + + PFV+ N A A +L+ + LFG+ KGAFTGAQ+ G FE Sbjct: 176 VARALHDYGKRR-----NGPFVAINMA--AIPRDLIESELFGHEKGAFTGAQTRSTGRFE 228 Query: 121 AANGGMLFLDEVHRLDAQGQEKLFTWLDRKEIYRVGETAQGLPISLRLVFATTEDIHS-- 178 A GG LFLDE+ + Q +L L + E VG + +R+V AT +D+ Sbjct: 229 QAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGR-TPIRSDVRIVAATNKDLKQSI 287 Query: 179 ---TFLTTFLRRIPIL-VSLPDLQHRSREEKEALTLQFFWQEARTLAAR-LQLTPRLLQV 233 F R+ ++ + LP L R R E ++ F Q+A + L++ Sbjct: 288 NQGLFREDLYYRLNVVPLRLPPL--RDRAEDIPDLVRHFVQQAEKEGLDVKRFDQEALEL 345 Query: 234 LTQYVYRGNVGELKNVVKYAVASAWARSPGREMLTVTLHDLPENVMAATPALSEAMGQQE 293 + + + GNV EL+N+V+ A P +T + + + P Sbjct: 346 MKAHPWPGNVRELENLVRRLT----ALYPQD---VITREIIENELRSEIPDSPIEKAAAR 398 Query: 294 P 294 Sbjct: 399 S 399
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.021 Identities = 39/217 (17%), Positives = 63/217 (29%), Gaps = 22/217 (10%) Query: 1 MLGLGMFFWSLFQALSGMVHSFTQFVLVRIGMGIGEAPMNPCGVKVINDWFNIKERGRPM 60 +L + + ++ A+ + RI GI A G I D + ER R Sbjct: 75 VLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDERARHF 133 Query: 61 GFFNAASTIGVAVSPPILAAMMLMMGWRWMFITIGILGIFIAIGWYMLYR---NREDLPL 117 GF +A G+ P+L +M F L + L E PL Sbjct: 134 GFMSACFGFGMVAG-PVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPL 192 Query: 118 TADEQAYLNAGSVNVRRDPLSFAEWRSLFKNKTMWGMMLGFSGINYTAWLYLAWLPGYLQ 177 R A +R + +M F + + A + + Sbjct: 193 R--------------REALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGE 238 Query: 178 TAYKLDLKSTGFMAAIPFLFGAAGMLINGYVTDWLVK 214 + D + G A FG L +T + Sbjct: 239 DRFHWDATTIGISLA---AFGILHSLAQAMITGPVAA 272
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 36.5 bits (84), Expect = 5e-05 Identities = 12/60 (20%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 80 ALRPAWRGKGLGRKLMQELLMLLQQQGIETVFLEVIRDNHAAVALYQSLGFTRRYGLCGY 139 A+ +R KG+G L+ + + ++ + LE N +A Y F + Sbjct: 96 AVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI----IGAV 151
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 34.1 bits (78), Expect = 1e-04 Identities = 21/93 (22%), Positives = 44/93 (47%), Gaps = 10/93 (10%) Query: 73 VGFI-LTEPLDDALFIVEVAVHQAWQQQGIGRMLLERVIESARQMGYPAVTLTTFREVPW 131 +G I + + I ++AV + ++++G+G LL + IE A++ + + L T +++ Sbjct: 77 IGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLET-QDI-- 133 Query: 132 NAP---FYTRLGFAM--LDELTLPAGLAAKREQ 159 N FY + F + +D L + E Sbjct: 134 NISACHFYAKHHFIIGAVD-TMLYSNFPTANEI 165
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 49.7 bits (119), Expect = 2e-08 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%) Query: 3 VDWLFKNVTVIDGSGGPQYRADVAVKGDRIMAIAPA--------LDV---AAEQVIDGQG 51 VD + N ++D G +AD+ +K RI AI A + + +VI G+G Sbjct: 68 VDTVITNALILDHWG--IVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEG 125 Query: 52 RVLAPGFIDVHTHDDINVIRMPEYLPKLSQGVTTVIVGNCGISAAT 97 +++ G +D H H I ++ E L G+T ++ G G + T Sbjct: 126 KIVTAGGMDSHIH-FICPQQIEE---ALMSGLTCMLGGGTGPAHGT 167
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 683 bits (1764), Expect = 0.0 Identities = 224/1059 (21%), Positives = 437/1059 (41%), Gaps = 54/1059 (5%) Query: 1 MIEWIIRRSVANRFLVMMAALFLSIWGTWTIIHTPVDALPDLSDVQVIVKTRYPGQAPQI 60 M + IRR + A+ L + G I+ PV P ++ V V YPG Q Sbjct: 1 MANFFIRR----PIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQT 56 Query: 61 VENQVTWPLTTTMLSVPGAKTVRGFSQ-FGDSYVYVIFEDGTDPYWARSRVLEYLNQVQG 119 V++ VT + M + + S G + + F+ GTDP A+ +V L Sbjct: 57 VQDTVTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATP 116 Query: 120 KLPAGVSAEMGP-DATGVGWVFEYALVDRSGKHDLAELRSLQDWFLKYELKTIPNVSEVA 178 LP V + + + ++ V + ++ +K L + V +V Sbjct: 117 LLPQEVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQ 176 Query: 179 SVGGVVKEYQIVVDPMKLTQYGISLGEVKSALDASNQEAGGSSVELA------EAEYMVR 232 G +I +D L +Y ++ +V + L N + + + + Sbjct: 177 LFGAQ-YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASII 235 Query: 233 ASGYLQTLDDFKNIVLKTGDNGVPVYLGDVARVQIGPEMRRGIAELNGEGEVAGGVVILR 292 A + ++F + L+ +G V L DVARV++G E IA +NG+ AG + L Sbjct: 236 AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLA 294 Query: 293 SGKNAREVISAVKAKLASLQSSLPEGVEVVTTYDRSQLIDRAIDNLSYKLLEEFIVVALV 352 +G NA + A+KAKLA LQ P+G++V+ YD + + +I + L E ++V LV Sbjct: 295 TGANALDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLV 354 Query: 353 CALFLWHVRSALVAIISLPLGLCFAFIMMHFQGLNANIMSLGGIAIAVGAMVDAAIVMIE 412 LFL ++R+ L+ I++P+ L F ++ G + N +++ G+ +A+G +VD AIV++E Sbjct: 355 MYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVE 414 Query: 413 NAHKRLEEWEHKHPGEKLSNDIRWKIITEASVEVGPALFISLLIITLSFIPIFTLEGQEG 472 N + + E + P + ++ ++ AL ++++ FIP+ G G Sbjct: 415 NVERVMME-DKLPP---------KEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTG 464 Query: 473 KLFGPLAFTKTWSMAGAALLAIVAIPILMGFWIRGRIPAENSNPLNRF----------LI 522 ++ + T +MA + L+A++ P L ++ AE+ F + Sbjct: 465 AIYRQFSITIVSAMALSVLVALILTPALCATLLKPV-SAEHHENKGGFFGWFNTTFDHSV 523 Query: 523 RIYHPLLLKVLHWPKTTLLIALLSILTVVWPLNRVGGEFLPQINEGDLLYMPSTLPGISA 582 Y + K+L LLI L + +V R+ FLP+ ++G L M G + Sbjct: 524 NHYTNSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQ 583 Query: 583 AQAADMLQKTDKLIMA--VPEVARVFGKTGKAETATDSAPLEMVETTIQLKPQDQW-RPG 639 + +L + + V VF G + + + LKP ++ Sbjct: 584 ERTQKVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQN---AGMAFVSLKPWEERNGDE 640 Query: 640 MTMEKIVEELDKTVRLPGLANLWVPPIRNRIDMLSTGIKSPIGIKVSGTNLADIDAIAGQ 699 + E ++ + + + +++ + I +G + Q Sbjct: 641 NSAEAVIHRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQ 700 Query: 700 IEVVARTVPG-VTSALAERLVGGRYLNIDIQREKAARYGMTVGDVQLFVSSAIGGAMVGE 758 + +A P + S L +++ +EKA G+++ D+ +S+A+GG V + Sbjct: 701 LLGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVND 760 Query: 759 TVEGVERYPINIRYPQSYRDSPETLRQLPILTPLKQQIVLGDVAEVKVVTGPSMLKTENA 818 ++ + ++ +R PE + +L + + + + V G L+ N Sbjct: 761 FIDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNG 820 Query: 819 RPTSWIYIDARDRDMVSVVHDLQQAIGKEVKLKPGISVSYSGQFELLERANQKLKLMVPM 878 P+ I +A L + + KL GI ++G + + +V + Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENL--ASKLPAGIGYDWTGMSYQERLSGNQAPALVAI 878 Query: 879 TLMIIFVLLYLAFRRVGEALLIITSVPFALVGGIWFLYWMGFHLSVATGTGFIALAGVAA 938 + +++F+ L + + ++ VP +VG + V G + G++A Sbjct: 879 SFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSA 938 Query: 939 EFGVVMLMYLRHAIEAEPSLENPQTFSVDKLDEALYQGAVLRVRPKAMTVAVIIAGLLPI 998 + ++++ + + +E E + EA +R+RP MT I G+LP+ Sbjct: 939 KNAILIVEFAKDLMEKEGK----------GVVEATLMAVRMRLRPILMTSLAFILGVLPL 988 Query: 999 LWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKL 1037 GAGS + + ++GGM++A LL++F +P + + Sbjct: 989 AISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVV 1027 Score = 74.5 bits (183), Expect = 2e-15 Identities = 58/353 (16%), Positives = 138/353 (39%), Gaps = 28/353 (7%) Query: 705 RTVPGVTSALAERLVGGRY-LNIDIQREKAARYGMTVGDVQLFVSSA----IGGAMVGET 759 + GV +L G +Y + I + + +Y +T DV + G + G Sbjct: 167 SRLNGVGDV---QLFGAQYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTP 223 Query: 760 VEGVERYPINIRYPQSYRDSPETLRQLPILTPLKQQIV-LGDVAEVKV-VTGPSMLKTEN 817 ++ +I Q+ +PE ++ + +V L DVA V++ +++ N Sbjct: 224 ALPGQQLNASII-AQTRFKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARIN 282 Query: 818 ARPTSWIYI----DARDRDMVSVVHDLQQAIGKEVKLKPGISVSY-SGQFELLERANQKL 872 +P + + I A D + + G+ V Y ++ + ++ Sbjct: 283 GKPAAGLGIKLATGANALDTAKAIKAKLAELQPF--FPQGMKVLYPYDTTPFVQLSIHEV 340 Query: 873 KLMVPMTLMIIFVLLYLAFRRVGEALLIITSVPFALVGGIWFLYWMGFHLSVATGTGFIA 932 + +M++F+++YL + + L+ +VP L+G L G+ ++ T G + Sbjct: 341 VKTLFEAIMLVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVL 400 Query: 933 LAGVAAEFGVVMLMYLRHAIEAEPSLENPQTFSVDKLDEALYQGAVLRVRPKAMTVAVII 992 G+ + +V++ + + + P+ + + + QGA++ V+ Sbjct: 401 AIGLLVDDAIVVVENVERVMMEDKL--PPKEATEKSMSQI--QGALV------GIAMVLS 450 Query: 993 AGLLPILWGTGAGSEVMSRIAAPMIGGMITAPLLSLFIIPAAYKLMWLSRHRG 1045 A +P+ + G+ + + + ++ M + L++L + PA + Sbjct: 451 AVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVALILTPALCATLLKPVSAE 503
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 233 bits (595), Expect = 1e-81 Identities = 85/151 (56%), Positives = 111/151 (73%), Gaps = 1/151 (0%) Query: 25 PKGVQPISGFDASRYLGKWYEVARLENRFERGLEQVTATYGARSDGGISVVNRGYDPVKK 84 P+ V+P+S F+ + YLGKWYEVARL++ FERGL QVTA Y R+DGGISV+NRGY K Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79 Query: 85 RWNESDGKAYFTGAPTTAALKVSFFGPFYGGYNVIRLD-DDYQYALVSGPNRDYLWILSR 143 W E++GKAYF T LKVSFFGPFYG Y V LD ++Y YA VSGPN +YLW+LSR Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139 Query: 144 TPTIPAAVKQDYLNTARELGFDVDRLVWIRQ 174 TPT+ + ++ ++E GFD +RL++++Q Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQ 170
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 61.6 bits (149), Expect = 2e-13 Identities = 51/233 (21%), Positives = 85/233 (36%), Gaps = 16/233 (6%) Query: 4 VLITGASSGIGAGLAKSFAADGHLVIACGRDASRLAALQQFSPNISVRL-----FDMTDR 58 ITGA+ GIG +A++ A+ G + A + +L + S R D+ D Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVS-SLKAEARHAEAFPADVRDS 69 Query: 59 DACRQALTGCFA-----DLIILCAGTCEYLDHGQVDAALVERVMATNFLGPVNCLAALQT 113 A + D+++ AG + E + N G N ++ Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSK 129 Query: 114 QLEA--GDRVVLVSSMAHWLPFPRAEAYGASKAALSWFANSLRLDWEPKGVAVTVVSPGF 171 + +V V S +P AY +SKAA F L L+ + +VSPG Sbjct: 130 YMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGS 189 Query: 172 VDTPLTRKNDFAMPGRVSVDRAVAA-IRHGLAKGKNHIAFPTGFSLALRLLAS 223 +T + G V + + G+ K +A P+ + A+ L S Sbjct: 190 TETDMQWSLWADENGAEQVIKGSLETFKTGIPLKK--LAKPSDIADAVLFLVS 240
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 28.3 bits (63), Expect = 0.049 Identities = 25/98 (25%), Positives = 47/98 (47%), Gaps = 10/98 (10%) Query: 19 GERIGYGMGDFAQNLVFGTIGGF---LALHMLTVNTISTATAGFIFLFVRIINV-FWDPM 74 G+ I + +G ++FGT+ GF + M V+ +STA G + +F ++V + + Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI 312 Query: 75 VGTYVDKRTSKAGKYRPWLLRAGVPLVILSALLFAPIP 112 G VD+R ++L GV + +S L + + Sbjct: 313 GGILVDRRGPL------YVLNIGVTFLSVSFLTASFLL 344
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.7 bits (79), Expect = 6e-04 Identities = 14/33 (42%), Positives = 18/33 (54%) Query: 30 VVLVGPSGCGKSTLLRLLAGLEPVSEGQIWLHD 62 VVL G G GKSTL+ L GL+ S+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGT 631
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 41.6 bits (97), Expect = 4e-06 Identities = 88/314 (28%), Positives = 139/314 (44%), Gaps = 40/314 (12%) Query: 113 DQKAGDFLHKEFWPAMHKNAQVMGTTYAIPFHNSTPILYYNKTMFDRAGIKQPPQTWAEL 172 D+ D L+ W A+ N +++ A P L YNK + + PP+TW E+ Sbjct: 108 DKAFQDKLYPFTWDAVRYNGKLI----AYPIAVEALSLIYNKDL-----LPNPPKTWEEI 158 Query: 173 LADAKKLTDESKGQWGIMLPSTNDDYGGWIFSALVRANGG---KYFNEDYP-GEVYYNSP 228 A K+L ++KG+ +M + + Y W L+ A+GG KY N Y +V ++ Sbjct: 159 PALDKEL--KAKGKSALMF-NLQEPYFTW---PLIAADGGYAFKYENGKYDIKDVGVDNA 212 Query: 229 TAIGALRFWQDLIYKDKVMPSGVLNSKQISAAFFSGKLGMAMLSTGALGFMRENSKDFEL 288 A L F DLI K+K M + + AAF G+ M + G + ++ Sbjct: 213 GAKAGLTFLVDLI-KNKHMNADT-DYSIAEAAFNKGETAMTI--NGPWAWSNIDTSKVNY 268 Query: 289 GVAMLPA-KEQRAVPIGGASLVSFKGINEA--QKKAAYQFL-TYLVSPEVNGAWSRFTGY 344 GV +LP K Q + P G V GIN A K+ A +FL YL++ E A ++ Sbjct: 269 GVTVLPTFKGQPSKPFVG---VLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPL 325 Query: 345 FSPRKASYDTPEMKAYLQQDPRAAIALEQLKYAHPWYSTWETVAVRKAMENQLAAVVNDA 404 + SY + L +DPR A +E + + + A A+ AV+N A Sbjct: 326 GAVALKSY-----EEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVR---TAVINAA 377 Query: 405 --KVTPEAAVQAAQ 416 + T + A++ AQ Sbjct: 378 SGRQTVDEALKDAQ 391
>BCTLIPOCALIN#Bacterial lipocalin signature. Length = 171 Score = 249 bits (638), Expect = 4e-88 Identities = 71/152 (46%), Positives = 103/152 (67%), Gaps = 1/152 (0%) Query: 25 PPGVTVVSPFDVQRYLGTWYEIARFDHPFESGLEKVTIAWHPRDDGGLDVVNKGYNPDRG 84 P V VS F++ YLG WYE+AR DH FE GL +VT + R+DGG+ V+N+GY+ ++G Sbjct: 20 PESVKPVSDFELNNYLGKWYEVARLDHSFERGLSQVTAEYRVRNDGGISVLNRGYSEEKG 79 Query: 85 MWQKTDGVAYFTGEPSRAALKISFFGPFYGSYNVIALDKE-YRYALVCGPDRDYLWLLAR 143 W++ +G AYF + LK+SFFGPFYGSY V LD+E Y YA V GP+ +YLWLL+R Sbjct: 80 EWKEAEGKAYFVNGSTDGYLKVSFFGPFYGSYVVFELDRENYSYAFVSGPNTEYLWLLSR 139 Query: 144 APTIAPEVRQQMLNIATRQGFDVGKLVWVNQR 175 PT+ + + + ++ +GFD +L++V Q+ Sbjct: 140 TPTVERGILDKFIEMSKERGFDTNRLIYVQQQ 171
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 118 bits (298), Expect = 1e-31 Identities = 67/401 (16%), Positives = 145/401 (36%), Gaps = 65/401 (16%) Query: 30 TGEVITLPHSVNVFAPQQGFVLNQYVKVGDIVKKGQKLYEIDISRNTTNGNVSLAQTAVI 89 G++ S + + V VK G+ V+KG L ++ + Q++++ Sbjct: 87 NGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALG--AEADTLKTQSSLL 144 Query: 90 NEKI--INAESIITKLIRNKDETL--------------------NALNTQLNTIKKSLSE 127 ++ + + + NK L + + Q +T + + Sbjct: 145 QARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQ 204 Query: 128 TTSMLANTQAGLNKMH--------------QNLSSYDKYLKEGLITKDQYNYQHSLYFQQ 173 L +A + L + L + I K Q + Y + Sbjct: 205 KELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEA 264 Query: 174 QSAYQSLVSQKMQLESQITQFTSDKVTKAADFDNQISNQ----QNQINDYKNQLVESDAK 229 + + SQ Q+ES+I + F N+I ++ + I +L +++ + Sbjct: 265 VNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEER 324 Query: 230 -GNVIIKATTDGRIESLAV-TKGQMVDNGSSLAQIKPTGNVEYYLI-LWLPNNSIPYVKP 286 +I+A +++ L V T+G +V +L I P + + + N I ++ Sbjct: 325 QQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDT--LEVTALVQNKDIGFINV 382 Query: 287 GDTINIRYDAFPADKFGQFPGEVISIS--SMPASRQEMSEYTNVNNGTNQQELALYKTIV 344 G I+ +AFP ++G G+V +I+ ++ R + + I+ Sbjct: 383 GQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLV----------------FNVII 426 Query: 345 KIKQKSFSYNGKTLYLSNGLKAEAVVFLEERPLYMWMFTPF 385 I++ S K + LS+G+ A + R + ++ +P Sbjct: 427 SIEENCLSTGNKNIPLSSGMAVTAEIKTGMRSVISYLLSPL 467
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 570 bits (1470), Expect = 0.0 Identities = 258/631 (40%), Positives = 375/631 (59%), Gaps = 42/631 (6%) Query: 2 LTLGESTSDADIFDSVPYRGAQLNSDDYMDAESIQGYAPVVRGIAKSNAKVIIKQSGYVI 61 LTLG+ + DIFD + +RGAQL SDD M +S +G+APV+ GIA+ A+V IKQ+GY I Sbjct: 261 LTLGDGYTQGDIFDGINFRGAQLASDDNMLPDSQRGFAPVIHGIARGTAQVTIKQNGYDI 320 Query: 62 YQSFVPPGAFEITDLYSTGGNGDLNVTIEEADGTQQNFVVAYASLPVLRREGSLKYSITS 121 Y S VPPG F I D+Y+ G +GDL VTI+EADG+ Q F V Y+S+P+L+REG +YSIT+ Sbjct: 321 YNSTVPPGPFTINDIYAAGNSGDLQVTIKEADGSTQIFTVPYSSVPLLQREGHTRYSITA 380 Query: 122 GQYRSSDGSVDYTPFSQATASYGLPYNTTLYGGFQAASKYQSVAIGVGNNLGVLGAVSLD 181 G+YRS + + F Q+T +GLP T+YGG Q A +Y++ G+G N+G LGA+S+D Sbjct: 381 GEYRSGNAQQEKPRFFQSTLLHGLPAGWTIYGGTQLADRYRAFNFGIGKNMGALGALSVD 440 Query: 182 VTQAWSTKQDQDKISGQSVRIRYSKNLNDIGTNIAIAGYRYSTSGFNTLSDVLETYRDDY 241 +TQA ST D + GQSVR Y+K+LN+ GTNI + GYRYSTSG+ +D + + Y Sbjct: 441 MTQANSTLPDDSQHDGQSVRFLYNKSLNESGTNIQLVGYRYSTSGYFNFADTTYSRMNGY 500 Query: 242 K-----------------YYYNDRVKNRTEITVSQSLGDKLGYFNIGGVMEDYWNQRRRN 284 Y + + ++TV+Q LG + G + YW + Sbjct: 501 NIETQDGVIQVKPKFTDYYNLAYNKRGKLQLTVTQQLGR-TSTLYLSGSHQTYWGTSNVD 559 Query: 285 NSLNVGYSNSWSGITYNFNYSHSRSSTDYEGYGRNYSTDNIFSFNINVPLNFWMP----- 339 G + ++ I + +YS ++++ D + + N+N+P + W+ Sbjct: 560 EQFQAGLNTAFEDINWTLSYSLTKNAWQKG-------RDQMLALNVNIPFSHWLRSDSKS 612 Query: 340 ---NTWATYGLNTSDPGSTSNSVGLSGLALADNNLSWNLQQQYDNRD----YSSGTAGVD 392 + A+Y ++ G +N G+ G L DNNLS+++Q Y S+G A ++ Sbjct: 613 QWRHASASYSMSHDLNGRMTNLAGVYGTLLEDNNLSYSVQTGYAGGGDGNSGSTGYATLN 672 Query: 393 YKGTYGEIFGSYNYDHNWQRLNYGINGGIVAHRDGITAGQSFSDTSALVKAPGVNGTRVV 452 Y+G YG Y++ + ++L YG++GG++AH +G+T GQ +DT LVKAPG +V Sbjct: 673 YRGGYGNANIGYSHSDDIKQLYYGVSGGVLAHANGVTLGQPLNDTVVLVKAPGAKDAKVE 732 Query: 453 GNTGVKTDYRGYAIVPNITMYRRNDVVLDTETMPNDVDLDTTVATVVPTRGAIVRAEYSG 512 TGV+TD+RGYA++P T YR N V LDT T+ ++VDLD VA VVPTRGAIVRAE+ Sbjct: 733 NQTGVRTDWRGYAVLPYATEYRENRVALDTNTLADNVDLDNAVANVVPTRGAIVRAEFKA 792 Query: 513 KKGIRALLQLVDTHNKFIPFGAMVNLASENSTNNNSGIVSDNGQVYLAGLPTTGVLLVKW 572 + GI+ L+ L NK +PFGAMV + ++ +SGIV+DNGQVYL+G+P G + VKW Sbjct: 793 RVGIKLLMTLTHN-NKPLPFGAMV----TSESSQSSGIVADNGQVYLSGMPLAGKVQVKW 847 Query: 573 GNSISKQCTVNYQFPGSEKVNGIKQGQFICR 603 G + C NYQ P + + Q CR Sbjct: 848 GEEENAHCVANYQLPPESQQQLLTQLSAECR 878
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 28.9 bits (64), Expect = 0.036 Identities = 22/88 (25%), Positives = 40/88 (45%), Gaps = 4/88 (4%) Query: 175 DAKVRLNVRHEIKSLQKRLGFTSLIVTHDQQEALVMA-DRIAVLNQGRIEQTGTPEEIYQ 233 D +++ V H K +K F + T D + + A DR Q + TP++ +Q Sbjct: 56 DTGIKVTVEHPDKLEEK---FPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQ 112 Query: 234 RPATPFVADFMGADNKIISGELSAAAIS 261 PF D + + K+I+ ++ A+S Sbjct: 113 DKLYPFTWDAVRYNGKLIAYPIAVEALS 140
>PF05272#Virulence-associated E family protein Length = 892 Score = 34.3 bits (78), Expect = 5e-05 Identities = 13/35 (37%), Positives = 18/35 (51%) Query: 32 VVFVGPSGCGKSTLLRMIAGLETITSGDLFIGDTR 66 VV G G GKSTL+ + GL+ + IG + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 756 bits (1952), Expect = 0.0 Identities = 378/396 (95%), Positives = 391/396 (98%) Query: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK Sbjct: 1 MKIKTGARILALSALTTMMFSASALAKIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIK 60 Query: 61 VSVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 V+VEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW Sbjct: 61 VTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTW 120 Query: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLVPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 DAVRYNGKLIAYPIAVEALSLIYNKDL+PNPPKTWEEIPALDKELKAKGKSALMFNLQEP Sbjct: 121 DAVRYNGKLIAYPIAVEALSLIYNKDLLPNPPKTWEEIPALDKELKAKGKSALMFNLQEP 180 Query: 181 YFTWPLIAADGGYAFKFENGKYDVKDVGVDSAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 YFTWPLIAADGGYAFK+ENGKYD+KDVGVD+AGAKAGLTFLVDLIKNKHMNADTDYSIAE Sbjct: 181 YFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDYSIAE 240 Query: 241 AAFNKGETAMTINGPWAWSNIDKSKVNYGVTLLPTFKGKPSKPFVGVLSAGINAASPNKE 300 AAFNKGETAMTINGPWAWSNID SKVNYGVT+LPTFKG+PSKPFVGVLSAGINAASPNKE Sbjct: 241 AAFNKGETAMTINGPWAWSNIDTSKVNYGVTVLPTFKGQPSKPFVGVLSAGINAASPNKE 300 Query: 301 LAKEFLENYLMTDQGLEAVNNDKPLGAVALKSFQEKLEKDPRIAATMANAQKGEIMPNIP 360 LAKEFLENYL+TD+GLEAVN DKPLGAVALKS++E+L KDPRIAATM NAQKGEIMPNIP Sbjct: 301 LAKEFLENYLLTDEGLEAVNKDKPLGAVALKSYEEELAKDPRIAATMENAQKGEIMPNIP 360 Query: 361 QMSAFWYAVRTAVINAASGRQTVDAALKDAQSRITK 396 QMSAFWYAVRTAVINAASGRQTVD ALKDAQ+RITK Sbjct: 361 QMSAFWYAVRTAVINAASGRQTVDEALKDAQTRITK 396
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 26.8 bits (59), Expect = 0.036 Identities = 11/43 (25%), Positives = 14/43 (32%) Query: 102 GHRYGEHIFHAVETRAKTAGESWLWLEVLAANPAARRFYERQG 144 G + H AK L LE N +A FY + Sbjct: 103 KKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHH 145
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 31.8 bits (72), Expect = 0.007 Identities = 17/79 (21%), Positives = 34/79 (43%), Gaps = 13/79 (16%) Query: 69 LAKETDLAGAIKSMFSGEKINR-------TEDRAVLHVALRNRSNTPIVVDGKDVMPEVN 121 AK +DL + + S + + D+ ++ + ++N IV DVM ++ Sbjct: 276 YAKASDLVEVLTGISSTMQSEKQAAKPVAALDKNII-IKAHGQTNALIVTAAPDVMNDLE 334 Query: 122 AVLEKM-----KTFSEAII 135 V+ ++ + EAII Sbjct: 335 RVIAQLDIRRPQVLVEAII 353
>56KDTSANTIGN#Rickettsia 56kDa type-specific antigen protein signature. Length = 533 Score = 25.3 bits (55), Expect = 0.038 Identities = 19/75 (25%), Positives = 34/75 (45%), Gaps = 5/75 (6%) Query: 2 ALPRITQKEMTEREQRELKTLLDRARIAHGRPLSNAETNSVKKEYIDKLMAQREAEAKKA 61 LP E + + +EL L+ R + ++NA N + ++ AQ++ Sbjct: 290 GLPNSASIEQIQSKIQELGDTLEELRDSFDGYINNAFVNQIHLNFVMPPQAQQQQG---- 345 Query: 62 RQVRKQQAYKTDKEA 76 Q ++QQA T +EA Sbjct: 346 -QGQQQQAQATAQEA 359
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 29.6 bits (66), Expect = 0.006 Identities = 8/29 (27%), Positives = 16/29 (55%) Query: 9 AKSLVRERARTGLSLAEVARRAGIAKSTL 37 A L ++ + SL E+A+ AG+ + + Sbjct: 20 ALRLFSQQGVSSTSLGEIAKAAGVTRGAI 48
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 65.6 bits (160), Expect = 2e-13 Identities = 43/236 (18%), Positives = 86/236 (36%), Gaps = 31/236 (13%) Query: 36 FPIVYASALNGIAGLDHEDMADDMTPLYQAIVDRVPAPDVDLDGPLQMQISQLDYNNYVG 95 FP+ + SA N I G+D+ L + I ++ + L ++ +++Y+ Sbjct: 214 FPVYHGSAKNNI-GIDN---------LIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQ 263 Query: 96 VIGIGRIKRGKVKPNQQVTIIDSEGKTRNGKVGKVLTHLGLERIESDVAEAGDIIAITGL 155 + R+ G + V I + E K+ ++ T + E + D A +G+I+ + Sbjct: 264 RLAYIRLYSGVLHLRDSVRISEKEKI----KITEMYTSINGELCKIDKAYSGEIVILQNE 319 Query: 156 G-ELN--ISDTICDPQNVEALPALSVDEPTVSMFFNVNTSPFCGKEGKFVTSRQILDRLN 212 +LN + DT PQ + P + + + D L Sbjct: 320 FLKLNSVLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPL- 374 Query: 213 KELVHNVALRVEETEDADAFRVSGRGELHLSVLIENMRRE-GFEMAVSRPKVIFRE 267 LR +S G++ + V ++ + E+ + P VI+ E Sbjct: 375 --------LRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 30.6 bits (69), Expect = 0.014 Identities = 12/75 (16%), Positives = 29/75 (38%), Gaps = 1/75 (1%) Query: 274 EPFENVTLDVEEQHQGSVMQALGERKGDLKNMNPDGKGRVRLDYVIPSRGLIGFRSEFMT 333 EP+ + + +++ + ++ + V L IP+R + +RS+ Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLKN-NEVILSGEIPARCIQEYRSDLTF 595 Query: 334 MTSGTGLLYSTFSHY 348 T+G + + Y Sbjct: 596 FTNGRSVCLTELKGY 610
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 123 bits (310), Expect = 3e-36 Identities = 49/123 (39%), Positives = 72/123 (58%), Gaps = 6/123 (4%) Query: 4 NLRNIAIIAHVDHGKTTLVDKLLQQSGTFD--ARTEAQERVMDSNDLEKERGITILAKNT 61 + NI ++AHVD GKTTL + LL SG + D+ LE++RGITI T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 62 AIKWNDYRINIVDTPGHADFGGEVERVMSMVDSVLLVVDAFDGPMPQTRFVTKKL----L 117 + +W + ++NI+DTPGH DF EV R +S++D +L++ A DG QTR + L + Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 118 PTV 120 PT+ Sbjct: 122 PTI 124
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 600 bits (1549), Expect = 0.0 Identities = 203/478 (42%), Positives = 297/478 (62%), Gaps = 11/478 (2%) Query: 1 MQRGIAWIVDDDSSIRWVLERALTGAGLSCTTFESGNEVLDALTTKTPDVLLSDIRMPGM 60 M + DDD++IR VL +AL+ AG + + + D++++D+ MP Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 DGLALLKQIKQRHPMLPVIIMTAHSDLDAAVSAYQQGAFDYLPKPFDIDEAVALVDRAIS 120 + LL +IK+ P LPV++M+A + A+ A ++GA+DYLPKPFD+ E + ++ RA++ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 HYQEQQQPRNAPISSPTADIIGEAPAMQDVFRIIGRLSRSSISVLINGESGTGKELVAHA 180 + + ++G + AMQ+++R++ RL ++ ++++I GESGTGKELVA A Sbjct: 121 EPKRRPSKLEDDSQDG-MPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARA 179 Query: 181 LHRHSPRSKAPFIALNMAAIPKDLIESELFGHEKGAFTGANTVRQGRFEQADGGTLFLDE 240 LH + R PF+A+NMAAIP+DLIESELFGHEKGAFTGA T GRFEQA+GGTLFLDE Sbjct: 180 LHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDE 239 Query: 241 IGDMPLDVQTRLLRVLADGQFYRVGGYAPVKVDVRIIAATHQNLEQRVQEGKFREDLFHR 300 IGDMP+D QTRLLRVL G++ VGG P++ DVRI+AAT+++L+Q + +G FREDL++R Sbjct: 240 IGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYR 299 Query: 301 LNVIRVHLPPLRERREDIPRLARHFLQIAARELGVEAKQLHPETETALTRLAWPGNVRQL 360 LNV+ + LPPLR+R EDIP L RHF+Q A +E G++ K+ E + WPGNVR+L Sbjct: 300 LNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKE-GLDVKRFDQEALELMKAHPWPGNVREL 358 Query: 361 ENTCRWLTVMAAGQEVLTQDLPSELFETTIPDSPTQMQPDSWATLLGQWADRALRS---- 416 EN R LT + + + + +EL + S + + Q + +R Sbjct: 359 ENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFAS 418 Query: 417 -----GHQNLLSEAQPEMERTLLTTALRHTQGHKQEAARLLGWGRNTLTRKLKELGME 469 L EME L+ AL T+G++ +AA LLG RNTL +K++ELG+ Sbjct: 419 FGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>SECA#SecA protein signature. Length = 901 Score = 27.9 bits (62), Expect = 0.022 Identities = 12/57 (21%), Positives = 23/57 (40%) Query: 15 KSREELNQEARDRKRQKKHRGHAAGSRANGGDAASAGKKQRQAQDPRVGSKKPIPLG 71 + EE+ + + R+ + + D+A+A Q + +VG P P G Sbjct: 832 RMPEEVEELEQQRRMEAERLAQMQQLSHQDDDSAAAAALAAQTGERKVGRNDPCPCG 888
>PRPHPHLPASEC#Prokaryotic zinc-dependent phospholipase C signature. Length = 398 Score = 29.6 bits (66), Expect = 0.010 Identities = 11/31 (35%), Positives = 13/31 (41%) Query: 69 NPWLKWDVQGLEGLNKKNWYLLISNHHSWAD 99 N W K +G K +Y S HSW D Sbjct: 214 NAWSKEYARGFAKTGKSIYYSHASMSHSWDD 244
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 64.5 bits (157), Expect = 2e-14 Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 5/116 (4%) Query: 2 TTIALIDDHLIVRSGFAQLLGLEADFQVVAEFGSGREALTGLPGRGVQVCICDISMPDIS 61 TI + DD +R+ Q L + V + + + + D+ MPD + Sbjct: 4 ATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDEN 61 Query: 62 GLELLSQLPK---GMATIMLSVHDSPALIEQALNAGARGFLSKRCSPDELIAAVRT 114 +LL ++ K + +++S ++ +A GA +L K ELI + Sbjct: 62 AFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.8 bits (85), Expect = 1e-04 Identities = 65/407 (15%), Positives = 130/407 (31%), Gaps = 58/407 (14%) Query: 29 RHILLTIWLGYALFY--FTRKSFNAAVPEILASNVLTRSDIGLLATLFYITYGLSKFFSG 86 RH + IWL F+ N ++P+I + + T F +T+ + G Sbjct: 11 RHNQILIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYG 70 Query: 87 IVSDRSDARYFMGLGLIATGVVNILFGFSSSLWAFALLWALNAFFQGWGS---PVCARLL 143 +SD+ + + G+I +++ S F L + F QG G+ P ++ Sbjct: 71 KLSDQLGIKRLLLFGIIINCFGSVIGFVGHS---FFSLLIMARFIQGAGAAAFPALVMVV 127 Query: 144 TAWY-SRTERGGWWALWNTAHNVGGALIPMVVGAAALHYGWRAGMTIAGCLAILAGLYLC 202 A Y + RG + L + +G + P + G A + W + I I Sbjct: 128 VARYIPKENRGKAFGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLLLIPMITIITV----- 182 Query: 203 WRLRDRPQAVGLPAVGDWRHDALEIAQQQEGAGMSRKAILTRYVLANPYIWLLSLCYVLV 262 P + L +I G + I+ + Y + VL Sbjct: 183 ------PFLMKLLKKEVRIKGHFDIK----GIILMSVGIVFFMLFTTSYSISFLIVSVLS 232 Query: 263 YVV-----RAAINDWGNLYMSETLGVDLVTANSAVTMFELGGFI-----------GALVA 306 +++ R + + + + + + + + + GF+ A Sbjct: 233 FLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTA 292 Query: 307 GWGSDKLFNGNRGPMNLIFAAGILLSVGGLWLMPFASYVMQAACFFTTGFFVFGPQMLI- 365 GS +F G + + GIL+ G + + F T F + + Sbjct: 293 EIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMT 352 Query: 366 --------GMAAAECS---------HKEAAGAATGFVGLFAYLGASL 395 G++ + ++ AGA + ++L Sbjct: 353 IIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGT 399
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.9 bits (80), Expect = 7e-04 Identities = 27/168 (16%), Positives = 62/168 (36%), Gaps = 17/168 (10%) Query: 49 FNIAQNDMISTYGLSMTQLGMIGLGFSITYGVGKTLVSYYADGKNTKQFLPFMLILSAIC 108 N++ D+ + + + F +T+ +G + +D K+ L F +I++ Sbjct: 33 LNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIIN--- 89 Query: 109 MLGFSASMGAGSTSLFLMIAFYALSGFFQSTGGSCSYSTI----TKWTPRRKRGTFLGFW 164 F + +G S F ++ + F Q G + + + ++ P+ RG G Sbjct: 90 --CFGSVIGFVGHSFFSLLIM---ARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLI 144 Query: 165 NISHNLGGAGAAGVALFGANYLFDGHVIGMFIFPSIIALIVGFIGLRY 212 +G + A+Y+ + + + P + I+ L Sbjct: 145 GSIVAMGEGVGPAIGGMIAHYIHWSY---LLLIP--MITIITVPFLMK 187
>PF05043#Transcriptional activator Length = 493 Score = 29.5 bits (66), Expect = 0.020 Identities = 16/66 (24%), Positives = 31/66 (46%), Gaps = 7/66 (10%) Query: 5 IRNL----DLNLLKALDALLDERS---VTRAAARLALTQPAVSGMLTRLRDAFNDPLFIR 57 +R+L L+ L+ L + + + A L T+ AV L+ ++ AF D +F Sbjct: 1 MRDLLSKKSHRQLELLELLFEHKRWFHRSELAELLNCTERAVKDDLSHVKSAFPDLIFHS 60 Query: 58 APHGMV 63 + +G+ Sbjct: 61 STNGIR 66
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.5 bits (74), Expect = 0.003 Identities = 27/151 (17%), Positives = 53/151 (35%), Gaps = 3/151 (1%) Query: 65 AFVAMFSSLFITTVIGKTDRRYVVILFSLLLTLSCLLVSFADSFTLLLLGRACLGLALGG 124 A + + + + + RR V+++ + +++ A +L +GR G+ G Sbjct: 53 ALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GA 111 Query: 125 FWAMSASLTMRLVPMRVVPKALSIIFGAVSIALVIAAPLGSFLGGLIGWRNVFNGAAVMG 184 A++ + + + + +V LG +GG F AA + Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALN 170 Query: 185 VLCTLWVLKALP-SLPGESASQQQNMFGLLK 214 L L LP S GE ++ L Sbjct: 171 GLNFLTGCFLLPESHKGERRPLRREALNPLA 201
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 118 bits (296), Expect = 6e-34 Identities = 43/124 (34%), Positives = 65/124 (52%), Gaps = 11/124 (8%) Query: 108 LNMPNNVTFDSNSANLKPAGANTLTGVAMVLKEYEKT--AVNVVGYTDSTGSKDLNMRLS 165 + ++V F+ N A LKP G L + L + +V V+GYTD GS N LS Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIGSDAYNQGLS 274 Query: 166 QQRADSVASALITQGVAANRIRTTGMGPANPIASNSTAEGK---------AQNRRVEITL 216 ++RA SV LI++G+ A++I GMG +NP+ N+ K A +RRVEI + Sbjct: 275 ERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDRRVEIEV 334 Query: 217 SPLQ 220 ++ Sbjct: 335 KGIK 338
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 18/52 (34%), Positives = 24/52 (46%), Gaps = 5/52 (9%) Query: 76 VAPGATRQGIGRALLDEVKQ-----HYAWLSLEVYQKNESAVSFYHAQGFRI 122 VA ++G+G ALL + + H+ L LE N SA FY F I Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFII 148
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 41.3 bits (97), Expect = 4e-06 Identities = 48/276 (17%), Positives = 92/276 (33%), Gaps = 33/276 (11%) Query: 44 PVSQVAFSFGLLSLGLALS----SSVAGKLQERFGVKRVTMASGILLGLGFFLTAHSSSL 99 + V +G+L AL + V G L +RFG + V + S + + + A + L Sbjct: 37 HSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFL 96 Query: 100 MMLWLS---AGVLVGLADGAGYLL----TLSNCVKWFPERKGLISAFSIGSYGLGSLGFK 152 +L++ AG+ AG + + F + LG Sbjct: 97 WVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGG---- 152 Query: 153 FIDSHLLATVGLEKTFVIWGAIVLVMIVFGATLMKDAPNHPAATAANGVVENDFTLAESM 212 L+ F A+ + + G L+ + + N Sbjct: 153 -----LMGGFSPHAPFFAAAALNGLNFLTGCFLLPE-SHKGERRPLRREALNPLASFRWA 206 Query: 213 R--KPQYWMLAVMFLTACMSG----LYVIGVAKDIAQGMVHLDVATAANAVTVISIAN-L 265 R ++AV F+ + L+VI + H D T ++ I + L Sbjct: 207 RGMTVVAALMAVFFIMQLVGQVPAALWVI-----FGEDRFHWDATTIGISLAAFGILHSL 261 Query: 266 SGRLVLGILSDKISRIRVITIGQVVSLVGMAALLFA 301 + ++ G ++ ++ R + +G + G L FA Sbjct: 262 AQAMITGPVAARLGERRALMLGMIADGTGYILLAFA 297 Score = 34.0 bits (78), Expect = 9e-04 Identities = 31/119 (26%), Positives = 51/119 (42%), Gaps = 2/119 (1%) Query: 270 VLGILSDKISRIRVITIGQVVSLVGMAALLFAPLNALTFFAAIACVAFNFGGTITVFPSL 329 VLG LSD+ R V+ + + V A + AP + + I VA G T V + Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRI--VAGITGATGAVAGAY 119 Query: 330 VSEFFGLNNLAKNYGVIYLGFGIGSICGSLIASLFGGFYVTFCVIFALLILSLALSTTI 388 +++ + A+++G + FG G + G ++ L GGF A + L T Sbjct: 120 IADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGC 178
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.049 Identities = 23/137 (16%), Positives = 43/137 (31%), Gaps = 23/137 (16%) Query: 5 RTMTQQKLSFWLALYIGWFMNVAVFFRRFDGYAQEFTFWKGLSGVVELVATVFVTFFLLR 64 T Q +W IGW + F G+A + K S + + ++ Sbjct: 3 STHRQANKYYWYCQGIGWGVYTLTGF----GFASLYGSPKLHSMIFNIAISLMGLVLTHA 58 Query: 65 LLSLFGRRIWRILATLIVLFSAAASYYMTFLNVVIGYGIIASVMTTDIDLSKEVIGWHLI 124 S R+ W L ++ + + G++ V T I W L+ Sbjct: 59 YRSFIKRQGWLKLNMGQIILRVLPA--------CVVIGMVWFVANTSI--------WRLL 102 Query: 125 LWLVAVSAPPLLFIWSN 141 ++ + P+ F Sbjct: 103 AFI---NTKPVAFTLPL 116
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 37.4 bits (86), Expect = 8e-05 Identities = 44/175 (25%), Positives = 72/175 (41%), Gaps = 17/175 (9%) Query: 134 GHLLSQPFNSSTPVLYYNKDAFKKAGLDPEQPPKTWQDLAAYTAKLKAAGMKCGYASGWQ 193 G L++ P L YNKD L P PPKTW+++ A +LKA G + + Sbjct: 127 GKLIAYPIAVEALSLIYNKD------LLP-NPPKTWEEIPALDKELKAKGKSALMFNLQE 179 Query: 194 GWIQIENFSAWHGLPVATKNNGFDGTDAVLEF--NKPEQVKHIALLEEMNKKGDFSYFGR 251 + +A G +N +D D ++ K + L++ + D Y Sbjct: 180 PYFTWPLIAADGGYAFKYENGKYDIKDVGVDNAGAKAGLTFLVDLIKNKHMNADTDY--- 236 Query: 252 KDESTEKFYNGDCAITTASSGSLADIRQYAKFNYGVGMMPYDADVKGAPQNAIIG 306 + F G+ A+T + ++I +K NYGV ++P KG P +G Sbjct: 237 -SIAEAAFNKGETAMTINGPWAWSNIDT-SKVNYGVTVLP---TFKGQPSKPFVG 286
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.9 bits (64), Expect = 0.037 Identities = 10/29 (34%), Positives = 16/29 (55%) Query: 33 IVMVGPSGCGKSTLLRMVAGLERVTSGDI 61 +V+ G G GKSTL+ + GL+ + Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHF 627
>PF04619#Dr-family adhesin Length = 160 Score = 28.4 bits (63), Expect = 0.018 Identities = 14/60 (23%), Positives = 23/60 (38%), Gaps = 4/60 (6%) Query: 29 VGARYGHTMIEFDAKLSKDGQIFLLHDDNLERTSNGWGVAGELAW----DDLLKVDAGSW 84 +G ++ D + G+ FL+ D+N ++ AW K D GSW Sbjct: 70 LGCDARQVALKADTDNFEQGKFFLISDNNRDKLYVNIRPTDNSAWTTDNGVFYKNDVGSW 129
>NAFLGMOTY#Sodium-type flagellar protein MotY precursor signature. Length = 293 Score = 33.2 bits (75), Expect = 0.003 Identities = 27/80 (33%), Positives = 36/80 (45%), Gaps = 13/80 (16%) Query: 276 RTPVSGEYRGYEVYSMPPPSSGGIHIVQILNILENFDMQKYGF-GSADAMQVMAEAEKHA 334 R P+ GE R + SMPPP G H +I N+ F Q G+ G A +++E EK Sbjct: 77 RRPM-GETRNVSLISMPPPWRPGEHADRITNL--KFFKQFDGYVGGQTAWGILSELEKGR 133 Query: 335 YADRSEYLGDPDFVNVPWQA 354 Y P F WQ+ Sbjct: 134 Y---------PTFSYQDWQS 144
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.6 bits (87), Expect = 6e-06 Identities = 20/90 (22%), Positives = 34/90 (37%), Gaps = 12/90 (13%) Query: 55 VACIDEQVVGHLSIAVVQRPRRSHVADFGVSVDSRWHNRGVASALMRTMIDMCDNWLRVE 114 + ++ +G + I + + D V+ D R GV +AL+ I+ W + E Sbjct: 69 LYYLENNCIGRIKIRSNWN-GYALIEDIAVAKDYRKK--GVGTALLHKAIE----WAK-E 120 Query: 115 R----IELTVFADNAPAIAVYKKYGFEIEG 140 + L N A Y K+ F I Sbjct: 121 NHFCGLMLETQDINISACHFYAKHHFIIGA 150
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 33.6 bits (76), Expect = 3e-04 Identities = 28/118 (23%), Positives = 44/118 (37%), Gaps = 16/118 (13%) Query: 37 LLVSRTARLQRDFLATLHTTADAQLLASLKQREQAMREAWQQHQRQRQQYQRRSAIAAWQ 96 L + LQ A + A+ K REQA EA +++ + Q R A Sbjct: 192 LFTEAISSLQIRMNTLTAAKASIEAAAANKAREQAAAEA-----KRKAEEQARQQAAIRA 246 Query: 97 PRLQALAAD----LPAQAWLTRLEYQGVLLTLDGLALNLQALTSVESALTRVAGFAPA 150 A+ A+ A +G++ G A QA++ + L RV AP+ Sbjct: 247 ANTYAMPANGSVVATAAG-------RGLIQVAQGAASLAQAISDAIAVLGRVLASAPS 297
>TYPE3OMGPROT#Type III secretion system outer membrane G protein family signature. Length = 607 Score = 226 bits (578), Expect = 3e-70 Identities = 77/282 (27%), Positives = 122/282 (43%), Gaps = 17/282 (6%) Query: 138 GGKLLSARGHLMADKRTNRLLIRDDARHLPALKAWAQEMDLPVGQVELAAHIVSMSETSL 197 SA+ + AD N +++RD +P + +D P ++E+A IV ++ L Sbjct: 237 AATRASAQARVEADPSLNAIIVRDSPERMPMYQRLIHALDKPSARIEVALSIVDINADQL 296 Query: 198 RELGVKWRLAEAGSPPGSGQITTLSSDVSVNDASTRAGFNIGKINGRLLEL---ELSALE 254 ELGV WR+ I T ++ G ++ R L+ ++ LE Sbjct: 297 TELGVDWRVGIRTGNNHQVVIKTTGDQSNIAS----NGALGSLVDARGLDYLLARVNLLE 352 Query: 255 RKQQVEIIASPRLLASHMQPASIKQGSEIPYQVSSGESGATSVEFKEAVLG--MEVTPTV 312 + ++++ P LL A I SE Y +G+ A E K G + +TP V Sbjct: 353 NEGSAQVVSRPTLLTQENAQAVIDH-SETYYVKVTGKEVA---ELKGITYGTMLRMTPRV 408 Query: 313 LQQG---RVRLKLRISENTPGQVLKQENGEALAIDKQEIETLVEVRSGETLALGGIFSQK 369 L QG + L L I + I + ++T+ V G++L +GGI+ + Sbjct: 409 LTQGDKSEISLNLHIEDGNQKPNS-SGIEGIPTISRTVVDTVARVGHGQSLIIGGIYRDE 467 Query: 370 NKTARDSVPLLGDIPVLGRLFRRDGKDNERRELVVFITPRIL 411 A VPLLGDIP +G LFRR + R + I PRI+ Sbjct: 468 LSVALSKVPLLGDIPYIGALFRRKSELTRRTVRLFIIEPRII 509
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.6 bits (69), Expect = 0.002 Identities = 27/91 (29%), Positives = 40/91 (43%), Gaps = 18/91 (19%) Query: 32 FYDSDQEIEKRTGADVGWVFDVEGEEGFRD----------REEKIINELTEKQGIVLATG 81 FYD + KR + GW+ + G+R E + I +L E+ IV+A+G Sbjct: 136 FYDEETA--KRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKKLVERGVIVIASG 193 Query: 82 GGSVKSRETRNRLSARGVVVYLETTIEKQLA 112 GG V + +GV E I+K LA Sbjct: 194 GGGVPVILEDGEI--KGV----EAVIDKDLA 218
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.9 bits (77), Expect = 0.001 Identities = 32/208 (15%), Positives = 58/208 (27%), Gaps = 12/208 (5%) Query: 125 SSSQQTASGEKSINLSDDQSASMPAAGQDQTAAANSTSQQDVTVPPIAANPTQGQAAVAP 184 S + A + +T A NS + N A Sbjct: 1009 SVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTV----EKNEQDATETTAQ 1064 Query: 185 QGQQRIEVQGDLNNALTQQ---QGQLDGAVANSTLPTEPATVAPIRNGANGTAAPRQATE 241 + E + ++ Q + +T E ATV T ++ + Sbjct: 1065 NREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPK 1124 Query: 242 RQTAATPRPAERKHTVIEAKPQSKPQSVVKTPVESKPVQPKHVESTATTAPAKTPVSESK 301 + +P+ + + +A+P + V K Q + + T PAK S + Sbjct: 1125 VTSQVSPKQEQSETVQPQAEPARENDPTVN----IKEPQSQTNTTADTEQPAKETSSNVE 1180 Query: 302 PVATAQSKPTTTTAAPAATAAAAAPAAK 329 T +S T + PA Sbjct: 1181 QPVT-ESTTVNTGNSVVENPENTTPATT 1207
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 381 bits (979), Expect = e-128 Identities = 119/426 (27%), Positives = 185/426 (43%), Gaps = 71/426 (16%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVNHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWSGMAKQYEPHRVNIIDTPGHVDFTIEVERSMRVLDGAVMVYCAVGGVQPQSETVWR 128 + W +VNIIDTPGH+DF EV RS+ VLDGA+++ A GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRMGANFLKVVGQIKTRLGANPVPLQLAIGAEEGFTGVVDLVKM 188 K +P I F+NK+D+ G + V IK +L A V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAINWNDADQGVTFEYEDIPADMQDLADEWHQNLIESAAEASEELMEKYLGGEELTEEEI 248 N+ +++Q ++ E +++L+EKY+ G+ L E+ Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYMSGKSLEALEL 200 Query: 249 KKALRQRVLNNEIILVTCGSAFKNKGVQAMLDAVIDYLPSPVDVPAINGILDDGKDTPAE 308 ++ R N + V GSA N G+ +++ + + S Sbjct: 201 EQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH----------------- 243 Query: 309 RHASDDEPFSALAFKIATDPFVGNLTFFRVYSGVVNSGDTVLNSVKAARERFGRIVQMHA 368 FKI L + R+YSGV++ D+V S K + + + Sbjct: 244 ---RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEK-EKIKITEMYTSIN 299 Query: 369 NKREEIKEVRAGDIAAAIG----LKDVTTGDTLCDPDAPIILERMEFPEPVISIAVEPKT 424 + +I + +G+I L V GDT P ER+E P P++ VEP Sbjct: 300 GELCKIDKAYSGEIVILQNEFLKLNSV-LGDTKLLPQR----ERIENPLPLLQTTVEPSK 354 Query: 425 KADQEK 430 +E Sbjct: 355 PQQREM 360
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 212 bits (541), Expect = 5e-67 Identities = 53/242 (21%), Positives = 103/242 (42%), Gaps = 10/242 (4%) Query: 1 MGELHLDIIVDRMKREFNVEANVGKPQVAYREAIRAKVTDIEGKHAKQSGGRGQYGHVVI 60 +G++ +++ ++ +++VE + +P V Y E K E + + + + Sbjct: 391 LGKVQMEVTCALLQEKYHVEIEIKEPTVIYMERPLKKA---EYTIHIEVPPNPFWASIGL 447 Query: 61 DMYPLEPGSNPKGYEFINDIKGGVIPGEYIPAVDKGIQEQLKAGPLAGYPVVDMGIRLHF 120 + PL GS G ++ + + G + + AV +GI+ + G L G+ V D I + Sbjct: 448 SVSPLPLGS---GMQYESSVSLGYLNQSFQNAVMEGIRYGCEQG-LYGWNVTDCKICFKY 503 Query: 121 GSYHDVDSSELAFKLAASIAFKEGFKKAKPVLLEPIMKVEVETPEENTGDVIGDLSRRRG 180 G Y+ S+ F++ A I ++ KKA LLEP + ++ P+E D + Sbjct: 504 GLYYSPVSTPADFRMLAPIVLEQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCA 563 Query: 181 MLRGQESEVTGVKIHAEVPLSEMFGYATQLRSLTKGRASYTMEFLKYDDAPNNVAQAVIE 240 + + + V + E+P + Y + L T GR+ E Y + V + Sbjct: 564 NIVDTQLKNNEVILSGEIPARCIQEYRSDLTFFTNGRSVCLTELKGYHVT---TGEPVCQ 620 Query: 241 AR 242 R Sbjct: 621 PR 622
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 79.5 bits (196), Expect = 4e-18 Identities = 53/154 (34%), Positives = 78/154 (50%), Gaps = 13/154 (8%) Query: 13 VNVGTIGHVDHGKTTLTAAI------TTVLAKTYGGSARAFDQIDNAPEEKARGITINTS 66 +N+G + HVD GKTTLT ++ T L G+ R DN E+ RGITI T Sbjct: 4 INIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRT----DNTLLERQRGITIQTG 59 Query: 67 HVEYDTPTRHYAHVDCPGHADYVKNMITGAAQMDGAILVVAATDGPMPQTREHILLGRQV 126 + +D PGH D++ + + +DGAIL+++A DG QTR R++ Sbjct: 60 ITSFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKM 119 Query: 127 GVPYIIVFLNKCDMVDDEELLELVEMEVRELLSQ 160 G+P I F+NK D + L V +++E LS Sbjct: 120 GIP-TIFFINKIDQNGID--LSTVYQDIKEKLSA 150
>HELNAPAPROT#Helicobacter neutrophil-activating protein A family signature. Length = 153 Score = 36.8 bits (85), Expect = 9e-06 Identities = 18/103 (17%), Positives = 42/103 (40%), Gaps = 10/103 (9%) Query: 44 EYHESIDEMKHADKYIERILFLEGIPN--LQDLGKL------GIGEDVEEMLRSDLRLEL 95 E ++ E D ER+L + G P +++ + G EM+++ + Sbjct: 52 ELYDHAAE--TVDTIAERLLAIGGQPVATVKEYTEHASITDGGNETSASEMVQALVNDYK 109 Query: 96 EGAQNLREAIAYADSVHDYVSRDMMIEILADEEGHIDWLETEL 138 + + + I A+ D + D+ + ++ + E + L + L Sbjct: 110 QISSESKFVIGLAEENQDNATADLFVGLIEEVEKQVWMLSSYL 152
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 127 bits (320), Expect = 1e-38 Identities = 70/150 (46%), Positives = 88/150 (58%), Gaps = 7/150 (4%) Query: 14 LAALPFLLCYSGLTVALCHQDLRHGLLPDRYTCPLLWSGLLFYLCLAPHQLHDAVWGAIA 73 L LL L VAL DL LLPD+ T PLLW GLLF L L DAV GA+A Sbjct: 132 WGTLAALLLTWVL-VALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMA 190 Query: 74 GYLSLAAIYWLYRGIRGYEGLGYGDIKYLAALGAWHGWRLLPQLVLVASLLAGIAWAGAG 133 GYL L ++YW ++ + G EG+GYGD K LAALGAW GW+ LP ++L++SL+ G Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250 Query: 134 LYASCGGRSKWRRSNPLPFGPFLAAAGFWC 163 L +S P+PFGP+LA AG+ Sbjct: 251 L------LRNHHQSKPIPFGPYLAIAGWIA 274
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.5 bits (58), Expect = 0.005 Identities = 20/65 (30%), Positives = 27/65 (41%), Gaps = 3/65 (4%) Query: 1 MKKYLIVALLASLLAGCAHDSPCV---PVYDSQGRLVHTNTCMKGTTEDNWETAGAIAGG 57 MKK L A LA L+ GCA + V P + + + + G + A I GG Sbjct: 6 MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQKKTVDAAKICGG 65 Query: 58 AAAVA 62 A V Sbjct: 66 AENVV 70
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 292 bits (749), Expect = 5e-95 Identities = 178/209 (85%), Positives = 192/209 (91%) Query: 1 MEILGEASPGKSTGEAMALMETLASKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVV 60 MEI GEA+PG S+G+AMALME LASKLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VV Sbjct: 824 MEIQGEAAPGTSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVV 883 Query: 61 FLCLAALYESWSIPFSVMLVVPLGVIGALLAATLRGLNNDVYFQVGLLTTIGLSAKNAIL 120 FLCLAALYESWSIP SVMLVVPLG++G LLAATL NDVYF VGLLTTIGLSAKNAIL Sbjct: 884 FLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAIL 943 Query: 121 IVEFAKDLMEKEGKGIIEATLEASRMRLRPILMTSLAFILGVMPLVISHGAGSGAQNAVG 180 IVEFAKDLMEKEGKG++EATL A RMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG Sbjct: 944 IVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVG 1003 Query: 181 TGVMGGMLTATLLAIFFVPVFFVVVRRRF 209 GVMGGM++ATLLAIFFVPVFFVV+RR F Sbjct: 1004 IGVMGGMVSATLLAIFFVPVFFVVIRRCF 1032 Score = 56.8 bits (137), Expect = 1e-11 Identities = 31/197 (15%), Positives = 77/197 (39%), Gaps = 1/197 (0%) Query: 17 MALMETLASKLPSGIGYDWT-GMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPF 75 A + L P G+ + + +LS ++ ++++VFL + ++ Sbjct: 307 KAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNMRATL 366 Query: 76 SVMLVVPLGVIGALLAATLRGLNNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 135 + VP+ ++G G + + G++ IGL +AI++VE + +M ++ Sbjct: 367 IPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMMEDKLP 426 Query: 136 IIEATLEASRMRLRPILMTSLAFILGVMPLVISHGAGSGAQNAVGTGVMGGMLTATLLAI 195 EAT ++ ++ ++ +P+ G+ ++ M + L+A+ Sbjct: 427 PKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSVLVAL 486 Query: 196 FFVPVFFVVVRRRFTRH 212 P + + + Sbjct: 487 ILTPALCATLLKPVSAE 503
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 303 bits (777), Expect = 1e-97 Identities = 186/232 (80%), Positives = 202/232 (87%) Query: 2 IYLLIVVGMAVLFMRLPTSFLPDEDQGVFLTMIQLPSGATQERTQKVLDTVTDYYLHNEK 61 IY LIV GM VLF+RLP+SFLP+EDQGVFLTMIQLP+GATQERTQKVLD VTDYYL NEK Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602 Query: 62 ANVESVFTVNGFSFSGQGQNSGMAFVSLKPWEARSGDKNSVESIIKRATIAFSQIKDAMV 121 ANVESVFTVNGFSFSGQ QN+GMAFVSLKPWE R+GD+NS E++I RA + +I+D V Sbjct: 603 ANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGFV 662 Query: 122 FPFNMPAIIELGTATGFDFELIDQGGLGHTALTQARNQLLGMVKQHPDQLVRVRPNGLED 181 PFNMPAI+ELGTATGFDFELIDQ GLGH ALTQARNQLLGM QHP LV VRPNGLED Sbjct: 663 IPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLED 722 Query: 182 TPQFKLDVDQEKAQALGVSLSDINETISAALGGYYVNDFIDRGRVKKCTFRL 233 T QFKL+VDQEKAQALGVSLSDIN+TIS ALGG YVNDFIDRGRVKK + Sbjct: 723 TAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQA 774 Score = 33.3 bits (76), Expect = 0.001 Identities = 24/212 (11%), Positives = 76/212 (35%), Gaps = 16/212 (7%) Query: 1 MIYLLIVVGMAVLFMRLPTSFLPDEDQGVFLTMIQLPSGATQERTQKVLDTVTDYYLHNE 60 ++ +++++ A+ ++LP + P + + + + Q V DTVT + Sbjct: 14 VLAIILMMAGALAILQLPVAQYPT----IAPPAVSVSANYPGADAQTVQDTVT-QVIEQN 68 Query: 61 KANVESVFTVNGFSFSGQGQNSGMAFVSLKPWEARSGDKNSVESIIKRATIAFSQ-IKDA 119 ++++ ++ S S S ++ + + V++ ++ AT Q ++ Sbjct: 69 MNGIDNLMYMSSTSDSAG---SVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQQ 125 Query: 120 MVFPFNMPAIIELGTATGFDFELIDQGGLGHTALTQARNQLLGMVKQHPDQLVRVRPNGL 179 + + + D Q + + ++ L + + + V+ G Sbjct: 126 GISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRL-----NGVGDVQLFGA 180 Query: 180 EDTPQFKLDVDQEKAQALGVSLSDINETISAA 211 + ++ +D + ++ D+ + Sbjct: 181 QY--AMRIWLDADLLNKYKLTPVDVINQLKVQ 210
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 682 bits (1762), Expect = 0.0 Identities = 457/508 (89%), Positives = 490/508 (96%) Query: 1 MSKFFIHRPVFAWVLAIIMMIAGGLAILQLPIAQYPTIAPPAVAISATYPGADAQTVQDT 60 M+ FFI RP+FAWVLAII+M+AG LAILQLP+AQYPTIAPPAV++SA YPGADAQTVQDT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFKSGTDPDIAQVQVQNKLQLATPLLPQ 120 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTF+SGTDPDIAQVQVQNKLQLATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKSSSSFLLVAGFISDNPTTTQDDISDYVASNVKDPISRLNGVGDVQLFGA 180 EVQQQGISVEKSSSS+L+VAGF+SDNP TTQDDISDYVASNVKD +SRLNGVGDVQLFGA Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRVWLDGNLLNKYNLTPVDVINALQVQNDQIAAGQLGGTPALKGQQLNASIIAQTRL 240 QYAMR+WLD +LLNKY LTPVDVIN L+VQNDQIAAGQLGGTPAL GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KDPQEFGKVTLRVNADGSVVHLKDVARIELGGENYNVVARINGKPASGLGIKLATGANAL 300 K+P+EFGKVTLRVN+DGSVV LKDVAR+ELGGENYNV+ARINGKPA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTATAIKAKLAELQPYFPQGMKVVYPYDTTPFVKISIHEVIKTLFEAIILVFLVMYLFLQ 360 DTA AIKAKLAELQP+FPQGMKV+YPYDTTPFV++SIHEV+KTLFEAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NMRATLIPTIAVPVVLLGTFAVLSMFGYSINTLTMFGMVLAIGLLVDDAIVVVENVECVM 420 NMRATLIPTIAVPVVLLGTFA+L+ FGYSINTLTMFGMVLAIGLLVDDAIVVVENVE VM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 VEEKLSPKEATEKSMSQIQGALVGIAMVLSAVFVPMAFFGGSTGAIYRQFSITIVSAMAL 480 +E+KL PKEATEKSMSQIQGALVGIAMVLSAVF+PMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALVLTPALCATLLKPASAEHHEKK 508 SVLVAL+LTPALCATLLKP SAEHHE K Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENK 508 Score = 92.6 bits (230), Expect = 9e-22 Identities = 79/512 (15%), Positives = 182/512 (35%), Gaps = 44/512 (8%) Query: 5 FIHRPVFAWVLAIIMMIAGGLAILQLPIAQYPTIAPPAVAISATYP-GADAQTVQDTVTQ 63 + ++ +++ + L+LP + P P GA + Q + Q Sbjct: 533 ILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQ 592 Query: 64 VIEQNMNGIDNLMY---------MSSTSDSAGSVTITL-TFKSGTDPDIAQVQVQNKLQL 113 V + + + S + +AG ++L ++ + + V ++ ++ Sbjct: 593 VTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKM 652 Query: 114 ATPLLP--------QEVQQQGISVEKSSSSFLLVAGFISDNPTTTQDDISDYVASNVKDP 165 + + + + AG D T ++ + A + Sbjct: 653 ELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASL 712 Query: 166 IS-RLNGVGDVQLFGAQYAMRVWLDGNLLNKYNLTPVDVINALQVQNDQIAAGQLGGTPA 224 +S R NG+ D ++ +D ++ D+ + Sbjct: 713 VSVRPNGLED------TAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF----I 762 Query: 225 LKGQQLNASIIA-QTRLKDPQEFGKVTLRVNADGSVVHLKDVARIELGGENYNVVARING 283 +G+ + A P++ K+ +R +A+G +V + R NG Sbjct: 763 DRGRVKKLYVQADAKFRMLPEDVDKLYVR-SANGEMVPFSAFTTSHWV-YGSPRLERYNG 820 Query: 284 KPASGLGIKLATGANALDTATAIKAKLAELQPYFPQGMKVVYPYDTTPFVKISIHEVIKT 343 P+ + + A G ++ D ++ ++L P G + Y + + + Sbjct: 821 LPSMEIQGEAAPGTSSGDAMALMENLASKL----PAG--IGYDWTGMSYQERLSGNQAPA 874 Query: 344 LFE-AIILVFLVMYLFLQNMRATLIPTIAVPVVLLGTFAVLSMFGYSINTLTMFGMVLAI 402 L + ++VFL + ++ + + VP+ ++G ++F + M G++ I Sbjct: 875 LVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTI 934 Query: 403 GLLVDDAIVVVENVECVMVEEKLSPKEAT-EKSMSQIQGALV-GIAMVLSAVFVPMAFFG 460 GL +AI++VE + +M +E EAT +++ L+ +A +L +P+A Sbjct: 935 GLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILG--VLPLAISN 992 Query: 461 GSTGAIYRQFSITIVSAMALSVLVALVLTPAL 492 G+ I ++ M + L+A+ P Sbjct: 993 GAGSGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 38.7 bits (90), Expect = 3e-05 Identities = 35/211 (16%), Positives = 67/211 (31%), Gaps = 30/211 (14%) Query: 97 ATYQAAWNSAKGDEAKAEAAAAIAHLTVKRYVPLLGTKYISQQEYDQAVATA-RQADADV 155 K + E+ A + Q + + RQ ++ Sbjct: 262 VEAVNELRVYKSQLEQIESEILSAKEEYQLV----------TQLFKNEILDKLRQTTDNI 311 Query: 156 IATKAAVETARINLAYTKVTSPISGRIGKSSV-TEGALVTNGQSDAMATVQQLDPIYVDV 214 + + + +P+S ++ + V TEG +VT ++ M V + D + V Sbjct: 312 GLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET-LMVIVPEDDTLEVTA 370 Query: 215 TESSNDFMRLKQESLQRGGDTKSVELVMENGQAYP-LKGSLQ--FSDVTVDESTG----- 266 + D + G +++ Y L G ++ D D+ G Sbjct: 371 LVQNKDIGFINV------GQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNV 424 Query: 267 --SITLRAIFPNPQHV-LLPGMFVRARIDEG 294 SI + +++ L GM V A I G Sbjct: 425 IISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455 Score = 35.6 bits (82), Expect = 3e-04 Identities = 23/127 (18%), Positives = 41/127 (32%), Gaps = 15/127 (11%) Query: 46 APLSVTTELPGR-TSAFRVAEVRPQVSGIILKRNFV-EGSDVEAGQSLYQIDPATYQAAW 103 + + G+ T + R E++P + I+ K V EG V G L ++ +A Sbjct: 78 GQVEIVATANGKLTHSGRSKEIKPIENSIV-KEIIVKEGESVRKGDVLLKLTALGAEA-- 134 Query: 104 NSAKGDEAKAEAAAAIAHLTVKRYVPLLGTKYISQQEYDQAVATARQADADVIATKAAVE 163 D K +++ A L RY L E ++ + Sbjct: 135 -----DTLKTQSSLLQARLEQTRYQILS-----RSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 164 TARINLA 170 +L Sbjct: 185 LRLTSLI 191
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 118 bits (297), Expect = 2e-35 Identities = 77/201 (38%), Positives = 124/201 (61%), Gaps = 3/201 (1%) Query: 1 MARKTKEEAQRTRQLLIESAIQQFALRGVTNTTLTDIADAAGVTRGAVYWHFASKTELFN 60 MARKTK+EAQ TRQ +++ A++ F+ +GV++T+L +IA AAGVTRGA+YWHF K++LF+ Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 EMW-QQQPPLRDLIQPSQAIEYEHEPLNALRERFIAGLRYIAANPRQRALMQILYQRCEF 119 E+W + + +L QA ++ +PL+ LRE I L R+R LM+I++ +CEF Sbjct: 61 EIWELSESNIGELELEYQA-KFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEF 119 Query: 120 SSDMLSEYEIRQRIGF-NYSLIGGILQCCVRNNILPAETNIEMILIVLHSAFSGLIKNWL 178 +M + ++ + +Y I L+ C+ +LPA+ I++ SGL++NWL Sbjct: 120 VGEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWL 179 Query: 179 LDPQRFDLYQQAPALVDNIMA 199 PQ FDL ++A V ++ Sbjct: 180 FAPQSFDLKKEARDYVAILLE 200
>DNABINDNGFIS#DNA-binding protein FIS signature. Length = 98 Score = 157 bits (399), Expect = 3e-54 Identities = 98/98 (100%), Positives = 98/98 (100%) Query: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ Sbjct: 1 MFEQRVNSDVLTVSTVNSQDQVTQKPLRDSVKQALKNYFAQLNGQDVNDLYELVLAEVEQ 60 Query: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN Sbjct: 61 PLLDMVMQYTRGNQTRAALMMGINRGTLRKKLKKYGMN 98
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.0 bits (78), Expect = 0.002 Identities = 22/82 (26%), Positives = 31/82 (37%), Gaps = 18/82 (21%) Query: 188 VLMVGPPGTGKTLLAKAI---AGEAKVPFFT-----ISGSDFVEMFVGV------GASRV 233 +++ G GTGK L+A+A+ PF I G GA Sbjct: 163 LMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTR 222 Query: 234 RD-MFEQAKKAAPCIIFIDEID 254 FEQA+ +F+DEI Sbjct: 223 STGRFEQAEGGT---LFLDEIG 241
>SECGEXPORT#Protein-export SecG membrane protein signature. Length = 110 Score = 149 bits (377), Expect = 2e-50 Identities = 97/110 (88%), Positives = 103/110 (93%), Gaps = 1/110 (0%) Query: 1 MYEALLVVFLIVAIGLVGLVMLQQGKGADMGASFGAGASGTLFGSSGSGNFMTRMTGILA 60 MYEALLVVFLIVAIGLVGL+MLQQGKGADMGASFGAGAS TLFGSSGSGNFMTRMT +LA Sbjct: 1 MYEALLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLA 60 Query: 61 ALFFIISLALGNINSNKTSKGSEWDNLSAPK-TEQTQPTAPAQPTSDIPH 109 LFFIISL LGNINSNKT+KGSEW+NLSAP TEQTQP APA+PTSDIP+ Sbjct: 61 TLFFIISLVLGNINSNKTNKGSEWENLSAPAKTEQTQPAAPAKPTSDIPN 110
>adhesinb#Adhesin B signature. Length = 310 Score = 26.7 bits (59), Expect = 0.033 Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 2/29 (6%) Query: 1 MNKIGLLIVAGV--LGLAGCSSTSPSQTV 27 M K L++ + +GLA CSS S Sbjct: 1 MKKCRFLVLLLLAFVGLAACSSQKSSTET 29
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.7 bits (66), Expect = 0.006 Identities = 15/44 (34%), Positives = 22/44 (50%) Query: 26 NLHGEAGAEFTNLSASFGAGEPGMTFSSQWAHSDNDGDSVGLGM 69 N H A +++ A+F GE MT + WA S+ D V G+ Sbjct: 227 NKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNYGV 270
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 38.3 bits (89), Expect = 3e-05 Identities = 38/190 (20%), Positives = 69/190 (36%), Gaps = 17/190 (8%) Query: 115 ILLQVMQQFTGMNVIMYYAPKIFELAGYANTTEQMWGTVIV--GLTNVLATFIAIGLVDR 172 IL V G+ +IM P + ++N +G ++ L + L DR Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69 Query: 173 WGRKPTLILGFIVMAAGMGVLGTMMHIGIHSSTAQYIAVLMLLMFIVGFAMSAGPLIWVL 232 +GR+P L++ A ++ TA ++ VL + + G + G + Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMA----------TAPFLWVLYIGRIVAGITGATGAVAGAY 119 Query: 233 CSEIQPLKGRD--FGITCSTATNWIANMIVGATFLTMLNSLGSANTFWVYGGLNVLFILL 290 ++I R FG + M+ G ++ F+ LN L L Sbjct: 120 IADITDGDERARHFGFMSACFG---FGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLT 176 Query: 291 TLWLIPETKN 300 +L+PE+ Sbjct: 177 GCFLLPESHK 186
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 48.0 bits (114), Expect = 2e-09 Identities = 31/112 (27%), Positives = 56/112 (50%), Gaps = 7/112 (6%) Query: 16 FFVCFLAALAGLLFGLDIGVIAGALPFIANEFQISAHTQEWVVSSMMFGAAVGAVGSGWL 75 ++C L+ L+ V+ +LP IAN+F + WV ++ M ++G G L Sbjct: 17 IWLCILS----FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKL 72 Query: 76 SFKLGRKKSLMIGAILFVAGSLFSAAAPN-VEILLVSRVLLGLAVGVASYTA 126 S +LG K+ L+ G I+ GS+ + +L+++R + G G A++ A Sbjct: 73 SDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQG--AGAAAFPA 122
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.015 Identities = 10/42 (23%), Positives = 18/42 (42%), Gaps = 3/42 (7%) Query: 107 QLDPNTLLTQFRDQVKNTGLDDALQQQFLEEFEAGLYGYTYL 148 + TLL Q + + + DA + +++ A L G L Sbjct: 71 PVLAQTLLMQLKQVL---SMSDATVAEHMQDLYATLLGDLQL 109
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.9 bits (75), Expect = 0.002 Identities = 71/342 (20%), Positives = 125/342 (36%), Gaps = 17/342 (4%) Query: 54 FLLGYGFSALLLTPVIESRWHYRQG----LLSSIAIWALVCAISPLLGSLLGMLIARIVL 109 L Y PV+ R G LL S+A A+ AI L + I RIV Sbjct: 48 LLALYALMQFACAPVL-GALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVA 106 Query: 110 GVAEGPLFSLKTRFINDNFAADEIGKPNALTALGVSLGLAVGFPLVTWLMAHVGWIGSFY 169 G+ G ++ +I D DE + + G+ G P++ LM F+ Sbjct: 107 GIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAG-PVLGGLMGGFSPHAPFF 164 Query: 170 ALALLNLLLGGGLIWRFLPAPQVTSQR--AKPGFRQTFALAWRTPLLGWILLVEIATLSY 227 A A LN L LP +R + + W + L+ + + Sbjct: 165 AAAALNGLN-FLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQ 223 Query: 228 LWGSSAWLPAWLRDEHHFSLQATGL---LAAVPFLLSLGSKFLGGVLLDKMRPEQAPLLF 284 L G + E F AT + LAA L SL + G + ++ +A Sbjct: 224 LVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRA---- 279 Query: 285 IIGGLLTAGSVLALMLSQQPAMLALFMLAANVFWGLQGAAIPAVVQHHAPREAVGSAYGI 344 ++ G++ G+ L+ +A ++ G+ A+ A++ E G G Sbjct: 280 LMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGS 339 Query: 345 INGIGNICAAFIPLLMGVVMKSVGSVSSGFSVLVASQLITLC 386 + + ++ + PLL + + + +G++ + + L LC Sbjct: 340 LAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLC 381
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 46.2 bits (109), Expect = 3e-09 Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 1/90 (1%) Query: 5 LNGKRIVVTGAARGLGYHFAEACAAQGATVVMCDILQGELAESAHRLQQKGYQVESHAID 64 + GK +TGAA+G+G A A+QGA + D +L + L+ + E+ D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 65 LASQASIEQVFSAIGAQ-GSIDGLVNNAAM 93 + A+I+++ + I + G ID LVN A + Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGV 95
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 68.2 bits (166), Expect = 1e-16 Identities = 44/150 (29%), Positives = 71/150 (47%), Gaps = 16/150 (10%) Query: 9 WDRVMTVNVKGTWLVTRAAVPLM--REGAAIVNVASDTALWGAPR--LMAYVASKGAVIA 64 W+ +VN G + +R+ M R +IV V S+ A G PR + AY +SK A + Sbjct: 109 WEATFSVNSTGVFNASRSVSKYMMDRRSGSIVTVGSNPA--GVPRTSMAAYASSKAAAVM 166 Query: 65 MTRSMARELGEKRIRINAIAPGLTRVE----------ATEYVPAERHQLYENGRALSGAQ 114 T+ + EL E IR N ++PG T + E V + ++ G L Sbjct: 167 FTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLA 226 Query: 115 QPEDVTGSVVWLLSDLSRFITGQLIPVNGG 144 +P D+ +V++L+S + IT + V+GG Sbjct: 227 KPSDIADAVLFLVSGQAGHITMHNLCVDGG 256
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 63.1 bits (153), Expect = 1e-14 Identities = 38/134 (28%), Positives = 67/134 (50%), Gaps = 2/134 (1%) Query: 3 LDAFSLQGKVAVVSGCDTGLGQGMALGLAEAGCDIVGI--NIVEPVETIERVTALGRRFL 60 ++A ++GK+A ++G G+G+ +A LA G I + N + + + + A R Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE 60 Query: 61 SLTADLRQIDGIPQLLERAVAEFGHIDILVNNAGLIRREDALAFSEKDWDDVMNLNIKSV 120 + AD+R I ++ R E G IDILVN AG++R + S+++W+ ++N V Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 121 FLCPRRRRNTLSPR 134 F R + R Sbjct: 121 FNASRSVSKYMMDR 134
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 50.8 bits (121), Expect = 5e-11 Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 8/105 (7%) Query: 8 IRVPSYTASKSAVMGVTRLLANEWAKHNINVNAIAPGYMATNNTQQLRADEQRSSEILD- 66 + +Y +SK+A + T+ L E A++NI N ++PG T+ L ADE + +++ Sbjct: 152 TSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKG 211 Query: 67 -------RIPAGRWGLPADLMGPVVFLASSASDYINGYTVAVDGG 104 IP + P+D+ V+FL S + +I + + VDGG Sbjct: 212 SLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHITMHNLCVDGG 256
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 54.9 bits (132), Expect = 4e-10 Identities = 66/371 (17%), Positives = 131/371 (35%), Gaps = 34/371 (9%) Query: 38 LDIGVISGALPFITDHFTLSSQLQEWVVSSMMLGAAIGALFNGWLSFRLGRKYSLMAGAV 97 L+ V++ +LP I + F WV ++ ML +IG G LS +LG K L+ G + Sbjct: 28 LNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGII 87 Query: 98 LFVAGSIGSAFAAS-VEVLLVARVVLGVAVGIASYTAPLYLSEMASENVRGKMISMYQLM 156 + GS+ S +L++AR + G + ++ + RGK + + Sbjct: 88 INCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSI 147 Query: 157 VTLGIVLAFLSDTAFSYSGNWRAMLGVLALPAVILIILVVFLPNSPRWLAEKGRHIEAEE 216 V +G + ++ +W L L +I II V FL + H + + Sbjct: 148 VAMGEGVGPAIGGMIAHYIHW----SYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKG 203 Query: 217 VLRMLRDTSEKARDELNEIRESLKLKQGGWALFKV----------------NRNVRRAVF 260 ++ M + L + + +F N V Sbjct: 204 IILMSVGIVFFMLFTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVL 263 Query: 261 LGMLLQAMQQFTGMNIIMYYAPRIFKMAGFTTTEQQMIATLVVGLTFMFATFIAVFTVDK 320 G + G ++ Y + + +T E + ++ + +I VD+ Sbjct: 264 CGGI--IFGTVAGFVSMVPYMMK--DVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDR 319 Query: 321 AGRKPALKIGFSVMALGTLVLGYCLMQFDNGTASSGLSWLSVGMTMMCIAGYAMSAAPVV 380 G L IG + +++ L + T SW + + + G + + + Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASF----LLETT-----SWFMTIIIVFVLGGLSFTKTVIS 370 Query: 381 WILCSEIQPLK 391 I+ S ++ + Sbjct: 371 TIVSSSLKQQE 381
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.0 bits (65), Expect = 0.036 Identities = 31/142 (21%), Positives = 48/142 (33%), Gaps = 8/142 (5%) Query: 43 AGDTGIIYAVLSVSALFAQVCYGFIQDKLGLRKHLLWYITALLILSGPAYLLFGHLLKIN 102 GI+ A+ ++ G + D+ G R LL L + Y + + Sbjct: 42 TAHYGILLALYALMQFACAPVLGALSDRFGRRPVLL----VSLAGAAVDYAIMATAPFLW 97 Query: 103 VL-LGSIFGGIYIGLTFNGGIGVLESYTERVARQSQFEFGRARMWGSLGWAVATFFAGLL 161 VL +G I GI G T + T+ R F F A G GL+ Sbjct: 98 VLYIGRIVAGI-TGATGAVAGAYIADITDGDERARHFGFMSACF--GFGMVAGPVLGGLM 154 Query: 162 FNINPQLNFLVASCSGLVFFIL 183 +P F A+ + F+ Sbjct: 155 GGFSPHAPFFAAAALNGLNFLT 176
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.7 bits (66), Expect = 0.024 Identities = 72/298 (24%), Positives = 118/298 (39%), Gaps = 41/298 (13%) Query: 128 NGKLNGIPISVTARVFYFNDEAWKKAGIPFPKTWDELMAAGKTFESKLGKQYYPVVLEHQ 187 NGKL PI+V A +N + PKTW+E+ A K ++K GK L+ Sbjct: 126 NGKLIAYPIAVEALSLIYNKDLLPNP----PKTWEEIPALDKELKAK-GKSALMFNLQEP 180 Query: 188 ----DVLALLNSYMVQKYNQPAIDEKGRKFSYSKAQWADFFGMYKKLIDSHVMPDTRYYA 243 ++A Y KY D K + A+ F + + + H+ DT Y Sbjct: 181 YFTWPLIAADGGYAF-KYENGKYDIKDVGVDNAGAKAGLTF-LVDLIKNKHMNADTDYSI 238 Query: 244 SFGKSNMYEMKPWIQGEWGGTYMWNSTINKYSDNLKPPAKLVLGEYPMLP--GATDAGLF 301 + N E I G W + + S +N Y + P K P P G AG Sbjct: 239 AEAAFNKGETAMTINGPWAWSNIDTSKVN-YGVTVLPTFK----GQPSKPFVGVLSAG-- 291 Query: 302 FKPAQMLSIGKSTKNPQAAAKVINFLLNSKEGVDILGLERGVPLSKAAVTYLTEDGVIKA 361 I ++ N + A + + L + EG++ + ++ PL A+ E+ A Sbjct: 292 --------INAASPNKELAKEFLENYLLTDEGLEAVNKDK--PLGAVALKSYEEE---LA 338 Query: 362 DDPAVSGLKLAQSLPTALPVSPYFDDPQIVA---QFGTTLQYIDYGKKSVEEAAEDFQ 416 DP ++A ++ A + PQ+ A T + G+++V+EA +D Q Sbjct: 339 KDP-----RIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQ 391
>BACINVASINB#Salmonella/Shigella invasin protein B signature. Length = 593 Score = 35.5 bits (81), Expect = 4e-04 Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 10/95 (10%) Query: 60 EVRIGDKIVNNLAPKSRGIAM-VFQNYALYPHMTVRENLAFGLKLSKLPKAQIDRQVEEA 118 +V +G ++ N A + G+A VF A E LA L++ QI + ++++ Sbjct: 504 KVALGMEVTNTAAQSAGGVAEGVFIKNA-------SEALA-DFMLARFAMDQIQQWLKQS 555 Query: 119 AKIL-ELEELLDRLPRQLSGGQAQRVAVGRAIVKK 152 +I E +++ L + +S Q R I+++ Sbjct: 556 VEIFGENQKVTAELQKAMSSAVQQNADASRFILRQ 590
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.6 bits (64), Expect = 0.027 Identities = 11/25 (44%), Positives = 14/25 (56%) Query: 47 IVGESGSGKSTVGRALLQLHPKKAR 71 I GESG+GK V RAL ++ Sbjct: 165 ITGESGTGKELVARALHDYGKRRNG 189
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.018 Identities = 19/93 (20%), Positives = 28/93 (30%), Gaps = 35/93 (37%) Query: 36 LVGESGSGKTTVLKCLAGLFTHWQGELTI---------------------------DAQP 68 L G G GK+T++ L GL I DA+ Sbjct: 601 LEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRADAEA 660 Query: 69 LGHEISRERCRQVQMVFQDPYGSL---HPRHTI 98 + S + R ++ YG HPR + Sbjct: 661 VKAFFSSRKDR-----YRGAYGRYVQDHPRQVV 688
>INTIMIN#Intimin signature. Length = 939 Score = 229 bits (585), Expect = 7e-69 Identities = 123/448 (27%), Positives = 209/448 (46%), Gaps = 22/448 (4%) Query: 1 MPVSFRLLPTLTFLLLLPGVPVWALTASDTTRPAQAQDPLPDMGIAPQVDDDARHFAEVA 60 +P + LP LL P+ A + PD+ + DD A ++A Sbjct: 117 LPFEYSALP------LLGSAPLVAAGGVAGHTNKLTKMS-PDVTKSNMTDDKALNYAAQQ 169 Query: 61 KKFGEASMSDNGLTAGEQAQLFAISKIGNEVSHQLESWLSPWGNANVDLLVDKEGKFTGS 120 + + L G+ A+ A+ GN+ S QL++WL +G A V+L F GS Sbjct: 170 AASLGSQLQSRSLN-GDYAKDTALGIAGNQASSQLQAWLQHYGTAEVNLQSGN--NFDGS 226 Query: 121 KGSWFVPLQDNDRYLTWNQYSVTRREHDLVGNIGLGQRWRVGGWLLGYNSFYDKVLSESL 180 + +P D+++ L + Q + N+G GQR+ + +LGYN F D+ S Sbjct: 227 SLDFLLPFYDSEKMLAFGQVGARYIDSRFTANLGAGQRFFLPENMLGYNVFIDQDFSGDN 286 Query: 181 ARGSVGAEAWGEYLRLSANYYHPLGDW-QLRDNQTQEQRMAAGYDVTAQARLPFYQHINT 239 R +G E W +Y + S N Y + W + + + ++R A G+D+ LP Y + Sbjct: 287 TRLGIGGEYWRDYFKSSVNGYFRMSGWHESYNKKDYDERPANGFDIRFNGYLPSYPALGA 346 Query: 240 SVSVEQYFGDSVDLFHSGTGYHNPVAVSVGLNYTPVPLVTVTAKHKQGENGVSQNNVGLK 299 + EQY+GD+V LF+S NP A +VG+NYTP+PLVT+ ++ G + ++ Sbjct: 347 KLMYEQYYGDNVALFNSDKLQSNPGAATVGVNYTPIPLVTMGIDYRHGTGNENDLLYSMQ 406 Query: 300 LNYRFGVPLKQQLAADEVAISNSLRGSRFDSPERDNLPVVEYRQRKNLTVYLATP-PWDL 358 Y+F P QQ+ V +L GSR+D +R+N ++EY +K + L P + Sbjct: 407 FRYQFDKPWSQQIEPQYVNELRTLSGSRYDLVQRNNNIILEY--KKQDILSLNIPHDING 464 Query: 359 QSGETVQLKLQIHSLHGIKALHWQGDTQALSLTPPVDASSPDG---WSIIMPVWNSEPGA 415 T +++L + S +G+ + W D+ S + S + I+P + G Sbjct: 465 TERSTQKIQLIVKSKYGLDRIVWD-DSALRSQGGQIQHSGSQSAQDYQAILPAYV--QGG 521 Query: 416 ANRWRLSVVVEDKQGQRVSSNEIALALT 443 +N ++++ D+ G SSN + L +T Sbjct: 522 SNVYKVTARAYDRNGN--SSNNVLLTIT 547
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 74.5 bits (183), Expect = 8e-18 Identities = 35/118 (29%), Positives = 57/118 (48%), Gaps = 2/118 (1%) Query: 6 RATILLIDDHPMLRTGVKQLISMAPDIQVIGEASNGAQGIELAESLDPDLILLDLNMPGM 65 ATIL+ DD +RT + Q +S A V SN A + D DL++ D+ MP Sbjct: 3 GATILVADDDAAIRTVLNQALSRA-GYDVRI-TSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 66 NGLETLDKLREKSLSGRVVVFSVSNHEEDVVTALKRGADGYLLKDMEPEDLLKALQQA 123 N + L ++++ V+V S N + A ++GA YL K + +L+ + +A Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRA 118
>PF06580#Sensor histidine kinase Length = 349 Score = 48.3 bits (115), Expect = 5e-08 Identities = 28/116 (24%), Positives = 49/116 (42%), Gaps = 9/116 (7%) Query: 476 FGFTVQLDYQLPPRFVPSHQAIHLLQIAREALSNALKHASAT-----EVTVTVSQRDNQV 530 F +Q + Q+ P + L+Q E N +KH A ++ + ++ + V Sbjct: 236 FEDRLQFENQINPAIMDVQVPPMLVQTLVE---NGIKHGIAQLPQGGKILLKGTKDNGTV 292 Query: 531 RLVVADNGRGVPDHAERSNHYGLIIMRDRAQSLRG-DCQVRRRETGGTEVIVTFIP 585 L V + G + + S GL +R+R Q L G + Q++ E G + IP Sbjct: 293 TLEVENTGSLALKNTKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.011 Identities = 52/282 (18%), Positives = 93/282 (32%), Gaps = 49/282 (17%) Query: 128 TPFSIFVIISLLCGFAGANF-ASSMANISFFFPKAKQGGALGVNGGLGNMGVSVMQLVAP 186 + FS+ ++ + G A F A M ++ + PK +G A G+ G + MG V + Sbjct: 101 SFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGPAIGG 160 Query: 187 ------------LVVSISIFAVFGGNGSEQPDGS--------------------MLYLEN 214 L+ I+I V + + ML+ + Sbjct: 161 MIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFMLFTTS 220 Query: 215 AAWIWVPFLIIFTLAAWFFMNDLSASK-----ASLSEQLPVLKRLHLWIMALLYLATFGS 269 + FLI+ L+ F+ + L + +P + + + +A F S Sbjct: 221 YSIS---FLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVS 277 Query: 270 FIGFSAGFAM-LSKTQFPDVQILHYAFFGPFIGALARSMGGAISDRLGGTRVTLVNFVVM 328 + + LS + V I G + GG + DR G V + + Sbjct: 278 MVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYI----GGILVDRRGPLYVLNIGVTFL 333 Query: 329 AVFCALLFLTLPTNGQGGNFIAFFAVFMVLFLTAGLGSASTF 370 +V L T F+ VF++ L+ ST Sbjct: 334 SVSFLTASFLLETTSW---FMTIIIVFVLGGLSFTKTVISTI 372
>cloacin#Cloacin signature. Length = 551 Score = 30.8 bits (69), Expect = 0.002 Identities = 15/44 (34%), Positives = 20/44 (45%) Query: 23 PAYANPGNGNGNGGGNHGNSGNHGNSGNHGNNGNSGDHGNKGQN 66 +++ N G G G+ + G GN G NGNSG G N Sbjct: 37 SGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80 Score = 29.7 bits (66), Expect = 0.006 Identities = 13/39 (33%), Positives = 16/39 (41%) Query: 27 NPGNGNGNGGGNHGNSGNHGNSGNHGNNGNSGDHGNKGQ 65 NP G G + G HGN G +GN+G G Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLS 82 Score = 29.7 bits (66), Expect = 0.007 Identities = 11/32 (34%), Positives = 16/32 (50%) Query: 26 ANPGNGNGNGGGNHGNSGNHGNSGNHGNNGNS 57 + G G+G GN G +GN G G N ++ Sbjct: 52 SGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 33.8 bits (77), Expect = 1e-04 Identities = 13/63 (20%), Positives = 27/63 (42%), Gaps = 1/63 (1%) Query: 78 RHTVEHSVYVHPEHQGKGLGRKLLVALIAEARRLNKHVMVAGIESQNHASLHLHETLGFI 137 +E + V +++ KG+G LL I A+ + ++ + N ++ H + FI Sbjct: 89 YALIED-IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFI 147 Query: 138 TTG 140 Sbjct: 148 IGA 150
>PF05272#Virulence-associated E family protein Length = 892 Score = 25.8 bits (56), Expect = 0.035 Identities = 12/26 (46%), Positives = 16/26 (61%) Query: 37 REEAESTVAAVRERAAAAAPASEPPQ 62 REE +VA + A A APA +PP+ Sbjct: 95 REEGLESVAGIVMGAPAGAPAPKPPR 120
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 35.5 bits (82), Expect = 5e-05 Identities = 16/36 (44%), Positives = 19/36 (52%) Query: 96 YTINAAYQEGKEAVRGSIEVGKVADFQVLDRDIFAV 131 YTIN A G GS+EVGK AD + + F V Sbjct: 409 YTINPAIAHGLSHEIGSLEVGKRADLVLWNPAFFGV 444
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 32.8 bits (75), Expect = 0.001 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 8/72 (11%) Query: 8 DCVLINGKVATVDAHFSFKRAIAVKQGWIINVGE--DQEIQQH----IGPQTQVIDLKGK 61 D V+ N +D K I +K G I +G+ + ++Q +GP T+VI +GK Sbjct: 69 DTVITN--ALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGK 126 Query: 62 LILPAAHDSHIH 73 ++ DSHIH Sbjct: 127 IVTAGGMDSHIH 138
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 101 bits (254), Expect = 5e-26 Identities = 63/409 (15%), Positives = 125/409 (30%), Gaps = 83/409 (20%) Query: 21 SIFTAAAIGLVGVLVILYAWQLPPFTRHSQFTDNAYVRGQTTFISPQVNGYITAVNVKDF 80 A I V+ + + L + G++ I P N + + VK+ Sbjct: 57 PRLVAYFIMGFLVIAFILSV-LGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEG 115 Query: 81 AIVQPGEVLFQIDDR-----IYKQRVHQAQATL------AMKEAALRNNL---------- 119 V+ G+VL ++ K + QA L + + N L Sbjct: 116 ESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPY 175 Query: 120 ------------------------QQRKSAEATIAKNEAALQNARAQNLKIQADLKRIQQ 155 Q+ E + K A A+ + + + + Sbjct: 176 FQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKS 235 Query: 156 -------LTADGSLS---IRERDSARASA----AQGAADIEQAKAALEMSRQD------- 194 L +++ + E+++ A + +EQ ++ + ++++ Sbjct: 236 RLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQL 295 Query: 195 -RESTIVNRDSLEADVASAKAALELAQIDLQNTQIIAPTGGQLGQISVR-LGAYVSAGTH 252 + + ++ L + Q + I AP ++ Q+ V G V+ Sbjct: 296 FKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET 355 Query: 253 LTSLVPPQH--WVIANLKETQLAEVRVGQPVTFTVDALNGETFH---GKVQSISPATGVE 307 L +VP V A ++ + + VGQ V+A + GKV++I+ Sbjct: 356 LMVIVPEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--- 412 Query: 308 FSAISPDNATGNFVKIAQRIPVRITVNDGQNNSERLRPGMSVQVTIDTR 356 D G + I +N L GM+V I T Sbjct: 413 ----IEDQRLGLVFNVIISIEENCLSTGNKN--IPLSSGMAVTAEIKTG 455
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.3 bits (76), Expect = 0.002 Identities = 28/235 (11%), Positives = 67/235 (28%), Gaps = 53/235 (22%) Query: 69 SDVLIARERVNEYQARAYAADSSLFPSLDASLTGTRARTQSAATGLPIHSTLYKGGLTAS 128 S +L AR YQ + + + + P L L + + ++L K + Sbjct: 141 SSLLQARLEQTRYQILSRSIELNKLPELK--LPDEPYFQNVSEEEVLRLTSLIKEQFST- 197 Query: 129 YDVDIWGANRSAANAAGASLEAQKAAAAAANLSVASSVAVGYVTLLSLDEQLRVTQQTLT 188 W + A++ A + + RV + L Sbjct: 198 -----WQNQKYQKELNLDKKRAERLTVLA--------------RINRYENLSRVEKSRLD 238 Query: 189 SREDAWRLAKRQFETGYTSRLELM-------QADSELRSTRAQIPPLQHQIAQQENALSV 241 L + ++ ++ +A +ELR ++Q+ ++ +I + + Sbjct: 239 DFS---SLLHK----QAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQL 291 Query: 242 LLGDNPGAVKRGEFAQLTPLRLPSQLPPTLLNRRPDIAQAERQLVAADATLASSQ 296 + +++ L +I +L + +S Sbjct: 292 VTQL-----------------FKNEILDKLRQTTDNIGLLTLELAKNEERQQASV 329
>LIPPROTEIN48#Mycoplasma P48 major surface lipoprotein signature. Length = 428 Score = 28.4 bits (63), Expect = 0.019 Identities = 34/139 (24%), Positives = 52/139 (37%), Gaps = 38/139 (27%) Query: 70 QQEALALSDELIAELKGNDVIVIAAPMYNFNIPTQLKNYFDL---VARAGVTFRY----- 121 QQ D EL+ N + +I +F+I T+ K ++ L + + T Y Sbjct: 129 QQSIKQYIDAHREELERNQIKIIGI---DFDIETEYKWFYSLQFNIKESAFTTGYAIASW 185 Query: 122 -TEKGPEGLVTGKRAVVVTSRGGIHKDTPTDLVTPYLSTFLGFIGITDVNFVFAEGIAY- 179 +E+ KR VV S GG F G+T N FA+GI Y Sbjct: 186 LSEQDES-----KR--VVASFGGGA-----------------FPGVTTFNEGFAKGILYY 221 Query: 180 -GPEVAAKAQSDAKAAIDS 197 ++K + +DS Sbjct: 222 NQKHKSSKIYHTSPVKLDS 240
>PRTACTNFAMLY#Pertactin virulence factor family signature. Length = 910 Score = 28.5 bits (63), Expect = 0.048 Identities = 16/54 (29%), Positives = 23/54 (42%) Query: 5 SKLDAFIQQAVTAMPISGTSLIASLYGDALLQRGGEVWLGSVAALLEGLGFGER 58 +L A AV + S + +AL +R GE+ L A G GF +R Sbjct: 604 RELSAAANAAVNTGGVGLASTLWYAESNALSKRLGELRLNPDAGGAWGRGFAQR 657
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 45.2 bits (107), Expect = 1e-07 Identities = 22/83 (26%), Positives = 38/83 (45%), Gaps = 11/83 (13%) Query: 1 MRIFLTGASGFIGSRILPALQASGHQVIGL---------ARSESTAQALKAAGAEVHCGT 51 M+ +TGA+GFIG + L +GHQV+G+ + ++ + L G + H Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 52 LDAPESL--LAGVGNADAVIHTA 72 L E + L G+ + V + Sbjct: 61 LADREGMTDLFASGHFERVFISP 83
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 89.9 bits (223), Expect = 4e-23 Identities = 33/124 (26%), Positives = 59/124 (47%) Query: 2 RVLVVEDNALLRHHLKVQLQELGHQVDAAEDAREADYYLGEHLPDIAIVDLGLPDEDGLS 61 +LV +D+A +R L L G+ V +A ++ D+ + D+ +PDE+ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LIRRWRSHDVSLPVLVLTAREGWQDKVEVLSAGADDYVTKPFHIEEVAARMQALLRRNSG 121 L+ R + LPVLV++A+ + ++ GA DY+ KPF + E+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 LASQ 125 S+ Sbjct: 125 RPSK 128
>PF06580#Sensor histidine kinase Length = 349 Score = 27.5 bits (61), Expect = 0.009 Identities = 11/78 (14%), Positives = 25/78 (32%), Gaps = 21/78 (26%) Query: 10 NACKYCLE------FVEVSVRQTTDSHLHILVEDDGPGIPQSQRRAVFDRGQRADTLRPG 63 N K+ + + + + + + + VE+ G ++ + Sbjct: 266 NGIKHGIAQLPQGGKILLKGTKD-NGTVTLEVENTGSLALKNTKE--------------S 310 Query: 64 QGVGLSVAREIVEQYDGE 81 G GL RE ++ G Sbjct: 311 TGTGLQNVRERLQMLYGT 328
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.020 Identities = 8/22 (36%), Positives = 14/22 (63%) Query: 46 LTLLGPSGCGKTTVLRLIAGLE 67 + L G G GK+T++ + GL+ Sbjct: 599 VVLEGTGGIGKSTLINTLVGLD 620
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.020 Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 8/108 (7%) Query: 60 LYYDVLLHSLNMALLATLACLALGYPFAWFLARLPQKVRPLLLFLLIVP--------FWT 111 LY LHS+ + +L L L + + F+ R + +L V W Sbjct: 33 LYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWF 92 Query: 112 NSLIRIYGLKIFLSTKGYLNEFLLWLGVIDTPIRIMFTPSAVIIGLVY 159 + I+ L F++TK L L +I + + F S + G + Sbjct: 93 VANTSIWRLLAFINTKPVAFTLPLALSIIFNVVVVTFMWSLLYFGWHF 140
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 32.5 bits (74), Expect = 0.003 Identities = 22/106 (20%), Positives = 35/106 (33%), Gaps = 6/106 (5%) Query: 301 LMIGMITFQFSSFSFGIGNAAGLLFAGIML-GFLRANHPTFG-YIPQ--GALNMVKEFGL 356 L++ + +L+ G ++ G A G YI + FG Sbjct: 76 LLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGF 135 Query: 357 MVFMAGVGLSAGAGINNGLGAVGGQM--LAAGLIVSLVPVVICFLF 400 M G G+ AG + +G AA + L + CFL Sbjct: 136 MSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 50.4 bits (120), Expect = 4e-10 Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 2/76 (2%) Query: 1 MAR--RPNDPQRRERILQATLDTIAAHGIHAVTHRKIATCANVPLGSLTYYFSGIEALIE 58 MAR + + R+ IL L + G+ + + +IA A V G++ ++F L Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 59 EAFSLFTAEMSAQYQQ 74 E + L + + + Sbjct: 61 EIWELSESNIGELELE 76
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 34.1 bits (78), Expect = 8e-04 Identities = 34/155 (21%), Positives = 62/155 (40%), Gaps = 19/155 (12%) Query: 17 LFMFFFIPGLLMASWATRTPAIRDLLALSTAEMGVVLFGLSVGSMSGILCS---AWLVKR 73 + I G + + ++D+ LSTAE+G V+ + G+MS I+ LV R Sbjct: 262 VLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVI--IFPGTMSVIIFGYIGGILVDR 319 Query: 74 FGTRKVIRTTM-----SFAVLGMLVLSLALWVTSAPLFAFGLAIFGASFGSAEVAINVEG 128 G V+ + SF L+ + + ++T +F G F + S V+ +++ Sbjct: 320 RGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQ 379 Query: 129 AAIEREMNKTVLPMMHGFYSFGTLFGAGVGMAVTG 163 M +F + G G+A+ G Sbjct: 380 QEAGAGM---------SLLNFTSFLSEGTGIAIVG 405 Score = 30.2 bits (68), Expect = 0.016 Identities = 30/150 (20%), Positives = 63/150 (42%), Gaps = 6/150 (4%) Query: 218 LLIGVIVLAMAFAEGSANDWL-PLLMVDGHGFSP-TSGSLIYAGFTLGMTLGRFTGGWFI 275 +IGV+ + F + + P +M D H S GS+I T+ + + + GG + Sbjct: 258 FMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILV 317 Query: 276 DRYSRVAVVRGSAV---MGALGIGLIIFVDNPWVAGISVLLWGIGASLGFPLTISAASDT 332 DR + V+ + L ++ + ++ I V + G + ++ +S Sbjct: 318 DRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSL 377 Query: 333 GP-DAPKRVSVVAITGYLAFLVGPPLLGFL 361 +A +S++ T +L+ G ++G L Sbjct: 378 KQQEAGAGMSLLNFTSFLSEGTGIAIVGGL 407
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 45.2 bits (107), Expect = 3e-07 Identities = 61/267 (22%), Positives = 106/267 (39%), Gaps = 19/267 (7%) Query: 71 LLGPLSDRIGRRPVMLTGVVWFIVTCLATLLAQTIEQFTLLRFLQGISLCFIGAVGYAAI 130 +LG LSDR GRRPV+L + V A + + R + GI+ GAV A I Sbjct: 62 VLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA-TGAVAGAYI 120 Query: 131 QESFEEAVCIKITALMANVALIAPLLGPLVGAAWVHILPWEMMFVLFAVLAAISFFGLQR 190 + + + M+ + GP++G P F A L ++F Sbjct: 121 ADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSP-HAPFFAAAALNGLNFLTGCF 179 Query: 191 AMPET--ATRLGEKLSVKELGRDYRLVLKNLRFVAGALATGFVSLPLLAWIAQSP--VII 246 +PE+ R + +R + + VA +A F ++ + Q P + + Sbjct: 180 LLPESHKGERRPLRREALNPLASFRWA-RGMTVVAALMAVFF----IMQLVGQVPAALWV 234 Query: 247 ISGEQATSYEYGMLQVPI--FGAL--IAGNLVLARLTARRTVRSLIIMGGWPIMFGLILS 302 I GE ++ + + + FG L +A ++ + AR R +++G G IL Sbjct: 235 IFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILL 294 Query: 303 AAATVVSSHAYLWMTAGLSFYAFGIGL 329 A AT ++ + + GIG+ Sbjct: 295 AFAT----RGWMAFPIMVLLASGGIGM 317
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 33.6 bits (77), Expect = 2e-04 Identities = 37/135 (27%), Positives = 54/135 (40%), Gaps = 9/135 (6%) Query: 18 CALLFLVAPAV-QAAEQLPDAPS-IDAR-AWILMDYASGKVLSEGNADEKLDPASLTKIM 74 A L L A Q EQ+ + S + R I MD ASG+ L+ ADE+ S K++ Sbjct: 12 LATLPLAVHASPQPLEQIKLSESQLSGRVGMIEMDLASGRTLTAWRADERFPMMSTFKVV 71 Query: 75 TSYVVGQAIKAGKIKLTDMVTVGRDAWATGNPALRGSSVMFLKPGMQVSVEDLNKGVIIQ 134 V + AG +L + + +P L GM +V +L I Sbjct: 72 LCGAVLARVDAGDEQLERKIHYRQQDLVDYSPVSE----KHLADGM--TVGELCAAAITM 125 Query: 135 SGNDASIAIADYVAG 149 S N A+ + V G Sbjct: 126 SDNSAANLLLATVGG 140
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 61.6 bits (149), Expect = 1e-14 Identities = 17/100 (17%), Positives = 40/100 (40%), Gaps = 7/100 (7%) Query: 4 KGEQAKNQLIAAAIAQFGEYGQHATT-RDIAAQAGQNIAAITYYFGSKDDLYLACAQWIA 62 + ++ + ++ A+ F + G +T+ +IA AG AI ++F K DL+ + Sbjct: 8 EAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSE 67 Query: 63 DFIGDNFRPQAEAAEHLLAGEAPDRQAIRDLILSACHNMI 102 IG+ +R++++ + + Sbjct: 68 SNIGELELEYQAKF------PGDPLSVLREILIHVLESTV 101
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 37.9 bits (88), Expect = 3e-06 Identities = 12/59 (20%), Positives = 21/59 (35%), Gaps = 7/59 (11%) Query: 50 VGGRLASLTVDEGDSIRAGQTLGELDRTPYENALLQAQANVSTAQAQYDLMMAATARKR 108 + + V EG+S+R G L +L A+A+ Q+ R + Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTAL-------GAEADTLKTQSSLLQARLEQTRYQ 154
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 66.0 bits (161), Expect = 6e-15 Identities = 39/213 (18%), Positives = 81/213 (38%), Gaps = 23/213 (10%) Query: 15 YQRQLGLRASSAISANDLENARSSRDQAQATLKSAQDKLRQYRAGNRPQ---EIAQAKAS 71 L + N+L +S +Q ++ + SA+++ + + + ++ Q + Sbjct: 251 KHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDN 310 Query: 72 LEQAQAALAQAKLDLHDTVLTAPSDGTLMTRAV-EPGTMLNAGGTVLTLSLT-HPVWVRA 129 + LA+ + +V+ AP + V G ++ T++ + + V A Sbjct: 311 IGLLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTA 370 Query: 130 YVDEKNLGQAQPGQEVLLYTDSRPDKPYH---GKIGFVSPSAEFTPKTVETPDLRTDLVY 186 V K++G GQ ++ ++ P Y GK+ ++ A D R LV+ Sbjct: 371 LVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDA--------IEDQRLGLVF 422 Query: 187 RLRIVVTDADGA-------LRQGMPVTISFSHG 212 + I + + + L GM VT G Sbjct: 423 NVIISIEENCLSTGNKNIPLSSGMAVTAEIKTG 455
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.8 bits (69), Expect = 0.019 Identities = 11/27 (40%), Positives = 14/27 (51%) Query: 30 IRAGYVTGLVGPDGAGKTTLMRMLAGL 56 + Y L G G GK+TL+ L GL Sbjct: 593 CKFDYSVVLEGTGGIGKSTLINTLVGL 619
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 29.1 bits (65), Expect = 0.005 Identities = 21/82 (25%), Positives = 38/82 (46%) Query: 29 SLLLFYFTMVIYGLSLVGFGLLISSLCATQQQAFIGVFVFMMPAILLSGYVSPVENMPQW 88 SLL + + GL+ G+++++L + + + P + LSG V PV+ +P Sbjct: 146 SLLYALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIV 205 Query: 89 LQDLTWINPIRHFTDITKQIYL 110 Q P+ H D+ + I L Sbjct: 206 FQTAARFLPLSHSIDLIRPIML 227
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1365 bits (3535), Expect = 0.0 Identities = 805/1032 (78%), Positives = 911/1032 (88%) Query: 1 MPNFFIDRPIFAWVIAIIIMLAGGLSILKLPVAQYPTIAPPAISITAMYPGADAETVQNT 60 M NFFI RPIFAWV+AII+M+AG L+IL+LPVAQYPTIAPPA+S++A YPGADA+TVQ+T Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQNMNGIDHLMYMSSNGDSTGTATITLTFESGTDPDIAQVQVQNKLALATPLLPQ 120 VTQVIEQNMNGID+LMYMSS DS G+ TITLTF+SGTDPDIAQVQVQNKL LATPLLPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 EVQQQGISVEKASSSFLMVVGVINTNGTMNQDDISDYVAANMKDPISRTSGVGDVQLFGS 180 EVQQQGISVEK+SSS+LMV G ++ N QDDISDYVA+N+KD +SR +GVGDVQLFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWMDPNKLNNFQLTPVDVISALKAQNAQVAAGQLGGTPPVKGQQLNASIIAQTRL 240 QYAMRIW+D + LN ++LTPVDVI+ LK QN Q+AAGQLGGTP + GQQLNASIIAQTR Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 TNTEEFGNILLKVNQDGSQVRLRDVAKIELGGESYDVVAKFNGQPASGLGIKLATGANAL 300 N EEFG + L+VN DGS VRL+DVA++ELGGE+Y+V+A+ NG+PA+GLGIKLATGANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 DTANAIRAELAKMEPFFPSGMKIVYPYDTTPFVKISIHEVVKTLVEAIILVFLVMYLFLQ 360 DTA AI+A+LA+++PFFP GMK++YPYDTTPFV++SIHEVVKTL EAI+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NFRATLIPTIAVPVVLLGTFAVLAAFGFSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 N RATLIPTIAVPVVLLGTFA+LAAFG+SINTLTMFGMVLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKSMGQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 E+ LPPKEAT KSM QIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATMLKPIQKGSHGATTGFFGWFNRMFDKSTHHYTDSVGNILRSTGRY 540 SVLVALILTPALCAT+LKP+ H GFFGWFN FD S +HYT+SVG IL STGRY Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LVLYLIIVVGMAWLFVRLPSSFLPDEDQGVFLSMAQLPAGATQERTQKVLDEMTNYYLTK 600 L++Y +IV GM LF+RLPSSFLP+EDQGVFL+M QLPAGATQERTQKVLD++T+YYL Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKDNVESVFAVNGFGFAGRGQNTGIAFVSLKDWSQRPGEENKVEAITARAMGYFSQIKDA 660 EK NVESVF VNGF F+G+ QN G+AFVSLK W +R G+EN EA+ RA +I+D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 MVFAFNLPAIVELGTATGFDFELIDQGGLGHEKLTQARNQLFGMVAQHPDVLTGVRPNGL 720 V FN+PAIVELGTATGFDFELIDQ GLGH+ LTQARNQL GM AQHP L VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 721 EDTPQFKIDIDQEKAQALGVSISDINTTLGAAWGGSYVNDFIDRGRVKKVYIMSEAKYRM 780 EDT QFK+++DQEKAQALGVS+SDIN T+ A GG+YVNDFIDRGRVKK+Y+ ++AK+RM Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRM 780 Query: 781 LPEDIGKWYVRGSDGQMVPFSAFSTSRWEYGSPRLERYNGLPSLEILGQAAPGKSTGEAM 840 LPED+ K YVR ++G+MVPFSAF+TS W YGSPRLERYNGLPS+EI G+AAPG S+G+AM Sbjct: 781 LPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 841 ALMEELAGKLPSGIGYDWTGMSYQERLSGNQAPALYAISLIVVFLCLAALYESWSIPFSV 900 ALME LA KLP+GIGYDWTGMSYQERLSGNQAPAL AIS +VVFLCLAALYESWSIP SV Sbjct: 841 ALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSV 900 Query: 901 MLVVPLGVVGALLAATFRGLTNDVYFQVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGLI 960 MLVVPLG+VG LLAAT NDVYF VGLLTTIGLSAKNAILIVEFAKDLMEKEGKG++ Sbjct: 901 MLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVV 960 Query: 961 EATLEAVRMRLRPILMTSLAFILGVMPLVISSGAGSGAQNAVGTGVMGGMVTATILAIFF 1020 EATL AVRMRLRPILMTSLAFILGV+PL IS+GAGSGAQNAVG GVMGGMV+AT+LAIFF Sbjct: 961 EATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFF 1020 Query: 1021 VPVFFVVVRRRF 1032 VPVFFVV+RR F Sbjct: 1021 VPVFFVVIRRCF 1032
>ISCHRISMTASE#Isochorismatase signature. Length = 312 Score = 36.1 bits (83), Expect = 6e-05 Identities = 35/186 (18%), Positives = 67/186 (36%), Gaps = 21/186 (11%) Query: 7 LDPTNSALIFIDHQPQM--SFGVANIDRQTLKNNTVALAKAGKIFNVPVIYT------SV 58 DP + L+ D Q +F L N L +PV+YT + Sbjct: 26 PDPNRAVLLIHDMQNYFVDAFTAGASPVTELSANIRKLKNQCVQLGIPVVYTAQPGSQNP 85 Query: 59 ETKSFSGYIW-PELLAVHPDVKPIERTS-------MNSWEDDAF-----VAAVKATGRKK 105 + ++ W P L + + K I + + W AF + ++ GR + Sbjct: 86 DDRALLTDFWGPGLNSGPYEEKIITELAPEDDDLVLTKWRYSAFKRTNLLEMMRKEGRDQ 145 Query: 106 LVISALWTEVCLTFPALMALEAGYEVYVVTDTSGGTSVDAHERSIDRMVQAGAVPVTWQQ 165 L+I+ ++ + A A + + V D S++ H+ +++ A V Sbjct: 146 LIITGIYAHIGCLVTACEAFMEDIKAFFVGDAVADFSLEKHQMALEYAAGRCAFTVMTDS 205 Query: 166 VLLEYQ 171 +L + Q Sbjct: 206 LLDQLQ 211
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 29.4 bits (66), Expect = 0.028 Identities = 17/75 (22%), Positives = 29/75 (38%) Query: 357 FLVIASLATFATVWVWIMILLSQIAFRRRLSPEEVKALKFKVPGGVLTTVIGLLFLAFII 416 F A+L + ++ S RR L E + L +T V L+ + FI+ Sbjct: 163 FFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIM 222 Query: 417 ALIGYHPDTRISLYV 431 L+G P ++ Sbjct: 223 QLVGQVPAALWVIFG 237
>PF06580#Sensor histidine kinase Length = 349 Score = 31.4 bits (71), Expect = 0.006 Identities = 16/98 (16%), Positives = 30/98 (30%), Gaps = 25/98 (25%) Query: 325 LVYNAVNH----TPPGTEIRVSWQRTPQGALFSVEDNGPGIAPEHIPRLTERFYRVDKAR 380 LV N + H P G +I + + VE+ G Sbjct: 263 LVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEVENTGSLALKN---------------- 306 Query: 381 SRQTGGSGLGLAIVKHAVNH---HDSRLEIDSTVGKGT 415 +G GL V+ + ++++++ GK Sbjct: 307 --TKESTGTGLQNVRERLQMLYGTEAQIKLSEKQGKVN 342
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.7 bits (238), Expect = 5e-25 Identities = 34/149 (22%), Positives = 63/149 (42%), Gaps = 9/149 (6%) Query: 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGLQ 63 ILV +D+A IR ++ L + G+ + + + DL++ D ++P + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 64 FIKLLKREAMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRI 123 + +K+ D+PV++++A+ ++ E GA DY+ KPF EL+ I + Sbjct: 65 LLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA-- 120 Query: 124 SPMAVEEVIEMQGLSLDPSSHRVMTGDSP 152 E L D + G S Sbjct: 121 -----EPKRRPSKLEDDSQDGMPLVGRSA 144
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 36.6 bits (84), Expect = 5e-04 Identities = 45/200 (22%), Positives = 89/200 (44%), Gaps = 27/200 (13%) Query: 552 QVKALTQQLQRDTEAAGRLAEEEQALTKAWQETCASLHITRDIAQEIN----DWMQEQER 607 Q KAL+ +L+ + E G+ ++ A+ + + CA + R + ++ DW +++ Sbjct: 187 QAKALSVELEINFETRGKRGQD--AMYEYMAQACAGNRVRRSVGSSLSCINLDWDVIRDK 244 Query: 608 YEQQLYQLSQRLMLQSQLND---QQALERQAEQQLAATRQGLESALQALALS--LPAEGT 662 + ++ L + +++++++ + E +A+Q L Q +AL+ LS GT Sbjct: 245 TKTKIESLKEHGPIKNKMSESPNKTVSEEKAKQYLEEFHQ---TALEHPELSELKTVTGT 301 Query: 663 EAAWLHARESEFAQWQAQQTQ------HDAIQQQIAALRPLLETLPTSDETEVEAESAIP 716 + A +A W Q D +++ AAL LP A+ A+ Sbjct: 302 NPVFAGA---NYAAWAVNVAQVIDSETADNLEKTTAAL----SILPGIGSVMGIADGAVH 354 Query: 717 DNWREIHEECLSLHSQLVAQ 736 N EI + ++L S +VAQ Sbjct: 355 HNTEEIVAQSIALSSLMVAQ 374
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 33.6 bits (77), Expect = 8e-04 Identities = 13/40 (32%), Positives = 23/40 (57%), Gaps = 1/40 (2%) Query: 216 EGDEKAELALSRYEQRLAKSLAHVVNILDP-DVIVLGGGM 254 GD++A+LAL+ + R+ K++ + DVIV G+ Sbjct: 293 NGDKRAQLALNVFAYRVKKTIGSYAAAMGGVDVIVFTAGI 332
>PF05272#Virulence-associated E family protein Length = 892 Score = 31.6 bits (71), Expect = 0.002 Identities = 11/66 (16%), Positives = 19/66 (28%), Gaps = 4/66 (6%) Query: 4 PIFLIGPRGCGKTTVGHALARARHFQFSDTDHRLQAHEQRTVAEIVQAEGWARFRELETL 63 + L G G GK+T+ + L F +D + E + E+ Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFF----SDTHFDIGTGKDSYEQIAGIVAYELSEMTAF 653 Query: 64 SLKAVT 69 Sbjct: 654 RRADAE 659
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 27.8 bits (62), Expect = 0.048 Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 2/76 (2%) Query: 95 RALLEKTEHALHQHSMITILIGRFVGPTRPLVPMVAGMLDLPVAKFVLPNIIGCLLWPPL 154 R LEK + Q + +L G + + PM+A + +L AK ++ LL + Sbjct: 362 RLCLEKQDIFRTQ--LRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGV 419 Query: 155 YFLPGILAGAAIDIPA 170 I G ++IP+ Sbjct: 420 DVSDSIEVGIMVEIPS 435
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 30.6 bits (69), Expect = 0.025 Identities = 22/90 (24%), Positives = 35/90 (38%), Gaps = 11/90 (12%) Query: 208 DYSAAVLAACLRADCCEIWTDVDGVYTCDPRQVPDARLLKSMSYQEA---MELSYFGAKV 264 D + LA + AD I TDV+G + L+ + +E E +F A Sbjct: 216 DLAGEKLAEEVNADIFMILTDVNGAAL-YYGT-EKEQWLREVKVEELRKYYEEGHFKAGS 273 Query: 265 LHPRTIAPIAQFQIPCLIKNTGNPQAPGTL 294 + P+ +A I +F I+ G L Sbjct: 274 MGPKVLAAI-RF-----IEWGGERAIIAHL 297
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 2e-20 Identities = 31/122 (25%), Positives = 60/122 (49%), Gaps = 1/122 (0%) Query: 1 MQTPHILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSENDINLVIMDINLPGK 60 M IL+ +D+ R L GYDV ++ A + + ++ D +LV+ D+ +P + Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 NGLLLARELRE-QADVALMFLTGRDNEVDKILGLEIGADDYITKPFNPRELTIRARNLLS 119 N L +++ + D+ ++ ++ ++ + I E GA DY+ KPF+ EL L+ Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 120 RT 121 Sbjct: 121 EP 122
>PF06580#Sensor histidine kinase Length = 349 Score = 37.9 bits (88), Expect = 3e-05 Identities = 28/94 (29%), Positives = 41/94 (43%), Gaps = 20/94 (21%) Query: 166 AIDFTPQGGEIALAAEKRNEEVQLSVIDNGCGIPDYALERIFERFYSLPREDGHKSSGLG 225 I PQGG+I L K N V L V + G SL ++ +S+G G Sbjct: 271 GIAQLPQGGKILLKGTKDNGTVTLEVENTG----------------SLALKNTKESTGTG 314 Query: 226 LAFVREVARLHHGD---INLHNRPEGGVVATLRL 256 L VRE ++ +G I L + +G V A + + Sbjct: 315 LQNVRERLQMLYGTEAQIKLSEK-QGKVNAMVLI 347
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.8 bits (246), Expect = 4e-26 Identities = 36/146 (24%), Positives = 64/146 (43%) Query: 1 MQQPRIWLVEDEQSIADTLVYMLQQEGFQVSVFGRGLPALEAAAHQAPDVAILDVGLPDI 60 M I + +D+ +I L L + G+ V + A D+ + DV +PD Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGFELCRRLLTRYPALPVLFLTARSDEVDKLLGLEIGADDYIAKPFSPREVCARVRTVLR 120 + F+L R+ P LPVL ++A++ + + E GA DY+ KPF E+ + L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 RLQKFAAPSPVVRVGEFVLDEQAAAI 146 ++ + L ++AA+ Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAM 146
>DPTHRIATOXIN#Diphtheria toxin signature. Length = 567 Score = 29.3 bits (65), Expect = 0.020 Identities = 20/97 (20%), Positives = 44/97 (45%), Gaps = 3/97 (3%) Query: 139 LGVTQSYTCKLEEISDFRNQMRVQFWRDFLGNSPS-IPPVLYGLHEPRPSLEK--DDEQE 195 +G S +++ D ++ + + G P + + G+ +P+ + DD+ + Sbjct: 24 IGAPPSAHAGADDVVDSSKSFVMENFSSYHGTKPGYVDSIQKGIQKPKSGTQGNYDDDWK 83 Query: 196 VFYTTALTPEMANGHLQHAHPVTLEGGEYVMFTYEGL 232 FY+T + A + + +P++ + G V TY GL Sbjct: 84 GFYSTDNKYDAAGYSVDNENPLSGKAGGVVKVTYPGL 120
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 25.7 bits (56), Expect = 0.019 Identities = 11/44 (25%), Positives = 20/44 (45%), Gaps = 8/44 (18%) Query: 44 RLGAIVWRVERAAGA--------ILMLFGLGIIWRFLHDLAVRL 79 RLG +VWR + + + +F G+ + ++A RL Sbjct: 117 RLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRL 160
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 36.4 bits (84), Expect = 2e-05 Identities = 23/82 (28%), Positives = 36/82 (43%), Gaps = 4/82 (4%) Query: 59 SLYMAGGMALQWLLGPLSDRIGRRPVLLTGALIFTLACLATLFTTSMTQFLI-ARFVQGT 117 L + G A+ G LSD++G + +LL G +I + S LI ARF+QG Sbjct: 59 MLTFSIGTAV---YGKLSDQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA 115 Query: 118 SICFIATVGYVTVQEAFEEKDR 139 + V V +++R Sbjct: 116 GAAAFPALVMVVVARYIPKENR 137
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 32.2 bits (73), Expect = 0.002 Identities = 22/93 (23%), Positives = 43/93 (46%), Gaps = 5/93 (5%) Query: 1 MAVITSVVLVAPIVGPLSGAALMHFIHWKALFGIIAAMGLVAWLGLLLTMPETVRRGDVP 60 +I S+V + VGP G + H+IHW L +I + ++ L+ + + VR Sbjct: 141 FGLIGSIVAMGEGVGPAIGGMIAHYIHWSYLL-LIPMITIITVPFLMKLLKKEVRIKG-H 198 Query: 61 FSPVGVLRDFRNVFRNRIFLLGAATLSLSYIPL 93 F G++ + F+L + S+S++ + Sbjct: 199 FDIKGIILMSVGIV---FFMLFTTSYSISFLIV 228
>ACETATEKNASE#Acetate kinase family signature. Length = 400 Score = 28.2 bits (63), Expect = 0.033 Identities = 13/49 (26%), Positives = 18/49 (36%), Gaps = 6/49 (12%) Query: 178 PEAILAGVINAMARRSANFIGRLSAQ----GPLLFTGGVSHCAAFARML 222 A LA +N A R IG +A ++FT G+ R Sbjct: 296 KRAQLA--LNVFAYRVKKTIGSYAAAMGGVDVIVFTAGIGENGPEIREF 342
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 66.0 bits (161), Expect = 4e-14 Identities = 35/221 (15%), Positives = 73/221 (33%), Gaps = 30/221 (13%) Query: 1 MMTPEQKFARWVRVSIAAFLGI-FAWFIVADIWIPLTPDSTVMRVVTP------VSSRVS 53 + TP + R V I FL I F ++ + +T +T + + Sbjct: 49 IETPVSRRPRLVAYFIMGFLVIAFILSVLG----QVEIVATANGKLTHSGRSKEIKPIEN 104 Query: 54 GYVSHVYVHNNSQVKKGDLLYELDPTPFINKVEAAQIALEQAKLSNQQLDAQIAAARAN- 112 V + V V+KGD+L +L Q +L QA+L + + N Sbjct: 105 SIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNK 164 Query: 113 -------------LRTAQYTARNDKVTLDRYQRLSTMQNVSQSDLDKVRTTWQTSEQSVS 159 + + R + +++ + + +LDK R T ++ Sbjct: 165 LPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARIN 224 Query: 160 ALNAQIQNLLIQRGERDDKRNVTLQKY--RNALEEAQLNLA 198 + + DD ++ ++ ++A+ E + Sbjct: 225 RYENLSRVE---KSRLDDFSSLLHKQAIAKHAVLEQENKYV 262 Score = 49.1 bits (117), Expect = 1e-08 Identities = 30/204 (14%), Positives = 71/204 (34%), Gaps = 21/204 (10%) Query: 86 EAAQIALEQAKLSNQQLDAQIAAARANLRTAQYTARNDKVTLDRYQRLSTMQNVSQSDLD 145 Q Q +L+ + A+ A + + +R +K LD + L Q +++ + Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL 255 Query: 146 KVRTTWQTSEQSVSALNAQIQNL--------LIQRGERDDKRNVTLQKYRNA-------- 189 + + + + +Q++ + + +N L K R Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLT 315 Query: 190 --LEEAQLNLAWTKVRAETDGMVSNLQLN-PGIYATAATAVLALVNNNTDIVAD--FREK 244 L + + + +RA V L+++ G T A ++ +V + + + K Sbjct: 316 LELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNK 375 Query: 245 SLRHTAVNTDAAVVFDALPGQVFP 268 + V +A + +A P + Sbjct: 376 DIGFINVGQNAIIKVEAFPYTRYG 399
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 48.4 bits (115), Expect = 1e-08 Identities = 41/171 (23%), Positives = 71/171 (41%), Gaps = 7/171 (4%) Query: 200 REREHGTIEHLLVMPVTPFEIMLAKI-WSMGLVVLVVSGLSLVLMVQGILQVPIEGSIPL 258 R T E +L + +I+L ++ W+ L +G+ +V G + L Sbjct: 93 RMEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGY----TQWLSLL 148 Query: 259 FMLGV-ALSLFATTSIGIFMGTLARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQLVQD 317 + L V AL+ A S+G+ + LA S LV+ P+ LSG P + +P + Q Sbjct: 149 YALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQT 208 Query: 318 IMLTMPTTHFVSLAQAILYRGASFAIVWPQFLTLL-AIGGVFFTIALLRFR 367 +P +H + L + I+ + + + F + ALLR R Sbjct: 209 AARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALCIYIVIPFFLSTALLRRR 259
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 29.4 bits (65), Expect = 0.028 Identities = 14/56 (25%), Positives = 27/56 (48%) Query: 358 RFVGSPCRVTGDPLMLRRAISNLLSNAIRYTPAGQAVTIQLSESAETVRLVVENPG 413 R+V R +P RR++++++ +R P A + +ES+E + E G Sbjct: 199 RYVSQQTRANPNPYTSRRSVASIVGTLVRMAPVIGACMARQAESSEAMAAWSERAG 254
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 83.0 bits (205), Expect = 1e-20 Identities = 35/117 (29%), Positives = 61/117 (52%) Query: 2 KILIVEDEKKTGEYLTKGLTEAGFVVDLADNGLNGYHLAMTSDYDLLILDIMLPDVNGWD 61 IL+ +D+ L + L+ AG+ V + N + D DL++ D+++PD N +D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 IVRMLRAAGKGMPILLLTALGTIEHRVKGLELGADDYLVKPFAFAELLARVRTLLRR 118 ++ ++ A +P+L+++A T +K E GA DYL KPF EL+ + L Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAE 121
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 32.0 bits (72), Expect = 0.005 Identities = 29/166 (17%), Positives = 57/166 (34%), Gaps = 9/166 (5%) Query: 176 RLKNLSEADRQNFFASEEARRAVHILLIANVSQSYFNQRLAAAQLQVANDTLQNYQQSYA 235 A + A + A A L + + +A+++ + A Sbjct: 204 NFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQA 263 Query: 236 FVEKQLLTGSTTVLALEQARGMIESTRADIAKRQGQLAQANNALQLLLGSYQHLPDDSAS 295 +EK L + I++ A+ A + + A + Q+L + Q L D + Sbjct: 264 ELEKAL---EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDLDA 320 Query: 296 SAVDLQGVTLPPSLSSAILLQRPDILEAEHSLQAANANIGAARAAF 341 S + + L ++ I EA S Q+ ++ A+R A Sbjct: 321 SREAKKQL----EAEHQKLEEQNKISEA--SRQSLRRDLDASREAK 360
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 29.8 bits (67), Expect = 0.002 Identities = 24/115 (20%), Positives = 47/115 (40%), Gaps = 12/115 (10%) Query: 18 LSLTSLAARADIIDDAIGNIQQAINDAYNPGSSRSDDDDRYDDDGRYDDGRYQGS----- 72 L LT+L A AD + +Q + SRS + ++ + D+ +Q Sbjct: 125 LKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQNVSEEEV 184 Query: 73 -------RQQSRDSQRQYDERQRQLDERRRQLDERQRQLDRDRRQLESDQRRLDD 120 ++Q Q Q +++ LD++R + +++R ++ RLDD Sbjct: 185 LRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDD 239
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 55.6 bits (134), Expect = 4e-10 Identities = 28/121 (23%), Positives = 58/121 (47%), Gaps = 8/121 (6%) Query: 646 LVLEDEEDVRQTLCEQLHQLGWLTLETASGEEALQLLEASPDIALLISDLMLPGALSGAD 705 LV +D+ +R L + L + G+ T++ + + A L+++D+++P + D Sbjct: 7 LVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDE-NAFD 64 Query: 706 VIHTARRRFPALPVLLISGQDLRPAQNPALPE--VEWLRKPF----TRAQLAQALSAAYA 759 ++ ++ P LPVL++S Q+ A + ++L KPF + +AL+ Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 760 R 760 R Sbjct: 125 R 125
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 28.7 bits (64), Expect = 0.043 Identities = 10/40 (25%), Positives = 19/40 (47%), Gaps = 2/40 (5%) Query: 227 FVYGMSGLLSGLGGVMSASRLYSANGNLGVGYELDAIAAV 266 ++G+ + GG A R + N G+G + A+ A+ Sbjct: 400 LLHGLPAGWTIYGGTQLADRYRAF--NFGIGKNMGALGAL 437
>SUBTILISIN#Subtilisin serine protease family (S8) signature. Length = 326 Score = 29.0 bits (65), Expect = 0.010 Identities = 16/65 (24%), Positives = 25/65 (38%), Gaps = 5/65 (7%) Query: 55 KLAGDNVKVTLVSSGYDLGQQVAQIDNFIAAKVDMIIL---NAADSKGIGPAVKRAKEAG 111 L +KV + I I KVD+I + D + AVK+A + Sbjct: 111 DLLI--IKVLNKQGSGQYDWIIQGIYYAIEQKVDIISMSLGGPEDVPELHEAVKKAVASQ 168 Query: 112 IVVVA 116 I+V+ Sbjct: 169 ILVMC 173
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 28.3 bits (63), Expect = 0.044 Identities = 6/21 (28%), Positives = 12/21 (57%) Query: 26 QAQIARELGIYRTTISRLLKR 46 Q + A LG+ R T+ + ++ Sbjct: 452 QIKAADLLGLNRNTLRKKIRE 472
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 115 bits (289), Expect = 4e-33 Identities = 79/271 (29%), Positives = 126/271 (46%), Gaps = 26/271 (9%) Query: 7 LKDNVIIVTGGASGIGLAIVDELLSQGAHVQMIDIHGGDRHHNGDNYHF-------WPTD 59 ++ + +TG A GIG A+ L SQGAH+ +D + + +P D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 60 ISSATEVQQTIDAIIQRWSRIDGLVNNAGVNFPRLLVDEKAPAGRYELNEAAFEKMVNIN 119 + + + + I + ID LVN AGV P L+ + L++ +E ++N Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLI---------HSLSDEEWEATFSVN 116 Query: 120 QKGVFFMSQAVARQMVKQRAGVIVNVSSESGLEGSEGQSCYAATKAALNSFTRSWSKELG 179 GVF S++V++ M+ +R+G IV V S + YA++KAA FT+ EL Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 180 KYGIRVVGVAPGILEKTGLRTPEYEEALAWTRNITVEQLREGYT---KNAIPIGRAGKLS 236 +Y IR V+PG E + W EQ+ +G K IP+ + K S Sbjct: 177 EYNIRCNIVSPGSTETDMQWS-------LWADENGAEQVIKGSLETFKTGIPLKKLAKPS 229 Query: 237 EVADFVCYLLSARASYITGVTTNIAGGKTRG 267 ++AD V +L+S +A +IT + GG T G Sbjct: 230 DIADAVLFLVSGQAGHITMHNLCVDGGATLG 260
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.5 bits (69), Expect = 0.013 Identities = 22/114 (19%), Positives = 43/114 (37%), Gaps = 3/114 (2%) Query: 331 LTAVVVGILFLLVIFLSPLAGMVPGYAAAGALIYVGVLMTSSLARVKWSDLTEAVPA--- 387 L+ VV +L PL + A A ++ G L++ + + A Sbjct: 72 LSYVVDNVLLEFFYLCFPLLTVAALMAIASHVVQYGFLISGEAIKPDIKKINPIEGAKRI 131 Query: 388 FITAVMMPFSFSITEGIALGFISYCVMKIGTGRLRELSPCVIIVSLLFVLKIVF 441 F ++ F SI + + L + + ++K L +L C I + +I+ Sbjct: 132 FSIKSLVEFLKSILKVVLLSILIWIIIKGNLVTLLQLPTCGIECITPLLGQILR 185
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 57.9 bits (140), Expect = 2e-11 Identities = 66/311 (21%), Positives = 118/311 (37%), Gaps = 14/311 (4%) Query: 5 LLCSFALVLLYPSGIDMYLVGLPRIAQDLGASEAQLHIAFSVYLAGMASAML----FAGR 60 L+ + V L GI + + LP + +DL S + + + LA A G Sbjct: 7 LIVILSTVALDAVGIGLIMPVLPGLLRDLVHSN-DVTAHYGILLALYALMQFACAPVLGA 65 Query: 61 IADRSGRKPVAIVGAAIFVIASLLCAQAHTSSHFLIGRFIQGIGAGSCYVVAFAILRDTL 120 ++DR GR+PV +V A + + A A IGR + GI G+ VA A + D Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124 Query: 121 DDRRRAKVLSLLNGITCIIPVLAPVLGHLIMLKYPWQSLFYTMTGMGVMIAVLSVFILRE 180 D RA+ ++ V PVLG L M + + F+ + + + F+L E Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGL-MGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 181 TRPTAPPQAASPQHDAGESLLNRFFLSRLLITTLSVTVILTYVNVSPVLMMEEMGFDRGT 240 + + S + ++ ++V I+ V P + G DR Sbjct: 184 SHKGERRPLRREALNPLASFRWARGM-TVVAALMAVFFIMQLVGQVPAALWVIFGEDRFH 242 Query: 241 YSMAM------ALMAMISMAVSFSTPFALSLFNPRTLMLTSQVLFLAAGVTLSLATRQAV 294 + A + S+A + T + R ++ + + L+ ATR + Sbjct: 243 WDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWM 302 Query: 295 TLIGLGMICAG 305 + ++ +G Sbjct: 303 AFPIMVLLASG 313
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 38.0 bits (88), Expect = 3e-06 Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 3/55 (5%) Query: 69 IVDVAVDPAHQGKGLGRLVMEKLVAWLDANAFDGAYV-TLVADVP--ELYAKFGF 120 I D+AV ++ KG+G ++ K + W N F G + T ++ YAK F Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHF 146
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 814 bits (2105), Expect = 0.0 Identities = 476/549 (86%), Positives = 510/549 (92%), Gaps = 2/549 (0%) Query: 1 MDSQRNLLIIALLFVSFMIWQAWEQDKNPQPQ-QQTTQTTTTAAGSAADQGVPASGQGKL 59 MDSQRNLL+IALLFVSFMIWQAWEQDKNPQPQ QQTTQTTTTAAGSAADQGVPASGQGKL Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTTTAAGSAADQGVPASGQGKL 60 Query: 60 ITVKTDVLELTINTNGGDIEQALLLAYPKTLKSTEPFQLLETTPQFVYQAQSGLTGRDGP 119 I+VKTDVL+LTINT GGD+EQALL AYPK L ST+PFQLLET+PQF+YQAQSGLTGRDGP Sbjct: 61 ISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSGLTGRDGP 120 Query: 120 DNPANGPRPLYNVDKEAFVLADGQDELVIPLTYTDKAGNVFTKTFTLKRGGYAVNVGYSV 179 DNPANGPRPLYNV+K+A+VLA+GQ+EL +P+TYTD AGN FTKTF LKRG YAVNV Y+V Sbjct: 121 DNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKRGDYAVNVNYNV 180 Query: 180 QNASEKPLEVSTFGQLKQTAALPTSRDTQTGGLSTMHTFRGAAFSTADSKYEKYKFDTIL 239 QNA EKPLE+S+FGQLKQ+ LP DT + + +HTFRGAA+ST D KYEKYKFDTI Sbjct: 181 QNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFA-LHTFRGAAYSTPDEKYEKYKFDTIA 239 Query: 240 DNENLNVSTKNGWVAMLQQYFTTAWVPRNNGTNNFYTANLGNGVVAIGYKSQPVLVQPGQ 299 DNENLN+S+K GWVAMLQQYF TAW+P N+GTNNFYTANLGNG+ AIGYKSQPVLVQPGQ Sbjct: 240 DNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQPVLVQPGQ 299 Query: 300 TDKLQSTLWVGPAIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKFIHSFLGNWGFSII 359 T + STLWVGP IQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLK+IHSF+GNWGFSII Sbjct: 300 TGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVGNWGFSII 359 Query: 360 VITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRQSQEMMALYKAEKVNP 419 +ITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQR SQEMMALYKAEKVNP Sbjct: 360 IITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALYKAEKVNP 419 Query: 420 LGGCFPLIIQMPIFLALYYMLSASVELRHAPFILWIHDLSAQDPYYILPIIMGATMFFIQ 479 LGGCFPL+IQMPIFLALYYML SVELR APF LWIHDLSAQDPYYILPI+MG TMFFIQ Sbjct: 420 LGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMGVTMFFIQ 479 Query: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVVYYIVSNLVTIIQQQLIYRGLEKRG 539 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLV+YYIVSNLVTIIQQQLIYRGLEKRG Sbjct: 480 KMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIYRGLEKRG 539 Query: 540 LHSREKKKS 548 LHSREKKKS Sbjct: 540 LHSREKKKS 548