>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.4 bits (63), Expect = 0.002 Identities = 9/47 (19%), Positives = 20/47 (42%) Query: 14 EKKAEIINLLCELSDENGFIMLKISEICEKLNVSKPTVISTFKLLEE 60 E + I+++ L + G + EI + V++ + FK + Sbjct: 11 ETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSD 57
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 56.8 bits (137), Expect = 3e-13 Identities = 21/74 (28%), Positives = 41/74 (55%) Query: 2 KKGFTMIELIFVIVILGILAAVAIPRLAATRDDAEIAKTAANIQTLVSDLGSYYTSQGSF 61 ++GFT++E++ VIVI+G+LA++ +P L ++ A+ K ++I L + L Y + Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66 Query: 62 AATSGTGSAASTTP 75 T+ + P Sbjct: 67 PTTNQGLESLVEAP 80
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 45.3 bits (107), Expect = 8e-09 Identities = 16/60 (26%), Positives = 36/60 (60%) Query: 2 KKAFTMIELIFVIVVIGVLAAIAIPRISATRDDAVLVKSMAEIRTAIEEINAYYISQGKL 61 ++ FT++E++ VIV+IGVLA++ +P + ++ A K++++I ++ Y + Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDNHHY 66
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 39.6 bits (92), Expect = 3e-05 Identities = 10/49 (20%), Positives = 25/49 (51%) Query: 484 QILANKLEMSNVDLGQALSEVIVTQKAYEASAKSITTSDEMIQTAIQMK 532 Q+ + +S V+L + + Q+ Y A+A+ + T++ + I ++ Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546
>PF08280#M protein trans-acting positive regulator Length = 530 Score = 29.8 bits (67), Expect = 0.023 Identities = 33/138 (23%), Positives = 51/138 (36%), Gaps = 17/138 (12%) Query: 35 SMVLDAAASKANSGQKITEKDVKEIVKTVDIQ-KETIEKAQNESVAKISAALEENLDEDT 93 S+ + A K +E+ TI+K IS E Sbjct: 58 SLPITEVAEKTGLTFLQLNHYCEELNAFFPDSLSMTIQKR------MISCQFTHPSKETY 111 Query: 94 KNELYENANFMQLLQVLEILNGNEKVSKFPNFSDKIANFLSVPQNVEELSNVKSVNDLID 153 +LY ++N +QLL L I NG+ +F+ +FLS S + LI Sbjct: 112 LYQLYASSNVLQLLAFL-IKNGSHSRP-LTDFARS--HFLSNS------SAYRMREALIP 161 Query: 154 LAKKFDLGLENIEISNED 171 L + F+L L +I E+ Sbjct: 162 LLRNFELKLSKNKIVGEE 179
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 195 bits (497), Expect = 2e-56 Identities = 107/447 (23%), Positives = 188/447 (42%), Gaps = 86/447 (19%) Query: 3 KIRNIAVIAHVDHGKTTMVDELLKQSGTFNE--HQNLGERVMDSNDIERERGITILSKNT 60 KI NI V+AHVD GKTT+ + LL SG E + G D+ +ER+RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 61 AIRYKDTKINIIDTPGHADFGGEVERVLKMVDGVLLLVDAQEGVMPQTKFVVKKALSLGL 120 + ++++TK+NIIDTPGH DF EV R L ++DG +LL+ A++GV QT+ + +G+ Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 121 RPIVVVNKIDKPAGDPDRVINEIFDLFVA----------------------------LDA 152 I +NKID+ D V +I + A ++ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEG 181 Query: 153 NDEQLE--------------------------FPVVYAAAKNGYAKLKLSDENKDMQPLF 186 ND+ LE FPV + +AKN N + L Sbjct: 182 NDDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKN----------NIGIDNLI 231 Query: 187 ETILAHVPAPSGSDENPLQLQVFTLDYDNYVGKIGIARIFNGKIAKNQNVMLAKADGTKT 246 E I + + ++ L +VF ++Y ++ R+++G + +V ++ K Sbjct: 232 EVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRIS----EKE 287 Query: 247 TGRISKLIGFMGLDRIDINEAGTGDIVAIAGFDA---LDVGDSVVDPNNPHPLDPLHIEE 303 +I+++ + + I++A +G+IV + +GD+ + P +PL Sbjct: 288 KIKITEMYTSINGELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQRERIENPL---- 343 Query: 304 PTLSVVFSVNDGPLAGTEGKHVTSNKIDERLANEMKTNIAMKYENIGEGKFKVSGRGELQ 363 P L + + +++ Y + + +S G++Q Sbjct: 344 PLLQTTVEPSKPQQREMLLDALLE------ISDSDPL--LRYYVDSATHEIILSFLGKVQ 395 Query: 364 ITILAENMRRE-GYEFLLGRPEVIVKE 389 + + ++ + E + P VI E Sbjct: 396 MEVTCALLQEKYHVEIEIKEPTVIYME 422 Score = 41.8 bits (98), Expect = 7e-06 Identities = 20/80 (25%), Positives = 29/80 (36%), Gaps = 1/80 (1%) Query: 396 EPYELLVIDAPDDTTGTVIEKLGKRKAEMVSMNPTGDGQTRIEFEIPARGLIGFRSQFLT 455 EPY I AP + K A +V + + + EIPAR + +RS Sbjct: 537 EPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ-LKNNEVILSGEIPARCIQEYRSDLTF 595 Query: 456 DTKGEGVMNHSFLEFRPLSG 475 T G V + +G Sbjct: 596 FTNGRSVCLTELKGYHVTTG 615
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 29.0 bits (65), Expect = 0.012 Identities = 8/50 (16%), Positives = 22/50 (44%), Gaps = 7/50 (14%) Query: 1 MKFVAICLLLLTSIFLIACSANQASNKINNSEIKELGKKYG---GVYVFN 47 M+++ +C++ L + +A A+ + +IK + G+ + Sbjct: 1 MRYIRLCIISLLATLPLAVHASPQPLE----QIKLSESQLSGRVGMIEMD 46
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 31.6 bits (72), Expect = 0.006 Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 8/85 (9%) Query: 102 FGIFIGYFPVLSGSDIFYIGALLLFIVAVFASFVILPVALYPLHYEKYLLNNNTKKLYFS 161 + +L L I + +F LP+ PL+ +L Sbjct: 125 LARIMDMLALLL---FLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFL-----ALTKAG 176 Query: 162 WLTFILTIPVAFFVFLALLVLYYIL 186 L F+ + +A + LL L L Sbjct: 177 SLIFLNGLMLALPLITLLLTLNLAL 201
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 29.3 bits (65), Expect = 0.038 Identities = 22/101 (21%), Positives = 45/101 (44%), Gaps = 21/101 (20%) Query: 6 ILDEKDESVKEARKFINFLKANFSNYEIR--SSKQARLIALLNEENDLFDRLNRTNFAEV 63 I+D+ D ++A + I+ L+ +SN I+ + K +N+ NDL ++ N Sbjct: 46 IVDKNDRDNRQAFEGISQLREEYSNKAIKNPTKKNQYFSDFINKSNDLINKDN------- 98 Query: 64 SKRLGEIKEQITLVILDIKDEITKDFGEQNYEIYKKALSKE 104 L+ ++ + + FG+Q Y I+ +S + Sbjct: 99 ------------LIDVESSTKSFQKFGDQRYRIFTSWVSHQ 127
>PF07675#Cleaved Adhesin Length = 1358 Score = 27.4 bits (60), Expect = 0.015 Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 1/32 (3%) Query: 73 SFSGSNFIKSFEKASEYINNYYKNNGDNFIYT 104 SF+G N AS YIN N DN++ T Sbjct: 1116 SFAGHNSAICVSSAS-YINFEGPQNPDNYLVT 1146
>PF05860#haemagglutination activity domain. Length = 117 Score = 61.0 bits (148), Expect = 7e-13 Identities = 24/130 (18%), Positives = 43/130 (33%), Gaps = 21/130 (16%) Query: 45 IIADPGASNRPDILKAPNETLIINITNPDSKGVSINEYSRFNTPTTGTILNNSNKNIDTK 104 I D + T II + + + F+ PT+GT N+ NI Sbjct: 3 ITPDTTLPI-NSNITTEGNTRIIERGTQAGSNLFHS-FQEFSVPTSGTAFFNNPTNI--- 57 Query: 105 IAGQIDANYRLNKEASLIINKVNSAEKSSLKGNLEVAGSRADVVIANPNGISVDGLNMIN 164 II++V S++ G + A++ + NPNGI ++ Sbjct: 58 ---------------QNIISRVTGGSVSNIDGLIRANA-TANLFLINPNGIIFGQNARLD 101 Query: 165 SRSLTLTTGN 174 + + Sbjct: 102 IGGSFVGSTA 111
>VACCYTOTOXIN#Helicobacter pylori vacuolating cytotoxin signature. Length = 1291 Score = 30.4 bits (68), Expect = 0.033 Identities = 48/196 (24%), Positives = 63/196 (32%), Gaps = 40/196 (20%) Query: 443 GAVGLGKLAKDGVVAVGKASVNGASKIANQIETNALNKITQNLPKNAMY--NKTNGIISI 500 G V +G+L G S SK+ ++ N L N + + NKT+ Sbjct: 255 GNVWMGRLQYVGAYLAPSYSTINTSKVTGEVNFNHLTVGDHNAAQAGIIASNKTHI---- 310 Query: 501 ENKNFIQVGTDTLTNSPILREIGYNKGSYALKGNHYVSTADGLYMIGKNALQKTGTKVIT 560 GT L S L I +G Y K N S N Q Sbjct: 311 --------GTLDLWQSAGLNIIAPPEGGYKDKPNDKPS----------NTTQNNAKNDKQ 352 Query: 561 NSSLKDIMTPSINPISNMYYNGTNFAIRNSTKITDFINGYFIPGTPDYVFWGGVGALTNM 620 SS + T INP N A + + T I+G F G V + + Sbjct: 353 ESSQNNSNTQVINP--------PNSAQKTEIQPTQVIDGPFAGGKNTVV------NINRI 398 Query: 621 GINYDTTIEN--FKNS 634 N D TI FK S Sbjct: 399 NTNADGTIRVGGFKAS 414
>PF06580#Sensor histidine kinase Length = 349 Score = 32.1 bits (73), Expect = 0.002 Identities = 17/89 (19%), Positives = 28/89 (31%), Gaps = 17/89 (19%) Query: 154 QKIEDKNIKIKIDSDEKFAYLSVEDNGGGIDKNVIDEIFKPYFTTKEDAKGTGLGLYMSK 213 Q + I +K D L VE+ G KN + TG GL + Sbjct: 274 QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTK--------------ESTGTGLQNVR 319 Query: 214 QIIDQF---NAEITAGNSDNGACFLIKLP 239 + + A+I ++ +P Sbjct: 320 ERLQMLYGTEAQIKLSEKQGKVNAMVLIP 348
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.9 bits (210), Expect = 3e-21 Identities = 33/113 (29%), Positives = 56/113 (49%) Query: 9 TVLLVEDDSDSKKIMHDVLSDNFEKVFTAQNGDEGLKKFKKYNPNMVITDVFMPISDGLD 68 T+L+ +DD+ + +++ LS V N + + ++V+TDV MP + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 69 MTRYIKEISKDTPVIVLSAHSEKETLLKAIDVGVDKYLIKPIMADDLLKTIEN 121 + IK+ D PV+V+SA + T +KA + G YL KP +L+ I Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 126 bits (317), Expect = 1e-35 Identities = 81/368 (22%), Positives = 128/368 (34%), Gaps = 69/368 (18%) Query: 1 MKNIALAMVAATAVFASNAAY----------------NYEITPTIGGVHPEGNLRVKDHN 44 MK A+A+ A A FA+ A Y T I P ++ Sbjct: 1 MKKTAIAIAVALAGFATVAQAAPKDNTWYTGAKLGWSQYHDTGFINNNGPTHENQLGAGA 60 Query: 45 FVGIRAARNLEDFFFDQVELGVDYSQKLKERTGDVVREGRALRYHANLVKNIVDFGPVSL 104 F G + + E+G D+ ++ + +A + + Sbjct: 61 FGGYQVNPYV------GFEMGYDWLGRMPYKGSVENGAYKAQGVQLTAKLGYPITDDLDI 114 Query: 105 YGLIGAGYEDVPAIFVK---NEDGGF-GQYGLGLRYQVTDRFALKAEARDAIKFEHADHN 160 Y +G N D G + G+ Y +T A + E + H Sbjct: 115 YTRLGGMVWRADTKSNVYGKNHDTGVSPVFAGGVEYAITPEIATRLEYQWTNNI-GDAHT 173 Query: 161 LFYSLGFG---IGLDSKAAPVVAAAPAAPAAAPAPVLDDDNDGVPNDIDQCPNTPAGVVV 217 + G +G+ + AA APA APAP + Sbjct: 174 IGTRPDNGMLSLGVSYRFGQGEAAPVVAPAPAPAPEV----------------------- 210 Query: 218 DERGCEKVIVLRDLDVNFAFDSYKVGPKYAAEIKKVADFMGEH--PDYKVVLAGHTDSVG 275 K L+ DV F F+ + P+ A + ++ + D VV+ G+TD +G Sbjct: 211 ----QTKHFTLKS-DVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGSVVVLGYTDRIG 265 Query: 276 AEAYNQKLSEKRAKAVAEVLAGYGVEKAKISTVGYGELKPIATNKT---------KEGRA 326 ++AYNQ LSE+RA++V + L G+ KIS G GE P+ N + A Sbjct: 266 SDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLA 325 Query: 327 QNRRVEAT 334 +RRVE Sbjct: 326 PDRRVEIE 333
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 96.4 bits (240), Expect = 6e-24 Identities = 72/357 (20%), Positives = 140/357 (39%), Gaps = 14/357 (3%) Query: 3 KSVLPLSFIVASRFFGLFIVLPVLS--LYALNLSGANEFLVGLIVGVYAISQMIFQVPFG 60 + ++ + VA G+ +++PVL L L S G+++ +YA+ Q G Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 61 ALSDKIGRKKALTIGLLIFVAGSIVCALASEIYTMLFGRFLQGV-GAVGAVATAMISDFV 119 ALSD+ GR+ L + L + A A ++ + GR + G+ GA GAVA A I+D Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 120 AEENRSKAMAIMGAFIGLSFTLSMVLGPLLVKDYGLSSLFYLSAALSLLCVVLLYTVVP- 178 + R++ M A G VLG L+ + + F+ +AAL+ L + ++P Sbjct: 125 DGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 179 ------KEIKVSAKAEKVPFGKLFLQKDYMIINFTSFMQKMLTSIAFLVIPIVLVKEYGY 232 + ++ A F + F+ +++ + + I + + Sbjct: 184 SHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHW 243 Query: 233 ESSELYKVYTLGAVLGFLAMG-LAGALGDGKGLSKVILIAGTLLFALTYVIFALSFTKFI 291 +++ + +L LA + G + L + + ++ T I T+ Sbjct: 244 DATTIGISLAAFGILHSLAQAMITGPV--AARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 292 FVVGIAIFFIGFNLHEPIMQSTATKFVKSSQKGTALGIFNSFGYFGSFVGGAFGGYI 348 I + + P +Q+ ++ V ++G G + S VG I Sbjct: 302 MAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAI 358
>STREPTOPAIN#Streptopain (C10) cysteine protease family signature. Length = 398 Score = 32.0 bits (72), Expect = 0.006 Identities = 34/133 (25%), Positives = 58/133 (43%), Gaps = 29/133 (21%) Query: 179 LANKIFEEKSANFSKNSKESLELLLTPLGEK--ITSFEKRVNDAHSDSQKSAGELSAQLK 236 LAN +F ++ NF++N KE+ + +T + + I + + D D GELS Sbjct: 21 LANPVFADQ--NFARNEKEAKDSAITFIQKSAAIKAGARSAEDIKLDKVNLGGELSGSNM 78 Query: 237 EVVELGKNMSKEANSLSTALKGSNKVLGNWGEMQLERTLEAAGLEKGTHYATQESFDASG 296 V N+S + + K S ++LG Y+T SFDA+G Sbjct: 79 YVY----NISTGGFVIVSGDKRSPEILG---------------------YSTSGSFDANG 113 Query: 297 KKLIPDFVINFPD 309 K+ I F+ ++ + Sbjct: 114 KENIASFMESYVE 126
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 58.3 bits (141), Expect = 6e-11 Identities = 36/148 (24%), Positives = 66/148 (44%), Gaps = 21/148 (14%) Query: 5 IGTAGHIDHGKTALIKELNGFEG---------------DNLEEEKKRGITIDLSFSNLSK 49 IG H+D GKT L + L G DN E++RGITI ++ Sbjct: 6 IGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITSFQW 65 Query: 50 NDENIAFIDVPGHENLIKTMISGAYGFDACLFVVAANDGLMPQSLEHLEILNLLGVKSII 109 + + ID PGH + + + D + +++A DG+ Q+ L +G+ +I Sbjct: 66 ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIPTIF 125 Query: 110 VALTKCDLVDEATINLRK--KEIRDEIS 135 + +D+ I+L ++I++++S Sbjct: 126 FI----NKIDQNGIDLSTVYQDIKEKLS 149
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 32.2 bits (73), Expect = 0.004 Identities = 18/84 (21%), Positives = 38/84 (45%), Gaps = 2/84 (2%) Query: 115 IDEMINKANQLYERGNKFEALKIYENIAVYNQSLSNYNLGVSQMKQ--EKCDEAIISFNK 172 ++++ + A Y+ G +A K+++ + V + S + LG+ +Q + D AI S++ Sbjct: 36 LEQLYSLAFNQYQSGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSY 95 Query: 173 AITDRENTAVSAINAAVCSLELNN 196 +AA C L+ Sbjct: 96 GAIMDIKEPRFPFHAAECLLQKGE 119
>BORPETOXINA#Bordetella pertussis toxin A subunit signature. Length = 269 Score = 37.5 bits (86), Expect = 4e-05 Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 8/90 (8%) Query: 237 KLFLDESGQRDLQARYERGGEGHGHFKAYLNELVWD--YFKDAREKFEYYQNNPDEVAKI 294 +++L+ Q ++A ER G G GHF Y+ E+ D ++ A FEY D +I Sbjct: 95 EVYLEHRMQEAVEA--ERAGRGTGHFIGYIYEVRADNNFYGAASSYFEYVDTYGDNAGRI 152 Query: 295 L--DLGAKKAQNVAHTTI--KKVREAVGIY 320 L L +++ +AH I + +R +Y Sbjct: 153 LAGALATYQSEYLAHRRIPPENIRRVTRVY 182
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 34.8 bits (80), Expect = 6e-04 Identities = 30/146 (20%), Positives = 64/146 (43%), Gaps = 7/146 (4%) Query: 196 KNIRVGIIGRVNVGKSSLLNALVKESRAV--VSDV-AGTTIDPVNEIYEHDGRVFEFVDT 252 K I +G++ V+ GK++L +L+ S A+ + V GTT + G + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 253 AGIRKRGKIEGIERYA----LNRTEKILEETDVALLVLDSSEPLTELDERIAGIASKFEL 308 + + K+ I+ L + L D A+L++ + + + + K + Sbjct: 62 SFQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGI 121 Query: 309 GVIIVLNKWDKSSEEFDELCKEIKDR 334 I +NK D++ + + ++IK++ Sbjct: 122 PTIFFINKIDQNGIDLSTVYQDIKEK 147
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 31.8 bits (72), Expect = 0.004 Identities = 21/89 (23%), Positives = 36/89 (40%), Gaps = 10/89 (11%) Query: 234 DGAKMIIGRDESDNNAL---LAHPNDKFEQVKFKESDDIVGAVSFISKNASKADKEL--A 288 DG + +G++ N+ L A+ +D + K +I G+ + + L A Sbjct: 466 DGIDIYVGKNNIQNDYLTLKFANKHDIWFHTK-----NIPGSHVIVKNIMDIPESTLLEA 520 Query: 289 ARLALAYTKASKDDEFEVSIANEKFIIKP 317 A LA Y+K+ V K + KP Sbjct: 521 ANLAAYYSKSQNSSNVPVDYTEVKNVKKP 549
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.1 bits (101), Expect = 2e-06 Identities = 32/127 (25%), Positives = 46/127 (36%), Gaps = 4/127 (3%) Query: 103 EQNESVALLQKQLEEKSNQVKELNVAKAQISQLQREKEEMESAITAKAELALNEKLKEEK 162 E E+VA KQ ++ E N A + Q + E+ KA NE + Sbjct: 1035 ETTETVAENSKQ----ESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGS 1090 Query: 163 EKIQKAADEQNELKFRQKEEQLKQLQEQLQIAQRKAEQGSMQLQGEVQELAIEEWLREKF 222 E + E E +KEE+ K E+ Q + Q S + + E RE Sbjct: 1091 ETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPAREND 1150 Query: 223 PFDTIDE 229 P I E Sbjct: 1151 PTVNIKE 1157 Score = 33.1 bits (75), Expect = 0.002 Identities = 27/193 (13%), Positives = 61/193 (31%), Gaps = 26/193 (13%) Query: 57 ESLRTKEQQLQDQKEKFEEEIKKATQIQLKMERARLQDELRKEILDEQNESVALLQKQLE 116 + E Q+ K E+ + AT+ + + + + + NE + E Sbjct: 1036 TTETVAENSKQESKTV-EKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKE 1094 Query: 117 EKSNQVKELNVAKAQISQLQREKEEMESAITAKAELALNEKLKEEKEKIQKAADEQNELK 176 ++ + KE A + EK K E EK Q+ +++ Sbjct: 1095 TQTTETKE------------------------TATVEKEEKAKVETEKTQEVPKVTSQVS 1130 Query: 177 FRQKEEQLKQLQEQLQIAQRKAEQGSMQLQGEVQELAIEEWLREKFPFDTIDEIKKGARG 236 +Q++ + Q Q + + + Q + A E ++ + + + Sbjct: 1131 PKQEQSETVQPQAEPA-RENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV 1189 Query: 237 ADCVQIVHTRESQ 249 +V E+ Sbjct: 1190 NTGNSVVENPENT 1202
>PF07328#T-DNA border endonuclease VirD1 Length = 144 Score = 35.8 bits (82), Expect = 1e-05 Identities = 27/118 (22%), Positives = 50/118 (42%), Gaps = 9/118 (7%) Query: 7 DKVLSIRITSQQNSKLSDMARELKISRSEIISYLIDN-GTINSESIKKKELYPTIITYFA 65 DKV+S+++T + ++ EL ++R+ + G K EL + A Sbjct: 20 DKVISVKMTEAELAEFDAQIAELGLNRNRALRIAARRIGGFVENDAKTVELLRDMSRAIA 79 Query: 66 RPFNNINQMAKKLNIAYKTSGNIDLKTILQTQ----EELYKVQSVLTEILSLIRNNYD 119 NINQ+AK A + + + + + EL K+ +VL ++ + R D Sbjct: 80 GVATNINQIAK----AANRTHDPAYHSFMAERKVLGLELSKLSAVLAPLMEVSRRRSD 133
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 28.1 bits (62), Expect = 0.030 Identities = 7/45 (15%), Positives = 15/45 (33%) Query: 49 EKLEKEIDKREEERDKVNKEILKEVSNIQDKEEQNKQLRLLLQEK 93 LEK ++ + +I + E + +L L+ Sbjct: 228 ADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGA 272
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 80.3 bits (198), Expect = 1e-19 Identities = 28/135 (20%), Positives = 65/135 (48%), Gaps = 3/135 (2%) Query: 4 VLMIEDDPEFAQILSEYLDSFNIKVTNFEDPYLGLSA-GIKNYDLLILDLTLPGIDGLEV 62 +L+ +DD +L++ L V + + DL++ D+ +P + ++ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 63 CKEIRQKY-DIPIIISSARSDISDKVVGLQLGADDYLPKPYDPKEMYARI-TSLIRRYKK 120 I++ D+P+++ SA++ + + GA DYLPKP+D E+ I +L ++ Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 121 TNEVQEEVVDSAFRI 135 ++++++ D + Sbjct: 126 PSKLEDDSQDGMPLV 140
>PF06291#Lambda prophage Bor protein Length = 102 Score = 26.2 bits (57), Expect = 0.028 Identities = 9/18 (50%), Positives = 14/18 (77%) Query: 1 MKKFIFALSAALLLAGCA 18 MKK +F+ + A+L+ GCA Sbjct: 6 MKKMLFSAALAMLITGCA 23
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 28.3 bits (63), Expect = 0.015 Identities = 29/129 (22%), Positives = 48/129 (37%), Gaps = 33/129 (25%) Query: 76 VMAYRNEDTAWGFPFYFKFNSADIQAKAQGFTNSDKNVTIKYYGYRISM----------- 124 VM YR DT+ YF ++ +A F ++ + VT+ Y G + Sbjct: 94 VMGYRAGDTS-----YFFNEASATEAAKYVFKDAKRKVTLPYSGNYERLQIAAGKIRENI 148 Query: 125 ---LNEFRNAISIKDSGTNTSWPIASYVLYFIL---------FISLVIWIRKINKAFAP- 171 L +AI+ S AS ++ I FI I ++++K F P Sbjct: 149 PLGLPALDSAITTLFYYNANS--AASALMVLIQSTSEAARYKFIEQQIG-KRVDKTFLPS 205 Query: 172 -KVENLETK 179 + +LE Sbjct: 206 LAIISLENS 214
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 433 bits (1114), Expect = e-152 Identities = 140/385 (36%), Positives = 226/385 (58%), Gaps = 8/385 (2%) Query: 3 IVIVEDDINMRKSLEIALGEYEELNIKSYKSAVEALKKLSDDT-DLIITDINMPKMDGLE 61 I++ +DD +R L AL +++ +A + ++ DL++TD+ MP + + Sbjct: 6 ILVADDDAAIRTVLNQAL-SRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FIKELN---GKFDVIIMTGNATLNKAIESVRLGVKDFLTKPFDVSTLYEAIKRVEALKQK 118 + + V++M+ T AI++ G D+L KPFD++ L I R A ++ Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 119 TPKSIKKVETKSENNGFLATSKALEATLYIALKAARTDASVMLSGESGVGKEVFAKFIHA 178 P K + + + S A++ + + +TD ++M++GESG GKE+ A+ +H Sbjct: 125 RPS--KLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD 182 Query: 179 NSPRKDAAFIALNMAAIPENLIESELFGFEKGAFTDAATTKKGQFELANSGTLFLDEIGE 238 R++ F+A+NMAAIP +LIESELFG EKGAFT A T G+FE A GTLFLDEIG+ Sbjct: 183 YGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGD 242 Query: 239 MPINLQPKLLRALQEREITRLGATKSEKIDVRIICATNANLELAMKEGRFREDLFYRLNT 298 MP++ Q +LLR LQ+ E T +G + DVRI+ ATN +L+ ++ +G FREDL+YRLN Sbjct: 243 MPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNV 302 Query: 299 IPLFIPPLRERKDEILPIAQDALEKCCKEYGFEAKNFSKAAKEELLGYDYPGNIRELISV 358 +PL +PPLR+R ++I + + +++ KE + K F + A E + + +PGN+REL ++ Sbjct: 303 VPLRLPPLRDRAEDIPDLVRHFVQQAEKEGL-DVKRFDQEALELMKAHPWPGNVRELENL 361 Query: 359 VQRAAILSEGDEILPKDLFLQARSK 383 V+R L D I + + + RS+ Sbjct: 362 VRRLTALYPQDVITREIIENELRSE 386
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 118 bits (298), Expect = 3e-30 Identities = 54/150 (36%), Positives = 82/150 (54%), Gaps = 13/150 (8%) Query: 3 NIRNFSIIAHIDHGKSTLADRL------IQECGAVSDREMSSQIMDTMDIEKERGITIKA 56 I N ++AH+D GK+TL + L I E G+V + + D +E++RGITI+ Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSV---DKGTTRTDNTLLERQRGITIQT 58 Query: 57 QSVRLNYALNGQNFVLNLIDTPGHVDFSYEVSRSLASCEGALLVVDASQGVEAQTIANVY 116 + +N +N+IDTPGH+DF EV RSL+ +GA+L++ A GV+AQT + Sbjct: 59 GITSFQW----ENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 117 IALENNLEIIPVINKIDLPAADPARVKDEI 146 + + I INKID D + V +I Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDI 144 Score = 86.1 bits (213), Expect = 1e-19 Identities = 50/218 (22%), Positives = 88/218 (40%), Gaps = 23/218 (10%) Query: 161 SAKTGVGIKELLEAIITRIPAPNGDVSKPTKALIYDSWFDNYLGALALVRVYDGEISKND 220 SAK +GI L+E I + + ++ + LA +R+Y G + D Sbjct: 220 SAKNNIGIDNLIEVITNKFYSSTHRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRD 279 Query: 221 EILVMGTGKKHIV-LDLMYPNPIAPIKTKTLSAGEVGIV---VLGLKNVSDVQVGDTITQ 276 + + K I + + K +GE+ I+ L L +V +GDT Sbjct: 280 SVRISEKEKIKITEMYTSINGEL--CKIDKAYSGEIVILQNEFLKLNSV----LGDTK-- 331 Query: 277 SRNPLKEPVGGFERAKPFVFAGLYPIETDKFEDLRDALDKLKLNDSSISYE--PETSVAL 334 P +E + E P + + P + + E L DAL ++ +D + Y T + Sbjct: 332 -LLPQRERI---ENPLPLLQTTVEPSKPQQREMLLDALLEISDSDPLLRYYVDSATHEII 387 Query: 335 GFGFRVGFLGLLHMEVVKERLEREFDLDLIATAPTVTY 372 + FLG + MEV L+ ++ +++ PTV Y Sbjct: 388 -----LSFLGKVQMEVTCALLQEKYHVEIEIKEPTVIY 420 Score = 44.5 bits (105), Expect = 1e-06 Identities = 22/84 (26%), Positives = 31/84 (36%), Gaps = 10/84 (11%) Query: 399 ILEPYVKATIITPSEFLGNIITLLNNRR----GIQTKMDYITTDRVLLEYDIPMNEIVMD 454 +LEPY+ I P E+L T Q K + V+L +IP I + Sbjct: 535 LLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQLK-----NNEVILSGEIPARCI-QE 588 Query: 455 FYDKLKSSTKGYASFDYEPSDYRV 478 + L T G + E Y V Sbjct: 589 YRSDLTFFTNGRSVCLTELKGYHV 612
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 63.3 bits (154), Expect = 2e-13 Identities = 24/113 (21%), Positives = 54/113 (47%), Gaps = 7/113 (6%) Query: 2 KILIVENEIYLAGSMASKLADFGYDCEIAKSVKEALKF---ENFDVVLLSTTLPGQDFYP 58 IL+ +++ + + L+ GYD I + ++ + D+V+ +P ++ + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 59 VIEKFKSS----IIILLIAYINSDTVLKPIQAGAVDYIQKPFMIEELVRKIRH 107 ++ + K + ++++ A T +K + GA DY+ KPF + EL+ I Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117 Score = 31.3 bits (71), Expect = 0.004 Identities = 7/27 (25%), Positives = 16/27 (59%) Query: 270 TELSKKLGISRKSLWEKRKKYDVSKKK 296 + + LG++R +L +K ++ VS + Sbjct: 453 IKAADLLGLNRNTLRKKIRELGVSVYR 479
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 43.3 bits (102), Expect = 1e-07 Identities = 15/44 (34%), Positives = 31/44 (70%) Query: 2 KKRAFTMIELIFVIVVVGILAAIMIPKLNRNASREAANQILTHI 45 K+R FT++E++ VIV++G+LA++++P L N + + ++ I Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDI 49
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 124 bits (312), Expect = 3e-36 Identities = 68/276 (24%), Positives = 115/276 (41%), Gaps = 37/276 (13%) Query: 7 FFAVFAFVFGICVGSFSNVLIYRLP------------------------RSESINFPASH 42 + F+F + +GSF NV+I+RLP ++ P S Sbjct: 14 LYFSLVFLFSLMIGSFLNVVIHRLPIMLEREWQAEYRSYFNPDDEGVDEPPYNLMVPRSC 73 Query: 43 CPKCSHKLNFYHNVPLFSWLFLGGKCAFCKQKISLVYPLVELVSGLFFLICFFKECGEVL 102 CP C+H + N+PL SWL+L G+C C+ IS YPLVEL++ L + Sbjct: 74 CPHCNHPITALENIPLLSWLWLRGRCRGCQAPISARYPLVELLTALLSVAVAMT------ 127 Query: 103 SLETLLYALFLGLCFIMLLALSVIDIRYKAVPDPLLFAALFFAFAYALLLFIFKGNFAQI 162 L L L +L+AL+ ID+ +PD L L+ + LL A I Sbjct: 128 -LAPGWGTLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVI 186 Query: 163 LNLILFGFIFWALRFVVSYAMKREAMGSADIFIAAIIGAILPAKLALVAIYLAALLTLPV 222 + + + W+L + +E MG D + A +GA L + + + L++L+ + Sbjct: 187 GAMAGYL-VLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFM 245 Query: 223 YALVRK-----KGYELAFVPFLSLGLLVTYTFDAQI 253 + + + F P+L++ + + I Sbjct: 246 GIGLILLRNHHQSKPIPFGPYLAIAGWIALLWGDSI 281
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 68.4 bits (167), Expect = 2e-15 Identities = 32/124 (25%), Positives = 57/124 (45%), Gaps = 16/124 (12%) Query: 124 VRLPAAMLFDKDSAEISGEDAKLFLKRIGMIIAKM-PNEVKTDIIGYTDNTNPSKDSIYK 182 L + +LF+ + A + E L ++ ++ + P + ++GYTD Y Sbjct: 215 FTLKSDVLFNFNKATLKPEGQAA-LDQLYSQLSNLDPKDGSVVVLGYTDRIGSDA---Y- 269 Query: 183 NNWQLSTARALSVLEELVSDGVPQERLITSGRASFDPIASNSTDEGR---------AKNN 233 N LS RA SV++ L+S G+P +++ G +P+ N+ D + A + Sbjct: 270 -NQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNVKQRAALIDCLAPDR 328 Query: 234 RVEI 237 RVEI Sbjct: 329 RVEI 332
>FLGBIOSNFLIP#Escherichia coli: Flagellar biosynthetic protein FliP signature. Length = 245 Score = 255 bits (654), Expect = 2e-88 Identities = 107/239 (44%), Positives = 161/239 (67%), Gaps = 1/239 (0%) Query: 5 LSLAVLFCVVFGADPALPTINLSLNSPQNAEQLVNSLNVLLILTALALAPSLIFMMTSFL 64 ++ +L+ + A LP I S P + + L+ +T+L P+++ MMTSF Sbjct: 7 VAPVLLWLITPLAFAQLPGIT-SQPLPGGGQSWSLPVQTLVFITSLTFIPAILLMMTSFT 65 Query: 65 RLVIVFSFLRQAMGTQQVPPSTVLISLAMVLTFFIMEPVGQRSYDEGIKPYIAEQIGYEE 124 R++IVF LR A+GT PP+ VL+ LA+ LTFFIM PV + Y + +P+ E+I +E Sbjct: 66 RIIIVFGLLRNALGTPSAPPNQVLLGLALFLTFFIMSPVIDKIYVDAYQPFSEEKISMQE 125 Query: 125 MLDKSLKPFKEFMVKNTREKDLALFFRIRNLQNPANIEDIPLSIAMSAFMISELKTSFEI 184 L+K +P +EFM++ TRE DL LF R+ N E +P+ I + A++ SELKT+F+I Sbjct: 126 ALEKGAQPLREFMLRQTREADLGLFARLANTGPLQGPEAVPMRILLPAYVTSELKTAFQI 185 Query: 185 AFLLYLPFLVIDMVVSSVLMAMGMMMLPPVMISLPFKLLIFVLVDGWNLLIGNLVKSFH 243 F +++PFL+ID+V++SVLMA+GMMM+PP I+LPFKL++FVLVDGW LL+G+L +SF+ Sbjct: 186 GFTIFIPFLIIDLVIASVLMALGMMMVPPATIALPFKLMLFVLVDGWQLLVGSLAQSFY 244
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 40.6 bits (95), Expect = 4e-06 Identities = 17/88 (19%), Positives = 29/88 (32%), Gaps = 7/88 (7%) Query: 38 SSGKVDKIFVDVSSHVKKGDALASLDQTSLEIALKKAKNDLALAKNAKEFAKSTFNKFSQ 97 + V +I V V+KGD L L E K ++ L A+ + + Sbjct: 103 ENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQI------- 155 Query: 98 VKDVTSKQEFDEVKYKFDEAALRVQAAE 125 + + E+K + V E Sbjct: 156 LSRSIELNKLPELKLPDEPYFQNVSEEE 183
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 627 bits (1619), Expect = 0.0 Identities = 261/1036 (25%), Positives = 469/1036 (45%), Gaps = 42/1036 (4%) Query: 1 MIKTAINRPITTLMIFLSLVVFGIYSLKTMNVNLYPQVNIPIVKI-TTYANGDMNYIKTK 59 M I RPI ++ + L++ G ++ + V YP + P V + Y D ++ Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 60 ITQKIEDEVSSIEGIKKLYSTSF-DNLSVVSIEFELNKDLESATNDVRDKMQKARLN--- 115 +TQ IE ++ I+ + + STS +++ F+ D + A V++K+Q A Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 116 --ANYEIEKLNGLSSAVFSLFITRLDGNETK--LMQEIDDVAKPFLERISGVSKVKTNGF 171 I SS + + T+ + + K L R++GV V+ G Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 172 LEPAVKILLDRFKLDKNALSANEVANLIKVENLKAPLGKIENEK------IQMAIKSNFS 225 + A++I LD L+K L+ +V N +KV+N + G++ + +I + Sbjct: 181 -QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTR 239 Query: 226 AKSIDEIRNLTIK-----QGVFLKDIASVDLAYKDANEAAIMDKKSGVLLGLELAPDANA 280 K+ +E +T++ V LKD+A V+L ++ N A ++ K LG++LA ANA Sbjct: 240 FKNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANA 299 Query: 281 LTVIALAKAKLDQFKSLLGNEYDVKIAYDKSEVIQKHIDQTAFDMILGVLLTIVIVYLFL 340 L KAKL + + V YD + +Q I + + ++L +++YLFL Sbjct: 300 LDTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFL 359 Query: 341 RNFSITIISVVAIPTSIVATFFIINALGYDINRLSLIALTLGIGIFIDDAIVVTENIASK 400 +N T+I +A+P ++ TF I+ A GY IN L++ + L IG+ +DDAIVV EN+ Sbjct: 360 QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERV 419 Query: 401 LKDEP-NALKASFTGIKEIAFSVFAISLVLLCVFVPIAFMSGIVGKYFNSFAMSVAAGIV 459 + ++ +A+ + +I ++ I++VL VF+P+AF G G + F++++ + + Sbjct: 420 MMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 460 ISFFVSIFLVPTLSARFVNAKESS-------FYIKGEPFFEALENFYEKILTLALKFKLL 512 +S V++ L P L A + + F+ F+ N Y + L Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 513 FLAATLLVVVCSFALAKFVGGDFMPSEDNSEFNIYFKLDPSLSLQASKERLKD--KISLI 570 +L L+V L + F+P ED F +L + + +++ L L Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 571 NADPQVAYAYFILGYTDAKQ-PYLVKAYVRLKELKDRANHE-RQNAIMQRFRDKLKS--D 626 N V + + G++ + Q A+V LK ++R E A++ R + +L D Sbjct: 600 NEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRD 659 Query: 627 DMSVIVADLPVVEGGDVQPVKLTITSENGKELEKFVPKISKILKEINDA----TDVNSPE 682 + +VE G + + G + +++L V Sbjct: 660 GFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNG 719 Query: 683 EDLLKRVQISIDEDKAKRLNLDKASVASAVYSAFSQNEVSVFENENGKEYELYMRLDDKF 742 + + ++ +D++KA+ L + + + + +A V+ F + G+ +LY++ D KF Sbjct: 720 LEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF-IDRGRVKKLYVQADAKF 778 Query: 743 RSDTNDILKTKIRSNEGFFVTLGDVATISFEQKPASISRFNRADEIKFLANTKNNAPLNS 802 R D+ K +RS G V T + + R+N ++ Sbjct: 779 RMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSS-G 837 Query: 803 VANEISKKLDEILPANFKYKFLGFVELMDDTNASFIFTVSASAVLIYMVLAALYESFLLP 862 A + + L LPA Y + G + V+ S V++++ LAALYES+ +P Sbjct: 838 DAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIP 897 Query: 863 FLIMLAMPLAFCGVVIGLFISGNPFSLFVMVGVILLFGMVGKNAILVVDFANHF-ANSGM 921 +ML +PL GV++ + ++ MVG++ G+ KNAIL+V+FA G Sbjct: 898 VSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGK 957 Query: 922 EANEAVKMAAKKRLRAVMMTTFAMIFAMLPLALSRGAGYEANSPMAISIIFGLISSTLLS 981 EA MA + RLR ++MT+ A I +LPLA+S GAG A + + I ++ G++S+TLL+ Sbjct: 958 GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLA 1017 Query: 982 LLVVPVLFAWVYNLDK 997 + VPV F + K Sbjct: 1018 IFFVPVFFVVIRRCFK 1033
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 52.2 bits (125), Expect = 1e-11 Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 5/69 (7%) Query: 2 KKRAFTLIEIIFVIVILGVLSAIAIPKLFFTRSDAIVANARTQIAAIKSGISLKYNDSVL 61 K+R FTL+EI+ VIVI+GVL+++ +P L + A A + I A+++ + + D+ Sbjct: 6 KQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKLDN-- 63 Query: 62 KGIPKYPDT 70 YP T Sbjct: 64 ---HHYPTT 69
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.9 bits (70), Expect = 0.009 Identities = 25/114 (21%), Positives = 42/114 (36%), Gaps = 18/114 (15%) Query: 180 YLMPLLFILMLVMIAKNITLEG---AMEGVKFYLTPDFSKI----------SLKLFVEVL 226 PLL + L+ IA ++ G + E +K PD KI S+K VE L Sbjct: 86 LCFPLLTVAALMAIASHVVQYGFLISGEAIK----PDIKKINPIEGAKRIFSIKSLVEFL 141 Query: 227 GQVFFALSLGFGVMITLSSFVKKDEGLVKISIITGILNTVIAVLAGFMIFPSLF 280 + + L + I + + L I I + +L M+ ++ Sbjct: 142 KSILKVVLLSILIWIIIKGNLVTLLQLP-TCGIECITPLLGQILRQLMVICTVG 194
>PF06580#Sensor histidine kinase Length = 349 Score = 30.6 bits (69), Expect = 0.009 Identities = 24/138 (17%), Positives = 50/138 (36%), Gaps = 5/138 (3%) Query: 68 FFAWISFGLKMLSDACLKEGTTFENIIFVMSFLLISSLLDLPLSIYESFVKDKKLGFSNM 127 W + L A L ++IF ++ L+ +L Y SF+K + NM Sbjct: 17 GIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHA---YRSFIKRQGWLKLNM 73 Query: 128 SARIFLVDTIKSL-ALMLVFGSAFVWLVLLYINFLGDFWWFWAFLLSFGVALIINLIYPT 186 I V + ++ + +W +L +IN + LS +++ + Sbjct: 74 GQIILRVLPACVVIGMVWFVANTSIWRLLAFINTKPVAFT-LPLALSIIFNVVVVTFMWS 132 Query: 187 LIAPIFNKMSPLEDGELK 204 L+ ++ + E+ Sbjct: 133 LLYFGWHFFKNYKQAEID 150
>PF06580#Sensor histidine kinase Length = 349 Score = 28.7 bits (64), Expect = 0.016 Identities = 20/98 (20%), Positives = 33/98 (33%), Gaps = 22/98 (22%) Query: 110 ISLLLRCEISNSDAKNPALKSSFELCRPYL-LEEIR------------------LIKPKI 150 +S L+R + S+A+ +L + YL L I+ + P + Sbjct: 200 LSELMRYSLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQVPPML 259 Query: 151 IITLGEQAFMHLYPNLLSKGGFSSIRGSILKDDDRFIM 188 + TL E H L G I KD+ + Sbjct: 260 VQTLVENGIKHGIAQLPQGG---KILLKGTKDNGTVTL 294
>60KDINNERMP#60kDa inner membrane protein signature. Length = 548 Score = 364 bits (935), Expect = e-122 Identities = 153/553 (27%), Positives = 267/553 (48%), Gaps = 48/553 (8%) Query: 4 MSMQKRLLLAALLSIVFFIVYDFFMPKRAPLEQNQTTISQTMDQSKAPASANDTPKSNEN 63 M Q+ LL+ ALL + F I + K + QTT + T + A+ P S + Sbjct: 1 MDSQRNLLVIALLFVSFMIWQAWEQDKNPQPQAQQTTQTTT--TAAGSAADQGVPASGQG 58 Query: 64 LASNEIIATIKGQSYEAKIDKLG-RIAKFYLTEDKYKTEDGNKIELVSQNPLPLELRFN- 121 + ++K + I+ G + + L + +L+ +P + + Sbjct: 59 K-----LISVKTDVLDLTINTRGGDVEQALLPAYPKELNSTQPFQLLETSPQFIYQAQSG 113 Query: 122 -------DSTLNADAFKVAYSSDVSEIDASSEPKTIKLT-QNLDGVTVTKNIKFYPNGRY 173 D+ N D + + +T + G T TK G Y Sbjct: 114 LTGRDGPDNPANGPRPLYNVEKDAYVLAEGQNELQVPMTYTDAAGNTFTKTFVLKR-GDY 172 Query: 174 EVEVNL------SKSVDYFI------TPGFRPNIAIDS-----YTVHGVMLRNTDDSLNI 216 V VN K ++ + P++ S +T G D+ Sbjct: 173 AVNVNYNVQNAGEKPLEISSFGQLKQSITLPPHLDTGSSNFALHTFRGAAYSTPDEKYEK 232 Query: 217 IE---DGDAKEVKNYANTTIAAASDRYYTALFYSFNKPFEAVVD-KDANNNPIVFVKT-- 270 + D + + + A +Y+ + N N + K+ Sbjct: 233 YKFDTIADNENLNISSKGGWVAMLQQYFATAWIPHNDGTNNFYTANLGNGIAAIGYKSQP 292 Query: 271 -------NDSLKLGAYIGPKEHKILSSMDERLNDVIEYGWFTFIAKPMFAFLNFLHNYIG 323 ++ ++GP+ ++++ L+ ++YGW FI++P+F L ++H+++G Sbjct: 293 VLVQPGQTGAMNSTLWVGPEIQDKMAAVAPHLDLTVDYGWLWFISQPLFKLLKWIHSFVG 352 Query: 324 NWGWAIVVLTLVIRIVLFPLTYKGMLSMNKLKELAPKVKEIQTKYKDDKQKMQVHMMELY 383 NWG++I+++T ++R +++PLT SM K++ L PK++ ++ + DDKQ++ MM LY Sbjct: 353 NWGFSIIIITFIVRGIMYPLTKAQYTSMAKMRMLQPKIQAMRERLGDDKQRISQEMMALY 412 Query: 384 KKHGANPMGGCLPILLQIPVFFAIYRVLLNAIELKGAPWILWIHDLSVMDPYFVLPILMG 443 K NP+GGC P+L+Q+P+F A+Y +L+ ++EL+ AP+ LWIHDLS DPY++LPILMG Sbjct: 413 KAEKVNPLGGCFPLLIQMPIFLALYYMLMGSVELRQAPFALWIHDLSAQDPYYILPILMG 472 Query: 444 LTMFLQQKLTPTTFTDPMQEKVMKFLPLIFTFFFVTFPAGLTLYWFVNNVCSVVQQVFVN 503 +TMF QK++PTT TDPMQ+K+M F+P+IFT FF+ FP+GL LY+ V+N+ +++QQ + Sbjct: 473 VTMFFIQKMSPTTVTDPMQQKIMTFMPVIFTVFFLWFPSGLVLYYIVSNLVTIIQQQLIY 532 Query: 504 KLFEKHKKAAEVK 516 + EK + K Sbjct: 533 RGLEKRGLHSREK 545
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.8 bits (69), Expect = 0.010 Identities = 33/167 (19%), Positives = 56/167 (33%), Gaps = 12/167 (7%) Query: 48 IEANLENQPKPQQKPKNDRNFAKKSDENEPVKEEKKQSKKHDHNDKKRNPKKHKDEKNEA 107 + +N E + + P A S+ E V E KQ K +++ + + A Sbjct: 1010 VPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVA 1069 Query: 108 KPEQKEHK-----NEKQNLSEKNSALAKDAFAEKGEKEAEEPGYVIKR--LDEPKAPKE- 159 K + K NE + E E EE V + PK + Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129 Query: 160 --QREQKEAKEPQASKSAHKNILDTSIIENFNHTDEESAPQALPKEK 204 ++EQ E +PQA + + T I+ +A P ++ Sbjct: 1130 SPKQEQSETVQPQAEPAREND--PTVNIKEPQSQTNTTADTEQPAKE 1174
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 38.2 bits (89), Expect = 5e-05 Identities = 42/193 (21%), Positives = 71/193 (36%), Gaps = 14/193 (7%) Query: 167 RKAVNLAGVQVDNVVLSGYASAIATLTKDEKELGVALIDMGGETCNMVVHAGNSLRYNSY 226 R++ AG + ++ A+AI + G ++D+GG T + V + N + Y+S Sbjct: 127 RESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSS 186 Query: 227 LHVGSANIT------IDLSMALHTPLPKAEEIKLEYGK-LVNKSVDLIELP---RLGDEQ 276 + +G + + AE IK E G V IE+ Sbjct: 187 VRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVP 246 Query: 277 KTHEVSLDVISNVISARAEETVMVLANMLEDSG---YKDLVGAGIVLTGGMTKLDGLKDL 333 + ++ + I + V + LE D+ G+VLTGG L L L Sbjct: 247 RGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRL 306 Query: 334 ASAIFDNMPVRIA 346 +PV +A Sbjct: 307 LME-ETGIPVVVA 318
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 300 bits (770), Expect = e-102 Identities = 128/360 (35%), Positives = 196/360 (54%), Gaps = 18/360 (5%) Query: 4 FLSFVAASVIATSAFATQIKELANIVGVRDNQLIGYGLVVGLNGTGDGST-SKFTIQSLS 62 F + S A ++IK++A++ RDNQLIGYGLVVGL GTGD S FT QS+ Sbjct: 13 FSALPFLSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVGLQGTGDSLRSSPFTEQSMR 72 Query: 63 NMLQGVNVKINPDDIKSKNAAAVMVTAKLPAFARHGDKLDIEISSIGDAKSLQGGTLLMT 122 MLQ + + +KN AAVMVTA LP FA G ++D+ +SS+GDA SL+GG L+MT Sbjct: 73 AMLQNLGITTQGGQSNAKNIAAVMVTANLPPFASPGSRVDVTVSSLGDATSLRGGNLIMT 132 Query: 123 PLKGVDGDIYALAQGPLSIGGKSAGRSG----GNHPTVGTILNGALVEREVTYDIYNQDS 178 L G DG IYA+AQG L + G SA T + NGA++ERE+ + + Sbjct: 133 SLSGADGQIYAVAQGALIVNGFSAQGDAATLTQGVTTSARVPNGAIIERELPSKFKDSVN 192 Query: 179 IKLSLKDTNFKTALDIQNAIN----ANISDDTAKAIDPRTVIVKKPDDVSIIELASAVLD 234 + L L++ +F TA+ + + +N A D A+ D + + V+KP + L + + + Sbjct: 193 LVLQLRNPDFSTAVRVADVVNAFARARYGDPIAEPRDSQEIAVQKPRVADLTRLMAEIEN 252 Query: 235 LDVEYKPDEKIVVDERTGTIVSGINAVVSPVVITHGAITIKIEPNSYEEAAQNDVNIGSD 294 L VE K+V++ERTGTIV G + +S V +++G +T+++ + + Q Sbjct: 253 LTVETDTPAKVVINERTGTIVIGADVRISRVAVSYGTLTVQVTESP--QVIQPAPFSRGQ 310 Query: 295 TSVAPSQNLLK-------ISGEKTTVANVTRALNKLGATPSDIISILENLKRVGAIQVDL 347 T+V P +++ E + + LN +G II+IL+ +K GA+Q +L Sbjct: 311 TAVQPQTDIMAMQEGSKVAIVEGPDLRTLVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 219 bits (558), Expect = 9e-66 Identities = 120/624 (19%), Positives = 226/624 (36%), Gaps = 88/624 (14%) Query: 7 SLGTGVSGLNAAQVQISTTGNNITNADSNYYTRQRVVQSASPAMNTVPGGVGTGTQVDTV 66 + +SGLNAAQ ++T NNI++ + YTRQ + + + + G VG G V V Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVSGV 62 Query: 67 TRLHDEFAYSRLKYSSSNLENTGYKQRILQEATKYFPDLKDNGMVKDIQEYFAAWNNFAS 126 R +D F ++L+ + + + + + + + +Q++F + S Sbjct: 63 QREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNML-STSTSSLATQMQDFFTSLQTLVS 121 Query: 127 NPDEGAQKVNLINKASVLTASINRSSKMLYDMHTQIDETIKININEINSLGKQIANINKQ 186 N ++ A + LI K+ L + + L D Q++ I ++++IN+ KQIA++N Q Sbjct: 122 NAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLNDQ 181 Query: 187 IQRVESGADAGIKINANDLRDKRDELELAMSKLVNTAVYKSDLKSESRIDTGISDQGRYY 246 I R+ G + N+L D+RD+L ++++V V G Y Sbjct: 182 ISRLTG---VGAGASPNNLLDQRDQLVSELNQIVGVEVS--------------VQDGGTY 224 Query: 247 NLNIG-GVSIVDGVNFHEISM-SSTESGQYTKIYYEREDGRRIPMEEKITN-GKIGAALD 303 N+ + G S+V G +++ S+ T + Y I + EK+ N G +G L Sbjct: 225 NITMANGYSLVQGSTARQLAAVPSSADPSRTTVAYVDGTAGNIEIPEKLLNTGSLGGILT 284 Query: 304 LRGRNYEPDNDKFSDGIIQKYIDNLNTFSKTLITSTNNVYAESAVEISNSDPISYLENDK 363 R + + + L + + N + Sbjct: 285 FR------------SQDLDQTRNTLGQLALAFAEAFNTQHKAG----------------- 315 Query: 364 TLMNHDNSIRNGSFE----AIVYDNKGNVVAKKTIEINGTTTMNDTKYGNSVVQDFNSNS 419 + + F A++ + K + + + T Y + D N Sbjct: 316 --FDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDASAVLATDY--KISFDNNQWQ 371 Query: 420 DDN-NDNNMLNDVDDFFEASYFYDKNTHQGTFALIPKQAQGLYSISIVDHGTNFPGVVGI 478 N D F +++ V V+ Sbjct: 372 VTRLASNTTFTVTPDANGKVAFDGLELTFTG----TPAVNDSFTLKPVSDAIVNMDVL-- 425 Query: 479 NRFFSGTNSNTIGINQNFTQDHTKLRAYSKPVVGNNEVANKMIQLQYQKQTFYSSGTALD 538 D K+ S+ G+++ N L Q S+ + Sbjct: 426 ------------------ITDEAKIAMASEEDAGDSDNRNGQALLDLQ-----SNSKTVG 462 Query: 539 RDETIEGYYRYFTTDMASDTEANNTIHDTNTSLQRTAEEEFQSTSGVDTNEELTNLIRFQ 598 ++ Y +D+ + T T T ++ + QS SGV+ +EE NL RFQ Sbjct: 463 GAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQ 522 Query: 599 ASYGAAAKIITTVDQMLDTLLSLK 622 Y A A+++ T + + D L++++ Sbjct: 523 QYYLANAQVLQTANAIFDALINIR 546
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 51.2 bits (122), Expect = 1e-08 Identities = 72/390 (18%), Positives = 148/390 (37%), Gaps = 17/390 (4%) Query: 74 NFVDDKDLNLSDNVKELQEQVRELSKKNEILAADNVDMSEKNLDFISKISEMKRNIENEK 133 + + ++ L +L + L N+ L + + EK +SE I+ + Sbjct: 60 DKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELE 119 Query: 134 NEIVEKNQKALGELEAQ-----HFENIQALTKRLNEAQADMIESSKAYEKKIIDLENAIN 188 + + G + + ++A L +AD+ ++ + I Sbjct: 120 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIK 179 Query: 189 DARNGDESKLKDAEASFNKFKESFEANYTALKEQNNELNATLAQKEALIKEYE------K 242 +++ L+ +A K E TA + L A A A + E Sbjct: 180 TLEA-EKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAM 238 Query: 243 AQSEKDRSEKKEILLLKEEIERVKNDADTQKFSYEKEINALNDGFETQKSVMEDELSKKA 302 S D ++ K + K +E + + + A + +T ++ ++KA Sbjct: 239 NFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKA 298 Query: 303 NKIIDLQEALESNKTALKDRIYELEEIKKNLNSKDLMA----QSYNGKNLELNASLAALH 358 + + L +N+ +L+ + E KK L ++ + L L A Sbjct: 299 -DLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASR 357 Query: 359 KSFDDLKQKSLKSEQENKLANENISSLKKELERANALNKKLEKQNLDANSTLSELSKKLS 418 ++ L+ + K E++NK++ + SL+++L+ + K++EK +ANS L+ L K Sbjct: 358 EAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNK 417 Query: 419 LSEESLKKSQEELKALDTKTTKFLKTLFDQ 448 EES K +++E L K K L ++ Sbjct: 418 ELEESKKLTEKEKAELQAKLEAEAKALKEK 447 Score = 43.5 bits (102), Expect = 3e-06 Identities = 39/287 (13%), Positives = 98/287 (34%), Gaps = 8/287 (2%) Query: 284 NDGFETQKSVMEDELSKKANKIIDLQEALESNKTALKDRIYELEEIKKNLNSKDLMAQSY 343 N ++D + ++ + +E L N +L ++ +++E++ + + Sbjct: 73 NSDLSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADLEKALEGA 132 Query: 344 NGKNLELNASLAALHKSFDDLKQKSLKSEQENKLANENISSLKKELERANALNKKLEKQN 403 + +A + L L + E+ + A ++ +++ A LE + Sbjct: 133 MNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQ 192 Query: 404 LDANSTLSELSKKLSLSEESLKKSQEELKALDTKTTKFLKTLFDQNQTISLQSQKLGSNE 463 + L + +K + E AL + + + ++ Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADL-------EKALEGAMNFSTADS 245 Query: 464 GELKNLSAKLDLKDAKIKELEENVTKTSQMLLSKQNELETQKRTLKIDMQNYEILRQQIN 523 ++K L A+ +A+ ELE+ + + +++T + L Q Sbjct: 246 AKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQ 305 Query: 524 MLQKKIVDTSTFLTDNNKSGGKNLLSLQNELENAKQKLNESNKTIER 570 +L L D ++ K L + +LE + S +++ R Sbjct: 306 VLNANRQSLRRDL-DASREAKKQLEAEHQKLEEQNKISEASRQSLRR 351 Score = 43.1 bits (101), Expect = 4e-06 Identities = 70/397 (17%), Positives = 149/397 (37%), Gaps = 11/397 (2%) Query: 120 SKISEMKRNIENEKNEIVEKNQKALGELEAQHFENIQALTKRLNEAQADMIESSKAYEKK 179 K+ E E E N + KN L ++ LT+ L+ A+ + ++ K+ +K Sbjct: 53 EKVQERADKFEIENNTLKLKNSD-LSFNNKALKDHNDELTEELSNAKEKLRKNDKSLSEK 111 Query: 180 IIDLENAINDARNGDESKLKDAEASFNKFKESFEANYTALKEQNNELNATLAQKEALIKE 239 + +AR D K + +F+ + A K A L + Sbjct: 112 --ASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMN 169 Query: 240 YEKAQSEKDRSEKKEILLLKEEIERVKNDADTQKFSYEKEINALNDGFETQKSVMEDELS 299 + A S K ++ + E L+ ++ + + +A E +K+ + + Sbjct: 170 FSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTAD-SAKIKTLEAEKAALAARKA 228 Query: 300 KKANKIIDLQEALESNKTALKDRIYELEEIKKNLNSKDLMAQSYNGKNLELNASLAALHK 359 + ++ +K E ++ + + + +A + L Sbjct: 229 DLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEA 288 Query: 360 SFDDLKQKSLKSEQENKLANENISSLKKELERANALNK-------KLEKQNLDANSTLSE 412 L+ + E ++++ N N SL+++L+ + K KLE+QN + ++ Sbjct: 289 EKAALEAEKADLEHQSQVLNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQS 348 Query: 413 LSKKLSLSEESLKKSQEELKALDTKTTKFLKTLFDQNQTISLQSQKLGSNEGELKNLSAK 472 L + L S E+ K+ + E + L+ + + + + + E L+ ++K Sbjct: 349 LRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSK 408 Query: 473 LDLKDAKIKELEENVTKTSQMLLSKQNELETQKRTLK 509 L + KELEE+ T + Q +LE + + LK Sbjct: 409 LAALEKLNKELEESKKLTEKEKAELQAKLEAEAKALK 445 Score = 41.6 bits (97), Expect = 1e-05 Identities = 44/342 (12%), Positives = 110/342 (32%), Gaps = 11/342 (3%) Query: 285 DGFETQKSVMEDELSKKANKIIDLQEALESNKTALKDRIYELEEIKKNLNSKDLMAQSYN 344 ++ +++ ++A+K L+ + L L++ L + A+ Sbjct: 42 AVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKL 101 Query: 345 GKNLELNASLAALHKSFDDLKQKSLK----SEQENKLANENISSLKKELERANALNKKLE 400 KN + + A+ + + K K + + + I +L+ E A LE Sbjct: 102 RKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLE 161 Query: 401 KQNLDANSTLSELSKKLSLSEESLKKSQEELKALDTKTTKFLKTLFDQNQTISLQSQKLG 460 K A + + S K+ E + L+ + + I + Sbjct: 162 KALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKA 221 Query: 461 SNEGELKNLSAKLDLKDAKIKELEENVTKTSQMLLSKQNELETQKRTLKIDMQNYEILRQ 520 + +L L+ + + + ++ L+ M Sbjct: 222 ALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSA 281 Query: 521 QINMLQKKIVDTSTFLTDNNKSGGKNLLSLQNELENAKQKLNESNKTIERLNSKINELSS 580 +I L+ + D L ++ ++ L+ S + ++L ++ +L Sbjct: 282 KIKTLEAEKAALEAEKADLEHQ----SQVLNANRQSLRRDLDASREAKKQLEAEHQKLEE 337 Query: 581 SGHKGGAVNAQIIELQKDIEQNLNRQDELENENVNLKNILQA 622 + A L++D++ + + +LE E+ L+ + Sbjct: 338 ---QNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNKI 376
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 91.8 bits (228), Expect = 9e-24 Identities = 38/132 (28%), Positives = 63/132 (47%), Gaps = 4/132 (3%) Query: 3 RILLVEDDETLLDLISEYLGENGYDVTTTNNAKDALDLAYERNFDLLILDVKLPQGDGFS 62 IL+ +DD + ++++ L GYDV T+NA + DL++ DV +P + F Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 63 LLSSLRELGVTTPSIFTTSLNTIDDLEKGYKSGCDDYLKKPFELKELLIRMQALIKRNFS 122 LL +++ P + ++ NT K + G DYL KPF+L E + +I R + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTE----LIGIIGRALA 120 Query: 123 HQNGEDIKILDD 134 K+ DD Sbjct: 121 EPKRRPSKLEDD 132
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 87.1 bits (216), Expect = 3e-26 Identities = 39/87 (44%), Positives = 51/87 (58%) Query: 3 KAEFIQAVADKAGLSKKDTLKVVDATLETIQAVLEKGDTISFIGFGTFGTADRAARKARV 62 K + I VA+ L+KKD+ VDA + + L KG+ + IGFG F +RAARK R Sbjct: 4 KQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGRN 63 Query: 63 PGTKKVIDVPASKAVKFKVGKKLKEAV 89 P T + I + ASK FK GK LK+AV Sbjct: 64 PQTGEEIKIKASKVPAFKAGKALKDAV 90
>FLAGELLIN#Flagellin signature. Length = 507 Score = 55.4 bits (133), Expect = 4e-10 Identities = 55/340 (16%), Positives = 113/340 (33%), Gaps = 9/340 (2%) Query: 18 KNMVGVNKSYQQLSNGLKIQDPYDGAAVYNDAMRLDYEATTLTQVADATGKSVNFAKNTD 77 K+ ++ + ++LS+GL+I D AA A R LTQ + ++ A+ T+ Sbjct: 19 KSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQTTE 78 Query: 78 NALKEFEKQLENFKTKVVQAASDVHSTTSLEALANDLQGIKNHLVNIAN-TSINGQFLFS 136 AL E L+ + VQA + +S + L+++ +++Q + ++N T NG + S Sbjct: 79 GALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKVLS 138 Query: 137 GSAVDTKPIDGSGKYQGNRDYMKTSAGAQVELPYNIPGFDLFLGKDGDYNKILTTNVMLA 196 + G+ + ++ + L GF++ K+ + ++ + Sbjct: 139 QDNQMKIQV-GANDGETITIDLQKIDVKSLGL----DGFNVNGPKEATVGDLKSSFKNVT 193 Query: 197 DQTRTDIAYAPKYLDENSKIKNMIGLNYASDSVVGSDGSYKGTIEPDFDFLDTSNVNFPD 256 + +D NS V + + D + ++ Sbjct: 194 GYDTYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTT 253 Query: 257 TYFFMQGKKPDGTTFTSKFKMSADTSMAGLMEKIGMEFGNTKTTKVVDVSINNDGQFNIK 316 + K G+ I + GN KV + Sbjct: 254 KSTAGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVA 313 Query: 317 DLTKGNQTIDFHMVAATSVAANRGAIAPNNTLDTVNSLQS 356 D+T G +D A + N N + ++ Sbjct: 314 DITAGAANVDA---ATLQSSKNVYTSVVNGQFTFDDKTKN 350 Score = 32.3 bits (73), Expect = 0.008 Identities = 28/300 (9%), Positives = 65/300 (21%), Gaps = 15/300 (5%) Query: 480 GGVNTPVQFQITSTTAAGVVSPTRNLTVYNSDEFGSYRTYASDFTYRQLMDIIAMAASDN 539 G V + + N+ A + T L A Sbjct: 201 GANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKSTAGTA 260 Query: 540 IPDPQNVENANFDTDIEKVRRDQNYNAYKEALSKTKGAVEVNLDDKGRMVLTDKTKSVTN 599 + + + + G V ++ + + LT + Sbjct: 261 EAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEK-VTLTVADITAGA 319 Query: 600 IELTMYDAKNGDI----FDGDSTGMNTAGAASHPQGKGSVFSFNENNALTIDEPSTSVFQ 655 + ++ + + + I Sbjct: 320 ANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAEYTA 379 Query: 656 DLDDMIFAVRNGYYRADANNHDPRNT----------GMQGALKRLDHLVDHANKELTKIG 705 + + D L +D + + + +G Sbjct: 380 NAAGDKVTLAGKTMFIDKTASGVSTLINEDAAAAKKSTANPLASIDSALSKVDAVRSSLG 439 Query: 706 SQTKLLTSTKERAEIMKVNVLTVKNDVIDADYAESYLKFTQLSLSYQATLQASAKINQLS 765 + S N+ + ++ + DADYA ++ + QA A+ NQ+ Sbjct: 440 AIQNRFDSAITNLGNTVTNLNSARSRIEDADYATEVSNMSKAQILQQAGTSVLAQANQVP 499
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 33.8 bits (77), Expect = 0.001 Identities = 17/85 (20%), Positives = 38/85 (44%), Gaps = 3/85 (3%) Query: 205 GLGSSYNPGFAWDPDKPNILHAHCSNETEISFKFDKDKMKDKDNNSTNSSDKDKENPKPD 264 G+ + +N + + N L + +I F D + ++ N+ D +P+ Sbjct: 255 GVPNEFNGAVFINENVRNFLKQNIE---DIHFANDDQPNNPDNPDNPNNPDNPNNPDEPN 311 Query: 265 KDKDNSNPDKKDNNENSNNSSGESG 289 + +NPD DN +N+N+ + ++ Sbjct: 312 NPDNPNNPDNPDNGDNNNSDNPDAA 336
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 107 bits (268), Expect = 2e-27 Identities = 53/268 (19%), Positives = 109/268 (40%), Gaps = 34/268 (12%) Query: 128 SNSVFFRADDYIFDQVKDAIAKIDKSLEQVTFKLTITETNLKDIKDLGTNLQ----GLLK 183 +N++ A + + ++ IA++D QV + I E D +LG G+ + Sbjct: 318 TNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWANKNAGMTQ 377 Query: 184 PLNHGDLAYYINL-----------ITSPYITNSNVIKNDDSAFFG-----ILNFLDTNGI 227 N G L + ++S + + + F+ +L L ++ Sbjct: 378 FTNSG-LPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSSTK 436 Query: 228 TKIISSPVLTAKNHTEVYFSSVQNIPYLVSKTDISNVNYQKTDSYEYKDIGLKINLKPII 287 I+++P + ++ E F+ Q +P L S N ++ E K +G+K+ +KP I Sbjct: 437 NDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDN--IFNTVERKTVGIKLKVKPQI 494 Query: 288 LSDHIDFDLHLILEDILSQ--------SSSLTPIVSKKELKSSYSLKRGDVLVLSGINKK 339 + L +E +S SS L + + + ++ + G+ +V+ G+ K Sbjct: 495 NEGD---SVLLEIEQEVSSVADAASSTSSDLGATFNTRTVNNAVLVGSGETVVVGGLLDK 551 Query: 340 TTAKQRNGVPVLKDIWLLKYLFSVEQDS 367 + + + VP+L DI ++ LF Sbjct: 552 SVSDTADKVPLLGDIPVIGALFRSTSKK 579
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.4 bits (66), Expect = 0.032 Identities = 9/33 (27%), Positives = 16/33 (48%), Gaps = 3/33 (9%) Query: 51 NILMIGSTGVGKTEIAR---RLSKMMGLPFIKV 80 +++ G +G GK +AR K PF+ + Sbjct: 162 TLMITGESGTGKELVARALHDYGKRRNGPFVAI 194
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 156 bits (395), Expect = 3e-43 Identities = 66/293 (22%), Positives = 132/293 (45%), Gaps = 22/293 (7%) Query: 200 NAGLITVTATPSQLKRVEKYIAEMQRRLKKQVIIDVSIIAVDLNNEYKQGVDWSKFELG- 258 + VTA P + +E+ IA++ R + QV+++ I V + G+ W+ G Sbjct: 317 QTNALIVTAAPDVMNDLERVIAQLDIR-RPQVLVEAIIAEVQDADGLNLGIQWANKNAGM 375 Query: 259 --FNSYIGNPGSSTSSYASWTNKGNSLSDGFGRTLN----IAANLNFSLDGMINFLETNG 312 F + ++ + + G S + A + ++ L ++ Sbjct: 376 TQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAGFYQGNWAMLLTALSSST 435 Query: 313 KTKVVSSPKVTTLNNQQALISVGDNINYRVMEETDNGSNNNNNNRLTTTYKQYSVFIGIL 372 K ++++P + TL+N +A +VG + +T +G N N T +GI Sbjct: 436 KNDILATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKT--------VGIK 487 Query: 373 LNLLPEVSDNNKIMLRINPSLSSFKYAEDDTRSQNTAIREIAPDTVQKKLSTVVQVNSGD 432 L + P++++ + ++L I +SS + ++ ++ + ++ V V SG+ Sbjct: 488 LKVKPQINEGDSVLLEIEQEVSSV------ADAASSTSSDLGATFNTRTVNNAVLVGSGE 541 Query: 433 TIILGGLIGQTKDKQNTAVPLLADIPLIGSVFKSTRDGVRTTELIFVITPRVV 485 T+++GGL+ ++ VPLL DIP+IG++F+ST V L+ I P V+ Sbjct: 542 TVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVI 594
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.024 Identities = 9/52 (17%), Positives = 15/52 (28%), Gaps = 1/52 (1%) Query: 132 DTSKNDPKRQRNSQGWLK-LNIPDEEPLTEQNGINGISVPQDEVIDLESKPA 182 D K+ P + + WL + Q + + E K A Sbjct: 810 DPGKSSPMLEGQVRDWLNENGWEYLRETSGQRRRGYMRPQVWPPVIAEDKEA 861
>PF04183#IucA / IucC family Length = 580 Score = 27.9 bits (62), Expect = 0.030 Identities = 10/30 (33%), Positives = 15/30 (50%), Gaps = 3/30 (10%) Query: 34 LLRAKGLDEANFYDLARVVAQICENYRKKF 63 L+ G+ E FY L +A + +Y KK Sbjct: 495 LMVRLGVPERRFYQL---LAAVLSDYMKKH 521
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 27.8 bits (62), Expect = 0.043 Identities = 18/111 (16%), Positives = 45/111 (40%), Gaps = 12/111 (10%) Query: 91 FSVFDVVMMSANARLGIFERPSKEDEKIALDALKTLNLESFKDKIYTDLSGGERQMVLIA 150 F D+V++ + + ++ AL + ++KI+ ++S +R ++ Sbjct: 245 FVFEDIVLLDDRSIQRVLREIDGQELAKALKS----VDIPVQEKIFKNMS--KRAASMLK 298 Query: 151 RALAQRSKVMLLDEPTANLDFGNQMRVLKEIKKLAKQGYIIILTSHQPEQV 201 + D + Q +++ I+KL +QG I+I + + + Sbjct: 299 EDMEFLGPTRRKDVEES------QQKIVSLIRKLEEQGEIVISRGGEEDVL 343
>PF05211#Neuraminyllactose-binding hemagglutinin Length = 260 Score = 25.4 bits (55), Expect = 0.025 Identities = 10/23 (43%), Positives = 17/23 (73%) Query: 52 MQKIDTQFALESLEVYQKIAEDM 74 MQ+ID + ++LE YQK A+++ Sbjct: 232 MQEIDKKLTQKNLESYQKDAKEL 254
>SYCDCHAPRONE#Gram-negative bacterial type III secretion SycD chaperone signature. Length = 168 Score = 32.2 bits (73), Expect = 0.001 Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 6/65 (9%) Query: 173 YNLAVLYHNTPGAKRDYKEAIKLYKKACDSDFSISCY--NLATLYQEQKEYEKANKLYFK 230 Y+LA + Y++A K+++ C D S + L Q +Y+ A Y Sbjct: 40 YSLAFNQYQ----SGKYEDAHKVFQALCVLDHYDSRFFLGLGACRQAMGQYDLAIHSYSY 95 Query: 231 ACKLD 235 +D Sbjct: 96 GAIMD 100
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 49.2 bits (117), Expect = 6e-09 Identities = 11/42 (26%), Positives = 25/42 (59%) Query: 220 EMSNVQLVEEMTDLITGQRAYEANSKAITTSDSMLEIVNGLK 261 +S V L EE +L Q+ Y AN++ + T++++ + + ++ Sbjct: 505 SISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINIR 546 Score = 44.2 bits (104), Expect = 3e-07 Identities = 10/35 (28%), Positives = 19/35 (54%) Query: 4 SLYTAATGMIAEQTQIDVTSHNIANVNTYGYKKNR 38 + A +G+ A Q ++ S+NI++ N GY + Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQT 37
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 35.7 bits (82), Expect = 2e-04 Identities = 11/40 (27%), Positives = 19/40 (47%) Query: 3 NGYYQATAGMVTQFNRLNVISNNLANVNTIGYKRNDVVIG 42 + A +G+ LN SNN+++ N GY R ++ Sbjct: 2 SLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMA 41
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 47.0 bits (112), Expect = 8e-08 Identities = 39/164 (23%), Positives = 62/164 (37%), Gaps = 39/164 (23%) Query: 5 IINGTIVNSDEKFKANILIENGKIAKIGSEKF------------EADKVIDATNKLVMPG 52 I N I++ KA+I +++G+IA IG +VI K+V G Sbjct: 72 ITNALILDHWGIVKADIGLKDGRIAAIGKAGNPDMQPGVTIIVGPGTEVIAGEGKIVTAG 131 Query: 53 LIDMHVHFRDPGQEYKDDIISGSQAAVAGGVTTCLCMANTNPVNDNASIT--------RA 104 +D H+HF P Q + A+ G+T + T P + + T Sbjct: 132 GMDSHIHFICPQQ---------IEEALMSGLTC-MLGGGTGPAHGTLATTCTPGPWHIAR 181 Query: 105 MIEKAKNCGLIDLLPI--AAISKGLGGNEIVEMGDLIEAGAVAF 146 MIE A D P+ A KG + +++ GA + Sbjct: 182 MIEAA------DAFPMNLAFAGKGNASLP-GALVEMVLGGATSL 218
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 31.2 bits (70), Expect = 0.016 Identities = 13/71 (18%), Positives = 27/71 (38%), Gaps = 4/71 (5%) Query: 622 VDEKGQLNFY----ILDTAAQQKLMDAVQYKDGAYHLMINVAQTSSIVQALRREKEKRPM 677 DE ++N+ ++ T + + + D + DG+Y N + +I+ L Sbjct: 95 FDELNRMNYSDLVELIKTISYENVPDLFNFNDGSYTFFSNRDRVQAIIYGLEDSGRTYTA 154 Query: 678 SQHGEMVLCVE 688 + VE Sbjct: 155 DDDKGIPTLVE 165
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 39.4 bits (92), Expect = 2e-05 Identities = 24/100 (24%), Positives = 46/100 (46%), Gaps = 8/100 (8%) Query: 111 RIEKIDTTRLKAELKAGRIVVVAGFQGI---DDKGDITTL-GRGGSDLSAVALAGALEAD 166 + +T +K ++ G IV+ +G G+ + G+I + DL+ LA + AD Sbjct: 172 GHVEAET--IKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNAD 229 Query: 167 LCEIFTDVDGVYTTDPRIEKKAKKLEKISYDEMLELASAG 206 + I TDV+G +K + L ++ +E+ + G Sbjct: 230 IFMILTDVNGAALYYGT--EKEQWLREVKVEELRKYYEEG 267
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 103 bits (259), Expect = 6e-28 Identities = 36/123 (29%), Positives = 61/123 (49%) Query: 2 KILVVEDEIDLNSVITRHLKKNGYSVDSACNGEEAMDFTAVAHYDLIVLDLMMPVMDGLT 61 ILV +D+ + +V+ + L + GY V N + A DL+V D++MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 FLQRSRAAKLVTPVLILTAKDDVDDVVKGLDAGADDYLVKPFDFKELLARVRTLIRRNSG 121 L R + A+ PVL+++A++ +K + GA DYL KPFD EL+ + + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 122 NVA 124 + Sbjct: 125 RPS 127
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 35.6 bits (82), Expect = 7e-05 Identities = 20/65 (30%), Positives = 38/65 (58%), Gaps = 3/65 (4%) Query: 2 KRAFTLLELVVVIVVLGIIAMMSFNAIMNIYSNYFQTKTVNELETQTEIALEQISKRLEH 61 +R FTLLE++VVIV++G++A + +M + K V+++ E AL+ +L++ Sbjct: 7 QRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVA-LENALDMY--KLDN 63 Query: 62 RIKPS 66 P+ Sbjct: 64 HHYPT 68
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 31.5 bits (71), Expect = 0.001 Identities = 13/48 (27%), Positives = 24/48 (50%), Gaps = 5/48 (10%) Query: 1 MVKRGFSLIELILSIVVVAIISTSIPLVLKT--TSELNQKAVTQESLM 46 M +RGF+L+E++L ++ ++ S +VL S + A T Sbjct: 1 MRQRGFTLLEMML---ILLLMGVSAGMVLLAFPASRDDSAAQTLARFE 45
>MICOLLPTASE#Microbial collagenase metalloprotease (M9) signature. Length = 1104 Score = 28.1 bits (62), Expect = 0.009 Identities = 9/20 (45%), Positives = 13/20 (65%) Query: 56 PDDFNYNKQFKAFVSNKNRM 75 PD FN+N F SN++R+ Sbjct: 119 PDLFNFNDGSYTFFSNRDRV 138
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 30.9 bits (70), Expect = 0.011 Identities = 25/83 (30%), Positives = 35/83 (42%), Gaps = 22/83 (26%) Query: 271 GNIANPAAVKDLVEAGADGIKV----GIGPGSICTTRIVAGVGVPQISAIDDCASEAAKY 326 GN + P A+ ++V GA +K+ G P +AID C S A +Y Sbjct: 199 GNASLPGALVEMVLGGATSLKLHEDWGTTP-----------------AAIDCCLSVADEY 241 Query: 327 GIPV-IADGGLKYSGDVAKALAA 348 + V I L SG V +AA Sbjct: 242 DVQVMIHTDTLNESGFVEDTIAA 264
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 39.4 bits (92), Expect = 6e-05 Identities = 31/148 (20%), Positives = 59/148 (39%), Gaps = 21/148 (14%) Query: 605 ELALKLKIAALVVAFLLLWFYFSAIISALVMGIII-FGVLLTLFIFAIFGVNLSIFGVFG 663 E+ L A ++V ++ F + + + L+ I + +L T I A FG +++ +FG Sbjct: 339 EVVKTLFEAIMLVFLVMYLFLQN-MRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFG 397 Query: 664 LILASAVGIDYMI--------FALNESLSEKERIYG---------IFCAFITS--FISFF 704 ++LA + +D I + + L KE + A + S FI Sbjct: 398 MVLAIGLLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMA 457 Query: 705 TLSFSQTAALSVFGLSVSLCVLIYGLCA 732 S A F +++ + + L A Sbjct: 458 FFGGSTGAIYRQFSITIVSAMALSVLVA 485 Score = 34.4 bits (79), Expect = 0.002 Identities = 27/161 (16%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 568 YASGFVKGAASDEVLKRHNAFSLNFADSLNESLTQAKELALKLKIAALVVAFLLLWFYFS 627 +SG + K ++ + + + I+ +VV L Y S Sbjct: 834 TSSGDAMALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYES 893 Query: 628 AIISALVMGIIIFGVLLTLFIFAIFGVNLSIFGVFGLI----LASAVGIDYMIFALNESL 683 I VM ++ G++ L +F ++ + GL+ L++ I + FA + Sbjct: 894 WSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLME 953 Query: 684 SE------------KERIYGIFCAFITSFISFFTLSFSQTA 712 E + R+ I + + L+ S A Sbjct: 954 KEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGA 994 Score = 34.0 bits (78), Expect = 0.003 Identities = 33/208 (15%), Positives = 75/208 (36%), Gaps = 35/208 (16%) Query: 242 AIFLMLAF-RNLRIFYVIFIATFGFSVAFVGTLLCLNE----LNILTILISTSLIGLMFD 296 +M F +N+R + IA V +GT L +N LT+ IGL+ D Sbjct: 351 VFLVMYLFLQNMRATLIPTIA---VPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVD 407 Query: 297 Y-------ILHWLSKNEGEAIRAS--SIKNMLKIFLLGLLITLSGYLAFTF---SDLRLL 344 + + +++ A+ S+ + + ++ + ++ F S + Sbjct: 408 DAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIY 467 Query: 345 KEVALFSAFALVAAFLASYFFMPLIF---------------EGVKFYRSKVFDAFLTKFC 389 ++ ++ A+ + L + P + G + + FD + + Sbjct: 468 RQFSITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYT 527 Query: 390 DLSGAVARHLGIKFLAISLILLAIFLVF 417 + G + G L +LI+ + ++F Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLF 555 Score = 32.1 bits (73), Expect = 0.010 Identities = 33/173 (19%), Positives = 73/173 (42%), Gaps = 17/173 (9%) Query: 218 YQAFSKQKNESESLYMSAVSLSLTAIFLMLA--FRNLRI-FYVIFIATFGFSVAFVGTLL 274 + S Q+ S + + V++S +FL LA + + I V+ + G + L Sbjct: 858 WTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATL 917 Query: 275 CLNELNILTILISTSLIG-------LMFDYILHWLSKNEGEAIRASSI---KNMLKIFLL 324 + ++ ++ + IG L+ ++ L + EG+ + +++ + L+ L+ Sbjct: 918 FNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKD-LMEKEGKGVVEATLMAVRMRLRPILM 976 Query: 325 GLLITLSGYLAFTFSD---LRLLKEVALFSAFALVAAFLASYFFMPLIFEGVK 374 L + G L S+ V + +V+A L + FF+P+ F ++ Sbjct: 977 TSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIR 1029
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 109 bits (273), Expect = 5e-31 Identities = 74/249 (29%), Positives = 116/249 (46%), Gaps = 15/249 (6%) Query: 3 KRVFITGSSRGIGASIARRLANEYEVVLHARSKSDELLKMAGELGAKFMT-----FDVAD 57 K FITG+++GIG ++AR LA++ + ++L K+ L A+ DV D Sbjct: 9 KIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRD 68 Query: 58 TAAAKEAIEADMEANGVYYGVILNAGITRDNTFVGLSDEEWFDVIDVNLNGFYNVLRPAL 117 +AA E G ++ AG+ R LSDEEW VN G +N R ++ Sbjct: 69 SAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR-SV 127 Query: 118 MPMIRARKPARIVTLSSVSGVIGNRGQVNYSASKAGIIGASKALAVELASRGITVNCVAP 177 + R+ IVT+ S + Y++SKA + +K L +ELA I N V+P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 178 GLIKTDMSEEILNSD---------FLDEVLKAIPAKRAGEADEVAGLVKFLLSDEASYIT 228 G +TDM + + L+ IP K+ + ++A V FL+S +A +IT Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 229 RQVIGVNGG 237 + V+GG Sbjct: 248 MHNLCVDGG 256
>ENTSNTHTASED#Enterobactin synthetase component D signature. Length = 234 Score = 31.9 bits (72), Expect = 0.001 Identities = 21/87 (24%), Positives = 42/87 (48%), Gaps = 5/87 (5%) Query: 62 LSHKENIAVLAISKEKIGVDVEE-LKQRNFDGVAKFCFNKKESEIYANAKDKMQKFYEI- 119 +SH A+ IS+++IG+D+E+ + Q +A + E +I + + Sbjct: 88 ISHCATTALAVISRQRIGIDIEKIMSQHTATELAPSIIDSDERQILQASLLPFPLALTLA 147 Query: 120 YTAKEAVIKAKNLAFSDLAGVGFDQMQ 146 ++AKE+V KA + + GF+ + Sbjct: 148 FSAKESVYKAFSDRVTLP---GFNSAK 171
>cloacin#Cloacin signature. Length = 551 Score = 36.2 bits (83), Expect = 1e-04 Identities = 18/45 (40%), Positives = 22/45 (48%) Query: 238 NSQGGSLPMGFRRGGSDSNGGGRSSNRGGGFSGGGGGFGGGGASG 282 N GG +G G SD +G +N GG SG G +GGG G Sbjct: 19 NINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHG 63 Score = 29.7 bits (66), Expect = 0.016 Identities = 16/36 (44%), Positives = 17/36 (47%), Gaps = 3/36 (8%) Query: 251 GGSDSN---GGGRSSNRGGGFSGGGGGFGGGGASGS 283 GGS S GGG GGG GGG G GG + Sbjct: 48 GGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSA 83 Score = 28.5 bits (63), Expect = 0.037 Identities = 14/37 (37%), Positives = 16/37 (43%) Query: 238 NSQGGSLPMGFRRGGSDSNGGGRSSNRGGGFSGGGGG 274 N GG G GG +G G + GG SG GG Sbjct: 44 NPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGN 80
>PF01206#SirA family protein Length = 76 Score = 56.3 bits (136), Expect = 4e-13 Identities = 16/69 (23%), Positives = 29/69 (42%) Query: 3 RTIDCRNLECPKPVIMTKNALEGLNEGESLEIIVNALAPKENISRFLKNQNIEFSLESNG 62 +++D L CP P++ K L +N GE L ++ ++ F K E + Sbjct: 6 QSLDATGLNCPLPILKAKKTLATMNAGEVLYVMATDPGSVKDFESFSKQTGHELLEQKEE 65 Query: 63 NETKILAIK 71 + T +K Sbjct: 66 DGTYHFRLK 74
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.6 bits (69), Expect = 0.002 Identities = 24/121 (19%), Positives = 46/121 (38%), Gaps = 11/121 (9%) Query: 8 VVRVKKQEMDKVEAKLVVARLNVRSAEEKI-----ALLRAKLNEFRLPKSGNIGELRENL 62 V + +E + V V+ +L AE +LL+A+L + R L ++ Sbjct: 107 VKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRY------QILSRSI 160 Query: 63 ELINIARAELSACKESLEIAKKEVLHYEHKYKNANLEYEKMKYLEKEEFKKEIKRIQKAE 122 EL + +L ++++EVL K ++ KY ++ K+ Sbjct: 161 ELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVL 220 Query: 123 A 123 A Sbjct: 221 A 221
>PF02370#M protein repeat Length = 168 Score = 27.0 bits (59), Expect = 0.034 Identities = 18/93 (19%), Positives = 43/93 (46%), Gaps = 5/93 (5%) Query: 31 KEEISKELEVIDEQRQALEVFRASSAAAYEENNKKLAKKEADLNATMKVIEQKRKEIDEV 90 ++++ + L+ D +R+ +RA N+ L K+E ++ +E++RKE E Sbjct: 30 QKQLEEYLDSSDSKRENDPQYRA-----LMGENQDLRKREGQYQDKIEELEKERKEKQER 84 Query: 91 VAKNEKILKELRTMTTDKVNESYAKMKDGAAAE 123 + EK ++ + + + + + + AE Sbjct: 85 PERREKFERQHQDKHYQEQQKKHQQEQQQLEAE 117
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 461 bits (1189), Expect = e-166 Identities = 183/338 (54%), Positives = 245/338 (72%), Gaps = 2/338 (0%) Query: 3 LDQVIGFFSSDMGIDLGTANTLVLVKDKGIIINEPSVVAVRREKYGKQK-ILAVGHAAKE 61 L + G FS+D+ IDLGTANTL+ VK +GI++NEPSVVA+R+++ G K + AVGH AK+ Sbjct: 2 LKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRAGSPKSVAAVGHDAKQ 61 Query: 62 MVGKTPGDIEAIRPMRDGVIADFDMTERMIRYFIEKTHRRKNF-LRPRIIISVPYGLTQV 120 M+G+TPG+I AIRPM+DGVIADF +TE+M+++FI++ H PR+++ VP G TQV Sbjct: 62 MLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGATQV 121 Query: 121 ERKAVRESALSAGAREVFLIEEPMAAAIGANLPVREPQGNLVVDIGGGTTEIGVVSLGGL 180 ER+A+RESA AGAREVFLIEEPMAAAIGA LPV E G++VVDIGGGTTE+ V+SL G+ Sbjct: 122 ERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGV 181 Query: 181 VISKSIRTAGDKIDSSIVNYIKEKYNLLIGERTGEEIKIAVGSAVQLEKELSVVVKGRDQ 240 V S S+R GD+ D +I+NY++ Y LIGE T E IK +GSA ++ + V+GR+ Sbjct: 182 VYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGRNL 241 Query: 241 VSGLLSRVELTSEDVREAMREPLKEIADALKTVLEMMPPDLAGDIVETGIVLTGGGALIR 300 G+ L S ++ EA++EPL I A+ LE PP+LA DI E G+VLTGGGAL+R Sbjct: 242 AEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLR 301 Query: 301 GLDKFLSDIVKLPVFVADEPLLAVARGTGKALQEIGLL 338 LD+ L + +PV VA++PL VARG GKAL+ I + Sbjct: 302 NLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMH 339
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 34.0 bits (78), Expect = 0.001 Identities = 31/169 (18%), Positives = 64/169 (37%), Gaps = 30/169 (17%) Query: 43 DTSVEESKNEEAIEYQKLTPKELKAVLDNYVIGQDRAKKVFSVGVYNHYKRIFKQSDIKD 102 + A ++ + E + ++G+ A +Y R+ + Sbjct: 109 TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAA----MQEIYRVLARLMQ------ 158 Query: 103 DTEISKSNILLVGPTGSGKTLMAQTL---ARFLDVP-IAI-CDA--TSLTEAGYVGEDVE 155 + +++ G +G+GK L+A+ L + + P +AI A L E+ G + Sbjct: 159 ----TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH-EK 213 Query: 156 NILTRLLQAANGDVKKAEQGIVFVDEID--------KIARMSENRSITR 196 T + G ++AE G +F+DEI ++ R+ + T Sbjct: 214 GAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTT 262
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 29.0 bits (65), Expect = 0.016 Identities = 11/37 (29%), Positives = 18/37 (48%), Gaps = 1/37 (2%) Query: 43 ILGQSGSGKSTLAKLISFSEPKSGGK-IYINNEEITD 78 I G+SG+GK +A+ + + G + IN I Sbjct: 165 ITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 27.9 bits (62), Expect = 0.040 Identities = 10/16 (62%), Positives = 13/16 (81%) Query: 32 ITGASGSGKSLFAKSL 47 ITG SG+GK L A++L Sbjct: 165 ITGESGTGKELVARAL 180
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 31.4 bits (71), Expect = 0.020 Identities = 9/26 (34%), Positives = 15/26 (57%) Query: 1064 VELANERANAVKEALIKAGLEASRIN 1089 L+ RA +V + LI G+ A +I+ Sbjct: 271 QGLSERRAQSVVDYLISKGIPADKIS 296
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 56.3 bits (136), Expect = 2e-14 Identities = 21/81 (25%), Positives = 39/81 (48%) Query: 3 STLVSLGVETFKIALYISLPMLLSGLIAGLIISIFQATTQINETTLSFVPKILLVVVVII 62 LV G + + L +S + I GL++ +FQ TQ+ E TL F K+L V + + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 63 FLMPWMISMMVEFTTRMLDFI 83 L W +++ + +++ Sbjct: 62 LLSGWYGEVLLSYGRQVIFLA 82
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 54.8 bits (132), Expect = 2e-10 Identities = 50/221 (22%), Positives = 84/221 (38%), Gaps = 28/221 (12%) Query: 33 SFIVIGGAGSIGSAVTKEIFIRDPKKLYVVDISENNLVELVRDIRSEFGYISGDFKTFAI 92 ++V G AG IG V+K + ++ +D + ++ R E F+ I Sbjct: 2 KYLVTGAAGFIGFHVSKR-LLEAGHQVVGIDNLNDYYDVSLKQARLEL-LAQPGFQFHKI 59 Query: 93 DVASAEFDALLAQSGGFDYVLNLSALKHVR-SEKDPFTLMRMLETNIFNTDKTLAQALYM 151 D+A E L SG F+ V VR S ++P ++N+ L + Sbjct: 60 DLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYA---DSNLTGFLNILEGCRHN 116 Query: 152 KSKKYFCVST---------------DKAANPVNLMGASKRIMEMFA--FRHSLNIDVSMA 194 K + S+ D +PV+L A+K+ E+ A + H + + Sbjct: 117 KIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPATGL 176 Query: 195 RFANVAFSDGS---LLFGFQKRIEKSQPIVAPND--VRRYF 230 RF V G LF F K + + + I N ++R F Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDF 217
>NEISSPPORIN#Neisseria sp. porin signature. Length = 348 Score = 30.0 bits (67), Expect = 0.010 Identities = 20/56 (35%), Positives = 25/56 (44%), Gaps = 16/56 (28%) Query: 11 VSLAFGFNFDMDSKNGAIQNTKELGLKDTLTLNLQNDNAITGADYDESKKQFAIVS 66 VS A GF +DS N NT D + GA+YD SK+ A+VS Sbjct: 283 VSYAHGFKGTVDSANH--DNTY--------------DQVVVGAEYDFSKRTSALVS 322
>CABNDNGRPT#NodO calcium binding signature. Length = 479 Score = 33.8 bits (77), Expect = 0.006 Identities = 30/144 (20%), Positives = 43/144 (29%), Gaps = 14/144 (9%) Query: 1341 HITTGDGDDVITVVDGWGGRINNHSSVELGDGTNMLKVARDIDKSTVTAGSGDDTVNVGN 1400 H D I + G + + GD D D T T S +V + Sbjct: 241 HYGGAPMIDDIAAIQRLYG---ANMTTRTGDSVYGFNSNTDRDFYTATDSSKALIFSVWD 297 Query: 1401 WIREHSDINLGNGNNTLTVGNIIVNSTVATGDGNDTIKVKNAIIGSTIKLGAGDDTVEVG 1460 + G NN N S V GN +I I G+G+D + Sbjct: 298 AGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIAHGVTI--ENAIGGSGNDILVGN 355 Query: 1461 NKDITDNTLTGKSNIDGGDGYDKL 1484 + + + GG G D L Sbjct: 356 S---------ADNILQGGAGNDVL 370 Score = 32.6 bits (74), Expect = 0.012 Identities = 22/137 (16%), Positives = 41/137 (29%), Gaps = 12/137 (8%) Query: 1352 TVVDGWGGRINNHSSVELGDGTNMLK---VARDIDKSTVTAGSGDDTVNVGNWIREHSDI 1408 +++ WG G M+ + + + +T +GD + D Sbjct: 224 SIMSYWGENETGADYNGHYGGAPMIDDIAAIQRLYGANMTTRTGDSVYGFNS--NTDRDF 281 Query: 1409 NLGNGNNTLTVGNIIVNSTVATGDGNDTIKVKNAIIGSTIKLGAGDDTVEVGNK-DITDN 1467 ++ + ++ G DT I L G + G K +++ Sbjct: 282 YTATDSSKALIFSVWD------AGGTDTFDFSGYSNNQRINLNEGSFSDVGGLKGNVSIA 335 Query: 1468 TLTGKSNIDGGDGYDKL 1484 N GG G D L Sbjct: 336 HGVTIENAIGGSGNDIL 352
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 27.0 bits (59), Expect = 0.046 Identities = 32/154 (20%), Positives = 59/154 (38%), Gaps = 21/154 (13%) Query: 19 ISLYNSLVAKQNQVKSVEAGIDAQLKRRYDLIPNLVATAKEYMVH---EKSLLENITA-- 73 + +Y+ + A+ N+ S I+ K + NL E + E +LE + Sbjct: 164 LKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIFKASAEYKILEKMPQTT 223 Query: 74 --LRESARSASTNEEKFELNNKISSLLNGLRVSVENYPDLKANQNLLHIQST-------- 123 + S + + ++ NK + L L+ +Y K N L H +T Sbjct: 224 IQVDGSEKKIVSIKDFLGSENKRTGALGNLK---NSYSYNKDNNELSHFATTCSDKSRPL 280 Query: 124 ---LNEVEEQISAARRAYNSAVEIYNNATQMFPS 154 +++ Q+S +NSA+E N Q + S Sbjct: 281 NDLVSQKTTQLSDITSRFNSAIEALNRFIQKYDS 314
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 25.8 bits (56), Expect = 0.045 Identities = 12/69 (17%), Positives = 25/69 (36%), Gaps = 1/69 (1%) Query: 47 ALKGERDRAFQSLGRINALFAKLEEQNEQLKAEIEQAKAKFEKLASNYVVLCDEIDELKS 106 + + A A LE + +L+ +E A ++ L E L++ Sbjct: 236 GAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEA 295 Query: 107 KLS-LKEQK 114 + + L+ Q Sbjct: 296 EKADLEHQS 304
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 27.3 bits (60), Expect = 0.031 Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 12/97 (12%) Query: 1 MAKIT----DKIREKILAD-----FHTGF--FSIRQIAERAGVSHVAVHKIVKGLTPKFK 49 MA+ T + R+ IL G S+ +IA+ AGV+ A++ K + F Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 50 EKINAEVAFKTELADENLQQIN-SVNEVISEATKHLI 85 E + EL E + V+ E H++ Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVL 97
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 30.1 bits (68), Expect = 0.014 Identities = 29/193 (15%), Positives = 66/193 (34%), Gaps = 35/193 (18%) Query: 209 LIINTVLFLPFFLGFLA--WVLTKDG----FAYNVNGVVSLVAYKYAINLIEMPIAGILL 262 L + ++ + ++ + F+ ++ VV V ++ + P+ + Sbjct: 38 LSAMLMGLSDYYFEHFSKLMLIPAEQSYLPFSQALSYVVDNVLLEFFY--LCFPLLTVAA 95 Query: 263 LVGVLLVLVGIFQGAF---TKSIR-------------GIFAYGVGV-----TLAVTALFL 301 L+ + + Q F ++I+ IF+ V L V L + Sbjct: 96 LMAIA---SHVVQYGFLISGEAIKPDIKKINPIEGAKRIFSIKSLVEFLKSILKVVLLSI 152 Query: 302 ITGLNGTAFYPSFSDLS-SSLT--IKNASSSHYTLGVMSYVSLLVPVVLAYIFIVWRAID 358 + + + L + L V+ V +V + Y F ++ I Sbjct: 153 LIWIIIKGNLVTLLQLPTCGIECITPLLGQILRQLMVICTVGFVVISIADYAFEYYQYIK 212 Query: 359 SKKITQDEIKNDH 371 K+++DEIK ++ Sbjct: 213 ELKMSKDEIKREY 225
>TONBPROTEIN#Gram-negative bacterial tonB protein signature. Length = 239 Score = 66.6 bits (162), Expect = 6e-14 Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 2/65 (3%) Query: 167 AIAPVAPVEPSQPENPTPQPKPVEPVEPKPEPEPENPTPQP--EPKPEPTPEPTPQPEPK 224 ++ V P + P+ P P+PV EP+PEP PE P P KP+P P+P P+P K Sbjct: 46 SVTMVTPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105 Query: 225 PDESK 229 E Sbjct: 106 VQEQP 110 Score = 59.2 bits (143), Expect = 2e-11 Identities = 20/62 (32%), Positives = 30/62 (48%), Gaps = 5/62 (8%) Query: 170 PVAPVEPSQPENPTPQPKPVEPVE---PKPEPEPENPTPQPEPKPEPTPEPTPQPEPKPD 226 V P E P P+P+P+ P +P+ P P+P+PKP + P+ + KP Sbjct: 60 AVQPPPEPVVE-PEPEPEPIPEPPKEAPVVIEKPK-PKPKPKPKPVKKVQEQPKRDVKPV 117 Query: 227 ES 228 ES Sbjct: 118 ES 119 Score = 50.4 bits (120), Expect = 2e-08 Identities = 16/37 (43%), Positives = 21/37 (56%) Query: 804 PEPTPVEPVEPVNPAPAPQPEPKPEPKPEPQPAPDPE 840 PEP P EP AP +PKP+PKP+P+P + Sbjct: 71 PEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107 Score = 48.8 bits (116), Expect = 5e-08 Identities = 15/39 (38%), Positives = 20/39 (51%) Query: 804 PEPTPVEPVEPVNPAPAPQPEPKPEPKPEPQPAPDPEPE 842 EP P P P AP KP+PKP+P+P P + + Sbjct: 69 VEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQ 107 Score = 48.4 bits (115), Expect = 7e-08 Identities = 14/38 (36%), Positives = 20/38 (52%) Query: 804 PEPTPVEPVEPVNPAPAPQPEPKPEPKPEPQPAPDPEP 841 PEP P+ P +P+PKP+PKP+P +P Sbjct: 73 PEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP 110 Score = 46.1 bits (109), Expect = 4e-07 Identities = 15/44 (34%), Positives = 20/44 (45%), Gaps = 3/44 (6%) Query: 802 AKPEPTPVEP---VEPVNPAPAPQPEPKPEPKPEPQPAPDPEPE 842 P VEP EP+ P P +PKP+P+P P P + Sbjct: 62 QPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 105 Score = 45.0 bits (106), Expect = 9e-07 Identities = 15/39 (38%), Positives = 21/39 (53%) Query: 804 PEPTPVEPVEPVNPAPAPQPEPKPEPKPEPQPAPDPEPE 842 PEP P P E P+P+PKP+PKP + P+ + Sbjct: 75 PEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQPKRD 113 Score = 44.2 bits (104), Expect = 2e-06 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 1/42 (2%) Query: 802 AKPEPTPVEPVEPVNPAPAPQPEPKPEPKPEPQ-PAPDPEPE 842 P P P P+PEP+P P+P + P +P+ Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPK 93 Score = 44.2 bits (104), Expect = 2e-06 Identities = 12/51 (23%), Positives = 21/51 (41%), Gaps = 1/51 (1%) Query: 170 PVAPVEPSQPENPTPQPKPVEPVEPKPEPEPENPTPQPEPKPEPTPEPTPQ 220 P P + +PKP +PKP + + P+ + KP + +P Sbjct: 76 EPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQE-QPKRDVKPVESRPASPF 125 Score = 43.4 bits (102), Expect = 3e-06 Identities = 16/40 (40%), Positives = 18/40 (45%), Gaps = 1/40 (2%) Query: 803 KPEPTPVEPVEPVNPAPAPQPEPKPEPKPEPQPAPDPEPE 842 P P V V PA P+ P PEP P+PEPE Sbjct: 38 LPAPAQPISVTMVTPADLEPPQAVQPP-PEPVVEPEPEPE 76 Score = 43.1 bits (101), Expect = 4e-06 Identities = 19/45 (42%), Positives = 25/45 (55%), Gaps = 1/45 (2%) Query: 806 PTPVEPVEPVNPAPAPQPEPKPEPKPEPQPAPDPEPEYIHEYDSK 850 P +EP + V P P P EP+PEP+P P+P P P I + K Sbjct: 52 PADLEPPQAVQPPPEPVVEPEPEPEPIPEP-PKEAPVVIEKPKPK 95 Score = 42.3 bits (99), Expect = 6e-06 Identities = 15/41 (36%), Positives = 18/41 (43%), Gaps = 1/41 (2%) Query: 802 AKPEPTPVEPVEPVNPAPAPQPEPKPE-PKPEPQPAPDPEP 841 P V+P P P+PEP PE PK P P+P Sbjct: 54 DLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKP 94 Score = 42.3 bits (99), Expect = 7e-06 Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 5/46 (10%) Query: 802 AKPEPTPVEPVEPVNPAPAPQPE-----PKPEPKPEPQPAPDPEPE 842 + P EPV P P P PE P KP+P+P P P+P Sbjct: 58 PQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPV 103 Score = 34.6 bits (79), Expect = 0.002 Identities = 16/64 (25%), Positives = 21/64 (32%), Gaps = 4/64 (6%) Query: 169 APVAPVEPSQPENPTPQPKPVEPVEPKPEP---EPENPTPQPEPKPEPTPEPTPQ-PEPK 224 APV +P P P+P +PK + E +P P T K Sbjct: 85 APVVIEKPKPKPKPKPKPVKKVQEQPKRDVKPVESRPASPFENTAPARLTSSTATAATSK 144 Query: 225 PDES 228 P S Sbjct: 145 PVTS 148
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 31.1 bits (70), Expect = 0.010 Identities = 24/105 (22%), Positives = 41/105 (39%), Gaps = 18/105 (17%) Query: 70 DATTSANDVRFDNGNDIVDMTRSIVNDAKIDAGDGDNKLRIHDNIEVRGLRFDAGAGNDE 129 D A+ GND D + + G+GD++L G GND+ Sbjct: 738 DIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQL-------------YGGDGNDK 784 Query: 130 IEIRNNVGIKDHTLLYTNDGDDSVKIYGATMENAAIHTGLDNDVI 174 + +G+ + L DGDD ++ G ++ + G ND + Sbjct: 785 L-----IGVAGNNYLNGGDGDDEFQVQGNSLAKNVLFGGKGNDKL 824
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 32.6 bits (74), Expect = 0.010 Identities = 30/147 (20%), Positives = 61/147 (41%), Gaps = 26/147 (17%) Query: 520 HQLIKNTKIDTGADNDTVNIKSDMYAYVTNN---------------GTTDLTEYAGSRTD 564 +++K ++ G + +S + ++ GTT ++ GS+ Sbjct: 678 QEVVKEQEVSVGKRTEKTQYRSYEFTHINGKNLTETDNLYSVEELIGTTRADKFFGSKFT 737 Query: 565 SFIKTGEGDDTINVTDASISRVDIDTGDSDTGDMLNFISAGIYNSEIKSGNGNDKIVLQD 624 +GDD I D + D GD G+ + +S G + ++ G+GNDK++ Sbjct: 738 DIFHGADGDDLIEGNDGN----DRLYGDK--GN--DTLSGGNGDDQLYGGDGNDKLIGVA 789 Query: 625 TKADVMDIYTGEGNDSLTIKGSTEIKN 651 + G+G+D ++G++ KN Sbjct: 790 GNNYLNG---GDGDDEFQVQGNSLAKN 813
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 55.4 bits (133), Expect = 2e-09 Identities = 37/156 (23%), Positives = 64/156 (41%), Gaps = 10/156 (6%) Query: 1150 NNVITTSEGNDIITVGDGNNTINAGRGENEIRTGNGNNVIITGDNNDVITTGSGNDYIDA 1209 ++ ++G+D+I DGN+ + +G + + GNG++ + GD ND + +GN+Y++ Sbjct: 737 TDIFHGADGDDLIEGNDGNDRLYGDKGNDTLSGGNGDDQLYGGDGNDKLIGVAGNNYLNG 796 Query: 1210 GR-----SGYTGINKGDLVNAGAGNDKVVFTFDD---PRAALSQSLDGGAGTDTLIMRPM 1261 G +++ G GNDK+ + L GG G D Sbjct: 797 GDGDDEFQVQGNSLAKNVLFGGKGNDKLYGSEGADLLDGGEGDDLLKGGYGNDIYRYLSG 856 Query: 1262 AKDGTIDFDKIDNKSLTNAIKNFEEIQLGMDEHGND 1297 ID D L+ A +F + GND Sbjct: 857 YGHHIIDDDGGKEDKLSLADIDFR--DVAFKREGND 890
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 35.0 bits (80), Expect = 0.001 Identities = 13/78 (16%), Positives = 21/78 (26%) Query: 164 TTTPISPVTPVTPSTPVTPPTPSTPSTPGVTVTPGTPSPANPPVITPRPGAVEITTSIDP 223 + T ++P P PP P P P P A + P+P + Sbjct: 51 SVTMVAPADLEPPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKK 110 Query: 224 AASEVRESGAGANGEGGH 241 R+ + Sbjct: 111 VEQPKRDVKPVESRPASP 128
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 77.6 bits (191), Expect = 1e-18 Identities = 31/112 (27%), Positives = 55/112 (49%) Query: 10 KVSILVAEDDEMARELIITGLKPYCDQVVGAKDGQDGLEKFKKQGFDIVMSDIHMPVLNG 69 +ILVA+DD R ++ L V + D+V++D+ MP N Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 70 FEMMNEIKKLKPHQKFIVFTSYDSDENLIKSYEQGATLFLKKPIDIKDLRSM 121 F+++ IKK +P +V ++ ++ IK+ E+GA +L KP D+ +L + Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114
>PF06917#Periplasmic pectate lyase Length = 555 Score = 27.2 bits (60), Expect = 0.046 Identities = 14/51 (27%), Positives = 18/51 (35%), Gaps = 11/51 (21%) Query: 130 LPPLRALANMCGMEF----ADYVYSGGLSYQSRHDEAKLA----LMRQKAL 172 LP L G+ F D +Y+ + D A A L RQ L Sbjct: 207 LPKLPE---TKGLTFVNAGTDLIYAAYKYAEYTGDAAAAAWGKHLYRQYVL 254
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 35.4 bits (81), Expect = 1e-04 Identities = 39/186 (20%), Positives = 80/186 (43%), Gaps = 9/186 (4%) Query: 7 ITGASSGIGAAAAKAFARRGENLILIARRGELLENLKSEIAKFANV--DVVIELCDLSKQ 64 ITGA+ GIG A A+ A +G ++ + E LE + S + A ++ D + Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72 Query: 65 ENALSLW-RNLEKFELKALINNAGFGDYNKVGEQNLEKITQMINLNIISLVTLSTLFTKK 123 + + R + + L+N AG + + E+ ++N + S +K Sbjct: 73 DEITARIEREMGPID--ILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSVSKY 130 Query: 124 YKDKDT-QLINISSIGGYKIVPNAVTYCASKFFVSAFSEGLYHELAQDKQAKMQAKVLAP 182 D+ + ++ + S + Y +SK F++ L ELA + ++ +++P Sbjct: 131 MMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA---EYNIRCNIVSP 187 Query: 183 AATKTE 188 +T+T+ Sbjct: 188 GSTETD 193
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 30.8 bits (69), Expect = 0.015 Identities = 21/100 (21%), Positives = 41/100 (41%), Gaps = 9/100 (9%) Query: 258 NLNVPARWGQSPFTNVTIDITCPSDLRDQIPTSDDIHLFTNVKDEKILKKANERGRKNLV 317 N+ AR + F + + + +D + +S+ L+T++ D++I ++ L Sbjct: 32 NIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLYTSMYDQQIAQQMTAGKGLGLA 91 Query: 318 DMTYKDFEPEMARIDKAFYEVLTAGDKCSQPFTFPIPTVN 357 +M K PE + L + P FP+ TV Sbjct: 92 EMMVKQMTPE---------QPLPEESTPAAPMKFPLETVV 122
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 33.6 bits (77), Expect = 0.002 Identities = 27/219 (12%), Positives = 73/219 (33%), Gaps = 25/219 (11%) Query: 248 EILKGAVASAQNLPSSPE-----IKAGISSDVLLRRSDVAKA---LADLKAT--NALVGV 297 ++ A A+ + S I+ I +++++ + + L L A A Sbjct: 79 QVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLK 138 Query: 298 AKADYFPTISLTGLLGFTSIDFENIFVGNANTWNIGGSLAQKIFDYGRTKNNVRVAET-N 356 ++ S E N + F + +R+ Sbjct: 139 TQSSLLQARLEQTRYQILSRSIE------LNKLPELKLPDEPYFQNVSEEEVLRLTSLIK 192 Query: 357 EQIAAVTYEATVRSALGEVRDALISRQNAKLS-LDQVKNLLQSQQKIYS-LAKDQYNAGY 414 EQ + + + + + A A+++ + + + +S+ +S L Sbjct: 193 EQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHK----QA 248 Query: 415 IGHLELLDAQRNLLQAK--LQDISAKLDEVDSAVEVYRA 451 I +L+ + ++A L+ ++L++++S + + Sbjct: 249 IAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILSAKE 287
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 950 bits (2456), Expect = 0.0 Identities = 424/1033 (41%), Positives = 642/1033 (62%), Gaps = 17/1033 (1%) Query: 3 SRFFINRPIFATVISIIIVIAGFMGIKGLPIEEYPSLTPPTVSVSATYSGADAQTIADSV 62 + FFI RPIFA V++II+++AG + I LP+ +YP++ PP VSVSA Y GADAQT+ D+V Sbjct: 2 ANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTV 61 Query: 63 ASAIEDQINGVENMLYMQSTSSSAGTMNISVYFKIGSSAKQATIDVNNRVQAALSRLPQE 122 IE +NG++N++YM STS SAG++ I++ F+ G+ A + V N++Q A LPQE Sbjct: 62 TQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQE 121 Query: 123 VQNMGVTVRERSGSILQVVGFTS--PNMNQVELYNYVNLNIADAIKRVNGIGDTVLIGNK 180 VQ G++V + S S L V GF S P Q ++ +YV N+ D + R+NG+GD L G Sbjct: 122 VQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFG-A 180 Query: 181 EYSMRIWLKPDRLAQFKLTPSDVISQVRIQNSQYAAGKIGEQPSKGENPYVYSVVSEGRF 240 +Y+MRIWL D L ++KLTP DVI+Q+++QN Q AAG++G P+ S++++ RF Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 KDPKQFGEILIKSD-DGTVVKLKEVATVELGAASYASEAMLNGKPAVPLLLFLQNDANAL 299 K+P++FG++ ++ + DG+VV+LK+VA VELG +Y A +NGKPA L + L ANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 300 ATAEAVKAKLEELKKTYPVGLEHTIAYNPTEFITVSIDEVIKTFVEAMVLVLIVMYFFLK 359 TA+A+KAKL EL+ +P G++ Y+ T F+ +SI EV+KT EA++LV +VMY FL+ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 360 SFRATIIPMLAVPVSIIGTFGGLYVMGFSINLITLFALILAIGIVVDDAIIVIENVERIL 419 + RAT+IP +AVPV ++GTF L G+SIN +T+F ++LAIG++VDDAI+V+ENVER++ Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 420 HEDKEISVKDATFKAMEEVQAPVISIVLVLCAVFVPVSFMEGFVGVIQKQFALTLVVAVC 479 EDK K+AT K+M ++Q ++ I +VL AVF+P++F G G I +QF++T+V A+ Sbjct: 421 MEDKL-PPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMA 479 Query: 480 ISGFVALTLTPALCAVMLKKQENKPF----WIVQKFNDFFDFSTKLFTAGVAKILKHVII 535 +S VAL LTPALCA +LK + FN FD S +T V KIL Sbjct: 480 LSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGR 539 Query: 536 SFIVIGIMGFATYGLFQKVPKGLVPSEDKGALMVITSLPPSTNMLKTKEEVKSISNAILS 595 ++ ++ LF ++P +P ED+G + + LP +T++ + +++ L Sbjct: 540 YLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLK 599 Query: 596 N--PNVEFTMGVAGYDMLASSLRENSAISFIKLKDWSERKGATDGADALVGQFNGMLWGS 653 N NVE V G+ S +N+ ++F+ LK W ER G + A+A++ + L Sbjct: 600 NEKANVESVFTVNGFS--FSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKI 657 Query: 654 KNSMTFVVNVPPIMGLSMTGGFEMYLQNKSGKSYNEIEADARKVTAAANARP-ELTGVRT 712 ++ N+P I+ L GF+ L +++G ++ + ++ A P L VR Sbjct: 658 RDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRP 717 Query: 713 TLETNYRQFKITVDKEKAKLFGVSESEIFSTIAASFGSYYINDFNLAGKSYRVYARASDN 772 + QFK+ VD+EKA+ GVS S+I TI+ + G Y+NDF G+ ++Y +A Sbjct: 718 NGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAK 777 Query: 773 FRNNPEDLRKIFVRSYEGGMVPLNSVATLTRSIGPDIVDRFNLFPAAKIMGDPKTGYTSG 832 FR PED+ K++VRS G MVP ++ T G ++R+N P+ +I G+ G +SG Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSG 837 Query: 833 DAIRAIQEVVNDTLSSDEYAISWAGTAYQEVNSQGTGTVAFIFGMVFVFLILAAQYERWL 892 DA+ ++ + + + W G +YQE S V VFL LAA YE W Sbjct: 838 DAMALMENLASKLPAG--IGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWS 895 Query: 893 IPLAVITAVPFAVFGSLLAVWIRGLTNDIYFEIGLLLLIGLAAKNAILIVEFAMQERE-S 951 IP++V+ VP + G LLA + ND+YF +GLL IGL+AKNAILIVEFA E Sbjct: 896 IPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKE 955 Query: 952 GKSIFESAVNAAKLRFRPIVMTSIAFTLGVFPMAISTGAGAASRHSLGTGVVGGMIASTT 1011 GK + E+ + A ++R RPI+MTS+AF LGV P+AIS GAG+ +++++G GV+GGM+++T Sbjct: 956 GKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATL 1015 Query: 1012 IAIFFVPMFYYLL 1024 +AIFFVP+F+ ++ Sbjct: 1016 LAIFFVPVFFVVI 1028 Score = 98.0 bits (244), Expect = 8e-23 Identities = 47/321 (14%), Positives = 121/321 (37%), Gaps = 10/321 (3%) Query: 179 NKEYSMRIWLKPDRLAQFKLTPSDVISQVRIQNSQYAAGKIGEQPSKGENPYVYSVVSEG 238 ++ + ++ ++ SD+ + + + +G +Y Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGT---YVNDFIDRGRVKKLYVQADAK 777 Query: 239 RFKDPKQFGEILIKSDDGTVVKLKEVATVELGAASYASEAMLNGKPAVPLLLFLQNDANA 298 P+ ++ ++S +G +V T S L +P + A Sbjct: 778 FRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGS----PRLERYNGLPSMEIQGEAAPG 833 Query: 299 LATAEAVKAKLEELKKTYPVGLEHTIAYNPTEFITVSIDEVIKTFVEAMVLVLIVMYFFL 358 ++ +A+ A +E L P G+ + + +S ++ + V+V + + Sbjct: 834 TSSGDAM-ALMENLASKLPAGIGYDWTGMSYQER-LSGNQAPALVAISFVVVFLCLAALY 891 Query: 359 KSFRATIIPMLAVPVSIIGTFGGLYVMGFSINLITLFALILAIGIVVDDAIIVIENVERI 418 +S+ + ML VP+ I+G + ++ + L+ IG+ +AI+++E + + Sbjct: 892 ESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDL 951 Query: 419 LHEDKEISVKDATFKAMEEVQAPVISIVLVLCAVFVPVSFMEGFVGVIQKQFALTLVVAV 478 + ++ + V +AT A+ P++ L +P++ G Q + ++ + Sbjct: 952 MEKEGK-GVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGM 1010 Query: 479 CISGFVALTLTPALCAVMLKK 499 + +A+ P V+ + Sbjct: 1011 VSATLLAIFFVPVFFVVIRRC 1031
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 40.2 bits (94), Expect = 1e-05 Identities = 20/129 (15%), Positives = 42/129 (32%), Gaps = 20/129 (15%) Query: 99 KYQASYDSLDAAVGVANANLKNAETEFKRISALYKKNAVSQKDYDAAVAAYDIANANLVS 158 + + + + + +A+ E++ ++ L+K + + N+ Sbjct: 263 EAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDK---------LRQTTDNIGL 313 Query: 159 AKANLKSAKIDLGYTSIVAPFDGVVGDNKV-DVGSLVVASQTQLVRLTKINP------IE 211 L + + I AP V KV G +V ++T L I P + Sbjct: 314 LTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAET----LMVIVPEDDTLEVT 369 Query: 212 ADFYIADVD 220 A D+ Sbjct: 370 ALVQNKDIG 378 Score = 39.4 bits (92), Expect = 2e-05 Identities = 13/108 (12%), Positives = 35/108 (32%), Gaps = 4/108 (3%) Query: 59 VTSNQDVIIYPKVGGTIIKQFFKPGDKVKAGEKLFLIDPEKYQASYDSLDAAVGVANANL 118 S + I P + + K G+ V+ G+ L + +A +++ A Sbjct: 91 THSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQ 150 Query: 119 KNAETEFKRISALYKKNAVSQKDYDAAVAAYDIANANLVSAKANLKSA 166 + + I N + + +++ ++ + +K Sbjct: 151 TRYQILSRSIE----LNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQ 194
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 28.5 bits (63), Expect = 0.022 Identities = 30/100 (30%), Positives = 43/100 (43%), Gaps = 13/100 (13%) Query: 10 YAAVGGFGAI-IMAGLAGCGSDDGGNENALNEVAQKNGAFVIIEESAPGVYKILEEYPST 68 Y GG GA G G N + V KNG+ ++I G+ Y Sbjct: 319 YGGNGGPGARHDWNATVGYKDQQGNNVATIINVHMKNGSGLVIAGGEKGI-NNPSFYLYK 377 Query: 69 ETRVVLKDMNGTERVLSKDEID------KLLAQANAKIDN 102 E + + G++R LS++EI + LAQ NAK+DN Sbjct: 378 EDQ-----LTGSQRALSQEEIQNKIDFMEFLAQNNAKLDN 412
>SHIGARICIN#Ribosome inactivating protein family signature. Length = 289 Score = 30.6 bits (69), Expect = 0.012 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 7/48 (14%) Query: 4 DGSYEIL------SCDDVELGIKR-SSALSFYACYDDVKEAKALLVII 44 G+YE L +++ LG+ SA++ Y+ A AL+V+I Sbjct: 131 SGNYERLQIAAGKIRENIPLGLPALDSAITTLFYYNANSAASALMVLI 178
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 94.6 bits (235), Expect = 2e-28 Identities = 30/85 (35%), Positives = 48/85 (56%) Query: 14 GLFKSYDELMDISVDFIAELGTTTVSINELLKFEAGSVIDLEKPAGESVELYINNRIFGK 73 G + D +MDI V ELG T ++I ELL+ GSV+ L+ AGE +++ IN + + Sbjct: 49 GAMQDIDLIMDIPVKLTVELGRTRMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQ 108 Query: 74 GEVMVYEKNLAIRINEILDSKSVIQ 98 GEV+V +RI +I+ ++ Sbjct: 109 GEVVVVADKYGVRITDIITPSERMR 133
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 35.9 bits (83), Expect = 3e-04 Identities = 21/55 (38%), Positives = 28/55 (50%) Query: 114 EEATFGAIAAKNLLHNLAECVTIDIGGGSTELARISNGKIVDVLSLDIGTVRLKE 168 EE AI A + + +DIGGG+TE+A IS +V S+ IG R E Sbjct: 142 EEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDE 196
>PF06580#Sensor histidine kinase Length = 349 Score = 35.6 bits (82), Expect = 3e-04 Identities = 15/90 (16%), Positives = 34/90 (37%), Gaps = 11/90 (12%) Query: 281 GEEKNLALDLKPEIFNLNIQTGLLTHIVQNFVQNAIKFSPKNSTITISSRVEKSKFIIEV 340 + + P I ++ + L+ +V+N +++ I P+ I + + +EV Sbjct: 237 EDRLQFENQINPAIMDVQVPPMLVQTLVENGIKHGIAQLPQGGKILLKGTKDNGTVTLEV 296 Query: 341 ADEGAGIDESKDLFAPFKRYGNKGGAGLGL 370 + G+ ++ K G GL Sbjct: 297 ENTGSLALKN-----------TKESTGTGL 315
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 78.7 bits (194), Expect = 4e-19 Identities = 35/112 (31%), Positives = 57/112 (50%) Query: 2 RILIVEDEVTLNKTIAEGLQEFGYQTDSSENFKDAEYYIGIRNYDLVLTDWMLQDGDGVD 61 IL+ +D+ + + + L GY + N +I + DLV+TD ++ D + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 LINIIKHKSPRTSVVVLSAKDDKESEIKALRAGADDYIKKPFDFDILVARLE 113 L+ IK P V+V+SA++ + IKA GA DY+ KPFD L+ + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>FLAGELLIN#Flagellin signature. Length = 507 Score = 52.0 bits (124), Expect = 7e-10 Identities = 31/124 (25%), Positives = 55/124 (44%), Gaps = 3/124 (2%) Query: 16 YLDQAKNSEKKALNAISANSEI---KASGANLQIAESLLSQTNVLNEGMANANDMIGMLQ 72 L+++++S A+ +S+ I K A IA S L + NAND I + Q Sbjct: 16 NLNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQ 75 Query: 73 IADSTLLNLSESADKIGELSSKLSNPALSANEQKGIKGEINALKNAMSDSVKEAKFNGKN 132 + L ++ + ++ ELS + +N S ++ K I+ EI + + +FNG Sbjct: 76 TTEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVK 135 Query: 133 VFDA 136 V Sbjct: 136 VLSQ 139
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 59.6 bits (144), Expect = 4e-12 Identities = 25/75 (33%), Positives = 39/75 (52%), Gaps = 7/75 (9%) Query: 226 LSSSVLFDKGSAVLKEEVKEELKATLSKYFDVLLNDKEIASNIDQIVIEGFTDSDGSYIY 285 L S VLF+ A LK E + L S+ ++ D +V+ G+TD GS Y Sbjct: 217 LKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDG-------SVVVLGYTDRIGSDAY 269 Query: 286 NLELSQKRAYAVMEF 300 N LS++RA +V+++ Sbjct: 270 NQGLSERRAQSVVDY 284
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 29.9 bits (67), Expect = 0.018 Identities = 24/112 (21%), Positives = 39/112 (34%), Gaps = 16/112 (14%) Query: 55 IVMMGVILFVAFIFSRHSALVAYSNFLANAKDYKIRLKEFIIAHLFEISGVKKANAKFED 114 + + VI F+ +V LA + DY I + +I + +SG + Sbjct: 148 LYALPVIALTGLAFASLGMVVTA---LAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPI 204 Query: 115 FFESYTR---------NFRNDNLANIGQAVFPMLGILGTFISIAISMPSFSS 157 F++ R R L + V +G L I I +P F S Sbjct: 205 VFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGAL----CIYIVIPFFLS 252
>LCRVANTIGEN#Low calcium response V antigen signature. Length = 326 Score = 27.3 bits (60), Expect = 0.045 Identities = 19/70 (27%), Positives = 33/70 (47%), Gaps = 4/70 (5%) Query: 11 KKRLLEEFKDAKSELKFRNLYELLVCVMLSAQCT----DKRVNLITPALFEAYKDVFELA 66 + +L EE + +ELK ++ + + LS+ T DK +NL+ L+ + A Sbjct: 150 RSKLREELAELTAELKIYSVIQAEINKHLSSSGTINIHDKSINLMDKNLYGYTDEEIFKA 209 Query: 67 SANLASLKLM 76 SA L+ M Sbjct: 210 SAEYKILEKM 219
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 86.4 bits (214), Expect = 2e-20 Identities = 28/111 (25%), Positives = 55/111 (49%), Gaps = 3/111 (2%) Query: 127 KVLVVEDSLPFRNMIKKILTSLQFKVLAAAHGEEAMSYFADNPDINLIITDYRMPVKDGL 186 +LV +D R ++ + L+ + V ++ + A +L++TD MP ++ Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGD-GDLVVTDVVMPDENAF 63 Query: 187 EVLKEVRKEKDKNHLGVIVMTSPSEKTDASIFLKNGASDFIAKPFSKEELI 237 ++L ++K + L V+VM++ + A + GA D++ KPF ELI Sbjct: 64 DLLPRIKKARP--DLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELI 112
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.1 bits (62), Expect = 0.007 Identities = 14/57 (24%), Positives = 26/57 (45%), Gaps = 3/57 (5%) Query: 9 LAASIAMAGGFVSNHKSENVISVKEALKLNDDAK--VMLEGKIKSHIKSDKYEFADK 63 AA A G+ N + + +AL D K MLEG+++ + + +E+ + Sbjct: 781 PAAEGAAQKGYSVNTTFVTIADLVQALGA-DPGKSSPMLEGQVRDWLNENGWEYLRE 836
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 28.0 bits (62), Expect = 0.019 Identities = 12/57 (21%), Positives = 19/57 (33%), Gaps = 2/57 (3%) Query: 9 GSNPDKINAVIEIPYGSNIKYEIDKDSGAVVVDR-VLYSAMFYPANYGFVPNTLAAD 64 + A++ NI Y SGA+ V ++ A G +P A Sbjct: 56 NLSDQDGGAIVAQLTQMNIPYRFANGSGAIEVPADKVHELRLRLAQQG-LPKGGAVG 111
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 154 bits (389), Expect = 9e-40 Identities = 124/576 (21%), Positives = 203/576 (35%), Gaps = 73/576 (12%) Query: 31 YRDYLDLAQNKGIFKATDAPLEFTQRNGTKFTFDKIPN-------NNARNNKGNFTALGR 83 Y+ + D A+NKG F + +N K +PN + +K T + Sbjct: 34 YQIFRDFAENKGKFSVGATNVLVKDKNN-KDLGTALPNGIPMIDFSVVDVDKRIATLINP 92 Query: 84 SFVVTATHVEKGANAVDYNEKRGF--FGNTKYEYLTRYSSTSTSKVYNTETTYLRTTKFI 141 +VV HV G + + + G GN K V E K + Sbjct: 93 QYVVGVKHVSNGVSELHFGNLNGNMNNGNAKAHRDVSSEENRYFSVEKNEYPTKLNGKTV 152 Query: 142 VEGSVDPIDIPDLEISPASYNDQDIAEIEVRKIENYFKSIKNSGGANGNDIFAYQAGIGL 201 D + ++A IE + + + + G G Sbjct: 153 TTEDQTQKRREDYYMPRLDKFVTEVAPIEASTASSDAGTYNDQN----KYPAFVRLGSGS 208 Query: 202 LSLEK--PRIDPITGNPTGGYDTIVDKDDTNNQTLGASLNNINIINSVAYKKKIPLLGDG 259 + K I N G + + D + + +N + G+ Sbjct: 209 QFIYKKGDNYSLILNNHEVGGNNLKLVGDAYTYGIAGTPYKVN-----HENNGLIGFGNS 263 Query: 260 NEVNGIYVLPFTNDNFRNKLYIGDSGSGFFAYDTLNNKWVLVGVTSVANG--------TQ 311 E + + D N +GDSGS F YD KW+ +G G Sbjct: 264 KEEHSDPKGILSQDPLTNYAVLGDSGSPLFVYDREKGKWLFLGSYDFWAGYNKKSWQEWN 323 Query: 312 NYASIVTARDFNDYKKGY-------------------ENLVSGVNVLGSA---LVQNKDN 349 Y S T N G +NV + + + Sbjct: 324 IYKSQFTKDVLNKDSAGSLIGSKTDYSWSSNGKTSTITGGEKSLNVDLADGKDKPNHGKS 383 Query: 350 IFSSANGSNITLSTNLDLGHGGIVVNSGDFTLNSTNGSKIAKFAGFDIARGASLNLNVTS 409 + +G+ +TL+ N+D G GG+ GD+ + T+ + K AG +A G ++ V + Sbjct: 384 VTFEGSGT-LTLNNNIDQGAGGLFFE-GDYEVKGTSDNTTWKGAGVSVAEGKTVTWKVHN 441 Query: 410 DTS--VHKLGKGSLIVSSSGNKP--LRLGEGVVELR------ALNAFDKIYLTSGRGLLR 459 + K+GKG+LIV +G+ L++G+G V L+ +AF + + SGR L Sbjct: 442 PQYDRLAKIGKGTLIVEGTGDNKGSLKVGDGTVILKQQTNGSGQHAFASVGIVSGRSTLV 501 Query: 460 LGVNENLN-DKIFFGNGGGALDLNGFDQTFDNISANSSDAKITNAN-SQRATLTINGES- 516 L ++ ++ + I+FG GG LDLNG TFD+I A++ N N + + +TI GES Sbjct: 502 LNDDKQVDPNSIYFGFRGGRLDLNGNSLTFDHIRNIDDGARLVNHNMTNASNITITGESL 561 Query: 517 -------GKDTIFHASIDKNIELRHSGQGKELVFDG 545 I D R G +L + Sbjct: 562 ITDPNTITPYNIDAPDEDNPYAFRRIKDGGQLYLNL 597
>PF02370#M protein repeat Length = 168 Score = 33.2 bits (75), Expect = 0.002 Identities = 25/116 (21%), Positives = 54/116 (46%), Gaps = 7/116 (6%) Query: 177 KKQIAYSFFVEENLEQR-LLKLIDYVIEEIEANKLQKEIKNKVHSKIDKTNKEYFLKEQL 235 + Y + EN + R IEE+E + +K+ + + K ++ +++ +EQ Sbjct: 45 ENDPQYRALMGENQDLRKREGQYQDKIEELEKERKEKQERPERREKFERQHQDKHYQEQQ 104 Query: 236 KQIQAELGADTSREEELEEYRKKLDAKKKFMAED------AYKEIKKQIDKLSRMH 285 K+ Q E + +++L + ++ DA ++ + D A KE++ + KL H Sbjct: 105 KKHQQEQQQLEAEKQKLAKEKQISDASRQGLNRDLEASRAAKKELEPKHQKLGTEH 160
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 31.6 bits (71), Expect = 0.010 Identities = 37/192 (19%), Positives = 78/192 (40%), Gaps = 9/192 (4%) Query: 471 EEKAKLLASSVSKVASSANTQANSLQESAAAVEQMS---SSMNAISQKTADVIRQSDEIK 527 E A + A++ A TQA + AA E + ++ +A++Q+ D++ ++ Sbjct: 46 ESSAAIHATAKWSTAQLKKTQAEQAARAKAAAEAQAKAKANRDALTQRLKDIVNEALRHN 105 Query: 528 NIITIIRDIADQTNLLALNAAIEA---ARAGEHGRGFAVVADEVRKLAERTQKSLGEIEA 584 T N A+ A E A+A E R A A++ + AE+ +K + +A Sbjct: 106 ASRTPSATELAHANNAAMQAEDERLRLAKAEEKARKEAEAAEKAFQEAEQRRKEIEREKA 165 Query: 585 NTN---VLAQSINEMSESIKEQSEGINMINQSVAQIDHLTKENVVIANQANEVTSEVDEM 641 T LA++ + ++ E+++ + + + ++ + N S Sbjct: 166 ETERQLKLAEAEEKRLAALSEEAKAVEIAQKKLSAAQSEVVKMDGEIKTLNSRLSSSIHA 225 Query: 642 AKAIVEEVRKKR 653 A ++ + KR Sbjct: 226 RDAEMKTLAGKR 237
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 33.5 bits (76), Expect = 0.002 Identities = 31/208 (14%), Positives = 81/208 (38%), Gaps = 10/208 (4%) Query: 289 LNNKEADEENKNLDDAELDTANLEQEELNLDELAKFDDENSLENELNLEDEPKDEENLDE 348 ++ ++++ ++ +E+ E + E + E + E + N++ + E Sbjct: 1029 APATPSETTETVAENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQS 1088 Query: 349 ISEADEEQIQVDDEKAEEDIEEEALDEISSEELENLESSESENSSNEMPVEELEDVSEPE 408 SE E Q E A +E+E ++ +E+ + + S+ S + E ++ +EP Sbjct: 1089 GSETKETQTTETKETA--TVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPA 1146 Query: 409 AKEDLGLVDETFEEENAQEENVKDDNKDAASSELNFDASSIDDIDENTMLAAFG-LKDIP 467 + D T + Q + + + + E + ++ + E+T + + + P Sbjct: 1147 REN-----DPTVNIKEPQSQTNTTADTEQPAKETS--SNVEQPVTESTTVNTGNSVVENP 1199 Query: 468 QTSSKNDAKEDYKEELTKKITKHVHESL 495 + ++ + E + K S+ Sbjct: 1200 ENTTPATTQPTVNSESSNKPKNRHRRSV 1227
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 118 bits (297), Expect = 2e-34 Identities = 63/240 (26%), Positives = 121/240 (50%), Gaps = 2/240 (0%) Query: 14 VFMLLFARLSGLIVFFPFYSHNQIPLSVKTLLVFVLCVVLFPLSKAHENSIN--FLVGEI 71 ++ R+ LI P S +P VK L ++ + P A++ + F + Sbjct: 15 LYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPANDVPVFSFFALWLA 74 Query: 72 LGEVMLGLSAGLMLTIIFATLQMAGEQISMVMGFSMASVLDPQTGTNSPVIANLINFIAL 131 + ++++G++ G + FA ++ AGE I + MG S A+ +DP + N PV+A +++ +AL Sbjct: 75 VQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASHLNMPVLARIMDMLAL 134 Query: 132 LTFLAFDGHHLLLQFYASSLAVVPLGDFYPRPGIMSYAINLFTNLFMFGFIMSFPIIALS 191 L FL F+GH L+ + +P+G + +F+ G +++ P+I L Sbjct: 135 LLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLIFLNGLMLALPLITLL 194 Query: 192 LLSDSIFGMLMKTMPQFNLLVIGYPIKVTIGFSVLIAILAGIMKIMSDLLLKVINDLPAL 251 L + G+L + PQ ++ VIG+P+ +T+G S++ A++ I L ++ N L + Sbjct: 195 LTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPFCEHLFSEIFNLLADI 254
>SECFTRNLCASE#Bacterial translocase SecF protein signature. Length = 333 Score = 30.6 bits (69), Expect = 0.004 Identities = 22/102 (21%), Positives = 44/102 (43%), Gaps = 10/102 (9%) Query: 74 LLAIRRLHFGIIFQSHYLFKGFSAYENIELASILSGENIEKNDLEALKISSVINQKVGEL 133 L + L+FGI F+ + + I++ + + LE L++ VI +V + Sbjct: 37 LPLVIGLNFGIDFKGGTTIR-TESTTAIDVG-------VYRAALEPLELGDVIISEVRDP 88 Query: 134 SGGQQQRVSIARVLTKKPKIIFADEPTGNLDKQTANEVMQVL 175 S + Q V++ R+ ++ E G ++ N+V L Sbjct: 89 SFREDQHVAMIRIQMQEDGQ--GAEGQGAQGQELVNKVETAL 128
>CHANLCOLICIN#Channel forming colicin signature. Length = 522 Score = 30.0 bits (67), Expect = 0.008 Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 5/81 (6%) Query: 140 MGEKFGSRAGEMSVFVGANIKGGCYEVGELDLGEFNAYKIGRNFDMNAALRDE-FNALGV 198 + EK+G + +M+ + KG L F YK N + A RD FNAL Sbjct: 359 LTEKYGEKYSKMAQELADKSKGKKIGNVNEALAAFEKYKDVLNKKFSKADRDAIFNAL-- 416 Query: 199 RNLNFSEVCTHCDE--RYFSY 217 ++ + + H D+ +Y Sbjct: 417 ASVKYDDWAKHLDQFAKYLKI 437
>cloacin#Cloacin signature. Length = 551 Score = 30.5 bits (68), Expect = 0.023 Identities = 27/112 (24%), Positives = 51/112 (45%), Gaps = 15/112 (13%) Query: 461 AQMLKNNEDVAKLLEDRAKALKGY--MQELTTKANKSATSLSEGAAAVEQ--------MS 510 A++ + NEDVA+ E +AKA++ Y + ANK +L++ A ++Q M+ Sbjct: 328 AELNQANEDVARNQERQAKAVQVYNSRKSELDAANK---TLADAIAEIKQFNRFAHDPMA 384 Query: 511 ASMRQVNARSDDVKRQSEEIKNIITIIHDIADQTNL--LALNAAIEAARAGE 560 R +R ++ N A + + AL++A+E+ + E Sbjct: 385 GGHRMWQMAGLKAQRAQTDVNNKQAAFDAAAKEKSDADAALSSAMESRKKKE 436
>FbpA_PF05833#Fibronectin-binding protein Length = 577 Score = 127 bits (320), Expect = 5e-34 Identities = 70/291 (24%), Positives = 123/291 (42%), Gaps = 27/291 (9%) Query: 168 EAFFKSEAARINEARIASLKEAKLASVQKKIDSMSEILNSLEDKDELMKKSEEFANYGTL 227 E F+ ++ R+ S V I+ ++ L + + + + F YG L Sbjct: 285 ENFYYAKDKS---DRLKSKSSDLQKIVMNNINRCTKKDKILNNTLKKCEDKDIFKLYGEL 341 Query: 228 LLANLANFKGYEREICLKDF---DGNEIKLTLSD--TPKNSANEFYSRSKKLRAKALGVE 282 L AN+ K I L ++ + + +K+TL + TP + +Y + KL+ Sbjct: 342 LTANIYALKKGLSHIELANYYSENYDTVKITLDENKTPSQNVQSYYKKYNKLKKSEEAAN 401 Query: 283 IEKRNLSEKIEFLEGLKSLLKEAKSAYELE----------ILSPKNKAKQRERQIKDVSE 332 + E++ +L + + + A + E+E + K K ++ + S+ Sbjct: 402 EQLLQNEEELNYLYSVLTNINNADNYDEIEEIKKELIETGYIKFKKIYKSKKSK---TSK 458 Query: 333 NAEIFYIREFKILVGRNEKGNINL-LDLAKKDDIWLHLKDAPSAHVIIKTNKSKVPEDVL 391 I VG+N N L L A K DIW H K+ P +HVI+K +PE L Sbjct: 459 PMHFISKDGIDIYVGKNNIQNDYLTLKFANKHDIWFHTKNIPGSHVIVKNIMD-IPESTL 517 Query: 392 EMAAKFCVEFS-VKGAGRYEVDYTKRENLRRENGAN---VTYTNYKTIIIN 438 AA +S + + VDYT+ +N+++ NGA V Y+ +TI + Sbjct: 518 LEAANLAAYYSKSQNSSNVPVDYTEVKNVKKPNGAKPGMVIYSTNQTIYVT 568 Score = 46.4 bits (110), Expect = 1e-07 Identities = 33/155 (21%), Positives = 68/155 (43%), Gaps = 7/155 (4%) Query: 38 KIIFDLNKSNSAIYKDDELKEAKIYQAPFDNVLKKRFNASHIKSVECLKDNRILKFTCTQ 97 K++ + + I+ D K I F VL+K + + I + + +RI+ Sbjct: 47 KLLISSSSNYPRIHLTDLTKPNPIKAPMFCMVLRKYISNAKIVDIHQINQDRIVVIDFES 106 Query: 98 SGSY-KSENFILYLEFTGRFTNAVITD-ENDVIIEALRHID---NSYRKIETGEVLKELP 152 + + + L +E GR +N + +++I+++++HI N+YR I G P Sbjct: 107 TDELGFNSIYSLIIEIMGRHSNMTLIRKRDNIIMDSIKHITPDINTYRSIYPGIEY-VYP 165 Query: 153 AIAIKEKPCEPITD-FEAFFKSEAARINEARIASL 186 + K P + D E F K + ++N+ + + Sbjct: 166 PKSPKLNPFDFSYDMIENFTKENSLQLNDNIFSKI 200
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 62.9 bits (153), Expect = 4e-12 Identities = 29/155 (18%), Positives = 70/155 (45%), Gaps = 6/155 (3%) Query: 666 TVAILFVIFCFVFRSIKLATIAIVSNLIPLCTLFGVMGFFGIPLDVMSITIAAISIGIGV 725 + + V++ F+ ++++ I ++ + L F ++ FG ++ +++ ++IG+ V Sbjct: 348 IMLVFLVMYLFL-QNMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLV 406 Query: 726 DDIIHYIHRFKEELLTKGV--FESIKAAHASIGYAMYYTSFTIFLGF-SVMITSNFIPTI 782 DD I + + ++ + E+ + + + I A+ + + F + I Sbjct: 407 DDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAI 466 Query: 783 Y--FGLLTDLVMVFMLLGALIILPSLIASFVKKRE 815 Y F + M +L ALI+ P+L A+ +K Sbjct: 467 YRQFSITIVSAMALSVLVALILTPALCATLLKPVS 501 Score = 33.3 bits (76), Expect = 0.005 Identities = 20/129 (15%), Positives = 47/129 (36%), Gaps = 6/129 (4%) Query: 652 LQNLLSSQVDTFGLTVAILFVIFCFVFRSIKLATIAIVSNLIPLCTLFGVMG--FFGIPL 709 + + ++ ++F+ ++ S + ++ +PL + ++ F Sbjct: 865 ERLSGNQAPALVAISFVVVFLCLAALYESWSIPVSVMLV--VPLGIVGVLLAATLFNQKN 922 Query: 710 DVMSITIAAISIGIGVDDIIHYIHRFKEELLTKG--VFESIKAAHASIGYAMYYTSFTIF 767 DV + +IG+ + I + K+ + +G V E+ A + TS Sbjct: 923 DVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFI 982 Query: 768 LGFSVMITS 776 LG + S Sbjct: 983 LGVLPLAIS 991
>VACJLIPOPROT#VacJ lipoprotein signature. Length = 251 Score = 165 bits (420), Expect = 5e-53 Identities = 73/236 (30%), Positives = 107/236 (45%), Gaps = 20/236 (8%) Query: 5 LAIFCSLLLACASTDLNANSEKDDFDVEFEAKKDVFDPLSGYNRVMTNVN-DFIYINMLT 63 LA+ +LL+ CAS+ + D PL G+NR M N N + + ++ Sbjct: 8 LALGTTLLVGCASSGTDQQGRSD--------------PLEGFNRTMYNFNFNVLDPYIVR 53 Query: 64 PVAKGYAYVVPSTARTMVANFFDNLLFPVRFVNNLLQFKFQNAGEETLRFLANTIIGFGG 123 PVA + VP AR ++NF NL P VN LQ RF NTI+G GG Sbjct: 54 PVAVAWRDYVPQPARNGLSNFTGNLEEPAVMVNYFLQGDPYQGMVHFTRFFLNTILGMGG 113 Query: 124 LTDGAKYYDLKAHNED---FRQTLGYWGLGSGFHIVWPLIGPSNLRDTGGLVGDYFADPI 180 D A + K + F TLG++G+G G ++ P G LRD GG + D + Sbjct: 114 FIDVAGMANPKLQRTEPHRFGSTLGHYGVGYGPYVQLPFYGSFTLRDDGGDMADALYPVL 173 Query: 181 SYVDPMLLSVGIESYRTFNSFAQDPTAYEKLRKDAIDLYPFLRDAYEQRRDKLIKE 236 S++ +SVG + + AQ + L + + D Y +R+AY QR D + Sbjct: 174 SWLT-WPMSVGKWTLEGIETRAQLLDSDG-LLRQSSDPYIMVREAYFQRHDFIANG 227