>PERTACTIN#Pertactin signature. Length = 922 Score = 32.8 bits (74), Expect = 0.004 Identities = 24/93 (25%), Positives = 31/93 (33%) Query: 81 PKAGQRSPAGATPLAPRAPLPSANPAPVGPGPACAPAVDAHAPAPAGMNAATAAAVAAAQ 140 P A + +P P+ P P P P P +A AP P +AAA AA Sbjct: 568 PPAPKPAPQPGPQPGPQPPQPPQPPQPPQPPQPPQRQPEAPAPQPPAGRELSAAANAAVN 627 Query: 141 AAQAAQANAAALNADEAADLDLPSLTAHEAAAG 173 A+ A L L + A G Sbjct: 628 TGGVGLASTLWYAESNALSKRLGELRLNPDAGG 660
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 39.2 bits (91), Expect = 9e-07 Identities = 18/97 (18%), Positives = 28/97 (28%), Gaps = 2/97 (2%) Query: 18 AGCAAFAPRDAAKLECTMPVAAYPENAKPLERRATVLVRAMITASGNAENVTVTTSSRNA 77 + L P YP A+ L V V+ +T G +NV + ++ Sbjct: 147 SKPVTSVASGPRALSRNQPQ--YPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPAN 204 Query: 78 AADRAAVDAMSRIACSQTPARGGEPYPFTLTRPFVFE 114 +R +AM R G E Sbjct: 205 MFEREVKNAMRRWRYEPGKPGSGIVVNILFKINGTTE 241
>PF06580#Sensor histidine kinase Length = 349 Score = 29.1 bits (65), Expect = 0.048 Identities = 17/107 (15%), Positives = 41/107 (38%), Gaps = 14/107 (13%) Query: 205 AALSALLSVGLALTVSRGPWLQVG-----------VMVVAGFWMAFA-QARRDPA--ASR 250 + + +L+ + R WL++ +V+ W R A ++ Sbjct: 49 SLMGLVLTHAYRSFIKRQGWLKLNMGQIILRVLPACVVIGMVWFVANTSIWRLLAFINTK 108 Query: 251 ARAWAIPVVLGVLFVAVNVAVRWANVHYHLGLAESAADRMRDAGQIA 297 A+ +P+ L ++F V V W+ +++ ++ D ++A Sbjct: 109 PVAFTLPLALSIIFNVVVVTFMWSLLYFGWHFFKNYKQAEIDQWKMA 155
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 42.2 bits (99), Expect = 2e-07 Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 5/59 (8%) Query: 15 RRRLRARGFTLIELMIVLAIVGVVAAYAIPAYQDYLARSRVGEGLALAASARLAVAENA 73 R + RGFTL+E+M+V+ I+GV+A+ +P + A + + ENA Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLM-----GNKEKADKQKAVSDIVALENA 55
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 98.8 bits (246), Expect = 8e-26 Identities = 35/118 (29%), Positives = 64/118 (54%), Gaps = 1/118 (0%) Query: 2 RILIAEDDSILADGLTRSLRQSGYAVDHVRNGVEADTALSMQTFDLLILDLGLPRMSGLE 61 IL+A+DD+ + L ++L ++GY V N ++ DL++ D+ +P + + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRRLRARNSNLPVLILTAADSVDERVKGLDLGADDYMAKPFALNE-LEARVRALTRR 118 +L R++ +LPVL+++A ++ +K + GA DY+ KPF L E + RAL Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122
>PF06580#Sensor histidine kinase Length = 349 Score = 47.2 bits (112), Expect = 7e-08 Identities = 44/198 (22%), Positives = 78/198 (39%), Gaps = 49/198 (24%) Query: 287 LAGLRTQAEF-ALRHEVNADVAH----SLEQIA----TSSEQAARLVTQLLALARAENRA 337 +A + +A+ AL+ ++N H +L I +A ++T L L R Sbjct: 154 MASMAQEAQLMALKAQINP---HFMFNALNNIRALILEDPTKAREMLTSLSELMRY---- 206 Query: 338 TGLTFEPVEIASLARQ--AVRDWV---QVALAKQMDLGYESPDTDAPLRIDGQPVMLREM 392 L + SLA + V ++ + ++ + +++ P ML Sbjct: 207 -SLRYSNARQVSLADELTVVDSYLQLASIQFEDRLQFENQINPAIMDVQV---PPML--- 259 Query: 393 LGNLIDNAIRY----TPAGGRITVRVRAERAAGAVHLEVEDTGPGIPPNERERVVERFYR 448 + L++N I++ P GG+I ++ + G V LEVE+TG N +E Sbjct: 260 VQTLVENGIKHGIAQLPQGGKILLKGTKDN--GTVTLEVENTGSLALKNTKE-------- 309 Query: 449 ILGREGDGSGLGLAIVRE 466 +G GL VRE Sbjct: 310 -------STGTGLQNVRE 320
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 34.8 bits (80), Expect = 8e-04 Identities = 47/264 (17%), Positives = 93/264 (35%), Gaps = 28/264 (10%) Query: 77 AIVFGRLGDLVGRKHTFLITIVIMGISTFVVGFLPGYASIGIAAPVIFIAMRLLQGLALG 136 A V G L D GR+ L+++ + ++ P V++I R++ G+ G Sbjct: 60 APVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAP-------FLWVLYIG-RIVAGIT-G 110 Query: 137 GEYGGAATYVAEHAPSHRRGFYTSWIQTTATLGLFLSLLVILGVRTAIGEEAFGSWGWRV 196 A Y+A+ R + ++ G+ + G +G G + Sbjct: 111 ATGAVAGAYIADITDGDERARHFGFMSACFGFGM------VAG--PVLGGLM-GGFSPHA 161 Query: 197 PFVASILLLAVSVWIRLQLNESPVFLRIKAEGKTSKAPLTEAFGQWKNLKIVILALIGLT 256 PF A+ L ++ L + + + PL + + L Sbjct: 162 PFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASF-----RWARGMTVVAALM 216 Query: 257 AGQAVVWYTGQFYA---LFFLTQTLKVDGASANILIALALLIGTPF-FVFFGSLSDRIGR 312 A ++ GQ A + F D + I +A ++ + + G ++ R+G Sbjct: 217 AVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGE 276 Query: 313 KPIILAGCLIAALTYFPLFKALTH 336 + ++ G +IA T + L T Sbjct: 277 RRALMLG-MIADGTGYILLAFATR 299 Score = 34.8 bits (80), Expect = 8e-04 Identities = 17/42 (40%), Positives = 24/42 (57%) Query: 287 ILIALALLIGTPFFVFFGSLSDRIGRKPIILAGCLIAALTYF 328 IL+AL L+ G+LSDR GR+P++L AA+ Y Sbjct: 47 ILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYA 88
>PF05272#Virulence-associated E family protein Length = 892 Score = 30.0 bits (67), Expect = 0.017 Identities = 14/35 (40%), Positives = 17/35 (48%) Query: 32 VVFVGPSGCGKSTLMRMIAGLEEISGGELLIDGAK 66 VV G G GKSTL+ + GL+ S I K Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGK 633
>PF06776#Invasion associated locus B Length = 214 Score = 33.8 bits (77), Expect = 0.001 Identities = 20/80 (25%), Positives = 25/80 (31%), Gaps = 11/80 (13%) Query: 60 RLCRRIGGRHAAGP------APARESPSENSMKTGRRHFVRSVASASAALAAAAWSPARA 113 R+ RR HA PA SP + + RR R+ A A A A Sbjct: 10 RISRRPVTNHAVPALKAIQMGPAELSPM---LASCRRLARRNGARLMLAGAMAI--ALSF 64 Query: 114 AIDAPASPATALSLTPGRWS 133 A A+ G W Sbjct: 65 GWSDRADAQGAVRSVHGDWQ 84
>BLACTAMASEA#Beta-lactamase class A signature. Length = 286 Score = 29.8 bits (67), Expect = 0.019 Identities = 11/35 (31%), Positives = 15/35 (42%) Query: 57 REDALFRFASVSKPIVSAAAMRAVAAGKLDLDASI 91 R D F S K ++ A + V AG L+ I Sbjct: 57 RADERFPMMSTFKVVLCGAVLARVDAGDEQLERKI 91
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 35.6 bits (82), Expect = 3e-04 Identities = 31/155 (20%), Positives = 59/155 (38%), Gaps = 5/155 (3%) Query: 26 LLALATAGFITIVTEALPAGLLPLMGRDLRVSDALVGQLVTVYAAGSIVAAMPLVAATRG 85 L+ L F +++ E + LP + D A + T + + + Sbjct: 16 LIWLCILSFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQ 75 Query: 86 MRRRPLLLAALAGFVVANTATAASPYYAPVLV-ARCVAGVSAGLLWALLAGYASRMVDAR 144 + + LLL + + + +L+ AR + G A AL+ +R + Sbjct: 76 LGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKE 135 Query: 145 QRGRAIAIAMLGAPVAMSVGI-PL-GTALGAALGW 177 RG+A ++G+ VAM G+ P G + + W Sbjct: 136 NRGKAFG--LIGSIVAMGEGVGPAIGGMIAHYIHW 168
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 40.2 bits (94), Expect = 1e-05 Identities = 88/424 (20%), Positives = 144/424 (33%), Gaps = 52/424 (12%) Query: 50 AVLLAAFAIVLDGFDSQLIGFAIPVLIKEWGITRDA---FAPAVAAGLFGMGVGSACAGL 106 +++ + LD LI +P L+++ + D + +A + G Sbjct: 6 PLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGA 65 Query: 107 FADRFGRRWAIIGSVFVFGAATCAIGFAPNVATIAALRFVAGLGIGGALPTATTMTAEYT 166 +DRFGRR ++ S+ + AP + + R VAG+ G A A+ T Sbjct: 66 LSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGI-TGATGAVAGAYIADIT 124 Query: 167 PARRRTMMVTATIVCVPAGGMLAGLFAHEVLPAYGWRGLFWLGGALPLALGLLLVRALPE 226 R C GM+AG ++ + F+ AL L LPE Sbjct: 125 DGDERARHFGFMSACF-GFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPE 183 Query: 227 SPRYLARNPARWRELGALLARMGRPVADGTAFTDLAEARAHEGQRRGVRALFSAAYARDT 286 S + R P R L AR V AL + + Sbjct: 184 SHKG-ERRPLRREALNP--------------LASFRWARG----MTVVAALMAVFF---I 221 Query: 287 IALWCAFCMCLLAVYSA--FSWLPTMLTSQGLSVSVAGSGLTAYNLGGVLGALGCALAIG 344 + L L ++ F W T + +S+A G+L +L A+ G Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTI-----GISLAA--------FGILHSLAQAMITG 268 Query: 345 RFGSRW-PLAFCCAGGAASAAWLLGVDAGSHAGWLI----VGLAAHGFFVNAVQSTMYAL 399 +R G A + + + GW+ V LA+ G + A+Q+ + Sbjct: 269 PVAARLGERRALMLGMIADGTGYILLAFATR-GWMAFPIMVLLASGGIGMPALQAMLSRQ 327 Query: 400 CTFIYPTPVRATGTAGAVAFGRVGAILSAFAGAYVISAGGANAYLAMLAAAMAVVLVALL 459 ++ + A VG +L F Y S N + + AA+ L+ L Sbjct: 328 VDEERQGQLQGSLAALTSLTSIVGPLL--FTAIYAASITTWNGWAWIAGAAL--YLLCLP 383 Query: 460 ALRR 463 ALRR Sbjct: 384 ALRR 387
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 30.4 bits (68), Expect = 0.028 Identities = 18/49 (36%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 13 RLAAACAAALAWPAAHAASTAAAVPADSTPAAAAEMTASGKTLDTVKVT 61 R+ +A AAAL A A+TA V A +T A + + A+ V VT Sbjct: 6 RIVSAAAAALL-AVAPIAATAMPVNAATTINADSAINANTNAKYDVDVT 53
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 168 bits (426), Expect = 4e-53 Identities = 75/146 (51%), Positives = 99/146 (67%), Gaps = 3/146 (2%) Query: 74 AQAPAPAPVAPVAPAITSQKITYQADTLFDFDKAVLKPAGKQKLDELAAKIQGMNVE--V 131 AP AP AP + ++ T ++D LF+F+KA LKP G+ LD+L +++ ++ + Sbjct: 195 EAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKDGS 254 Query: 132 VVATGYTDRIGSDKYNDRLSLRRAQAVKSYLVSKGVPANKVYTEGKGKRNPVTGNTC-KQ 190 VV GYTDRIGSD YN LS RRAQ+V YL+SKG+PA+K+ G G+ NPVTGNTC Sbjct: 255 VVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVTGNTCDNV 314 Query: 191 KNRKQLIACLAPDRRVEVEVVGTQEV 216 K R LI CLAPDRRVE+EV G ++V Sbjct: 315 KQRAALIDCLAPDRRVEIEVKGIKDV 340
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 354 bits (910), Expect = e-118 Identities = 145/453 (32%), Positives = 217/453 (47%), Gaps = 46/453 (10%) Query: 174 VHVARSANEAARRVKPNQPQAGIADL---DGFAPRELPTLEAVLRQQQVGWIALAGDTRI 230 V + +A R + + D+ D A LP ++ V + ++ Sbjct: 30 VRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKARPDLPV--LVMSAQNTF 87 Query: 231 NDPDVRRLIRQYCFDYMQGLPPHETIDYLVGHAYGMVALCDLDVTAGAAATGDEMVGACD 290 + + +DY+ + ++G A + G +VG Sbjct: 88 MTA--IKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR-RPSKLEDDSQDGMPLVGRSA 144 Query: 291 AMQQLFRTIRKVAATDATVFISGESGTGKELTALAIHERSERRKAPFVAINCGAIPNHLL 350 AMQ+++R + ++ TD T+ I+GESGTGKEL A A+H+ +RR PFVAIN AIP L+ Sbjct: 145 AMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLI 204 Query: 351 QSELFGYERGAFTGASQRKVGRVEAADGGTLFLDEIGDMPLESQASMLRFLQEGKIERLG 410 +SELFG+E+GAFTGA R GR E A+GGTLFLDEIGDMP+++Q +LR LQ+G+ +G Sbjct: 205 ESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVG 264 Query: 411 GHESIPVDVRIISATHVDLDAAMREGRFRDDLYHRLCVLKLDEPPLRARGKDIEILAHHI 470 G I DVRI++AT+ DL ++ +G FR+DLY+RL V+ L PPLR R +DI L H Sbjct: 265 GRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHF 324 Query: 471 LHQFRSDGARRIHGFTSCAIEAMYNYHWPGNVRELINRIRRAIVMSDSRQLSAADLDL-- 528 + Q +G + F A+E M + WPGNVREL N +RR + ++ ++ Sbjct: 325 VQQAEKEG-LDVKRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENEL 383 Query: 529 -----------------------------------APFAARQATTLAEARERAERRTIEA 553 A + E I A Sbjct: 384 RSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILA 443 Query: 554 SLLRHRNHLTEAAAELGVSRATLYRLMVSHGLR 586 +L R + +AA LG++R TL + + G+ Sbjct: 444 ALTATRGNQIKAADLLGLNRNTLRKKIRELGVS 476
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 376 bits (968), Expect = e-129 Identities = 133/388 (34%), Positives = 202/388 (52%), Gaps = 40/388 (10%) Query: 101 FDYVTVPYECDRIVESVGHAYGMVTLSEGLAPAAATVRNEGEMVGTCEAMLALFKMIRKV 160 +DY+ P++ ++ +G A + + ++ +VG AM +++++ ++ Sbjct: 99 YDYLPKPFDLTELIGIIGRA--LAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARL 156 Query: 161 ASTDAPVFISGESGTGKELTAVAIHERSSRAGAPFVAINCGAIPPTLLQAELFGYERGAF 220 TD + I+GESGTGKEL A A+H+ R PFVAIN AIP L+++ELFG+E+GAF Sbjct: 157 MQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAF 216 Query: 221 TGANQRKIGRIEAANGGTLFLDEIGDLPFESQASLLRFLQEHKVERVGGHQSIPVDVRII 280 TGA R GR E A GGTLFLDEIGD+P ++Q LLR LQ+ + VGG I DVRI+ Sbjct: 217 TGAQTRSTGRFEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIV 276 Query: 281 SATHVDMQIALRNGRFREDLYHRLCVLKLEEPPLRERGKDIEILARHMLERFKGDAHRRL 340 +AT+ D++ ++ G FREDLY+RL V+ L PPLR+R +DI L RH +++ + + + Sbjct: 277 AATNKDLKQSINQGLFREDLYYRLNVVPLRLPPLRDRAEDIPDLVRHFVQQAEKEG-LDV 335 Query: 341 RGFTPDAIAALHNYAWPGNVRELINRVRRAIVMSEGRMISAADLELSGYAEVA------- 393 + F +A+ + + WPGNVREL N VRR + +I+ +E +E+ Sbjct: 336 KRFDQEALELMKAHPWPGNVRELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKA 395 Query: 394 ------------------------------PMSLEEARESAERHAIEVALLRHRGRLADA 423 + E I AL RG A Sbjct: 396 AARSGSLSISQAVEENMRQYFASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKA 455 Query: 424 ARELGVSRVTLYRLLCAYGMRDDDGARA 451 A LG++R TL + + G+ +R+ Sbjct: 456 ADLLGLNRNTLRKKIRELGVSVYRSSRS 483
>adhesinmafb#Neisseria meningitidis: adhesin MafB signature. Length = 467 Score = 32.0 bits (72), Expect = 0.002 Identities = 17/67 (25%), Positives = 28/67 (41%), Gaps = 3/67 (4%) Query: 36 SARPAGELTMIAGLSPSAASAHLARLTDGGLLAL---DVRGRHRYYRIATPDIAAAIEAL 92 R A + + ++P A A + G +A + R + P+ A +EA+ Sbjct: 254 GTRYAIDKAAMRNIAPLPAEGKFAVIGGLGSVAGFEKNTREAVDRWIQENPNAAETVEAV 313 Query: 93 ANVAQAA 99 NVA AA Sbjct: 314 FNVAAAA 320
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 49.3 bits (117), Expect = 2e-07 Identities = 55/329 (16%), Positives = 96/329 (29%), Gaps = 32/329 (9%) Query: 280 RAQARPTAPDPRFAPRRPATQAAVSAARNRPMTFTPSRQTTGATPPQPAPPAQTAAPTAE 339 + R D QA V + + PP PA P++T AE Sbjct: 984 EVEKRNQTVDTTNITTPNNIQADVPSVPSNNEEIARV-DEAPVPPPAPATPSETTETVAE 1042 Query: 340 TARKRAPANPARAPLYAWHEKPAERIAPAASVHETLRSIEASAAQWTALAGATSTAATPV 399 +++ + + E A V + +S + Q +A + S Sbjct: 1043 NSKQESKTVEKN------EQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQ 1096 Query: 400 TARESIAAPAAPSGGAAASAARDGRAPTSAETAAPDGHAPTSAETVAPDGHVPTSAETAA 459 T A A + P +P ETV P AE A Sbjct: 1097 TTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQS---ETVQP------QAEPAR 1147 Query: 460 PDGHVPTSAETAAPNDHASTSAETVAPDSHAPTSAETAAPDGHASTITEATAPNGHVSAT 519 + E + +T+A+T P ++ E + T T + + Sbjct: 1148 ENDPTVNIKEPQSQT---NTTADTEQPAKETSSNVEQPV----TESTTVNTGNSVVENPE 1200 Query: 520 VETSAVAAPAGITQAAPPIAADICPAGEHVIAAVEPACTSDSAAIGAGAIAHAEAGAAAS 579 T A P ++++ + V VEPA TS + +A + + + Sbjct: 1201 NTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN---DRSTVALCDLTSTNT 1257 Query: 580 TAET------ASPIGADTHIAPSREADRT 602 A A + + A S+ + Sbjct: 1258 NAVLSDARAKAQFVALNVGKAVSQHISQL 1286 Score = 41.6 bits (97), Expect = 3e-05 Identities = 53/315 (16%), Positives = 96/315 (30%), Gaps = 51/315 (16%) Query: 578 ASTAETASPIGADTHIAPSREADRTA-QTAPTAPSPAEATPHVDAPHALDVAARALVGNT 636 + T + I AD PS + AP P PA ATP + Sbjct: 994 TTNITTPNNIQADVPSVPSNNEEIARVDEAPVPP-PAPATP----------SETTET-VA 1041 Query: 637 AATAHGAAAVNGSAQRADTASPAASTSGPPAPVAASAASSDRAAPQPVATAAPASIATSG 696 + + V + Q A + VA A S+ +A Q + Sbjct: 1042 ENSKQESKTVEKNEQDATETTAQNRE------VAKEAKSNVKANTQ-------TNEVAQS 1088 Query: 697 ALGTMKASGTAGPQPSTIAAQRASAIDDTGQPPSTGHSTHAAVSNELGRRPHAAPDAVTP 756 T + T + +T+ + + ++ ++ + E + V P Sbjct: 1089 GSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQVSPKQE-------QSETVQP 1141 Query: 757 VLPPAAAAVPTNASAVQRQALASESAEAAQGVARAAAAGDSRETTQVSPAGARPDGAAPS 816 PA PT + + Q+ + +A+ Q ++ + T + + Sbjct: 1142 QAEPARENDPT-VNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV-------NTGN 1193 Query: 817 AAVANPIAPLPDASAITAHEDAPT--------SAAPDAATPVIAAMDSAMPNAVAPASAI 868 + V NP P + T + ++ S A S + VA Sbjct: 1194 SVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLT 1253 Query: 869 A--SNAGMSPASASA 881 + +NA +S A A A Sbjct: 1254 STNTNAVLSDARAKA 1268
>SECGEXPORT#Protein-export SecG membrane protein signature. Length = 110 Score = 82.7 bits (204), Expect = 8e-24 Identities = 46/102 (45%), Positives = 68/102 (66%), Gaps = 1/102 (0%) Query: 8 IIVVQLLSALGVIGLVLLQHGKGADMGAAFGSGASGSLFGATGSANFLSRTTAVLATIFF 67 ++VV L+ A+G++GL++LQ GKGADMGA+FG+GAS +LFG++GS NF++R TA+LAT+FF Sbjct: 5 LLVVFLIVAIGLVGLIMLQQGKGADMGASFGAGASATLFGSSGSGNFMTRMTALLATLFF 64 Query: 68 VATLALTYLGSYKSAPSVGVLGAAPAPAASAPAASQTPAASA 109 + +L L + S K+ APA + PA Sbjct: 65 IISLVLGNINSNKTNKGSEWEN-LSAPAKTEQTQPAAPAKPT 105
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 29.9 bits (67), Expect = 0.014 Identities = 16/96 (16%), Positives = 28/96 (29%), Gaps = 10/96 (10%) Query: 136 YAVILAGWASNSKYAFLGAMR-------AAAQMVSYEISMGFALVLVLMTAGSLNLSEIV 188 Y GW+ F+ A Y+++ + G + Sbjct: 29 YTGAKLGWSQYHDTGFINNNGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPY---K 85 Query: 189 GSQQHGFFAGHGVNFLSWNWLPLLPVFVIYFISGIA 224 GS ++G + GV + P+ IY G Sbjct: 86 GSVENGAYKAQGVQLTAKLGYPITDDLDIYTRLGGM 121
>CHLAMIDIAOM6#Chlamydia cysteine-rich outer membrane protein 6 signature. Length = 547 Score = 32.4 bits (73), Expect = 0.007 Identities = 15/61 (24%), Positives = 24/61 (39%), Gaps = 3/61 (4%) Query: 562 FNLG-LDPDKAREFHDETLPKDSAKVAHFC--SMCGPHFCSMKITQDVREFAAQQGVSEN 618 F LG + P + R E P + + S CG H + +T + E Q ++ Sbjct: 265 FTLGDMQPGEHRTITVEFCPLKRGRATNIATVSYCGGHKNTASVTTVINEPCVQVSIAGA 324 Query: 619 D 619 D Sbjct: 325 D 325
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 44.8 bits (106), Expect = 3e-07 Identities = 60/266 (22%), Positives = 95/266 (35%), Gaps = 11/266 (4%) Query: 66 YATGMLVLAPLG----DRFDRRTLILLQIAGLSAALVVAAAAPTLGVLAAASLAIGILAT 121 YA AP+ DRF RR ++L+ +AG + + A AP L VL + GI Sbjct: 52 YALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGA 111 Query: 122 IAQQAVPFAAEIAPPAARGQAVGTVMSGLLLGILLARTAAGFVAEYFGWRAVFAASVAAL 181 A + A+I R + G + + G++ G + + FAA AAL Sbjct: 112 TGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAA--AAL 169 Query: 182 AALAAVIVA-RLPRSSPTSTLPYGKLLASMWQLVRELRGLR--EASMTGGAIFAAFSAFW 238 L + LP S P + + R RG+ A M I Sbjct: 170 NGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVP 229 Query: 239 PVLTLLLAGAPFHLGPQAAGL-FGIVGAAGALAAPY-AGRFADKRGPRAIISLAIALIAA 296 L ++ FH G+ G +LA G A + G R + L + Sbjct: 230 AALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289 Query: 297 SFAIFALSGASLIGLVIGVIVLDVGV 322 + + A + + I V++ G+ Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGI 315
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.030 Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 3/37 (8%) Query: 21 RVLEPLDLAIGAGETLVLLGPSGCGKTTTLRLIAGLD 57 RV+EP ++VL G G GK+T + + GLD Sbjct: 587 RVMEP---GCKFDYSVVLEGTGGIGKSTLINTLVGLD 620
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 67.0 bits (163), Expect = 3e-15 Identities = 74/266 (27%), Positives = 119/266 (44%), Gaps = 19/266 (7%) Query: 1 MADHSIKGKTVIIAGGAKNLGGLIARDLAAQGAQAVAIHYNSAASKGAAAETVAAIEAAG 60 M I+GK I G A+ +G +AR LA+QGA A+ YN + + V++++A Sbjct: 1 MNAKGIEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLE----KVVSSLKAEA 56 Query: 61 ARAVALQADLTAAGAVEKLFVDTVAAIGRPDIAINTVGKVLKKPFVEITEAEYDEMAAVN 120 A A AD+ + A++++ +G DI +N G + +++ E++ +VN Sbjct: 57 RHAEAFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVN 116 Query: 121 SKTAFFFLKEAGRHVND--NGKIVTLVTSLLGAFTPFYAAYAGMKAPVEHFTRAAAKEFG 178 S F + +++ D +G IVT+ ++ G AAYA KA FT+ E Sbjct: 117 STGVFNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELA 176 Query: 179 ARGISVTAVGPGPMDTPFFYPAEGADAVAYHKTAAALSPFSKTGL--------TDIGDVV 230 I V PG +T + + A +L F KTG+ +DI D V Sbjct: 177 EYNIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETF-KTGIPLKKLAKPSDIADAV 235 Query: 231 PFIRHLVSD-GWWITGQTILINGGYT 255 F LVS IT + ++GG T Sbjct: 236 LF---LVSGQAGHITMHNLCVDGGAT 258
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 43.9 bits (103), Expect = 2e-06 Identities = 44/288 (15%), Positives = 69/288 (23%), Gaps = 14/288 (4%) Query: 313 PTRRDKAAVKAAEKERVAPLPEPAE-------TAEGAPMKLKTPAAPTPPAA-PVPASSA 364 P+ A E P P PA AE + + KT A + Sbjct: 1008 PSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNRE 1067 Query: 365 APGTSASSAVAAPAAAGSGPAASAPAAPVRHAAPAPASATAA--ASAPTAASAPAPTPAS 422 + S+ A + S A+ A T + P S Sbjct: 1068 VAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTS 1127 Query: 423 APAPASTPAPASAPT--PTPASAPTPASIPAPAPASAPASTPAPASAPAPAPTTSPASSI 480 +P + P P + PT + + A T PA + S Sbjct: 1128 QVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTEST 1187 Query: 481 APTAAPFASAIPPARAEKFA-PAVTATTAGSTSTPASAAAPSSPSSPWLPPLLPPLLSPD 539 P P V + ++ + S P + S Sbjct: 1188 TVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTV 1247 Query: 540 APSPPADTARTAPLAPAASPATAAAAATNATATAGAMQSAPRDDAATN 587 A T A L+ A + A A + Q ++ Sbjct: 1248 ALCDLTSTNTNAVLSDARAKAQFVALNVGKAVSQHISQLE-MNNEGQY 1294 Score = 41.6 bits (97), Expect = 9e-06 Identities = 30/213 (14%), Positives = 54/213 (25%), Gaps = 5/213 (2%) Query: 312 DPTRRDKAAVKAAEKERVAPLPEPAETAEGAPMKLKTPAAPTPPAAPVPASSAAPGTSAS 371 + T +++ K A K V + E A+ +T T A V A + Sbjct: 1060 ETTAQNREVAKEA-KSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEK 1118 Query: 372 SAVAAPAAAGSGPAASAPAAPVRHAAPAPASATAAASAPTAASAPAPTPASAPAPASTPA 431 + + P A PA + + PA ++ Sbjct: 1119 TQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSN 1178 Query: 432 PASAPTPTPASAPTPASIPAPAPASAPASTPAPASAPAPAPTTSPASSIAPTAA---PFA 488 T + + + P + + P S + P S+ P Sbjct: 1179 VEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPAT 1238 Query: 489 SAIPPARAEKFAPAVTATTAGSTSTPASAAAPS 521 ++ + T S A A A Sbjct: 1239 TSSNDRSTVALCDLTSTNTNAVLSD-ARAKAQF 1270 Score = 41.6 bits (97), Expect = 1e-05 Identities = 37/246 (15%), Positives = 73/246 (29%), Gaps = 10/246 (4%) Query: 334 EPAETAEGAPMKLKTPAAPTPPAAPVPASSAAPGTSASSAVAAPAAAGSGPAASAPAAPV 393 P+ + + A PPA P+ + S + A A Sbjct: 1007 VPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATETTAQNR 1066 Query: 394 RHAAPAPASATAAASAPTAASAPAPTPASAPAPASTPAP------ASAPTPTPASAPTPA 447 A A ++ A A + + T + A A T P Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126 Query: 448 SIPAPAPASAPASTPAPASAPAPAPTTSPASSIAPTAAPFASAIPPARAEKFAPAVTATT 507 S +P + P A PT + + T + P A++ + V Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQP---AKETSSNVEQPV 1183 Query: 508 AGSTSTPASAAAPSSPSSPWLPPLLPPLLSPDAPSPPADTARTAPLAPAASPATAAAAAT 567 ST+ + +P + P P ++ ++ + P + R + + + A ++ Sbjct: 1184 TESTTVNTGNSVVENPENT-TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSN 1242 Query: 568 NATATA 573 + + A Sbjct: 1243 DRSTVA 1248
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 24.5 bits (53), Expect = 0.044 Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 2/51 (3%) Query: 8 AGFHRGARAFGRAPGASVASVASVASGASGASAAS--GASGAAGAAGAAGA 56 A FH+ A + +ASV+SG S A+ S GA +A G Sbjct: 355 AAFHKETGAIDASLTTISTVLASVSSGISAAATTSLVGAPVSALVGAVTGI 405
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 33.6 bits (77), Expect = 0.002 Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 3/49 (6%) Query: 566 SPAQYAQVTSMNPDEWRAELALHAELFDKLSARLPDALAETKARIEKRL 614 P + + + S P E + +A L D+ S P+ + E + +EK+L Sbjct: 148 DPQKASFILSSLPTEVQTNVARRIALMDRTS---PEVVREVERVLEKKL 193
>ADHESNFAMILY#Adhesin family signature. Length = 309 Score = 30.2 bits (68), Expect = 0.019 Identities = 28/128 (21%), Positives = 42/128 (32%), Gaps = 19/128 (14%) Query: 388 LKAGEEADARTPAA---LRRGRKLVVQIGE----------TFGEKNAPMFVEQLDALRLA 434 L+ E P A L G I + F EKN + ++LD L Sbjct: 127 LEGQNEKGKEDPHAWLNLENGIIFAKNIAKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKE 186 Query: 435 DKLALDLAPVMVYGDDVTHVVTEEGIANLLMCRDADEREHAIRGVAGYTEIGRGRDRRLV 494 K + P + +VT EG + I + E + + LV Sbjct: 187 SKDKFNKIP-----AEKKLIVTSEGAFKYF-SKAYGVPSAYIWEINTEEEGTPEQIKTLV 240 Query: 495 ERLRERGV 502 E+LR+ V Sbjct: 241 EKLRQTKV 248
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 31.3 bits (71), Expect = 0.004 Identities = 11/64 (17%), Positives = 28/64 (43%), Gaps = 12/64 (18%) Query: 209 VIAAAAGTVVYAGNGLRGYGNLLIVKHDADFLTTYAHNRALLVKEGQTVAQGQKIAEMGD 268 ++A A G + ++G +K + + ++VKEG++V +G + ++ Sbjct: 82 IVATANGKLTHSGR-------SKEIKP---IENSIV--KEIIVKEGESVRKGDVLLKLTA 129 Query: 269 TDND 272 + Sbjct: 130 LGAE 133
>PF07675#Cleaved Adhesin Length = 1358 Score = 31.2 bits (70), Expect = 0.011 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 3/46 (6%) Query: 376 SYNVYRNGNKVGSS-TSTAYTDAGLIAGTAYSYTVTEIDPSLGESA 420 +Y +YRN ++ S T T Y D L G Y+Y V ++ GESA Sbjct: 1260 TYTIYRNNTQIASGVTETTYRDPDLATGF-YTYGV-KVVYPNGESA 1303
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 78.7 bits (194), Expect = 9e-19 Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 1/117 (0%) Query: 1 MSAARKVLLVEDDEAQANWAKLVLTRGRFDVTHCQTGGQAIRAMTKEVPDAVVLDMRLPD 60 M+ A +L+ +DD A L+R +DV R + D VV D+ +PD Sbjct: 1 MTGA-TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPD 59 Query: 61 VHGLEVLVWIRRNFFDVPVIVLSNAMQEMQIVEAFSAGADDYVLKPAREAEFLARIA 117 + ++L I++ D+PV+V+S M ++A GA DY+ KP E + I Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIG 116
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 79.9 bits (197), Expect = 2e-19 Identities = 35/135 (25%), Positives = 58/135 (42%), Gaps = 1/135 (0%) Query: 4 IYLIEDDEVQARCYAAILQHAGYSVRVLPDGERALREIQRAAPDLIVLDRRLPDIDGLEI 63 I + +DD L AGY VR+ + R I DL+V D +PD + ++ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDL 65 Query: 64 IAWVRERCAPLPILVLTNAVLETDLVEALEAGADDYLIKPPREREFVARV-NALRRRASI 122 + +++ LP+LV++ ++A E GA DYL KP E + + AL Sbjct: 66 LPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRR 125 Query: 123 SKQFEGTIEIGGYRI 137 + E + G + Sbjct: 126 PSKLEDDSQDGMPLV 140
>PF03895#Serum resistance protein DsrA. Length = 79 Score = 40.6 bits (95), Expect = 2e-06 Identities = 21/77 (27%), Positives = 40/77 (51%) Query: 1046 VARAAYGGIAAATALTMIPEVDKDKTIAVGIGGGTYRGYQAVALGATARITENIKVRAGV 1105 +++ G+A +AL+M+ + + +V G YR A+A+G +RIT+ +AGV Sbjct: 1 LSKELQTGLANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGV 60 Query: 1106 GMSSGGTTAGIGASMQW 1122 ++ GAS+ + Sbjct: 61 AFNTYNGGMSYGASVGY 77
>OUTRMMBRANEA#Outer membrane protein A signature. Length = 346 Score = 126 bits (317), Expect = 8e-37 Identities = 67/151 (44%), Positives = 95/151 (62%), Gaps = 10/151 (6%) Query: 87 FQCGEPAQPVAQQPQPAPAAAPAAEPIRLNADAMFAFDRADAASMTEQGRQQLSQLAQRL 146 F GE A VA P PAPA + L +D +F F++A + +G+ L QL +L Sbjct: 191 FGQGEAAPVVA--PAPAPAPEVQTKHFTLKSDVLFNFNKAT---LKPEGQAALDQLYSQL 245 Query: 147 TDRHAQTVSIV--GYTDRLGSDAYNRQLSQARAKTVGDYLIAAGVPADSVHAEGRGASDP 204 ++ + S+V GYTDR+GSDAYN+ LS+ RA++V DYLI+ G+PAD + A G G S+P Sbjct: 246 SNLDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNP 305 Query: 205 LV--QCDQ-RERAALIACLSPNRRVEVVAAG 232 + CD ++RAALI CL+P+RRVE+ G Sbjct: 306 VTGNTCDNVKQRAALIDCLAPDRRVEIEVKG 336
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 38.7 bits (90), Expect = 3e-05 Identities = 30/169 (17%), Positives = 70/169 (41%), Gaps = 5/169 (2%) Query: 1 MFSLVIPALLTAWGIGKGQAGLIGGATLAAGAIGGLLAGMIADRFGRVRALQITVCWFSL 60 + ++ +P + + + A + +IG + G ++D+ G R L + Sbjct: 32 VLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCF 91 Query: 61 FTFLSAFAQNFEQLLVL-KTLQGLGFGGEWTAGAVLLSETIRARHRGKAMGIVQSAWGFG 119 + + +F LL++ + +QG G V+++ I +RGKA G++ S G Sbjct: 92 GSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMG 151 Query: 120 WGGAVLLYTLVFSWLPPEWAWRVLFAIGVLPALLVLYIRRAIPEPPRDD 168 G + ++ ++ W L I ++ + V ++ + + + R Sbjct: 152 EGVGPAIGGMIAHYI----HWSYLLLIPMITIITVPFLMKLLKKEVRIK 196
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 62.9 bits (153), Expect = 2e-12 Identities = 30/122 (24%), Positives = 50/122 (40%), Gaps = 10/122 (8%) Query: 401 RVLVVDDQEMNRIVLRYQLDALGHHARLCASGDEALRALGTAAYDVVLTDCRMPGMDGIA 460 +LV DD R VL L G+ R+ ++ R + D+V+TD MP + Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 461 LTAAIRAH-PDARVRATPIVGVTALVSDAEHARCVDAGMTLCIGKP----TTLDALERAL 515 L I+ PD P++ ++A + + + G + KP + + RAL Sbjct: 65 LLPRIKKARPD-----LPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 516 VE 517 E Sbjct: 120 AE 121
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 27.6 bits (61), Expect = 0.042 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 2/34 (5%) Query: 190 VDIREEALHELIDRLDDLASEFHSAF--LHEAGK 221 + R + L + + L LA F AF H+AG Sbjct: 283 LTFRSQDLDQTRNTLGQLALAFAEAFNTQHKAGF 316
>RTXTOXINA#Gram-negative bacterial RTX toxin determinant A family signature. Length = 1024 Score = 27.2 bits (60), Expect = 0.003 Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 3/45 (6%) Query: 7 DSLLERLRRPRGASRVSLCG-GAPLAATASAAAVAASAAARAVAA 50 DSLL + GA SL LA+ + + ++A+A V A Sbjct: 351 DSLLAAFHKETGAIDASLTTISTVLASVS--SGISAAATTSLVGA 393
>PERTACTIN#Pertactin signature. Length = 922 Score = 28.1 bits (62), Expect = 0.012 Identities = 16/41 (39%), Positives = 22/41 (53%) Query: 38 KRLWFYRKPACAEKAWGQWFEQAQQSGIAALQKFAQRLQGY 78 KRL R A AWG+ F Q QQ A ++F Q++ G+ Sbjct: 647 KRLGELRLNPDAGGAWGRGFAQRQQLDNRAGRRFDQKVAGF 687
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.0 bits (70), Expect = 0.010 Identities = 24/111 (21%), Positives = 44/111 (39%), Gaps = 11/111 (9%) Query: 268 LVNTAGMHAKTASNVMTAALFVYMLMQPVFGALSDKIGRR----MSMILFGTGAVIGTVP 323 + N + + V TA + + + V+G LSD++G + +I+ G+VIG V Sbjct: 40 IANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIGFVG 99 Query: 324 LMHALGGVTSPLVAFGLIVVALAIVSFYTSISGLIKAEMFPPEVRAMGVGL 374 L+ + +F ++ ++ A P E R GL Sbjct: 100 HSFF------SLLIMARFIQGAGAAAF-PALVMVVVARYIPKENRGKAFGL 143
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 30.2 bits (68), Expect = 0.003 Identities = 31/145 (21%), Positives = 50/145 (34%), Gaps = 12/145 (8%) Query: 7 LVASWTLASLALADLRTRRLA---TFAVALVGALYAALALVGAPGDGGFASHAALGAAA- 62 L+ +W L +L DL L T + G L+ L + GD + A Sbjct: 138 LLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNLLGGFVSLGDAVIGAMAGYLVLWS 197 Query: 63 -FALGAAMFRAGWIAGGDVKLAAVVFLWAGPAHAWPVAFAIGVGGLAVGAVCIAAGRAPR 121 + + + GD KL A + W G V + G +G I Sbjct: 198 LYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLILLRNH-- 255 Query: 122 VLAWFAPARGVPYGVALAAGGLLAV 146 ++ +P+G LA G +A+ Sbjct: 256 -----HQSKPIPFGPYLAIAGWIAL 275
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 144 bits (364), Expect = 2e-39 Identities = 68/283 (24%), Positives = 116/283 (40%), Gaps = 16/283 (5%) Query: 170 VVQTLKPYLRQQEALVNRLTLARPIQVHLRVRITEVDRNITQQLGINWSALGA------- 222 +V + E ++ +L + RP QV + I EV LGI W+ A Sbjct: 322 IVTAAPDVMNDLERVIAQLDIRRP-QVLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTN 380 Query: 223 SGNFVGGLFNGRTLFDTASKAFDLSPSGAFSVVGGFHTSRYSIDG--VLDALDQEGLITM 280 SG + G ++ S A S G Y + +L AL + Sbjct: 381 SGLPISTAIAGANQYNKDGTVSSSLAS-ALSSFNGIAAGFYQGNWAMLLTALSSSTKNDI 439 Query: 281 LAEPNLTAISGQTASFLAGGEFPIPVAQDTTGA----ITIQFKPYGVSLDFTPTVLADNR 336 LA P++ + A+F G E P+ TT T++ K G+ L P + + Sbjct: 440 LATPSIVTLDNMEATFNVGQEVPVLTGSQTTSGDNIFNTVERKTVGIKLKVKPQINEGDS 499 Query: 337 ISLKVRPEVSEIDPTNSVTTGSIKVPALTVRRVDTTVELSSGQSFAIGGLLQSKSSDVLA 396 + L++ EVS + S T+ + R V+ V + SG++ +GGLL SD Sbjct: 500 VLLEIEQEVSSVADAASSTSSDLGA-TFNTRTVNNAVLVGSGETVVVGGLLDKSVSDTAD 558 Query: 397 ELPGLARLPVLGKLFSSRNYLNDKTEVVVIVTPYIVQPANPGE 439 ++P L +PV+G LF S + K +++ + P +++ + Sbjct: 559 KVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPTVIRDRDEYR 601
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 33.7 bits (77), Expect = 0.001 Identities = 29/165 (17%), Positives = 52/165 (31%), Gaps = 20/165 (12%) Query: 22 GARLVAIVADAASDEVIRNLIADQAMTGAQVARGGIDDAIALMRDLSHGPQHLLVDVSGA 81 GA ++ DAA V+ ++ + + ++ DV Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYD--VRITSNAATLWRWIA--AGDGDLVVTDVV-- 56 Query: 82 AMP----LSDLARLADVCDPSVNVIVIGERNDVGLFRSMLRIGVRDYLVKPL----TVEL 133 MP L R+ P + V+V+ +N G DYL KP + + Sbjct: 57 -MPDENAFDLLPRIKKA-RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGI 114 Query: 134 VHRALSAADPNAAARAGKAIGFVGARGGVGVTSIAVALARHLADR 178 + RAL+ + + + G S A+ + R Sbjct: 115 IGRALAEPKRRPSKLEDDSQDGMPLVG----RSAAMQEIYRVLAR 155
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.032 Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 4/50 (8%) Query: 299 IVISGGTGSGKTTLLNAL---SHFIDSHERIVTIEDAAELQLQQPHVVSL 345 +V+ G G GK+TL+N L F D+H I T +D+ E Q+ L Sbjct: 599 VVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYE-QIAGIVAYEL 647
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 31.7 bits (71), Expect = 0.004 Identities = 30/132 (22%), Positives = 49/132 (37%), Gaps = 4/132 (3%) Query: 40 RVAAARNELQNAADAAALAGAASLEAGAGAPAWAAAASAAAAALSLNASDGAALSSGDVQ 99 A A+ + + A A AA+ A PA + + AA + + GAA + + Sbjct: 226 AAAEAKRKAEEQARQQAAIRAANTYA---MPANGSVVATAAGRGLIQVAQGAASLAQAIS 282 Query: 100 TGYWNVTGVPAGLEPTTLAPGEYDVPAVQATVTRAPNQNGGPLSLLMGGLLGLVGTPAAA 159 V G P+ +A G + T + +Q + +G +G P + Sbjct: 283 DAI-AVLGRVLASAPSVMAVGFASLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSV 341 Query: 160 TAVAVAGAPATV 171 AVA A TV Sbjct: 342 NLNAVAKASGTV 353
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 122 bits (308), Expect = 4e-36 Identities = 81/252 (32%), Positives = 118/252 (46%), Gaps = 15/252 (5%) Query: 9 GRSFLVTGASSGIGRAAAVALRGGGARVVAAARNARELERLAHETGC-----EPLELDVG 63 G+ +TGA+ GIG A A L GA + A N +LE++ E DV Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 64 CDASVRAALSG-ERMRDAFDGLINCAGVTSLAAAIDTTADEFDRVMAVNARGAMLVARHV 122 A++ + ER D L+N AGV + +E++ +VN+ G +R V Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 123 ARAMIRAGRGGSIVNVSSQAALVALPSHLAYCASKAALDAMTRVLCVELGPHGIRVNSVN 182 ++ M R GSIV V S A V S AY +SKAA T+ L +EL + IR N V+ Sbjct: 128 SKYM-MDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PTVTLTPMAERAWSDPHASGPMLA--------AIPLGRFARVADVVAPILFLSSDAAAMV 234 P T T M W+D + + ++ IPL + A+ +D+ +LFL S A + Sbjct: 187 PGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHI 246 Query: 235 SGVALPVDGGYT 246 + L VDGG T Sbjct: 247 TMHNLCVDGGAT 258
>PF05272#Virulence-associated E family protein Length = 892 Score = 28.5 bits (63), Expect = 0.040 Identities = 12/23 (52%), Positives = 13/23 (56%) Query: 36 VTALCGPNGCGKSTLLRTLAGLQ 58 L G G GKSTL+ TL GL Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLD 620
>2FE2SRDCTASE#Ferric iron reductase signature. Length = 262 Score = 55.8 bits (134), Expect = 2e-11 Identities = 50/186 (26%), Positives = 72/186 (38%), Gaps = 24/186 (12%) Query: 78 RALVSQWSKYYFNLAASAGFAAALLLGRPLDMAPQRMRVALRGGMPVALLFEADALRPAQ 137 + L+S W+++Y L A L + LD++P+ VA F D Sbjct: 89 KPLISLWAQWYIGLMVPPLMLALLTQEKALDVSPEHFHAEFHETGRVA-CFWVDVCEDKN 147 Query: 138 AEPAS---RYAALVDH-LRATIDTLAVLAKLSPRVLWANAGNLLD-YLFEQCAHAPRAGA 192 A P S R L+ L + L +++ +++W+N G L++ YL E G Sbjct: 148 ATPHSPQHRMETLISQALVPVVQALEATGEINGKLIWSNTGYLINWYLTEM---KQLLGE 204 Query: 193 DA------AWLFGPVDSRGEANPLRLPVRRVKPCSARLPDPFRARRVCCLRNEIPGEDQL 246 A F + GE NPL V L D RR CC R +P Q Sbjct: 205 ATVESLRHALFFEKTLTNGEDNPLWRTV--------VLRDGLLVRRTCCQRYRLPDVQQ- 255 Query: 247 CGSCPL 252 CG C L Sbjct: 256 CGDCTL 261
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 112 bits (282), Expect = 1e-30 Identities = 77/264 (29%), Positives = 112/264 (42%), Gaps = 15/264 (5%) Query: 115 PARIVVLEFMFAEDLAALDITPVGMADPAYYPIWIGYDDARFARVSDVGTRQEPSLEAIA 174 P RIV LE++ E L AL I P G+AD Y +W+ + V DVG R EP+LE + Sbjct: 35 PNRIVALEWLPVELLLALGIVPYGVADTINYRLWVS-EPPLPDSVIDVGLRTEPNLELLT 93 Query: 175 AAKPDLILGVGLRHAPIFDALSRIAPTVLFKYSPNYIEDGRQVTQYDWACAILRTIGCLT 234 KP ++ + P + L+RIAP F +S DG+Q A L + L Sbjct: 94 EMKPSFMVW-SAGYGPSPEMLARIAPGRGFNFS-----DGKQ--PLAMARKSLTEMADLL 145 Query: 235 GRARDARAVQARVDAGLARDARRIAAAGRAGERVAWLQELGLPDRYWAFTGNSASAGIAR 294 A A+ + + R + G R L L P F NS I Sbjct: 146 NLQSAAETHLAQYEDFIRSMKPRFV---KRGARPLLLTTLIDPRHMLVFGPNSLFQEILD 202 Query: 295 ALGLE-PWPGEPTREGTAYVTSEDLLKQPDLAVLFVSATEPGVPLDAKLDSSIWRFVPAR 353 G+ W GE G+ V+ + L D+ VL +DA + + +W+ +P Sbjct: 203 EYGIPNAWQGETNFWGSTAVSIDRLAAYKDVDVLCFDHDNSKD-MDALMATPLWQAMPFV 261 Query: 354 RAGRVALVERNIWGFGGPMSALRL 377 RAGR V +W +G +SA+ Sbjct: 262 RAGRFQRVP-AVWFYGATLSAMHF 284
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 44.4 bits (105), Expect = 6e-07 Identities = 39/176 (22%), Positives = 64/176 (36%), Gaps = 14/176 (7%) Query: 17 DRALPAAYPFSALIGQ-AALQQALLLVA-VDPGLGGVLVSGPRGTAKSTAARALAELLP- 73 + + L+G+ AA+Q+ ++A + ++++G GT K ARAL + Sbjct: 127 SKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHDYGKR 186 Query: 74 -EGRFVTLPLSASDEQVTGSLDLASALADNT--VRFSPGLVARAHLGVLYVDEINLLPDA 130 G FV + ++A + S T S G +A G L++DEI +P Sbjct: 187 RNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDMPMD 246 Query: 131 LVDALLDAAASGVNTVERDGVSHSHAARFALVGTMNP------EEGELRPQLLDRF 180 LL G G + +V N +G R L R Sbjct: 247 AQTRLLRVLQQG--EYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRL 300
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 29.5 bits (65), Expect = 0.026 Identities = 25/63 (39%), Positives = 28/63 (44%) Query: 147 ADGATPAAIAGALVARGFGPSAMSVFEHLGGPLERRLDARADAWRDARAAALNVVAIECR 206 A GAT A GA VA G G A V GPL + L A + A A + VAI R Sbjct: 74 AIGATAEAAKGAAVAVGAGSIATGVNSVAIGPLSKALGDSAVTYGAASTAQKDGVAIGAR 133 Query: 207 ACA 209 A Sbjct: 134 AST 136
>ARGDEIMINASE#Bacterial arginine deiminase signature. Length = 409 Score = 513 bits (1324), Expect = 0.0 Identities = 130/423 (30%), Positives = 227/423 (53%), Gaps = 21/423 (4%) Query: 8 MSQAIPQVGVHSEVGKLRKVLVCSPGLAHQRLTPSNCDELLFDDVMWVNQAKRDHFDFVS 67 M + + + + SE+G+L+KVL+ PG + LTP LFDD+ ++ A+++H F S Sbjct: 1 MEEYLNPINIFSEIGRLKKVLLHRPGEELENLTPFIMKNFLFDDIPYLEVARQEHEVFAS 60 Query: 68 KMRERGVEVLEMHNLLTETVQNPAALK------WILDRKITPDNVGIGLVDEVRAWLEGL 121 ++ VE+ + +L++E + + AL+ +IL+ +I D ++ ++ + L Sbjct: 61 ILKNNLVEIEYIEDLISEVLVSSVALENKFISQFILEAEIKTDFT----INLLKDYFSSL 116 Query: 122 EPRALAEFLIGGVAASDIAGAERSKVLTLFRDYLGKSSFVLPPLPNMMFTRDTSCWIYGG 181 + +I GV ++ S + G + F++ P+PN++FTRD I G Sbjct: 117 TIDNMISKMISGVVTEELKNYTSSLDDLV----NGANLFIIDPMPNVLFTRDPFASIGNG 172 Query: 182 VTLNPMHWPARRQETLLVAAVYKFHPAFTDAKFDVWYGDPDRDHGMATLEGGDVMPIGRG 241 VT+N M R++ET+ ++K+HP + +W + A+LEGGD + + +G Sbjct: 173 VTINKMFTKVRQRETIFAEYIFKYHPVYK-ENVPIWLNRWE----EASLEGGDELVLNKG 227 Query: 242 VVLVGMGERTSRQAVGQLAQALFA-KGAAERVIVAGLPNSRASMHLDTVFSFCDRDLVTV 300 ++++G+ ERT ++V +LA +LF K + + ++ +P +R+ MHLDTVF+ D + T Sbjct: 228 LLVIGISERTEAKSVEKLAISLFKNKTSFDTILAFQIPKNRSYMHLDTVFTQIDYSVFTS 287 Query: 301 FPEVVNRIVPFTLRPGGDARYGIDIEREDKPFVDVVAQALGLKSLRVVETGGNDFAAERE 360 F + L + I I++E DV++ LG K + GG+ RE Sbjct: 288 FTSDDMYFSIYVLTYNPSSSK-IHIKKEKARIKDVLSFYLGRKIDIIKCAGGDLIHGARE 346 Query: 361 QWDDGNNMVCIEPGVVVGYDRNTYTNTLLRKAGVEVITIGSSELGRGRGGGHCMTCPVLR 420 QW+DG N++ I PG ++ Y RN TN L + G++V I SSEL RGRGG CM+ P++R Sbjct: 347 QWNDGANVLAIAPGEIIAYSRNHVTNKLFEENGIKVHRIPSSELSRGRGGPRCMSMPLIR 406 Query: 421 DPV 423 + + Sbjct: 407 EDI 409
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 93.4 bits (232), Expect = 1e-22 Identities = 74/403 (18%), Positives = 153/403 (37%), Gaps = 16/403 (3%) Query: 18 FMQNLDSTVVATALPSMARELGVNVVFLSSAITSYLVALTVFIPVSGWIAERFGAKRVFI 77 F L+ V+ +LP +A + + T++++ ++ V G ++++ G KR+ + Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 78 AAIAIFTAASVMCAAANGLAT-LVAARILQGAGGALMVPVGRLILYRGVSRHEMLAATTW 136 I I SV+ + + L+ AR +QGAG A + +++ R + + A Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 137 LTMPALVGPLLGPPLGGFLTDALSWRAVFWINVPVGVAGAALAARLVPASAGERRAPADA 196 + +G +GP +GG + + W + + +P+ + + D Sbjct: 144 IGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPMITIITVPFLMKLLKKEVRIKGHFDI 201 Query: 197 RGMLLVGAALAALMLGVETAGRGVLPAGAPALCLGAGVALGGLAIRHCRRVAHPAVDLSL 256 +G++L+ + ML + L + + ++H R+V P VD L Sbjct: 202 KGIILMSVGIVFFMLFTTSYSISFLIVSVLSF---------LIFVKHIRKVTDPFVDPGL 252 Query: 257 L-GIPTFHAATIAGSLFRAGAGALPFLVPLTLQVGFGASASRSGAITLASA-LGSLVMRP 314 IP G +F AG + +VP ++ S + G++ + + ++ Sbjct: 253 GKNIPFMIGVLCGGIIFGTVAGFV-SMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIFGY 311 Query: 315 MTHAALHRAPMRTVLIAGSVSFAAVLAACATLSPAWPDAAVFALLLVGGLSRSLSFASLG 374 + + R VL G + + L ++ V G S + + Sbjct: 312 IGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTVIS 370 Query: 375 ALVFSDVPSERLSAATSFQGTAQQLMRAVGVAVAAGALHLAML 417 +V S + + A S L G+A+ G L + +L Sbjct: 371 TIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLSIPLL 413
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 30.4 bits (68), Expect = 0.024 Identities = 24/171 (14%), Positives = 47/171 (27%), Gaps = 5/171 (2%) Query: 404 ASEVRSLAQRSSSAAKEIKDLINASVQKIHDGSALAGEAGKTMTEVTQAVARVTDIMGEI 463 + + S++ D S + + ++ V + E Sbjct: 1002 NIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETVAENSKQESKTVEKNEQDATET 1061 Query: 464 AAASGEQSRGIEQVNQAIAQMDEVTQQNAALVEEAAAASKSLEEQGRHLTQAVSFFRASA 523 A + E ++ + +A Q +EV Q + E +K + + Sbjct: 1062 TAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKA-----KVET 1116 Query: 524 ASAAPQARHAAPAKPKAKRGVAAPASAPRAAHAAPTFNKPAPALAAAATAS 574 + + PK ++ A A PT N P TA Sbjct: 1117 EKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTAD 1167
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 92.6 bits (230), Expect = 5e-23 Identities = 88/379 (23%), Positives = 140/379 (36%), Gaps = 62/379 (16%) Query: 32 ASTAHAQSSVVLYGLIDTSITYANNQRTHGAGSPGSPGWAVTSGALNASRWGLRGREDLG 91 A A + V LYG I + + + +GA + T S+ G +G+EDLG Sbjct: 12 ALPVAAMADVTLYGTIKAGVETSRSVAHNGAQAASVE--TGTGIVDLGSKIGFKGQEDLG 69 Query: 92 DGVSAIFALENGFSGASGALSQKGVDMFGRQAWIGLKSKEGGALTLGRQYDLILDF--VT 149 +G+ AI+ +E AS A + G RQ++IGLK G L +GR ++ D + Sbjct: 70 NGLKAIWQVE---QKASIAGTDSG--WGNRQSFIGLK-GGFGKLRVGRLNSVLKDTGDIN 123 Query: 150 PLGASGPGWGGNLAVHPYDNDDSNRNIRINNAVKYTSPTYRGWTLGAMYGFSNTAGPFGN 209 P + G N P R I +V+Y SP + G + Y ++ AG N Sbjct: 124 PWDSKSDYLGVNKIAEP-----EARLI----SVRYDSPEFAGLSGSVQYALNDNAG-RHN 173 Query: 210 NAAWSAGLSYANGPLKLGAGYLRINRNPNAANANGALSTTDGSATITGGSQQIWAVAGRY 269 + ++ AG +Y NG + G + QI + Y Sbjct: 174 SESYHAGFNYKNGGFFVQYGGAYKRH-------------HQVQENVNIEKYQIHRLVSGY 220 Query: 270 -AFGPHSIGAAWSHSATDRVSGVLQGGSIAKLDGKSLVFDNFTLDGRYVVTPRLSLAAAY 328 ++ A A + F N VTPR+S A + Sbjct: 221 DNDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFGN--------VTPRVSYAHGF 272 Query: 329 TYTMGRFDARSGETRPKWNHMVAQADYAFSIRTDAYLEAVYQRVSGGNGIPAFNATIWTL 388 + + + ++ +V A+Y FS RT A + A + + G G F +T Sbjct: 273 KGSFDATNYNN-----DYDQVVVGAEYDFSKRTSALVSAGWLQ--EGKGESKFVSTA--- 322 Query: 389 TPSANGNQVVVALGLRHRF 407 +GLRH+F Sbjct: 323 ----------GGVGLRHKF 331
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 120 bits (301), Expect = 5e-35 Identities = 77/261 (29%), Positives = 125/261 (47%), Gaps = 16/261 (6%) Query: 7 LEGKVALITGASSGLGQRFAQVLSQAGAKVVLASRRVERLKELRAEIEAAGGAAHVVSLD 66 +EGK+A ITGA+ G+G+ A+ L+ GA + E+L+++ + ++A A D Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPAD 65 Query: 67 VTDYQSIRAAVAHAETEAGTIDILVNNSGVSTMQKLVDVSPADFEYVFDTNTRGAFFVAQ 126 V D +I A E E G IDILVN +GV + +S ++E F N+ G F ++ Sbjct: 66 VRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASR 125 Query: 127 EVAKRMMMRAGSGNAKPACRIINIASVAGLRPFSQIGLYAMSKAAVVHMTRAMALEWGRH 186 V+K MM R I+ + S P + + YA SKAA V T+ + LE + Sbjct: 126 SVSKYMMDRRSGS-------IVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEY 178 Query: 187 GINVNAICPGYIDTEINHYLWETEQGQ---------KLQSMLPRRRVGKPQDLDGLLLLL 237 I N + PG +T++ LW E G ++ +P +++ KP D+ +L L Sbjct: 179 NIRCNIVSPGSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFL 238 Query: 238 AADESQFINGSIVSADDGLGL 258 + ++ I + D G L Sbjct: 239 VSGQAGHITMHNLCVDGGATL 259
>DNABINDINGHU#Prokaryotic integration host factor signature. Length = 91 Score = 119 bits (301), Expect = 1e-38 Identities = 35/89 (39%), Positives = 53/89 (59%) Query: 37 TKAELAELLFDSVGLNKREAKDMVEAFFEVIRDALENGESVKLSGFGNFQLRDKPQRPGR 96 K +L + ++ L K+++ V+A F + L GE V+L GFGNF++R++ R GR Sbjct: 3 NKQDLIAKVAEATELTKKDSAAAVDAVFSAVSSYLAKGEKVQLIGFGNFEVRERAARKGR 62 Query: 97 NPKTGEAIPIAARRVVTFHASQKLKALVE 125 NP+TGE I I A +V F A + LK V+ Sbjct: 63 NPQTGEEIKIKASKVPAFKAGKALKDAVK 91
>PF00577#Outer membrane usher protein FimD Length = 878 Score = 30.6 bits (69), Expect = 0.024 Identities = 32/179 (17%), Positives = 58/179 (32%), Gaps = 36/179 (20%) Query: 480 APWDAMSDLFNRHLLDYSPRSLNDLKLSADGGALRVRGGIKLWNQVPPGVWLPADMKGSL 539 AP + FN L P+++ DL +G ++PPG + D+ + Sbjct: 40 APLSSAELYFNPRFLADDPQAVADLSRFENG------------QELPPGTY-RVDIYLNN 86 Query: 540 TLLDERHLAFTPTQVSVLGIP--QAKLLRALGIELSSLAPLKRRGAELRGDSLVLDQYTV 597 + R + F +P L ++G+ +S++ + + L Sbjct: 87 GYMATRDVTFNTGDSEQGIVPCLTRAQLASMGLNTASVSGMNLLA---DDACVPLTSM-- 141 Query: 598 FPPPVLIGHMSQATVEPDG----LRLTFRPAPNAPVLRPPANLPGSYLWLEGGDTKMFN 652 + AT + D L LT P A + LW G + + N Sbjct: 142 ---------IHDATAQLDVGQQRLNLTI---PQAFMSNRARGYIPPELWDPGINAGLLN 188
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.5 bits (68), Expect = 0.009 Identities = 28/86 (32%), Positives = 37/86 (43%), Gaps = 3/86 (3%) Query: 214 LMNQLKLAPAVRAEIRNDATRIAAAARARQRA-LARPGAPGAAASAGATLAASAAGSNGG 272 MN L A A + R AAA A+++A AA A T A A GS Sbjct: 203 RMNTLTAAKASIEAAAANKAREQAAAEAKRKAEEQARQQ--AAIRAANTYAMPANGSVVA 260 Query: 273 AAAGKGAVAGSGASAPGAAATATAAA 298 AAG+G + + +A A A + A A Sbjct: 261 TAAGRGLIQVAQGAASLAQAISDAIA 286
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 38.3 bits (89), Expect = 5e-05 Identities = 11/63 (17%), Positives = 26/63 (41%) Query: 79 AALRVSHPGLPIVALGSLGEPESALAALRAGVRDFIDFSAPAEDALRITRGLLDHVGDQP 138 ++ + P LP++ + + +A+ A G D++ + + I L +P Sbjct: 67 PRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKRRP 126 Query: 139 SRH 141 S+ Sbjct: 127 SKL 129
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 138 bits (348), Expect = 2e-37 Identities = 58/249 (23%), Positives = 111/249 (44%), Gaps = 11/249 (4%) Query: 151 VQVDVRVVEFSRSVLKQAGLNFFKQNNGFTFGSFAPAGLASVTGGG----TSSMSVSANI 206 V V+ + E + G+ + +N G T + + +++ G S+ Sbjct: 347 VLVEAIIAEVQDADGLNLGIQWANKNAGMTQFTNSGLPISTAIAGANQYNKDGTVSSSLA 406 Query: 207 PIASAFN-LVVGSATRGLFADLSILEANNLARVLAQPTLVALSGQSASFLAGGEIPVPVP 265 S+FN + G L+ L ++ +LA P++V L A+F G E+PV Sbjct: 407 SALSSFNGIAAGFYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTG 466 Query: 266 QSLGT-----ISIDWKPYGVGLTLTPTVLSPRRIALKVAPESSQLDFVHSITINGVTVPA 320 + +++ K G+ L + P + + L++ E S + S + + Sbjct: 467 SQTTSGDNIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSVADAAS-STSSDLGAT 525 Query: 321 LTTRRADTTVELGDGESFAIGGLIDRETTSNVDKVPFLGDLPIIGTFFKHLSYQQNDKEL 380 TR + V +G GE+ +GGL+D+ + DKVP LGD+P+IG F+ S + + + L Sbjct: 526 FNTRTVNNAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNL 585 Query: 381 VIIVTPHLV 389 ++ + P ++ Sbjct: 586 MLFIRPTVI 594
>PREPILNPTASE#Type IV prepilin cysteine protease (C20) family signature. Length = 290 Score = 53.3 bits (128), Expect = 4e-11 Identities = 31/124 (25%), Positives = 52/124 (41%), Gaps = 10/124 (8%) Query: 4 LFSIGFFFAWAAAVAIADCRDRRIPNELVLVGLAAVIIFTVCRQNPFGTTLSGALIGGAV 63 + A+ D +P++L L L ++F + F +L A+IG Sbjct: 134 TLAALLLTWVLVALTFIDLDKMLLPDQLTLPLLWGGLLFNL--LGGF-VSLGDAVIGAMA 190 Query: 64 GLVSLFPFFAL-------RVMGAADVKVFAVLGAWCGLSALPRLWVVASVAAGVHALALM 116 G + L+ + MG D K+ A LGAW G ALP + +++S+ + L+ Sbjct: 191 GYLVLWSLYWAFKLLTGKEGMGYGDFKLLAALGAWLGWQALPIVLLLSSLVGAFMGIGLI 250 Query: 117 LLTR 120 LL Sbjct: 251 LLRN 254
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 26.6 bits (59), Expect = 0.046 Identities = 11/33 (33%), Positives = 17/33 (51%), Gaps = 5/33 (15%) Query: 121 LKMIGEHGVQVALHTDVV-----VDVTVNVIGD 148 L + E+ VQV +HTD + V+ T+ I Sbjct: 235 LSVADEYDVQVMIHTDTLNESGFVEDTIAAIKG 267
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 29.9 bits (67), Expect = 0.003 Identities = 11/56 (19%), Positives = 21/56 (37%) Query: 88 IYLDEAARGSGLGSRLLEAALAKAPALGVHTALGFIFGHNEPSLRLFARYGFTTWG 143 I + + R G+G+ LL A+ A + N + +A++ F Sbjct: 95 IAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 37.2 bits (86), Expect = 6e-06 Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 3/59 (5%) Query: 74 IGRVSVLADARGRGVGSRLLDALLAEARGRGDALVRLYAQQR---AVAFYLRIGFRIVG 129 I ++V D R +GVG+ LL + A+ + L Q A FY + F I Sbjct: 92 IEDIAVAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGA 150
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.1 bits (67), Expect = 0.024 Identities = 53/222 (23%), Positives = 73/222 (32%), Gaps = 21/222 (9%) Query: 84 DALVAAAELRRLGFAADAWMPIEVKPDDARWALERARAANVPIDEAAPESFDGYGWLVDG 143 + L AA AA A E +A+ E I A + G +V Sbjct: 205 NTLTAAKASIE---AAAANKAREQAAAEAKRKAEEQARQQAAIRAANTYAMPANGSVVAT 261 Query: 144 LFGIGLARPLDGAFAAIAQRIAARARHTGRVLALDVPSGLDSDTGARVGGGTAVTATCTL 203 G GL + GA A++AQ I+ GRVLA S G ++T + Sbjct: 262 AAGRGLIQVAQGA-ASLAQAISDAIAVLGRVLA--------SAPSVMAVGFASLTYSSRT 312 Query: 204 SFIAAKPGLYTGDGRDLAGEIHVAPLDLGEPPAPAIRLNAPELFEAR--LPERAFASHKG 261 + T D A + A L P++ LNA LP R +G Sbjct: 313 AEQWQD---QTPDSVRYALGMDAAKLG----LPPSVNLNAVAKASGTVDLPMRLTNEARG 365 Query: 262 TYGSLGIVGGDTGMCGAPILAARAALFAGAGKVHVGFVGTGA 303 +L +V D + AA A G V T A Sbjct: 366 NTTTLSVVSTDGVSVPKAVPVRMAAYNATTGLYEVTVPSTTA 407
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 40.4 bits (94), Expect = 3e-05 Identities = 35/192 (18%), Positives = 73/192 (38%), Gaps = 15/192 (7%) Query: 92 KVLVEGLQRAQALSIEEQETQFSCEVMPLEPDHADSAETEALRRAIVSQFDQYVKLNKKI 151 L + L+ A S + + E A A+ E ++ K + Sbjct: 193 AELEKALEGAMNFSTADSAKIKTLEAE-KAALAARKADLEKALEGAMNFSTADSAKIKTL 251 Query: 152 PPEILTSLSGIDEAGRLADTIAAHLPLKLDQKQHILEMFPVIERLEHLLAQLEAEIDILQ 211 E + E + + + + + LE A LE + +L Sbjct: 252 EAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAE---KAALEAEKADLEHQSQVLN 308 Query: 212 VEKRIRGRVKRQMEKSQREYYLNEQVKAIQKELGEGEEGAD--LEELEKRINAARMPKEA 269 R ++R ++ S+ +Q++A ++L E + ++ + L + ++A+R EA Sbjct: 309 AN---RQSLRRDLDASREAK---KQLEAEHQKLEEQNKISEASRQSLRRDLDASR---EA 359 Query: 270 KKKADAELKKLK 281 KK+ +AE +KL+ Sbjct: 360 KKQLEAEHQKLE 371
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 31.3 bits (71), Expect = 0.008 Identities = 19/112 (16%), Positives = 40/112 (35%), Gaps = 12/112 (10%) Query: 51 EAAAAGVEASLSKSDLPSPQEIRDILDQYVIGQERAKKILAVAVYNHYKRL-------KH 103 +A+ G L K E+ I+ + + +R L + + + Sbjct: 92 KASEKGAYDYLPKP--FDLTELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEI 149 Query: 104 LDKKDDVELSKSNILLIGPTGSGKTLLAQTLARL---LNVPFVIADATTLTE 152 + + +++ G +G+GK L+A+ L N PFV + + Sbjct: 150 YRVLARLMQTDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPR 201
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 67.1 bits (164), Expect = 1e-14 Identities = 72/323 (22%), Positives = 119/323 (36%), Gaps = 37/323 (11%) Query: 20 AATLAALSGPAHAQSTLTLYGVADAGVQYLSRADGRHAAWRLQN-----YGILPSQLGIK 74 A TLAAL P A + +TLYG AGV SR+ + A L S++G K Sbjct: 7 ALTLAAL--PVAAMADVTLYGTIKAGV-ETSRSVAHNGAQAASVETGTGIVDLGSKIGFK 63 Query: 75 GEEDLGGGWRARFQLEQGINLNDSTATVPGYAFFRGAYVGMGGPAGTVTLGRQFSTLFDK 134 G+EDLG G +A +Q+EQ ++ + + R +++G+ G G + +GR S L D Sbjct: 64 GQEDLGNGLKAIWQVEQKASIAGTDSGW----GNRQSFIGLKGGFGKLRVGRLNSVLKDT 119 Query: 135 TLFYDPLWYASYSGQGVLVPLSANFVDHSIKFQSATFAGFDVEALAAMAGIAGNTRAGRV 194 +P S + + S+++ S FAG ++ A N AGR Sbjct: 120 GDI-NPWDSKSDYLGVNKIAEPEARLI-SVRYDSPEFAGL-SGSVQ----YALNDNAGRH 172 Query: 195 ------LELGGQFTSRGLSASAVLHRSH-GTAQGGADRSAQRRDIGTFAARYAFASLPLT 247 + + R H ++ R + + +AS+ + Sbjct: 173 NSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHRLVSGYDNDALYASVAVQ 232 Query: 248 VHAGVQRLTGELDPARTIV-------WGGARYQASGRFGFAGGIYHTDSPTPQVGHPTLF 300 ++T V +G + S GF G T+ Sbjct: 233 QQDAKLVEENYSHNSQTEVAATLAYRFGNVTPRVSYAHGFKGSFDATNYNN----DYDQV 288 Query: 301 IASTTCSLSKRTVAYLNLGYAKN 323 + SKRT A ++ G+ + Sbjct: 289 VVGAEYDFSKRTSALVSAGWLQE 311
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 31.7 bits (72), Expect = 0.002 Identities = 48/240 (20%), Positives = 79/240 (32%), Gaps = 79/240 (32%) Query: 6 KRVLLKLSGEALM---GDDAFGINRATIERMVADIAEVVRLGTQLAVVIGG----GNIFR 58 KRV++ L G AL ++ + + IAE++ G ++ + G G++ Sbjct: 3 KRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLLL 62 Query: 59 GVAGGAAG-------MDRATADYMGMLATMMNALALQDAMRHAGIEARVQSALRMDQV-- 109 + G A MD A A G + M+ AL++ +R G+E +V + + V Sbjct: 63 HMDAGQATYGIPAQPMDVAGAMSQGWIGYMIQQ-ALKNELRKRGMEKKVVTIITQTIVDK 121 Query: 110 ------------------------------------------VEPYIRP------RAIRQ 121 V P P I++ Sbjct: 122 NDPAFQNPTKPVGPFYDEETAKRLAREKGWIVKEDSGRGWRRVVPSPDPKGHVEAETIKK 181 Query: 122 L-EEGKVVIFAAGTGNPFFTT-------------DTAAALRGSEVGAEVVLKATKVDGVY 167 L E G +VI + G G P D A EV A++ + T V+G Sbjct: 182 LVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAA 241
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 35.0 bits (80), Expect = 0.001 Identities = 36/277 (12%), Positives = 69/277 (24%), Gaps = 12/277 (4%) Query: 5 NPRPATPGRAPVRSGSLTARKVARPDPKAAGAKPAA-AKPAAKSASAAKPAAPRSAANAA 63 NP + V + ++T + D + + A+ PA P Sbjct: 982 NPEVEKRNQ-TVDTTNITTPNNIQADVPSVPSNNEEIARVDEAPVPPPAPATPSETTETV 1040 Query: 64 PKRAPGPSRPAAASEGKRVAKPRTAHDAGRTGGERAPAKRATTPGAASAPRTRRTDAKPA 123 + + S+ +E + + A T A S T+ T Sbjct: 1041 AENSKQESKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTET 1100 Query: 124 RRTNERPAGRDERAPRDSDARAFDAGTRGK-DRAPREGARPGARGATGAKFGGAARRSDD 182 + T + + ++ + E +P A A + Sbjct: 1101 KETATVEKEEKAKVETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQS 1160 Query: 183 ADRRTPRATRADSRARDAAPSSFAGKTATAGKRAPQRADDRYGAAGKRTSPRTE------ 236 T + T + + A + + +E Sbjct: 1161 QTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENTTPATTQPTVNSESSNKPK 1220 Query: 237 -RTERTERPARFGERPATRASASGERRPTARAATGSR 272 R R+ R PAT ++S +R A S Sbjct: 1221 NRHRRSVRSVPHNVEPAT--TSSNDRSTVALCDLTST 1255
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 54.3 bits (130), Expect = 2e-09 Identities = 46/342 (13%), Positives = 112/342 (32%), Gaps = 19/342 (5%) Query: 199 AAGVSKYKERRRETENRLHDTRENLTRVEDIVRELGANLEKLEAQAVVATKYKELVADGE 258 + ++ + + + + D+ A + + + KE + + Sbjct: 46 RSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSNAKEKLRKND 105 Query: 259 EKQRLLWLLRKNEAAAEQDRQRRAIGDAQIELDAQTAKLREVEAQLETLRVAHYSASDAM 318 + + A + D ++ A+ A A +AK++ +EA+ L A+ Sbjct: 106 KSLSEKASKIQELEARKADLEK-ALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKAL 164 Query: 319 QGAQGALYEANAEVSRLEAQIKFIVESRNRVQAQIAALVAQQEQWRAQADKAQGDLEAAE 378 +GA +A++ LEA+ + + ++ + + A+ + + A Sbjct: 165 EGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALA 224 Query: 379 EARAVADEKAAIAEDDAAAKHDALPALEARWRDAQTGLNDERGRIAQTEQALKLEAAHQR 438 +A ++ A + + A + LEA + + + ++A + Sbjct: 225 ARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIK 284 Query: 439 NA-------DQQLQQLQQRHERLKVEAGGLDAPDEAQLEELRMQLAEHEAMLADAQARLA 491 + + L+ + + L A + LR L +A Sbjct: 285 TLEAEKAALEAEKADLEHQSQVL-----------NANRQSLRRDLDASREAKKQLEAEHQ 333 Query: 492 DAQEALPRLDAQRRAAHERVQAESAQIHQLEARLAALKQLQE 533 +E +A R++ + A QLEA L++ + Sbjct: 334 KLEEQNKISEASRQSLRRDLDASREAKKQLEAEHQKLEEQNK 375 Score = 43.1 bits (101), Expect = 6e-06 Identities = 48/278 (17%), Positives = 104/278 (37%), Gaps = 16/278 (5%) Query: 268 RKNEAAAEQDRQRRAIGDAQIELDAQTAKLREVEAQLETLRVAHYSASDAMQGAQGALYE 327 +E E + + L + +K++E+EA+ L A++GA Sbjct: 86 HNDELTEELSNAKEKLRKNDKSLSEKASKIQELEARKADL-------EKALEGAMNFSTA 138 Query: 328 ANAEVSRLEAQIKFIVESRNRVQAQIAALVAQQEQWRAQADKAQGDLEAAEEARAVADEK 387 +A++ LEA+ + + ++ + + A+ + + A E +A ++ Sbjct: 139 DSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAELEKA 198 Query: 388 AAIAEDDAAAKHDALPALEARWRDAQTGLNDERGRIAQTEQALKLEAAHQRNADQQLQQL 447 A + + A + LEA D + ++A + + + L Sbjct: 199 LEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAAL 258 Query: 448 QQRHERLKVEAGGLDAPDEAQLEELRMQLAEHEAMLADA-------QARLADAQEALPRL 500 + R L+ G A +++ AE A+ A+ Q A+ Q L Sbjct: 259 EARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318 Query: 501 DAQRRAAHERVQAESAQI-HQLEARLAALKQLQENVQT 537 DA R A ++++AE ++ Q + A+ + L+ ++ Sbjct: 319 DASRE-AKKQLEAEHQKLEEQNKISEASRQSLRRDLDA 355 Score = 40.8 bits (95), Expect = 3e-05 Identities = 40/313 (12%), Positives = 95/313 (30%), Gaps = 20/313 (6%) Query: 734 TEVRAQAERA--TQRVHALQMDVLKLTQAHERYTQRSTQIREELEEIGAQIEEQRALRAE 791 T + T + +Q K + +++ + + + +E + Sbjct: 37 TNEVSAVATRSQTDTLEKVQERADKFEIENNTLKLKNSDLSFNNKALKDHNDELTEELSN 96 Query: 792 SEANFERHDAELAELQARFEDNQLAFESLDETLTNARQEARERERAATDARFAARQSANR 851 ++ ++D L+E ++ ++ + L++ L A + Sbjct: 97 AKEKLRKNDKSLSEKASKIQELEARKADLEKALEGAMNFSTADSAKI-----------KT 145 Query: 852 IDELKRSIQVAHEQAERVAASLEDARAELETINEQTAHTGLQDALEVRAAKEQALGAARA 911 ++ K ++ E+ + +T +A E+AL A Sbjct: 146 LEAEKAALAARKADLEKALEGAMNFSTADSA-KIKTLEAEKAALEARQAELEKALEGAMN 204 Query: 912 ELDDLTAKLRAADEARLAAERSLQPLRDRITELQLKEQAARMTGEQFAEQLATAEVDEAA 971 +AK++ + + A L + A + + A E +A Sbjct: 205 FSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSAKIKTLEAEKAALEARQAE 264 Query: 972 LKEKLMPDMKPSYLQGEVTRINNAINALGPVNMAALDELAAASERKVFLDAQSADLTNAI 1031 L++ L T + I L A E A + L+A L + Sbjct: 265 LEKAL------EGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQVLNANRQSLRRDL 318 Query: 1032 ETLEDAIRKIDQE 1044 + +A ++++ E Sbjct: 319 DASREAKKQLEAE 331 Score = 40.8 bits (95), Expect = 3e-05 Identities = 57/268 (21%), Positives = 99/268 (36%), Gaps = 11/268 (4%) Query: 714 EAKAAAIRAEAAHTQASQALTEVRAQAERATQRVHALQMDVLKLTQAHERYTQRSTQIRE 773 +A E A A T A+ + AL L +A E ST Sbjct: 187 ALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALAARKADLEKALEGAMNFSTADSA 246 Query: 774 ELEEIGAQIEEQRALRAESEANFERHDAELAELQARFEDNQLAFESLDETLTNARQEARE 833 +++ + A+ A +AE E E A+ + + +L+ + +++ Sbjct: 247 KIKTLEAEKAALEARQAELEKALEGAMNFSTADSAKIKTLEAEKAALEAEKADLEHQSQV 306 Query: 834 RERAATDARFAARQSANRIDELKRSIQVAHEQAERVAASLEDARAELETINEQTAHTGLQ 893 R S +L+ Q EQ + AS + R +L+ E Sbjct: 307 LNANRQSLRRDLDASREAKKQLEAEHQKLEEQNKISEASRQSLRRDLDASREAKK----- 361 Query: 894 DALEVRAAK-EQALGAARAELDDLTAKLRAADEARLAAERSLQPLRDRITELQLK----E 948 LE K E+ + A L L A+ EA+ E++L+ ++ L+ E Sbjct: 362 -QLEAEHQKLEEQNKISEASRQSLRRDLDASREAKKQVEKALEEANSKLAALEKLNKELE 420 Query: 949 QAARMTGEQFAEQLATAEVDEAALKEKL 976 ++ ++T ++ AE A E + ALKEKL Sbjct: 421 ESKKLTEKEKAELQAKLEAEAKALKEKL 448
>cloacin#Cloacin signature. Length = 551 Score = 27.8 bits (61), Expect = 0.012 Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%) Query: 49 GTVNVWGGDGWRDRDHWHGGDDRWHGGWRGGGNWRDGNDWHGGRGNGWQGGRGPAGGRN 107 G + G G D W ++ W GG G +W G HG G G G G N Sbjct: 23 GPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHW-GGGSGHGNGGGNGNSGGGSGTGGN 80 Score = 26.2 bits (57), Expect = 0.046 Identities = 20/51 (39%), Positives = 23/51 (45%), Gaps = 3/51 (5%) Query: 74 GGWRGGGNWRDGNDWHGGRGNGWQGGRGPAGGRNVRGGNDWPDGGGNGRGG 124 G G G + N W GG G+G G G G GN GGG+G GG Sbjct: 32 GASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN---SGGGSGTGG 79
>FERRIBNDNGPP#Ferrichrome-binding periplasmic protein signature. Length = 296 Score = 39.9 bits (93), Expect = 7e-06 Identities = 39/186 (20%), Positives = 68/186 (36%), Gaps = 9/186 (4%) Query: 42 AITLAAPARRVVSLAPHVTELIYAAG----GGAKLVGAVSYSDYPPAAKAIARVGSNKAL 97 A A R+V+L EL+ A G G A + + PP ++ VG Sbjct: 28 AHAAAIDPNRIVALEWLPVELLLALGIVPYGVADTINYRLWVSEPPLPDSVIDVGLRTEP 87 Query: 98 DLERIAALKPDLIVVWRHGNAEHETERLRALGIPLYFSEPRH-LDDVAASLDKLGLLLGT 156 +LE + +KP +V E A G FS+ + L SL ++ LL Sbjct: 88 NLELLTEMKPSFMVWSAGYGPSPEMLARIAPGRGFNFSDGKQPLAMARKSLTEMADLLNL 147 Query: 157 HEIASAAADAYRRRIAQLRARYADK--PPVTVFFQAWDKPLITLNGDH-IVSDVIALCGG 213 A Y I ++ R+ + P+ + D + + G + + +++ G Sbjct: 148 QSAAETHLAQYEDFIRSMKPRFVKRGARPL-LLTTLIDPRHMLVFGPNSLFQEILDEYGI 206 Query: 214 RNVFAR 219 N + Sbjct: 207 PNAWQG 212
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 31.8 bits (72), Expect = 0.007 Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 3/39 (7%) Query: 190 KKNDAGEVVASILAGG-GLGRTPIVGAIIRENLPWQHLL 227 + A ++ SI+A G G+G P +G +I + W +LL Sbjct: 136 NRGKAFGLIGSIVAMGEGVG--PAIGGMIAHYIHWSYLL 172
>PF06580#Sensor histidine kinase Length = 349 Score = 32.9 bits (75), Expect = 0.002 Identities = 29/141 (20%), Positives = 50/141 (35%), Gaps = 25/141 (17%) Query: 40 ALWLASLRG---HAVAGLSPAISGLAWHVHEMVFGFSAAIIVGFLLTAIRAWTSRETLHG 96 W G + + G A + +H M+F + +++ L A R++ R Sbjct: 11 YYWYCQGIGWGVYTLTGFGFASLYGSPKLHSMIFNIAISLMGLVLTHAYRSFIKR----- 65 Query: 97 APLAALWLPWAAGRLLVWAGPEPLAAVVDSAFLPITAILLLRVLLAARNHRNVFLTVALF 156 WL G++++ P A V + A + LLA N + V T+ L Sbjct: 66 ----QGWLKLNMGQIILRVLP----ACVVIGMVWFVANTSIWRLLAFINTKPVAFTLPLA 117 Query: 157 L---------FGALNALFHGW 168 L + L+ GW Sbjct: 118 LSIIFNVVVVTFMWSLLYFGW 138
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 72.1 bits (177), Expect = 8e-16 Identities = 53/301 (17%), Positives = 108/301 (35%), Gaps = 50/301 (16%) Query: 288 VMVTGAGGSIGSELCRQILKFQPAQLIAFD-LSEYAMYRLTEELRERFPDLPVVPIIGDA 346 +VTGA G IG + +++L+ Q++ D L++Y L + E D Sbjct: 3 YLVTGAAGFIGFHVSKRLLE-AGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 347 KDSLLLDQVMSRYAPHIVFHAAAYKHVPLMEELNAWQALRNNVLGTYRVARAAIRHDVRH 406 D + + + VF + V E N +N+ G + + ++H Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLE-NPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 407 FVLIST---------------DKAVNPTNVMGASKRLAE-MACQALQQTSARTQFETV-- 448 + S+ D +P ++ A+K+ E MA S Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTY----SHLYGLPATGL 176 Query: 449 RFGNVLGSAGS---VIPKFQQQIAKGGPVTV-THPEITRFFMTIPEASQLVLQA------ 498 RF V G G + KF + + +G + V + ++ R F I + ++ +++ Sbjct: 177 RFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPH 236 Query: 499 ------------SSMGQGGEIFILDMGEPVKIVDLARDLIRLYGFTEEQIRIEFSGLRPG 546 ++ ++ + PV+++D + L G + + L+PG Sbjct: 237 ADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGI---EAKKNMLPLQPG 293 Query: 547 E 547 + Sbjct: 294 D 294
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 104 bits (262), Expect = 5e-28 Identities = 67/344 (19%), Positives = 130/344 (37%), Gaps = 42/344 (12%) Query: 3 RVIVTGANGFVGRALCRALLAAGHEVTGL-------------VRRRGVCAEGVSEWVHEA 49 + +VTGA GF+G + + LL AGH+V G+ R + G H+ Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQ--FHKI 59 Query: 50 D--DFDGVADRWPAGLQVDAVVHLAARVHMMRDRSPDPDAAFRASNVAATMRVARAAQQQ 107 D D +G+ D + +G + V R+ + S + A+ SN+ + + + Sbjct: 60 DLADREGMTDLFASG-HFERVFISPHRLAV--RYSLENPHAYADSNLTGFLNILEGCRHN 116 Query: 108 GARRFVFLS--SVKAIAESDGGTPLCE-NSTPAPQDAYGRSKLEAERALEQLRDELSFDT 164 + ++ S SV + P +S P Y +K E Sbjct: 117 KIQHLLYASSSSVYGLNRKM---PFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGLPA 173 Query: 165 VIVRPPLVYGPGVRAN--FLSLMRAVSRGVPLPL-GAVRARRSMVYVDNLADAVMRCVTE 221 +R VYGP R + +A+ G + + + +R Y+D++A+A++R Sbjct: 174 TGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRLQDV 233 Query: 222 PAATNGCFHVADSDMPPTIAEL-LDDIGHHLGRPARLLPVPERLLRVAGALTGRAAQ--- 277 + + V +IA + +IG+ P L+ ++ G A+ Sbjct: 234 IPHADTQWTVETGTPAASIAPYRVYNIGN--SSPVELM----DYIQALEDALGIEAKKNM 287 Query: 278 IDRLTSDLR---LDTTHIRTVLDWRPPRSSEEGLAETACWFKSL 318 + D+ DT + V+ + P + ++G+ W++ Sbjct: 288 LPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRDF 331
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 168 bits (426), Expect = 2e-51 Identities = 83/363 (22%), Positives = 136/363 (37%), Gaps = 58/363 (15%) Query: 6 KILVTGGAGFIGCAISERLAARASRYVVMDNLHPQIHASAVRPGALHEKAE----LVVAD 61 K LVTG AGFIG +S+RL + V +DNL+ + +++ L A+ D Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDY-YDVSLKQARLELLAQPGFQFHKID 60 Query: 62 VTDAGAWDALLSDFQPEIIIHLAAETGTGQSLTEASRHALVNVVGTTRLTDALVKHGIVV 121 + D L + E + SL +A N+ G + + + + Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNK--I 118 Query: 122 EHILLTSSRAVYGEGAWQKDDGTIVYPGQRGRAQLEAAQWDFPGMTMLPSRADRTEPRPT 181 +H+L SS +VYG +P D + P Sbjct: 119 QHLLYASSSSVYGLN------------------------------RKMPFSTDDSVDHPV 148 Query: 182 SVYGATKLAQEHVLRAWSLATKTPLSILRLQNVYGPGQSLTNSYTGIVALFSRLAREKKV 241 S+Y ATK A E + +S P + LR VYGP + F++ E K Sbjct: 149 SLYAATKKANELMAHTYSHLYGLPATGLRFFTVYGPWGRPDMALF----KFTKAMLEGKS 204 Query: 242 IPLYEDGNVTRDFVSIDDVADAIVATLVRTPEA-----------------LSLFDIGSGQ 284 I +Y G + RDF IDD+A+AI+ P A +++IG+ Sbjct: 205 IDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSS 264 Query: 285 ATSILDMARIIAAHYGAPEPQINGAFRDGDVRHAACDLSESLANLGWKPQWSLKRGIGEL 344 ++D + + G + + GDV + D +G+ P+ ++K G+ Sbjct: 265 PVELMDYIQALEDALGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNF 324 Query: 345 QTW 347 W Sbjct: 325 VNW 327
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 30.3 bits (68), Expect = 0.007 Identities = 16/59 (27%), Positives = 24/59 (40%) Query: 195 LFTMVLMFLSPVFYPASALPEKYRFWLELNPLTLFIEQSRGILLEGRVPDFHPLGLAFL 253 L ++FLS +P LP ++ PL+ I+ R I+L V D A Sbjct: 184 LVITPILFLSGAVFPVDQLPIVFQTAARFLPLSHSIDLIRPIMLGHPVVDVCQHVGALC 242
>PYOCINKILLER#Pyocin S killer protein signature. Length = 617 Score = 30.1 bits (67), Expect = 0.028 Identities = 45/160 (28%), Positives = 64/160 (40%), Gaps = 8/160 (5%) Query: 477 ANALSVANPAALTAAANTVAGTLARAANGTPVAGAIGGLVAALPVANPAGALTSAANNAA 536 A A A A AA A T A ANG+ VA A G + VA A +L A ++A Sbjct: 228 AEAKRKAEEQARQQAAIRAANTYAMPANGSVVATAAGR--GLIQVAQGAASLAQAISDAI 285 Query: 537 STIATVAGTNPAAAIGGVAGALTGAAGTGVATASQLGSVGSALMGSGAASAGKVLTSGSA 596 + + V + P+ G A +LT ++ T Q +G AA G + Sbjct: 286 AVLGRVLASAPSVMAVGFA-SLTYSSRTAEQWQDQTPDSVRYALGMDAAKLGLPPSVNLN 344 Query: 597 AFGSAAASAG-----SLLTTGAAATSSVVNSLGSSVGAVV 631 A A+ + + G T SVV++ G SV V Sbjct: 345 AVAKASGTVDLPMRLTNEARGNTTTLSVVSTDGVSVPKAV 384
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 60.5 bits (146), Expect = 4e-12 Identities = 34/189 (17%), Positives = 68/189 (35%), Gaps = 10/189 (5%) Query: 46 QNSTPAGAEAELWTSVPDTSTPQPAPTPPVKVAPPPPPVKNEEADIALQQKRREQQAAAA 105 +TP +A++ SVP + V PP P +E + + ++E + Sbjct: 996 NITTPNNIQADV-PSVPSNNEEIARVDEAP-VPPPAPATPSETTETVAENSKQESKTVEK 1053 Query: 106 REAQLEEQRRQQQLKAQQ-----LAAQQAAQLAAQKAAEREKQKQAEKLKQQQLAEQQQR 160 E E Q + A++ A Q ++A + +E Q K E++ + Sbjct: 1054 NEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAK 1113 Query: 161 KLEQQKLEQQKLEQQ---KKQEQLAAQKKADAEKAEKAEKAAKAAAAAKANAAAKAKLDK 217 ++ E K+ Q K+++ Q +A+ + K + A + K Sbjct: 1114 VETEKTQEVPKVTSQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAK 1173 Query: 218 ERQARLAQL 226 E + + Q Sbjct: 1174 ETSSNVEQP 1182 Score = 41.2 bits (96), Expect = 6e-06 Identities = 18/132 (13%), Positives = 44/132 (33%), Gaps = 8/132 (6%) Query: 93 LQQKRREQQAAAAREAQLEEQRRQQQLKAQQLAAQQAAQLAAQKAAEREKQKQ--AEKLK 150 Q Q + +++A A + A + + AE K Sbjct: 988 RNQTVDTTNITTPNNIQADVPSVPS--NNEEIARVDEAPVPPPAPATPSETTETVAENSK 1045 Query: 151 QQQLAEQQQRKLEQQKLEQQKLEQQKKQEQLAAQKKADAEKAEKAEKAAKAAAAAKANAA 210 Q+ ++ Q + + ++ ++ + KA+ + E A+ ++ Sbjct: 1046 QESKTVEKNE----QDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETK 1101 Query: 211 AKAKLDKERQAR 222 A ++KE +A+ Sbjct: 1102 ETATVEKEEKAK 1113 Score = 33.1 bits (75), Expect = 0.002 Identities = 19/145 (13%), Positives = 44/145 (30%), Gaps = 7/145 (4%) Query: 87 EEADIALQQKRREQQAAAAREAQLEEQRRQQQLKAQQLAAQQAAQLAAQKAAERE----- 141 +EA ++ + + A + E Q + + A ++A + + Sbjct: 1070 KEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVTSQV 1129 Query: 142 --KQKQAEKLKQQQLAEQQQRKLEQQKLEQQKLEQQKKQEQLAAQKKADAEKAEKAEKAA 199 KQ+Q+E ++ Q ++ K Q + EQ A + ++ E+ Sbjct: 1130 SPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTV 1189 Query: 200 KAAAAAKANAAAKAKLDKERQARLA 224 + N + Sbjct: 1190 NTGNSVVENPENTTPATTQPTVNSE 1214
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 100 bits (250), Expect = 5e-28 Identities = 32/137 (23%), Positives = 56/137 (40%), Gaps = 10/137 (7%) Query: 33 QGDAVSTQPNPENVAQVTVDPLNDPNSPLAKRSVYFDFDSYSVQDQYQALLQQHAQYLKS 92 QG+A A S V F+F+ +++ + QA L Q L + Sbjct: 193 QGEAAPVVAPAPAPAPEVQTKHFTLKS-----DVLFNFNKATLKPEGQAALDQLYSQLSN 247 Query: 93 HPQRH--ILIQGNTDERGTSEYNLALGQKRAEAVRRALSLLGVGDAQMEAVSLGKEKPVA 150 + +++ G TD G+ YN L ++RA++V L G+ ++ A +G+ PV Sbjct: 248 LDPKDGSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKISARGMGESNPVT 307 Query: 151 LGHDEASWAQNRRADLV 167 +RA L+ Sbjct: 308 ---GNTCDNVKQRAALI 321
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 127 bits (322), Expect = 4e-36 Identities = 89/386 (23%), Positives = 143/386 (37%), Gaps = 62/386 (16%) Query: 1 MKKTLIVAALSGVFATAAHAQSSVTLYGLIDAGITYTNNQGGHSAWS-----QSTGSVNG 55 MKK+LI L+ + A + VTLYG I AG+ + + + A + + G Sbjct: 1 MKKSLIALTLAALPVAAM---ADVTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57 Query: 56 SRWGLRGAEDLGGGLKAIFVLENGFGINNGTLKQNGREFGRQAFVGLSHEQYGALTLGRQ 115 S+ G +G EDLG GLKAI+ +E I + RQ+F+GL +G L +GR Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGT----DSGWGNRQSFIGLK-GGFGKLRVGRL 112 Query: 116 YDSVVDYLG--PLSLTGTQFGGTQFAHPFDNDNLNNSFRINNAVKYTSVNWAGLKFGALY 173 + D P G + A P + S V+Y S +AGL Y Sbjct: 113 NSVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQY 163 Query: 174 GFSNNNQFANNRAYSAGVSYSYAGFNIGAGYLQLNNNFGPTVSNASGAVALDNTFVGKRQ 233 ++N N+ +Y AG +Y GF + G ++ ++ Sbjct: 164 ALNDNAGRHNSESYHAGFNYKNGGFFVQYGGAYKRHHQV------------QENVNIEKY 211 Query: 234 RVFGGGLNYTFGPATAGFVFTQSRVNRATAIGAGASGVSSGIALDGTFMRFNNYEVNARY 293 ++ Y A + + A + S S RF N Sbjct: 212 QIHRLVSGYD---NDALYASVAVQQQDAKLVEENYSHNSQTEVAATLAYRFG----NVTP 264 Query: 294 AITPAWTVAGSYTYTAGFIENHHPGWNQFNLQTAYALSKRTDVYLQGVYQKVNNDGTGLG 353 ++ A GS+ T N++ ++Q + Y SKRT + + + +G G Sbjct: 265 RVSYAHGFKGSFDAT-----NYNNDYDQVVVGAEYDFSKRTSALVSAGWLQ---EGKGES 316 Query: 354 AYINGIGGMSSTEKQIAVTAGLRHRF 379 ++ A GLRH+F Sbjct: 317 KFV-----------STAGGVGLRHKF 331
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 30.8 bits (69), Expect = 0.023 Identities = 19/52 (36%), Positives = 28/52 (53%), Gaps = 7/52 (13%) Query: 460 ARLEKIKIEKELEELRAEKAKLEELLANESAMKRLMIKE-------IEADAK 504 +R K ++EK LEE ++ A LE+L K+L KE +EA+AK Sbjct: 391 SREAKKQVEKALEEANSKLAALEKLNKELEESKKLTEKEKAELQAKLEAEAK 442
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 624 bits (1610), Expect = 0.0 Identities = 172/683 (25%), Positives = 295/683 (43%), Gaps = 75/683 (10%) Query: 9 RYRNIGISAHIDAGKTTTTERILFYTGVSHKIGEVHDGAATMDWMEQEQERGITITSAAT 68 + NIG+ AH+DAGKTT TE +L+ +G ++G V G D E++RGITI + T Sbjct: 2 KIINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGIT 61 Query: 69 TAFWKGMAGNYPEHRINIIDTPGHVDFTIEVERSMRVLDGACMVYDSVGGVQPQSETVWR 128 + W+ ++NIIDTPGH+DF EV RS+ VLDGA ++ + GVQ Q+ ++ Sbjct: 62 SFQWEN-------TKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFH 114 Query: 129 QANKYKVPRIAFVNKMDRVGADFFRVQRQIGERLKGVAVPIQIPVGAEEHFQGVVDLVKM 188 K +P I F+NK+D+ G D V + I E+L V Q V M Sbjct: 115 ALRKMGIPTIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQ----------KVELYPNM 164 Query: 189 KAIVWDDESQGVKFTYEDIPANLVELAHEWREKMVEAAAEASEELLEKYLTDHNSLTEDE 248 + + Q + E +++LLEKY+ SL E Sbjct: 165 CVTNFTESEQ------------------------WDTVIEGNDDLLEKYM-SGKSLEALE 199 Query: 249 IKAALRKRTIANEIVPMLCGSAFKNKGVQAMLDAVIDYLPSPADVPAILGHDLDDKEAER 308 ++ R + P+ GSA N G+ +++ + + S Sbjct: 200 LEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSSTH---------------- 243 Query: 309 HPSDDEPFSALAFKIMTDPFVGQLIFFRVYSGVVESGDTLLNATKDKKERLGRILQMHAN 368 FKI +L + R+YSGV+ D++ + K+K ++ + Sbjct: 244 --RGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSVRISEKEKI-KITEMYTSING 300 Query: 369 ERKEIKEVRAGDIAAAVG--LK-EATTGDTLCDPGKPIILEKMEFPEPVISQAVEPKTKA 425 E +I + +G+I LK + GDT P + E++E P P++ VEP Sbjct: 301 ELCKIDKAYSGEIVILQNEFLKLNSVLGDTKLLPQR----ERIENPLPLLQTTVEPSKPQ 356 Query: 426 DQEKMGLALNRLAQEDPSFRVQTDEESGQTIISGMGELHLEIIVDRMKREFGVEATVGKP 485 +E + AL ++ DP R D + + I+S +G++ +E+ ++ ++ VE + +P Sbjct: 357 QREMLLDALLEISDSDPLLRYYVDSATHEIILSFLGKVQMEVTCALLQEKYHVEIEIKEP 416 Query: 486 QVAYRETVRTVAEDVEGKFVKQSGGRGQYGHAVIKLEPNP-GKGYEFLDEIKGGVIPREF 544 V Y E + E + + + + P P G G ++ + G + + F Sbjct: 417 TVIYME---RPLKKAEYTIHIEVPPNPFWASIGLSVSPLPLGSGMQYESSVSLGYLNQSF 473 Query: 545 IPAVNKGIEETLKSGVLAGYPVVDVKVHLTFGSYHDVDSNENAFRMAGSMAFKEAMRRAK 604 AV +GI + G L G+ V D K+ +G Y+ S FRM + ++ +++A Sbjct: 474 QNAVMEGIRYGCEQG-LYGWNVTDCKICFKYGLYYSPVSTPADFRMLAPIVLEQVLKKAG 532 Query: 605 PVLLEPMMAVEVETPEDFMGNVMGDLSSRRGIVQGMEDIAGGGGKLVRAEVPLAEMFGYS 664 LLEP ++ ++ P++++ D + + ++ E+P + Y Sbjct: 533 TELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQ--LKNNEVILSGEIPARCIQEYR 590 Query: 665 TSLRSATQGRATYTMEFKHYAET 687 + L T GR+ E K Y T Sbjct: 591 SDLTFFTNGRSVCLTELKGYHVT 613
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 138 bits (349), Expect = 3e-38 Identities = 92/408 (22%), Positives = 171/408 (41%), Gaps = 15/408 (3%) Query: 17 VMLWLVATGFFMQTLDATIVNTALPSMAASLGESPLRMQSVVIAYSLTMAVMIPVSGWLA 76 +++WL FF L+ ++N +LP +A + P V A+ LT ++ V G L+ Sbjct: 15 ILIWLCILSFF-SVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLS 73 Query: 77 DTLGTRRVFFSAILIFTLGSLLCANAHT-LPLLVAFRVIQGVGGAMLLPVGRLAVLRTFP 135 D LG +R+ I+I GS++ H+ LL+ R IQG G A + + V R P Sbjct: 74 DQLGIKRLLLFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIP 133 Query: 136 AERYLPALSFVAIPGLIGPLIGPTLGGWLVKIASWHWIFLINVPVGIAGCIATFYSMPDS 195 E A + +G +GP +GG + HW +L+ +P+ + + Sbjct: 134 KENRGKAFGLIGSIVAMGEGVGPAIGGMIAH--YIHWSYLLLIPMITIITVPFLMKLLKK 191 Query: 196 RNPAAGRFDLKGYLLLTIGMIAISLSLDGLADLGMQHAMVLVLLILSLACFVAYGLYAVR 255 G FD+KG +L+++G++ L L++ +LS FV + + Sbjct: 192 EVRIKGHFDIKGIILMSVGIVFFMLFTT------SYSISFLIVSVLSFLIFVKHIR---K 242 Query: 256 APQPIFSLELFGIHTFSVGLLGNLFARIGSGAMPYLIPLLLQVSLGYGAFEAG-LMMLPV 314 P L F +G+L ++P +++ E G +++ P Sbjct: 243 VTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPG 302 Query: 315 AAAGMFSKRIITVLITRHGYRKVLLANTIMVGLMMASFALVSDAMPTWLKIAQLALFGGF 374 + + I +L+ R G VL + + + + + + ++ I + + GG Sbjct: 303 TMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLGGL 362 Query: 375 NSMQFTAMNTLTLKDLGTGGASSGNSLFSLVQMLSMSLGVTVAGALLA 422 + + T ++T+ L A +G SL + LS G+ + G LL+ Sbjct: 363 SFTK-TVISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIVGGLLS 409
>SECA#SecA protein signature. Length = 901 Score = 24.8 bits (54), Expect = 0.027 Identities = 14/36 (38%), Positives = 20/36 (55%), Gaps = 5/36 (13%) Query: 33 KLAYPIRDGIPVMLVDEARQTVEGTPVDPAGPARGR 68 KL Y + D + +L+DEAR TP+ +GPA Sbjct: 202 KLHYALVDEVDSILIDEAR-----TPLIISGPAEDS 232
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 37.1 bits (86), Expect = 3e-04 Identities = 33/143 (23%), Positives = 59/143 (41%), Gaps = 9/143 (6%) Query: 152 AKPADANAEGEDAGAQKETPLAQFTQNLNQMAKDGR-IDPLIGRESEVERVVQVLCR--R 208 KP D + LA+ + +++ D + PL+GR + ++ + +VL R + Sbjct: 103 PKPFDL----TELIGIIGRALAEPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQ 158 Query: 209 RKNNPLLVGEAGVGKTAIAEGLAYRITRGEVPDILANAQVYSLD-MGALLAGTKYRGDFE 267 ++ GE+G GK +A L R P + N D + + L G + +G F Sbjct: 159 TDLTLMITGESGTGKELVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHE-KGAFT 217 Query: 268 QRLKTVLKELKERPHAILFIDEI 290 ++ LF+DEI Sbjct: 218 GAQTRSTGRFEQAEGGTLFLDEI 240 Score = 32.5 bits (74), Expect = 0.006 Identities = 39/183 (21%), Positives = 63/183 (34%), Gaps = 32/183 (17%) Query: 446 QDDRSKLQTLDRDLKSVVFGQDPAIDALAAAIKMARAGLGKLDKPIGAFLFSGPTGVGKT 505 + SKL+ +D +V G+ A+ + + + D + + +G +G GK Sbjct: 123 KRRPSKLEDDSQDGMPLV-GRSAAMQEIYRVLARL----MQTDLTL---MITGESGTGKE 174 Query: 506 EVAR---QLAFTLGIELIRFDMSEYMERHAVSRLIGAPPGYVGFDQGGLLTEAVTKKPHC 562 VAR + +M+ S L G + G T A T+ Sbjct: 175 LVARALHDYGKRRNGPFVAINMAAIPRDLIESELFGH--------EKGAFTGAQTRSTGR 226 Query: 563 V-------LLLDEIEKAHPDIFNVLLQVMDHGTLT---DNNGRKADFRNVIIIMTTNAGA 612 L LDEI D LL+V+ G T ++D R I+ TN Sbjct: 227 FEQAEGGTLFLDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVR---IVAATNKDL 283 Query: 613 ESM 615 + Sbjct: 284 KQS 286
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 39.3 bits (91), Expect = 1e-04 Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 5/123 (4%) Query: 2142 LVVGGTGGLGFASARWMVERGARRLTLASRSGELAVAARDEIECWRATLGVAVDIVSCDV 2201 + G G+G A AR + +GA + +L + DV Sbjct: 12 FITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSL-----KAEARHAEAFPADV 66 Query: 2202 TDAAAVDAMIAAIVRRDIPLKGVLHSAMTIDDGLVRNLDDARMAAVLAPKVAGAWNLHRA 2261 D+AA+D + A I R P+ +++ A + GL+ +L D A + G +N R+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 2262 TRS 2264 Sbjct: 127 VSK 129
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 70.5 bits (172), Expect = 2e-16 Identities = 66/249 (26%), Positives = 101/249 (40%), Gaps = 26/249 (10%) Query: 12 ITGASAGLGRALARAYARPGVVLSLGGRDAVRLEESAADCRARGATVFVASIDVRDADAM 71 ITGA+ G+G A+AR A G ++ + +LE+ + +A DVRD+ A+ Sbjct: 13 ITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDSAAI 72 Query: 72 R----RWLEQFDDAHPIHLLIANAGVASTLAHGGDWEARERTAAIVDTNFYGAMNAVLPV 127 R + PI +L+ AGV + E A N G NA V Sbjct: 73 DEITARIEREMG---PIDILVNVAGVLRPGLI--HSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 128 IDRMRARGSGQVALISSLAALRGMAISPAYCASKAALKAWGDSVRPVLKRDGIRLSVVLP 187 M R SG + + S A AY +SKAA + + L IR ++V P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 188 GFVKTAMSDVFPADKPLLWSPDKAAQYIQRGIAARRAEIAFPALLALGMRLLPLL-PAVM 246 G +T M + LW+ + A+ + +G G+ L L P+ + Sbjct: 188 GSTETDM-------QWSLWADENGAEQVIKGSLET---------FKTGIPLKKLAKPSDI 231 Query: 247 ADAILGRLS 255 ADA+L +S Sbjct: 232 ADAVLFLVS 240
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 129 bits (326), Expect = 4e-37 Identities = 80/352 (22%), Positives = 136/352 (38%), Gaps = 52/352 (14%) Query: 4 RVLITGITGMVGSHLADFLLENTDWEIYGLCRWRSPLDNV-SHLLPRINEKNRIRL---- 58 + L+TG G +G H++ LLE ++ G+ DN+ + + + L Sbjct: 2 KYLVTGAAGFIGFHVSKRLLEAGH-QVVGI-------DNLNDYYDVSLKQARLELLAQPG 53 Query: 59 ---VYGDLRDYLSIHEAVKQSTPDFVFHLAAQSYPKTSFDSPLDTLETNVQGTANVLEAL 115 DL D + + + VF + + S ++P ++N+ G N+LE Sbjct: 54 FQFHKIDLADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGC 113 Query: 116 RKNNIDAVTHVCASSEVFGRVPREKLPIDEE-CTFHPASPYAISKVGTDLIGRYYAEAYN 174 R N I + +SS V+G K+P + HP S YA +K +L+ Y+ Y Sbjct: 114 RHNKIQHLL-YASSSSVYGL--NRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 175 MTVMTTRMFTHTGPR-RGDVFAESTFAKQIAMIERGLIPPVVKTGNLDSLRTFADVRDAV 233 + R FT GP R D+ A F K + G V G + R F + D Sbjct: 171 LPATGLRFFTVYGPWGRPDM-ALFKFTKAML---EGKSIDVYNYGKM--KRDFTYIDDIA 224 Query: 234 RAYYMLVTINPI-----------------PGAYYNIGGTYSCTVGQMLDTLISMSTSKDV 276 A L + P P YNIG + + + L +D Sbjct: 225 EAIIRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQAL------EDA 278 Query: 277 IRVETDPE--RLRPIDADLQVPNTRKFEAVTGWKPEISFEKTMEDLLNYWRA 326 + +E L+P D +T+ V G+ PE + + +++ +N++R Sbjct: 279 LGIEAKKNMLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 44.8 bits (106), Expect = 1e-07 Identities = 59/332 (17%), Positives = 105/332 (31%), Gaps = 82/332 (24%) Query: 1 MKVFLVGSTGYIGKTLFDA-CSRRWRTLGT-STRDGADIVFSLARAEAFPYEQVSA--GD 56 MK + G+ G+IG + + +G + D D+ AR E D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 57 ------------------VVAVAA------AISSPDACAKDYETAFQVNVTGTLTLIRGV 92 V ++ +P A Y N+TG L ++ G Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHA----Y---ADSNLTGFLNILEG- 112 Query: 93 VARGA---RVIFFSSDTVYGASEQLLSEEAELT--PAGAYGAMKRRVEA---ELGENAAV 144 R +++ SS +VYG + ++ + P Y A K+ E + Sbjct: 113 -CRHNKIQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMAHTYSHLYGL 171 Query: 145 KVIRLSY--VFSLRDR-------FTQYLLGCAKEGKRADIFK--PFSRCVVYLSDVVEGV 193 L + V+ R FT+ +L EGK D++ R Y+ D+ E + Sbjct: 172 PATGLRFFTVYGPWGRPDMALFKFTKAML----EGKSIDVYNYGKMKRDFTYIDDIAEAI 227 Query: 194 VSLIE-------RWD---------AIDERVINFVGPELVAREDFVEKIRNLAAPELDYGF 237 + L + +W RV N V D+++ + + E Sbjct: 228 IRLQDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKNM 287 Query: 238 SEP-EGDFFVNRPRIINVSSARFEKLLGRRPR 268 GD + + +++G P Sbjct: 288 LPLQPGDVLETSA---DTKALY--EVIGFTPE 314
>PF05043#Transcriptional activator Length = 493 Score = 29.9 bits (67), Expect = 0.007 Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%) Query: 48 RHLEEIGASLRIDIDE---IESWCVDELKSREVGENDGGKQIDISVTDFILANCRQKRLF 104 H + + +L +E W EL+ + D DI +++FI+ KRL Sbjct: 414 YHAKFVAETLSYYCSNNFELEVW--TELELSKESLED--SPYDIIISNFIIPPIENKRLI 469 Query: 105 YTMNHPTAALMREIAA 120 Y+ N T +L+ + A Sbjct: 470 YSNNINTVSLIYLLNA 485
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 38.0 bits (88), Expect = 2e-05 Identities = 32/139 (23%), Positives = 58/139 (41%), Gaps = 7/139 (5%) Query: 88 MAVTPNLALMYHRNVKVIDIFIARILLEVVGNTASFFVLMITFHALGLVDYPEDILEVMF 147 M M + +++ DI + + + + + ALG + +++ Sbjct: 94 MEGQRTWEAMLYTQLRLGDIVLGEMAWAATKAALAGAGIGVVAAALGYTQWLS----LLY 149 Query: 148 AWVMIIWFG---ASLGFIIGALSEKTELVEKLWHPVTYLMFPLSGAIFMVDWLSPAFQKI 204 A +I G ASLG ++ AL+ + V + LSGA+F VD L FQ Sbjct: 150 ALPVIALTGLAFASLGMVVTALAPSYDYFIFYQTLVITPILFLSGAVFPVDQLPIVFQTA 209 Query: 205 VLWLPMVHGVEMLREGYFG 223 +LP+ H ++++R G Sbjct: 210 ARFLPLSHSIDLIRPIMLG 228
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 135 bits (342), Expect = 3e-37 Identities = 81/382 (21%), Positives = 138/382 (36%), Gaps = 71/382 (18%) Query: 5 IGIDLGTTNSCVAIMEGNQVKVIENSEGARTTPSIIAYMDDNEVL-VGAPAKRQSVTNPK 63 + IDLGT N+ + + V + R V VG AK+ P Sbjct: 13 LSIDLGTANTLIYVKGQGIVLNEPSVVAIRQD----RAGSPKSVAAVGHDAKQMLGRTPG 68 Query: 64 NTLFAVKRLIGRRFEEKEVQKDIGLMPYAIIKADNGDAWVEAHGEKLAPPQVSAEVLRK- 122 N + A++ + + V D V+ ++L+ Sbjct: 69 N-IAAIRPM------KDGVIADF---------------------------FVTEKMLQHF 94 Query: 123 MKKTAEDYLGEPVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAFGLD 182 +K+ + P ++ VP +R+A +++ + AG +I EP AAA+ GL Sbjct: 95 IKQVHSNSFMRPSPRVLVCVPVGATQVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLP 154 Query: 183 KAEKGDRKIAVYDLGGGTFDVSIIEIADVDGEMQFEVLSTNGDTFLGGEDFDQRIIDYII 242 +E V D+GGGT +V++I + V + +GG+ FD+ II+Y+ Sbjct: 155 VSE--ATGSMVVDIGGGTTEVAVISLNGV---------VYSSSVRIGGDRFDEAIINYVR 203 Query: 243 GEFKKEQGVDLSKDVLALQRLKEAAEKAKIELSSS----QQTEINLPYITADASGPKHLN 298 + G + AE+ K E+ S+ + EI + P+ Sbjct: 204 RNYGSLIG-------------EATAERIKHEIGSAYPGDEVREIEVRGRNLAEGVPRGFT 250 Query: 299 LKVTRAKLEALVEDLVERTIEPCRTAIKDAGVKVSDIDD--VILVGGQTRMPKVQEKVKE 356 L + LEAL E L + SDI + ++L GG + + + E Sbjct: 251 LN-SNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGALLRNLDRLLME 309 Query: 357 FFGKEPRRDVNPDEAVAVGAAI 378 G +P VA G Sbjct: 310 ETGIPVVVAEDPLTCVARGGGK 331
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.2 bits (70), Expect = 0.002 Identities = 16/77 (20%), Positives = 24/77 (31%), Gaps = 8/77 (10%) Query: 2 ENTQENPTDQTTEETGREAQAAEPAAQAAENAAPAAEAA--------LAEAQAKIAELQE 53 T E TE T + + A+ A + E A + K E Sbjct: 1048 SKTVEKNEQDATETTAQNREVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVE 1107 Query: 54 SFLRAKAETENVRRRAQ 70 +AK ETE + + Sbjct: 1108 KEEKAKVETEKTQEVPK 1124
>cloacin#Cloacin signature. Length = 551 Score = 32.0 bits (72), Expect = 0.003 Identities = 16/39 (41%), Positives = 22/39 (56%), Gaps = 2/39 (5%) Query: 241 GGGMGARVGGPFIGGRGGRGGGNDGFRGGGGGFGGGGAS 279 GGG G+ + + GG G GG +G GGG G GG ++ Sbjct: 47 GGGSGSGIH--WGGGSGHGNGGGNGNSGGGSGTGGNLSA 83
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 48.7 bits (116), Expect = 3e-08 Identities = 34/160 (21%), Positives = 51/160 (31%), Gaps = 34/160 (21%) Query: 216 LDRTGRAQTHIVLASPETGVVSELNVR-DGAMVTPGQTLAKIAGLS-TLWAVIDVPEALA 273 L + Q V+ +P + V +L V +G +VT +TL I TL V Sbjct: 318 LAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALVQNKDI 377 Query: 274 SGVRPGMRVDATFEGDPQRR---VSGAIREILPG------VNATTRTLQARLE------L 318 + G E P R + G ++ I + + + E Sbjct: 378 GFINVGQNAIIKVEAFPYTRYGYLVGKVKNINLDAIEDQRLGLVFNVIISIEENCLSTGN 437 Query: 319 DNRALTPGMLMRARVGASHAASRLVVPSDAVIATGKRSVV 358 N L+ GM A I TG RSV+ Sbjct: 438 KNIPLSSGM-----------------AVTAEIKTGMRSVI 460 Score = 35.2 bits (81), Expect = 5e-04 Identities = 13/58 (22%), Positives = 23/58 (39%), Gaps = 7/58 (12%) Query: 211 SVIANLDRTGRAQTHIV-------LASPETGVVSELNVRDGAMVTPGQTLAKIAGLST 261 SV+ ++ A + + E +V E+ V++G V G L K+ L Sbjct: 75 SVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGA 132
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 665 bits (1718), Expect = 0.0 Identities = 219/1055 (20%), Positives = 437/1055 (41%), Gaps = 48/1055 (4%) Query: 7 RWSIRNRLLVLLATALVAAWGVVSLNRTPLDALPDLSDTQVIVKASYPGKAPRVVEDQVT 66 + IR + + ++ G +++ + P+ P ++ V V A+YPG + V+D VT Sbjct: 3 NFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVT 62 Query: 67 YPLTTTLLGVPGAKTIRAYS-SFGDAFVYVLFDDRTDQYWARSRVLEYLNQVQGRLPQGA 125 + + G+ + + S S G + + F TD A+ +V L LPQ Sbjct: 63 QVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEV 122 Query: 126 -SVALGPDATGVGWVYEYALVDRSGRRDLGELRALNDWFLKFELKAVPDVAEVASVGGMV 184 + + + ++ V + ++ +K L + V +V G Sbjct: 123 QQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQ- 181 Query: 185 RQYQVVLDPDRLRAFGITQAAVVDALGKANSESGG------SVVEMAESEYMVRASGYLR 238 ++ LD D L + +T V++ L N + + + + A + Sbjct: 182 YAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFK 241 Query: 239 SLDDFRNVVLRTSESGTPVLLGDVARVQIGPEMRRGIAELNGEGEVAGGVIVMRSGKNAL 298 + ++F V LR + G+ V L DVARV++G E IA +NG+ AG I + +G NAL Sbjct: 242 NPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGK-PAAGLGIKLATGANAL 300 Query: 299 STIEAVKAKLAELRRSLPAGVELVTTYDRSQLIGRAVDNLKDKLIEEFVVVGLVCALFLF 358 T +A+KAKLAEL+ P G++++ YD + + ++ + L E ++V LV LFL Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 359 HLRSAFVAILSLPLGVLAAFIVMRHQGVNANLMSLGGIAIAIGAMIDAAVVMIENAHKHL 418 ++R+ + +++P+ +L F ++ G + N +++ G+ +AIG ++D A+V++EN + + Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 419 ESHEHAHPGAPLSSAARWELIAASAAEVGPALFFSLLIVTLSFVPVFALEGQEGKLFAPL 478 + E S +++ AL ++++ F+P+ G G ++ Sbjct: 421 MEDK----------LPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQF 470 Query: 479 AFTKTYTIAAAAGLSVTLVPVLMGYLIRGRIPREASNP------LNRL---LVRLYRPLL 529 + T +A + +++ L P L L++ N N V Y + Sbjct: 471 SITIVSAMALSVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSV 530 Query: 530 EATLARPWRAIAIAAAALVLTAIPMSRLGGEFMPPLDEGDLLYMPTALPGISAQKAAELL 589 L R + I A + + RL F+P D+G L M G + ++ ++L Sbjct: 531 GKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVL 590 Query: 590 QQTDRLIKT--VPEVATVFGKSGRADTATDPAPLEMFETTIRFRPRGEW-RPGMTPGRLV 646 Q V +VF +G + + +P E + ++ Sbjct: 591 DQVTDYYLKNEKANVESVFTVNGF---SFSGQAQNAGMAFVSLKPWEERNGDENSAEAVI 647 Query: 647 DELDRVVKVPGLSNVWVPPIRNRLDMLSTGIKTPVGVKIAGPELAQIDRIAAQVEAAVKR 706 + V + +++ + + AG + + Q+ + Sbjct: 648 HRAKMELGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQ 707 Query: 707 VPG-VTSALAERLNGGRYVDVDIDRRAAARYGLSVGDVQAVVASAIGGENVGEVIAGRER 765 P + S L +++D+ A G+S+ D+ +++A+GG V + I Sbjct: 708 HPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRV 767 Query: 766 FPINIRYPREVRDSLEKLRALPIVTERGAQILLRDVAAVTIADGPPMIRSENARLSGYVY 825 + ++ + R E + L + + G + G P + N S + Sbjct: 768 KKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQ 827 Query: 826 VDIR-GVDLKTAVGAMQRAVAQQVALPPGYSIAWSGQFEYLERAAATLRTVIPVTLAVIF 884 + G A+ M+ ++ LP G W+G + ++ ++ V+F Sbjct: 828 GEAAPGTSSGDAMALMENLASK---LPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVF 884 Query: 885 VLLFLTFDSAADALLLMTTVPFALVGGLWFVWALGHAVSVATAVGFIALAGVAAEFGVVM 944 + L ++S + + +M VP +VG L V VG + G++A+ +++ Sbjct: 885 LCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILI 944 Query: 945 LLYLKRAYERRIAAGEPPNEATLADAIREGAVLRVRPKAMTVAVVLAGLVPIMIGHGSGS 1004 + + K E+ G+ EATL +R+RP MT + G++P+ I +G+GS Sbjct: 945 VEFAKDLMEKE---GKGVVEATL-----MAVRMRLRPILMTSLAFILGVLPLAISNGAGS 996 Query: 1005 EVMQRIAAPMVGGMVTAPLLSMFVIPAAWLLLQRR 1039 + ++GGMV+A LL++F +P +++++R Sbjct: 997 GAQNAVGIGVMGGMVSATLLAIFFVPVFFVVIRRC 1031
>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature. Length = 144 Score = 29.0 bits (64), Expect = 0.021 Identities = 21/44 (47%), Positives = 23/44 (52%), Gaps = 3/44 (6%) Query: 21 VLHPLAGRPLLSHVIDTARALAPSRLVVVIGHGAEQVRAAVAAP 64 V PLAG L S A APS LV+ +GHG AA AAP Sbjct: 18 VCGPLAGASLASPATAPASLYAPSALVLTVGHGES---AATAAP 58
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 28.1 bits (62), Expect = 0.043 Identities = 17/136 (12%), Positives = 34/136 (25%), Gaps = 8/136 (5%) Query: 2 GTTIRDVAQAANVSIGTVSRALKNQPGLSEATRARIVE-----IAHRMNYDPTQLRPRIK 56 T++ ++A+AA V+ G + K++ L P ++ Sbjct: 31 STSLGEIAKAAGVTRGAIYWHFKDKSDLFSEIWELSESNIGELELEYQAKFPGDPLSVLR 90 Query: 57 -RLTFLLHRQHNNFATTPFFSHVLHGVEDACRERGIVPSLLTTGPTDDVIRQMRPHAPDA 115 L +L + H E E +V + ++ Sbjct: 91 EILIHVLESTVTEERRRLLMEIIFHKCEFV-GEMAVVQQAQRNLCL-ESYDRIEQTLKHC 148 Query: 116 IAVAGFMEPETLEALA 131 I A Sbjct: 149 IEAKMLPADLMTRRAA 164
>PF04647#Accessory gene regulator B Length = 212 Score = 29.0 bits (65), Expect = 0.007 Identities = 5/41 (12%), Positives = 15/41 (36%) Query: 108 AIVALAGFAISVFTTPFKGMLIIAAALIALFLFILYRPAAT 148 + + + + + +LI+ A + +L + P Sbjct: 86 LVFNVLAYIAHLIDPAYFQLLILIAFITSLLALLFLVPVDN 126
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 56.2 bits (135), Expect = 1e-11 Identities = 51/186 (27%), Positives = 79/186 (42%), Gaps = 16/186 (8%) Query: 7 VVLVTGANRGLGLAFVEGLKAAGAK------------KIYAAARDPARVTTPGVQPVRLD 54 + +TGA +G+G A L + GA K+ ++ + AR VR Sbjct: 10 IAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVRDS 69 Query: 55 VTRAQDIAAAARELRDVNLLVNNAGIFRMGSLLAEADGGGLQAQLDTNFFGPLAMARAFA 114 + A RE+ +++LVN AG+ R G L+ +A N G +R+ + Sbjct: 70 AAIDEITARIEREMGPIDILVNVAGVLRPG-LIHSLSDEEWEATFSVNSTGVFNASRSVS 128 Query: 115 PVLRENGGGAIINVLS-WLGLPNT--GAYGISKAAAWAATNAIRNELREQRTRVLALHSA 171 + + G+I+ V S G+P T AY SKAAA T + EL E R + Sbjct: 129 KYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSPG 188 Query: 172 YIDTDM 177 +TDM Sbjct: 189 STETDM 194
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 73.2 bits (179), Expect = 2e-17 Identities = 50/188 (26%), Positives = 81/188 (43%), Gaps = 10/188 (5%) Query: 9 VFITGASSGLGLALAAEYARHGATLGLVARRADALAEFAP------RFPKASISIYPADV 62 FITGA+ G+G A+A A GA + V + L + R +A +PADV Sbjct: 11 AFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEA----FPADV 66 Query: 63 RDADALALAASRFVAAHGCPDVVIANAGISKGAITGEGDLAAFREIMDVNYYGMIATFEP 122 RD+ A+ +R G D+++ AG+ + + + VN G+ Sbjct: 67 RDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRS 126 Query: 123 FIAPMTAARRGTLVGIASVAGVRGLPGSGAYSASKAAAIKYLEALRVELRPAQVAVVTIA 182 M R G++V + S AY++SKAAA+ + + L +EL + ++ Sbjct: 127 VSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVS 186 Query: 183 PGYIRTPM 190 PG T M Sbjct: 187 PGSTETDM 194
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 32.7 bits (74), Expect = 0.002 Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 6/173 (3%) Query: 58 ASQPQQFDPNRALQGKTPGQPVTPQAAQPAPPNTAPGQAANPSQPPLLPEPQIVEVPSSN 117 A ++ DP ++ T QPA ++ + + +VE P + Sbjct: 1143 AEPARENDPTVNIKEPQSQTNTTADTEQPAKETSSNVEQPVTESTTVNTGNSVVENPENT 1202 Query: 118 NNGNGSPSASNNAAD-----NGVAVAPKPAEPAPPPAKKPQTAANGSSAPHVANNNAQAS 172 P+ ++ +++ + +V P P + N NA S Sbjct: 1203 TPATTQPTVNSESSNKPKNRHRRSVRSVPHNVEPATTSSNDRSTVALCDLTSTNTNAVLS 1262 Query: 173 AAATPPKAAQAPKGASSATTTAAKPTSGADANTGYFLQVGAYKTEADAEQQRA 225 A + G + + + + ++ + + Q R Sbjct: 1263 DARAKAQFVALNVGKAVSQHISQLEMNNEGQYN-VWVSNTSMNKNYSSSQYRR 1314
>MALTOSEBP#Maltose binding protein signature. Length = 396 Score = 29.7 bits (66), Expect = 0.018 Identities = 17/49 (34%), Positives = 27/49 (55%) Query: 44 FEKETGIKVRLDVYDSNEALQTKLTTGNSGYDLVFPSNDFLARQIQAGL 92 FEK+TGIKV ++ D E ++ G D++F ++D Q+GL Sbjct: 53 FEKDTGIKVTVEHPDKLEEKFPQVAATGDGPDIIFWAHDRFGGYAQSGL 101
>YERSSTKINASE#Yersinia serine/threonine protein kinase signature. Length = 732 Score = 34.3 bits (78), Expect = 0.004 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%) Query: 149 QVLDGLAHAHANGVVHRDLKPQNVMVTTRDGEPCAKILDFGI 190 ++LD H GVVH D+KP NV+ GEP ++D G+ Sbjct: 253 RLLDVTNHLAKAGVVHNDIKPGNVVFDRASGEPV--VIDLGL 292
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.3 bits (65), Expect = 0.016 Identities = 21/68 (30%), Positives = 28/68 (41%), Gaps = 1/68 (1%) Query: 58 CVALTGPSGAGKSTLLRCLYGNYLANRGTIAVRVGTRAAEHVV-LTASEPHEVIALRRDV 116 V L G G GKSTL+ L G + + G + E + + A E E+ A RR Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHFDIGTGKDSYEQIAGIVAYELSEMTAFRRAD 657 Query: 117 IGYVSQFL 124 V F Sbjct: 658 AEAVKAFF 665
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 25.2 bits (55), Expect = 0.040 Identities = 9/38 (23%), Positives = 21/38 (55%), Gaps = 1/38 (2%) Query: 14 IEIDDVIVGLLAI-RLNLPENADPRDAISRHLSEAGGP 50 + +DD IV + + R+ + + P++A + +S+ G Sbjct: 404 LLVDDAIVVVENVERVMMEDKLPPKEATEKSMSQIQGA 441
>PF05272#Virulence-associated E family protein Length = 892 Score = 32.4 bits (73), Expect = 0.002 Identities = 17/53 (32%), Positives = 24/53 (45%), Gaps = 5/53 (9%) Query: 29 VVVVCGPSGSGKSTLIKTVNGLEPFQQGEILVNGQSVGDKKTNLSKLRSKVGM 81 VV+ G G GKSTLI T+ GL+ F +G K + ++ V Sbjct: 598 SVVLEGTGGIGKSTLINTLVGLDFFSDTHF-----DIGTGKDSYEQIAGIVAY 645
>BCTERIALGSPD#Bacterial general secretion pathway protein D signature. Length = 660 Score = 403 bits (1037), Expect = e-133 Identities = 215/691 (31%), Positives = 325/691 (47%), Gaps = 88/691 (12%) Query: 13 TALVVAGIVAAQAAHAQVTLNFVNADIDQVAKAIGAATGKTIIVDPRVKGQLNLVAERPV 72 T L+ A ++ AA + + +F DI + + KT+I+DP V+G + + + + Sbjct: 13 TLLIFAALLFRPAAAEEFSASFKGTDIQEFINTVSKNLNKTVIIDPSVRGTITVRSYDML 72 Query: 73 PEDQALKTLQSALRMQGFALV-QDHGVLKVVPEADAKLQGVPTYIGNAPQVRGDQVVTQV 131 E+Q + S L + GFA++ ++GVLKVV DAK VP AP + GD+VVT+V Sbjct: 73 NEEQYYQFFLSVLDVYGFAVINMNNGVLKVVRSKDAKTAAVPVASDAAPGI-GDEVVTRV 131 Query: 132 FELRNESANNLLPVLRPLI--SPNNTITAYPANNTIVVTDYADNVRRIAQIIAGVDSAAG 189 L N +A +L P+LR L + ++ Y +N +++T A ++R+ I+ VD+A Sbjct: 132 VPLTNVAARDLAPLLRQLNDNAGVGSVVHYEPSNVLLMTGRAAVIKRLLTIVERVDNAGD 191 Query: 190 SQVAVVPLKNANAIDIAAQLTKLLDPGAIGNTDATLKVTVQADPRTNALLLRASNAQRLA 249 V VPL A+A D+ +T+L + ++ V AD RTNA+L+ R Sbjct: 192 RSVVTVPLSWASAADVVKLVTELNKDTSKSALPGSMVANVVADERTNAVLVSGEPNSR-Q 250 Query: 250 AAKKIAQQLDAPSGVPGNMHVVPLRNAEAVKLAKTLRGMLGKGGGESGSSASSNDANAFN 309 + +QLD GN V+ L+ A+A L + L G+ Sbjct: 251 RIIAMIKQLDRQQATQGNTKVIYLKYAKASDLVEVLTGIS-------------------- 290 Query: 310 QGGSQSGSNFSTGASGTPPLPSGLSSNSSGGAGGTTGGGGLGNAGLLGGDKDKGDDNQPG 369 S + S + Sbjct: 291 ---------------------STMQSEKQAAKPVAALDKNI------------------- 310 Query: 370 GMIQADAASNSLIITASDPVYRNLRAVIDQLDSRRAQVYIEALVVELQATTSANLGIQWQ 429 +I+A +N+LI+TA+ V +L VI QLD RR QV +EA++ E+Q NLGIQW Sbjct: 311 -IIKAHGQTNALIVTAAPDVMNDLERVIAQLDIRRPQVLVEAIIAEVQDADGLNLGIQWA 369 Query: 430 VANNALYAGTNLVTGQTGLGNSIVNLTAGAVT--NPGGTLGSLG---SITNGLNIGWLHN 484 N +T T G I AGA G SL S NG+ G Sbjct: 370 NKNAG-------MTQFTNSGLPISTAIAGANQYNKDGTVSSSLASALSSFNGIAAG---- 418 Query: 485 MFGVQGLGALLQFFAGSSDANVLSTPNLVTLDNEEAKIVVGQNVPIPTGSYSNLTSGTTA 544 F LL + S+ ++L+TP++VTLDN EA VGQ VP+ TGS + + Sbjct: 419 -FYQGNWAMLLTALSSSTKNDILATPSIVTLDNMEATFNVGQEVPVLTGS----QTTSGD 473 Query: 545 NAFNTYDRRDVGLTLHVKPQITEGGILKLQLYTEDSAVVPGTNTTSANSPGPTFTKRSIQ 604 N FNT +R+ VG+ L VKPQI EG + L++ E S+V +++++ G TF R++ Sbjct: 474 NIFNTVERKTVGIKLKVKPQINEGDSVLLEIEQEVSSV-ADAASSTSSDLGATFNTRTVN 532 Query: 605 STVLADNGEIIVLGGLMQDNYQVSNTKVPLLGDIPWIGQLFRSEGKTRQKTNLMVFLRPV 664 + VL +GE +V+GGL+ + + KVPLLGDIP IG LFRS K K NLM+F+RP Sbjct: 533 NAVLVGSGETVVVGGLLDKSVSDTADKVPLLGDIPVIGALFRSTSKKVSKRNLMLFIRPT 592 Query: 665 IINDRETAQAVTSNRYDYIQGVTGAYKSDNN 695 +I DR+ + +S +Y + N Sbjct: 593 VIRDRDEYRQASSGQYTAFNDAQSKQRGKEN 623
>BCTERIALGSPF#Bacterial general secretion pathway protein F signature. Length = 408 Score = 382 bits (982), Expect = e-133 Identities = 174/406 (42%), Positives = 266/406 (65%), Gaps = 2/406 (0%) Query: 1 MPAFRFEAIDASGRAQKGVIEADSARNARGQLRTQGLTPLVVEPAASAQRGARSQRLALG 60 M + ++A+DA G+ +G EADSAR AR LR +GL PL V+ Q+ + S L+L Sbjct: 1 MAQYHYQALDAQGKKCRGTQEADSARQARQLLRERGLVPLSVDENRGDQQKSGSTGLSLR 60 Query: 61 R--KLSQREQAILTRQLASLLVAGLPLDEALAVLTEQAERDYIRELMAAIRAEVLGGHSL 118 R +LS + A+LTRQLA+L+ A +PL+EAL + +Q+E+ ++ +LMAA+R++V+ GHSL Sbjct: 61 RKIRLSTSDLALLTRQLATLVAASMPLEEALDAVAKQSEKPHLSQLMAAVRSKVMEGHSL 120 Query: 119 ANALTQHPRDFPEIYRALVAAGEHTGKLGIVLSRLADYIEERNALKQKILLAFTYPAIVT 178 A+A+ P F +Y A+VAAGE +G L VL+RLADY E+R ++ +I A YP ++T Sbjct: 121 ADAMKCFPGSFERLYCAMVAAGETSGHLDAVLNRLADYTEQRQQMRSRIQQAMIYPCVLT 180 Query: 179 VIAFGIVTFLLSYVVPQVVNVFASTKQQLPVLTIVMMALSDFVRHWWWAILIGIAAVVYL 238 V+A +V+ LLS VVP+VV F KQ LP+ T V+M +SD VR + +L+ + A Sbjct: 181 VVAIAVVSILLSVVVPKVVEQFIHMKQALPLSTRVLMGMSDAVRTFGPWMLLALLAGFMA 240 Query: 239 VKATLSRDGPRLAFDRWLLTAPLAGKLVRGYNTVRFASTLGILTAAGVPILRALQAAGET 298 + L ++ R++F R LL PL G++ RG NT R+A TL IL A+ VP+L+A++ +G+ Sbjct: 241 FRVMLRQEKRRVSFHRRLLHLPLIGRIARGLNTARYARTLSILNASAVPLLQAMRISGDV 300 Query: 299 LSNRAMRGNIDDAIVRVREGSALSRALNNVKTFPPVLVHLIRSGEATGDVTTMLDRAAEG 358 +SN R + A VREG +L +AL FPP++ H+I SGE +G++ +ML+RAA+ Sbjct: 301 MSNDYARHRLSLATDAVREGVSLHKALEQTALFPPMMRHMIASGERSGELDSMLERAADN 360 Query: 359 ESRELERRTMFLTSLLEPLLILAMGGIVLVIVLAVMLPIIELNNMV 404 + RE + L EPLL+++M +VL IVLA++ PI++LN ++ Sbjct: 361 QDREFSSQMTLALGLFEPLLVVSMAAVVLFIVLAILQPILQLNTLM 406
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 188 bits (480), Expect = 6e-65 Identities = 67/140 (47%), Positives = 94/140 (67%), Gaps = 3/140 (2%) Query: 10 QAARRQRGFTLIEIMVVVAILGILAALIVPKIMSRPDEARRIAAKQDIGTIMQALKLYRL 69 +A +QRGFTL+EIMVV+ I+G+LA+L+VP +M ++A + A DI + AL +Y+L Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQKAVSDIVALENALDMYKL 61 Query: 70 DNGRYPTQDQGLNALIQKPTTDPIPNNWKDGGYLERLPNDPWGNSYKYLNPGVHGEIDVF 129 DN YPT +QGL +L++ PT P+ N+ GY++RLP DPWGN Y +NPG HG D+ Sbjct: 62 DNHHYPTTNQGLESLVEAPTLPPLAANYNKEGYIKRLPADPWGNDYVLVNPGEHGAYDLL 121 Query: 130 SYGADGKEGGESNDSDIGSW 149 S G DG+ G E DI +W Sbjct: 122 SAGPDGEMGTE---DDITNW 138
>BCTERIALGSPH#Bacterial general secretion pathway protein H signature. Length = 170 Score = 51.5 bits (123), Expect = 1e-10 Identities = 20/101 (19%), Positives = 33/101 (32%), Gaps = 15/101 (14%) Query: 51 RARGFTLLEMLVVLVIAGILVSVASLTLRRNPRTDLREEAQRIALLFETAGDEAQVRARP 110 R RGFTLLEM+++L++ G+ + L + + R + Sbjct: 2 RQRGFTLLEMMLILLLMGVSAGMVLLAFPASRDDSAAQTLARFEAQLRFVQQRGLQTGQF 61 Query: 111 IAWRATEHGFRF---------------DIRTGDGWRPLRDD 136 ++F D +G W PLR Sbjct: 62 FGVSVHPDRWQFLVLEARDGADPAPADDGWSGYRWLPLRAG 102
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 30.2 bits (68), Expect = 0.001 Identities = 10/26 (38%), Positives = 18/26 (69%) Query: 10 RSPARSRGFTMIEVLVALAIIAVALA 35 R+ + RGFT++E++V + II V + Sbjct: 2 RATDKQRGFTLLEIMVVIVIIGVLAS 27
>BCTERIALGSPG#Bacterial general secretion pathway protein G signature. Length = 145 Score = 33.7 bits (77), Expect = 3e-04 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 3/72 (4%) Query: 33 RGFTLIEMMIAITILAVIA-ILSWRGLDQIIRGREKVAAAMEDERVFAQMFDQMRIDARR 91 RGFTL+E+M+ I I+ V+A ++ + + ++ A+ D D ++D Sbjct: 8 RGFTLLEIMVVIVIIGVLASLVVPNLMGNKEKADKQK--AVSDIVALENALDMYKLDNHH 65 Query: 92 AATDDEAGQPAV 103 T ++ + V Sbjct: 66 YPTTNQGLESLV 77
>FLGMOTORFLIM#Flagellar motor switch protein FliM signature. Length = 344 Score = 274 bits (703), Expect = 4e-93 Identities = 82/324 (25%), Positives = 158/324 (48%), Gaps = 10/324 (3%) Query: 5 EFMSQEEVDALLKGVTGEDDSADEPAEASG---IRPYNIATQERIVRGRMPGLEIINDRF 61 E +SQ+E+D LL ++ D S ++ S I Y+ ++ + +M L ++++ F Sbjct: 3 EVLSQDEIDQLLTAISSGDASIEDARPISDTRKITLYDFRRPDKFSKEQMRTLSLMHETF 62 Query: 62 ARLLRIGIFNFMRRTAEISVSQVKVQKYSEFTRNLPIPTNLNLVHVKPLRGTSLFVFDPN 121 ARL + +R + V+ V Y EF R++P P+ L ++ + PL+G ++ DP+ Sbjct: 63 ARLTTTSLSAQLRSMVHVHVASVDQLTYEEFIRSIPTPSTLAVITMDPLKGNAVLEVDPS 122 Query: 122 LVFFVVDNLFGGDGRFHTRVEGRDFTATEQRIIGKLLNLVFEHYASAWKSVRPLQFEFVR 181 + F ++D LFGG G+ RD T E ++ ++ + + +W V L+ + Sbjct: 123 ITFSIIDRLFGGTGQAAKVQ--RDLTDIENSVMEGVIVRILANVRESWTQVIDLRPRLGQ 180 Query: 182 SEMHTQFANVATPNEIVIATQFSIEFGPTGGTLHICMPYSMIEPIRDVLSSPIQGEAL-- 239 E + QFA + P+E+V+ + G G ++ C+PY IEPI LSS ++ Sbjct: 181 IETNPQFAQIVPPSEMVVLVTLETKVGEEEGMMNFCIPYITIEPIISKLSSQFWFSSVRR 240 Query: 240 EVDRRWVRVLSQQVQSAEVELVADLAEVPTTFEKILNLRTGDVLPLD---ITDSITAKVD 296 +++ VL ++ + ++++VA++ + + IL LR GD++ L + D + Sbjct: 241 SSTTQYMGVLRDKLSTVDMDVVAEVGSLRLSVRDILGLRVGDIIRLHDTHVGDPFVLSIG 300 Query: 297 GVPVMECGYGIFNGQYALRVQRMI 320 C G+ + A ++ I Sbjct: 301 NRKKFLCQPGVVGKKIAAQILERI 324
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 32.0 bits (73), Expect = 0.004 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 3/67 (4%) Query: 144 AGQPGDAPFAPPTLVGDLGGGALYLAMGVLAGIVDAR-LRGKGQIVDAAIVDGSANLMNL 202 AG P ++V D+GGG +A+ L G+V + +R G D AI++ Sbjct: 151 AGLPVSEATG--SMVVDIGGGTTEVAVISLNGVVYSSSVRIGGDRFDEAIINYVRRNYGS 208 Query: 203 LLSIHAA 209 L+ A Sbjct: 209 LIGEATA 215
>UREASE#Urea amidohydrolase (urease) protein signature. Length = 570 Score = 29.7 bits (67), Expect = 0.004 Identities = 16/44 (36%), Positives = 21/44 (47%), Gaps = 5/44 (11%) Query: 106 GGILVYDQFVTP----PTPQPVRQRRLRWGAHGRSNNGDNFYVV 145 GG + P PTPQPV R + +GA+GRS + V Sbjct: 452 GGTIAAAPMGDPNASIPTPQPVHYRPM-FGAYGRSRTNSSVTFV 494
>OMPADOMAIN#OMPA domain signature. Length = 346 Score = 68.0 bits (166), Expect = 1e-14 Identities = 27/103 (26%), Positives = 46/103 (44%), Gaps = 2/103 (1%) Query: 391 QADGGAAANAASGAAAQTQAQAPALPAAIYFETGKSELPADAKDAIAAAAEYVKAH--PD 448 Q + A A + Q + L + + F K+ L + + A+ + D Sbjct: 193 QGEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKATLKPEGQAALDQLYSQLSNLDPKD 252 Query: 449 AKLALSGFTDKTGSADANAELAKRRAQVVRDALKTAGVAEDRI 491 + + G+TD+ GS N L++RRAQ V D L + G+ D+I Sbjct: 253 GSVVVLGYTDRIGSDAYNQGLSERRAQSVVDYLISKGIPADKI 295
>cloacin#Cloacin signature. Length = 551 Score = 30.8 bits (69), Expect = 0.023 Identities = 31/120 (25%), Positives = 48/120 (40%), Gaps = 8/120 (6%) Query: 176 VVVDGAAPAVLRYDDTDDELRYVETLPADAQNNSPGNAPP--AAAQPVANRALPSVKRQR 233 V + G P+ + DD + + V +LPAD SP ++ P A V R + VK +R Sbjct: 134 VALYGVLPSQIAKDDPNMMSKIVTSLPADDITESPVSSLPLDKATVNVNVRVVDDVKDER 193 Query: 234 ALPGALDLRGVELTLPELPSAQVAALRERAGTLGLDGARVPVWGVVAPRRLPADIAVPGG 293 + GV +++P + A ER G PV + PA + G Sbjct: 194 QNISVVS--GVPMSVPVVD----AKPTERPGVFTASIPGAPVLNISVNNSTPAVQTLSPG 247
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 513 bits (1323), Expect = e-175 Identities = 194/567 (34%), Positives = 312/567 (55%), Gaps = 7/567 (1%) Query: 300 PNTLAGVCAAPGIAVGTLVRWDDAQIVPPELASGTPAAESRLLDRALAEVDAQLETTVRE 359 + + G+ A+ G+A+ + + + + + E L AL + +L + Sbjct: 2 HHKITGIAASSGVAIAKAFIHLEPNVDIEKTSITDVSTEIEKLTAALEKSKEELRAIKDQ 61 Query: 360 ASRRGAIGEAGIFAVHRVLLEDPALVDAARDLI-SLGKSAGYAWRETIRAQTAVLADVDD 418 +A IFA H ++L+DP LVD + I + +A YA +E ++ +D+ Sbjct: 62 TEASMGADKAEIFAAHLLVLDDPELVDGIKGKIENEQMNAEYALKEVSDMFVSMFESMDN 121 Query: 419 TLLAERAADLRDIDKRVLRAL-GYASASARELPAEAVLAAEEFTPSDLASLDRERVAALV 477 + ERAAD+RD+ KRVL L G + S + E V+ AE+ TPSD A L+++ V Sbjct: 122 EYMKERAADIRDVSKRVLGHLIGVETGSLATIAEETVIIAEDLTPSDTAQLNKQFVKGFA 181 Query: 478 MARGGATSHAAIIARQLGIPALVAVGDALYAIAQRTQVVVDASAGRLEYAPSALDVERAR 537 GG TSH+AI++R L IPA+V + I V+VD G + P+ +V+ Sbjct: 182 TDIGGRTSHSAIMSRSLEIPAVVGTKEVTEKIQHGDMVIVDGIEGIVIVNPTEEEVKAYE 241 Query: 538 HERQRLAGVREANRRMSGEAALTRDGHRIEVAANIATLDDARVALDNGADAVGLLRTELM 597 +R ++ ++ GE + T+DG +E+AANI T D L NG + +GL RTE + Sbjct: 242 EKRAAFEKQKQEWAKLVGEPSTTKDGAHVELAANIGTPKDVDGVLANGGEGIGLYRTEFL 301 Query: 598 FIHRQAAPTASEHQQSYQSIVDALQGRTAIIRTLDVGADKEVDYLTLPPEPNPALGLRGI 657 ++ R PT E ++Y+ +V + G+ +IRTLD+G DKE+ YL LP E NP LG R I Sbjct: 302 YMDRDQLPTEEEQFEAYKEVVQRMDGKPVVIRTLDIGGDKELSYLQLPKELNPFLGFRAI 361 Query: 658 RLAQVRPDLLDDQLRGLLAVKPYGSVRILLPMVTDVGELVRIRKRIDD-----FARAMGR 712 RL + D+ QLR LL YG+++++ PM+ + EL + + + + + + Sbjct: 362 RLCLEKQDIFRTQLRALLRASTYGNLKVMFPMIATLEELRQAKAIMQEEKDKLLSEGVDV 421 Query: 713 AQAVEVGVMIEVPSAALLADQLAQHADFLSIGTNDLTQYTLAMDRCQADLAAQADGLHPA 772 + ++EVG+M+E+PS A+ A+ A+ DF SIGTNDL QYT+A DR ++ HPA Sbjct: 422 SDSIEVGIMVEIPSTAVAANLFAKEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPYHPA 481 Query: 773 VLRLVDATVRGAEKHGKWVGVCGALGGDPVAVPVLVGLGVTELSVDPVSVPGIKAQVRRL 832 +LRLVD ++ A GKWVG+CG + GD VA+P+L+GLG+ E S+ S+ ++Q+ +L Sbjct: 482 ILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARSQLLKL 541 Query: 833 DYQLCRQRAQDLLALESAQAVRAASRE 859 + + AQ L L++A+ V ++ Sbjct: 542 SKEELKPFAQKALMLDTAEEVEQLVKK 568
>ABC2TRNSPORT#ABC-2 type transport system membrane protein signature. Length = 262 Score = 31.1 bits (70), Expect = 0.006 Identities = 33/155 (21%), Positives = 59/155 (38%), Gaps = 7/155 (4%) Query: 163 YGEFFATGILIMVFMSIGVVSTA-TTIATLRERNTFKMYVCFPVSRF-VFLASLIVSRVI 220 Y F A G++ M+ T + + T++ + + + L + + Sbjct: 65 YTAFLAAGMVATSAMTAATFETIYAAFGRMEGQRTWEAMLYTQLRLGDIVLGEMAWAATK 124 Query: 221 LMLAASVTLMLAARYLFQVPLPLWSLRALRAIPVVLLGAAMLLSLGTLLASRARSLAAAE 280 LA + ++AA + SL L A+PV+ L SLG ++ + A S Sbjct: 125 AALAGAGIGVVAAALGY---TQWLSL--LYALPVIALTGLAFASLGMVVTALAPSYDYFI 179 Query: 281 AWCNLIYFPLLFFSDLTIPLRAAPHWLRVVLLVLP 315 + L+ P+LF S P+ P + LP Sbjct: 180 FYQTLVITPILFLSGAVFPVDQLPIVFQTAARFLP 214
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 84.9 bits (210), Expect = 3e-20 Identities = 77/368 (20%), Positives = 143/368 (38%), Gaps = 31/368 (8%) Query: 7 RATTSLAAIFALRMLGLFMIMPVFSVYAKTIPGGENVVL-VGIALGAYGVTQSLLYIFYG 65 R + + AL +G+ +IMPV + + +V GI L Y + Q G Sbjct: 5 RPLIVILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLG 64 Query: 66 WASDKFGRKPVIAAGLLIFALGSFVAAFAHDITWIIVGRVIQGM-GAVSSAVLAFIADLT 124 SD+FGR+PV+ L A+ + A A + + +GR++ G+ GA + A+IAD+T Sbjct: 65 ALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADIT 124 Query: 125 SEHNRTKAMAMVGGSIGMSFAVAIVGAPI--VFHWVGMSGLFAIVGALSVAAIGVVLWVV 182 R + + G + G + + F AL+ +++ Sbjct: 125 DGDERARHFGFMSACFGFGM---VAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLL 181 Query: 183 PDAPRPVHVPAPFAEVLHNVELLRLNFGVLVLHATQTALFLVVPRLLVDGGLPVA----- 237 P++ + P E L+ + R G+ V+ A F+ + + G +P A Sbjct: 182 PESHKGERRPLR-REALNPLASFRWARGMTVVAALMAVFFI----MQLVGQVPAALWVIF 236 Query: 238 ----SHWQ-----VYLPVMGL--AFVMMVPAIIVAEKQGRMKPVLLGGIAAILIGQLLLG 286 HW + L G+ + + VA + G + ++L G+ A G +LL Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALML-GMIADGTGYILLA 295 Query: 287 VATHTILIVAAILFVYFLGFNILEASQPSLVSKLAPGSRKGAATGVYNTTQSIGLALGGV 346 AT + + V I + +++S+ R+G G S+ +G + Sbjct: 296 FATRGWMAF--PIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPL 353 Query: 347 VGGVLLKH 354 + + Sbjct: 354 LFTAIYAA 361
>cloacin#Cloacin signature. Length = 551 Score = 46.2 bits (109), Expect = 2e-08 Identities = 26/65 (40%), Positives = 29/65 (44%) Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGG 168 GG G G GGG D G+ +GGG GGG G GG SGGG G GG Sbjct: 22 GGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNL 81 Query: 169 GGGAS 173 A+ Sbjct: 82 SAVAA 86 Score = 40.5 bits (94), Expect = 2e-06 Identities = 25/65 (38%), Positives = 27/65 (41%) Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGG 168 G + G GG G GGG G G E GG + G G SG G GGG G Sbjct: 11 TGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGN 70 Query: 169 GGGAS 173 GG S Sbjct: 71 SGGGS 75 Score = 38.2 bits (88), Expect = 1e-05 Identities = 25/73 (34%), Positives = 30/73 (41%) Query: 110 GRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGGG 169 GRG + G + G G G GGG G GGG+G+ GGG G G G Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNG 65 Query: 170 GGASRPSAPAGGG 182 GG +G G Sbjct: 66 GGNGNSGGGSGTG 78 Score = 36.2 bits (83), Expect = 6e-05 Identities = 27/79 (34%), Positives = 30/79 (37%), Gaps = 3/79 (3%) Query: 107 MLGGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSG---GGGG 163 M GG G G G GG G G G G G + G G+ SG GGG Sbjct: 1 MSGGDGRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGS 60 Query: 164 GGGGGGGGASRPSAPAGGG 182 G G GGG + GG Sbjct: 61 GHGNGGGNGNSGGGSGTGG 79 Score = 30.5 bits (68), Expect = 0.005 Identities = 25/72 (34%), Positives = 27/72 (37%) Query: 109 GGRGGSGGGGGGGDDGGYGGGGGGYGGGRDMERGGGGGRASGGGGAGARSGGGGGGGGGG 168 GG GSG GGG G GGG G GGG A G A S G GG Sbjct: 47 GGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAVAAPVAFGFPALSTPGAGGLAVS 106 Query: 169 GGGASRPSAPAG 180 + +A A Sbjct: 107 ISAGALSAAIAD 118
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 26.8 bits (59), Expect = 0.034 Identities = 24/87 (27%), Positives = 45/87 (51%), Gaps = 9/87 (10%) Query: 5 ATIARPYAEALFRVAEGGDISAWSTLVQELAQVAQLPEVLSVASSPKVSRTQ--VAELLL 62 AT + A+A+F+ GGD+S +Q++ + +P L+V ++ RT+ + ELL Sbjct: 28 ATTTKSAADAVFQQLGGGDVSG---AMQDIDLIMDIPVKLTV----ELGRTRMTIKELLR 80 Query: 63 AALKSPLASGAQAKNFVQMLVDNHRIA 89 S +A A + +L++ + IA Sbjct: 81 LTQGSVVALDGLAGEPLDILINGYLIA 107
>OMADHESIN#Yersinia outer membrane adhesin signature. Length = 455 Score = 27.9 bits (61), Expect = 0.044 Identities = 32/107 (29%), Positives = 45/107 (42%), Gaps = 2/107 (1%) Query: 69 LSLSAIAASEAFSFAYAWTCRRHRWPLALAAGLAAWAAAASALARLPATPPAATAVAFAA 128 +S+SA S FS YA+ P A ++ A A L P PP A A Sbjct: 7 ISVSAALISALFSSPYAFADDYDGIPNLTAVQISPNADPALGLE-YPVRPPVPGAGGLNA 65 Query: 129 TCFGQSCLPRGATLAPRAPLSHADLAGRLAAGAALALAVTSLAGALG 175 + G + GAT + A AG +A G ++A+ L+ ALG Sbjct: 66 SAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVN-SVAIGPLSKALG 111
>SURFACELAYER#Lactobacillus surface layer protein signature. Length = 439 Score = 26.2 bits (57), Expect = 0.035 Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 3/68 (4%) Query: 5 KRVKRTMSAAAAAMAVVSCAMAAAPAAHADAGDGLKVARSNACMGCHAVDRKLVGPSFQQ 64 K+ R +SAAAAA+ V+ A A +A A + + VD V PS Sbjct: 2 KKNLRIVSAAAAALLAVAPIAATAMPVNAATTINADSAINANTNAKYDVD---VTPSISA 58 Query: 65 IAERYKND 72 IA K+D Sbjct: 59 IAAVAKSD 66
>SECA#SecA protein signature. Length = 901 Score = 29.1 bits (65), Expect = 0.026 Identities = 18/49 (36%), Positives = 22/49 (44%), Gaps = 4/49 (8%) Query: 198 AAAEVDALRARDATLAGGLP----PVALAAVRAGATLTDTFAAALNALA 242 A+ V +R D L GG+ +A G TLT T A LNAL Sbjct: 74 ASKRVFGMRHFDVQLLGGMVLNERCIAEMRTGEGKTLTATLPAYLNALT 122
>PF03309#Bvg accessory factor Length = 271 Score = 202 bits (516), Expect = 6e-67 Identities = 58/279 (20%), Positives = 102/279 (36%), Gaps = 47/279 (16%) Query: 1 MCLLIDAGNSRIKWALADTARHFVTSGAFEHASDAPDWSTLPAPR------GAWISNVAG 54 M L ID N+ L G+ +HA W P I + G Sbjct: 1 MLLAIDVRNTHTVVGLIS--------GSGDHAKVVQQWRIRTEPEVTADELALTIDGLIG 52 Query: 55 DAAAA---------------RIDALIEARWPALPRTVVRASAAQCGVTNGYAEPARLGSD 99 D A + ++E WP +P ++ G+ P +G+D Sbjct: 53 DDAERLTGASGLSTVPSVLHEVRVMLEQYWPNVPHVLIEPGVR-TGIPLLVDNPKEVGAD 111 Query: 100 RWAGLIGAHAAFADEHLLIATFGTATTLEALRADGHFAGGLIAPGWALMMRSLGMHTAQL 159 R + A+ + +++ FG++ ++ + A G F GG IAPG + + +A L Sbjct: 112 RIVNCLAAYHKYGTAAIVV-DFGSSICVDVVSAKGEFLGGAIAPGVQVSSDAAAARSAAL 170 Query: 160 PTVSIDAATNLLDELAENDAHAPFAIDTPHALSAGCLQAQAGLIE----RAWRDLEKAWQ 215 V + +++ + +T + AG + AGL++ R D++ Sbjct: 171 RRVELTRPRSVIGK------------NTVECMQAGAVFGFAGLVDGLVNRIRDDVDGFSG 218 Query: 216 APVRLVLSGGAADAIVRALTVPHTRHDTLVLTGLALIAH 254 A V +V +G A ++ L L L GL L+ Sbjct: 219 ADVAVVATGHTAPLVLPDLRTVEHYDRHLTLDGLRLVFE 257
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 30.4 bits (68), Expect = 0.002 Identities = 21/80 (26%), Positives = 29/80 (36%), Gaps = 8/80 (10%) Query: 70 PALETAPLNASGAAPAAASDSAPGSPAASAPASAVAPASMPASVAAPAAPA----PSSPP 125 A E A L A A+ + D+ PG+ A A + P AP PS+ Sbjct: 451 QAEELAKLRAGKASDSQTPDAKPGNKAVPGKGQAPQAGTKPNQNKAPMKETKRQLPSTGE 510 Query: 126 AAQP----ARAPILPGASAA 141 A P A ++ A A Sbjct: 511 TANPFFTAAALTVMATAGVA 530
>PF03544#Gram-negative bacterial tonB protein Length = 243 Score = 34.6 bits (79), Expect = 6e-04 Identities = 18/98 (18%), Positives = 28/98 (28%) Query: 55 PVQVELLKPQPIERAPAPEKPAADRPRAAPKRAARASAPPAHAPRASAPVSSAAESSTES 114 P Q P+P+ +P + P+ AP + P P+ V Sbjct: 62 PPQAVQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVEQPKRDVKPV 121 Query: 115 SAESPAAASGTEPASAAGGQAAGATSGAAAGASGASAP 152 + + T PA A ATS + Sbjct: 122 ESRPASPFENTAPARPTSSTATAATSKPVTSVASGPRA 159
>ECOLNEIPORIN#E.coli/Neisseria porin superfamily signature. Length = 331 Score = 88.3 bits (219), Expect = 1e-21 Identities = 90/394 (22%), Positives = 139/394 (35%), Gaps = 71/394 (18%) Query: 1 MKKSLLALVALSAFAGAAHAQSSVTLYGIIDEGFNINTNAGGKHL-----YNLSSGVMQG 55 MKKSL+AL L+A AA A VTLYG I G + + + V G Sbjct: 1 MKKSLIALT-LAALPVAAMAD--VTLYGTIKAGVETSRSVAHNGAQAASVETGTGIVDLG 57 Query: 56 SRWGLRGTEDLGGGLKALFVLENGFDVNSGKLNQGGLEFGRQAYVGLSSGFGTVTLGRQY 115 S+ G +G EDLG GLKA++ +E + G RQ+++GL GFG + +GR Sbjct: 58 SKIGFKGQEDLGNGLKAIWQVEQKASIAGTDSGWGN----RQSFIGLKGGFGKLRVGRLN 113 Query: 116 DSVVDF--VGPLEA-GDQWGGYIAAHPGDLDNFNNAYRVNNAVKFTSANYGGFTFGGLYS 172 + D + P ++ D G A P + + V++ S + G + Y+ Sbjct: 114 SVLKDTGDINPWDSKSDYLGVNKIAEP---EARLIS------VRYDSPEFAGLSGSVQYA 164 Query: 173 FGGVAGDFSRNQTWSLGAGYTNGPLVLGVGYLNARTPSTAGGLFGNNTTSSTPAAVTTPV 232 AG ++++ G Y NG + G R + + Sbjct: 165 LNDNAG-RHNSESYHAGFNYKNGGFFVQYGGAYKRHHQVQENVNIEKYQIHR-------L 216 Query: 233 YAGYASAHTYQVIGAGGAYSFGAATVGITYSNIKFMNFASTVFPNQTATFNNAEINFKYQ 292 +GY + Y A A A + + + T + N + Sbjct: 217 VSGYDNDALY----ASVAVQQQDAKL-VEENYSHNSQTEVAA----TLAYRFG--NVTPR 265 Query: 293 LTPTLLAGAAYDYTQGSKIAGSSAAKYHQGSVGVDYFLSKRTDVYAIGVYQHASGNVIEA 352 ++ ++D T + Y Q VG +Y SKRT + E Sbjct: 266 VSYAHGFKGSFDATNYNND-------YDQVVVGAEYDFSKRTSALVSAGWLQ------EG 312 Query: 353 DGNTVGPATAAINGLTPSSNRNQFAARVGIRHKF 386 G S A VG+RHKF Sbjct: 313 KG---------------ESKFVSTAGGVGLRHKF 331
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 36.3 bits (84), Expect = 1e-04 Identities = 34/129 (26%), Positives = 48/129 (37%), Gaps = 10/129 (7%) Query: 132 GVVPIINENDTVVTDEIKFGDNDTLGALVANLIEGDTLVILTDQPGLFTADPRKDPGATL 191 G VP+I E+ + E D D G +A + D +ILTD G + Sbjct: 195 GGVPVILEDGEIKGVEAVI-DKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLR 253 Query: 192 VAEASAGAPELEAMAGGAGSSIGRGGMLTKILAAKRAAHSGANTVIASGRERDVLVRLAA 251 + E AGS M K+LAA R G I + E+ V Sbjct: 254 EVKVEELRKYYEEGHFKAGS------MGPKVLAAIRFIEWGGERAIIAHLEK--AVEALE 305 Query: 252 GEAIGTQLI 260 G+ GTQ++ Sbjct: 306 GKT-GTQVL 313
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 135 bits (341), Expect = 8e-41 Identities = 81/260 (31%), Positives = 130/260 (50%), Gaps = 14/260 (5%) Query: 4 LAGKVAIVTGAGRGIGAAIARAFVREGAAVAIAELDAA---LAEESADAIARDTAGARVL 60 + GK+A +TGA +GIG A+AR +GA +A + + S A AR Sbjct: 6 IEGKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAE----- 60 Query: 61 AVPTDVARAESVAAALARTERAFGPLDVLVNNAGVNVFGDPLALTDEDWRRCFAIDLDGV 120 A P DV + ++ AR ER GP+D+LVN AGV G +L+DE+W F+++ GV Sbjct: 61 AFPADVRDSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGV 120 Query: 121 WNGCRAALPGMVERGRGSIVNIASTHAFKIIPGCFPYPVAKHGVLGLTRALGIEYAPRNV 180 +N R+ M++R GSIV + S A Y +K + T+ LG+E A N+ Sbjct: 121 FNASRSVSKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNI 180 Query: 181 RVNAIAPGYIETQLTHDWWSAQPDPQAARRETLALQ-----PMKRIGRPDEVAMTAVFLA 235 R N ++PG ET + W+ + + + P+K++ +P ++A +FL Sbjct: 181 RCNIVSPGSTETDMQWSLWADE-NGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLV 239 Query: 236 SDEAPFINASCITIDGGRSV 255 S +A I + +DGG ++ Sbjct: 240 SGQAGHITMHNLCVDGGATL 259
>PF05272#Virulence-associated E family protein Length = 892 Score = 29.7 bits (66), Expect = 0.038 Identities = 14/38 (36%), Positives = 17/38 (44%), Gaps = 5/38 (13%) Query: 18 RALD-GISFDVQAGQVHGLMGENGAGKSTLLKILGGEY 54 R ++ G FD L G G GKSTL+ L G Sbjct: 587 RVMEPGCKFDY----SVVLEGTGGIGKSTLINTLVGLD 620
>DHBDHDRGNASE#2,3-dihydro-2,3-dihydroxybenzoate dehydrogenase signature. Length = 261 Score = 123 bits (310), Expect = 3e-36 Identities = 76/249 (30%), Positives = 113/249 (45%), Gaps = 8/249 (3%) Query: 26 GRAVLITGGATGIGASFVEHFARQGARVAFVDLDEKAGRALVARLADAAHEPVFVVCDLT 85 G+ ITG A GIG + A QGA +A VD + + +V+ L A D+ Sbjct: 8 GKIAFITGAAQGIGEAVARTLASQGAHIAAVDYNPEKLEKVVSSLKAEARHAEAFPADVR 67 Query: 86 DIGALRGAIDAIRVRIGPIAVLVNNAANDVRHAVADVTPESFDASIAVNLRHQFFAAQAV 145 D A+ I +GPI +LVN A + ++ E ++A+ +VN F A+++V Sbjct: 68 DSAAIDEITARIEREMGPIDILVNVAGVLRPGLIHSLSDEEWEATFSVNSTGVFNASRSV 127 Query: 146 IDDMKRLGGGAIVNLGSIGWMLKNAGYPVYATAKAAVQGLTRALARELGPFGIRVNTLVP 205 M G+IV +GS + YA++KAA T+ L EL + IR N + P Sbjct: 128 SKYMMDRRSGSIVTVGSNPAGVPRTSMAAYASSKAAAVMFTKCLGLELAEYNIRCNIVSP 187 Query: 206 GWVMTDKQRRLWLDDAGRAAIKAGQCIDAEL--------LPGDLARMALFLAADDSRLIT 257 G TD Q LW D+ G + G + P D+A LFL + + IT Sbjct: 188 GSTETDMQWSLWADENGAEQVIKGSLETFKTGIPLKKLAKPSDIADAVLFLVSGQAGHIT 247 Query: 258 AQDVVVDGG 266 ++ VDGG Sbjct: 248 MHNLCVDGG 256
>FLGPRINGFLGI#Flagellar P-ring protein signature. Length = 373 Score = 371 bits (953), Expect = e-129 Identities = 164/392 (41%), Positives = 225/392 (57%), Gaps = 27/392 (6%) Query: 4 RVVRPLVAARRRAAACCALAACMLALAFAPAAARAERLKDLAQIQGVRDNPLIGYGLVVG 63 RV+R + AA +A L+ PA A R+KD+A +Q RDN LIGYGLVVG Sbjct: 2 RVLRIIAAALVFSALPF--------LSTPPAQADTSRIKDIASLQAGRDNQLIGYGLVVG 53 Query: 64 LDGTGDQTMQTPFTTQTLANMLANLGISINNGSANGGGSSAMTNMQLKNVAAVMVTATLP 123 L GTGD +PFT Q++ ML NLGI+ G +N KN+AAVMVTA LP Sbjct: 54 LQGTGDSLRSSPFTEQSMRAMLQNLGITTQGGQSN-----------AKNIAAVMVTANLP 102 Query: 124 PFARPGEAIDVTVSSLGNAKSLRGGTLLLTPLKGADGQVYALAQGNMAVGGAGASANGSR 183 PFA PG +DVTVSSLG+A SLRGG L++T L GADGQ+YA+AQG + V G A + + Sbjct: 103 PFASPGSRVDVTVSSLGDATSLRGGNLIMTSLSGADGQIYAVAQGALIVNGFSAQGDAAT 162 Query: 184 VQVNQLAAGRIAGGAIVERSVPNAVAQMNGVLQLQLNDMDYGTAQRIVSAVNS----SFG 239 + + R+ GAI+ER +P+ L LQL + D+ TA R+ VN+ +G Sbjct: 163 LTQGVTTSARVPNGAIIERELPSKFKDSV-NLVLQLRNPDFSTAVRVADVVNAFARARYG 221 Query: 240 AGTATALDGRTIQLTAPADSAQQVAFMARLQNLEVSPERAAAKVILNARTGSIVMNQMVT 299 A D + I + P + MA ++NL V + AKV++N RTG+IV+ V Sbjct: 222 DPIAEPRDSQEIAVQKPRVA-DLTRLMAEIENLTVETD-TPAKVVINERTGTIVIGADVR 279 Query: 300 LQNCAVAHGNLSVVVNTQPVVSQPGPFSNGQTVVAQQSQIQLKQDNGSLRMVTAGANLAE 359 + AV++G L+V V P V QP PFS GQT V Q+ I Q+ + + G +L Sbjct: 280 ISRVAVSYGTLTVQVTESPQVIQPAPFSRGQTAVQPQTDIMAMQEGSKV-AIVEGPDLRT 338 Query: 360 VVKALNSLGATPADLMSILQAMKAAGALRADL 391 +V LNS+G +++ILQ +K+AGAL+A+L Sbjct: 339 LVAGLNSIGLKADGIIAILQGIKSAGALQAEL 370
>FLGLRINGFLGH#Flagellar L-ring protein signature. Length = 232 Score = 206 bits (526), Expect = 3e-69 Identities = 127/220 (57%), Positives = 156/220 (70%), Gaps = 7/220 (3%) Query: 25 AALAAAALAGCAQIPREPITQQPMSAMPPMPPAMQAPGSIY---NPGYAG-RPLFEDQRP 80 ++L +L GCA IP P+ Q SA P P A GSI+ P G +PLFED+RP Sbjct: 12 SSLLVLSLTGCAWIPSTPLVQGATSAQPVPGPTPVANGSIFQSAQPINYGYQPLFEDRRP 71 Query: 81 RNVGDILTIVIAENINATKSSGANTNRQGNTSFDVPTAG-FLGGLF--NKANLSAQGANK 137 RN+GD LTIV+ EN++A+KSS AN +R G T+F T +L GLF +A++ A G N Sbjct: 72 RNIGDTLTIVLQENVSASKSSSANASRDGKTNFGFDTVPRYLQGLFGNARADVEASGGNT 131 Query: 138 FAATGGASAANTFNGTITVTVTNVLPNGNLVVSGEKQMLINQGNEFVRFSGIVNPNTISG 197 F GGA+A+NTF+GT+TVTV VL NGNL V GEKQ+ INQG EF+RFSG+VNP TISG Sbjct: 132 FNGKGGANASNTFSGTLTVTVDQVLVNGNLHVVGEKQIAINQGTEFIRFSGVVNPRTISG 191 Query: 198 QNSVYSTQVADARIEYSAKGYINEAETMGWLQRFFLNIAP 237 N+V STQVADARIEY GYINEA+ MGWLQRFFLN++P Sbjct: 192 SNTVPSTQVADARIEYVGNGYINEAQNMGWLQRFFLNLSP 231
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 42.3 bits (99), Expect = 1e-06 Identities = 10/48 (20%), Positives = 23/48 (47%) Query: 213 TLKQGYVESSNVNVVQELVNMIQTQRAYEINSKAVTTSDQMLQTVTQM 260 L S VN+ +E N+ + Q+ Y N++ + T++ + + + Sbjct: 498 QLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545 Score = 40.3 bits (94), Expect = 5e-06 Identities = 19/80 (23%), Positives = 34/80 (42%), Gaps = 14/80 (17%) Query: 4 SLYIAATGMNAQQAQMDVISNNLANVSTNGFKGSRAVFEDLLYQTVRQPGANSTQQTELP 63 + A +G+NA QA ++ SNN+++ + G+ RQ + + L Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYT--------------RQTTIMAQANSTLG 48 Query: 64 SGLQLGTGVQQVATERLYTQ 83 +G +G GV +R Y Sbjct: 49 AGGWVGNGVYVSGVQREYDA 68
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 29.2 bits (65), Expect = 0.019 Identities = 9/34 (26%), Positives = 18/34 (52%) Query: 4 LIYTAMTGATQSLEQQSVVANNLANASTTGFRAQ 37 LI AM+G + + +NN+++ + G+ Q Sbjct: 3 LINNAMSGLNAAQAALNTASNNISSYNVAGYTRQ 36
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 34.2 bits (78), Expect = 0.001 Identities = 17/58 (29%), Positives = 24/58 (41%) Query: 356 ISAPGSTNHGTLQGSALENSNVDLTSQLVKLITAQRNYQANAQTIKTQQTVDQTLINL 413 SA L S V+L + L Q+ Y ANAQ ++T + LIN+ Sbjct: 488 SSATQGNVVTQLSNQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545 Score = 30.3 bits (68), Expect = 0.017 Identities = 11/31 (35%), Positives = 17/31 (54%) Query: 6 GLSGLAGASSDLDVIGNNIANANTVGFKGST 36 +SGL A + L+ NNI++ N G+ T Sbjct: 7 AMSGLNAAQAALNTASNNISSYNVAGYTRQT 37
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 84.9 bits (210), Expect = 3e-21 Identities = 31/114 (27%), Positives = 55/114 (48%), Gaps = 2/114 (1%) Query: 5 ILLVDDHAIVRQGIRQLLIDRGIAREVKEAECGGDALVIAEKSEFDVILLDISLPDMNGI 64 IL+ DD A +R + Q L G +V+ + D+++ D+ +PD N Sbjct: 6 ILVADDDAAIRTVLNQALSRAGY--DVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAF 63 Query: 65 EVLKRLKRRLPSTPVLMFSMYREDQFAVRALKAGAAGYLSKTVNAAQMVSAISQ 118 ++L R+K+ P PVL+ S A++A + GA YL K + +++ I + Sbjct: 64 DLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGR 117
>TYPE3IMSPROT#Type III secretion system inner membrane S protein family signature. Length = 354 Score = 62.1 bits (151), Expect = 5e-15 Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 1/69 (1%) Query: 12 APRVVAKGYGLVAERIIERARDAGLYVHTAPEMV-SLLMQVDLDARIPPQLYQAVAELLA 70 P V K + + + A + G+ + + +L +D IP + +A AE+L Sbjct: 280 LPLVTFKYTDAQVQTVRKIAEEEGVPILQRIPLARALYWDALVDHYIPAEQIEATAEVLR 339 Query: 71 WLYALERDA 79 WL + Sbjct: 340 WLERQNIEK 348
>FLGHOOKFLIE#Flagellar hook-basal body complex protein FliE signature. Length = 103 Score = 61.2 bits (148), Expect = 9e-16 Identities = 47/111 (42%), Positives = 62/111 (55%), Gaps = 8/111 (7%) Query: 3 APVNGIASALQQMQAMAAQAAGGASPATSLAGSGAASAGSFASAMKASLDKISGDQQKAL 62 + + GI + Q+QA A A S SFA + A+LD+IS Q A Sbjct: 1 SAIQGIEGVISQLQATAMSARAQES--------LPQPTISFAGQLHAALDRISDTQTAAR 52 Query: 63 GEAHAFEIGAQNVSLNDVMVDMQKANIGFQFGLQVRNKLVSAYNEIMQMSV 113 +A F +G V+LNDVM DMQKA++ Q G+QVRNKLV+AY E+M M V Sbjct: 53 TQAEKFTLGEPGVALNDVMTDMQKASVSMQMGIQVRNKLVAAYQEVMSMQV 103
>FLGMRINGFLIF#Flagellar M-ring protein signature. Length = 559 Score = 462 bits (1191), Expect = e-159 Identities = 253/562 (45%), Positives = 359/562 (63%), Gaps = 37/562 (6%) Query: 53 LSRMKTNPRLPFLIGAALAIAAIVALVLWSRAPDYRVLYSNLSDRDGGAIIAALQQANVP 112 L+R++ NPR+P ++ + A+A +VA+VLW++ PDYR L+SNLSD+DGGAI+A L Q N+P Sbjct: 16 LNRLRANPRIPLIVAGSAAVAIVVAMVLWAKTPDYRTLFSNLSDQDGGAIVAQLTQMNIP 75 Query: 113 YKFADAGGAILVPANQVHETRLKLAAMGLPKGGSVGFELMDNQKFGISQFAEQVNYQRAL 172 Y+FA+ GAI VPA++VHE RL+LA GLPKGG+VGFEL+D +KFGISQF+EQVNYQRAL Sbjct: 76 YRFANGSGAIEVPADKVHELRLRLAQQGLPKGGAVGFELLDQEKFGISQFSEQVNYQRAL 135 Query: 173 EGELQRTVESSNAVRAARVYLAIPKPSVFVRDREAPSASVLVDLYPGRVLDEGQVLAVTR 232 EGEL RT+E+ V++ARV+LA+PKPS+FVR++++PSASV V L PGR LDEGQ+ AV Sbjct: 136 EGELARTIETLGPVKSARVHLAMPKPSLFVREQKSPSASVTVTLEPGRALDEGQISAVVH 195 Query: 233 MVSSSVPDMPAKNVTIVDQDGNLLTQT-ASATGLDASQLKYVQQIERNTQKRIDAILAPI 291 +VSS+V +P NVT+VDQ G+LLTQ+ S L+ +QLK+ +E Q+RI+AIL+PI Sbjct: 196 LVSSAVAGLPPGNVTLVDQSGHLLTQSNTSGRDLNDAQLKFANDVESRIQRRIEAILSPI 255 Query: 292 FGAGNARSQVSADVDFSKIEQTSESYGPNGTPQQSAIRSQQTSSSTELAQSGASGVPGAL 351 G GN +QV+A +DF+ EQT E Y PNG ++ +RS+Q + S ++ GVPGAL Sbjct: 256 VGNGNVHAQVTAQLDFANKEQTEEHYSPNGDASKATLRSRQLNISEQVGAGYPGGVPGAL 315 Query: 352 SNTPPQPASAPIVA-------------SNGQPAGPAATPVSDRKDSTTNYELDKTVRHVE 398 SN P P API ++ +A P S +++ T+NYE+D+T+RH + Sbjct: 316 SNQPAPPNEAPIATPPTNQQNAQNTPQTSTSTNSNSAGPRSTQRNETSNYEVDRTIRHTK 375 Query: 399 QSMGTIKRLSVAVVVNYQPSTDAKGRVTMQPLAADKLAQVQQLVKDAMGYDEKRGDSVNV 458 ++G I+RLSVAVVVNY+ D K PL AD++ Q++ L ++AMG+ +KRGD++NV Sbjct: 376 MNVGDIERLSVAVVVNYKTLADGKP----LPLTADQMKQIEDLTREAMGFSDKRGDTLNV 431 Query: 459 VNSAFSAAADPFANLPWWRQPDMIELGKDIAKWLGVAAAAAALYFMFVRPALRR---AFP 515 VNS FSA + LP+W+Q I+ +WL V A L+ VRP L R Sbjct: 432 VNSPFSAVDNTGGELPFWQQQSFIDQLLAAGRWLLVLVVAWILWRKAVRPQLTRRVEEAK 491 Query: 516 PPAEPAAAAVPALDGPDDMLALDGLPSPDKKQLAEEDEEHPALLAFENERNRYERNLDYA 575 E A + + L+ D + N+R E Sbjct: 492 AAQEQAQVRQETEEAVEVRLSKDEQLQQRR----------------ANQRLGAEVMSQRI 535 Query: 576 RTIARQDPKIVATVVKNWVSDE 597 R ++ DP++VA V++ W+S++ Sbjct: 536 REMSDNDPRVVALVIRQWMSND 557
>FLGMOTORFLIG#Flagellar motor switch protein FliG signature. Length = 344 Score = 297 bits (762), Expect = e-102 Identities = 114/324 (35%), Positives = 191/324 (58%) Query: 5 GLNKSALLLMSIGDEEAAQVFKFLAPREVQKIGAAMAALKNVTREQVEDVLNDFVQEAEK 64 G K+A+LL+SIG E +++VFK+L+ E++ + +A L+ +T E ++VL +F + Sbjct: 17 GKQKAAILLVSIGSEISSKVFKYLSQEEIESLTFEIAKLETITSELKDNVLLEFKELMMA 76 Query: 65 HTALSLDSSEYIRTVLTKALGEDKAGVLIDRILQGSDTSGIEGLKWMDSAAVAELIKNEH 124 + +Y R +L K+LG KA +I+ + + E ++ D A + I+ EH Sbjct: 77 QEFIQKGGIDYARELLEKSLGTQKAVDIINNLGSALQSRPFEFVRRADPANILNFIQQEH 136 Query: 125 PQIIATILVHLDRDQASEIASCFTERLRNDVLLRIATLDGIQPTALRELDDVLTGLLSGS 184 PQ IA IL +LD +AS I S ++ +V RIA +D P +RE++ VL L+ Sbjct: 137 PQTIALILSYLDPQKASFILSSLPTEVQTNVARRIALMDRTSPEVVREVERVLEKKLASL 196 Query: 185 DNLKRAPMGGIRTAAEILNFMTSVHEEAVIENVKQYDPDLAQKIIDQMFVFENLLDLEDR 244 + GG+ EI+N E+ +IE++++ DP+LA++I +MFVFE+++ L+DR Sbjct: 197 SSEDYTSAGGVDNVVEIINMADRKTEKFIIESLEEEDPELAEEIKKKMFVFEDIVLLDDR 256 Query: 245 AIQLLLKEVESEALIIALKGAPPALRQKFLSNMSQRAAELLAEDLDARGPVRVSEVETQQ 304 +IQ +L+E++ + L ALK +++K NMS+RAA +L ED++ GP R +VE Q Sbjct: 257 SIQRVLREIDGQELAKALKSVDIPVQEKIFKNMSKRAASMLKEDMEFLGPTRRKDVEESQ 316 Query: 305 RKILQVVRNLAESGQIVIGGKAED 328 +KI+ ++R L E G+IVI E+ Sbjct: 317 QKIVSLIRKLEEQGEIVISRGGEE 340
>FLGFLIH#Flagellar assembly protein FliH signature. Length = 228 Score = 108 bits (271), Expect = 3e-31 Identities = 64/184 (34%), Positives = 106/184 (57%), Gaps = 4/184 (2%) Query: 37 AAAALAAELQRVRDAAHAEGLAAGHVEGQALGYQAGYEQGRAKGFDEGRAEAHTHAAQLA 96 A +L +L +++ AH +G AG EG+ G++ GY++G A+G ++G AEA + A + Sbjct: 36 AEPSLEQQLAQLQMQAHEQGYQAGIAEGRQQGHKQGYQEGLAQGLEQGLAEAKSQQAPIH 95 Query: 97 A----LAASFRDALAGVERDLADDIATLALEIAQQVVRQHVQHDPAALIAAAREVLAAEP 152 A L + F+ L ++ +A + +ALE A+QV+ Q D +ALI +++L EP Sbjct: 96 ARMQQLVSEFQTTLDALDSVIASRLMQMALEAARQVIGQTPTVDNSALIKQIQQLLQQEP 155 Query: 153 ALAGAPHLIVNPADLPVVEAYLKDELDTLGWSVRTDTSIERGGCRAHASTGEIDATLTTR 212 +G P L V+P DL V+ L L GW +R D ++ GGC+ A G++DA++ TR Sbjct: 156 LFSGKPQLRVHPDDLQRVDDMLGATLSLHGWRLRGDPTLHPGGCKVSADEGDLDASVATR 215 Query: 213 WERV 216 W+ + Sbjct: 216 WQEL 219
>FLGFLIJ#Flagellar FliJ protein signature. Length = 147 Score = 59.8 bits (144), Expect = 2e-14 Identities = 43/140 (30%), Positives = 74/140 (52%) Query: 1 MAQSFPLQLLLERAQDDLDTAAKQLGRAQRERTDAQAQLDALMRYRDEYRVRFAESAQSG 60 MA+ L L + A+ +++ AA+ LG +R A+ QL L+ Y++EYR +G Sbjct: 1 MAEHGALATLKDLAEKEVEDAARLLGEMRRGCQQAEEQLKMLIDYQNEYRNNLNSDMSAG 60 Query: 61 MPAGNWRNFQAFLDTLDAAIEQQRRVLAAAQTRIDAARPEWQAKKRTLGSYEILQARGAR 120 + + W N+Q F+ TL+ AI Q R+ L ++D A W+ KK+ L +++ LQ R + Sbjct: 61 ITSNRWINYQQFIQTLEKAITQHRQQLNQWTQKVDIALNSWREKKQRLQAWQTLQERQST 120 Query: 121 QDAQRAAKREQRDADEHAAK 140 + +Q+ DE A + Sbjct: 121 AALLAENRLDQKKMDEFAQR 140
>FLGHOOKFLIK#Flagellar hook-length control protein signature. Length = 375 Score = 73.7 bits (180), Expect = 2e-16 Identities = 79/257 (30%), Positives = 109/257 (42%), Gaps = 8/257 (3%) Query: 213 NGDASAPLAANRAAFDKLLAGAKAPAAQAAPTDASGANPATALANAAANAAQPDASG--A 270 N D +A L+A A K A + T L + AQPD + Sbjct: 124 NEDVTASLSALFAMLPGFDNTPKVTDAPSTVLPTEKPTLFTKLTSEQLTTAQPDDAPGTP 183 Query: 271 LAALQDAADSARATLAASSAPAALQQAA-PAALAANASAAAASAAPSLAPPVGTPDWTDA 329 L A++ S P+ + AA P AAP L+ P+G+ +W + Sbjct: 184 AQPLTPLVAEAQSKAEVISTPSPVTAAASPLITPHQTQPLPTVAAPVLSAPLGSHEWQQS 243 Query: 330 LSQKVVFLSNAHQQSAELTLNPPDLGPLQVVLRVADNHAHALFVSQHAQVRDAVEAALPK 389 LSQ + + QQSAEL L+P DLG +Q+ L+V DN A VS H VR A+EAALP Sbjct: 244 LSQHISLFTRQGQQSAELRLHPQDLGEVQISLKVDDNQAQIQMVSPHQHVRAALEAALPV 303 Query: 390 LREAMEAGGLGLGSASVSDGGFASAQQQQTPQRQSSDGSATRRAFGASTADAALDELAAA 449 LR + G+ LG +++S F+ QQ + Q+QS +A D L Sbjct: 304 LRTQLAESGIQLGQSNISGESFSGQQQAASQQQQSQR-TANHEPLAGEDDDT----LPVP 358 Query: 450 SSGGAARRTVGMVDTFA 466 S VD FA Sbjct: 359 VSLQGRVTGNSGVDIFA 375
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 44.5 bits (105), Expect = 6e-07 Identities = 62/361 (17%), Positives = 114/361 (31%), Gaps = 45/361 (12%) Query: 66 LPEFSKAFGVSPAQSSLALSFATAALAAAVFVAGFVSEALSRHRLMTASLTASSLLTLAA 125 LP+ + F PA ++ + + V G +S+ L RL+ + + ++ Sbjct: 37 LPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLLFGIIINCFGSVIG 96 Query: 126 AFAPHWHQLLIL-RALTGLALGGVPAVAMAYLAEEVHPDGLGLAMGLYVGGTAIGGMAGR 184 + LLI+ R + G PA+ M +A + + G A GL A+G G Sbjct: 97 FVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGLIGSIVAMGEGVGP 156 Query: 185 VITGILTDLFSWRIAVGAIGVLGLASMLAFRMLLPPSRH--------------------- 223 I G++ W + + + ++L R Sbjct: 157 AIGGMIAHYIHWSYLLLIPMITIITVPFLMKLLKKEVRIKGHFDIKGIILMSVGIVFFML 216 Query: 224 ------------FVPRRGLNLAHHRTS----LAHHLRGQRELPVLFAMAFVLMGSFVTLY 267 V + + H R + L + ++ G+ Sbjct: 217 FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVDPGLGKNIPFMIGVLCGGIIFGTVAGFV 276 Query: 268 NYIGYRLLAPPYSMGQATIGA--IFVVYLVGVVASPLSGRLADTLGRGRVLI---ASLAV 322 + + Y ++ + + A IG+ IF + ++ + G L D G VL L+V Sbjct: 277 SMVPY-MMKDVHQLSTAEIGSVIIFPGTMSVIIFGYIGGILVDRRGPLYVLNIGVTFLSV 335 Query: 323 MLGGVALTLLHPVAAIVAGVACVTFGFFAGHAVASGWVGR-LAQHGKGQAAALYLLAYYL 381 + L + + V G V S V L Q G +L +L Sbjct: 336 SFLTASFLLETTSWFMTIIIVFVLGGLSFTKTVISTIVSSSLKQQEAGAGMSLLNFTSFL 395 Query: 382 G 382 Sbjct: 396 S 396
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 95.3 bits (237), Expect = 6e-25 Identities = 36/124 (29%), Positives = 60/124 (48%), Gaps = 1/124 (0%) Query: 2 RILLVEDDRMIAEGVRKALKADGCAVDWVQDGDAALTALGGEAYDLLLLDLGLPKRDGID 61 IL+ +DD I + +AL G V + + DL++ D+ +P + D Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFD 64 Query: 62 VLRTLRARGLALPVLILTARDAVADRVKGLDAGADDYLVKPFDLDE-LAARMRALIRRQS 120 +L ++ LPVL+++A++ +K + GA DYL KPFDL E + RAL + Sbjct: 65 LLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEPKR 124 Query: 121 GRSE 124 S+ Sbjct: 125 RPSK 128
>V8PROTEASE#V8 serine protease family signature. Length = 336 Score = 78.1 bits (192), Expect = 6e-18 Identities = 38/207 (18%), Positives = 71/207 (34%), Gaps = 40/207 (19%) Query: 69 QRRAAPQLPIDPDDP-----FYQFFRHFYGQIPGMGGGRQPQPDDQPSTSLGSGFIISAD 123 ++R + + +D I Q + T + SG ++ Sbjct: 62 EQREHANVILPNNDRHQITDTTNGHYAPVTYI---------QVEAPTGTFIASGVVV-GK 111 Query: 124 GYILTNAHVIDGANVVTVKLTDKR-----------EYKA-KVVGADKQSDVAVLKIDA-- 169 +LTN HV+D + L + A ++ + D+A++K Sbjct: 112 DTLLTNKHVVDATHGDPHALKAFPSAINQDNYPNGGFTAEQITKYSGEGDLAIVKFSPNE 171 Query: 170 ------SGLPIVKIGDPAQSKVGQWVVAIGSPYGFDNTVTSGIISAKSRALPDENYTPFI 223 + + + A+++V Q + G P +K + + + Sbjct: 172 QNKHIGEVVKPATMSNNAETQVNQNITVTGYPGDKPVATMW---ESKGKITYLKGE--AM 226 Query: 224 QTDVPVNPGNSGGPLFNLNGEVIGINS 250 Q D+ GNSG P+FN EVIGI+ Sbjct: 227 QYDLSTTGGNSGSPVFNEKNEVIGIHW 253
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 126 bits (317), Expect = 2e-38 Identities = 81/209 (38%), Positives = 116/209 (55%), Gaps = 1/209 (0%) Query: 1 MARRTKEEALATRDRILDAAEHVFFEKGVSHTSLADIAQHAGVTRGAIYWHFASKSELFD 60 MAR+TK+EA TR ILD A +F ++GVS TSL +IA+ AGVTRGAIYWHF KS+LF Sbjct: 1 MARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFS 60 Query: 61 AMFDRVLLPIDELKAGT-GEPHADPLGRIREILIWCLLGAARDPQLRRVFSILFMKCEYV 119 +++ I EL+ + DPL +REILI L + + R + I+F KCE+V Sbjct: 61 EIWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLMEIIFHKCEFV 120 Query: 120 ADMGPLLQRNREGMRDALRNIEADLAQGVANGQLPADLDTWRATLMLHTLVSGFVRDMLM 179 +M + Q R ++ IE L + LPADL T RA +++ +SG + + L Sbjct: 121 GEMAVVQQAQRNLCLESYDRIEQTLKHCIEAKMLPADLMTRRAAIIMRGYISGLMENWLF 180 Query: 180 LPGEIDAERHAEKLVDGCFDMLRMSPAMR 208 P D ++ A V +M + P +R Sbjct: 181 APQSFDLKKEARDYVAILLEMYLLCPTLR 209
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.7 bits (98), Expect = 4e-06 Identities = 42/266 (15%), Positives = 80/266 (30%), Gaps = 75/266 (28%) Query: 92 KIDPAPYIAQLNSAKATLAKAQANLATQNALVARYKVLVAANAVSKQQYDDAVAAQGQAA 151 +++ A+ + A + + + + + + + L+ A++K + +A Sbjct: 206 ELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAV 265 Query: 152 ADVGAGKAAV-------------------------------------------ETAQINL 168 ++ K+ + + Sbjct: 266 NELRVYKSQLEQIESEILSAKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQ 325 Query: 169 GYTDVVSPITGRV-GISQVTPGAYVQASQATLMSTVQQLDPVYVDLTQSSLDGLKLRQDI 227 + + +P++ +V + T G V ++ TLM V + D + V + D + Sbjct: 326 QASVIRAPVSVKVQQLKVHTEGGVVTTAE-TLMVIVPEDDTLEVTALVQNKDIGFINVG- 383 Query: 228 QSGRIK-------TEGPGAAKVTLILEDGKPYPERGKLQFSDVTVDQTTGSVT--IRAI- 277 Q+ IK G KV I D DQ G V I +I Sbjct: 384 QNAIIKVEAFPYTRYGYLVGKVKNI--------------NLDAIEDQRLGLVFNVIISIE 429 Query: 278 -----FPNKQRVLLPGMFVRARIEEG 298 NK L GM V A I+ G Sbjct: 430 ENCLSTGNKNIPLSSGMAVTAEIKTG 455 Score = 30.6 bits (69), Expect = 0.012 Identities = 20/122 (16%), Positives = 35/122 (28%), Gaps = 20/122 (16%) Query: 1 MRVERVPYRLITVATAAVFLAACGKKESAPPPQTPEVGVVTVQPQPVPVVSELPGRTSAY 60 R V Y ++ A L+ G+ E G +T + + Sbjct: 55 RRPRLVAYFIMGFLVIAFILSVLGQVEIVATAN----GKLTHSGRSKEIKPIENSIVKEI 110 Query: 61 LVAQVRARVDGIVLRREFTEGSDVKAGQRLYKIDPAPYIAQLNSAKATLAKAQANLATQN 120 +V EG V+ G L K+ A +++L +A+ Sbjct: 111 IVK----------------EGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQ 154 Query: 121 AL 122 L Sbjct: 155 IL 156
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 1267 bits (3280), Expect = 0.0 Identities = 673/1035 (65%), Positives = 821/1035 (79%), Gaps = 2/1035 (0%) Query: 1 MAKFFIDRPIFAWVIAIILMLAGVAAIFTLPIAQYPTIAPPSIQITANYPGASAKTVEDT 60 MA FFI RPIFAWV+AIILM+AG AI LP+AQYPTIAPP++ ++ANYPGA A+TV+DT Sbjct: 1 MANFFIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDT 60 Query: 61 VTQVIEQQMSGLDNFLYMSSTSDDSGNATITITFAPGTNPDIAQVQVQNKLSLATPILPQ 120 VTQVIEQ M+G+DN +YMSSTSD +G+ TIT+TF GT+PDIAQVQVQNKL LATP+LPQ Sbjct: 61 VTQVIEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQ 120 Query: 121 VVQQLGLSVTKSSSSFLLVLAFNSEDGSMNKYDLANYVASHVKDPISRINGVGTVTLFGS 180 VQQ G+SV KSSSS+L+V F S++ + D+++YVAS+VKD +SR+NGVG V LFG+ Sbjct: 121 EVQQQGISVEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGA 180 Query: 181 QYAMRIWLDPTKLTNYGLTPVDVTSAISAQNVQIAGGQLGGTPAVPGTVLQATITEATLL 240 QYAMRIWLD L Y LTPVDV + + QN QIA GQLGGTPA+PG L A+I T Sbjct: 181 QYAMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRF 240 Query: 241 QTPEQFGNILLKVNQDGSQVRLKDVAQIGLGGETYNFDTKYNGQPTAALGIQLATNANAL 300 + PE+FG + L+VN DGS VRLKDVA++ LGGE YN + NG+P A LGI+LAT ANAL Sbjct: 241 KNPEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANAL 300 Query: 301 ATAKAVRAKIDEMSAYFPHGLVVKYPYDTTPFVRLSIEEVVKTLLEGIVLVFLVMYLFLQ 360 TAKA++AK+ E+ +FP G+ V YPYDTTPFV+LSI EVVKTL E I+LVFLVMYLFLQ Sbjct: 301 DTAKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQ 360 Query: 361 NLRATIIPTIAVPVVLLGTFAIMSMVGFSINVLSMFGLVLAIGLLVDDAIVVVENVERVM 420 N+RAT+IPTIAVPVVLLGTFAI++ G+SIN L+MFG+VLAIGLLVDDAIVVVENVERVM Sbjct: 361 NMRATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVM 420 Query: 421 AEEGLPPKEATRKAMGQITGALVGVALVLSAVFVPVAFSGGSVGAIYRQFSLTIVSAMVL 480 E+ LPPKEAT K+M QI GALVG+A+VLSAVF+P+AF GGS GAIYRQFS+TIVSAM L Sbjct: 421 MEDKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMAL 480 Query: 481 SVLVALILTPALCATILKPIPQGHHEEKKGFFGWFNRTFNSSRDKYHVGVHHVIKRSGRW 540 SVLVALILTPALCAT+LKP+ HHE K GFFGWFN TF+ S + Y V ++ +GR+ Sbjct: 481 SVLVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRY 540 Query: 541 LIIYLAVIVAVGLLFVRLPKSFLPDEDQGLMFVIVQTPSGSTQETTARTLANISDYLLTQ 600 L+IY ++ + +LF+RLP SFLP+EDQG+ ++Q P+G+TQE T + L ++DY L Sbjct: 541 LLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKN 600 Query: 601 EKDIVESAFTVNGFSFAGRGQNSGLVFVKLKDYSQRQSSDQKVQALIGRMFGRYAGYKDA 660 EK VES FTVNGFSF+G+ QN+G+ FV LK + +R + +A+I R +D Sbjct: 601 EKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDG 660 Query: 661 LVIPFNPPSIPELGTAAGFDFELTDNAGLGHDALMAARNQLLGMAAKDP-TLRGVRPNGL 719 VIPFN P+I ELGTA GFDFEL D AGLGHDAL ARNQLLGMAA+ P +L VRPNGL Sbjct: 661 FVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGL 720 Query: 720 NDTPQYKVDIDREKANALGVTADAIDQTFSIAWASKYVNNFLDTDGRIKKVYVQSDAPFR 779 DT Q+K+++D+EKA ALGV+ I+QT S A YVN+F+D GR+KK+YVQ+DA FR Sbjct: 721 EDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFID-RGRVKKLYVQADAKFR 779 Query: 780 MTPEDMNIWYVRNGSGGMVPFSAFATGHWTYGSPKLERYNGISAMEIQGQAAPGKSTGQA 839 M PED++ YVR+ +G MVPFSAF T HW YGSP+LERYNG+ +MEIQG+AAPG S+G A Sbjct: 780 MLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDA 839 Query: 840 MTAMETLAKKLPTGIGYSWTGLSFQEIQSGSQAPILYAISILVVFLCLAALYESWSIPFS 899 M ME LA KLP GIGY WTG+S+QE SG+QAP L AIS +VVFLCLAALYESWSIP S Sbjct: 840 MALMENLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPVS 899 Query: 900 VIMVVPLGVIGALLAATLRGLENDVFFQVGLLTTVGLSAKNAILIVEFARELQQTEKMGP 959 V++VVPLG++G LLAATL +NDV+F VGLLTT+GLSAKNAILIVEFA++L + E G Sbjct: 900 VMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKGV 959 Query: 960 IEAALEAARLRLRPILMTSLAFILGVMLLAISNGAGSASQHAIGTGVIGGMITATFLAIF 1019 +EA L A R+RLRPILMTSLAFILGV+ LAISNGAGS +Q+A+G GV+GGM++AT LAIF Sbjct: 960 VEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAIF 1019 Query: 1020 MIPMFFVKVRAVFSG 1034 +P+FFV +R F G Sbjct: 1020 FVPVFFVVIRRCFKG 1034
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 55.2 bits (133), Expect = 3e-11 Identities = 41/159 (25%), Positives = 65/159 (40%), Gaps = 13/159 (8%) Query: 5 VLIADDHPLVLLGVRHMLAGMG-DVSIVGEAHDPAGLLALLAATPCDIVITDFAMPEQPA 63 +L+ADD + + L+ G DV I A L +AA D+V+TD MP+ Sbjct: 6 ILVADDDAAIRTVLNQALSRAGYDVRITSNAAT---LWRWIAAGDGDLVVTDVVMPD--- 59 Query: 64 ADGLAMLTAIRDGYPSVRVIVLTMLDNPVLMHTMRQAGALAVLSKRGDLDEL----PRAL 119 + +L I+ P + V+V++ + + + GA L K DL EL RAL Sbjct: 60 ENAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRAL 119 Query: 120 AAVYQGRPFVGTHAGAAGGGAMRGTDAPRQLSPREIEVV 158 A + + G + G A Q R + + Sbjct: 120 AEPKRRPS--KLEDDSQDGMPLVGRSAAMQEIYRVLARL 156
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 99.2 bits (247), Expect = 2e-24 Identities = 70/331 (21%), Positives = 138/331 (41%), Gaps = 20/331 (6%) Query: 41 AFMEVLDTTIVNVALPHIAGTMSASYDEATWTLTSYLVANGIVLPISGFLGRLLGRKRYF 100 +F VL+ ++NV+LP IA + W T++++ I + G L LG KR Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 101 VLCIVAFTICSFLCGIATDLGQLIVF-RVLQGLFGGGLQPNQQSIILDTF-PPEQRNRAF 158 + I+ S + + L++ R +QG G P +++ + P E R +AF Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGA-GAAAFPALVMVVVARYIPKENRGKAF 141 Query: 159 SISAVAIVVAPVLGPTLGGWITDNFSWRWVFLLNVPIGVLTSLAVIQLVEDPPWKRGRAR 218 + + + +GP +GG I W +LL +P+ +T + V L++ K R + Sbjct: 142 GLIGSIVAMGEGVGPAIGGMIAHYIHWS--YLLLIPM--ITIITVPFLMKLLK-KEVRIK 196 Query: 219 GLSIDYIGITLIAIGLGCLQVMLDRGEDEDWFASTFIRTFAGLTAAGLVGATFWLLYAKK 278 G D GI L+++G+ + F +++ +F ++ + + Sbjct: 197 G-HFDIKGIILMSVGIVFFML----------FTTSYSISFLIVSVLSFLIFVKHIRKVTD 245 Query: 279 PVVDLSCLKDRNFTLGCVTIATFAVVLYGSAVLVPQLAQQRLGYTAMLAG-LVLSPGALL 337 P VD K+ F +G + + G +VP + + + G +++ PG + Sbjct: 246 PFVDPGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMS 305 Query: 338 ITLEIPIVSKLMPYVQTRFLVCFGFLLLAAS 368 + + I L+ +++ G L+ S Sbjct: 306 VIIFGYIGGILVDRRGPLYVLNIGVTFLSVS 336
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 99.5 bits (248), Expect = 6e-25 Identities = 63/414 (15%), Positives = 134/414 (32%), Gaps = 91/414 (21%) Query: 51 KRPGKKPLVVLAIIVVLLLVGAFVW-WFATRNQVSTDDA--YTDGNAITIAPKVSGYVVA 107 + P + ++A ++ LV AF+ V+T + G + I P + V Sbjct: 50 ETPVSRRPRLVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKE 109 Query: 108 LAIDDNVYVHRGDLLLVIDQRDYQAQVDAARAQLGLAQAQLDAAQVQLDIA------HVQ 161 + + + V +GD+LL + +A ++ L A+ + Q+ ++ Sbjct: 110 IIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELK 169 Query: 162 FPAQYRQAQA---QIEAAQASFRQALAAYERQHAVDARATSQQAIDVADAQRLTADANVA 218 P + ++ + ++ + ++ Q + + +D A+RLT A + Sbjct: 170 LPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQ-----KYQKELNLDKKRAERLTVLARIN 224 Query: 219 TARAQA----------------------------RTASLVPQQIRQAQTAVEQRRQQVLQ 250 + ++R ++ +EQ ++L Sbjct: 225 RYENLSRVEKSRLDDFSSLLHKQAIAKHAVLEQENKYVEAVNELRVYKSQLEQIESEILS 284 Query: 251 AQA-----------------------------QLEAAQLALSYCEVRAPSDGWITRRNVQ 281 A+ +L + +RAP + + V Sbjct: 285 AKEEYQLVTQLFKNEILDKLRQTTDNIGLLTLELAKNEERQQASVIRAPVSVKVQQLKVH 344 Query: 282 -LGSFLQAGAALFAIVTPQ---LWVTANFKESQLERMRAGDRVSVSVDAYP---NLELHG 334 G + L IV P+ L VTA + + + G + V+A+P L G Sbjct: 345 TEGGVVTTAETLMVIV-PEDDTLEVTALVQNKDIGFINVGQNAIIKVEAFPYTRYGYLVG 403 Query: 335 HVDSIQLGSGSRFSAFPPENATGNFVKIVQRVPVKIAIDGGLPRDPPLGIGLSV 388 V +I L + + G ++ + G ++ PL G++V Sbjct: 404 KVKNINLDA-------IEDQRLGLVFNVIISIEENCLSTGN--KNIPLSSGMAV 448
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 135 bits (342), Expect = 6e-37 Identities = 84/396 (21%), Positives = 159/396 (40%), Gaps = 16/396 (4%) Query: 27 VFMNVLDTSIANVAIPTISGDLGVSSDQGTWVITSFAVANAISVPLTGWLTDRIGQVRLF 86 F +VL+ + NV++P I+ D WV T+F + +I + G L+D++G RL Sbjct: 23 SFFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLL 82 Query: 87 LASIILFVISSWMCGLAPT-LPFLLASRVLQGAVAGPMIPLSQALLLSSYPRAKAPMALA 145 L II+ S + + + L+ +R +QGA A L ++ P+ A Sbjct: 83 LFGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFG 142 Query: 146 LWSMTTLIAPVAGPILGGWISDNYSWPWIFYVNIPVGIAAAAVTWMIYRSRESAVRRAPI 205 L + GP +GG I+ W ++ IP+ I V +++ ++ + Sbjct: 143 LIGSIVAMGEGVGPAIGGMIAHYIHWSYLL--LIPM-ITIITVPFLMKLLKKEVRIKGHF 199 Query: 206 DGVGLALLVIWVGSLQIMLDKGKDLDWFASTTIVVLALTALIAFAFFVVWELTAEHPVVD 265 D G+ L+ + G + ML F ++ + + ++++F FV P VD Sbjct: 200 DIKGIILMSV--GIVFFML--------FTTSYSISFLIVSVLSFLIFVKHIRKVTDPFVD 249 Query: 266 LSLFRMRNFSGGTIALSVGYGLYFGNLVLLPLWLQTQIGYTATDAG-LVMAPVGFFAILL 324 L + F G + + +G G + ++P ++ + + G +++ P I+ Sbjct: 250 PGLGKNIPFMIGVLCGGIIFGTVAGFVSMVPYMMKDVHQLSTAEIGSVIIFPGTMSVIIF 309 Query: 325 SPLTGKFLSRTDPRYIATAAFLTFALCFWMRSRYTTGVDEWSLMAPTFVQGIAMAGFFIP 384 + G + R P Y+ ++ F S + + FV G ++ Sbjct: 310 GYIGGILVDRRGPLYVLNIGVTFLSVSFLTASFLLETTSWFMTIIIVFVLG-GLSFTKTV 368 Query: 385 LVSITLSGLPGHRIPAASGLSNFVRIMCGGIGTSIF 420 + +I S L A L NF + G G +I Sbjct: 369 ISTIVSSSLKQQEAGAGMSLLNFTSFLSEGTGIAIV 404
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 72.6 bits (178), Expect = 6e-16 Identities = 36/270 (13%), Positives = 85/270 (31%), Gaps = 28/270 (10%) Query: 94 ADSQVALQQAEANLAQTVRQVRGLYVNDDQYRAQVALRQSDLS--------------KAQ 139 + Q Q E NL + + + ++Y + +S L Sbjct: 196 STWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLDDFSSLLHKQAIAKHAVL 255 Query: 140 DDLRRRLAVAQTGAVSQEEISHARDAVKAAQASLDAAGQQLASNRALTANTTVADHPNVL 199 + + + V + ++ + +A+ Q + L N+ Sbjct: 256 EQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFKN-EILDKLRQT--TDNIG 312 Query: 200 AAAAKVRDAYLNNARNTLPAPVTGYVAKRSVR-VGQRVSPGTPLMSVVPLNAV-WVDANF 257 ++ + + APV+ V + V G V+ LM +VP + V A Sbjct: 313 LLTLELAKNEERQQASVIRAPVSVKVQQLKVHTEGGVVTTAETLMVIVPEDDTLEVTALV 372 Query: 258 KEVQLKHMRIGQPVELTADIYGSSVKYHGKVIGFSAGTGAAFSLLPAQNATGNWIKVVQR 317 + + + +GQ + + + + +G ++G + + +V Sbjct: 373 QNKDIGFINVGQNAIIKVEAFPYTR--YGYLVG-------KVKNINLDAIEDQRLGLVFN 423 Query: 318 LPVRVELDPKELKEHPLRIGLSMQVDVDIK 347 + + +E + + + M V +IK Sbjct: 424 VIISIEENCLSTGNKNIPLSSGMAVTAEIK 453 Score = 47.1 bits (112), Expect = 8e-08 Identities = 32/207 (15%), Positives = 72/207 (34%), Gaps = 28/207 (13%) Query: 29 VIAIAAIAYGLYYLLVARFHETTDDAYVNGNVV------QITPQVTGTVIAVKADDTQTV 82 ++A + + + +++ + A NG + +I P V + + ++V Sbjct: 59 LVAYFIMGFLVIAFILSVLGQVEIVATANGKLTHSGRSKEIKPIENSIVKEIIVKEGESV 118 Query: 83 KSGDPLVVLDPADSQVALQQAEANLAQT---------------VRQVRGLYVNDDQYRAQ 127 + GD L+ L ++ + +++L Q + ++ L + D+ Y Sbjct: 119 RKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILSRSIELNKLPELKLPDEPYFQN 178 Query: 128 VA----LRQSDLSKAQ-DDLRRRLAVAQ-TGAVSQEEISHARDAVKAAQASLDAAGQQLA 181 V+ LR + L K Q + + + + E + + +L Sbjct: 179 VSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERLTVLARINRYENLSRVEKSRLD 238 Query: 182 SNRALTANTTVADHPNVLAAAAKVRDA 208 +L +A H VL K +A Sbjct: 239 DFSSLLHKQAIAKH-AVLEQENKYVEA 264
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 171 bits (435), Expect = 5e-48 Identities = 102/435 (23%), Positives = 172/435 (39%), Gaps = 62/435 (14%) Query: 5 LRNIAIIAHVDHGKTTLVDQLLRQSGTFRENQQVAE--RVMDSNDIEKERGITILAKNCA 62 + NI ++AHVD GKTTL + LL SG E V + D+ +E++RGITI + Sbjct: 3 IINIGVLAHVDAGKTTLTESLLYNSGAITELGSVDKGTTRTDNTLLERQRGITIQTGITS 62 Query: 63 VEYEGTHINIVDTPGHADFGGEVERVLSMVDSVLLLVDAVEGPMPQTRFVTKKALALGLK 122 ++E T +NI+DTPGH DF EV R LS++D +LL+ A +G QTR + +G+ Sbjct: 63 FQWENTKVNIIDTPGHMDFLAEVYRSLSVLDGAILLISAKDGVQAQTRILFHALRKMGIP 122 Query: 123 PIVVINKIDRPGARIDWV-------------INQTFDLFDKLGATE----EQLDFPIV-- 163 I INKID+ G + V I Q +L+ + T EQ D I Sbjct: 123 TIFFINKIDQNGIDLSTVYQDIKEKLSAEIVIKQKVELYPNMCVTNFTESEQWDTVIEGN 182 Query: 164 -----------------YASGLNGY---ASLDP-----AARDGDMRPLFEAILQHVPVRP 198 + SL P A + + L E I Sbjct: 183 DDLLEKYMSGKSLEALELEQEESIRFHNCSLFPVYHGSAKNNIGIDNLIEVITNKFYSST 242 Query: 199 ADPDAPLQLQITSLDYSTYVGRIGVGRITRGRIKPGQPVVMRFGPEGDVLNRKINQVLSF 258 + L ++ ++YS R+ R+ G + V R + + KI ++ + Sbjct: 243 HRGQSELCGKVFKIEYSEKRQRLAYIRLYSGVLHLRDSV--RISEKEKI---KITEMYTS 297 Query: 259 QGLERVQVDSAEAGDIVLINGIEDVGIGATICAVEAPEALPMITVDEPTLTMNFLVNSSP 318 E ++D A +G+IV++ E + + + + + I P L + Sbjct: 298 INGELCKIDKAYSGEIVILQN-EFLKLNSVLGDTKLLPQRERIENPLPLLQTTVEPSKPQ 356 Query: 319 LAGREGKFVTSRQIRDRLMKELNHNVALRVKDTGDETVFEVSGRGELHLTILVENMRRE- 377 + D L++ V E + +S G++ + + ++ + Sbjct: 357 QREMLLDALLEISDSDPLLR-------YYVDSATHEII--LSFLGKVQMEVTCALLQEKY 407 Query: 378 GYELAVSRPRVVMQE 392 E+ + P V+ E Sbjct: 408 HVEIEIKEPTVIYME 422 Score = 33.7 bits (77), Expect = 0.002 Identities = 17/100 (17%), Positives = 32/100 (32%), Gaps = 1/100 (1%) Query: 387 RVVMQEIDGVKHEPYELLTVDLEDEHQGGVMEELGRRKGEMLDMVSDGRGRTRLEYRIPA 446 V+++ EPY + E+ + + ++D L IPA Sbjct: 525 EQVLKKAGTELLEPYLSFKIYAPQEYLSRAYTDAPKYCANIVDTQL-KNNEVILSGEIPA 583 Query: 447 RGLIGFQSEFLTLTRGTGLMSHIFDSYAPVKEGSVGERRN 486 R + ++S+ T G + Y V + R Sbjct: 584 RCIQEYRSDLTFFTNGRSVCLTELKGYHVTTGEPVCQPRR 623
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 30.2 bits (68), Expect = 0.015 Identities = 8/86 (9%), Positives = 27/86 (31%), Gaps = 1/86 (1%) Query: 48 EVPAPAAGVLAQVLQNDGDTVVADQVIATID-TEAKAGAAAAAAGAADVQSAAAPVAAPA 106 E+ ++ +++ +G++V V+ + A+A + + + Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 107 PAAQPAAAASSTAATSPAASKLMAEK 132 + + P + E+ Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEE 183
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 60.2 bits (146), Expect = 4e-12 Identities = 58/261 (22%), Positives = 103/261 (39%), Gaps = 12/261 (4%) Query: 61 IGALIFGRLADHFGRRPTLMINIACYSLLELASGFAPSLAALLVLRTLFGVAMGGEWGVG 120 A + G L+D FGRRP L++++A ++ AP L L + R + G+ G V Sbjct: 58 ACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGIT-GATGAVA 116 Query: 121 SALTMETVPPRARGAVSGLLQAGYPSGYLLASVVFGLLYPYIGWRGMFMIGVLPALLVLY 180 A + R G + A + G + V+ GL+ + F L L L Sbjct: 117 GAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLT 176 Query: 181 VRAKVPES-PAWKQMEKRARPGLVATLKQNWKLSIYAVVLMTAF--NFFSHGTQDLYPTF 237 +PES ++ +R +A+ + +++ A ++ F L+ F Sbjct: 177 GCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIF 236 Query: 238 LREQHHFDPHTVSWITIVLNI-GAIVGGLTFGWLSERIGRRRAI---FIAAMIALPVLPL 293 ++ H+D T+ I ++ + G ++ R+G RRA+ IA +L Sbjct: 237 GEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAF 296 Query: 294 ----WAFSTGALALAAGAFLM 310 W + LA+G M Sbjct: 297 ATRGWMAFPIMVLLASGGIGM 317
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 67.3 bits (164), Expect = 3e-15 Identities = 21/83 (25%), Positives = 35/83 (42%) Query: 4 RQASRQSGGTKARILDAAEDLFIEHGFEAMSMRQITSRAAVNLAAVNYHFGSKEALIHAM 63 R+ +++ T+ ILD A LF + G + S+ +I A V A+ +HF K L + Sbjct: 3 RKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSEI 62 Query: 64 LSRRLDQLNEERLRILDRFDAQL 86 + E L +F Sbjct: 63 WELSESNIGELELEYQAKFPGDP 85
>cloacin#Cloacin signature. Length = 551 Score = 28.9 bits (64), Expect = 0.018 Identities = 25/85 (29%), Positives = 26/85 (30%), Gaps = 7/85 (8%) Query: 76 GRGPRAGGAHGGGGRPGGREGGGHGPYGSHG----GSREPRGDGGGYGARESRGDGGYGS 131 GRG G G GG G G G S G P G G G G GG G Sbjct: 6 GRGHNTGAHSTSGNINGGPTGLGVGGGASDGSGWSSENNPWGGGSGSGIHWG---GGSGH 62 Query: 132 RESRGDGGYGSREPRGDGGYGSREP 156 G+G G G P Sbjct: 63 GNGGGNGNSGGGSGTGGNLSAVAAP 87
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 27.7 bits (61), Expect = 0.047 Identities = 19/108 (17%), Positives = 32/108 (29%), Gaps = 9/108 (8%) Query: 113 LFQQKAFWRVIRTASEARAEAVYRDFAKQSETLAVNELQAAKLESQKALTDRQIAVA--- 169 ++A V + + T K E K T++ V Sbjct: 1067 EVAKEAKSNVKANTQTNEVAQSGSETKETQTTETKETATVEKEEKAKVETEKTQEVPKVT 1126 Query: 170 ------QERASRLQADLSIAREQRAAVATRQKDKLDETVALREQKSER 211 QE++ +Q ARE V ++ T A EQ ++ Sbjct: 1127 SQVSPKQEQSETVQPQAEPARENDPTVNIKEPQSQTNTTADTEQPAKE 1174
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 298 bits (765), Expect = 1e-98 Identities = 130/475 (27%), Positives = 205/475 (43%), Gaps = 53/475 (11%) Query: 19 ADIVDRVARCMSSFDVEVIRADN-EELSAERTAMRPSLAIISVSMIE-SGAAFLRTWQAE 76 A I + + +S +V N L A L + V M + + L + Sbjct: 13 AAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENAFDLLPRIKKA 72 Query: 77 -IGMPVVWVGA--------------ARDHDPSLYPPEYSHILPLDFTCAELRGMISKLAV 121 +PV+ + A A D+ P P + + ++ + +++ Sbjct: 73 RPDLPVLVMSAQNTFMTAIKASEKGAYDYLPK--PFDLTELIGIIGRA------LAEPKR 124 Query: 122 QLRAHAAKALEPSTLVAHSDCMQALLQEVDTFADCDTNVLLHGETGVGKERIAQLLHEKH 181 + + + LV S MQ + + + D +++ GE+G GKE +A+ LH+ + Sbjct: 125 RPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELVARALHD-Y 183 Query: 182 SRYGMGEFVPVNCGAIPDGLFESLFFGHAKGSFTGAVGTHKGYFEQAAGGTLFLDEVGDL 241 + G FV +N AIP L ES FGH KG+FTGA G FEQA GGTLFLDE+GD+ Sbjct: 184 GKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLFLDEIGDM 243 Query: 242 PLYQQVKLLRVLEDGAVLRIGATAPVKVDFRLVAASNKKLPQLVKDGLFRADLYYRLAVI 301 P+ Q +LLRVL+ G +G P++ D R+VAA+NK L Q + GLFR DLYYRL V+ Sbjct: 244 PMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDLYYRLNVV 303 Query: 302 ELSIPSLEERGPVDKIALFKSFVASIVGEDRLAALPELPYWLAEAVADSYFPGNVRELRN 361 L +P L +R D L + FV E + E + +PGNVREL N Sbjct: 304 PLRLPPLRDR-AEDIPDLVRHFVQQAEKEGL--DVKRFDQEALELMKAHPWPGNVRELEN 360 Query: 362 LAERVGV------------------------TVRQTGGWDTVRLQRLIAHARSAAQPAPA 397 L R+ + ++ + + + + Sbjct: 361 LVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFG 420 Query: 398 ESAPDVFVDRSKWDMTERNRVIAALDANGWRRQDTAQHLGISRKVLWEKMRKYQI 452 ++ P + E ++AAL A + A LG++R L +K+R+ + Sbjct: 421 DALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475
>TCRTETOQM#Tetracycline resistance protein TetO/TetQ/TetM family signature. Length = 639 Score = 31.0 bits (70), Expect = 0.011 Identities = 16/79 (20%), Positives = 27/79 (34%), Gaps = 17/79 (21%) Query: 104 LLQSLAQIASERPALYISGEESGAQIALRAQRLALLEGGASAADLKLLAEIQLEKIQATI 163 LL +L +I+ P L + + +I L L ++Q+E A + Sbjct: 361 LLDALLEISDSDPLLRYYVDSATHEIILS-----------------FLGKVQMEVTCALL 403 Query: 164 DAERPDVAVIDSIQTIYSE 182 + I IY E Sbjct: 404 QEKYHVEIEIKEPTVIYME 422
>ALARACEMASE#Alanine racemase signature. Length = 356 Score = 438 bits (1127), Expect = e-156 Identities = 207/353 (58%), Positives = 270/353 (76%) Query: 1 MPRPISATIHTAALANNLSVVRRHAAQSKVWAIVKANAYGHGLARVFPGLRGTDGFGLLD 60 M RPI A++ AL NLS+VR+ A ++VW++VKANAYGHG+ R++ + TDGF LL+ Sbjct: 1 MTRPIQASLDLQALKQNLSIVRQAATHARVWSVVKANAYGHGIERIWSAIGATDGFALLN 60 Query: 61 LDEAVKLRELGWAGPILLLEGFFRSTDIDVIDRYSLTTAVHNDEQMRMLETARLSKPVNV 120 L+EA+ LRE GW GPIL+LEGFF + D+++ D++ LTT VH++ Q++ L+ ARL P+++ Sbjct: 61 LEEAITLRERGWKGPILMLEGFFHAQDLEIYDQHRLTTCVHSNWQLKALQNARLKAPLDI 120 Query: 121 QLKMNSGMNRLGYTPEKYRAAWERARACPGIGQITLMTHFSDADGERGVAEQMATFERGA 180 LK+NSGMNRLG+ P++ W++ RA +G++TLM+HF++A+ G++ MA E+ A Sbjct: 121 YLKVNSGMNRLGFQPDRVLTVWQQLRAMANVGEMTLMSHFAEAEHPDGISGAMARIEQAA 180 Query: 181 QGIAGARSFANSAAVLWHPSAHFDWVRPGIMLYGASPSGRAADIADRGLKPTMTLASELI 240 +G+ RS +NSAA LWHP AHFDWVRPGI+LYGASPSG+ DIA+ GL+P MTL+SE+I Sbjct: 181 EGLECRRSLSNSAATLWHPEAHFDWVRPGIILYGASPSGQWRDIANTGLRPVMTLSSEII 240 Query: 241 AVQTLAKGQAVGYGSMFVAEDTMRIGVVACGYADGYPRIAPEGTPVVVDGVRTRIVGRVS 300 VQTL G+ VGYG + A D RIG+VA GYADGYPR AP GTPV+VDGVRT VG VS Sbjct: 241 GVQTLKAGERVGYGGRYTARDEQRIGIVAAGYADGYPRHAPTGTPVLVDGVRTMTVGTVS 300 Query: 301 MDMLTVDLTPVPQAGVGARVELWGETLPIDDVAARCMTVGYELMCAVAPRVPV 353 MDML VDLTP PQAG+G VELWG+ + IDDVAA TVGYELMCA+A RVPV Sbjct: 301 MDMLAVDLTPCPQAGIGTPVELWGKEIKIDDVAAAAGTVGYELMCALALRVPV 353
>TCRTETB#Tetracycline resistance protein TetB signature. Length = 458 Score = 29.1 bits (65), Expect = 0.040 Identities = 31/139 (22%), Positives = 54/139 (38%), Gaps = 4/139 (2%) Query: 29 FFSSLADSALLIAAIALLKDLHAPNWMIPLLKLFFVLSYVVLAAFVGAFADSRPKGHVMF 88 FFS L + L ++ + D + P + F+L++ + A G +D ++ Sbjct: 24 FFSVLNEMVLNVSLPDIANDFNKPPASTNWVNTAFMLTFSIGTAVYGKLSDQLGIKRLLL 83 Query: 89 ITNSIKVVGCLIMLFGAHP----LIAYGIVGFGAAAYSPAKYGILTELLPPERLVAANGW 144 I G +I G ++A I G GAAA+ ++ +P E A G Sbjct: 84 FGIIINCFGSVIGFVGHSFFSLLIMARFIQGAGAAAFPALVMVVVARYIPKENRGKAFGL 143 Query: 145 IEGTTVGSIILGTVLGGAL 163 I +G +GG + Sbjct: 144 IGSIVAMGEGVGPAIGGMI 162
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 44.2 bits (104), Expect = 3e-08 Identities = 21/71 (29%), Positives = 32/71 (45%) Query: 79 VAPVAQRSGVGLALLREAVRIARAERLDGVLLEVRPSNPRAIRLYERFGFVSVGRRRNYY 138 VA ++ GVG ALL +A+ A+ G++LE + N A Y + F+ Y Sbjct: 97 VAKDYRKKGVGTALLHKAIEWAKENHFCGLMLETQDINISACHFYAKHHFIIGAVDTMLY 156 Query: 139 PAKHRSREDAI 149 + E AI Sbjct: 157 SNFPTANEIAI 167
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 39.4 bits (92), Expect = 2e-05 Identities = 57/271 (21%), Positives = 97/271 (35%), Gaps = 13/271 (4%) Query: 74 AFTLPIALFALLSGVAADAWDRRTVMLLSQALMFSVALCLVALAAAGAMTPARLLVCMFV 133 + L A + G +D + RR V+L+S +V ++A A + L + V Sbjct: 51 LYALMQFACAPVLGALSDRFGRRPVLLVS-LAGAAVDYAIMATAPFLWV----LYIGRIV 105 Query: 134 GGCAGAMFQPAWQSAVTEQVPARELSAAIALDSFSMNFARTAGPALGGFIVASVSPNAAF 193 G GA + + + E + S F AGP LGG + SP+A F Sbjct: 106 AGITGATG-AVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLM-GGFSPHAPF 163 Query: 194 V---LSGLSYAGLIYVLSRSIRGAAARPPVRERLATMLVQGVRYCGRARGIRGTLIRSSL 250 L RP R A + R+ + + + Sbjct: 164 FAAAALNGLNFLTGCFLLPESHKGERRP--LRREALNPLASFRWARGMTVVAALMAVFFI 221 Query: 251 FGFLGSPVWALLPLFAKTQFGGEARTYGVLLASFGA-GAASGALGGAAGRARLGREALVR 309 +G AL +F + +F +A T G+ LA+FG + + A+ ARLG + Sbjct: 222 MQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALM 281 Query: 310 LCTLTFAAGMLATAWSPCQAVAMLGLAVAGG 340 L + G + A++ +A + + Sbjct: 282 LGMIADGTGYILLAFATRGWMAFPIMVLLAS 312 Score = 35.2 bits (81), Expect = 4e-04 Identities = 31/167 (18%), Positives = 58/167 (34%), Gaps = 8/167 (4%) Query: 21 LAALRGPFAYRTFAAIWVAS-LVGNIGGSIQTVAASWLMTSMAPSPTMVSLVQTAFTLPI 79 LA+ R AA+ ++ +G + + T + + AF + Sbjct: 200 LASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILH 259 Query: 80 ALF-ALLSGVAADAWDRRTVMLLSQALMFSVALCLVALAAAGAMTPARLLVCMFVGGCAG 138 +L A+++G A R ++L M + + LA A A ++ + G Sbjct: 260 SLAQAMITGPVAARLGERRALMLG---MIADGTGYILLAFATRGWMAFPIMVLLASG--- 313 Query: 139 AMFQPAWQSAVTEQVPARELSAAIALDSFSMNFARTAGPALGGFIVA 185 + PA Q+ ++ QV + + GP L I A Sbjct: 314 GIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYA 360
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 41.7 bits (98), Expect = 4e-06 Identities = 17/126 (13%), Positives = 41/126 (32%), Gaps = 21/126 (16%) Query: 87 TVRSQVDGQITHVRFHEGQQVRAGDVLVEIDRRALQATADQATAKLEQDKATLANARLEL 146 ++ + + + EG+ VR GDVL+++ +A + + L Q + ++ Sbjct: 98 EIKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQILS 157 Query: 147 ----------------ARHQRLAEMNAAPVQML-----DTWKARVNELHAQIRGDQAAVQ 185 Q ++E + L TW+ + + + +A Sbjct: 158 RSIELNKLPELKLPDEPYFQNVSEEEVLRLTSLIKEQFSTWQNQKYQKELNLDKKRAERL 217 Query: 186 NARVAV 191 + Sbjct: 218 TVLARI 223 Score = 29.0 bits (65), Expect = 0.035 Identities = 14/94 (14%), Positives = 39/94 (41%), Gaps = 11/94 (11%) Query: 113 LVEIDRRALQATADQAT--AKLEQDKATLANARLELARHQRLAEMNAAPVQMLDTWKARV 170 ++E + + ++A + ++LEQ ++ + +A+ E +L + + ++ Sbjct: 254 VLEQENKYVEAVNELRVYKSQLEQIESEILSAKEEYQLVTQLFK---------NEILDKL 304 Query: 171 NELHAQIRGDQAAVQNARVAVDYTTIRAPISGRI 204 + I + + IRAP+S ++ Sbjct: 305 RQTTDNIGLLTLELAKNEERQQASVIRAPVSVKV 338
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 755 bits (1950), Expect = 0.0 Identities = 272/1033 (26%), Positives = 495/1033 (47%), Gaps = 26/1033 (2%) Query: 9 FIRYPVATCLMTAGILFAGVAAYFHLPVAPLPQVEFPTIQVSAVLPGADPVSVASTLAQP 68 FIR P+ ++ ++ AG A LPVA P + P + VSA PGAD +V T+ Q Sbjct: 5 FIRRPIFAWVLAIILMMAGALAILQLPVAQYPTIAPPAVSVSANYPGADAQTVQDTVTQV 64 Query: 69 LETQFSKIPYVTQMTSQSTLS-STSIVLQFSLERSIDAAANDVQSAIDAAAAQLPADLPS 127 +E + I + M+S S + S +I L F D A VQ+ + A LP ++ Sbjct: 65 IEQNMNGIDNLMYMSSTSDSAGSVTITLTFQSGTDPDIAQVQVQNKLQLATPLLPQEVQQ 124 Query: 128 PPTFQKVNPADSPIMLLSAISSTLPLTTID--DYVETRLTKSLSQIDGVGSVSIGGQQKP 185 + S +M+ +S T D DYV + + +LS+++GVG V + G Q Sbjct: 125 QGIS-VEKSSSSYLMVAGFVSDNPGTTQDDISDYVASNVKDTLSRLNGVGDVQLFGAQY- 182 Query: 186 SIRIQLDPVKLASRGLSSEDVRRALSGLSGVNPKGVFNGT------TRSYTIYTNGQLTE 239 ++RI LD L L+ DV L + G GT + +I + Sbjct: 183 AMRIWLDADLLNKYKLTPVDVINQLKVQNDQIAAGQLGGTPALPGQQLNASIIAQTRFKN 242 Query: 240 PAQWNDAIV-AYRDGTPVRIRDIGQAVLGPEDNTLAAWIDGRRAISVGIYKKPGANTVST 298 P ++ + DG+ VR++D+ + LG E+ + A I+G+ A +GI GAN + T Sbjct: 243 PEEFGKVTLRVNSDGSVVRLKDVARVELGGENYNVIARINGKPAAGLGIKLATGANALDT 302 Query: 299 VDKIRARLPELEASLPPSLKIAVLADRTQTIRASLLDIELTLLLNVVLVVVVIYAFLGSV 358 I+A+L EL+ P +K+ D T ++ S+ ++ TL ++LV +V+Y FL ++ Sbjct: 303 AKAIKAKLAELQPFFPQGMKVLYPYDTTPFVQLSIHEVVKTLFEAIMLVFLVMYLFLQNM 362 Query: 359 RTTIIPAVTVPVSLFGACALMWVCGYSLDNISLMAMTIAVGFVVDDAIVMVENIARH-VE 417 R T+IP + VPV L G A++ GYS++ +++ M +A+G +VDDAIV+VEN+ R +E Sbjct: 363 RATLIPTIAVPVVLLGTFAILAAFGYSINTLTMFGMVLAIGLLVDDAIVVVENVERVMME 422 Query: 418 AGELPLQAALKGLSETSFTIASISLSLVAVLLPLLLMSGIIGRMFREFAVTLSMTIIVSA 477 P +A K +S+ + I++ L AV +P+ G G ++R+F++T+ + +S Sbjct: 423 DKLPPKEATEKSMSQIQGALVGIAMVLSAVFIPMAFFGGSTGAIYRQFSITIVSAMALSV 482 Query: 478 FVSLTLTPMMASYLLRAHRHDAGRPPRP--GLFERAFARTAAAYERALDVALRHRFVTLC 535 V+L LTP + + LL+ + G F F + Y ++ L L Sbjct: 483 LVALILTPALCATLLKPVSAEHHENKGGFFGWFNTTFDHSVNHYTNSVGKILGSTGRYLL 542 Query: 536 AFLASVAASVFLYVGIPKGFFPQQDTGVITGISEAAQTISVEDMARHSMALAAIIRADPA 595 + VA V L++ +P F P++D GV + + + E + + + Sbjct: 543 IYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQKVLDQVTDYYLKNEK 602 Query: 596 --VEHCQMAVGGSAYAGTTVNNGRWYITLKPRDQRDA---TADEVIRRLRPQFAKVPGVR 650 VE V G +++G N G +++LKP ++R+ +A+ VI R + + K+ Sbjct: 603 ANVES-VFTVNGFSFSGQAQNAGMAFVSLKPWEERNGDENSAEAVIHRAKMELGKIRDGF 661 Query: 651 MYLQAAQDVIIGARLARTQYQLTLQSA-DVGALTTWAPRLLARLSGLP-QLRDVASDQQV 708 + ++ ++L Q+ ALT +LL + P L V + Sbjct: 662 VIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQLLGMAAQHPASLVSVRPNGLE 721 Query: 709 NGSALSVAIDRDQAARYGLTPEAIDGTLYDAFGSRQVAQYFTQLSTYKVIMETLPSLQRD 768 + + + +D+++A G++ I+ T+ A G V + + K+ ++ + Sbjct: 722 DTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDFIDRGRVKKLYVQADAKFRML 781 Query: 769 PGTLDRIYMKAPSGALVPLSSVARWTTDTVQPLSVNHQSHFPSVTISFNLAPGVSLGEAT 828 P +D++Y+++ +G +VP S+ + + + PS+ I APG S G+A Sbjct: 782 PEDVDKLYVRSANGEMVPFSAFTT-SHWVYGSPRLERYNGLPSMEIQGEAAPGTSSGDAM 840 Query: 829 AAIEAARASLRMPPAVVGSFQGTAQAFQSTLATMPMLILSALIVAYLVLGALHGSFIHPW 888 A +E + ++P + + G + + + P L+ + +V +L L AL+ S+ P Sbjct: 841 ALME--NLASKLPAGIGYDWTGMSYQERLSGNQAPALVAISFVVVFLCLAALYESWSIPV 898 Query: 889 TILSTLPSAGVGAIATLWLFKYDFNLIALIGVILLIGIVKKNGIMMVDFAIAATRERNMT 948 +++ +P VG + LF ++ ++G++ IG+ KN I++V+FA + Sbjct: 899 SVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIGLSAKNAILIVEFAKDLMEKEGKG 958 Query: 949 SLDAIRSACLLRLRPIMMTTMTALFGALPLMFTPGMGSELRQPLGYAMVGGLLVSQVLTL 1008 ++A A +RLRPI+MT++ + G LPL + G GS + +G ++GG++ + +L + Sbjct: 959 VVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAGSGAQNAVGIGVMGGMVSATLLAI 1018 Query: 1009 FTTPVIYLYLDTL 1021 F PV ++ + Sbjct: 1019 FFVPVFFVVIRRC 1031 Score = 90.3 bits (224), Expect = 2e-20 Identities = 78/509 (15%), Positives = 163/509 (32%), Gaps = 37/509 (7%) Query: 4 NLFAVFIRYPVATCLMTAGILFAGVAAYFHLPVAPLPQVEFPTIQVSAVLP-GADPVSVA 62 N + L+ A I+ V + LP + LP+ + LP GA Sbjct: 528 NSVGKILGSTGRYLLIYALIVAGMVVLFLRLPSSFLPEEDQGVFLTMIQLPAGATQERTQ 587 Query: 63 STLAQPLETQF---SKIPYVTQMTSQSTLSSTS-------IVLQFSLERSIDAAANDVQS 112 L Q + + + S + + L+ ER + N ++ Sbjct: 588 KVLDQVTDYYLKNEKANVESVFTVNGFSFSGQAQNAGMAFVSLKPWEER--NGDENSAEA 645 Query: 113 AIDAAAAQLPADLPSPPTFQKVNPADSPIMLLSAIS---------STLPLTTIDDYVETR 163 I A + L + I+ L + + L + Sbjct: 646 VIHRAKME----LGKIRDGFVIPFNMPAIVELGTATGFDFELIDQAGLGHDALTQARNQL 701 Query: 164 LTKSLSQIDGVGSVSIGGQQ-KPSIRIQLDPVKLASRGLSSEDVRRALSGLSGVNPKGVF 222 L + + SV G + ++++D K + G+S D+ + +S G F Sbjct: 702 LGMAAQHPASLVSVRPNGLEDTAQFKLEVDQEKAQALGVSLSDINQTISTALGGTYVNDF 761 Query: 223 NGTTRSYTIYTNGQ---LTEPAQWNDAIVAYRDGTPVRIRDIGQAVLGPEDNTLAAWIDG 279 R +Y P + V +G V + L +G Sbjct: 762 IDRGRVKKLYVQADAKFRMLPEDVDKLYVRSANGEMVPFSAFTTSHWVYGSPRLER-YNG 820 Query: 280 RRAISVGIYKKPGANTVSTVDKIRARLPELEASLPPSLKIAVLADRTQTIRASLLDIELT 339 ++ + PG ++ A + L + LP + + R S Sbjct: 821 LPSMEIQGEAAPG----TSSGDAMALMENLASKLPAGIGYDW-TGMSYQERLSGNQAPAL 875 Query: 340 LLLNVVLVVVVIYAFLGSVRTTIIPAVTVPVSLFGACALMWVCGYSLDNISLMAMTIAVG 399 + ++ V+V + + A S + + VP+ + G + D ++ + +G Sbjct: 876 VAISFVVVFLCLAALYESWSIPVSVMLVVPLGIVGVLLAATLFNQKNDVYFMVGLLTTIG 935 Query: 400 FVVDDAIVMVENI-ARHVEAGELPLQAALKGLSETSFTIASISLSLVAVLLPLLLMSGII 458 +AI++VE + G+ ++A L + I SL+ + +LPL + +G Sbjct: 936 LSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFILGVLPLAISNGAG 995 Query: 459 GRMFREFAVTLSMTIIVSAFVSLTLTPMM 487 + + ++ + +++ P+ Sbjct: 996 SGAQNAVGIGVMGGMVSATLLAIFFVPVF 1024
>IGASERPTASE#IgA-specific serine endopeptidase (S6) signature. Length = 1541 Score = 31.2 bits (70), Expect = 0.015 Identities = 34/191 (17%), Positives = 52/191 (27%), Gaps = 22/191 (11%) Query: 335 SDRLSLFADVGYTRNFHG--AAGGMNAFDSDVEMFSIGADYKLSEASRAGALLSSGNANG 392 S+ + L Y RN + A N + + Y G L G Sbjct: 1331 SNNVQLGGVFTYVRNSNNFDKATSKNTL----AQVNFYSKYYADNHWYLGIDLGYGKFQS 1386 Query: 393 SLAGGQGR-IGLHAYRLGVY--HAFERAGLFVRAYAGAGWSR-----YRL--DRAAVLPG 442 L H + G+ AF + G +S + L R V P Sbjct: 1387 KLQTNHNAKFARHTAQFGLTAGKAFNLGNFGITPIVGVRYSYLSNADFALDQARIKVNPI 1446 Query: 443 AVRASTSGFDFGALVKAGYLFALGGVRLGPVADVGYTQLVARGYTEDGDSILAQNVGVQR 502 +V+ + + D Y + LG + P+ Y G A NV Q+ Sbjct: 1447 SVKTAFAQVDLS------YTYHLGEFSVTPILSARYDANQGSGKINVNGYDFAYNVENQQ 1500 Query: 503 LKGVSAGAGVR 513 Sbjct: 1501 QYNAGLKLKYH 1511 Score = 30.4 bits (68), Expect = 0.024 Identities = 29/184 (15%), Positives = 57/184 (30%), Gaps = 34/184 (18%) Query: 342 ADVGYTRNFHGAAGGMNAFDSDVEMFSIGADYKLSEASRAGALLSSGNANGSLAGGQGRI 401 ++ +N+ + F S +G D +S + G + + + + + Sbjct: 1299 SNTSMNKNYSSSQ--YRRFSSKSTQTQLGWDQTISNNVQLGGVFTYVRNSNNFDKATSK- 1355 Query: 402 GLHAYRLGVYHAFERAGLFVRAYA----------GAGWSRYRLDRAAVLPGAVRASTSGF 451 + + + + YA G G + +L A + G Sbjct: 1356 ----------NTLAQVNFYSKYYADNHWYLGIDLGYGKFQSKLQTNHNAKFARHTAQFG- 1404 Query: 452 DFGALVKAGYLFALGGVRLGPVADVGYTQLVARGYTEDGDSILAQNVGVQRLKGVSAGAG 511 + AG F LG + P+ V Y+ L + D I + V+ A A Sbjct: 1405 -----LTAGKAFNLGNFGITPIVGVRYSYLSNADFALDQARIKVNPISVKT-----AFAQ 1454 Query: 512 VRFA 515 V + Sbjct: 1455 VDLS 1458
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 36.0 bits (83), Expect = 2e-04 Identities = 33/119 (27%), Positives = 56/119 (47%), Gaps = 15/119 (12%) Query: 116 IDDERVRRDLDAGKVVIITGFQGV---DPDGHITTL-GRGGSDTSAVAVAAALEADECLI 171 ++ E +++ ++ G +VI +G GV DG I + D + +A + AD +I Sbjct: 174 VEAETIKKLVERGVIVIASGGGGVPVILEDGEIKGVEAVIDKDLAGEKLAEEVNADIFMI 233 Query: 172 YTDVDGVYTTDPRVVEEARRLDSVTFEEMLEMA--------SLGSKVLQ-IRSVEFAGK 221 TDV+G E+ + L V EE+ + S+G KVL IR +E+ G+ Sbjct: 234 LTDVNGAALYYGT--EKEQWLREVKVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGE 290
>SSBTLNINHBTR#Streptomyces subtilisin inhibitor signature. Length = 144 Score = 28.3 bits (62), Expect = 0.027 Identities = 15/50 (30%), Positives = 23/50 (46%) Query: 15 VATAAVAPADAFAATAKTAQSAKGKKSAAKKSLRAASSSAEPRAKGARKR 64 +A+ A APA +A +A G+ +A LRA + + P A G Sbjct: 27 LASPATAPASLYAPSALVLTVGHGESAATAAPLRAVTLTCAPTASGTHPA 76
>RTXTOXIND#Gram-negative bacterial RTX secretion protein D signature. Length = 478 Score = 35.6 bits (82), Expect = 5e-04 Identities = 12/58 (20%), Positives = 22/58 (37%) Query: 49 VPSPSAGTVKEVKVKVGDAVSQGSLIVLLDGAQAAAQPAQANGAATSAAQPAAAPAAA 106 + VKE+ VK G++V +G +++ L A A + + A Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEADTLKTQSSLLQARLEQTRYQIL 156 Score = 31.0 bits (70), Expect = 0.013 Identities = 12/37 (32%), Positives = 21/37 (56%) Query: 162 VPSPAAGVVKDIKVKVGDAVSEGSLIVVLEASGGAAA 198 + +VK+I VK G++V +G +++ L A G A Sbjct: 99 IKPIENSIVKEIIVKEGESVRKGDVLLKLTALGAEAD 135
>PF06580#Sensor histidine kinase Length = 349 Score = 31.8 bits (72), Expect = 0.012 Identities = 18/85 (21%), Positives = 31/85 (36%), Gaps = 18/85 (21%) Query: 700 PVLIEQVLV-NLMKNAAEAMQEARPQAENGVIRVVADLEAGFVDIRVIDQGPGVDEATAE 758 P ++ Q LV N +K+ + G I + + G V + V + G + T E Sbjct: 256 PPMLVQTLVENGIKHGIA------QLPQGGKILLKGTKDNGTVTLEVENTGSLALKNTKE 309 Query: 759 RLFEPFYSTKSDGMGMGLNICRSII 783 S G G+ N+ + Sbjct: 310 ----------STGTGL-QNVRERLQ 323
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 113 bits (283), Expect = 2e-31 Identities = 39/153 (25%), Positives = 67/153 (43%), Gaps = 4/153 (2%) Query: 11 TVFVVDDDEAVRDSLRWLLEANGYRVQCFSSAEQFLDAYQPAQQAGQIACLILDVRMSGM 70 T+ V DDD A+R L L GY V+ S+A AG ++ DV M Sbjct: 5 TILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIA----AGDGDLVVTDVVMPDE 60 Query: 71 SGLELQERLIAENAALPIIFVTGHGDVPMAVSTMKKGAMDFIEKPFDEAELRKLVERMLE 130 + +L R+ LP++ ++ A+ +KGA D++ KPFD EL ++ R L Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 131 KARNESKSVQEQRAASERLSKLTAREQQVLERI 163 + + +++ L +A Q++ + Sbjct: 121 EPKRRPSKLEDDSQDGMPLVGRSAAMQEIYRVL 153
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 58.3 bits (141), Expect = 7e-12 Identities = 32/160 (20%), Positives = 56/160 (35%), Gaps = 27/160 (16%) Query: 1 MKILVTGANGQVGWELARSLAVLGQVV-----------PLTRE--------------QAD 35 MK LVTGA G +G+ +++ L G V ++ + D Sbjct: 1 MKYLVTGAAGFIGFHVSKRLLEAGHQVVGIDNLNDYYDVSLKQARLELLAQPGFQFHKID 60 Query: 36 LGRPETLARIVEDAKPDVVVNAAAYTAVDAAETDGAAANVINGEA-VGVLAAATKRVGGL 94 L E + + + V + AV + + A N + +L Sbjct: 61 LADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHNKIQH 120 Query: 95 FVHYSTDYVFDGTKPSPYIETDPT-CPVNAYGASKLLGEL 133 ++ S+ V+ + P+ D PV+ Y A+K EL Sbjct: 121 LLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANEL 160
>NUCEPIMERASE#Nucleotide sugar epimerase signature. Length = 334 Score = 174 bits (443), Expect = 7e-54 Identities = 90/350 (25%), Positives = 136/350 (38%), Gaps = 45/350 (12%) Query: 2 ILVTGGAGFIGANFVLDWLAQSDEAVLNVDKLT--YAGNLGTLK-SLQGNPKHVFARVDI 58 LVTG AGFIG + L + V+ +D L Y +L + L P F ++D+ Sbjct: 3 YLVTGAAGFIGFHVSKRLLEAGHQ-VVGIDNLNDYYDVSLKQARLELLAQPGFQFHKIDL 61 Query: 59 CDRAAIDALLAQHKPRAIVHFAAESHVDRSIHGPADFVQTNVVGTFTLLEAARQYWSALG 118 DR + L A + V S+ P + +N+ G +LE R Sbjct: 62 ADREGMTDLFASGHFERVFISPHRLAVRYSLENPHAYADSNLTGFLNILEGCRHN----- 116 Query: 119 TDAKAAFRFLHVSTDEVFGSLSPADPQFSETTPYA-PNSPYSATKAGSDHLVRAYHHTYG 177 L+ S+ V+G L+ P FS P S Y+ATK ++ + Y H YG Sbjct: 117 ----KIQHLLYASSSSVYG-LNRKMP-FSTDDSVDHPVSLYAATKKANELMAHTYSHLYG 170 Query: 178 LPVLTTNCSNNYGPYQFPEKLIPLMIANALGGKPLPVYGDGQNVRDWLYVGDHCSAIREV 237 LP YGP+ P+ + L GK + VY G+ RD+ Y+ D AI + Sbjct: 171 LPATGLRFFTVYGPWGRPDMALFKFTKAMLEGKSIDVYNYGKMKRDFTYIDDIAEAIIRL 230 Query: 238 L------------------ARGVPGETYNVGGWNEKKNLDVVHTLCDLLD-EARPKAGGS 278 A P YN+G + + +D + L D L EA+ Sbjct: 231 QDVIPHADTQWTVETGTPAASIAPYRVYNIGNSSPVELMDYIQALEDALGIEAKKN---- 286 Query: 279 YRDQITYVTDRPGHDRRYAIDARKLERELGWKPAETFETGLAKTVRWYLD 328 + +PG + D + L +G+ P T + G+ V WY D Sbjct: 287 ------MLPLQPGDVLETSADTKALYEVIGFTPETTVKDGVKNFVNWYRD 330
>SECA#SecA protein signature. Length = 901 Score = 33.7 bits (77), Expect = 0.001 Identities = 27/87 (31%), Positives = 42/87 (48%), Gaps = 9/87 (10%) Query: 116 LNRRLPRAVARTREGDFSLNGLLGFDLFGKTVGVIGTGLI--GSVFARIMTGFGMRVLAH 173 L +P A A RE + G+ FD V ++G G++ A + TG G + L Sbjct: 60 LENLIPEAFAVVREASKRVFGMRHFD-----VQLLG-GMVLNERCIAEMRTGEG-KTLTA 112 Query: 174 SLPPHDDALIALGVRYVPLDALLAEAD 200 +LP + +AL GV V ++ LA+ D Sbjct: 113 TLPAYLNALTGKGVHVVTVNDYLAQRD 139
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 49.4 bits (118), Expect = 1e-08 Identities = 94/399 (23%), Positives = 148/399 (37%), Gaps = 37/399 (9%) Query: 5 LFALAVAAFGIGTTEFVIMGLLPNVARDLGVSIPAA---GMLVSGYALGVTIGAPILAVV 61 L +A+ A GIG +IM +LP + RDL S G+L++ YAL AP+L + Sbjct: 11 LSTVALDAVGIG----LIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGAL 66 Query: 62 TAKMPRKATLLALIGVFIVGNLFCAIAPGYATLMVARVVTAFCHGAFFGIGSVVASNLVA 121 + + R+ LL + V A AP L + R+V G+ +A ++ Sbjct: 67 SDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIA-DITD 125 Query: 122 PNKRAQAIALMFTGLTLANVLGVPLGTALGQALGWRATFWAVTGIGALAAAALAFCVPKR 181 ++RA+ M V G LG +G A F+A + L F +P+ Sbjct: 126 GDERARHFGFMSACFGFGMVAGPVLGGLMGG-FSPHAPFFAAAALNGLNFLTGCFLLPES 184 Query: 182 LEMPAAGIAREFGVLRNPQVLMVLGISVLASASLFTVFTYIAPI-----------LEDVT 230 + + RE NP + A+L VF + + ED Sbjct: 185 HKGERRPLRRE---ALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRF 241 Query: 231 GFTPHDVTLVLLLFG-LGLTVGGTVGGKLADW---RRMPSLVATLASIGVVLAAFAGTMR 286 + + + L FG L + G +A RR L G +L AFA Sbjct: 242 HWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGW 301 Query: 287 TPLPALVTIFVWGVLAFAIVPPLQILIVDRAS-HAPNLASTLNQGAFNLGNALGAWLGGT 345 P +V + G+ +P LQ ++ + +L + +G L Sbjct: 302 MAFPIMVLLASGGI----GMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTA 357 Query: 346 AIHAGVPLAK-LPW-AGAAL---AMAALALTLWSASLER 379 A + W AGAAL + AL LWS + +R Sbjct: 358 IYAASITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQR 396
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 46.7 bits (111), Expect = 9e-08 Identities = 95/395 (24%), Positives = 152/395 (38%), Gaps = 25/395 (6%) Query: 12 LILSVAVVGLGTGATLPLTALALTEAGHGTRIV---GILTAAQAGGGLAVVPFVTAITKR 68 ++ +VA+ +G G +P+ L + H + GIL A A A P + A++ R Sbjct: 10 ILSTVALDAVGIGLIMPVLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDR 69 Query: 69 LGARQVIVASVVVLAAATALMQFTSNLVVWGVLRVVCGAALMLLFTIGEAWVNQLADDAT 128 G R V++ S+ A A+M L V + R+V G G A++ + D Sbjct: 70 FGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAG-AYIADITDGDE 128 Query: 129 RGRVVAIYATNFTLFQMAGPVLVSQIAGMT-HVRFALSGALFLLAL--------PSLASI 179 R R + F +AGPVL + G + H F + AL L S Sbjct: 129 RARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGE 188 Query: 180 RKTPIADEPHHDAHDRWTRVIPKMPALVVGTAFFALFDTLALSLLPIFAMAR--GVASEA 237 R+ + + A RW R + + AL+ L + +L IF R A+ Sbjct: 189 RRPLRREALNPLASFRWARGMTVVAALMAVFFIMQLVGQVPAALWVIFGEDRFHWDATTI 248 Query: 238 AVLFAAILLFGDTAMQFPIGWLADKLGRERVHLGAGCVVLALLPLLPAVVTTPWLCWPLL 297 + AA + A G +A +LG ER L G + +L A T W+ +P++ Sbjct: 249 GISLAAFGILHSLAQAMITGPVAARLG-ERRALMLGMIADGTGYILLAFATRGWMAFPIM 307 Query: 298 FVLGAAAGSVYTL----SLVACGERFRGSALVTASSLVSASWSAASFGGPLVAGALMEQF 353 +L + + L S ER +G + ++L S S GPL+ A+ Sbjct: 308 VLLASGGIGMPALQAMLSRQVDEER-QGQLQGSLAALT----SLTSIVGPLLFTAIYAAS 362 Query: 354 GGDALIGVLIVSAIAFVGAALWERRALPMQAARRG 388 I A ++ RR L A +R Sbjct: 363 ITTWNGWAWIAGAALYLLCLPALRRGLWSGAGQRA 397
>TCRTETA#Tetracycline resistance protein signature. Length = 399 Score = 43.7 bits (103), Expect = 9e-07 Identities = 78/398 (19%), Positives = 125/398 (31%), Gaps = 59/398 (14%) Query: 50 VAPSVIAEWGVKKQA---LGPVFSASLFGMLLGALGLSVLADRIGRRPVLIGATLFFALA 106 V P ++ + G + + A L L+DR GRRPVL+ + A+ Sbjct: 27 VLPGLLRDLVHSNDVTAHYGILLALYALMQFACAPVLGALSDRFGRRPVLLVSLAGAAVD 86 Query: 107 MLATPFATSIPILIALRFVTGLGLGCIMPNAMALVGECSPGAHRVKRM----MIVSCGFT 162 A + +L R V G+ G A A + + + G R + G Sbjct: 87 YAIMATAPFLWVLYIGRIVAGIT-GATGAVAGAYIADITDGDERARHFGFMSACFGFGMV 145 Query: 163 LGAALGGFVSAALIPAFGWRAVFFVGGAVPLALAAAMAASLPESPQLLVLRGRHDAARAW 222 G LGG + F A FF A+ LPES H R Sbjct: 146 AGPVLGGLMGG-----FSPHAPFFAAAALNGLNFLTGCFLLPES---------HKGERRP 191 Query: 223 LAKFAPRLAVPPDTRLVVREAGPRGAPVAELFRSGRARVTLLLWAINF-MNLIDLYFLSN 281 L + A P+A + V L A+ F M L+ + Sbjct: 192 LRREALN-------------------PLASFRWARGMTVVAALMAVFFIMQLVGQVPAAL 232 Query: 282 WLPTVMRDAGYASGTAVIVGTVLQTGGVIGTLS----LGWFIERHGFARVLFACFACATI 337 W+ + A +G L G++ +L+ G R G R L Sbjct: 233 WVIFGEDRFHW---DATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGT 289 Query: 338 AIGLIGSVAHAFVWLLAAVFVGGFCVVGGQPAVNALAGHYYPTSLRSTGIGWSLGVGRVG 397 L+ ++ V + + G PA+ A+ + G + + Sbjct: 290 GYILLAFATRGWMAFPIMVLLASGGI--GMPALQAMLSRQVDEERQGQLQGSLAALTSLT 347 Query: 398 SVLGPLVGGQLIA--------LGWSNDALFHAAAVPVL 427 S++GPL+ + A W A + +P L Sbjct: 348 SIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPAL 385
>PHPHTRNFRASE#Phosphoenolpyruvate-protein phosphotransferase signature. Length = 572 Score = 44.0 bits (104), Expect = 3e-07 Identities = 36/178 (20%), Positives = 60/178 (33%), Gaps = 34/178 (19%) Query: 87 RALDAGARTLMFPGVETADEAAHAVRLTRFQAPDAPDGLRGVAGIVRAAAYGMRRDYVQT 146 RA G +MFP + T +E LR I++ + + V Sbjct: 380 RASTYGNLKVMFPMIATLEE------------------LRQAKAIMQEEKDKLLSEGVDV 421 Query: 147 ANAQIATIVQIESARGVDEAERIAATPGVDCVFVGPADL----------SASLGHLGDTK 196 ++ I + +E A A VD +G DL + + +L Sbjct: 422 SD-SIEVGIMVEIPSTAVAANLFA--KEVDFFSIGTNDLIQYTMAADRMNERVSYLYQPY 478 Query: 197 HPDVAAALEHVLAAGRRAGVPVGI---FAADTAGARQSLEAGFRVVALSADVVWLLRA 251 HP + ++ V+ A G VG+ A D L G ++SA + R+ Sbjct: 479 HPAILRLVDMVIKAAHSEGKWVGMCGEMAGDEVAIPLLLGLGLDEFSMSATSILPARS 536
>cloacin#Cloacin signature. Length = 551 Score = 31.2 bits (70), Expect = 0.018 Identities = 22/65 (33%), Positives = 29/65 (44%), Gaps = 1/65 (1%) Query: 680 SGADGASGASGAGGEPTEHANAGGNPAGGGIAGGAAGTANNGSGAAAPGGM-PGANGAAM 738 +G GAS G +E+ GG G GG +G N G + GG G N +A+ Sbjct: 25 TGLGVGGGASDGSGWSSENNPWGGGSGSGIHWGGGSGHGNGGGNGNSGGGSGTGGNLSAV 84 Query: 739 GAPPA 743 AP A Sbjct: 85 AAPVA 89
>GPOSANCHOR#Gram-positive coccus surface protein anchor signature. Length = 539 Score = 28.5 bits (63), Expect = 0.046 Identities = 16/64 (25%), Positives = 22/64 (34%), Gaps = 3/64 (4%) Query: 293 KAAKGKKATKGADKSAKAADKGADKDKGAKPAAAPPVPARSRPAGPAQPAAPLKPATAPS 352 K + +KA A A+A A K+K AK A + + P A P Sbjct: 424 KLTEKEKAELQAKLEAEAK---ALKEKLAKQAEELAKLRAGKASDSQTPDAKPGNKAVPG 480 Query: 353 PGAP 356 G Sbjct: 481 KGQA 484
>SHAPEPROTEIN#Bacterial cell shape determinant MreB/Mbl protein signature. Length = 347 Score = 504 bits (1300), Expect = 0.0 Identities = 247/348 (70%), Positives = 294/348 (84%), Gaps = 2/348 (0%) Query: 1 MFGFLRSYFSNDLAIDLGTANTLIYMRGKGIVLDEPSVVSIRQEGGPNGKKTIQAVGKEA 60 M R FSNDL+IDLGTANTLIY++G+GIVL+EPSVV+IRQ+ K++ AVG +A Sbjct: 1 MLKKFRGMFSNDLSIDLGTANTLIYVKGQGIVLNEPSVVAIRQDRA-GSPKSVAAVGHDA 59 Query: 61 KQMLGKVPGNIEAIRPMKDGVIADFTVTEQMIKQFIKTAHESRMFSPSPRIIICVPCGST 120 KQMLG+ PGNI AIRPMKDGVIADF VTE+M++ FIK H + PSPR+++CVP G+T Sbjct: 60 KQMLGRTPGNIAAIRPMKDGVIADFFVTEKMLQHFIKQVHSNSFMRPSPRVLVCVPVGAT 119 Query: 121 QVERRAIKEAAHGAGASQVYLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVGVISLG 180 QVERRAI+E+A GAGA +V+LIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEV VISL Sbjct: 120 QVERRAIRESAQGAGAREVFLIEEPMAAAIGAGLPVSEATGSMVVDIGGGTTEVAVISLN 179 Query: 181 GIVYKGSVRVGGDKFDEAIVNYIRRNYGMLIGEQTAEAIKKEIGSAFPGSEVKEMEVKGR 240 G+VY SVR+GGD+FDEAI+NY+RRNYG LIGE TAE IK EIGSA+PG EV+E+EV+GR Sbjct: 180 GVVYSSSVRIGGDRFDEAIINYVRRNYGSLIGEATAERIKHEIGSAYPGDEVREIEVRGR 239 Query: 241 NLSEGIPRSFTISSNEILEALTDPLNQIVSSVKIALEQTPPELGADIAERGMMLTGGGAL 300 NL+EG+PR FT++SNEILEAL +PL IVS+V +ALEQ PPEL +DI+ERGM+LTGGGAL Sbjct: 240 NLAEGVPRGFTLNSNEILEALQEPLTGIVSAVMVALEQCPPELASDISERGMVLTGGGAL 299 Query: 301 LRDLDRLLAEETGLPVLVAEDPLTCVVRGSGMALERMDKL-GSIFSYE 347 LR+LDRLL EETG+PV+VAEDPLTCV RG G ALE +D G +FS E Sbjct: 300 LRNLDRLLMEETGIPVVVAEDPLTCVARGGGKALEMIDMHGGDLFSEE 347
>TYPE4SSCAGA#Type IV secretion system CagA exotoxin signature. Length = 1147 Score = 30.8 bits (69), Expect = 0.013 Identities = 29/89 (32%), Positives = 43/89 (48%), Gaps = 5/89 (5%) Query: 395 SNKIAKEIFVTIWDEKAADEGAADRIIEAKGLK-QISDTGALEAIIDEVLAANAKSVEEF 453 +N EIF I E D A KG+K ++SD LE + ++ L KS +EF Sbjct: 648 ANSQKDEIFALINKEANRDARAIAYAQNLKGIKRELSDK--LENV-NKNLKDFDKSFDEF 704 Query: 454 RAGKDKAFNALVGQAMKATKGKANPQQVN 482 + GK+K F+ + +KA KG +N Sbjct: 705 KNGKNKDFSK-AEETLKALKGSVKDLGIN 732
>FLGMOTORFLIN#Flagellar motor switch protein FliN signature. Length = 137 Score = 134 bits (339), Expect = 2e-43 Identities = 78/126 (61%), Positives = 97/126 (76%), Gaps = 3/126 (2%) Query: 38 AMDD-WAAALAEQNQQPIETGATGAGVFRPLSKATASSTHNDIDLILDIPVKMTVELGRT 96 A+DD WA AL EQ ++ A VF+ L S DIDLI+DIPVK+TVELGRT Sbjct: 14 ALDDLWADALNEQKATTTKSAADA--VFQQLGGGDVSGAMQDIDLIMDIPVKLTVELGRT 71 Query: 97 KIAIRNLLQLAQGSVVELDGLAGEPMDVLVNGCLIAQGEVVVVNDKFGIRLTDIITPSER 156 ++ I+ LL+L QGSVV LDGLAGEP+D+L+NG LIAQGEVVVV DK+G+R+TDIITPSER Sbjct: 72 RMTIKELLRLTQGSVVALDGLAGEPLDILINGYLIAQGEVVVVADKYGVRITDIITPSER 131 Query: 157 IRKLNR 162 +R+L+R Sbjct: 132 MRRLSR 137
>TYPE3IMQPROT#Type III secretion system inner membrane Q protein family signature. Length = 86 Score = 68.6 bits (168), Expect = 4e-19 Identities = 26/85 (30%), Positives = 46/85 (54%) Query: 4 ENVMTLAHQAMYIGLLLAAPLLLVALAVGLVVSLFQAATQINEATLSFIPKLLAVAATMV 63 ++++ ++A+Y+ L+L+ +VA +GL+V LFQ TQ+ E TL F KLL V + Sbjct: 2 DDLVFAGNKALYLVLILSGWPTIVATIIGLLVGLFQTVTQLQEQTLPFGIKLLGVCLCLF 61 Query: 64 IAGPWMLSTMIDYLRETLLRVATLG 88 + W ++ Y R+ + G Sbjct: 62 LLSGWYGEVLLSYGRQVIFLALAKG 86
>TYPE3IMRPROT#Type III secretion system inner membrane R protein family signature. Length = 261 Score = 161 bits (409), Expect = 5e-51 Identities = 117/250 (46%), Positives = 158/250 (63%), Gaps = 1/250 (0%) Query: 1 MFSVTYAQLNGWLTAFLWPFVRMLALVAIAPVTGHRSTPVRVKIGLAGFMALVVAPTLPP 60 M VT Q WL + WP +R+LAL++ AP+ RS P RVK+GLA + +AP+LP Sbjct: 1 MLQVTSEQWLSWLNLYFWPLLRVLALISTAPILSERSVPKRVKLGLAMMITFAIAPSLPA 60 Query: 61 MPVATVFSAQGVWIIVNQFLIGAALGFTMQIVFAAIEAAGDIIGLSMGLGFATFFDPHSS 120 V VFS +W+ V Q LIG ALGFTMQ FAA+ AG+IIGL MGL FATF DP S Sbjct: 61 NDVP-VFSFFALWLAVQQILIGIALGFTMQFAFAAVRTAGEIIGLQMGLSFATFVDPASH 119 Query: 121 GATPVMGRFLNAVAILAFLAFDGHLQVFAALVDSFRLVPVSANLLRAAGWQTLVAFGAAI 180 PV+ R ++ +A+L FL F+GHL + + LVD+F +P+ L + + L G+ I Sbjct: 120 LNMPVLARIMDMLALLLFLTFNGHLWLISLLVDTFHTLPIGGEPLNSNAFLALTKAGSLI 179 Query: 181 FEMGLLLALPVVAALLIANLALGILNRAAPQIGIFQVGFPVTMLVGLLLVQLMAPNLIPF 240 F GL+LALP++ LL NLALG+LNR APQ+ IF +GFP+T+ VG+ L+ + P + PF Sbjct: 180 FLNGLMLALPLITLLLTLNLALGLLNRMAPQLSIFVIGFPLTLTVGISLMAALMPLIAPF 239 Query: 241 VGRLFDTGVD 250 LF + Sbjct: 240 CEHLFSEIFN 249
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 445 bits (1145), Expect = e-156 Identities = 152/483 (31%), Positives = 231/483 (47%), Gaps = 47/483 (9%) Query: 4 RLQVIYIEDDELVRRASVQSLQLAGFDVVGFGSVEAAEKAIVGDATGVIVSDIRLPGASG 63 ++ +DD +R Q+L AG+DV + + I ++V+D+ +P + Sbjct: 3 GATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDENA 62 Query: 64 LELLAQCRERTPDVPVVLVTGHGDISMAVQAMRDGAYDFIEKPFAAERLTETVRRALERR 123 +LL + ++ PD+PV++++ A++A GAYD++ KPF L + RAL Sbjct: 63 FDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALAEP 122 Query: 124 ALVLENHALRRELAGQGVVAPRIIGRSPAIEQVRRLIANVAPTDASVLINGDTGAGKELI 183 +L ++GRS A++++ R++A + TD +++I G++G GKEL+ Sbjct: 123 K------RRPSKLEDDSQDGMPLVGRSAAMQEIYRVLARLMQTDLTLMITGESGTGKELV 176 Query: 184 ARSLHELSPRRDKPFIAVNCGALPEPMFESEMFGYEPGAFTGAAKRRIGKLEYASGGTLF 243 AR+LH+ RR+ PF+A+N A+P + ESE+FG+E GAFTGA R G+ E A GGTLF Sbjct: 177 ARALHDYGKRRNGPFVAINMAAIPRDLIESELFGHEKGAFTGAQTRSTGRFEQAEGGTLF 236 Query: 244 LDEIESMPLALQVKLLRVLQDGVLERLGSNQPIRVNCRVVAAAKGDMSEHVAAGTFRRDL 303 LDEI MP+ Q +LLRVLQ G +G PIR + R+VAA D+ + + G FR DL Sbjct: 237 LDEIGDMPMDAQTRLLRVLQQGEYTTVGGRTPIRSDVRIVAATNKDLKQSINQGLFREDL 296 Query: 304 LYRLNVVTIALPPLAERREDIVPLFEHFMLDAAVRYGRPAPLLTDRQRASLMQRDWPGNV 363 YRLNVV + LPPL +R EDI L HF + A + G + WPGNV Sbjct: 297 YYRLNVVPLRLPPLRDRAEDIPDLVRHF-VQQAEKEGLDVKRFDQEALELMKAHPWPGNV 355 Query: 364 RELRNAADRFVLGVTEGIVG---------------------------------------- 383 REL N R + ++ Sbjct: 356 RELENLVRRLTALYPQDVITREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQY 415 Query: 384 DAGPETDEHAEQSLKERVEQFERAVIAETLNRTGGAVATTADKLHVGKATLYEKMKRYGL 443 A + + E +I L T G AD L + + TL +K++ G+ Sbjct: 416 FASFGDALPPSGLYDRVLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRELGV 475 Query: 444 SAK 446 S Sbjct: 476 SVY 478
>TYPE4SSCAGX#Type IV secretion system CagX conjugation protein signature. Length = 522 Score = 27.8 bits (61), Expect = 0.025 Identities = 19/64 (29%), Positives = 31/64 (48%), Gaps = 8/64 (12%) Query: 97 EFVAVAMNYDPPMYVANYAQTRQ------LPFKVALDDGSVAK-QFGNVQLTPTTFVIGK 149 ++V A+ +P NY Q + +P ++ DDG+ F N+ L P FV+ Sbjct: 386 QYVHNALKRNPVPRNYNYYQAPEKRSKHIMPSEI-FDDGTFTYFGFKNITLQPAIFVVQP 444 Query: 150 DGKI 153 DGK+ Sbjct: 445 DGKL 448
>SACTRNSFRASE#Streptothricin acetyltransferase signature. Length = 173 Score = 31.8 bits (72), Expect = 6e-04 Identities = 20/83 (24%), Positives = 30/83 (36%), Gaps = 6/83 (7%) Query: 47 GEALLVAQARDE--GIVGFVSVWEPERFVHHLYVAGTRLREGIGAALLRALPGW----PA 100 G+A + + G + S W + + VA ++G+G ALL W Sbjct: 64 GKAAFLYYLENNCIGRIKIRSNWNGYALIEDIAVAKDYRKKGVGTALLHKAIEWAKENHF 123 Query: 101 ARYRLKCLVRNERALAFYRAHGF 123 L+ N A FY H F Sbjct: 124 CGLMLETQDINISACHFYAKHHF 146
>SYCECHAPRONE#Gram-negative bacterial type III secretion SycE chaperone signature. Length = 130 Score = 28.9 bits (64), Expect = 0.024 Identities = 22/84 (26%), Positives = 32/84 (38%), Gaps = 8/84 (9%) Query: 158 ELYLPLPSAAEAALVPGVTVYGAADLPALCAHLADTPDGRLAPVAAPRLDALPAAATADL 217 +L L +P E + GV V C H+ + P G++ P LD T Sbjct: 14 QLSLSIPDTIEPVI--GVKVG-----EFAC-HITEHPVGQILMFTLPSLDNNDEKETLLS 65 Query: 218 ADVIGQAGAKRALEVAAAGGHHML 241 ++ Q K L GGH +L Sbjct: 66 HNIFSQDILKPILSWDEVGGHPVL 89
>ACRIFLAVINRP#Acriflavin resistance protein family signature. Length = 1034 Score = 28.3 bits (63), Expect = 0.021 Identities = 17/63 (26%), Positives = 30/63 (47%), Gaps = 2/63 (3%) Query: 110 YVQQGMMPVTAGLVVASAVLISEASNRSALQWGITAAVAAL-AYRTRVHPLWLLAGGALA 168 Y G++ T GL +A+LI E + + G A L A R R+ P+ + + + Sbjct: 925 YFMVGLL-TTIGLSAKNAILIVEFAKDLMEKEGKGVVEATLMAVRMRLRPILMTSLAFIL 983 Query: 169 GLV 171 G++ Sbjct: 984 GVL 986
>FLAGELLIN#Flagellin signature. Length = 507 Score = 41.2 bits (96), Expect = 6e-06 Identities = 55/369 (14%), Positives = 113/369 (30%), Gaps = 10/369 (2%) Query: 16 MNDQQAQIAQLYQQVSSGISLTTPADNPLAAAQAVQLSATSATLAQYTQNQTIVQTALQT 75 +N Q+ ++ +++SSG+ + + D+ A A + ++ L Q ++N + QT Sbjct: 17 LNKSQSSLSSAIERLSSGLRINSAKDDAAGQAIANRFTSNIKGLTQASRNANDGISIAQT 76 Query: 76 EDTTLTSVNDVLNAAYQALMHAGDGGLSDSDRAALAAQIQGSRDHLLTLANTADGAGNYL 135 + L +N+ L + + A +G SDSD ++ +IQ + + ++N G + Sbjct: 77 TEGALNEINNNLQRVRELSVQATNGTNSDSDLKSIQDEIQQRLEEIDRVSNQTQFNGVKV 136 Query: 136 FAGFQPTTQPFSNKPGGGVTY------AGDYGARAVQIADTRTVSQGDNGANVFMSVPFL 189 + G +T G + + + GD ++ + Sbjct: 137 LSQDNQMKIQVGANDGETITIDLQKIDVKSLGLDGFNVNGPKEATVGDLKSSFKNVTGYD 196 Query: 190 GSLPVPAAGASNTGTGTIGAVSITNPSDPTNTHQFTITFGGTAAAPTYTVTDNSVTPPTT 249 + +G + + T A T D T +T Sbjct: 197 TYAVGANKYRVDVNSGAVVTDTTAPTVPDKVYVNAANGQLTTDDAENNTAVDLFKTTKST 256 Query: 250 TAAQAYSSGQGINLGGQTVAVSGKPAVGDTFTVTPAPQAGTDVFATLD----TVIAALKS 305 + G GG+ V T V T++ T+ A + Sbjct: 257 AGTAEAKAIAGAIKGGKEGDTFDYKGVTFTIDTKTGNDGNGKVSTTINGEKVTLTVADIT 316 Query: 306 PVGNSQTASTALTNTMATASTKLMNTMTNVLTVQASVGGRLQEVKAMQAVTTTNTLQTTN 365 + A+T ++ S + T S E + T+ Sbjct: 317 AGAANVDAATLQSSKNVYTSVVNGQFTFDDKTKNESAKLSDLEANNAVKGESKITVNGAE 376 Query: 366 SLSNLTDTN 374 +N Sbjct: 377 YTANAAGDK 385
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 231 bits (591), Expect = 4e-70 Identities = 162/444 (36%), Positives = 253/444 (56%), Gaps = 12/444 (2%) Query: 3 NTLMNLGVSGLNAALWGLTTTGQNISNAATPGYSVERPVYAEASGQYTSSGYLPQGVSTV 62 ++L+N +SGLNAA L T NIS+ GY+ + + A+A+ + G++ GV Sbjct: 1 SSLINNAMSGLNAAQAALNTASNNISSYNVAGYTRQTTIMAQANSTLGAGGWVGNGVYVS 60 Query: 63 TVERQYNQYLSNQLNAAQTQGSSLSTYYTLVAQLNNYVGSPTAGIATAITNYFTGLQTVA 122 V+R+Y+ +++NQL AAQTQ S L+ Y +++++N + + T+ +AT + ++FT LQT+ Sbjct: 61 GVQREYDAFITNQLRAAQTQSSGLTARYEQMSKIDNMLSTSTSSLATQMQDFFTSLQTLV 120 Query: 123 NNAADPSARQTAMSNAQTLASQLVAAGQQYSQLRQSVNSQLTDTVTQINSYTSQIAQLNE 182 +NA DP+ARQ + ++ L +Q Q + VN + +V QIN+Y QIA LN+ Sbjct: 121 SNAEDPAARQALIGKSEGLVNQFKTTDQYLRDQDKQVNIAIGASVDQINNYAKQIASLND 180 Query: 183 QIA--SASSQGQPPNQLLDQRDLAVSKLSQLAGVQV-VQSNGNYSVFLSGGQPLVVGNAS 239 QI+ + G PN LLDQRD VS+L+Q+ GV+V VQ G Y++ ++ G LV G+ + Sbjct: 181 QISRLTGVGAGASPNNLLDQRDQLVSELNQIVGVEVSVQDGGTYNITMANGYSLVQGSTA 240 Query: 240 YQLATVASPSDPSELTI-VSKGVAGSAQPGPTQYLPDVSLTGGALGGLLAFRSQTLDPAQ 298 QLA V S +DPS T+ G AG+ + +P+ L G+LGG+L FRSQ LD + Sbjct: 241 RQLAAVPSSADPSRTTVAYVDGTAGNIE------IPEKLLNTGSLGGILTFRSQDLDQTR 294 Query: 299 AQLGALAVSFASQVNAQNALGVDMSGNPGGSLFAVGAPAVYANQNNTGSATLSVSFVDGT 358 LG LA++FA N Q+ G D +G+ G FA+G PAV N N G + + D + Sbjct: 295 NTLGQLALAFAEAFNTQHKAGFDANGDAGEDFFAIGKPAVLQNTKNKGDVAIGATVTDAS 354 Query: 359 QPTTSDYALSYDGAKYTLTDRATGSVVGTATPSSTPPTMTIGGLKLSLSSTPNAGDSFTV 418 +DY +S+D ++ +T R + T TP + + GL+L+ + TP DSFT+ Sbjct: 355 AVLATDYKISFDNNQWQVT-RLASNTTFTVTPDAN-GKVAFDGLELTFTGTPAVNDSFTL 412 Query: 419 LPTRGALDGFSLATANGSAIAAAS 442 P A+ + + + IA AS Sbjct: 413 KPVSDAIVNMDVLITDEAKIAMAS 436 Score = 83.1 bits (205), Expect = 9e-19 Identities = 46/105 (43%), Positives = 66/105 (62%) Query: 561 GTNDGRNALALSQLVNSKTMNNGTTTLTGAYAGYVNAIGNAASQLKASSAAQTALVGQIT 620 G +D RN AL L ++ G + AYA V+ IGN + LK SSA Q +V Q++ Sbjct: 441 GDSDNRNGQALLDLQSNSKTVGGAKSFNDAYASLVSDIGNKTATLKTSSATQGNVVTQLS 500 Query: 621 QAQQSVSGVNQNEEAANLMQYQQLYQANAKVIQTANSVFQTVLGL 665 QQS+SGVN +EE NL ++QQ Y ANA+V+QTAN++F ++ + Sbjct: 501 NQQQSISGVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>FLGFLGJ#Flagellar protein FlgJ signature. Length = 313 Score = 227 bits (579), Expect = 3e-75 Identities = 124/297 (41%), Positives = 173/297 (58%), Gaps = 15/297 (5%) Query: 15 ALDVQGFDALRSKATAAAPREGVKMVAGQFDAMFTQMMLKSMRDATPSDGLLDSSSSKMY 74 A D Q + L++KA P ++ VA Q + MF QMMLKSMRDA P DGL S +++Y Sbjct: 12 AWDAQSLNELKAKA-GEDPAANIRPVARQVEGMFVQMMLKSMRDALPKDGLFSSEHTRLY 70 Query: 75 TSMLDQQLAQQMSS-KGIGVADALTKQLLRNANVAPDAQGEGGLAAMNALAKAYANSNGA 133 TSM DQQ+AQQM++ KG+G+A+ + KQ+ + ++ + Y N + Sbjct: 71 TSMYDQQIAQQMTAGKGLGLAEMMVKQMTPEQPLPEESTPAAPMKFPLETVVRYQNQALS 130 Query: 134 PGNGALAGTRGYSAASALTPPLKGNGNSAQADAFVEKMALAAQAASATTGIPARFIVGQA 193 P + + AF+ +++L AQ AS +G+P I+ QA Sbjct: 131 ------------QLVQKAVPRNYDDSLPGDSKAFLAQLSLPAQLASQQSGVPHHLILAQA 178 Query: 194 ALESGWGKREIRGANGESSYNVFGIKATKGWTGRTVSAVTTEYVNGKPHRVVAQFRAYDS 253 ALESGWG+R+IR NGE SYN+FG+KA+ W G TTEY NG+ +V A+FR Y S Sbjct: 179 ALESGWGQRQIRRENGEPSYNLFGVKASGNWKGPVTEITTTEYENGEAKKVKAKFRVYSS 238 Query: 254 YEHAMTDYANLLKNNPRYASVLNAGHNAEGFAHGMQKAGYATDPHYAKKLISIMQQI 310 Y A++DY LL NPRYA+V A +AE A +Q AGYATDPHYA+KL +++QQ+ Sbjct: 239 YLEALSDYVGLLTRNPRYAAVTTAA-SAEQGAQALQDAGYATDPHYARKLTNMIQQM 294
>FLGHOOKAP1#Flagellar hook-associated protein signature. Length = 546 Score = 26.8 bits (59), Expect = 0.029 Identities = 10/38 (26%), Positives = 17/38 (44%) Query: 102 NVDPVQEMVNMISASRSYQANVETLNTAKQLMLKTLTI 139 V+ +E N+ + Y AN + L TA + + I Sbjct: 508 GVNLDEEYGNLQRFQQYYLANAQVLQTANAIFDALINI 545
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 30.6 bits (69), Expect = 0.016 Identities = 13/68 (19%), Positives = 29/68 (42%), Gaps = 15/68 (22%) Query: 17 IIGQAKAKKAVAVALRNRWRRQQVAEPLRQEITPKNILMIGPTGVGKTEIAR---RLAKL 73 ++G++ A + + ++ + T +++ G +G GK +AR K Sbjct: 139 LVGRSAAMQEI------YRVLARLMQ------TDLTLMITGESGTGKELVARALHDYGKR 186 Query: 74 ADAPFIKI 81 + PF+ I Sbjct: 187 RNGPFVAI 194
>HTHFIS#FIS bacterial regulatory protein HTH signature. Length = 484 Score = 87.6 bits (217), Expect = 9e-23 Identities = 30/127 (23%), Positives = 60/127 (47%) Query: 1 MSDKNFLVIDDNEVFAGTLARGLERRGYAVRQAHNKDEALKLAGAEKFEFITVDLHLGND 60 M+ LV DD+ L + L R GY VR N + A + + D+ + ++ Sbjct: 1 MTGATILVADDDAAIRTVLNQALSRAGYDVRITSNAATLWRWIAAGDGDLVVTDVVMPDE 60 Query: 61 SGLSLIAPLCDLQPDARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNAS 120 + L+ + +PD +LV++ + TA++A + GA +YL KP ++ ++ + + Sbjct: 61 NAFDLLPRIKKARPDLPVLVMSAQNTFMTAIKASEKGAYDYLPKPFDLTELIGIIGRALA 120 Query: 121 EVQAEEA 127 E + + Sbjct: 121 EPKRRPS 127 Score = 45.2 bits (107), Expect = 4e-08 Identities = 16/101 (15%), Positives = 32/101 (31%), Gaps = 3/101 (2%) Query: 75 DARILVLTGYASIATAVQAVKDGADNYLAKPANVESILAALQTNASEVQAEEALENPVVL 134 I+ + I + L+ VE + + + L Sbjct: 375 TREIIENELRSEIPDSPIEKAAARSGSLSISQAVEENMRQYFASFGDALPPSGL---YDR 431 Query: 135 SVDRLEWEHIQRVLAENNNNISATARALNMHRRTLQRKLAK 175 + +E+ I L N A L ++R TL++K+ + Sbjct: 432 VLAEMEYPLILAALTATRGNQIKAADLLGLNRNTLRKKIRE 472
>CARBMTKINASE#Bacterial carbamate kinase signature. Length = 314 Score = 43.7 bits (103), Expect = 5e-07 Identities = 27/99 (27%), Positives = 48/99 (48%), Gaps = 6/99 (6%) Query: 180 IPVISPIGFGEDGLSYNINADLVAGKLATVLNAEKLVMMTNIPGVMDKEG----NLLTDL 235 +PVI G G+ I+ DL KLA +NA+ +++T++ G G L ++ Sbjct: 197 VPVILEDG-EIKGVEAVIDKDLAGEKLAEEVNADIFMILTDVNGAALYYGTEKEQWLREV 255 Query: 236 SAREIDALFEDGT-ISGGMLPKISSALDAAKSGVKSVHI 273 E+ +E+G +G M PK+ +A+ + G + I Sbjct: 256 KVEELRKYYEEGHFKAGSMGPKVLAAIRFIEWGGERAII 294 Score = 36.7 bits (85), Expect = 8e-05 Identities = 21/60 (35%), Positives = 27/60 (45%), Gaps = 10/60 (16%) Query: 31 GKTVVIKYGGNAMTEERLKQGF----------ARDVILLKLVGINPVIVHGGGPQIDQAL 80 GK VVI GGNA+ + K + AR + + G VI HG GPQ+ L Sbjct: 2 GKRVVIALGGNALQQRGQKGSYEEMMDNVRKTARQIAEIIARGYEVVITHGNGPQVGSLL 61
>HTHTETR#TetR bacterial regulatory protein HTH signature. Length = 215 Score = 58.1 bits (140), Expect = 2e-12 Identities = 31/183 (16%), Positives = 62/183 (33%), Gaps = 15/183 (8%) Query: 24 ASRTRPKPGERRVHILQTLASMLESPKSEKITTAALAARLDVSEAALYRHFSSKAQMFEG 83 A +T+ + E R HIL + + +A V+ A+Y HF K+ +F Sbjct: 2 ARKTKQEAQETRQHILDVALRLFSQQGVSSTSLGEIAKAAGVTRGAIYWHFKDKSDLFSE 61 Query: 84 LIEFIEETFFGLVNQIAANEPNGVLQA-RSIALMLLNFSAKNPGMTRVLTGEALVGEHER 142 + E E L + A P L R I + +L + ++ + H+ Sbjct: 62 IWELSESNIGELELEYQAKFPGDPLSVLREILIHVLESTVTEERRRLLME----IIFHKC 117 Query: 143 LAERVNQMLERVEASIKQCLR---VALLEAQAHAAGGAPPPVPLPDDYDPALRASLVISY 199 ++++ + ++ L+ A P + A ++ Y Sbjct: 118 EFVGEMAVVQQAQRNLCLESYDRIEQTLKH-CIEAKMLPADL------MTRRAAIIMRGY 170 Query: 200 VLG 202 + G Sbjct: 171 ISG 173